I Left Port 22 Open on the Internet for 54 Days. Here's Who Showed Up.

exu@feditown.com · 1 day

Edit: The post was probably heavily AI written and contains mistakes to that effect, which is unfortunate. The data in general is still interesting though.

vinyl@lemmy.world · 2 hours

tf is a web scraper engineer, gen ai?

uenticx@lemmy.world · 27 minutes

A professional web scraper and data extraction expert …

Looks like he’s a tool … maybe sed/awk?

MonkderVierte@lemmy.zip · 7 hours

deleted by creator

sakphul@discuss.tchncs.de · 8 hours

That was an interesting read. I didn’t have a lot If knowledge about this topic. So thanks for that!

Was ist using AI for assisted writing? Maybe. But I don’t have a Problem with that. I assume everyone ist using it to some extend (me included) and uses it as a tool as any other tool.

The Data is the actual interesting Thing. Would be cool If the author could share his data (of course IP’s or other personel information must be anonymized/hashed). If that is made up by AI…Well there goes your credibility. But I don’t assume that from the beginning.

Retro_unlimited@lemmy.world · 1 day

If they used AI, then I consider they lost all credibility.

KatherinaReichelt@feddit.org · 2 hours

I really struggle to see the point of posts like this. It is an interesting article about an interesting topic.

sircac@lemmy.world · 9 hours

For me it is not only that they used AI for the writing, is that they did not care to review/recheck/polishing it before releasing it to the public, so my effort in consuming it will be reciprocal

terabyterex@lemmy.world · 1 day

deleted by creator

magnue@lemmy.world · 1 day

Oh so you want them to do all that and gather all the data and do it themselves for free? What a dumb comment.

I’ve run a honeypot for the last month and the data is near-identical to this. It’s definitely credible.

MonkderVierte@lemmy.zip · 8 hours

If you publish something online, that’s also a responsibility. If they don’t want that, then they could just have made a comment somewhere “yo, i’ve had this container online on port 22 and this is what happened, yolo”.

Same for open source software btw.

MerryJaneDoe@lemmy.world · 12 hours

The issue with using AI is that the author doesn’t openly disclose the use at the beginning of the paper.

Yes, I know this particular write-up isn’t for official submission to an academic journal, but sharing methodology is important.

I would have no problem with AI-assisted writing IF the author credited the service used and, where applicable, included the prompts used.

It should be similar to documenting any sourced material. It’s not just about giving credit where credit is due. It’s also about accountability.

What a dumb comment.

Why is this necessary? Does this add anything at all to the conversation?

I’ve run a honeypot for the last month and the data is near-identical to this. It’s definitely credible.

Ah, well then. Problem solved. Someone on the internet said it’s credible, therefore it must be credible. Tell ya what - when you create a webpage to display your data and then provide an analysis of said data, I’ll consider you credible. Until then, though, you are just some short-tempered, rude, anonymous voice shouting into the void.

floquant@lemmy.dbzer0.com · 22 hours

Oh so you want them to do all that and gather all the data and do it themselves for free?

Yes, that is what 90% of the internet has been about since it became a thing. Doing everything for profit turns everything into shit.

cecilkorik@piefed.ca · 1 day

Near-identical doesn’t make it valuable. Plausible but incorrect is still incorrect. AI creates plausible and credible but incorrect data.

The plausibility and credibility is like a honeypot for your confidence. You read it, and understand it, and come to believe it. But it was false all along. You think you learned things. You actually learned nothing.

magnue@lemmy.world · 24 hours

Sounds like AI

NotSteve_@lemmy.ca · 23 hours

I initially disagreed but after actually reading the post, I’m with you. If it was only the article’s text that was generated and not the data or graphs then I don’t see why the whole thing would be written off. I mean, it’s really sad seeing people offload their writing to AI but I still found it interesting.

magnue@lemmy.world · 23 hours

Yeah I hate the slop but the data is good.

Fair Fairy@thelemmy.club · 18 hours

Honeypot as a Python script in a docker container?
Isn’t that not really a true isolation?

mal3oon@lemmy.world · 10 hours

endlessh + fail2ban

baller_w@lemmy.zip · 15 hours

Please say more.

I use both on a daily basis and from what I understand, there’s no implicit access from within a container. If you set it up right, there’s no access outside the container of any sort unless you explicitly say so.

trolololol@lemmy.world · 14 hours

Unless the container had a bug that they know but you don’t know.

Valmond@lemmy.dbzer0.com · 7 hours

Yeah the system isn’t protecting you (like it does preventing a normal user accessing another user), “only” the docker code does.

Or so I have understood it.

Glitchvid@lemmy.world · 23 hours

The Belgian traffic? Almost entirely from a single residential IP — one box that sent over 156,000 login attempts, more than the entire country of Germany. It just sat there, hammering echo “\x6F\x6B” over and over, every single second, for weeks. Relentless.

Had a funny similar thing, there’s some weird person/people that randomly probe and attack a specific game’s community hosted dedicated servers; and one week this specific IP address out of Virginia was just hammering one of mine, with what amounts to a specific byte sequence, then an incrementing number of the packet (until it wrapped around). Then it stopped. Weird shit.

frongt@lemmy.zip · 16 hours

It’s possible it was something misconfigured, a poorly-written script, or a bug in some software causing unexpected behavior. At the scale of the Internet, all of those are very possible.

It could also be the Internet equivalent of a numbers station.

Glitchvid@lemmy.world · 3 hours

It’s was a pretty specific non standard port on UDP. It’s not even doing proper scanning since the byte sequence used isn’t one that would trigger a response challenge/ack. My guess is someone trying to DOS using an older byte sequence that used to choke/kill the server software on older versions.

null@lemmy.zip · 18 hours

I’m kind of disappointed that bigboobz wasn’t on the top of the password list.

XLE@piefed.social · 1 day

Thanks for the warning OP

kratoz29@lemmy.zip · 23 hours

and contains mistakes to that effect

What mistakes?

AbidanYre@lemmy.world · 21 hours

At one point it said only 28 IPs came back and those 31 were clever. Or something to that effect.

magnue@lemmy.world · 1 day

Weird I did the exact same thing on a VPS. Basically the same data.

Phoenixz@lemmy.ca · 23 hours

So is there a socket container for this? Wi wouldn’t mind wasting some hacker assholes time with this

CalcProgrammer1@lemmy.today · 1 day

deleted by creator