Epstein Files Jan 30, 2026 Release - Archived from Justice.gov

xodoh74984@lemmy.world · edit-2 1 day ago

Epstein Files Jan 30, 2026 Release - Archived from Justice.gov

Wild_Cow_5769@lemmy.world · 22 minutes ago

@wild_cow_5769:matrix.org If someone has a group working on finding the dataset.

There are billions of people on earth. Someone downloaded dataset 9 before the link was taken down. We just have to find them :)

Wild_Cow_5769@lemmy.world · 43 minutes ago

Someone mentioned a matrix group. Can they DM and invite me. I want to help. Thx

DigitalForensick@lemmy.world · 3 hours ago

Holy shit

The entire Court Records and FOIA page is completely gone too! Fuckers!

Wild_Cow_5769@lemmy.world · 1 hour ago

I told you…

We need dataset 9…

DigitalForensick@lemmy.world · 1 hour ago

Have a scraper running on web.archive.org pulling all previously posted Court-Records and FOIA (docs,audio,etc.) from Jan 30th

Arthas@lemmy.world · 4 hours ago

some bad news, it looks like the data 9 zip file link doesn’t work anymore. They appear to have removed the file so my download stopped at 36gb. I’m not familiar with their site so is this normal for them to remove the files and maybe put them back again once they’ve reorganized them and at the same link location? or are we having to do the scrape of each pdf like another user has been doing?

DigitalForensick@lemmy.world · 3 hours ago

Does anyone have the OTHER data sets from before? Ive been lasered in on the DS1-DS12 but havent looked at the other documents at all

DigitalForensick@lemmy.world · 3 hours ago

this is ridiculous. Good thing we got in when we did!

Wild_Cow_5769@lemmy.world · 4 hours ago

All the zip files are gone on the DOJ website. The links are gone.

Wild_Cow_5769@lemmy.world · 4 hours ago

All the zip download links are gone on the DOJ website.

It’s only a matter of time before all the files just go poof.

DigitalForensick@lemmy.world · 5 hours ago

While I feel hopeful that we will be able to reconstruct the archive and create some sort of baseline that can be put back out there, I also cant stop thinking about the “and then what” aspect here. We’ve see our elected officials do nothing with this info over and over again and I’m worried this is going to repeat itself.

I’m fully open to input on this, but I think having a group path forward is useful here. These are the things I believe we can do to move the needle.

Right Now:

Create a clean Data Archive for each of the known datasets (01-12). Something that is actually organized and accessible.
Create a working Archive Directory containing an “itemized” reference list (SQL DB?) the full Data Archive, with each document’s listed as a row with certain metadata. Imagining a Github repo that we can all contribute to as we work. – File number – Dir. Location – File type (image, legal record, flight log, email, video, etc.) – File Status (Redacted bool, Missing bool, Flagged bool
Infill any MISSING records where possible.
Extract images away from .pdf format, Breakout the “Multi-File” pdfs, renaming images/docs by file number. (I made a quick script that does this reliably well.)
Determine which files were left as CSAM and “redact” them ourselves, removing any liability on our part.

What’s Next: Once we have the Archive and Archive Directory. We can begin safely and confidently walking through the Directory as a group effort and fill in as many files/blanks as possible.

Identify and dedact all documents with garbage redactions, (remember the copy/paste DOJ blunders from December) & Identify poorly positioned redaction bars to uncover obfuscated names
LABELING! If we could start adding labels to each document in the form of tags that contain individuals, emails, locations, businesses - This would make it MUCH easier for people to “connect the dots”
Event Timeline… This will be hard, but if we can apply a timeline ID to each document, we can put the archive in order of events
Create some method for visualizing the timeline, searching, or making connection with labels.

We may not be detectives, legislators, or law men, but we are sleuth nerds, and the best thing we can do is get this data in a place that can allow others to push for justice and put an end to this crap once and for all. Its lofty, I know, but enough is enough. …Thoughts?

Wild_Cow_5769@lemmy.world · edit-2 4 hours ago

GFD….

My 2 cents. As a father of only daughters…

If we don’t weed out this sick behavior as a society we never will.

My thoughts are enough is enough.

Once the files are gone there is little to 0 chance they are ever public again….

You expect me to believe that a “oh shit we messed up” was accident?

It’s the perfect excuse… so no one looks at the files.

That’s my 2 cents.

acelee1012@lemmy.world · edit-2 4 hours ago

is anyone else having issues getting dataset 10 11* to start downloading? it has been sittiing at 0 percent for a day while everything else is done and seeding. it shows connections to peers, rechecking does nothing, deleting and re-adding does nothing, asking tracker for more peers does nothing

Nomad64@lemmy.world · edit-2 2 hours ago

I have been seeding all of the datasets since Sunday. The copy of set 9 has been the busiest, with set 10 a distant second. I plan on seeding them for quite a while yet, and also picking up a consolidated torrent when that becomes available. Hopefully you are able to get connected via the Swarm.

dh007@lemmy.world · 3 hours ago

I’m getting errors for 1 and 8, all the rest went smooth.

acelee1012@lemmy.world · 3 hours ago

i am not seeing any errors, has just been stuck on downloading status with nothing going through. I originally added everything around the same time and all the other ones went through fine. I figured it was bugged or something so removed then readded it several times to no avail. I am not sure what else to try

acelee1012@lemmy.world · 3 hours ago

its really strange because on my other machine, everythings going fine

captainmycaptain@lemmy.world · edit-2 5 hours ago

read the OP re. DS9 and DS10

acelee1012@lemmy.world · 5 hours ago

regardless of OP removing the magnet links or not, the torrents are still out there and that shouldn’t stop it. secondly, I meant 11

lukky@lemmy.zip · 3 hours ago

**what is the name of the softwar you use for torrent ? **

acelee1012@lemmy.world · 3 hours ago

transmission

lukky@lemmy.zip · 3 hours ago

thank you

TurtleGreen@lemmy.world · 8 hours ago

where did the party move?

BWint@lemmy.world · 7 hours ago

Hasn’t moved AFAIK, just going slowly.

Wild_Cow_5769@lemmy.world · edit-2 6 hours ago

This entire thing smells funny. Even OP turned ghost on the threat of suspect images that no one has seen…

Ask yourself. How did the times or whoever came up with this narrative even find these “suspect” images in a few hours when it seems no one in the world came even download the zip…

Wild_Cow_5769@lemmy.world · 6 hours ago

It’s still here. No one dropped a complete dataset 9 yet tho…

kongstrong@lemmy.world · edit-2 7 hours ago

PSA: paging bug has been fixed on the DOJ’s website. Website caps out at around 9600 for ~197k files, way less than the 520k in the less-complete dataset 9 torrent. Scraping the website now to find out which files they took offline.

Correction: 9600*50 files per page is in the 470k ballpark. Much more tan 197k but still a lot less than the torrent’s 530k let alone the expected 600k+ files that were supposed to be in there

lukky@lemmy.zip · 7 hours ago

can you explain to me what is the problem exact ? its the dataset9 ? when i press the dataset9 link of the DOJ gov i see a download start with 180gb zip file in the browser. ?

kongstrong@lemmy.world · 7 hours ago

yea for me it fails after anywhere between 200MB and 10-15GB. All the time.

Wild_Cow_5769@lemmy.world · 6 hours ago

Same. Every damn time.

lukky@lemmy.zip · 6 hours ago

And what is the solution ?

Wild_Cow_5769@lemmy.world · 4 hours ago

F if I know I’ve been messing with it for days. I’ve tried chunking. Different scripts. Different cookies.

kongstrong@lemmy.world · 5 hours ago

we’re working on some more complex solutions in an Element group. Not really sure where we stand at this moment, but it seems we can stitch a lot together from the large torrent files and by what we scraped from the DOJs website through a little bit of force.

Wild_Cow_5769@lemmy.world · 4 hours ago

Are u doing after DS9?

kongstrong@lemmy.world · 4 hours ago

trying

captainmycaptain@lemmy.world · 5 hours ago

Check Available Pieces for the torrents. My guess is that you’ll see half of them are missing and UNavailable.

Wild_Cow_5769@lemmy.world · 12 hours ago

Let me ask a question.

For all the folks saying there are news reports of CSAM… Does that mean the news outlets got the full zip? How did they get it? No one else seems to be able to get it. Were they given it fist?

If they don’t have the zip how did they even find it within hours of the files being released?

Did they provide proof where they redacted the “danger” and said look… here is the proof?

Seems rather suspect…

Considering the massive effort of regular to comb through the files I would think the outcry would be gigantic….

DigitalForensick@lemmy.world · 6 hours ago

I’m not sure of the exact files that were reported by the NYT, but there certainly were some concerning images in the initial Jan 30 release, however it was certainly more than the reported 40. I saw others as well but I don’t remember what the file numbers we’re.

spoiler

[246249_247010]

From my own observation timeline on the images in question: Jan 30: Images were accessible through DOJ directly. File numbers wereskipped in the list, but were manually reachable through URL. All these photos were fully unredacted (uncensored). **Feb 1: ** Images were NOT accessible through DOJ anymore, returns “Page not found”. However images were (and still are) snapshotted via web.archive.org. Feb 2: Downloading the 87GB Set 9 appeared contain these images as well, meaning we likely all have them on our computers. yikes

These files were scrubbed from the DOJ website, along with many others.

I found many of the scrubbed files by parsing through the lists and finding large gaps in file numbers, where the preceding file did not contain multiple images/documents in one pdf. There are also tons of internal memos in the dataset that precede file groups and talk about the content ahead. These memos surrounded files that seemed like they were meant to be redacted, so its worth poking around. I didn’t go nuts, but things I found around these that interesting and were also removed:

[EFTA00276493]: internal memo referring to Clinton photographed with “nude Gretchen”.
[EFTA00273790-EFTA276487]: (removed) looks like arial Lidar scans of the full estate?
[EFTA00276220]: (removed) panoramic Infrared / xray-ray scan of a room

BWint@lemmy.world · 5 hours ago

One Redditor said that they reported more than 500 nude images to the DOJ, all from Dataset 9.

captainmycaptain@lemmy.world · 5 hours ago

I’m still waiting for just the first zip file to uncompress and it’s been HOURS. The ONLY reasonable explanation to bolster the NYT claim is that they put “AI” on the datasets running on a supercomputer, and “caught” the DOJ distributing CP! Show us the proof NYT! (redact faces and genitalia and show the images!) Then: CONVICT THEM ALL! LIFE IN PRISON FOR THE ENTIRE DOJ!!! ;-P

Wild_Cow_5769@lemmy.world · 5 hours ago

Or… wealthy people wanting the files off the internet.

BWint@lemmy.world · 6 hours ago

We didn’t have trouble getting Datasets 10, 11, or 12. I think Dataset 9 was probably delivered fine on Friday, so the NYTimes was able to grab a complete copy. Then, NYTimes started reporting the abusive material, which prompted the DOJ to yoink the ZIP, and it’s been screwy ever since.

I saw a post from a random Redditor confirming that they found abusive material, if that’s the concern. I doubt that the reports are fabricated, but I also agree that the reports are a great excuse for the DOJ to remove legitimate files.

kongstrong@lemmy.world · 9 hours ago

if what you’re saying is that CSAM seems like a very good excuse to redact a lot more of those files than they previously intended, I agree yes.

Wild_Cow_5769@lemmy.world · 7 hours ago

Yes…. It’s just an excuse to pull the files back and go after anyone who has them.

PeoplesElbow@lemmy.world · 22 hours ago

Ok everyone, I have done a complete indexing of the first 13,000 pages of the DOJ Data Set 9.

KEY FINDING: 3 files are listed but INACCESSIBLE

These appear in DOJ pagination but return error pages - potential evidence of removal:

EFTA00326497

EFTA00326501

EFTA00534391

You can try them yourself (they all fail):

https://www.justice.gov/epstein/files/DataSet 9/EFTA00326497.pdf

The 86GB torrent is 7x more complete than DOJ website

DOJ website exposes: 77,766 files

Torrent contains: 531,256 files

Page Range Min EFTA Max EFTA New Files

0-499 EFTA00039025 EFTA00267311 21,842

500-999 EFTA00267314 EFTA00337032 18,983

1000-1499 EFTA00067524 EFTA00380774 14,396

1500-1999 EFTA00092963 EFTA00413050 2,709

2000-2499 EFTA00083599 EFTA00426736 4,432

2500-2999 EFTA00218527 EFTA00423620 4,515

3000-3499 EFTA00203975 EFTA00539216 2,692

3500-3999 EFTA00137295 EFTA00313715 329

4000-4499 EFTA00078217 EFTA00338754 706

4500-4999 EFTA00338134 EFTA00384534 2,825

5000-5499 EFTA00377742 EFTA00415182 1,353

5500-5999 EFTA00416356 EFTA00432673 1,214

6000-6499 EFTA00213187 EFTA00270156 501

6500-6999 EFTA00068280 EFTA00281003 554

7000-7499 EFTA00154989 EFTA00425720 106

7500-7999 (no new files - all wraps/redundant)

8000-8499 (no new files - all wraps/redundant)

8500-8999 EFTA00168409 EFTA00169291 10

9000-9499 EFTA00154873 EFTA00154974 35

9500-9999 EFTA00139661 EFTA00377759 324

10000-10499 EFTA00140897 EFTA01262781 240

10500-12999 (no new files - all wraps/redundant)

TOTAL UNIQUE FILES: 77,766

Pagination limit discovered: page 184,467,440,737,095,516 (2^64/100)

I searched random pages between 13k and this limit - NO new documents found. The pagination is an infinite loop. All work at: https://github.com/degenai/Dataset9

PeoplesElbow@lemmy.world · 7 hours ago

DOJ Epstein Files: I found what’s around those 3 missing files (Part 2)

Follow-up to my Dataset 9 indexing post. I pulled the adjacent files from my local copy of the torrent. What I found is… notable.

TLDR

The 3 missing files aren’t random corruption. They all cluster around one event: Epstein’s girlfriend Karyna Shuliak leaving St. Thomas (the island) in April 2016. And one of the gaps sits directly next to an email where Epstein recommends her a novel about a sympathetic pedophile—two days before the book was publicly released.

The Big Finding: Duplicate Processing Batches

Two of the missing files (326497 and 534391) are the same document processed twice—once with redactions, once without—208,000 files apart in the index.

Redacted Batch	Unredacted Batch	Content
326494-326496	534388-534390	AmEx travel booking, staff emails
326497 - MISSING	534391 - MISSING	???
326498-326500	—	Email chain continues
326501 - MISSING	—	???
326502-326506	—	Reply + Invoice
—	534392	Epstein personal email

Random file corruption hitting the same logical document in two separate processing runs, 208,000 positions apart? That’s not how corruption works. That’s how removal works.

What’s Actually In These Files

I pulled everything around the gaps. It’s all one email chain from April 10, 2016:

The event: Karyna Shuliak (Epstein’s girlfriend) booked on Delta flight from Charlotte Amalie, St. Thomas → JFK on April 13, 2016.

St. Thomas is where you fly in/out to reach Little St. James. She was leaving the island.

The chain:

11:31 AM — AmEx Centurion (black card) sends confirmation to [email protected]
11:33 AM — Lesley Groff (Epstein’s executive assistant) forwards to Shuliak, CC’s staff
11:35 AM — Shuliak replies “Thanks so much”
3:52 PM — Epstein personally emails Shuliak
Next day — AmEx sends invoice

The unredacted batch (534xxx) reveals the email addresses that are blacked out in the redacted batch (326xxx):

Lesley Groff: [email protected]
Ann Rodriquez: [email protected]
Bella Klein: [email protected]
Karyna Shuliak: [email protected]

The Epstein Email (EFTA00534392)

The document immediately after missing file 534391:

From: "jeffrey E." <jeevacation@gmail.com>
To: Karyna Shuliak
Date: Sun, 10 Apr 2016 19:52:13 +0000

order http://softskull.com/dd-product/undone/

He’s telling her to buy a book. The same day she’s being booked to leave his island.

The Book

“Undone” by John Colapinto (Soft Skull Press)

On-sale date: April 12, 2016
Epstein’s email: April 10, 2016

He recommended it two days before public release.

Publisher’s description:

“Dez is a former lawyer and teacher—an ephebophile with a proclivity for teenage girls, hiding out in a trailer park with his latest conquest, Chloe. Having been in and out of courtrooms (and therapists’ offices) for a number of years, Dez is at odds with a society that persecutes him over his desires.”

The protagonist is a pedophile who resents society for judging him.

The author (John Colapinto) is a New Yorker staff writer, former Vanity Fair and Rolling Stone contributor. Exactly the media circles Epstein cultivated.

What’s Missing

So now we know the context:

EFTA00326497 — Between AmEx confirmation and Groff’s forward. Probably the PDF ticket attachment referenced in the emails.
EFTA00326501 — Between the forward chain and Shuliak’s reply. Unknown.
EFTA00534391 — Immediately before Epstein’s personal email about the pedo book. Unknown, but its position is notable.

Open Questions

How did Epstein have this book before release? Advance copy? Knows the author?
What is 534391? It sits between staff logistics emails and Epstein’s direct correspondence. Another Epstein email? An attachment?
Are there other Shuliak travel records with similar gaps? Is April 2016 unique or part of a pattern?
What else is in the corpus from [email protected]?

Verify It Yourself

Try the DOJ links (all return errors):

Check the torrent: Pull the EFTA numbers I listed. Confirm the gaps. Confirm the adjacencies.

Grep the corpus: Search for “QWURMO” (booking reference), “Shuliak”, “jeevacation”, “Colapinto”

Summary

Three files missing from 531,256. All three cluster around one girlfriend’s April 2016 departure from St. Thomas. Same gaps appear in two processing batches 208,000 files apart. One gap sits adjacent to Epstein personally recommending a novel about a sympathetic pedophile, sent before the book was even publicly available.

This isn’t random corruption.

Full analysis + all code: https://github.com/degenai/Dataset9

If anyone has the torrent and wants to grep for Colapinto connections or other Shuliak trips, please do. This is open source for a reason.

sherbeticecream@lemmy.world · 16 minutes ago

Just skimming through and I have file 534391 but it shows ‘No Images Produced’ not sure if that was your reason as well and apologies in advance! Heres an image of said file (https://lemmy.world/pictrs/image/d840f280-5e32-4417-a92e-ff281582080a.png)

kongstrong@lemmy.world · 9 hours ago

ysk the page limit has been fixed, it caps out around 9600 for a total of ~197k file entries. Way less than the largest torrent’s 530k. Scraping now to get a list of the files they kept on the DOJ so we can determine which files they don’t want out there. Would be a good lead to further investigate the torrent

PeoplesElbow@lemmy.world · 7 hours ago

Oh no…I didn’t know this, on one hand now i need to run another scan, but on the other it could reveal something, the torrent has 500k+ files so there is still a gap. I will run the scraper again and do a new analysis in the next day or two.

Wild_Cow_5769@lemmy.world · 21 hours ago

Just like I said… In NO way do I trust DOJ… Our only hope is if someone drops the full data set 9 somewhere.

PeoplesElbow@lemmy.world · 17 hours ago

My question is, why is the total download size so large and the range of displayed documents so little? Only 15% of the known documents are individually served on the site, and some arent seen until page 10,000

Wild_Cow_5769@lemmy.world · 12 hours ago

That’s why you need the full zip…

Moonsurfer_1@lemmy.world · 17 hours ago

It’s an effort to obscure for sure.

Wild_Cow_5769@lemmy.world · 12 hours ago

Yup… hopefully someone is able to get the full zip

Herschel@lemmy.world · 19 hours ago

Can anybody verify these hashes?

https://03c.de/?30a9ce3df3d88c3c#A6EKCNKa1NtfJShxAqMRkbVQewhJ2H2n4DfL6YhRSmUa

Arthas@lemmy.world · 1 day ago

I am downloading dataset 9 and should have the full 180gb zip done in a day. To confirm, the link on DOJ to the dataset 9 zip is now updated to be clean of CSAM or not? As much as I wish to help the cause, I do not want any of that type of material on my server unless permission has been given to host it for credible researchers only that need access to all files for their investigation, but I have no way of understanding what’s within legal rights to assist with redistributing the files to legitimate investigators and thus my plans to help create a torrent may be squashed. Please let me know.

kongstrong@lemmy.world · 9 hours ago

you’re not getting your connection cut off from the place where you’re downloading? That’s huge, could you let me know if you succeed?

Arthas@lemmy.world · 32 minutes ago

I was, and that is why it was taking so long for me to download as I use my custom downloader which uses various techniques to chunk the download. Unfortunately it seems like they’ve now removed the file completely so my downloader has no source to pull from and is stopped at 36gb.

BWint@lemmy.world · 1 day ago

Amazing - Once you have the 180GB Set 9 downloaded, I’ll seed.

At this point, my working assumption is that the version you’re downloading should be presumed to be free of CSAM, but we can’t know for sure until we check it. In addition, I assume that legitimate files were also removed from the version you’re downloading, but the legitimate files are preserved in the archives we already have (along with, tragically, the CSAM.)

I think that after you download the 180GB set, we should compare it to our existing files to identify files that were removed. Then, we can identify which of the removed files were CSAM, and which of the removed files were legitimate. Going to be a hell of a task…

Arthas@lemmy.world · 1 day ago

Ok great. As for comparing files. I would likely do a hash check. That shouldn’t be difficult to identify truly unique files. It’ll take a few days for a decent computer to generate all the hashes but it should be pretty automated. I’ll reach out once I have it completed.

BWint@lemmy.world · 1 day ago

Thank you! I’m not very tech savvy, so I’m very little help in this whole process. Please LMK what you find.

o_derr889@lemmy.world · edit-2 1 day ago

someone posted the list of the original links. If it helps to cross reference I can check to see if I have it.

o_derr889@lemmy.world · 1 day ago

I have it as a text file. Shoot me a DM and I can send it to you.

thetrekkersparky@startrek.website · 1 day ago

From my understanding nobody knows. The DOJ said it was already removed, but the NYTimes claimed they found 40 images of CSAM. The DOJ said they immediately removed them Saturday, but a lot of files that didn’t contain CSAM were also removed. I’ve extracted the 101GB torrent and haven’t come accross any yet, but there’s a ton of files in there too. People have yet been able to download the entire ZIP and are trying to scrape everything individually as far as I know.

As for the legality, I’m not a Lawyer and I don’t live in the states, but It’s all information that’s been released to the public by the US DOJ as required by a court order, so it’s a call that only you can make. With the amount of data that’s already disapeared I’m personally choosing to host it regardless, and I’ll seed whatever anyone else can salvage of dataset 9 too.

DigitalForensick@lemmy.world · 1 day ago

wondering the same thing myself. Not sure about the latest DS9 dump, but I’ve definitely seen some of the other leaks that included some CSAM. crazy that DOJ let that out the door. :/

Wild_Cow_5769@lemmy.world · 20 hours ago

Any luck?

Arthas@lemmy.world · 19 hours ago

yeah still chugging away slowly, it may take me a few days actually, it’s quite slow but so far it appears to be getting it.

CapableStaircase@lemmy.zip · 1 day ago

What’s your method for getting the zip file without being cut off by the CDN?

Arthas@lemmy.world · 30 minutes ago

I was being cut off, I manage it with chunking techniques. They unfortunately took down the file so now I have no source to pull from.

Arthas@lemmy.world · 22 hours ago

I have various chunking techniques that I use. I adaptively modify the request size of the chunks as I’ve noticed at times the CDN will give large amounts then micro amounts. I haven’t figured out the exact backoff rate but I have retry mechanisms in place. The CDN is very annoying but so far my methods are working, just slow.

Wild_Cow_5769@lemmy.world · 12 hours ago

I have tried dozens of different settings. Cookies. Ect ect. I haven’t had much luck

Wild_Cow_5769@lemmy.world · edit-2 1 day ago

Good… I don’t trust the what the DOJ says if I see it from my own eyes that’s one thing, and I’ll promptly delete it. But I don’t believe anything the DOJ says.

Wild_Cow_5769@lemmy.world · 1 day ago

I’ve been trying all day to get chunks from that CDN…

o_derr889@lemmy.world · 1 day ago

I can also help seed. Ive got lots of TB’s free.

DigitalForensick@lemmy.world · 1 day ago

So what’s the consensus on what to do about all the fully uncensored CSAM the DOJ released on the 30th? Much of it has been removed as of today, but that shit is still fully up on archive.org… 🙄…Not Great…

Wild_Cow_5769@lemmy.world · edit-2 1 day ago

My two cents I have nothing but daughter…all my children are just daughters…

We don’t take care of this weird sexual abuse problem now between authority, figures and other things like that. We never will…

I don’t think I could sleep at night if I didn’t do my due diligence because someday time will just move on and all of us will be too old to do something about it…

We either take care if this now or we never will as a society …

You think about it do you ever think there will be another point in the future to root out this kind of evil?

So I say release the files and let the chips fall where they fall but that’s just my two cents…

Would be one thing if this entire process felt like we could really trust justice to do the right thing…

Just look over there in the Epstein form on Reddit. They are all kinds of pictures and names of really really wealthy people that can just easily buy their way out of trouble…

DigitalForensick@lemmy.world · 1 day ago

Hey that makes sense to me man.

I think there will be plenty of falling chips in the coming weeks. Once the data is aggregated and truly accessible searchable… someone is going to make some AI something that can connect the dots faster than the justice system - because my god is it slow as molasses.

I’m so tired of waiting around.

Wild_Cow_5769@lemmy.world · 1 day ago

Thx…

BWint@lemmy.world · 1 day ago

It’s “great” that the DOJ removed CSAM at the same time as they were removing perfectly legitimate files that are in the public interest. That’s just really smart. Puts us all in a hell of a bind.

I can’t speak for others, but I’ll plan to preserve the 87GB Set 9, the 90GB Set 9, and Set 10, until we can get an updated “complete” (current) Set 9 that can be presumed to be free of CSAM. After that, we can try to identify the legitimate files that are missing from the “complete” Set 9, and preserve those while purging the CSAM.

Xenom0rph@lemmy.world · 18 hours ago

Where can I get magnet links or torrents for this 87GB and 90GB sets?

Moonsurfer_1@lemmy.world · 17 hours ago

90GB

The 90 GB is a de-duplicated merge of the 87 GB and 48 GB incomplete downloads.

Here’s the magnet link for the 90 GB file.

DigitalForensick@lemmy.world · 1 day ago

This seems like a valid plan - although I’m not that confident in the ‘purge’. It might be good to redact those images ourselves and then nobody is pressed to store them. Better to have a confidently safe dataset that can be passed around safely.

Also, It looks like they went back and repaired the shitty text redactions on docs that were released late 2025 from what I can tell. I ran a script that auto detects and removes “fake” redactions and its not getting any hits anymore. even on files that it flagged in the past. They are definitely trying to cover their tracts* by the day*

Wild_Cow_5769@lemmy.world · 1 day ago

And have you seen this or are you speculating?

DigitalForensick@lemmy.world · 1 day ago

Without a timestamp on the photo its impossible to be 100% but it was obvious enough for me to ask the question. :/ It seems like it was a mistake on their part because everything else has heavily redacted nudity. You can also see references in the internal memo docs preceding the content.

Wild_Cow_5769@lemmy.world · 1 day ago

There’s a lawsuit to try to have a judge give an injunction to the file release. There isn’t a lot of time left…

Once those files go away, do you honestly think anybody who will ever get to see them again?

Wild_Cow_5769@lemmy.world · 2 days ago

As far as CSAM and the “don’t go looking for data set 9”…

Look I’ll be straight up.

If I find any CSAM it gets deleted…

But if you believe for 1 second that DOJ didn’t remove delete relevant files because they are protecting people then I have a time share to sell you at a cheap price on a beautiful scenic swamp in Florida…

MachineFab812@discuss.tchncs.de · edit-2 1 day ago

It’s literally left-in on purpose to try to have something over people that download and/or seed the torrents. We need a file-list to know what not to dl/seed, or a new torrent for that set.