Just noticed this today - seems all the archiving activity has been noticed by NCBI / NLM staff. Thankfully most of SRA (the Sequence Read Archive) and other genomic data is also mirrored in Europe.
From watching the ArchiveTeam’s Warrior URLs as they stream past, it looks like PubMed Central manuscripts are being archived, which is a good thing.
Yup, 🫡.
Thank you 🤗
Good that people are doing this
That’s a lot of data to be archiving! What’s the archiving action responsible for this, or what group? I work with SRA and GEO daily for work, so this is interesting to see on lemmy.
It looks like ArchiveTeam’s Warrior was mostly capturing PubMedCentral (PMC) articles. As far as I know, SRA and GEO aren’t being backed up by ArchiveTeam (that is a lot of data), but since SRA is largely also mirrored by ENA, it wouldn’t seem a priority.
Didn’t know about ENA mirroring. Thanks! I’m tickled by the idea that all the paywalled journals are not backed up. If we ever have a planet wide catastrophe, we’ll have to rebuild using the open articles only!
It looks like ArchiveTeam’s Warrior was mostly capturing PubMedCentral (PMC) articles. As far as I know, SRA and GEO aren’t being backed up by ArchiveTeam (that is a lot of data), but since SRA is largely also mirrored by ENA, it wouldn’t seem a priority.