Just noticed this today - seems all the archiving activity has been noticed by NCBI / NLM staff. Thankfully most of SRA (the Sequence Read Archive) and other genomic data is also mirrored in Europe.

  • pansapiens@lemmy.sdf.orgOP
    link
    fedilink
    English
    arrow-up
    4
    ·
    17 days ago

    From watching the ArchiveTeam’s Warrior URLs as they stream past, it looks like PubMed Central manuscripts are being archived, which is a good thing.

  • taiidan@slrpnk.net
    link
    fedilink
    arrow-up
    1
    ·
    17 days ago

    That’s a lot of data to be archiving! What’s the archiving action responsible for this, or what group? I work with SRA and GEO daily for work, so this is interesting to see on lemmy.

    • pansapiens@lemmy.sdf.orgOP
      link
      fedilink
      English
      arrow-up
      3
      ·
      8 days ago

      It looks like ArchiveTeam’s Warrior was mostly capturing PubMedCentral (PMC) articles. As far as I know, SRA and GEO aren’t being backed up by ArchiveTeam (that is a lot of data), but since SRA is largely also mirrored by ENA, it wouldn’t seem a priority.

      • taiidan@slrpnk.net
        link
        fedilink
        arrow-up
        1
        ·
        7 days ago

        Didn’t know about ENA mirroring. Thanks! I’m tickled by the idea that all the paywalled journals are not backed up. If we ever have a planet wide catastrophe, we’ll have to rebuild using the open articles only!

    • pansapiens@lemmy.sdf.orgOP
      link
      fedilink
      English
      arrow-up
      1
      ·
      8 days ago

      It looks like ArchiveTeam’s Warrior was mostly capturing PubMedCentral (PMC) articles. As far as I know, SRA and GEO aren’t being backed up by ArchiveTeam (that is a lot of data), but since SRA is largely also mirrored by ENA, it wouldn’t seem a priority.