cross-posted from: https://lemmy.ml/post/15471632

Codeberg was asking about this. The linked toot by a commenter points to :

SEqlite

These are CC-BY-SA 4.0 remixes of the Stack Exchange Creative Commons Data Dumps. 100% Unendorsed by Stack Exchange, Inc.

They are minimal. They provide the data you probably care about and the data you need to comply with the original license in SQLite format.

  • @Miaou
    link
    English
    11 month ago

    They have already access to SO’s CC content, why would they get it from the fediverse?

    • @lambalicious@lemmy.sdf.org
      link
      fedilink
      English
      11 month ago

      They already have it.

      I said alternative to SO. As in, likely, a place to post new content (answers, comments). Nothing can really be done with the content OAI already got their hands on other than firing off a few well-placed EMP bombs.

      • @Miaou
        link
        English
        11 month ago

        Yes, but you mentioned importing old content is problematic, and I don’t see why?

        • @lambalicious@lemmy.sdf.org
          link
          fedilink
          English
          11 month ago

          Because to import old content, you have to respect the old license (or get every contributor of back-then to relicense). That would mean having a site with contents under differing licenses depending on date, which is something the corpos can use as an excuse to continue siphoning everything without consequence.

          I’m fine with a mirror / archive of SO. But it shoudl very definitively be a different thing than an active SO alternative, and their users and data storages should be also different.