• @Candelestine@lemmy.world
    link
    fedilink
    English
    -210 months ago

    You can anonymize anything, you just strip the identifying data out and do not include it when you give the info out. Like scrubbing the metadata off a pic before you post it online.

    The level of discourse in this sub has really fallen off a cliff recently…

    • @LaSaucisseMasquee
      link
      1010 months ago

      You can’t anonymize genetic data because, by essence, it identifies an individual.

      • ares35
        link
        fedilink
        710 months ago

        and, given enough samples, even those NOT in the database, are anyway by genetic relation.

      • @Candelestine@lemmy.world
        link
        fedilink
        English
        110 months ago

        By that logic you cannot anonymize a pic either. Yet everyone who has their photo taken cannot necessarily be identified in it.

        • @LaSaucisseMasquee
          link
          3
          edit-2
          10 months ago

          Can’t show a proof without doxxing me but I’ve written a patent to anonymize medical data (not genetic) and I’m a bioinformatician working with sequencing data.

          While you could probably achieve reasonable privacy levels by altering genetic data, we shouldn’t play with that under fallacious pretenses.

          You can use that data for medical research, of course… but also population profiling or stratification of customers if you are an insurance company.

        • Bipta
          link
          fedilink
          210 months ago

          Anonymized data has long been problematic and you definitely cannot meaningfully anonymize a picture in the truest sense of the word.

    • @gibmiser@lemmy.world
      link
      fedilink
      510 months ago

      So, paternity tests exist. Genetics, once enough data are collected, can absolutely be used to identify people.

      • @Candelestine@lemmy.world
        link
        fedilink
        English
        310 months ago

        Correct. Much like with a fingerprint, with both examples, you can determine with good accuracy if they are the same.

        In addition, as tools are advancing, we can extrapolate with very large data sets to identify you even without having seen your specific code, by tracing commonalities through any relatives of yours that voluntarily or involuntarily submitted their codes. However, this does still require those codes to be identifiable. A randomized set of random people’s codes could not be used for this, anymore than a database of fingerprints where all the labels were deleted could be used. It’s just a bunch of random fingerprints, with no names attached anywhere at all, its just a bunch of bleh.

        So, big concern in the hands of anyone who has not scrubbed the labels all off, which would overall render it much less useful.