Putting it all Together
After the EDNA software processes a users DNA the result is a large group of tiny 1-line files that would look similar to this…
(And around 3-million more files like the those shown above)
This process is run on servers that have no connection to the internet, no WiFi or bluetooth capability. The software also generates one larger file that is just a list of the cyan strings which begin EDNA_. This larger file is encrypted and placed on the Interplanetary File System (IPFS) and the decryption key for the file sent to the DNA owners EOS wallet.
EDNA waits till we have saved up 5,000 DNA owners data as represented in the tiny files. We then tumble these files all together and mix them before sending at random the small files likewise to the IPFS storage system. The odds of someone being able to reconstruct anyone’s DNA without the private keys are 3-million times 3 million repeated 5,000 times to one making the EDNA storage strategy probably one of the most secure methods ever created.
What is almost as important as the security built into this design is the added feature that the raw data does not need to be encrypted to be kept safe, and can therefore be used by EDNA’s internal research program to add value to the DNA kept within the EDNA system.
Just How Safe is This?
The graphic below illustrates a part of the human genome called Short Tandem Repeats (STR’s). Every human carries these, and it is from these sections of the DNA that forensic scientists discover someones identity in their DNA. In the US, the CODIS database stores 20 of these sections of DNA so that they can discover who left DNA at a crime scene. The odds of matching 20 of 20 of these STR’s and it not being the persons DNA are astronomical – and why DNA evidence in court is so convincing. When scientists match a large number of them (but not all 20) they know they are looking at a family member of the person they seek.
The graphic below illustrates STR #1 and #2. With 11 to 18 more digits in the “combination” (depending on which countries database is being used) you can see the science requires a good amount of intact data and quality DNA to “get a match”. Further the STR’s are spread widely across the DNA, and appear in most of the chromosomes as can be seen in the bottom graphic.
Why is this so Important?
Look again at the EDNA data storage strategy above. The letters in the first file listed (A-T) could correspond to the fist blue box in the blue section of 10 for the first person in the graphic. The other 9 base pairs would be stored in other files, mixed in with millions and millions of other base pairs, and no possible way to put them together as belonging to the same person without the private keys. Shown below is just how scattered across the chromosomes these STR’s are in the human genome.
This is how EDNA keeps your identity and family relations safe while your data is on chain.