Fig. 1 It's in the bank |
I. Background
In the previous post of this series the genomes, or segments thereof, of many organisms were shown to contain the "~32/35/25/6 pattern" of atom types Carbon, Hydrogen, Nitrogen, and Oxygen.
It is a "percentages pattern" detected in the nucleotides/base pairs 'A', 'C', 'G', 'T', and 'U' of their DNA/RNA in ribosomes (On The Origin of The Containment Entity - 15).
So, in today's post I am presenting an appendix (Link) with exclusively human DNA chromosomes revealed in recent research:
"Since its initial release in 2000, the human reference genome has covered only the euchromatic fraction of the genome, leaving important heterochromatic regions unfinished. Addressing the remaining 8% of the genome, the Telomere-to-Telomere (T2T) Consortium presents a complete 3.055 billion–base pair sequence of a human genome, T2T-CHM13, that includes gapless assemblies for all chromosomes except Y, correctserrors in the prior references, and introduces nearly 200 million base pairs of sequence containing 1956 gene predictions, 99 of which are predicted to be protein coding. The completed regions include all centromeric satellite arrays, recent segmental duplications, and the short arms of all five acrocentric chromosomes, unlocking these complex regions of the genome to variational and functional studies."
(Nurk et al., Science). After improving the recorded sequences in the human chromosome, those improvements were placed in the GenBank (Fig. 1).
The tables in today's appendix (Link) contain the atomic information "percentages pattern" but also as usual they contain a link to the GenBank database data for that chromosome.
To view the nucleotides in FASTA format click on the "FASTA" button at the upper left on the GenBank information page linked to.
As you can see, the "~32/35/25/6 pattern" appears in human chromosome DNA too.
II. The Tata Box Containment Entity
In genetics a "Tata box" begins with a promoter region of nucleotides (e.g. "TATAAA") and terminates with a terminator region of nucleotides (e.g. "TATCTC").
Fig. 2 The Tata box zone |
In today's appendix the tables include:
1) an HTML table detailing an analysis of the entire sequence of nucleotides, 2) an HTML table detailing an analysis of only the sequence of nucleotides within a Tata box region, and 3) an HTML table detailing an analysis of the sequence of nucleotides within the Tata box where chimera nucleotides, if any, have been excluded from the analysis.
The coding (programming) process is: 1) load all of the GenBank nucleotides into a structure, 2) isolate all of the Tata boxes (exclude "out-of-frame" nucleotides [meaning any nucleotides preceding the promoter as well as any following the terminator]), and 3) exclude any chimera nucleotides (i.e. "out-of-frame" by not being a part of a three-nucleotide codon).
III. The Atomic Makeup
The number of atoms in a given nucleotide ('A', 'C', 'G', 'T', and 'U') are:
'A' = "C5H5N5" (5 Carbon, 5 Hydrogen, 5 Nitrogen)
'C' = "C4H5N3O1" (4 Carbon, 5 Hydrogen, 3 Nitrogen, 1 Oxygen)
'T' = "C5H6N2O2" (5 Carbon, 6 Hydrogen, 2 Nitrogen, 2 Oxygen)
'G' = "C5H5N5O1" (5 Carbon, 5 Hydrogen, 5 Nitrogen, 1 Oxygen)
'U' = "C4H4N2O2" (4 Carbon, 4 Hydrogen, 2 Nitrogen, 2 Oxygen)
(Quantum Biology - 18). As you can see the analysis of the atom content of any given genome in the appendix (Link) is a function of simple arithmetic (it is not "rocket science").
The percentages are derived by dividing the count of each atom type by the total atom count.
For example:
Atom | Atom Count | Percent |
Carbon | 108,622,420 | 32.4217 |
Hydrogen | 119,321,446 | 35.6151 |
Nitrogen | 85,235,852 | 25.4412 |
Oxygen | 21,850,477 | 6.5219 |
Totals | 335,030,195 | ----- |
Atom | Atom Count | Percent |
Carbon | 77,347,794 | 32.4460 |
Hydrogen | 84,768,668 | 35.5589 |
Nitrogen | 60,976,224 | 25.5784 |
Oxygen | 15,296,473 | 6.4166 |
Totals | 238,389,159 | ----- |
Atom | Atom Count | Percent |
Carbon | 77,335,375 | 32.4460 |
Hydrogen | 84,754,884 | 35.5589 |
Nitrogen | 60,966,898 | 25.5786 |
Oxygen | 15,293,830 | 6.4165 |
Totals | 238,350,987 | ----- |
(Today's Appendix at Chromosome Y). For example in the "Full-Genome Table" the Carbon percentage (32.4217) is derived by dividing 108,622,420 by 335,030,195 then multiplying by 100.0 (i.e. change decimal into percent), and finally by rounding to 4 decimal places.
And so forth.
IV. Closing Comments
The big deal about atomic percentage patterns does not have to be explained in fourth-year Latin or Greek phrases.
It is a simple containment entity to see.
The inexplicable mystery is how these groups of just four atoms became brainy enough to make up the genes of you and I and everyone we know.
The previous post in this series is here.
"Astronomers have found by far the most distant and earliest quasar ever seen, a cosmic beacon shining so soon after the big bang that standard theory can’t explain how it was built ... Astronomers thought the black holes formed stepwise within early galaxies, as giant stars collapsed and merged, but quasars detected from when the universe was less than 1 billion years old have challenged the idea. 'We were already concerned,' says Anna-Christina Eilers of the Massachusetts Institute of Technology. The new one, dubbed UHZ-1, which blazed when the universe was less than 450 million years old, HAS MADE THAT SCENARIO UNTENABLE" (Science, emphasis added).
ReplyDelete