Pages

Saturday, November 11, 2023

On The Origin Of A Genetic Constant - 9

Ye olde Snail's Pace

If you haven't read the previous posts in this series (On The Origin Of A Genetic Constant, 1, 2, 3, 4, 5, 6, 7, 8) then this post will be a primer.

To determine the genetic constants for DNA and RNA we begin with the basic Chemical Formula concept: 

"In chemistry, a chemical formula is a way of presenting information about the chemical proportions of atoms that constitute a particular chemical compound or molecule, using chemical element symbols, numbers ... [example glucose:] C6H12O6 (12 hydrogen atoms, six carbon and oxygen atoms)."

(Wikipedia, Chemical Formula). Now we apply it to the molecules of DNA and RNA:

"I. Where Did The ~(32/35/25/6) Originate?

That is a fair question, so this post will answer the question and will provide the updated exact values and where "~(32/35/25/6)" originated.

Let's start with the nucleotide, atom, and their quantities that are in BOTH DNA and RNA:

Nucleotides(ACG), atoms, and counts in both DNA and RNA:

A: (Adenine) 15 atoms (5 carbon, 5 hydrogen, 5 nitrogen, 0 oxygen)

C: (Cytocine) 13 atoms (4 carbon, 5 hydrogen, 3 nitrogen, 1 oxygen)

G: (Guanine) 16 atoms (5 carbon, 5 hydrogen, 5 nitrogen, 1 oxygen)

44 total atoms (14 carbon, 15 hydrogen, 13 nitrogen, 2 oxygen)

Percentages:
(carbon 31.8181, hydrogen 34.0909, nitrogen 29.5455, oxygen 4.5455)


Now let's add the missing ingredient ("T") needed to make the nucleotide group complete for DNA:

Additional nucleotide(T), atoms, and count (only in DNA):

T: (Thymine) 15 atoms, (5 carbon, 6 hydrogen, 2 nitrogen, 2 oxygen)
59 total atoms (19 carbon, 21 hydrogen, 15 nitrogen, 4 oxygen)

Percentages:
(carbon 32.2034, hydrogen 35.5932, nitrogen 25.4237, oxygen 6.7797)

That is the source for the DNA (ACGT) ~(32/35/25/6) genetic constant.

It is 19, 21, 15, and 4 divided by 59 (x 100.0) which determines those percentages of those atoms in the genomes of DNA."

(On The Origin Of A Genetic Constant - 5). To top it off, we note that RNA does not have "T" thymine, instead it has "U" uracil:

"Now let's add the missing ingredient ("U") needed to make the nucleotide group complete for RNA:

Additional nucleotide(U), atoms, and count (only in RNA):
U: (Uracil) 12 atoms, (4 carbon, 4 hydrogen, 2 nitrogen, 2 oxygen)
56 total atoms (18 carbon, 19 hydrogen, 15 nitrogen, 4 oxygen)

Percentages:
(carbon 32.1429, hydrogen 33.9286, nitrogen 26.7857, oxygen 7.1429)

That is the source for the RNA (ACGU) ~(32/33/26/7) genetic constant.

It is 18, 19, 15, and 4 divided by 56 (x 100.0) which determines those percentages of those atoms in the genomes of RNA."

(On The Origin Of A Genetic Constant - 6).

You may be wondering where the projected constant "~(32/35/25/6)" of DNA nucleotide carbon, hydrogen, nitrogen, and oxygen atoms came from, so here are the relevant Chemical Formulas:

Screen shots
from Wikipedia links above

The calculations concerning the atomic genetic constant are accomplished by simple arithmetic (no algebra or high math or calculus is necessary).

To apply it to a GenBank single genome file (.gbff or .fasta): 

1) load the ACGT nucleotides (Origin section of a GenBank .gbff file, or the second line through the end of the file for a GenBank FASTA file); 2) determine the total individual nucleotide count of ACG and T in that genome; 3) multiply those nucleotide counts by the individual carbon('C'), hydrogen ('H'), nitrogen('H') and oxygen('O') atom counts specified in the Chemical Formula for each nucleotide type (ACGT), example: 'A' nucleotide count times carbon count (5), 'C' nucleotide count times carbon count (4), etc.;  4) add up (sum) the 'C' 'H' 'N' and 'O' quantities for each individual atom type ; 5)  add up (sum) all atoms ; and 6) divide each atom category  count ('C', 'H', 'N' and 'O') by the sum of all atoms, then multiply by 100 to derive the percentage of each atom type in that genome:

Example genome totals:

Atom Atom Count Percent
Carbon 1,071,062 32.3550
Hydrogen 1,178,966 35.6146
Nitrogen 847,036 25.5875
Oxygen 213,284 6.4429
Totals 3,310,348 ~100.00

Total carbon atoms (1,071,062 divided by total genome atoms 3,310,348) = 0.32355 (times 100 = 32.3550%) and so forth.

Today's appendix gives an example of how the numbers come about (Example Appendix).

If you like, enjoy this arithmetic applied to over a million genomes in the previous appendices to the previous post (Appendix Low, Appendix High, and Appendix Average).

The next post in this series is here, the previous post in this series is here.


Once upon a time a polymath visited Harvard:


Appendix Genetic Constant - 9

This is an appendix to: On The Origin Of A Genetic Constant - 9


Genetic Constant ~(32/35/25/6) file analysis

Genbank File Genome link: CP002988.1

Nucleotide Counts:

Adenine ('A') count: 527,444
Cytosine ('C') count: 930,774
Guanine ('G') count: 931,003
Thymine ('T') count: 524,468

Total Nucleotide content: 2,913,689 bp

ACGT Atom Percents:

Adenine ('A'):
Carbon: 6.17, Hydrogen: 6.17, Nitrogen: 6.17, Oxygen: 0.00,
Cytosine ('C'):
Carbon: 8.70, Hydrogen: 10.88, Nitrogen: 6.53, Oxygen: 2.18,
Guanine ('G'):
Carbon: 10.88, Hydrogen: 10.88, Nitrogen: 10.88, Oxygen: 2.18,
Thymine ('T'):
Carbon: 6.13, Hydrogen: 7.36, Nitrogen: 2.45, Oxygen: 2.45,
        For totals: (see Atom percent sums below)

Atom percent sums (from Atom Data/Counts above):

Carbon: 31.8825% (6.17%+8.70%+10.88%+6.13%)
Hydrogen: 35.2846% (6.17%+10.88%+10.88%+7.36%)
Nitrogen: 26.0282% (6.17%+6.53%+10.88%+2.45%)
Oxygen: 6.80474% (0.00%+2.18%+2.18%+2.45%)

Total percent: 100%

Atom variation-from-constant percents:

Carbon: 0.320903%
Hydrogen: 0.308601%
Nitrogen: 0.604465%
Oxygen: 0.0250394%
total: 1.25901%

Friday, November 10, 2023

On The Origin Of A Genetic Constant - 8

Sketchfab
What it the big deal about a "genetic constant" you might wonder.

Well, the answer is the same as for any "fundamental physical constant or universal constant":

There are many physical constants in science, some of the most widely recognized being the speed of light in vacuum c, the gravitational constant G, the Planck constant h, the electric constant ε0, and the elementary charge e. Physical constants can take many dimensional forms: the speed of light signifies a maximum speed for any object and its dimension is length divided by time; while the proton-to-electron mass ratio, is dimensionless.

The term "fundamental physical constant" is sometimes used to refer to universal-but-dimensioned physical constants such as those mentioned above.[1] Increasingly, however, physicists reserve the expression for the narrower case of dimensionless universal physical constants, such as the fine-structure constant α, which characterizes the strength of the electromagnetic interaction.

(Wikipedia, Physical Constant). The concept is simple enough, however there is another part of the story to consider.

One thing about all constants is that they are measurements.

The quality of everything used to make measurements as well as the one doing the measuring, involves activities that can vary in degree of accuracy.

In the case of DNA gathering, analyzing, storing, and use:

"Despite how it is often portrayed, in the media and in courts, the forensic science of DNA is far from infallible." (Nature)

"It is extremely important to remember that DNA can easily be damaged even when it has been properly collected and packaged. Damage to DNA can occur from improper storage, exposure to direct sunlight, or simply by becoming too warm." (How To)

(cf. Guidelines for DNA Collection, The Atlantic, Puritan, DNA Evidence).

Measuring the length of a pencil is not the same in terms of difficulty as counting the atoms in a genome.

But even more difficult than that is accurately gathering those atoms.

It is well known that even gathering the nucleotide count ('A','C','G,','T','U') is difficult because they are so tiny.

Those are some of the reasons that the genetic constant ~(32/35/25/6) discovered and discussed in this series is compelling.

The genomes where it is found come from around the globe, from the air, water, and land, and from many different scientists.

In today's appendices over a million such genomes documented in the GenBank are analyzed for three variation dimensions (Appendix Low, Appendix High, and Appendix Average).

The technique used is to gather the counts of the carbon, hydrogen, nitrogen and oxygen atoms in each 'A','C','G,','T' nucleotide in each genome.

The variations from the ~(32/35/25/6) genetic constant are then detailed by the percentages of lowest percents of variation, highest percents of variation, and average percents of variation.

That is carbon variation from 32.2034%, hydrogen variation from 35.5932%, nitrogen variation from 25.4237%, and oxygen variation from 6.7797% (NOTE: The locus of the genetic constant values for DNA are calculated here and for RNA here.)

These genome values are considered in sections of 100,000 (one hundred thousand) genomes and a final smaller quantity, the sum of all of the sections being 1,052,789 (one million fifty two thousand and seven hundred eighty nine) genomes.

Sample HTML tables are displayed in each section with a link to the GenBank record for user perusal.

What is so convincing about this ~(32/35/25/6) genetic constant is that it is found in so many different plants, animals, fish, humans, and microbes from so many different scientists, and from so many different countries.

In practical terms, one can detect collection and/or processing and/or storage complications to the degree that the ACGT atom percentages vary from the ~(32.2034/35.5932/25.4237/6.7797) constant a significant amount.

The next post in this series is here, the previous post in this series is here.



GC-8 Appendix LOW

This is an appendix to: On The Origin Of A Genetic Constant - 8


GenBank Genome Files
Low Variations
Analysis Report


after processing 100,000 genomes:

Sample genome HTML tables
showing LOW variation
summaries for this section:



GenBank Genome
Table: 1
(v= <1 [0.0210%])
URL Link: LR862133.1

Genome info for LR862133.1:

LR862133 18411140 bp DNA linear PLN 28-JUL-2020 Ananas comosus var. bracteatus genome assembly
chromosome: 5.

Nucleotide count: 18,405,240
Non-Nucleotide Count: 5,900

Atom Atom Count Percent
Carbon 88,541,178 32.4808
Hydrogen 97,752,075 35.8597
Nitrogen 67,878,531 24.9008
Oxygen 18,424,038 6.7587
Totals 272,595,822 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 100,000

variation count:
@ <2.0% = 0

variation count:
@ <3.0% = 0

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 200,000 genomes:

Sample genome HTML tables
showing LOW variation
summaries for this section:



GenBank Genome
Table: 100001
(v= <1 [0.0793%])
URL Link: OD565703.1

Genome info for OD565703.1:

OD565703 160245 bp DNA linear INV 11-DEC-2020 4 Tbi b3v08.

Nucleotide count: 154,008
Non-Nucleotide Count: 6,237

Atom Atom Count Percent
Carbon 743,062 32.5406
Hydrogen 819,390 35.8832
Nitrogen 568,034 24.8757
Oxygen 153,003 6.7004
Totals 2,283,489 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 100,000

variation count:
@ <2.0% = 0

variation count:
@ <3.0% = 0

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 300,000 genomes:

Sample genome HTML tables
showing LOW variation
summaries for this section:



GenBank Genome
Table: 200001
(v= <1 [0.0347%])
URL Link: EA695480.1

Genome info for EA695480.1:

EA695480 14337 bp DNA linear PAT 10-JUN-2008 Sequence 74359 from patent US 7365185.

Nucleotide count: 14,337

Atom Atom Count Percent
Carbon 68,750 32.3906
Hydrogen 75,474 35.5585
Nitrogen 54,448 25.6524
Oxygen 13,581 6.3985
Totals 212,253 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 100,000

variation count:
@ <2.0% = 0

variation count:
@ <3.0% = 0

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 400,000 genomes:

Sample genome HTML tables
showing LOW variation
summaries for this section:



GenBank Genome
Table: 300001
(v= <1 [0.0991%])
URL Link: MX519141.1

Genome info for MX519141.1:

MX519141 11326 bp DNA linear PAT 28-JUN-2021 Sequence 2781 from patent US 10961585.

Nucleotide count: 11,326

Atom Atom Count Percent
Carbon 53,376 32.0309
Hydrogen 59,147 35.4941
Nitrogen 42,571 25.5468
Oxygen 11,545 6.9282
Totals 166,639 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 100,000

variation count:
@ <2.0% = 0

variation count:
@ <3.0% = 0

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 500,000 genomes:

Sample genome HTML tables
showing LOW variation
summaries for this section:



GenBank Genome
Table: 400001
(v= <1 [0.0262%])
URL Link: MH064508.1

Genome info for MH064508.1:

MH064508 117444 bp DNA circular PLN 08-JUN-2019 Radula japonica voucher 16-8723 chloroplast complete genome.

Nucleotide count: 117,444

Atom Atom Count Percent
Carbon 569,471 32.6497
Hydrogen 628,230 36.0185
Nitrogen 428,692 24.5783
Oxygen 117,794 6.7535
Totals 1,744,187 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 100,000

variation count:
@ <2.0% = 0

variation count:
@ <3.0% = 0

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 600,000 genomes:

Sample genome HTML tables
showing LOW variation
summaries for this section:



GenBank Genome
Table: 500001
(v= <1 [0.0214%])
URL Link: AC133204.3

Genome info for AC133204.3:

AC133204 224181 bp DNA linear ROD 25-NOV-2003 Mus musculus BAC clone RP23-106N3 from chromosome 5 complete

Nucleotide count: 224,181

Atom Atom Count Percent
Carbon 1,071,062 32.3550
Hydrogen 1,178,966 35.6146
Nitrogen 847,036 25.5875
Oxygen 213,284 6.4429
Totals 3,310,348 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 100,000

variation count:
@ <2.0% = 0

variation count:
@ <3.0% = 0

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 700,000 genomes:

Sample genome HTML tables
showing LOW variation
summaries for this section:



GenBank Genome
Table: 600001
(v= <1 [0.0841%])
URL Link: JX684087.1

Genome info for JX684087.1:

JX684087 41468 bp DNA linear ENV 26-APR-2013 Uncultured organism clone FLSS-21 genomic sequence.

Nucleotide count: 41,468

Atom Atom Count Percent
Carbon 195,141 32.0406
Hydrogen 215,920 35.4524
Nitrogen 157,202 25.8114
Oxygen 40,779 6.6956
Totals 609,042 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 100,000

variation count:
@ <2.0% = 0

variation count:
@ <3.0% = 0

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 800,000 genomes:

Sample genome HTML tables
showing LOW variation
summaries for this section:



GenBank Genome
Table: 700001
(v= <1 [0.0457%])
URL Link: CP014211.1

Genome info for CP014211.1:

CP014211 4106512 bp DNA circular BCT 17-DEC-2019 Bordetella pertussis strain J377 chromosome complete genome.

Nucleotide count: 4,106,512

Atom Atom Count Percent
Carbon 19,140,791 31.7939
Hydrogen 21,196,927 35.2093
Nitrogen 15,755,921 26.1715
Oxygen 4,109,066 6.8254
Totals 60,202,705 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 100,000

variation count:
@ <2.0% = 0

variation count:
@ <3.0% = 0

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 900,000 genomes:

Sample genome HTML tables
showing LOW variation
summaries for this section:



GenBank Genome
Table: 800001
(v= <1 [0.0426%])
URL Link: CP057830.1

Genome info for CP057830.1:

CP057830 61278 bp DNA circular BCT 13-MAY-2022 Escherichia coli strain RHB13-C18 plasmid pRHB13-C18 2 complete

Nucleotide count: 61,278

Atom Atom Count Percent
Carbon 291,362 32.2487
Hydrogen 322,125 35.6536
Nitrogen 229,129 25.3606
Oxygen 60,869 6.7371
Totals 903,485 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 100,000

variation count:
@ <2.0% = 0

variation count:
@ <3.0% = 0

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 1,000,000 genomes:

Sample genome HTML tables
showing LOW variation
summaries for this section:



GenBank Genome
Table: 900001
(v= <1 [0.0212%])
URL Link: AC016793.3

Genome info for AC016793.3:

AC016793 96138 bp DNA linear HTG 15-DEC-1999 Drosophila melanogaster strain y chromosome 2 clone BACR33K22

Nucleotide count: 91,171
Non-Nucleotide Count: 4,967

Atom Atom Count Percent
Carbon 435,426 32.3266
Hydrogen 481,317 35.7336
Nitrogen 338,611 25.1389
Oxygen 91,606 6.8009
Totals 1,346,960 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 100,000

variation count:
@ <2.0% = 0

variation count:
@ <3.0% = 0

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



Total processed 1,052,789 genomes:


Variation totals for all sections:

variation count:
@ <1.0% = 1,052,789
(100.0000%)

variation count:
@ <2.0% = 0
(0.0000%)

variation count:
@ <3.0% = 0
(0.0000%)

variation count:
@ <4.0% = 0
(0.0000%)

variation count:
@ >=4.0% = 0
(0.0000%)




GC-8 Appendix HIGH

This is an appendix to: On The Origin Of A Genetic Constant - 8


GenBank Genome Files
High Variations
Analysis Report


after processing 100,000 genomes:

Sample genome HTML tables
showing HIGH variation
summaries for this section:



GenBank Genome
Table: 1
(v= <1 [0.5229%])
URL Link: LR862133.1

Genome info for LR862133.1:

LR862133 18411140 bp DNA linear PLN 28-JUL-2020 Ananas comosus var. bracteatus genome assembly
chromosome: 5.

Nucleotide count: 18,405,240
Non-Nucleotide Count: 5,900

Atom Atom Count Percent
Carbon 88,541,178 32.4808
Hydrogen 97,752,075 35.8597
Nitrogen 67,878,531 24.9008
Oxygen 18,424,038 6.7587
Totals 272,595,822 ~100.00





GenBank Genome
Table: 167
(v= >1 <2 [1.0428%])
URL Link: CU302273.5

Genome info for CU302273.5:

CU302273 185664 bp DNA linear MAM 02-NOV-2010 Pig DNA sequence from clone CH242-384E8 on chromosome X
complete

Nucleotide count: 185,664

Atom Atom Count Percent
Carbon 894,700 32.5329
Hydrogen 991,844 36.0653
Nitrogen 670,508 24.3809
Oxygen 193,085 7.0209
Totals 2,750,137 ~100.00





GenBank Genome
Table: 6234
(v= >2 <3 [2.1571%])
URL Link: LR905064.1

Genome info for LR905064.1:

LR905064 14426 bp DNA linear INV 24-NOV-2020 Darwinula stevensoni.

Nucleotide count: 14,426

Atom Atom Count Percent
Carbon 69,819 32.7286
Hydrogen 78,088 36.6048
Nitrogen 49,634 23.2666
Oxygen 15,786 7.3999
Totals 213,327 ~100.00





GenBank Genome
Table: 57068
(v= >3 <4 [3.0583%])
URL Link: MN883538.1

Genome info for MN883538.1:

MN883538 15792 bp DNA circular INV 29-MAR-2020 Florometra sp. BMK-2020 mitochondrion
complete genome.

Nucleotide count: 15,792

Atom Atom Count Percent
Carbon 77,125 32.7125
Hydrogen 86,480 36.6804
Nitrogen 52,730 22.3654
Oxygen 19,431 8.2416
Totals 235,766 ~100.00





GenBank Genome
Table: 92511
(v >=4 [4.6493%])
URL Link: MK361035.1

Genome info for MK361035.1:

MK361035 10487 bp DNA circular INV 25-NOV-2019 Beroe cucumis mitochondrion
complete genome.

Nucleotide count: 10,487

Atom Atom Count Percent
Carbon 51,400 32.9194
Hydrogen 58,411 37.4096
Nitrogen 32,437 20.7744
Oxygen 13,891 8.8966
Totals 156,139 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 94,709

variation count:
@ <2.0% = 4,853

variation count:
@ <3.0% = 341

variation count:
@ <4.0% = 96

variation count:
@ >=4.0% = 1



after processing 200,000 genomes:

Sample genome HTML tables
showing HIGH variation
summaries for this section:



GenBank Genome
Table: 100001
(v= <1 [0.5480%])
URL Link: OD565703.1

Genome info for OD565703.1:

OD565703 160245 bp DNA linear INV 11-DEC-2020 4 Tbi b3v08.

Nucleotide count: 154,008
Non-Nucleotide Count: 6,237

Atom Atom Count Percent
Carbon 743,062 32.5406
Hydrogen 819,390 35.8832
Nitrogen 568,034 24.8757
Oxygen 153,003 6.7004
Totals 2,283,489 ~100.00





GenBank Genome
Table: 100801
(v= >1 <2 [1.0340%])
URL Link: OD566503.1

Genome info for OD566503.1:

OD566503 117171 bp DNA linear INV 11-DEC-2020 4 Tbi b3v08.

Nucleotide count: 117,079
Non-Nucleotide Count: 92

Atom Atom Count Percent
Carbon 565,770 32.5943
Hydrogen 626,325 36.0829
Nitrogen 423,355 24.3897
Oxygen 120,345 6.9331
Totals 1,735,795 ~100.00





GenBank Genome
Table: 103116
(v= >2 <3 [2.2883%])
URL Link: OC917970.1

Genome info for OC917970.1:

OC917970 11508 bp DNA linear INV 25-JAN-2021 Oppiella nova.

Nucleotide count: 11,508

Atom Atom Count Percent
Carbon 55,876 32.6699
Hydrogen 62,421 36.4967
Nitrogen 39,569 23.1354
Oxygen 13,166 7.6980
Totals 171,032 ~100.00





GenBank Genome
Table: 105095
(v= >3 <4 [3.5560%])
URL Link: MZ359281.1

Genome info for MZ359281.1:

MZ359281 20974 bp DNA circular INV 15-SEP-2021 Meloidogyne exigua isolate Mex1 mitochondrion
complete genome.

Nucleotide count: 20,946
Non-Nucleotide Count: 28

Atom Atom Count Percent
Carbon 103,843 33.0443
Hydrogen 116,142 36.9580
Nitrogen 68,720 21.8677
Oxygen 25,549 8.1300
Totals 314,254 ~100.00





GenBank Genome
Table: 105813
(v >=4 [4.4645%])
URL Link: MZ636522.1

Genome info for MZ636522.1:

MZ636522 13647 bp DNA circular INV 31-AUG-2022 Dipetalonema gracile mitochondrion
complete genome.

Nucleotide count: 13,647

Atom Atom Count Percent
Carbon 67,325 32.7518
Hydrogen 76,012 36.9778
Nitrogen 43,084 20.9592
Oxygen 19,140 9.3111
Totals 205,561 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 89,207

variation count:
@ <2.0% = 7,575

variation count:
@ <3.0% = 3,117

variation count:
@ <4.0% = 93

variation count:
@ >=4.0% = 8



after processing 300,000 genomes:

Sample genome HTML tables
showing HIGH variation
summaries for this section:



GenBank Genome
Table: 200001
(v= <1 [0.3812%])
URL Link: EA695480.1

Genome info for EA695480.1:

EA695480 14337 bp DNA linear PAT 10-JUN-2008 Sequence 74359 from patent US 7365185.

Nucleotide count: 14,337

Atom Atom Count Percent
Carbon 68,750 32.3906
Hydrogen 75,474 35.5585
Nitrogen 54,448 25.6524
Oxygen 13,581 6.3985
Totals 212,253 ~100.00





GenBank Genome
Table: 200397
(v= >1 <2 [1.0746%])
URL Link: EA696781.1

Genome info for EA696781.1:

EA696781 11168 bp DNA linear PAT 10-JUN-2008 Sequence 75660 from patent US 7365185.

Nucleotide count: 11,168

Atom Atom Count Percent
Carbon 53,375 32.1786
Hydrogen 58,159 35.0628
Nitrogen 43,953 26.4983
Oxygen 10,384 6.2603
Totals 165,871 ~100.00





GenBank Genome
Table: 202972
(v= >2 <3 [2.1346%])
URL Link: EA740987.1

Genome info for EA740987.1:

EA740987 10907 bp DNA linear PAT 10-JUN-2008 Sequence 12433 from patent US 7368527.

Nucleotide count: 10,907

Atom Atom Count Percent
Carbon 52,319 32.3758
Hydrogen 58,691 36.3189
Nitrogen 37,635 23.2891
Oxygen 12,954 8.0161
Totals 161,599 ~100.00





GenBank Genome
Table: 203379
(v= >3 <4 [3.1033%])
URL Link: EA771733.1

Genome info for EA771733.1:

EA771733 18997 bp DNA linear PAT 10-JUN-2008 Sequence 17 from patent US 7381808.

Nucleotide count: 18,997

Atom Atom Count Percent
Carbon 94,686 32.8346
Hydrogen 104,992 36.4084
Nitrogen 64,366 22.3204
Oxygen 24,329 8.4366
Totals 288,373 ~100.00





GenBank Genome
Table: 235994
(v >=4 [4.1669%])
URL Link: FV536683.1

Genome info for FV536683.1:

FV536683 10359 bp DNA linear PAT 18-MAR-2010 Modified Microbial Nucleic Acid.

Nucleotide count: 10,359

Atom Atom Count Percent
Carbon 51,795 32.8822
Hydrogen 57,899 36.7573
Nitrogen 33,483 21.2568
Oxygen 14,340 9.1038
Totals 157,517 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 92,033

variation count:
@ <2.0% = 6,988

variation count:
@ <3.0% = 925

variation count:
@ <4.0% = 50

variation count:
@ >=4.0% = 4



after processing 400,000 genomes:

Sample genome HTML tables
showing HIGH variation
summaries for this section:



GenBank Genome
Table: 300001
(v= <1 [0.1725%])
URL Link: MX519141.1

Genome info for MX519141.1:

MX519141 11326 bp DNA linear PAT 28-JUN-2021 Sequence 2781 from patent US 10961585.

Nucleotide count: 11,326

Atom Atom Count Percent
Carbon 53,376 32.0309
Hydrogen 59,147 35.4941
Nitrogen 42,571 25.5468
Oxygen 11,545 6.9282
Totals 166,639 ~100.00





GenBank Genome
Table: 300024
(v= >1 <2 [1.1090%])
URL Link: MX519177.1

Genome info for MX519177.1:

MX519177 38856 bp DNA linear PAT 28-JUN-2021 Sequence 2817 from patent US 10961585.

Nucleotide count: 38,856

Atom Atom Count Percent
Carbon 187,629 32.5649
Hydrogen 207,908 36.0845
Nitrogen 140,094 24.3147
Oxygen 40,539 7.0359
Totals 576,170 ~100.00





GenBank Genome
Table: 301529
(v= >2 <3 [2.5213%])
URL Link: MI279662.1

Genome info for MI279662.1:

MI279662 11127 bp DNA linear PAT 12-FEB-2018 Sequence 21784 from patent US 9827332.

Nucleotide count: 11,127

Atom Atom Count Percent
Carbon 52,077 31.7514
Hydrogen 56,530 34.4664
Nitrogen 45,834 27.9450
Oxygen 9,574 5.8373
Totals 164,015 ~100.00





GenBank Genome
Table: 301748
(v= >3 <4 [3.9013%])
URL Link: MI380907.1

Genome info for MI380907.1:

MI380907 38032 bp DNA linear PAT 12-APR-2018 Sequence 170 from patent US 9862943.

Nucleotide count: 38,032

Atom Atom Count Percent
Carbon 181,944 32.5425
Hydrogen 207,959 37.1956
Nitrogen 120,331 21.5224
Oxygen 48,862 8.7395
Totals 559,096 ~100.00





GenBank Genome
Table: 317731
(v >=4 [6.2456%])
URL Link: MQ014117.1

Genome info for MQ014117.1:

MQ014117 22500 bp DNA linear PAT 18-MAY-2021 Sequence 23 from Patent EP3717009.

Nucleotide count: 22,500

Atom Atom Count Percent
Carbon 108,000 32.8767
Hydrogen 126,000 38.3562
Nitrogen 63,000 19.1781
Oxygen 31,500 9.5890
Totals 328,500 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 93,795

variation count:
@ <2.0% = 5,616

variation count:
@ <3.0% = 342

variation count:
@ <4.0% = 245

variation count:
@ >=4.0% = 2



after processing 500,000 genomes:

Sample genome HTML tables
showing HIGH variation
summaries for this section:



GenBank Genome
Table: 400001
(v= <1 [0.8454%])
URL Link: MH064508.1

Genome info for MH064508.1:

MH064508 117444 bp DNA circular PLN 08-JUN-2019 Radula japonica voucher 16-8723 chloroplast complete genome.

Nucleotide count: 117,444

Atom Atom Count Percent
Carbon 569,471 32.6497
Hydrogen 628,230 36.0185
Nitrogen 428,692 24.5783
Oxygen 117,794 6.7535
Totals 1,744,187 ~100.00





GenBank Genome
Table: 400014
(v= >1 <2 [1.1327%])
URL Link: MH085931.1

Genome info for MH085931.1:

MH085931 15361 bp DNA linear PLN 18-AUG-2018 Helianthus annuus V-type proton ATPase subunit a3 gene complete

Nucleotide count: 15,361

Atom Atom Count Percent
Carbon 74,272 32.5827
Hydrogen 82,261 36.0875
Nitrogen 55,371 24.2910
Oxygen 16,045 7.0389
Totals 227,949 ~100.00





GenBank Genome
Table: 403520
(v= >2 <3 [2.2745%])
URL Link: MZ748319.1

Genome info for MZ748319.1:

MZ748319 19912 bp DNA circular PLN 23-MAR-2022 Galdieria sp. strain ACUF 613 mitochondrion complete genome.

Nucleotide count: 19,912

Atom Atom Count Percent
Carbon 98,326 32.3620
Hydrogen 103,872 34.1873
Nitrogen 84,156 27.6982
Oxygen 17,478 5.7525
Totals 303,832 ~100.00





GenBank Genome
Table: 420094
(v >=4 [4.0557%])
URL Link: MN366148.1

Genome info for MN366148.1:

MN366148 10410 bp DNA linear PLN 10-FEB-2021 UNVERIFIED: Phalaenopsis aphrodite subsp. formosana clone Contig 17

Nucleotide count: 10,410

Atom Atom Count Percent
Carbon 48,410 32.3419
Hydrogen 56,312 37.6211
Nitrogen 31,984 21.3680
Oxygen 12,976 8.6690
Totals 149,682 ~100.00





GenBank Genome
Table: 453921
(v= >3 <4 [3.4655%])
URL Link: AC068123.5

Genome info for AC068123.5:

AC068123 98295 bp DNA linear PRI 09-MAY-2001 Homo sapiens BAC clone RP11-242E13 from Y complete sequence.

Nucleotide count: 98,295

Atom Atom Count Percent
Carbon 461,510 32.4371
Hydrogen 531,184 37.3341
Nitrogen 312,418 21.9582
Oxygen 117,672 8.2705
Totals 1,422,784 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 97,681

variation count:
@ <2.0% = 2,209

variation count:
@ <3.0% = 94

variation count:
@ <4.0% = 15

variation count:
@ >=4.0% = 1



after processing 600,000 genomes:

Sample genome HTML tables
showing HIGH variation
summaries for this section:



GenBank Genome
Table: 500001
(v= <1 [0.3368%])
URL Link: AC133204.3

Genome info for AC133204.3:

AC133204 224181 bp DNA linear ROD 25-NOV-2003 Mus musculus BAC clone RP23-106N3 from chromosome 5 complete

Nucleotide count: 224,181

Atom Atom Count Percent
Carbon 1,071,062 32.3550
Hydrogen 1,178,966 35.6146
Nitrogen 847,036 25.5875
Oxygen 213,284 6.4429
Totals 3,310,348 ~100.00





GenBank Genome
Table: 500405
(v= >1 <2 [1.0039%])
URL Link: AC135082.4

Genome info for AC135082.4:

AC135082 183170 bp DNA linear ROD 27-NOV-2003 Mus musculus BAC clone RP24-222O22 from chromosome 7 complete

Nucleotide count: 183,170

Atom Atom Count Percent
Carbon 879,496 32.4539
Hydrogen 976,306 36.0263
Nitrogen 661,774 24.4198
Oxygen 192,409 7.1000
Totals 2,709,985 ~100.00





GenBank Genome
Table: 516503
(v >=4 [4.5079%])
URL Link: MF597730.1

Genome info for MF597730.1:

MF597730 10830 bp DNA linear ROD 04-AUG-2017 Mus musculus chromosome X WGS contig MG104 gap closure genomic

Nucleotide count: 10,830

Atom Atom Count Percent
Carbon 52,672 31.9918
Hydrogen 54,788 33.2770
Nitrogen 49,280 29.9316
Oxygen 7,902 4.7995
Totals 164,642 ~100.00





GenBank Genome
Table: 516505
(v= >3 <4 [3.9959%])
URL Link: MF597734.1

Genome info for MF597734.1:

MF597734 13294 bp DNA linear ROD 04-AUG-2017 Mus musculus chromosome X WGS contig MG3683 gap closure genomic

Nucleotide count: 13,294

Atom Atom Count Percent
Carbon 65,471 32.3258
Hydrogen 68,099 33.6233
Nitrogen 59,585 29.4196
Oxygen 9,380 4.6313
Totals 202,535 ~100.00





GenBank Genome
Table: 540951
(v= >2 <3 [2.0659%])
URL Link: KT329717.1

Genome info for KT329717.1:

KT329717 11756 bp DNA linear PRI 10-JAN-2016 Macaca mulatta isolate Rh23717 6148-2 major histocompatibility

Nucleotide count: 11,756

Atom Atom Count Percent
Carbon 56,318 32.4110
Hydrogen 63,203 36.3733
Nitrogen 40,587 23.3578
Oxygen 13,654 7.8579
Totals 173,762 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 97,226

variation count:
@ <2.0% = 2,733

variation count:
@ <3.0% = 38

variation count:
@ <4.0% = 1

variation count:
@ >=4.0% = 2



after processing 700,000 genomes:

Sample genome HTML tables
showing HIGH variation
summaries for this section:



GenBank Genome
Table: 600001
(v= <1 [0.3877%])
URL Link: JX684087.1

Genome info for JX684087.1:

JX684087 41468 bp DNA linear ENV 26-APR-2013 Uncultured organism clone FLSS-21 genomic sequence.

Nucleotide count: 41,468

Atom Atom Count Percent
Carbon 195,141 32.0406
Hydrogen 215,920 35.4524
Nitrogen 157,202 25.8114
Oxygen 40,779 6.6956
Totals 609,042 ~100.00





GenBank Genome
Table: 600003
(v= >1 <2 [1.0473%])
URL Link: JX684089.1

Genome info for JX684089.1:

JX684089 34111 bp DNA linear ENV 26-APR-2013 Uncultured organism clone FLSS-23 genomic sequence.

Nucleotide count: 34,111

Atom Atom Count Percent
Carbon 164,141 32.5238
Hydrogen 182,123 36.0868
Nitrogen 123,023 24.3764
Oxygen 35,393 7.0130
Totals 504,680 ~100.00





GenBank Genome
Table: 600781
(v= >2 <3 [2.2021%])
URL Link: CP021571.1

Genome info for CP021571.1:

CP021571 11301 bp DNA circular ENV 08-JAN-2019 Unidentified plasmid plasmid MO1-2 2829 complete sequence.

Nucleotide count: 11,301

Atom Atom Count Percent
Carbon 54,070 32.5637
Hydrogen 60,864 36.6553
Nitrogen 38,558 23.2216
Oxygen 12,552 7.5594
Totals 166,044 ~100.00





GenBank Genome
Table: 606825
(v= >3 <4 [3.1466%])
URL Link: MK230775.1

Genome info for MK230775.1:

MK230775 11172 bp DNA linear ENV 31-JAN-2020 MAG: Viral metagenome isolate S10C.dsDNAvir NODE 200 sequence.

Nucleotide count: 11,172

Atom Atom Count Percent
Carbon 53,859 32.6588
Hydrogen 60,900 36.9283
Nitrogen 36,738 22.2771
Oxygen 13,417 8.1358
Totals 164,914 ~100.00





GenBank Genome
Table: 613004
(v >=4 [4.0198%])
URL Link: JQ316200.1

Genome info for JQ316200.1:

JQ316200 13641 bp DNA circular INV 24-MAR-2012 Wuchereria bancrofti mitochondrion complete genome.

Nucleotide count: 13,592
Non-Nucleotide Count: 49

Atom Atom Count Percent
Carbon 66,986 32.7725
Hydrogen 75,381 36.8797
Nitrogen 43,749 21.4039
Oxygen 18,281 8.9439
Totals 204,397 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 83,232

variation count:
@ <2.0% = 15,014

variation count:
@ <3.0% = 1,522

variation count:
@ <4.0% = 197

variation count:
@ >=4.0% = 35



after processing 800,000 genomes:

Sample genome HTML tables
showing HIGH variation
summaries for this section:



GenBank Genome
Table: 700001
(v= <1 [0.7478%])
URL Link: CP014211.1

Genome info for CP014211.1:

CP014211 4106512 bp DNA circular BCT 17-DEC-2019 Bordetella pertussis strain J377 chromosome complete genome.

Nucleotide count: 4,106,512

Atom Atom Count Percent
Carbon 19,140,791 31.7939
Hydrogen 21,196,927 35.2093
Nitrogen 15,755,921 26.1715
Oxygen 4,109,066 6.8254
Totals 60,202,705 ~100.00





GenBank Genome
Table: 700008
(v= >1 <2 [1.0195%])
URL Link: CP014219.1

Genome info for CP014219.1:

CP014219 3959509 bp DNA circular BCT 30-APR-2020 Clostridium botulinum strain B515 chromosome complete genome.

Nucleotide count: 3,959,509

Atom Atom Count Percent
Carbon 19,232,417 32.7014
Hydrogen 21,235,757 36.1077
Nitrogen 14,352,653 24.4042
Oxygen 3,991,413 6.7867
Totals 58,812,240 ~100.00





GenBank Genome
Table: 700621
(v= >2 <3 [2.1696%])
URL Link: CP009334.1

Genome info for CP009334.1:

CP009334 349601 bp DNA circular BCT 26-JUL-2016 Bacillus thuringiensis strain HD1011 plasmid 2 complete sequence.

Nucleotide count: 349,601

Atom Atom Count Percent
Carbon 1,677,817 32.5787
Hydrogen 1,884,682 36.5954
Nitrogen 1,197,598 23.2541
Oxygen 389,952 7.5718
Totals 5,150,049 ~100.00





GenBank Genome
Table: 726833
(v= >3 <4 [3.1798%])
URL Link: FQ976896.2

Genome info for FQ976896.2:

FQ976896 40128 bp DNA linear HTG 15-JUL-2012 Homo sapiens chromosome 21 clone WI2-1537B11 *** SEQUENCING IN

Nucleotide count: 40,128

Atom Atom Count Percent
Carbon 187,936 32.4143
Hydrogen 216,061 37.2651
Nitrogen 128,969 22.2439
Oxygen 46,828 8.0767
Totals 579,794 ~100.00





GenBank Genome
Table: 786399
(v >=4 [8.6816%])
URL Link: CP045289.2

Genome info for CP045289.2:

CP045289 25142 bp DNA circular BCT 06-MAR-2020 Curtobacterium flaccumfaciens pv. flaccumfaciens strain P990

Nucleotide count: 25,142

Atom Atom Count Percent
Carbon 118,003 32.5860
Hydrogen 142,266 39.2861
Nitrogen 60,628 16.7421
Oxygen 41,231 11.3858
Totals 362,128 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 97,195

variation count:
@ <2.0% = 2,719

variation count:
@ <3.0% = 79

variation count:
@ <4.0% = 6

variation count:
@ >=4.0% = 1



after processing 900,000 genomes:

Sample genome HTML tables
showing HIGH variation
summaries for this section:



GenBank Genome
Table: 800001
(v= <1 [0.0631%])
URL Link: CP057830.1

Genome info for CP057830.1:

CP057830 61278 bp DNA circular BCT 13-MAY-2022 Escherichia coli strain RHB13-C18 plasmid pRHB13-C18 2 complete

Nucleotide count: 61,278

Atom Atom Count Percent
Carbon 291,362 32.2487
Hydrogen 322,125 35.6536
Nitrogen 229,129 25.3606
Oxygen 60,869 6.7371
Totals 903,485 ~100.00





GenBank Genome
Table: 800189
(v= >1 <2 [1.3256%])
URL Link: CP066236.1

Genome info for CP066236.1:

CP066236 73361 bp DNA circular BCT 24-DEC-2020 Acinetobacter baumannii strain G20AB009 plasmid pG20AB009-1

Nucleotide count: 73,361

Atom Atom Count Percent
Carbon 353,317 32.5786
Hydrogen 392,966 36.2345
Nitrogen 261,346 24.0981
Oxygen 76,879 7.0888
Totals 1,084,508 ~100.00





GenBank Genome
Table: 800233
(v= >2 <3 [2.0633%])
URL Link: CP051757.1

Genome info for CP051757.1:

CP051757 20671 bp DNA circular BCT 01-DEC-2020 Sarcina sp. JB2 plasmid p3 complete sequence.

Nucleotide count: 20,671

Atom Atom Count Percent
Carbon 100,807 32.8231
Hydrogen 112,193 36.5304
Nitrogen 71,745 23.3604
Oxygen 22,377 7.2860
Totals 307,122 ~100.00





GenBank Genome
Table: 819779
(v >=4 [4.6303%])
URL Link: CP095532.1

Genome info for CP095532.1:

CP095532 14577 bp DNA linear BCT 05-DEC-2022 Escherichia coli strain GN06156 plasmid unnamed1.

Nucleotide count: 14,577

Atom Atom Count Percent
Carbon 71,294 32.5927
Hydrogen 80,958 37.0107
Nitrogen 45,484 20.7934
Oxygen 21,006 9.6031
Totals 218,742 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 95,300

variation count:
@ <2.0% = 4,525

variation count:
@ <3.0% = 174

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 1



after processing 1,000,000 genomes:

Sample genome HTML tables
showing HIGH variation
summaries for this section:



GenBank Genome
Table: 900001
(v= <1 [0.2848%])
URL Link: AC016793.3

Genome info for AC016793.3:

AC016793 96138 bp DNA linear HTG 15-DEC-1999 Drosophila melanogaster strain y chromosome 2 clone BACR33K22

Nucleotide count: 91,171
Non-Nucleotide Count: 4,967

Atom Atom Count Percent
Carbon 435,426 32.3266
Hydrogen 481,317 35.7336
Nitrogen 338,611 25.1389
Oxygen 91,606 6.8009
Totals 1,346,960 ~100.00





GenBank Genome
Table: 900583
(v= >1 <2 [1.1079%])
URL Link: AC017850.1

Genome info for AC017850.1:

AC017850 10855 bp DNA linear HTG 09-DEC-1999 Drosophila melanogaster *** SEQUENCING IN PROGRESS ***.

Nucleotide count: 10,855

Atom Atom Count Percent
Carbon 52,185 32.4507
Hydrogen 57,939 36.0288
Nitrogen 39,103 24.3158
Oxygen 11,586 7.2046
Totals 160,813 ~100.00





GenBank Genome
Table: 901169
(v= >2 <3 [2.0179%])
URL Link: AC019006.3

Genome info for AC019006.3:

AC019006 149651 bp DNA linear HTG 16-MAR-2000 Homo sapiens chromosome 8 clone RP11-458N4 map 8 WORKING DRAFT

Nucleotide count: 148,150
Non-Nucleotide Count: 1,501

Atom Atom Count Percent
Carbon 724,599 32.4130
Hydrogen 772,412 34.5518
Nitrogen 613,462 27.4416
Oxygen 125,045 5.5936
Totals 2,235,518 ~100.00





GenBank Genome
Table: 909393
(v= >3 <4 [3.4764%])
URL Link: AC148092.3

Genome info for AC148092.3:

AC148092 63169 bp DNA linear HTG 29-JUL-2004 Mus musculus chromosome 17 clone RP24-300D15 map 17 *** SEQUENCING

Nucleotide count: 62,469
Non-Nucleotide Count: 700

Atom Atom Count Percent
Carbon 298,096 32.5091
Hydrogen 339,878 37.0657
Nitrogen 201,248 21.9473
Oxygen 77,739 8.4779
Totals 916,961 ~100.00





GenBank Genome
Table: 918032
(v >=4 [4.1558%])
URL Link: AC027353.4

Genome info for AC027353.4:

AC027353 101509 bp DNA linear HTG 26-SEP-2000 Homo sapiens chromosome 16 clone RP11-167D21 WORKING DRAFT

Nucleotide count: 71,328
Non-Nucleotide Count: 30,181

Atom Atom Count Percent
Carbon 349,107 32.2159
Hydrogen 363,652 33.5581
Nitrogen 320,538 29.5795
Oxygen 50,351 4.6464
Totals 1,083,648 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 98,836

variation count:
@ <2.0% = 1,090

variation count:
@ <3.0% = 54

variation count:
@ <4.0% = 16

variation count:
@ >=4.0% = 4



Total processed 1,052,789 genomes:


Variation totals for all sections:

variation count:
@ <1.0% = 990,538
(94.0870%)

variation count:
@ <2.0% = 54,687
(5.1945%)

variation count:
@ <3.0% = 6,763
(0.6424%)

variation count:
@ <4.0% = 740
(0.0703%)

variation count:
@ >=4.0% = 61
(0.0058%)




GC-8 Appendix AVG

This is an appendix to: On The Origin Of A Genetic Constant - 8


GenBank Genome Files
Average Variations
Analysis Report


after processing 100,000 genomes:

Sample genome HTML tables
showing AVG variation
summaries for this section:



GenBank Genome
Table: 500001
(v= <1 [0.1684%])
URL Link: AC133204.3

Genome info for AC133204.3:

AC133204 224181 bp DNA linear ROD 25-NOV-2003 Mus musculus BAC clone RP23-106N3 from chromosome 5 complete

Nucleotide count: 224,181

Atom Atom Count Percent
Carbon 1,071,062 32.3550
Hydrogen 1,178,966 35.6146
Nitrogen 847,036 25.5875
Oxygen 213,284 6.4429
Totals 3,310,348 ~100.00





GenBank Genome
Table: 1
(v= <1 [0.2719%])
URL Link: LR862133.1

Genome info for LR862133.1:

LR862133 18411140 bp DNA linear PLN 28-JUL-2020 Ananas comosus var. bracteatus genome assembly
chromosome: 5.

Nucleotide count: 18,405,240
Non-Nucleotide Count: 5,900

Atom Atom Count Percent
Carbon 88,541,178 32.4808
Hydrogen 97,752,075 35.8597
Nitrogen 67,878,531 24.9008
Oxygen 18,424,038 6.7587
Totals 272,595,822 ~100.00





GenBank Genome
Table: 6234
(v= >1 <2 [1.0785%])
URL Link: LR905064.1

Genome info for LR905064.1:

LR905064 14426 bp DNA linear INV 24-NOV-2020 Darwinula stevensoni.

Nucleotide count: 14,426

Atom Atom Count Percent
Carbon 69,819 32.7286
Hydrogen 78,088 36.6048
Nitrogen 49,634 23.2666
Oxygen 15,786 7.3999
Totals 213,327 ~100.00





GenBank Genome
Table: 92511
(v= >2 <3 [2.3246%])
URL Link: MK361035.1

Genome info for MK361035.1:

MK361035 10487 bp DNA circular INV 25-NOV-2019 Beroe cucumis mitochondrion
complete genome.

Nucleotide count: 10,487

Atom Atom Count Percent
Carbon 51,400 32.9194
Hydrogen 58,411 37.4096
Nitrogen 32,437 20.7744
Oxygen 13,891 8.8966
Totals 156,139 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 99,560

variation count:
@ <2.0% = 439

variation count:
@ <3.0% = 1

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 200,000 genomes:

Sample genome HTML tables
showing AVG variation
summaries for this section:



GenBank Genome
Table: 100001
(v= <1 [0.3136%])
URL Link: OD565703.1

Genome info for OD565703.1:

OD565703 160245 bp DNA linear INV 11-DEC-2020 4 Tbi b3v08.

Nucleotide count: 154,008
Non-Nucleotide Count: 6,237

Atom Atom Count Percent
Carbon 743,062 32.5406
Hydrogen 819,390 35.8832
Nitrogen 568,034 24.8757
Oxygen 153,003 6.7004
Totals 2,283,489 ~100.00





GenBank Genome
Table: 103116
(v= >1 <2 [1.1441%])
URL Link: OC917970.1

Genome info for OC917970.1:

OC917970 11508 bp DNA linear INV 25-JAN-2021 Oppiella nova.

Nucleotide count: 11,508

Atom Atom Count Percent
Carbon 55,876 32.6699
Hydrogen 62,421 36.4967
Nitrogen 39,569 23.1354
Oxygen 13,166 7.6980
Totals 171,032 ~100.00





GenBank Genome
Table: 105813
(v= >2 <3 [2.2322%])
URL Link: MZ636522.1

Genome info for MZ636522.1:

MZ636522 13647 bp DNA circular INV 31-AUG-2022 Dipetalonema gracile mitochondrion
complete genome.

Nucleotide count: 13,647

Atom Atom Count Percent
Carbon 67,325 32.7518
Hydrogen 76,012 36.9778
Nitrogen 43,084 20.9592
Oxygen 19,140 9.3111
Totals 205,561 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 96,781

variation count:
@ <2.0% = 3,211

variation count:
@ <3.0% = 8

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 300,000 genomes:

Sample genome HTML tables
showing AVG variation
summaries for this section:



GenBank Genome
Table: 200001
(v= <1 [0.2079%])
URL Link: EA695480.1

Genome info for EA695480.1:

EA695480 14337 bp DNA linear PAT 10-JUN-2008 Sequence 74359 from patent US 7365185.

Nucleotide count: 14,337

Atom Atom Count Percent
Carbon 68,750 32.3906
Hydrogen 75,474 35.5585
Nitrogen 54,448 25.6524
Oxygen 13,581 6.3985
Totals 212,253 ~100.00





GenBank Genome
Table: 202972
(v= >1 <2 [1.0673%])
URL Link: EA740987.1

Genome info for EA740987.1:

EA740987 10907 bp DNA linear PAT 10-JUN-2008 Sequence 12433 from patent US 7368527.

Nucleotide count: 10,907

Atom Atom Count Percent
Carbon 52,319 32.3758
Hydrogen 58,691 36.3189
Nitrogen 37,635 23.2891
Oxygen 12,954 8.0161
Totals 161,599 ~100.00





GenBank Genome
Table: 235994
(v= >2 <3 [2.0835%])
URL Link: FV536683.1

Genome info for FV536683.1:

FV536683 10359 bp DNA linear PAT 18-MAR-2010 Modified Microbial Nucleic Acid.

Nucleotide count: 10,359

Atom Atom Count Percent
Carbon 51,795 32.8822
Hydrogen 57,899 36.7573
Nitrogen 33,483 21.2568
Oxygen 14,340 9.1038
Totals 157,517 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 99,021

variation count:
@ <2.0% = 975

variation count:
@ <3.0% = 4

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 400,000 genomes:

Sample genome HTML tables
showing AVG variation
summaries for this section:



GenBank Genome
Table: 300001
(v= <1 [0.1358%])
URL Link: MX519141.1

Genome info for MX519141.1:

MX519141 11326 bp DNA linear PAT 28-JUN-2021 Sequence 2781 from patent US 10961585.

Nucleotide count: 11,326

Atom Atom Count Percent
Carbon 53,376 32.0309
Hydrogen 59,147 35.4941
Nitrogen 42,571 25.5468
Oxygen 11,545 6.9282
Totals 166,639 ~100.00





GenBank Genome
Table: 301529
(v= >1 <2 [1.2606%])
URL Link: MI279662.1

Genome info for MI279662.1:

MI279662 11127 bp DNA linear PAT 12-FEB-2018 Sequence 21784 from patent US 9827332.

Nucleotide count: 11,127

Atom Atom Count Percent
Carbon 52,077 31.7514
Hydrogen 56,530 34.4664
Nitrogen 45,834 27.9450
Oxygen 9,574 5.8373
Totals 164,015 ~100.00





GenBank Genome
Table: 317731
(v= >3 <4 [3.1228%])
URL Link: MQ014117.1

Genome info for MQ014117.1:

MQ014117 22500 bp DNA linear PAT 18-MAY-2021 Sequence 23 from Patent EP3717009.

Nucleotide count: 22,500

Atom Atom Count Percent
Carbon 108,000 32.8767
Hydrogen 126,000 38.3562
Nitrogen 63,000 19.1781
Oxygen 31,500 9.5890
Totals 328,500 ~100.00





GenBank Genome
Table: 374519
(v= >2 <3 [2.4758%])
URL Link: AP015624.1

Genome info for AP015624.1:

AP015624 19638 bp DNA linear PLN 03-DEC-2015 Vigna angularis var. angularis DNA scaffold: Scaffold 0580

Nucleotide count: 19,638

Atom Atom Count Percent
Carbon 94,239 31.7826
Hydrogen 98,264 33.1401
Nitrogen 90,066 30.3753
Oxygen 13,942 4.7020
Totals 296,511 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 99,401

variation count:
@ <2.0% = 595

variation count:
@ <3.0% = 3

variation count:
@ <4.0% = 1

variation count:
@ >=4.0% = 0



after processing 500,000 genomes:

Sample genome HTML tables
showing AVG variation
summaries for this section:



GenBank Genome
Table: 400001
(v= <1 [0.4358%])
URL Link: MH064508.1

Genome info for MH064508.1:

MH064508 117444 bp DNA circular PLN 08-JUN-2019 Radula japonica voucher 16-8723 chloroplast complete genome.

Nucleotide count: 117,444

Atom Atom Count Percent
Carbon 569,471 32.6497
Hydrogen 628,230 36.0185
Nitrogen 428,692 24.5783
Oxygen 117,794 6.7535
Totals 1,744,187 ~100.00





GenBank Genome
Table: 403520
(v= >1 <2 [1.2166%])
URL Link: MZ748319.1

Genome info for MZ748319.1:

MZ748319 19912 bp DNA circular PLN 23-MAR-2022 Galdieria sp. strain ACUF 613 mitochondrion complete genome.

Nucleotide count: 19,912

Atom Atom Count Percent
Carbon 98,326 32.3620
Hydrogen 103,872 34.1873
Nitrogen 84,156 27.6982
Oxygen 17,478 5.7525
Totals 303,832 ~100.00





GenBank Genome
Table: 420094
(v= >2 <3 [2.0278%])
URL Link: MN366148.1

Genome info for MN366148.1:

MN366148 10410 bp DNA linear PLN 10-FEB-2021 UNVERIFIED: Phalaenopsis aphrodite subsp. formosana clone Contig 17

Nucleotide count: 10,410

Atom Atom Count Percent
Carbon 48,410 32.3419
Hydrogen 56,312 37.6211
Nitrogen 31,984 21.3680
Oxygen 12,976 8.6690
Totals 149,682 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 99,890

variation count:
@ <2.0% = 109

variation count:
@ <3.0% = 1

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 600,000 genomes:

Sample genome HTML tables
showing AVG variation
summaries for this section:



GenBank Genome
Table: 500001
(v= <1 [0.1684%])
URL Link: AC133204.3

Genome info for AC133204.3:

AC133204 224181 bp DNA linear ROD 25-NOV-2003 Mus musculus BAC clone RP23-106N3 from chromosome 5 complete

Nucleotide count: 224,181

Atom Atom Count Percent
Carbon 1,071,062 32.3550
Hydrogen 1,178,966 35.6146
Nitrogen 847,036 25.5875
Oxygen 213,284 6.4429
Totals 3,310,348 ~100.00





GenBank Genome
Table: 516503
(v= >2 <3 [2.2540%])
URL Link: MF597730.1

Genome info for MF597730.1:

MF597730 10830 bp DNA linear ROD 04-AUG-2017 Mus musculus chromosome X WGS contig MG104 gap closure genomic

Nucleotide count: 10,830

Atom Atom Count Percent
Carbon 52,672 31.9918
Hydrogen 54,788 33.2770
Nitrogen 49,280 29.9316
Oxygen 7,902 4.7995
Totals 164,642 ~100.00





GenBank Genome
Table: 534038
(v= >1 <2 [1.0598%])
URL Link: AC140396.3

Genome info for AC140396.3:

AC140396 109575 bp DNA linear ROD 29-MAY-2004 Mus musculus BAC clone RP24-184N24 from chromosome Y complete

Nucleotide count: 109,575

Atom Atom Count Percent
Carbon 530,504 32.4881
Hydrogen 570,549 34.9405
Nitrogen 445,111 27.2586
Oxygen 86,754 5.3128
Totals 1,632,918 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 99,957

variation count:
@ <2.0% = 40

variation count:
@ <3.0% = 3

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 700,000 genomes:

Sample genome HTML tables
showing AVG variation
summaries for this section:



GenBank Genome
Table: 600001
(v= <1 [0.1939%])
URL Link: JX684087.1

Genome info for JX684087.1:

JX684087 41468 bp DNA linear ENV 26-APR-2013 Uncultured organism clone FLSS-21 genomic sequence.

Nucleotide count: 41,468

Atom Atom Count Percent
Carbon 195,141 32.0406
Hydrogen 215,920 35.4524
Nitrogen 157,202 25.8114
Oxygen 40,779 6.6956
Totals 609,042 ~100.00





GenBank Genome
Table: 600781
(v= >1 <2 [1.1010%])
URL Link: CP021571.1

Genome info for CP021571.1:

CP021571 11301 bp DNA circular ENV 08-JAN-2019 Unidentified plasmid plasmid MO1-2 2829 complete sequence.

Nucleotide count: 11,301

Atom Atom Count Percent
Carbon 54,070 32.5637
Hydrogen 60,864 36.6553
Nitrogen 38,558 23.2216
Oxygen 12,552 7.5594
Totals 166,044 ~100.00





GenBank Genome
Table: 613004
(v= >2 <3 [2.0099%])
URL Link: JQ316200.1

Genome info for JQ316200.1:

JQ316200 13641 bp DNA circular INV 24-MAR-2012 Wuchereria bancrofti mitochondrion complete genome.

Nucleotide count: 13,592
Non-Nucleotide Count: 49

Atom Atom Count Percent
Carbon 66,986 32.7725
Hydrogen 75,381 36.8797
Nitrogen 43,749 21.4039
Oxygen 18,281 8.9439
Totals 204,397 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 98,240

variation count:
@ <2.0% = 1,725

variation count:
@ <3.0% = 35

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 800,000 genomes:

Sample genome HTML tables
showing AVG variation
summaries for this section:



GenBank Genome
Table: 700001
(v= <1 [0.3967%])
URL Link: CP014211.1

Genome info for CP014211.1:

CP014211 4106512 bp DNA circular BCT 17-DEC-2019 Bordetella pertussis strain J377 chromosome complete genome.

Nucleotide count: 4,106,512

Atom Atom Count Percent
Carbon 19,140,791 31.7939
Hydrogen 21,196,927 35.2093
Nitrogen 15,755,921 26.1715
Oxygen 4,109,066 6.8254
Totals 60,202,705 ~100.00





GenBank Genome
Table: 700621
(v= >1 <2 [1.0848%])
URL Link: CP009334.1

Genome info for CP009334.1:

CP009334 349601 bp DNA circular BCT 26-JUL-2016 Bacillus thuringiensis strain HD1011 plasmid 2 complete sequence.

Nucleotide count: 349,601

Atom Atom Count Percent
Carbon 1,677,817 32.5787
Hydrogen 1,884,682 36.5954
Nitrogen 1,197,598 23.2541
Oxygen 389,952 7.5718
Totals 5,150,049 ~100.00





GenBank Genome
Table: 786399
(v >=4 [4.3408%])
URL Link: CP045289.2

Genome info for CP045289.2:

CP045289 25142 bp DNA circular BCT 06-MAR-2020 Curtobacterium flaccumfaciens pv. flaccumfaciens strain P990

Nucleotide count: 25,142

Atom Atom Count Percent
Carbon 118,003 32.5860
Hydrogen 142,266 39.2861
Nitrogen 60,628 16.7421
Oxygen 41,231 11.3858
Totals 362,128 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 99,913

variation count:
@ <2.0% = 86

variation count:
@ <3.0% = 0

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 1



after processing 900,000 genomes:

Sample genome HTML tables
showing AVG variation
summaries for this section:



GenBank Genome
Table: 800001
(v= <1 [0.0528%])
URL Link: CP057830.1

Genome info for CP057830.1:

CP057830 61278 bp DNA circular BCT 13-MAY-2022 Escherichia coli strain RHB13-C18 plasmid pRHB13-C18 2 complete

Nucleotide count: 61,278

Atom Atom Count Percent
Carbon 291,362 32.2487
Hydrogen 322,125 35.6536
Nitrogen 229,129 25.3606
Oxygen 60,869 6.7371
Totals 903,485 ~100.00





GenBank Genome
Table: 800233
(v= >1 <2 [1.0316%])
URL Link: CP051757.1

Genome info for CP051757.1:

CP051757 20671 bp DNA circular BCT 01-DEC-2020 Sarcina sp. JB2 plasmid p3 complete sequence.

Nucleotide count: 20,671

Atom Atom Count Percent
Carbon 100,807 32.8231
Hydrogen 112,193 36.5304
Nitrogen 71,745 23.3604
Oxygen 22,377 7.2860
Totals 307,122 ~100.00





GenBank Genome
Table: 819779
(v= >2 <3 [2.3151%])
URL Link: CP095532.1

Genome info for CP095532.1:

CP095532 14577 bp DNA linear BCT 05-DEC-2022 Escherichia coli strain GN06156 plasmid unnamed1.

Nucleotide count: 14,577

Atom Atom Count Percent
Carbon 71,294 32.5927
Hydrogen 80,958 37.0107
Nitrogen 45,484 20.7934
Oxygen 21,006 9.6031
Totals 218,742 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 99,821

variation count:
@ <2.0% = 178

variation count:
@ <3.0% = 1

variation count:
@ <4.0% = 0

variation count:
@ >=4.0% = 0



after processing 1,000,000 genomes:

Sample genome HTML tables
showing AVG variation
summaries for this section:



GenBank Genome
Table: 900001
(v= <1 [0.1424%])
URL Link: AC016793.3

Genome info for AC016793.3:

AC016793 96138 bp DNA linear HTG 15-DEC-1999 Drosophila melanogaster strain y chromosome 2 clone BACR33K22

Nucleotide count: 91,171
Non-Nucleotide Count: 4,967

Atom Atom Count Percent
Carbon 435,426 32.3266
Hydrogen 481,317 35.7336
Nitrogen 338,611 25.1389
Oxygen 91,606 6.8009
Totals 1,346,960 ~100.00





GenBank Genome
Table: 901169
(v= >1 <2 [1.1137%])
URL Link: AC019006.3

Genome info for AC019006.3:

AC019006 149651 bp DNA linear HTG 16-MAR-2000 Homo sapiens chromosome 8 clone RP11-458N4 map 8 WORKING DRAFT

Nucleotide count: 148,150
Non-Nucleotide Count: 1,501

Atom Atom Count Percent
Carbon 724,599 32.4130
Hydrogen 772,412 34.5518
Nitrogen 613,462 27.4416
Oxygen 125,045 5.5936
Totals 2,235,518 ~100.00





GenBank Genome
Table: 918032
(v= >2 <3 [2.0842%])
URL Link: AC027353.4

Genome info for AC027353.4:

AC027353 101509 bp DNA linear HTG 26-SEP-2000 Homo sapiens chromosome 16 clone RP11-167D21 WORKING DRAFT

Nucleotide count: 71,328
Non-Nucleotide Count: 30,181

Atom Atom Count Percent
Carbon 349,107 32.2159
Hydrogen 363,652 33.5581
Nitrogen 320,538 29.5795
Oxygen 50,351 4.6464
Totals 1,083,648 ~100.00





GenBank Genome
Table: 994274
(v= >3 <4 [3.0781%])
URL Link: LN006378.1

Genome info for LN006378.1:

LN006378 14473 bp DNA linear INV 07-OCT-2014 Spirometra erinaceieuropaei genome assembly S erinaceieuropaei

Nucleotide count: 11,935
Non-Nucleotide Count: 2,538

Atom Atom Count Percent
Carbon 54,428 32.2485
Hydrogen 65,229 38.6480
Nitrogen 32,519 19.2674
Oxygen 16,601 9.8361
Totals 168,777 ~100.00

Variation Summary for this section:

variation count:
@ <1.0% = 99,924

variation count:
@ <2.0% = 72

variation count:
@ <3.0% = 3

variation count:
@ <4.0% = 1

variation count:
@ >=4.0% = 0



Total processed 1,052,789 genomes:


Variation totals for all sections:

variation count:
@ <1.0% = 1,045,196
(99.2788%)

variation count:
@ <2.0% = 7,529
(0.7151%)

variation count:
@ <3.0% = 61
(0.0058%)

variation count:
@ <4.0% = 2
(0.0002%)

variation count:
@ >=4.0% = 1
(0.0001%)