Saturday, December 9, 2023

On The Origin Of Another Genetic Constant - 4

New and Improved!

This new constant discovery applies only to "TATA" boxes in genomes where transcription from DNA to mRNA takes place (TATA box).

The type of tata box featured today has a "TATAAA" promoter and a "TATCTC" terminator (see example below).

I processed about 9,500 FASTA files, excluding those without any tata boxes in them, to produce today's appendices.

The appendices contain the results and the variation quantities from the ~(32/35/25/6) genetic constant in human and various other genomes.

How the constant was isolated is shown here (On The Origin Of A Genetic Constant - 9). 

The big news is that the tata box examples in the appendices match the variation values of entire genomes that were featured in previous posts (On The Origin Of A Genetic Constant, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10).

Here is the list of appendices (001, 002, 003, 004, 005, 006, 007, 008, 009, 010, 011, 012).

A sample tata box:

From organism Xenopus tropicalis: TATAAACTGGCACAGTTGCTATGGGTCCCTGAGCAGGGAGGCCGGG
ACACATGGGACCTTGGTTATGGCCCAGTCAGGGTTATACTCATAGGTTACACTGGGTAGGGTCAGGGCTGAGCAGCTTGA
GCTCCAGCGAGGAGATTCAGTGCGATTACTGCCCAGAGTAGGGACCCAGTGTACAGACCTAGGGCCAAAGAGACTTGGTG
GGGGGGTGACCCGTCATTACTGATCTAATTCTCTGCGCATTGATTGGCCCTGAGGGGCTGAGAATTGTGGGAAGGTATAA
CTGTAAGTGCCCCCTGTGTGAGGTACTTGCCCTGCTGCCCTGAGATGTAATAGTTAATAAGTCCTTGTTTGAGTTCAGAA
ACCCCTGCTGCCCATTATTGGTGTCTCAGAGTGAGAGATATAGTTGCACTACACCCTCCCCCCAATACATACCTGAGAGA
TTGCCCTGGTCTATGGAGTCCAGCCCCTTACAGTTCCTAATAACTAGGCGATTCCTTTACTTACTGAATGTCACCTTAGT
GAATGTCACCCTGAGGCGCAAATCAAACAGCGTTAAACTTTGCGCGACTCCGGTAGAAAACAAACGCGCTGCCGCCCTTG
TGGCCCCGGCACCAACTTTCCCTTTTATTCCCATTTCTTGTCTCAGTCGCTCCCCAAACAGCATTTATGTGCGGTAATGT
TATGCCCCCCCCATTAATGAAGGGAATGCGGAGACATAATAACAGGGTCCCCCAGGGGCCCCCCGGCTGCTTAGCATGTG
ACCGAGTCGCTGCGATTCCCCCGCTGTATAATGAGCCGCTACTTCAGGCAGAATGCGATATCAAAGGCGCCGGCGCTGCG
ACTCTGTGTCAGATTGCGCTTTCTGCCCCCTCCCTTCTACGTGAACTTGTGCCCCTGTCAGACATCTTAGGGAGGCTTGA
AACATAGGGCCCCCCCTCCTCGATACCTGCCCCCCCCGCTCCCTGGGGCCCCAACGGGACTCGCTTTGTGCAGCTTTTAA
CTTGATAAAGTTTAACGTTCTGAGCGAAGTGAGACCCCCCCGAGAAATGCTGCCCCCCAGTGTCACTCATCCAAAGGTTT
AACATTCTGAGCGAAGTGAGACCCCCGAGAAAATGCCTTGACCCCCCAGTGTCACTCATCCAAAGGTTTAACGTTCTGAG
CGAAGTGAGACCCCCCCAGATAAATGCTTGCCCCCAGTGTCACCTCTCAAAGGGTTTAACGTTCTGAGCGAAGTAGGAGG
ACCCCCAGAGAAATGCTGCCCCCTCCAGTGTTTCACTCAGTCCCAAAGTTACGTTCTGAGCGAAGTGAGACCCCCCCCAG
CGAGAAATGCTGCCCCCAGTGTCACTCTCCAAGGTTTAACGTTCTAGAGCGAAAGTGAGGACCCCCCGAGAAATGTGCCC
CCCAGTGTCAACTCATCCAAAGGTTTAACGTTTGACGAAGTGAAACCCCCCCCCCCGAGAAATGCTGCCCCCCAGTGTCA
CTCATCCAAAGGTTTAACATTCTGAGCGAAGTGAGACCCCCCCGAAATGCTGCCCCCCAGTGTCACTCATCCAAAGGTTT
AACGTTCTGAGCGAAGTGAGACCCCCCCCCGAGAAATGCTGCCCCCCAGTGTCACATCATCCAAAAGTTTAACTGTTCTC
GAGCGAAGTGAGACCCCCCGCCGAGAAATGCTGCCCCCCAGTGTCACTCAATCAAAGGTTAACGTTCTGAGCGAAGTGGA
CCCCCCCAGAGAAATGGGCTTGCCCCCCCAGTTGTCACTCATCAAAGGTTTAACGTACTGAGCGAAGTAGAGACCCCCCC
CCGAGAAAATGCTGCCGCCCAGTGGTGTCACTCATCCAAAGTTTCAACGTTCTGAGCGAAGTGAGACCCCCCCAGAGAAA
TGCCTGCCCCAGTGTCCAACTCACTCCAAAGGTTTAACGTTCTGAAGTGAAGTGAGACCCCCCCCAGAGAAAATGCTGCC
CCCAGTGTACTCATCCAAAGTTTAACGTCTGAGCGAGTGAGACCCCCCCCGAGAAATGCTGCCCCAGTGTCCACTCATCA
AAGGTTTAACGTTGCTGAGCGAAGTGAGACCCCCCCCAGAGAAATGGCTGCCCCAGTGTCACTCATCCAAAGTTTAACGT
CTGAGCGAAGTGAAGACCCCCCCGAGAAATGCTGCCCCCCAGTTACTCATCCAAAGGTTTAAGTTCTGACCGAAGTTGAG
ACCCCCCCCGAGACATGCTGCCCCCAGTTGTCACTCATCCAAGGTTAACATTCTGAAGACGAAGTGAGACCCCCCCAGAG
AAATCTGCCCAGTGGTCACTATTCCAAAGTTTAACGTTCTTGAGCGAAGTGAGACCCCCTCCCAGAGAAATGCTGCCCAG
TGTCACTCATCCAATAGGTTTAACATTCTGAGCGAGTGAGACCCCCAGAGGAGAATGCTTGCCCCCAGTGTCACTCATCC
AAAGGTTTAACTTCTGAGCGAAGTGAGACCCCCCAGAGAAAATTGCTGCCCCCAGTGTCACTCATCAAATAGGTTTTAAC
GTTACTGACGAAGTGAGACCCCCCCCCGAGAAATGCTAGCCCCCCAGTGTCACTCATCCAAGGTTTAACGTTCTTGATGC
GAGTTGAGACCCCCCCGATTAAATGCTGCCCCCCAGTGTCATCATCCAAAGGTTTAACGTCTCTGAGCGAAGTGCAGACC
CCCCCGAAGAAAAAGCTGGCCCCCCGAGTTCACTCATCCAAGGTTTTACGTTCTGAGCGAAGTGAGAACCCCCCCGAGAA
TGCTGCCCCCCAGTGTCACTCATCAAAGTTTAACGTCTGAGCGAAGTGAGACCCGCCCCCAGAGAACTGCTGCCCCCCAG
TGGTCACTCATTCCAAAGGTGTAACGTGCTGAGCGAAGTGGACCCCCCCAGAAGAAAAATGCTGCCCCAGTCGTGTCACT
CTCCAAAGGTTTTAACGTTTGAGCGAAGTGGAACCCCCCCGGAAATGCTGCCCCCAGTGTCATCATTCAAGGTTTACGTT
CTGAGCGAAGTTGAGACCCCTCCGAGAGAAATGCTGCCCCCAGTGTCATCATCGAAAAAATGTTTTTAACGTTCTGAGCG
AAGTGAGACCCCCCAGGAAACTGCTGCCCCAGTGTCACTCATCAAGGTTTAACGTTCTGAGCAAGTGAGACCCCCCCCCT
GAGAAATGCTGCCCCCAGTGTCACCTCATCCAAAGTTTAACATTCTAGCGAAGTGGACCCCCCGAGTTAAATGCTGCCCC
CAGTGTCACTCATCCAAAGGTTAAGTGTTCTGAGCGAAGTGAGACCCCCCGAGAAATGCTGCCCCCCAGTGTCACTCATA
CAAAGGTTTAACGTTTCGAGCGAAGTGAGACCCCCCAGAGAAATGCTGCCCCCCAGGTCACTATCCAAAGGTTTAACGTT
CTGGAGCGAAGTGGACCCCCCAGAGAAAATGCTGCCCCCAGTGTCACTCATCCAAAGGTTTAACGTTACCTGGAGCGAAG
TTGAGACCCCCCCAGAGAAATGACTGCCCCCATGTCACTCATCCAAGTGGCCCCATCTCTTCCTGTATTTATACAGACTG
TGTGAGGTGCGGAACTACTACTCCCTGCCTCACAGGAGCTTACACTTTACTCAGCTCCACCCCATACCCCCCCTAATGTC
ACCCAGTGTCGGACTGGGACCCCGAGGGCCACCAGAAAACCTTAGACCCTAGTCCCCCTCTCCCAACTA
TATCTC

The video below points out the dynamics involved.

The previous post in this series is here.



No comments:

Post a Comment