Wednesday, June 14, 2023

It's In The GenBank - 7


Fig. 1 Nucleotides

I. In General

The "ABC's" ("ACGT") of our genes are depicted in Fig. 1.

DNA genes are made of these molecules, excluding Uracil, which is in RNA.

The vast GenBank stores files of 'sequences' of these nucleotides (GenBank FTP download site).

The GenBank "Readme" file explains technical aspects of the site (Readme.genbank link to text file).

The process which a 'submitter' goes through to upload data into the database is explained here.

II. Accuracy Tests

Today's post contains a list of HTML tables in several appendices which show the results of Dredd Blog tests done on many of the files in the GenBank (Appendix A, B, C, D, E, F, G, H, I, J, and K).

Fig. 2 Amino Acids

While the GenBank files are generally accurate, some of them have errors in the CDS->Translation data.

That is where, in the GBFF files, gene locations like "CDS 15..345" (which refers to nucleotide locations "fifteen through three-hundred and forty-five").

Codons within that range are 'translated' into an amino-acid sequence.

Amino acids are 'generated' by ribosomes based on the three nucleotide codons within the 15.345 range.

The "translation" results in a list of amino acids that are indicated by the codons in the 15..345 range of nucleotides (the 20 available amino acids are shown in Fig. 2).

In the CDS 15..345 example, that hypothetical list is 110 "codons" that are each made up of three of the ACGT molecules shown in Fig. 1.

The arithmetic is simple (345−15) ÷ 3 = 110 codons; so that CDS reference should equate to 110 amino acids (one amino acid per codon).

The "translation key" ("/translation=") in the GBFF files would refer back to the 110 letters (amino acids) each of which is derived from a three-letter codon by the sequencing machines and researcher's processing of the genetic material.

III. Dredd Blog Test Methods

The test procedures are: 

1) proceed through the GBFF file searching for a CDS key that is not a 'complement', 'join', 'assembly_gap', or 'gap' (only "CDS digit..digit" references such as "CDS 15..345" are tested);

2) find the subsequent matching "/translation=" sequence of amino acids

3) proceed through the list one letter (one amino acid) at a time;

4) extract three nucleotides (one codon) from the specified DNA location for each amino acid;

5) see if that exact codon is associated with the exact specified amino acid in the Genetic Code.

If the information in the file is accurate, move on, but if there is a mismatch either in the number of CDS specifications compared to translation specifications, or in a codon to amino acid specification, add those errors to the list of errors being produced (The CDS-Translation count (column 2 in the HTML table is not a total CDS-Translation count, it is only a count of those with mismatches).

IV. Closing Comments

Here is a depiction of the CDS hypothetical example above:

...
CDS 15..345
...
/translation="MNPLAQPVIYSTIFAGTLITALSSHWFFTWVGLEMNMLAFIPVLTKKMNPRSTE
AAIKYFLTQATASMILLMAILFNNMLSGQWTMTNTTNQYSSLMIMMAMAMKLGMAP"
...
ORIGIN
AAAAATAGCCAGACATGGTGGCGGGCACCTGTAATCCCAGCTACTCGGGAAGCTGAGGC
AAGAGAATCACTTGAAACCCAGGAGGCAGAGTTTGCAGTGAGCTGAGATTATGCCACTG
CACTCCAACTTGGGCAACAGAGCGAGACTGTCTGAGAAAAAAAATACACAAAAGCACAC
ATTTTTTTATATTACAGTTGTGTAGATCAGAAGTTGGACACAGGTCTCACTGGAATAAA
ATCAAGAAGTCAGCAGGACCAGTTCCTTCTGAAGGTTTTGGGGAGAATCTGTTTCCTTA
CTGCTGCAGCTTCTAGAGGTTGCCTGTATTCCTTGGCTTGTGACCCTAA
TCT
TCATCTTCAAAGCCACCACTGGCAAATTTAGTCCCTTTTACGCCTGACATTT...
//

(The underlined letters (nucleotides) represent the three-letter codons in the 15..345 range).

The letters following the  "/translation=" key represent amino acids (1 letter - 1 amino acid).

The Appendix files have a link to the GenBank genome sequence so that you can see the official file that was tested in each case.

If a person used a word processor to test these files it would take years to to all of them in GenBank.

The program I wrote to do this does it is seconds (less than a minute).

The next post in this series is here, the previous post in this series is here.

Appendix K GenBank - 7

This is an appendix to: It's In The GenBank - 7


GBFF '.seq' File Analysis
of Translation | CDS Data
and Amino Acid - Codon
Mismatches

GenBank File
and/or Version
Translation|
CDS Counts
AA - Codon
Mismatches
'gbbct369'
CP041584.1
835|835 9013
CP041585.1 1250|1250 14839
CP041586.1 1056|1056 14302
CP041587.1 1794|1794 105
CP041617.1 830|830 111
CP041626.1 929|929 1476
CP041633.1 1430|1431 4730
CP041637.1 1619|1619 2768
CP041638.1 1582|1582 1869
'gbbct591'
CP073284.1
1098|1098 2050
CP073349.1 1050|1050 6281
CP073350.1 1053|1053 6197
CP073355.1 1178|1178 4520
CP073366.1 1628|1628 1788
'gbbct10'
AP013064.1
29|29 101
AP013065.1 54|54 9
AP013066.1 1528|1528 212
AP013067.1 31|31 4
AP013104.1 47|47 15
AP013293.1 146|146 9
AP013294.1 1696|1696 408
AP014520.1 1167|1167 149
AP014521.1 317|317 38
AP014523.1 811|811 167
AP014524.1 1258|1258 154
AP014525.1 710|710 178
AP014563.1 727|727 152
AP014567.1 967|967 53180
AP014568.1 1158|1158 186
AP014569.1 1046|1046 177
AP014571.1 950|950 51
AP014572.1 940|940 146
AP014576.1 1318|1318 112
AP014577.1 819|819 94
AP014578.1 763|763 74
AP014579.1 662|662 895
'gbbct492'
CP058312.1
1424|1424 7126
CP058315.1 1084|1084 3591
CP058316.1 1660|1660 3534
CP058321.1 1778|1778 4142
CP058334.1 1620|1620 1468
CP058335.1 1899|1899 7721
CP058336.1 809|809 6020
CP058340.1 734|734 5221
CP058347.1 1303|1303 3007
CP058348.1 1980|1980 1236
CP058364.1 1656|1656 3585
CP058409.1 1513|1513 851
'gbbct493'
CP058529.1
1806|1806 4404
CP058534.1 1631|1631 2000
CP058557.1 1511|1511 4797
CP058559.1 1671|1671 3256
CP058562.1 1231|1231 6500
CP058613.1 1327|1327 7242
CP058615.1 1442|1442 7780
CP058624.1 1310|1310 4153
CP058627.1 1618|1618 1451
CP058666.1 985|985 92
CP058668.1 1299|1299 4642
CP058669.1 1241|1241 4704
'gbbct404'
CP046120.1
1678|1678 25646
CP046121.1 1508|1508 29257
CP046146.1 1416|1416 2247
CP046147.1 1419|1419 2088
CP046162.1 1620|1620 158
CP046242.1 1439|1439 4625
CP046243.1 703|703 7846
CP046246.1 1117|1117 9488
CP046250.1 1419|1419 24204
CP046276.1 592|592 1820
'gbbct335'
CP035491.1
1655|1655 25099
CP035493.1 1710|1710 19268
CP035495.1 1764|1764 35566
CP035497.1 783|783 934
CP035504.1 1206|1206 13971
CP035510.1 1228|1228 3056
CP035533.1 1870|1870 20785
'gbbct264'
CP027017.1
1259|1259 7392
CP027018.1 1442|1443 7108
CP027034.1 1793|1793 5133
CP027061.1 1910|1910 7000
CP027062.1 1491|1491 899
CP027116.1 1849|1849 4597
CP027123.1 1912|1912 205
'gbbct608'
CP075681.1
1778|1778 7189
CP075684.1 1767|1767 7035
CP075685.1 1760|1760 7129
'gbbct379'
CP042808.1
1212|1212 2572
CP042812.1 1331|1331 181
CP042813.1 1412|1412 12057
CP042815.1 1425|1425 12531
CP042816.1 1385|1385 12738
CP042817.1 1516|1516 12213
CP042818.1 1521|1521 11603
'gbbct64'
CP001750.1
1105|1105 223
CP001752.1 530|530 206
CP001758.1 1262|1262 91
CP001759.1 519|519 185
CP001779.1 693|693 61
CP001781.1 1321|1321 168
CP001785.1 1027|1027 525
CP001787.1 833|833 95
CP001791.1 1815|1815 249
CP001792.1 1573|1573 80
CP001801.1 1129|1129 125
CP001807.1 1448|1448 61
CP001816.1 1098|1098 154
CP001818.1 740|740 324
CP001820.1 924|924 61
CP001821.1 1750|1750 658
CP001827.1 712|712 134
CP001828.1 1625|1625 182
CP001830.1 1962|1962 253
CP001834.1 1241|1241 2999
CP001837.1 1310|1310 201
CP001839.1 897|897 312
CP001840.1 858|858 131
CP001841.1 1679|1679 269
CP001844.2 1319|1319 240
CP001845.1 1023|1023 130
CP001849.1 688|688 110
CP001850.2 805|805 116
CP001857.1 882|882 131
CP001859.1 1007|1007 106
CP001860.1 1833|1833 152
CP001868.2 1487|1487 214
'gbbct181'
CP017478.1
1391|1391 2185
CP017484.1 1141|1141 8499
CP017485.1 1144|1144 7726
CP017486.1 1165|1165 8669
CP017487.1 1157|1157 8079
CP017488.1 1156|1156 8078
CP017489.1 1143|1143 7914
CP017490.1 1157|1157 8079
CP017491.1 1253|1253 4750
CP017492.1 1133|1133 8642
CP017493.1 1134|1134 8253
CP017494.1 1133|1133 8647
CP017495.1 1157|1157 8082
CP017496.1 1174|1174 8435
CP017497.1 1155|1155 8665
CP017498.1 1095|1095 7465
CP017499.1 1231|1231 7862
CP017500.1 1130|1130 7607
CP017501.1 1255|1255 4421
CP017502.1 1127|1127 7417
CP017503.1 1095|1095 7489
CP017504.1 1368|1368 6626
CP017505.1 1305|1305 6147
CP017506.1 1305|1305 5818
CP017507.1 1142|1142 7724
CP017508.1 1143|1143 7912
CP017509.1 1258|1258 9096
CP017510.1 1175|1175 8436
CP017511.1 1178|1178 8626
CP017512.1 1179|1179 8626
CP017513.1 1174|1174 8426
CP017514.1 1095|1095 7488
CP017515.1 1127|1127 7759
CP017516.1 1131|1131 7610
CP017517.1 1133|1133 8645
CP017518.1 1132|1132 8455
CP017519.1 1370|1370 6113
CP017520.1 1399|1399 4969
CP017521.1 1334|1334 6401
CP017522.1 1153|1153 8454
CP017523.1 1235|1235 5038
CP017524.1 1358|1358 5259
CP017525.1 1179|1179 7813
CP017526.1 1135|1135 8702
CP017527.1 1374|1374 6359
CP017528.1 1274|1274 5572
CP017529.1 1333|1333 6495
CP017530.1 1244|1244 5994
CP017531.1 1285|1285 5994
CP017532.1 1436|1436 5577
CP017533.1 1235|1235 5075
CP017534.1 1359|1359 6472
CP017535.1 1287|1287 5354
CP017536.1 1335|1335 6512
CP017537.1 1371|1371 6260
CP017538.1 1389|1390 6312
CP017539.1 1306|1306 5377
CP017540.1 1235|1235 4625
CP017541.1 1372|1372 5435
CP017542.1 1339|1339 5246
CP017543.1 1374|1374 6026
CP017544.1 1371|1371 5874
CP017545.1 1209|1209 7680
CP017546.1 1338|1338 5560
CP017547.1 1318|1318 8493
CP017548.1 1368|1368 6120
CP017549.1 1310|1310 6317
CP017550.1 1357|1357 5913
CP017551.1 1397|1397 6816
CP017552.1 1343|1343 5958
'gbbct171'
CP016370.1
1892|1892 3022
CP016371.1 1754|1754 4234
CP016379.1 1469|1469 5942
CP016382.1 830|830 609
CP016394.1 977|977 180
CP016432.1 1044|1044 73
'gbbct86'
CP006591.1
1476|1476 276
CP006592.1 1444|1444 280
CP006593.1 1500|1500 271
CP006594.1 1018|1018 186
CP006596.2 1327|1327 1463
CP006597.1 1460|1460 277
CP006598.1 1422|1422 274
CP006599.1 1480|1480 252
CP006600.1 1938|1938 353
CP006603.1 1048|1048 179
CP006604.1 524|524 49
CP006610.2 712|712 109
CP006615.1 1063|1063 158
CP006616.1 636|636 207
CP006617.1 638|638 212
CP006619.1 1447|1447 112
CP006620.1 1445|1445 232
CP006630.1 1286|1286 195
CP006645.1 1024|1024 155
CP006646.1 953|953 256
CP006647.2 397|397 75
CP006649.1 1070|1070 122
CP006650.1 1669|1669 165
CP006670.1 1285|1285 198
CP006671.1 459|459 82
CP006673.1 460|460 86
CP006674.1 462|462 70
CP006676.1 459|459 72
CP006680.1 451|451 65
CP006681.1 568|568 1933
CP006682.1 578|578 1733
CP006683.1 804|804 130
CP006686.1 834|834 115
CP006689.1 836|836 116
CP006690.1 1229|1229 4815
CP006691.1 841|841 235
CP006696.1 1352|1352 245
CP006702.2 736|736 4907
CP006706.1 1213|1213 180
CP006707.2 875|875 105
CP006708.2 838|838 100
'gbbct212'
CP021040.1
1762|1762 132
CP021047.1 1762|1762 132
CP021059.1 1155|1155 157
CP021081.1 1368|1368 2044
CP021105.1 1310|1310 3027
CP021106.3 1475|1475 2236
CP021114.1 1399|1399 2657
CP021129.1 273|273 3612
CP021130.1 1308|1308 206
CP021138.1 1358|1358 7593
'gbbct664'
CP084878.1
1174|1174 3700
CP084916.1 1025|1025 13973
CP084921.1 1861|1861 5427
CP085036.1 1386|1386 1055
CP085037.1 1813|1813 2828
CP085043.1 1826|1826 8448
CP085046.1 1118|1118 2295
CP085047.1 1096|1096 2810
CP085048.1 1096|1096 2740
CP085049.1 1092|1092 3110
CP085050.1 1097|1097 3259
CP085051.1 1096|1096 2741
'gbbct502'
CP059902.1
1813|1813 2299
CP059964.1 836|837 9518
CP059965.1 843|843 9286
CP059966.1 753|754 7864
CP059967.1 818|818 8615
CP059968.1 842|843 8613
CP059969.1 765|766 11961
CP059970.1 876|877 10697
CP059971.1 740|741 9894
CP059972.1 798|799 9466
CP059986.1 1842|1842 4091
CP059987.1 1342|1342 1823
'gbbct117'
CP010346.1
1395|1395 277
CP010350.1 1555|1555 135
CP010368.1 1710|1710 136
CP010397.1 1660|1660 148
CP010401.1 606|606 103
CP010402.1 1302|1302 210
CP010406.1 1917|1917 410
CP010411.1 1173|1173 264
CP010412.1 925|925 203
CP010432.1 966|966 3315
CP010433.1 797|797 138
CP010435.1 746|746 133
CP010436.1 744|744 132
CP010437.1 961|961 164
'gbbct79'
CP003700.1
1100|1100 94
CP003703.1 383|383 1753
CP003704.1 784|784 107
CP003708.1 164|164 30
CP003709.1 819|819 581
CP003723.1 1071|1071 90
CP003726.1 1331|1331 194
CP003731.1 501|501 1386
CP003732.1 1356|1356 512
CP003736.1 883|883 82
CP003745.1 1182|1182 137
CP003770.1 304|304 654
CP003771.1 286|286 568
CP003772.1 309|309 612
CP003773.1 302|302 642
CP003784.1 603|603 105
CP003787.1 1070|1070 126
CP003789.1 668|668 99
CP003790.1 567|567 117
CP003791.1 533|533 90
CP003792.1 530|530 96
CP003793.1 569|569 123
CP003794.1 533|533 92
CP003795.1 547|547 86
CP003796.1 537|537 85
CP003797.1 577|577 114
CP003798.1 537|537 93
CP003799.1 973|973 176
CP003800.1 829|829 63
CP003801.1 740|740 109
CP003802.1 352|352 1087
CP003806.1 336|336 38
CP003808.1 1378|1378 228
CP003809.1 701|701 50
CP003810.1 974|974 100
CP003835.1 155|155 25
CP003838.1 1898|1898 449
CP003839.1 921|921 57
CP003840.1 828|828 63
'gbbct305'
CP031838.1
1399|1399 4772
CP031839.1 1405|1405 4235
CP031842.1 1491|1491 4787
CP031843.2 965|965 8791
CP031844.2 933|933 7491
'gbbct349'
CP038439.1
1497|1497 7874
CP038440.1 1366|1366 1358
'gbbct244'
CP025057.1
589|589 2061
CP025074.1 1731|1731 11047
CP025076.1 1073|1073 8878
CP025077.1 1235|1235 7642
CP025082.1 1452|1452 1913
CP025086.1 1722|1722 2763
CP025095.1 1229|1229 9334
CP025116.1 1323|1323 1002
CP025118.1 1524|1524 3308
CP025119.1 1579|1579 3994
CP025120.1 1294|1294 1691
CP025135.1 1494|1494 32752
'gbbct637'
CP080623.1
1775|1775 2286
CP080624.1 1677|1677 1045
CP080627.1 1554|1554 2777
CP080636.1 1577|1577 4357
CP080649.1 1650|1650 551
CP080651.2 740|740 758
CP080759.1 1332|1332 4861
CP080760.1 1778|1778 5203
CP080761.1 1812|1812 11185
CP080770.1 1306|1306 3353
CP080953.1 1172|1172 3183

Appendix J GenBank - 7

This is an appendix to: It's In The GenBank - 7


GBFF '.seq' File Analysis
of Translation | CDS Data
and Amino Acid - Codon
Mismatches

GenBank File
and/or Version
Translation|
CDS Counts
AA - Codon
Mismatches
'gbbct127'
CP011361.3
1267|1267 1457
CP011364.1 1596|1596 163
CP011366.1 1452|1452 211
CP011367.1 1352|1352 205
CP011368.1 283|283 951
CP011372.1 766|766 79
CP011373.1 1296|1296 124
CP011374.1 876|876 95
CP011376.1 1090|1090 131
CP011377.1 909|909 95
CP011378.1 1008|1008 110
CP011379.1 923|923 94
CP011380.2 962|962 3195
CP011381.2 934|934 5326
CP011386.1 923|923 154
CP011387.1 1242|1242 260
CP011389.1 1543|1543 412
CP011391.1 1271|1271 233
CP011393.1 844|844 247
CP011397.1 1419|1419 249
CP011402.1 949|949 176
CP011403.1 848|848 170
CP011414.1 867|867 82
CP011415.1 867|867 82
CP011419.1 1043|1043 103
CP011452.2 1565|1565 224
'gbbct258'
CP026503.1
1108|1108 10152
CP026515.1 743|743 5793
'gbbct302'
CP031470.1
1041|1041 36388
CP031471.1 1139|1139 26717
CP031513.1 875|875 1871
CP031517.1 1096|1096 2120
CP031518.1 1263|1263 1253
CP031537.1 1402|1402 6743
CP031551.1 1071|1071 2546
CP031552.1 1092|1092 4041
CP031553.1 1067|1067 4023
CP031554.1 1071|1071 2548
CP031556.1 1049|1049 5046
CP031558.1 688|688 7497
'gbbct466'
CP054424.1
1777|1777 3528
CP054476.1 1450|1450 742
CP054477.1 1207|1207 4889
CP054482.1 1196|1196 2518
CP054494.1 1321|1321 4218
CP054495.1 1321|1321 4218
CP054496.1 1321|1321 4218
CP054497.1 1321|1321 4217
CP054498.1 1320|1320 4221
CP054499.1 1321|1321 4218
CP054500.1 1321|1321 4217
CP054501.1 1320|1320 4338
CP054502.1 1321|1321 4219
CP054503.1 1321|1321 4219
CP054504.1 1321|1321 4219
CP054505.1 1321|1321 4221
CP054506.1 1321|1321 4218
CP054507.1 1439|1439 5866
CP054508.1 1438|1438 5866
CP054509.1 1438|1438 6175
CP054510.1 1437|1437 6180
CP054511.1 1438|1438 5866
CP054512.1 1435|1435 6202
CP054513.1 1439|1439 5868
CP054514.1 1439|1439 5868
CP054516.1 1439|1439 5870
'gbbct321'
CP033720.1
920|920 1143
CP033740.1 1435|1435 7068
CP033808.1 1039|1039 5132
CP033809.1 1085|1085 6833
CP033810.1 995|995 4486
'gbbct70'
CP002547.1
1503|1503 258
CP002549.1 495|495 64
CP002551.1 1198|1198 172
CP002557.1 884|884 66
CP002558.1 874|874 63
CP002559.1 1056|1056 163
CP002562.1 983|983 82
CP002563.1 1240|1240 217
CP002565.1 1438|1438 326
CP002570.1 888|888 84
CP002571.1 788|788 185
CP002572.1 783|783 175
CP002573.1 1477|1477 418
CP002586.1 512|512 204
CP002588.1 1064|1064 125
CP002589.1 1200|1200 153
CP002590.1 1188|1188 456
CP002605.1 833|833 173
CP002606.1 829|829 82
CP002608.1 506|506 98
CP002609.1 1033|1033 149
CP002616.1 1492|1492 286
CP002618.1 1536|1536 292
CP002621.1 1270|1270 225
CP002627.1 1747|1747 411
CP002628.1 929|929 128
CP002629.1 1498|1498 192
CP002630.1 1157|1157 257
CP002631.1 1277|1277 85
CP002633.1 978|978 108
CP002634.1 2034|2034 487
CP002637.1 1140|1140 192
CP002640.1 913|913 90
CP002641.1 988|988 97
CP002643.1 1302|1302 185
CP002644.1 958|958 88
CP002648.1 284|284 30
CP002651.1 940|940 120
CP002652.1 1106|1106 201
CP002656.1 985|985 264
CP002659.1 864|864 47
CP002660.1 1887|1887 335
CP002663.1 1759|1759 274
CP002665.1 1556|1556 667
CP002667.1 1261|1261 88
CP002669.1 411|411 1268
CP002670.1 963|963 156
CP002671.1 452|452 47
CP002672.1 450|450 50
CP002673.1 465|465 48
CP002674.1 453|453 52
CP002675.1 447|447 46
CP002676.1 451|451 51
CP002677.1 468|468 48
CP002678.1 459|459 50
CP002679.1 451|451 48
CP002680.1 463|463 48
CP002681.1 451|451 49
CP002682.1 452|452 53
CP002683.1 1120|1120 116
CP002689.1 802|802 19
CP002690.1 1038|1038 355
'gbbct350'
CP038451.1
866|866 10022
CP038460.1 1262|1262 150
CP038461.1 1268|1268 156
CP038462.1 1608|1608 54753
CP038470.1 776|776 2155
CP038492.1 1853|1853 16866
CP038517.1 1787|1787 1874
CP038612.1 1345|1345 187
CP038613.1 1935|1935 227
CP038631.1 1109|1109 4458
CP038642.1 1470|1471 3664
CP038644.1 1723|1723 9482
'gbbct614'
CP076291.1
1788|1788 2904
'gbbct96'
CP007436.1
1778|1778 425
CP007443.1 1012|1012 101
CP007445.1 1477|1477 127
CP007446.1 1125|1125 108
CP007447.1 1415|1415 236
CP007454.1 1237|1237 154
CP007456.1 994|994 134
CP007457.1 785|785 132
CP007459.1 1468|1468 282
CP007460.1 1468|1468 282
CP007461.1 1469|1469 282
CP007462.1 1391|1391 269
CP007470.1 837|837 45
CP007471.1 873|873 42
CP007472.1 895|895 40
CP007473.1 499|499 51
CP007474.1 505|505 63
CP007475.1 445|445 57
CP007476.1 451|451 56
CP007477.1 504|504 60
CP007478.1 442|442 56
CP007479.1 447|447 49
CP007480.1 493|493 51
CP007481.1 438|438 92
CP007482.1 786|786 94
CP007490.1 725|725 172
CP007492.1 1364|1364 267
CP007493.1 968|968 293
CP007494.1 2125|2125 604
CP007495.1 2103|2103 551
CP007497.1 857|857 69
CP007499.1 1331|1331 198
CP007502.1 1160|1160 273
'gbbct583'
CP072038.1
1350|1350 2304
CP072040.1 1171|1171 2211
CP072042.1 1178|1178 1767
CP072043.1 1247|1247 1679
CP072044.1 1318|1318 1098
CP072049.1 1341|1341 1410
CP072050.1 1106|1106 3433
CP072051.1 1259|1259 2958
CP072052.1 1118|1118 1085
CP072107.1 1008|1008 786
CP072110.1 1379|1379 815
CP072111.1 1277|1277 4080
CP072112.1 948|948 3167
CP072113.1 1490|1490 4543
CP072115.1 955|955 1464
CP072116.1 1408|1408 5467
'gbbct4'
AJ235270.1
131|131 10
AJ235271.1 154|154 11
AJ235272.1 75|75 11
AJ235273.1 69|69 10
AJ248283.1 148|148 47
AJ248284.1 192|192 45
AJ248285.1 143|143 38
AJ248286.2 157|157 32
AJ248287.2 138|138 29
AJ248288.1 95|95 21
AJ344068.1 57|57 15
AJ576039.1 46|46 10
AJ749949.2 870|870 2874
AJ938182.1 1224|1224 1546
AL445563.1 192|192 655
AL445564.1 122|122 431
AL445565.1 89|89 380
AL591973.1 172|172 35
AL591974.1 107|107 131
AL591975.1 183|183 149
AL591976.1 195|195 462
AL591977.1 203|203 124
AL591978.1 206|206 44
AL591979.1 52|52 10
AL591980.1 16|16 4
AL591981.1 65|65 14
AL591982.1 76|76 13
AL591983.1 59|59 122
AL591984.1 46|46 8
AL596163.1 208|208 241
AL596164.1 156|156 31
AL596165.1 157|157 87
AL596166.1 210|210 625
AL596167.1 212|212 46
AL596168.1 208|208 102
AL596169.1 33|33 4
AL596170.1 57|57 10
AL596171.1 61|61 8
AL596172.1 60|60 15
AL596173.1 50|50 11
AL596174.1 13|13 1
AL766843.1 121|121 20
AL766844.1 177|177 142
AL766845.1 148|148 437
AL766846.1 179|179 490
AL766847.1 125|125 13
AL766848.1 51|51 143
AL766849.1 27|27 154
AL766850.1 27|27 376
AL766851.1 22|22 1
AL766852.1 28|28 2
AL766853.1 19|19 0
AL766854.1 28|28 1
AL766855.1 25|25 3
AL766856.1 42|42 360
AL935263.2 1514|1514 280
AL939104.1 155|155 154
AL939105.1 146|146 565
AL939106.1 146|146 411
AL939107.1 130|130 182
AL939108.1 137|137 42
AL939109.1 71|71 27
AL939110.1 108|108 38
AL939111.1 131|131 53
AL939112.1 149|149 53
AL939113.1 112|112 158
AL939114.1 124|124 47
AL939115.1 81|81 33
AL939116.1 113|113 74
AL939117.1 119|119 41
AL939118.1 107|107 30168
AL939119.1 154|154 61
AL939120.1 141|141 64
AL939121.1 184|184 75
AL939122.1 181|181 71
AL939123.1 152|152 68
AL939124.1 172|172 64
AL939125.1 163|163 89
AL939126.1 137|137 61
AL939127.1 118|118 165
AL939128.1 136|136 109
AL939129.1 129|129 58
AL939130.1 131|131 160
AL939131.1 121|121 156
AL939132.1 179|179 532
AL954747.1 1167|1167 5000
'gbbct352'
CP039138.1
1527|1527 2995
CP039139.1 1497|1497 2235
CP039143.1 1337|1337 6551
CP039156.1 1410|1410 3668
CP039157.1 1493|1494 8376
CP039160.1 1361|1362 5505
CP039162.1 1385|1386 2964
CP039164.1 1406|1407 5832
CP039167.1 1448|1448 6279
CP039227.1 2122|2122 495
CP039234.1 1921|1921 450
CP039240.1 2028|2028 458
CP039251.1 1303|1303 2311
CP039261.1 874|874 10280
CP039266.1 997|997 7864
CP039268.1 1192|1193 7568
'gbbct498'
CP059276.1
841|841 9266
CP059297.1 1496|1496 3293
CP059318.1 1787|1787 6596
CP059323.1 1205|1205 23332
CP059324.1 1208|1208 24915
CP059325.1 1306|1306 23638
CP059326.1 1092|1092 19478
CP059327.1 1113|1113 16974
CP059328.1 1106|1106 18686
CP059329.1 1167|1167 17407
CP059330.1 1196|1196 23740
CP059331.1 1090|1090 17884
CP059343.1 1236|1236 415
CP059344.1 1920|1920 4752
CP059360.1 812|812 3370
CP059362.1 890|890 5417
CP059364.1 935|935 10511
CP059365.1 809|809 4629
CP059366.1 436|436 1236
CP059368.1 466|467 1208
CP059369.1 809|809 3958
CP059371.1 815|815 5990
CP059372.1 808|808 4029
CP059373.1 735|735 2305
CP059374.1 905|905 5683
CP059375.1 842|842 13538
'gbbct418'
CP047931.1
1480|1480 434
CP047932.1 1394|1394 399
CP047933.1 1502|1502 529
CP047936.1 1652|1652 547
CP047937.1 1301|1301 451
CP047938.1 1567|1567 561
CP047939.1 1518|1518 503
CP047940.1 1547|1547 578
CP047941.1 1436|1436 474
CP047942.1 1508|1508 477
CP047943.1 1379|1379 400
CP047944.1 1465|1465 516
CP047945.1 1379|1379 435
CP047946.1 1839|1839 609
CP047948.1 1378|1378 460
CP047949.1 1455|1455 480
CP047950.1 1375|1375 489
CP047951.1 1494|1494 506
CP047952.1 1359|1359 423
CP047953.1 1358|1358 465
CP047954.1 1231|1231 436
CP047955.1 1510|1510 484
CP047956.1 1474|1474 474
CP047957.1 1465|1465 512
CP047958.1 1485|1485 530
CP047959.1 1687|1687 513
CP047960.1 1425|1425 439
CP047961.1 1304|1304 397
CP048001.1 905|905 5014
CP048014.1 1242|1242 4330
CP048020.1 1373|1373 23826
CP048021.1 1571|1571 1725
CP048029.1 1373|1373 19298
CP048030.1 1484|1484 3357
CP048032.1 1465|1465 3231
'gbbct282'
CP029185.2
1571|1571 5504
CP029186.1 1787|1787 4511
CP029187.1 1414|1414 1596
CP029198.1 1352|1352 125
CP029202.1 672|672 2320
CP029210.1 1783|1783 2522
CP029257.1 923|923 3357
'gbbct160'
CP014949.1
1287|1287 6765
CP014984.1 1441|1441 9317
CP014985.1 1493|1493 6102
CP014990.2 1782|1782 4749
'gbbct75'
CP003147.1
1125|1126 157730
CP003152.1 1046|1046 317
CP003153.1 1672|1672 182
CP003155.1 1587|1587 136
CP003157.1 997|997 101
CP003166.2 1323|1323 4443
CP003167.1 1440|1440 103
CP003168.1 764|764 74
CP003171.1 1611|1611 206
CP003179.1 1719|1719 426
CP003182.2 1251|1251 428
CP003184.1 1275|1275 271
CP003191.1 1620|1620 235
CP003194.1 1302|1302 188
CP003195.1 1117|1117 325
CP003196.1 1139|1139 327
CP003197.1 1123|1123 325
CP003198.1 1001|1001 297
CP003199.1 571|571 1960
CP003206.1 1201|1201 321
CP003207.1 1102|1102 305
CP003208.1 1098|1098 285
CP003209.1 1170|1170 309
CP003210.1 1161|1161 306
CP003211.1 1115|1115 297
CP003212.1 1101|1101 304
CP003213.1 1099|1099 294
CP003214.1 1125|1125 304
CP003215.1 1118|1118 300
CP003216.1 1178|1178 325
CP003217.1 1078|1078 298
CP003222.2 1250|1250 96
CP003229.1 852|852 381
CP003230.1 1155|1155 146
CP003231.1 276|276 938
CP003236.1 1761|1761 263
CP003243.1 1305|1305 187
'gbbct615'
CP076336.1
769|769 1062
CP076337.1 767|767 1806
CP076338.1 711|711 1089
CP076361.1 1604|1604 1729
CP076408.1 1809|1809 2888
CP076413.1 1643|1643 1098
CP076414.1 1779|1779 4858
CP076415.1 1670|1670 32772
CP076443.1 678|678 1900
CP076444.2 839|839 4990
CP076448.1 1657|1657 8784
'gbbct299'
CP031156.1
1064|1064 4516
CP031185.1 1165|1165 193
CP031188.1 1318|1318 1467
CP031192.1 1736|1736 7250
CP031196.1 1288|1288 6179
CP031219.1 1343|1343 189
CP031221.1 619|619 10738
CP031222.1 1656|1656 877
CP031230.1 1684|1684 4496
CP031250.1 842|842 7532
CP031265.1 1364|1364 4034
CP031266.1 1155|1155 10011
CP031269.1 1171|1171 18952
CP031271.1 1323|1323 4834
CP031274.1 1159|1159 1695
CP031275.1 1326|1326 2821
CP031277.1 1101|1101 3340
CP031280.1 1190|1190 4766
CP031290.1 1327|1327 5485
CP031303.1 1828|1828 4230
'gbbct491'
CP058293.1
764|764 6000
CP058295.1 871|871 10491
CP058299.1 778|778 7664
'gbbct248'
CP025395.1
1263|1263 240
CP025408.1 1888|1889 4175
CP025420.1 1156|1156 122
CP025425.1 1371|1371 5254
CP025430.1 1811|1811 3430
CP025437.1 1663|1663 12440
CP025438.1 1459|1459 2217
CP025440.1 1458|1458 2274
CP025442.1 1498|1498 3351
CP025443.1 1494|1494 3253
CP025471.1 888|888 1985
CP025473.1 1207|1207 11798
CP025474.1 716|716 6386
CP025476.1 969|969 8912
CP025495.1 1496|1496 3975
CP025512.1 1685|1685 1684
'gbbct72'
CP002838.1
1007|1007 996
CP002839.1 1777|1777 145
CP002840.1 190|190 11
CP002841.1 115|115 10
CP002842.1 44|44 1
CP002843.1 1066|1066 124
CP002844.1 1205|1205 225
CP002850.1 917|917 122
CP002857.1 994|994 324
CP002858.1 1188|1188 3709
CP002865.1 794|794 102
CP002868.1 1489|1489 125
CP002872.1 1086|1086 112
CP002873.1 1143|1143 110
CP002874.1 1368|1368 168
CP002888.1 944|944 136
CP002898.1 656|656 45
CP002899.1 608|608 38
CP002901.1 1748|1748 553
CP002903.1 1190|1190 136
CP002912.1 669|669 92
CP002913.1 964|964 157
CP002916.1 801|801 111
CP002917.1 1500|1500 521
CP002918.1 33|33 4
CP002919.1 1269|1269 397
CP002920.1 1022|1022 144
CP002924.1 1015|1015 302
CP002925.1 1137|1137 96
CP002927.1 1778|1778 420
CP002930.1 1955|1955 434
CP002933.1 409|409 99
CP002952.1 1133|1133 506
CP002953.1 743|743 119
CP002972.1 1630|1630 146
CP002976.1 1743|1743 173
CP002980.1 703|703 138
CP002982.1 754|754 145
'gbbct144'
CP013107.1
1852|1852 307
CP013114.1 1299|1299 186
CP013131.1 988|988 40
CP013132.1 1210|1210 178
CP013133.1 565|565 1800
CP013137.1 1300|1300 542
CP013138.1 1571|1571 1441
'gbbct101'
CP008776.1
931|931 109
CP008796.1 741|741 85
CP008804.1 908|908 55
CP008813.1 1025|1025 110
CP008816.1 598|599 95616
CP008821.1 1391|1391 269
CP008836.1 1425|1425 259
CP008837.1 1425|1425 258
CP008855.1 1503|1503 312
'gbbct437'
CP050462.1
1799|1799 3095
CP050485.1 1807|1807 6022
CP050491.1 1209|1209 1762
CP050497.1 1063|1063 20280
CP050505.1 965|965 11891
CP050511.1 1660|1660 25444
CP050541.1 1106|1106 6443
CP050690.1 1300|1300 3655
CP050691.1 1252|1252 4331
CP050695.1 1751|1751 6817
'gbbct361'
CP040518.1
1679|1679 9564
CP040556.1 1111|1111 15252
CP040560.1 1346|1346 5036
'gbbct545'
CP066377.1
1832|1832 2975
CP066421.1 1422|1422 7320
CP066427.1 1297|1297 5116
CP066434.1 1339|1339 6759
CP066451.1 1288|1288 5669
CP066466.1 1295|1295 5346
CP066486.1 620|620 33280
CP066487.1 617|617 31861
CP066541.1 881|881 3225
CP066583.1 1267|1267 8904
CP066598.1 1305|1305 6064
CP066629.1 1322|1322 5702
CP066645.1 1290|1290 5802
CP066659.1 1332|1332 7053
CP066667.1 1309|1309 5789
'gbbct530'
CP064343.1
1310|1310 7467
'gbbct232'
CP023561.1
1360|1360 4134
CP023563.1 1659|1659 7409
CP023565.1 1092|1092 5260
CP023566.1 1032|1032 5278
'gbbct542'
CP066060.1
1296|1296 6581
CP066065.1 834|834 1510
CP066078.1 1051|1051 6463
'gbbct440'
CP050993.1
1602|1602 9593
CP051004.1 940|940 2840
CP051005.1 1398|1398 5708
CP051130.1 1465|1466 485
CP051144.1 1451|1451 3260
CP051147.1 1303|1303 6188
CP051148.1 1139|1139 5970
CP051151.1 826|826 684
CP051153.1 1259|1259 3628
CP051162.1 1595|1595 13797
CP051165.2 1367|1367 3653
CP051169.1 1846|1846 8455
'gbbct184'
CP017696.1
780|781 4797
CP017706.1 960|961 14414
CP017730.1 448|448 737
CP017731.1 448|448 736
CP017732.1 450|450 172
CP017733.1 450|450 253
CP017737.1 441|441 3062
CP017738.1 442|442 3276
CP017739.1 443|443 3204
CP017740.1 451|451 603
CP017741.1 450|450 666
CP017742.1 451|451 605
CP017743.1 447|447 250
CP017745.1 452|452 278
CP017746.1 452|452 214
CP017767.1 1056|1056 2913
CP017774.1 1629|1629 1774
'gbbct601'
CP074438.1
1499|1499 24549
CP074441.1 608|608 738
CP074573.1 1460|1460 4833
CP074575.1 1030|1030 1030
'gbbct337'
CP035806.1
1568|1568 7178
CP035807.1 1530|1530 6196
CP035810.1 1491|1491 6243
CP035897.1 1045|1045 10293
CP035914.1 1344|1344 259
'gbbct511'
CP061169.1
1632|1632 2318
CP061176.1 1778|1778 5215
CP061202.1 1643|1643 5301
CP061208.1 945|945 77
CP061274.1 1406|1406 4781
CP061280.1 1070|1070 1327
CP061285.1 1018|1018 680
CP061287.1 1136|1136 408
CP061289.1 1107|1107 655
CP061291.1 906|906 516
CP061292.1 1029|1029 305
CP061293.1 943|943 381
CP061294.1 940|940 702
CP061295.1 1065|1065 552
CP061296.1 910|910 2384
CP061299.1 909|909 578
CP061302.1 1050|1050 972
CP061304.1 967|967 918
'gbbct655'
CP083539.1
1521|1521 1415
CP083553.1 1871|1871 3238
CP083560.1 1849|1849 2043
CP083564.1 1578|1578 922
CP083571.1 1545|1545 1302
CP083592.1 1082|1082 2481
CP083594.1 1075|1075 2058
CP083595.1 1131|1131 2398
CP083598.1 1119|1119 2189
CP083602.1 1121|1121 1976
CP083604.1 1120|1120 2030
CP083608.1 1075|1075 2951
CP083628.1 1732|1732 3106
'gbbct336'
CP035544.1
1375|1376 13076
CP035632.1 1380|1381 11418
CP035643.1 1040|1040 4297
CP035648.1 1392|1392 8453
CP035654.1 1388|1388 10101
CP035660.1 1355|1355 13622
CP035666.1 1439|1439 12843
CP035670.1 1361|1361 22911
CP035671.1 1414|1414 9993
CP035730.1 1349|1349 10015
'gbbct36'
AP024371.1
413|413 73
AP024372.1 415|415 73
AP024373.1 415|415 73
AP024374.1 414|414 74
AP024375.1 412|412 73
AP024391.1 415|415 73
AP024392.1 414|414 73
AP024393.1 412|412 73
AP024394.1 413|413 73
AP024395.1 414|414 73
AP024396.1 414|414 74
AP024397.1 414|414 74
AP024398.1 413|413 73
AP024399.1 413|413 73
AP024400.1 410|410 76
AP024401.1 406|406 70
AP024480.1 1085|1085 273
AP024505.1 187|187 44
AP024515.1 1470|1470 179
AP024516.1 34|34 1
AP024517.1 22|22 1
'gbbct97'
CP007503.1
889|889 117
CP007504.1 972|972 165
CP007514.1 1376|1376 645
CP007519.1 841|841 211
CP007520.1 350|350 1329
CP007521.1 303|303 923
CP007522.1 763|763 101
CP007524.1 1131|1131 273
CP007525.1 1461|1461 283
CP007526.1 1469|1469 285
CP007535.2 1692|1692 108
CP007536.1 1515|1515 528
CP007537.1 945|945 111
CP007541.1 344|344 84
CP007542.1 1689|1689 393
'gbbct218'
CP021885.1
1210|1210 10372
CP021886.1 877|877 7860
CP021912.1 1506|1506 1552
CP021976.1 1827|1827 5215
CP021987.1 259|259 3498
CP021988.1 301|301 5350
CP021989.1 231|231 10882
CP021990.1 307|307 4774
CP021991.1 300|300 6073
CP021992.1 1729|1729 3715
CP021996.1 489|489 4615
'gbbct465'
CP054256.1
446|446 16751
CP054257.1 1149|1149 2873
CP054306.1 1673|1673 10304
'gbbct578'
CP071445.1
1384|1384 1760
CP071459.1 828|828 3688
CP071460.1 821|821 5090
CP071463.1 1736|1736 4752
CP071487.1 1050|1050 9242
CP071489.1 1033|1033 11891
CP071490.1 1091|1091 8849
CP071491.1 1291|1291 13349
CP071505.1 1174|1174 8985
CP071508.1 1178|1178 8983
CP071512.1 1177|1177 9174
CP071517.1 1687|1687 9966
CP071518.1 1585|1585 15988
CP071519.1 778|778 6170
CP071527.1 1478|1478 2258
'gbbct333'
CP035267.1
1018|1018 4829
CP035278.1 489|489 71
CP035280.1 1546|1546 4932
CP035281.1 1191|1191 1755
CP035283.1 1470|1470 2887
CP035284.1 1313|1313 5366
CP035298.1 1007|1007 18609
CP035299.1 1111|1111 265
CP035300.1 1569|1569 8852
CP035303.1 497|497 6712
CP035305.1 1619|1619 11026
CP035309.1 1136|1136 2921
'gbbct63'
CP001643.1
1507|1507 340
CP001672.1 1181|1181 107
CP001673.1 1333|1333 330
CP001674.1 1350|1350 136
CP001677.5 554|554 101
CP001678.1 1529|1529 163
CP001680.1 701|701 139
CP001682.1 661|661 102
CP001684.1 1428|1428 200
CP001685.1 1022|1022 99
CP001686.1 1353|1353 485
CP001687.1 1501|1501 118
CP001688.1 1545|1545 140
CP001690.1 1447|1447 198
CP001696.1 781|781 62
CP001698.1 1153|1153 160
CP001706.1 1401|1401 432
CP001707.1 1404|1404 125
CP001708.1 920|920 58
CP001710.1 901|901 195
CP001712.1 1522|1522 219
CP001713.1 552|552 96
CP001719.1 1096|1096 80
CP001721.1 675|675 104
CP001722.1 885|885 121
CP001726.1 1537|1537 212
CP001731.1 1474|1474 390
CP001733.2 946|946 62
CP001734.1 1229|1229 122
CP001742.1 714|714 426
CP001743.1 1513|1513 248