Sequences submitted to 'GenBank' are revised for a number of reasons.
Each revision uses the original Accession Number with a revision number appended, e.g. .2,.3, etc.
JF742196.2(Nepal) Wang_H Haplogroup R8a1a1a 24-JAN-2013 and 6 others Corrected sequences JF742196.1(Nepal) Wang_H Haplogroup R8a1a1a 31-MAR-2012 AY882385.2(Yemen) Achilli-Rengo Haplogroup U3b1a 11-JAN-2013 Corrected sequence AY882385.1(Yemen) Achilli-Rengo Haplogroup U3b1a 15-APR-2005 JX266265.2 Mielnik-Sikorska Haplogroup L2a1l2a1 27-AUG-2012 Extra bytes removed JX266265.1 Mielnik-Sikorska Haplogroup L2a1l2a1 20-AUG-2012 JX266266.2 Mielnik-Sikorska Haplogroup G2a 27-AUG-2012 Extra bytes removed JX266266.1 Mielnik-Sikorska Haplogroup G2a 20-AUG-2012 JX266267.2 Mielnik-Sikorska Haplogroup A 27-AUG-2012 Extra bytes removed JX266267.1 Mielnik-Sikorska Haplogroup A 20-AUG-2012 GQ214520.2(New Zealand) Haplogroup U5a2a 08-AUG-2012 and 7 others Corrected sequences submitted GQ214520.1(New Zealand) Haplogroup U5a2a 07-JUN-2010 AY275527.2 Cabrera Haplogroup U6b 18-JUL-2012 AY275527.1 Cabrera Haplogroup U6b 20-OCT-2003 and 5 others Corrected sequences submitted GQ398480.2(Ecuador) Cardoso A2 16-MAR-2012 Extra mutation removed GQ398480.1(Ecuador) Cardoso A2 01-DEC-2009 and 10 others JQ611709.2 FTDNA Haplogroup U8a 13-FEB-2012 Corrected sequence submitted JQ611709.1 FTDNA Haplogroup U8a 12-FEB-2012 HQ593807.3(Italian) Zaragoza 08-NOV-2011 Extra bytes removed HQ593807.2(Italian) Zaragoza 13-JAN-2011 HQ593810.3(Italian) Zaragoza 08-NOV-2011 Extra bytes removed HQ593810.2(Italian) Zaragoza 13-JAN-2011 HQ840646.2(Scottish) FTDNA T2b 24-JAN-2011 Corrected sequence submitted HQ840646.1(Scottish) FTDNA T2b 09-JAN-2011 HQ593808.2(Italian) Zaragoza 13-JAN-2011 and 5 others Corrected sequences submitted HQ593808.1(Italian) Zaragoza 09-JAN-2011 HM575425.2(Sweden) FTDNA K1a10 14-JUL-2010 Corrected sequence submitted HM575425.1(Sweden) FTDNA K1a10 05-JUL-2010 HM130562.2(German) FTDNA Haplogroup U5b2a2b1 25-OCT-2010 Mutations were missed HM130562.1(German) FTDNA Haplogroup U5b2 04-MAY-2010 GU990521.3 FTDNA Haplogroup U9 20-APR-2010 Now correct GU990521.2 FTDNA Haplogroup U9 12-APR-2010 Mutation corrected, but fresh error added GU990521.1 FTDNA Haplogroup U9 20-MAR-2010 GU004259.2 FTDNA Haplogroup T2a1a 09-FEB-2010 '524.1C' error corrected GU004259.1 FTDNA Haplogroup T2a1a 24-OCT-2009 GQ983100.2 Santoro H5 13-OCT-2010 Sequence revised GQ983100.1 Santoro H5 28-SEP-2010 GQ983072.2 Santoro H5 13-OCT-2010 Sequence revised GQ983072.1 Santoro H5 28-SEP-2010 GQ214521.2(Fiji) Corser Q2a 02-SEP-2010 'G16569-' mutation introduced GQ214521.1(Fiji) Corser Q2a 07-JUN-2010 GQ214522.2(Fiji) Corser Q2a 02-SEP-2010 New mutations added GQ214522.1(Fiji) Corser Q2a 07 JUN-2010 GQ214523.2(Kiribati) Corser B4a1a1a1 02-SEP-2010 Spurious mutations removed GQ214523.1(Kiribati) Corser B4a1a1a1 07-JUN-2010 GQ214526.2(New Guinea) Corser Q2 02-SEP-2010 Spurious mutations removed GQ214526.1(New Guinea) Corser Q2 07-JUN-2010 GQ200592.2 FTDNA Haplogroup U4a3 16-FEB-2010 '524.1C' error corrected GQ200592.1 FTDNA Haplogroup U4a3 03-JUN-2009 GQ129176.2 Pala Haplogroup U5b3e 23-JUN-2009 Revised sequence (minor) GQ129176.1 Pala Haplogroup U5b3e 17-JUN-2009 '3107n' FJ968772.2 Zhao Haplogroup M9 20-NOV-2009 and 3 others Extra base at '16570' removed FJ968772.1 Zhao Haplogroup M9 16-NOV-2009 FJ770972.2(Nepal) Fornarino Haplogroup M3c1a 20-AUG-2009 Correction made, but sequence incomplete FJ770972.1(Nepal) Fornarino Haplogroup M3c1a 10-AUG-2009 FJ770948.2(India) Fornarino Haplogroup M53 20-AUG-2009 Assembly error corrected FJ770948.1(India) Fornarino Haplogroup M53 10-AUG-2009 FJ878777.2 FTDNA Haplogroup T1b 10-APR-2009 Mutation corrected FJ878777.1 FTDNA Haplogroup T1b 08-APR-2010 FJ544236.2 Zhao Haplogroup M9 20-NOV-2009 Revised sequence (minor) FJ544236.1 Zhao Haplogroup M9 16-NOV-2009 '3106-7' 'CC' error FJ527772.2 Alvarez Haplogroup H2a5 12-JUN-2009 and 7 others Revised sequences (minor) FJ527772.1 Alvarez Haplogroup H2a5 20-APR-2009 '3106-7' 'CC' error FJ467993.2(India) Thangaraj2009 Haplogroup R8 18-SEP-2009 Missing Mutation added FJ467993.1(India) Thangaraj2009 Haplogroup R8 30-AUG-2009 FJ467952.2(India) Thangaraj2009 Haplogroup R8 18-SEP-2009 Missing Mutation added FJ467952.1(India) Thangaraj2009 Haplogroup R8 30-AUG-2009 FJ460547.2(Tunisia) Costa Haplogroup HV1 12-JAN-2010 Missing mutation added FJ460547.1(Tunisia) Costa Haplogroup HV1 31-DEC-2008 FJ449703.2 FTDNA Haplogroup W6 04-FEB-2010 '524.1C' error corrected FJ449703.1 FTDNA Haplogroup W6 26-NOV-2008 FJ445408.2 FTDNA Haplogroup J2b 05-MAR-2009 Submission error (minor) FJ445408.1 FTDNA Haplogroup J2b 18-NOV-2008 'Data entry error' FJ441666.2(France) FTDNA Haplogroup X2b 01-JUN-2009 Submission error (minor) FJ441666.1(France) FTDNA Haplogroup X2b 18-NOV-2008 'Data entry error' FJ004829.2(Ori74) Chaubey Haplogroup R5a2b2 17-SEP-2008 Revised sequence (major) FJ004829.1(Ori74) Chaubey Haplogroup R5a2b2 24-AUG-2008 'Confusion over sequence' EU872029.2(India) Bhat Haplogroup U2 18-FEB-2009 and 3 others Record now withdrawn EU872029.1(India) Bhat Haplogroup U2 30-SEP-2008 'Contamination' EU744541.2 FTDNA Haplogroup H* 14-JUL-2008 Submission error (minor) EU744541.1 FTDNA Haplogroup H* 03-JUN-2008 'Data entry error' EU725607.2(Inuit) Gilbert Haplogroup A2 22-JAN-2009 and 14 others ........ Revised sequences (minor) EU725607.1(Inuit) Gilbert Haplogroup A2 30-MAY-2008 EU719115.2 FTDNA Haplogroup H2a2b1 10-FEB-2010 '524.1C' error corrected EU719115.1 FTDNA Haplogroup H2a2b1 21-MAY-2008 EU682506.2 FTDNA Haplogroup U5b2 30-SEP-2010 '524.1C' error corrected EU682506.1 FTDNA Haplogroup U5b2 13-MAY-2008 EU682394.2 FTDNA Haplogroup T2b 10-FEB-2010 '524.1C' error corrected EU682394.1 FTDNA Haplogroup T2b 13-MAY-2008 EU677750.2 FTDNA Haplogroup H5 09-FEB-2010 '524.1C' error corrected EU677750.1 FTDNA Haplogroup H5 04-MAY-2008 EU573192.2 FTDNA Haplogroup T2a1a 09-FEB-2010 '524.1C' error corrected EU573192.1 FTDNA Haplogroup T2a1a 06-APR-2008 EU547188.2(Poland) FTDNA Haplogroup L2a1 18-MAR-2008 Submission error (minor) EU547188.1(Poland) FTDNA Haplogroup L2a1 16-MAR-2008 'Data entry error' EU545451.3(Russia) Grzybowski Haplogroup U4a1a 19-OCT-2010 and 9 others Revised sequences EU545451.2(Russia) Grzybowski Haplogroup U4a1a 14-OCT-2010 EU545451.1(Russia) Grzybowski Haplogroup U4a1a 22-JUL-2008 EU545415.2(Belarus) Grzybowski Haplogroup U4b1a 14-OCT-2010 and 25 others Revised sequences EU545415.1(Belarus) Grzybowski Haplogroup U4b1a 22-JUL-2008 EU482374.2(Tubalar) Volodko Haplogroup A 28-NOV-2008 Revised sequence (major) EU482374.1(Tubalar) Volodko Haplogroup A 14-MAY-2008 EU445683.2(Italy)Brisighelli Haplogroup U7a2a 12-JUN-2009 and 8 others Revised sequences (minor) EU445683.1(Italy)Brisighelli Haplogroup U7a2a 31-JAN-2009 '3106-7' 'CC' error EU443605.2 FTDNA Haplogroup H2a2b1 30-NOV-2009 Heteroplasmy A73R added EU443605.1 FTDNA Haplogroup H2a2b1 20-FEB-2008 EU431080.2(USA) Achilli Haplogroup A2 01-AUG-2008 Revised sequence (major) EU431080.1(USA) Achilli Haplogroup A2 18-MAR-2008 EU156036.2 FTDNA Haplogroup H10 05-FEB-2010 '524.1C' error corrected EU156036.1 FTDNA Haplogroup H10 24-SEP-2007 EU130575.2 FTDNA Haplogroup H2a2 17-SEP-2007 Revised sequence EU130575.1 FTDNA Haplogroup H2a2 12-SEP-2007 EU095548.2(Waunana) Tamm Haplogroup B2 20-MAR-2008 Revised sequence (minor) EU095548.1(Waunana) Tamm Haplogroup B2 10-SEP-2007 EU095545.2(Kogui) Tamm Haplogroup A2 20-MAR-2008 Revised sequence (minor) EU095545.1(Kogui) Tamm Haplogroup A2 10-SEP-2007 EU095535.2(Coreguaje) Tamm Haplogroup B2 20-MAR-2008 Revised sequence (minor) EU095535.1(Coreguaje) Tamm Haplogroup B2 10-SEP-2007 EF452295.2 FTDNA Haplogroup U4b2a 03-FEB-2010 '524.1C' error corrected EF452295.1 FTDNA Haplogroup U4b2a 05-MAR-2007 EF397754.2 FTDNA Haplogroup U5a2d 30-NOV-2009 Heteroplasmy G8155R added EF397754.1 FTDNA Haplogroup U5a2d 14-FEB-2007 EF222024.1 FTDNA 27 JAN 2007 Withdwawn - a duplicate EF061150.2(PNG) Friedlaender Haplogroup E1b 31-AUG-2009 Changed to 3106n EF061150.1(PNG) Friedlaender Haplogroup E1b 28-FEB-2007 EF060364.2 La Morgia Haplogroup U4a 20-AUG-2009 Extra part removed EF060364.1 La Morgia Haplogroup U4a 31-OCT-2007 DQ830736.2 FTDNA Haplogroup K1c2 18-AUG-2008 Revised sequence (minor) DQ830736.1 FTDNA Haplogroup K1c2 08-JUL-2006 DQ408672.2(Karnataka) Thangaraj Haplogroup M34 25-APR-2006 and 8 others .. Revised sequences DQ408672.1(Karnataka) Thangaraj Haplogroup M34 07-MAR-2006 DQ404440.3(Australia) Pellekaan Haplogroup S1 27-SEP-2006 and 7 others ... Revised sequences DQ404440.1(Australia) Pellekaan Haplogroup S1 10-APR-2006 DQ358973.2 Detjen Haplogroup J1c1 06-MAR-2006 and 4 others ............... Revised sequences DQ358973.1 Detjen Haplogroup J1c1 24-JAN-2006 DQ341068.2(Ethiopia) Torroni Haplogroup L3i 05-MAY-2009 Revised sequence DQ341068.1(Ethiopia) Torroni Haplogroup L3i 23-JUN-2006 DQ112686.2(Dominican Rep) Kivisild Haplogroup L0 18-OCT-2006 and 276 others Revised sequences DQ112686.1(Dominican Rep) Kivisild Haplogroup L0 11-JUL-2005 AY963586.3(Italy) Bandelt Haplogroup I3a 29-JUN-2009 Revised sequence AY963586.1(Italy) Bandelt Haplogroup I3a 16-JUN-2005 AY963573.2(China) Macaulay Haplogroup D4 20-SEP-2005 and 12 others ........ Revised sequences AY963573.1(China) Macaulay Haplogroup D4 18-MAY-2005 AY956413.2(PNG) Friedlaender Haplogroup Q2b 31-AUG-2009 Spurious mutation deleted AY956413.1(PNG) Friedlaender Haplogroup Q2b 18-MAY-2005 AY950289.2(Andaman) Kumarsamy Haplogroup F1 23-MAY-2005 and 10 others ..... Revised sequences AY950289.1(Andaman) Kumarsamy Haplogroup F1 20-MAY-2005 AY882391.2(Pakistan) Achilli-Rengo Haplogroup U7a 20-AUG-2009 Mutation added AY882391.1(Pakistan) Achilli-Rengo Haplogroup U7a 15-APR-2005 AY615360.2(Tofalar) Starikovskaya Haplogroup C 02-JUN-2005 Revised sequence AY615360.1(Tofalar) Starikovskaya Haplogroup C 01-JUN-2004 AY519484.2(Buriat) Starikovskaya Haplogroup B 02-JUN-2005 and 12 others ... Revised sequences AY519484.1(Buriat) Starikovskaya Haplogroup B 17-JAN-2004 AY495105.2(European) Coble Haplogroup H7 07-JAN-2008 and 6 others ........ Revised sequences AY495105.1(European) Coble Haplogroup H7 06-FEB-2004 AY339409.2(Finland) Moilanen Haplogroup H13 10-OCT-2007 and 61 others ..... Revised sequences AY339409.1(Finland) Moilanen Haplogroup H13 31-AUG-2003 AY255134.2(Chinese) Kong Haplogroup D4j 04-OCT-2006 Revised sequence AY255134.1(Chinese) Kong Haplogroup D4j 17-JUL-2003 AY195745.2(Caucasian) Mishmar Haplogroup T2b 10-JUN-2004 and 34 others .... Revised sequences AY195745.1(Caucasian) Mishmar Haplogroup T2b 09-APR-2003 AM260602.2 Annunen-Rasila Haplogroup H13 08-AUG-2006 Revised sequence (minor) AM260602.1 Annunen-Rasila Haplogroup H13 01-AUG-2006 AF381984.2(Morocco) Maca-Meyer Haplogroup M1 19-JUN-2006 and 4 others .... Revised sequences AF381984.1(Morocco) Maca-Meyer Haplogroup M1 28-DEC-2001 OTHER PROBLEMS: A) LENGTH MISTAKES: The following sequences all have length mistakes: AP008336 TCsq0077(Japanese) Tanaka Haplogroup M7 16-JUL-2005 - Sequence is 1 base too long. "g" AP008866 JDsq0048(Japanese) Tanaka Haplogroup D 16-JUL-2005 Sequence 8 bases too long. "GATCACAG" DQ523681(Sardinia) Fraumene Haplogroup H1 03-OCT-2006 Sequence is one base too short. EF060364 La Morgia Haplogroup U4a 31-OCT-2007 CORRECTED 20-AUG-2009 - Sequence is 51 bases too long. "gatcacaggt ctatcaccct attaaccact cacgggagct ctccatgcat t" EF556162 Behar2008 Haplogroup H* 22-APR-2008 - Sequence is one base too long. "g" EU742151 Feder Haplogroup N1b2 22-JUN-2008 Sequence is 4 bases too long. "GATC" GQ895152 Qin Haplogroup A4 30-MAR-2010 Sequence is 10 bases too short. FJ748746 Ji Haplogroup .. 01-JUL-2010 Sequence is 1 base too short. HQ593807.2(Italian) Zaragoza 13-JAN-2011 Corrected 08-NOV-2011 HQ593810.2(Italian) Zaragoza 13-JAN-2011 Have many extra bases B) DUPLICATED SEQUENCES The following sequences have been duplicated by 'Hartmann', having been previously published by 'Kivisild'. DQ112773.2(Brazil) Kivisild Haplogroup D1 18-OCT-2006 EU597510(Karitiana, Brazil HGDP01000) Hartmann Haplogroup D1 06-APR-2008 DQ112952.2(Asia) Kivisild Haplogroup M2 18-OCT-2006 EU597516(Sindhi, Pakistan Hartmann HGDP00167) Haplogroup M2 06-APR-2008 DQ112784.2(Asia) Kivisild Haplogroup M* 18-OCT-2006 EU597554(Cambodia HGDP00714) Hartmann Haplogroup M 06-APR-2008 DQ112765.3(Pakistan) Kivisild Haplogroup U9 18-OCT-2006 EU597540(Pathan, Pakistan HGDP00214) Hartmann Haplogroup U9b 06-APR-2008 DQ112790.2(America) Kivisild Haplogroup B 18-OCT-2006 EU597569(Colombia HGDP00709) Hartmann Haplogroup B 06-APR-2008 DQ112791.2(America) Kivisild Haplogroup B 18-OCT-2006 EU597580(Colombia HGDP00710) Hartmann Haplogroup B 06-APR-2008 DQ112885.2(Oceania) Kivisild Haplogroup Q1 18-OCT-2006 DQ112886.2(Oceania) Kivisild Haplogroup Q1 18-OCT-2006 DQ112887.2(Oceania) Kivisild Haplogroup Q1 18-OCT-2006 EU597543(Melanesia HGDP00789) Hartmann Haplogroup Q1 06-APR-2008 C) PROTEIN LENGTH DIFFERENCES The mtDNA has 13 genes for producing proteins: name coding area length amino acids comment NAD1 3307-4260 854 318 codons + 'T' at 4261 Stops with 'TAA' NAD2 4470-5510 1041 347 codons + 'T' at 5511 Stops with 'TAG' COX1 5904-7442 1539 513 codons + 'AGA' Stops with 'AGA' 7443-7445 COX2 7586-8266 681 227 codons + 'T' at 8267 Stops with 'TAG', 1.5% 'TAA' ATP8 8366-8569 204 68 codons + 'T' at 8570 Stops with 'TAG', ATP6 from 8527 onwards ATP6 8527-9204 678 226 codons + 'T' at 9205 COX3 9207-9989 783 261 codons + 'T' at 9990 NAD3 10059-10403 345 115 codons + 'T' at 10404 NAD4L 10470-10763 294 98 codons + 'T' at 10764 NAD4 10760-12136 1377 459 codons + 'T' at 12137 NAD5 12337-14145 1809 603 codons + 'T' at 14146 98.5% TAA, 1.5% TAG NAD6 14673-14152 522 174 codons + 'T' at 14151 Reversed & complemented CYTB 14747-15886 1140 380 codons + 'T' at 15887 Code letters for amino acids: A - Alanine (Ala) C - Cysteine (Cys) D - Aspartic Acid (Asp) E - Glutamic Acid (Glu) F - Phenylalanine (Phe) G - Glycine (Gly) H - Histidine (His) I - Isoleucine (Ile) K - Lysine (Lys) L - Leucine (Leu) M - Methionine (Met) N - Asparagine (Asn) P - Proline (Pro) Q - Glutamine (Gln) R - Arginine (Arg) S - Serine (Ser) T - Threonine (Thr) V - Valine (Val) W - Tryptophan (Trp) Y - Tyrosine (Tyr) Each protein should conform to the expected pattern of codons. However, there are a number of GenBank sequences that give different protein lengths - most of these are because of sequencing errors, but some changes are physiological. The mutations and GenBank sequences with protein length differences are: Mutations Gene Genbank sequence Author Error or Physiological --------- ---- ---------------- ------ ---------------------- 3307.1 NAD1 EU431080.2(USA) Achilli Physiological GQ377757(Canada) FTDNA " T3308C NAD1 69 sequences e.g. AF346986(Ibo) Ingman Physiological 3312.1 NAD1 EF657310 mtDNA170(Asia) Herrnstadt Error 3571.1 NAD1 EF660993(Italy) Gasparre Error 4511.1 NAD2 EU443512 Kumar Error 5436- NAD2 JF742208 Wang_h Error 5436.1 NAD2 DQ246818 Rajkumar Error 6077insGTC COX1 FJ625852 Cerny Physiological ? G6322- COX1 EU443478 Kumar Error G7444A COX1 AF347006(Saami) V7 Ingman Physiological ? AM260606-AM260612 V7 Annunen-Rasila AP009431(Japan) D4e2c Kazuno AP010714(Japan) D4e2 Rabadan AP010747(Japan) M7a1a1a Rabadan AY339446-AY339450 V7 Moilanen AY922286 M Sun DQ112737.2 L1b Kivisild DQ112936.2(Europe) V7 Kivisild DQ282408(Hispanic) A2 Parsons DQ282507(Hispanic) L3e Parsons EF184618-EF184619 L2a Gonder EF657747(Europe) H Hernnstadt EF657594(Europe) H3 Herrnstadt EU092805(L456) L3f Behar EU092893(L554) L1b Behar EU567454(Russia) V7 Malyarchuk FJ348217/FJ348223 W Irene FJ467943(India) R8a Thangaraj2009 A7445C EU482325(Yukaghir) D4j Volodko Pathological ? A7445G EU571946(Hungary) U4b1a3 Maasz Pathological ? 8495.1A GQ214523(Kiribati) B4a1a1a1 Corser Error A8508- C8488- ATP8 EF184640(Tanzania) Gonder Error 8527.1A ATP8/ATP6 EU443477 Kumar Physiological T9205C ATP6 EU431081 Achilli Physiological EU600328 Shlush " T9959- COX3 EU443476 Kumar Error A10116- NAD3 EF660930(Italy) Gasparre Error T10117- " " " " T10390- " KC622235(Khoisan) Barbieri Physiological(?) T10404- NAD3 EF184634(Tanzania) Gonder Error A11038- NAD4 EF660995(Italy) Gasparre Error C11085- NAD4 EF660994(Italy) Gasparre Error A11086- " " " " A11376- NAD4 EU443512 Kumar Error T12338C NAD5 GQ999958 All F2 Yu Physiological 12617.1 NAD5 EU443497 Kumar Error 13235.1 NAD5 EF660996(Italy) Gasparre Error 14189.1 NAD6 FJ383217 Rao Error 15719.1 CYTB EU443443 Kumar Error " & EU443444 Kumar Error A15788- " EF184610(S. Africa) Gonder Error C15789- " " " " C15790- " " " " A15791- " " " " T15792- " " " " Discussion: 3307.1 EU431080.2(USA) Achilli, GQ377757(Canada) FTDNA NAD1 mutation - gene starts at Codon 3. This has the same result at the more common mutation T3308C - GenBank gives this translation: MANLLLLIVPILIAMAFLMLTERKILGYMQLRKGPNVVGPYGLLQPFADAMKLFTK EPLKPATSTITLYITAPTLALTIALLLWTPLPMPNPLVNLNLGLLFILATSSLAVYSI LWSGWASNSNYALIGALRAVAQTISYEVTLAIILLSTLLMSGSFNLSTLITTQEHLWL LLPSWPLAMMWFISTLAETNRTPFDLAEGESELVSGFNIEYAAGPFALFFMAEYTNII MMNTLTTTIFLGTTYDALSPELYTTYFVTKTLLLTSLFLWIRTAYPRFRYDQLMHLLW KNFLPLTLALLMWYVSMPITISSIPPQT And normal CRS is: (All CRS sequences from NC_012920) MPMANLLLLIVPILIAMAFLMLTERKILGYMQLRKGPNVVGPYGLLQPFADAMKLFTK EPLKPATSTITLYITAPTLALTIALLLWTPLPMPNPLVNLNLGLLFILATSSLAVYSI LWSGWASNSNYALIGALRAVAQTISYEVTLAIILLSTLLMSGSFNLSTLITTQEHLWL LLPSWPLAMMWFISTLAETNRTPFDLAEGESELVSGFNIEYAAGPFALFFMAEYTNII MMNTLTTTIFLGTTYDALSPELYTTYFVTKTLLLTSLFLWIRTAYPRFRYDQLMHLLW KNFLPLTLALLMWYVSMPITISSIPPQT ...... T3308C AF346986(Ibo) Ingman NAD1 Gene starts at codon 3: GenBank gives this translation: TPMANLLLLIVPILIAMAFLMLTERKILGYMQLRKGPNVVGPYGLLQPFADAMKLFTK EPLKPATSTITLYITAPTLALTIALLLWTPLPMPNPLVNLNLGLLFILATSSLAVYSI LWSGWASNSNYALIGALRAVAQTISYEVTLAIILLSTLLMSGSFNLSTLITTQEHLWL LLPSWPLAMMWFISTLAETNRTPFDLAEGESELVSGFNIEYAAGPFALFFMAEYTNII MMNTLTTTIFLGTTYDALSPELYTTYFVTKTLLLTSLFLWIRTAYPRFRYDQLMHLLW KNFLPLTLALLMWYVSMPITISSIPPQT And normal CRS is: MPMANLLLLIVPILIAMAFLMLTERKILGYMQLRKGPNVVGPYGLLQPFADAMKLFTK EPLKPATSTITLYITAPTLALTIALLLWTPLPMPNPLVNLNLGLLFILATSSLAVYSI LWSGWASNSNYALIGALRAVAQTISYEVTLAIILLSTLLMSGSFNLSTLITTQEHLWL LLPSWPLAMMWFISTLAETNRTPFDLAEGESELVSGFNIEYAAGPFALFFMAEYTNII MMNTLTTTIFLGTTYDALSPELYTTYFVTKTLLLTSLFLWIRTAYPRFRYDQLMHLLW KNFLPLTLALLMWYVSMPITISSIPPQT ...... 3312.1 EF657310 mtDNA170(Asia) NAD1 - erroneous at codon 3. GenBank does not give a translation. ...... 3571.1 EF660993(Italy) Gasparre NAD1 - erroneous at codon 89. GenBank does not give a translation. ...... 4511.1 EU443512 Kumar NAD2 - Erroneous at codon 14. - GenBank gives this translation: MPWPNPSSTLPSFAGTLITALSSHWFFTWVGLEMNMLAFIPVLTKKMNPRSTEAAIK YFLTQATASMILLMAILFNNMLSGQWTMTNTTNQYSSLMIMMAMAMKLGMAPFHFWVP EVTQGTPLTSGLLLLTWQKLAPISIMYQISPSLNVSLLLTLSILSIMAGSWGGLNQTQ LRKILAYSSITHMGWMMAVLPYNPNMTILNLTIYIILTTTAFLLLNLNSSTTTLLLSR TWNKLTWLTPLIPSTLLSLGGLPPLTGFLPKWAIIEEFTKNNSLIIPTIMATITLLNL YFYLRLIYSTSITLLPMSNNVKMKWQFEHTKPTPFLPTLIALTTLLLPISPFMLMIL But CRS is: MNPLAQPVIYSTIFAGTLITALSSHWFFTWVGLEMNMLAFIPVLTKKMNPRSTEAAIK YFLTQATASMILLMAILFNNMLSGQWTMTNTTNQYSSLMIMMAMAMKLGMAPFHFWVP EVTQGTPLTSGLLLLTWQKLAPISIMYQISPSLNVSLLLTLSILSIMAGSWGGLNQTQ LRKILAYSSITHMGWMMAVLPYNPNMTILNLTIYIILTTTAFLLLNLNSSTTTLLLSR TWNKLTWLTPLIPSTLLSLGGLPPLTGFLPKWAIIEEFTKNNSLIIPTIMATITLLNL YFYLRLIYSTSITLLPMSNNVKMKWQFEHTKPTPFLPTLIALTTLLLPISPFMLMIL ...... 5436- JF742208 Wang_h NAD2 - erroneous at codon 323 - GenBank gives this translation: MNPLAQPVIYSTIFAGTLITALSSHWFFTWVGLEMNMLAFIPVLTKKMNPRSTEAAIK YFLTQATASMILLMAILFNNMLSGQWTMTNTTNQYSSLMIMMAMAMKLGMAPFHFWVP EVTQGTPLTSGLLLLTWQKLAPISIMYQISPSLNVSLLLTLSILSIMAGSWGGLNQTQ LRKILAYSSITHMGWMMAVLPYNPNMTILNLTIYIILTTTAFLLLNLNSSTTTLLLSR TWNKLTWLTPLIPSTLLSLGGLPPLTGFLPKWAIIEEFTKNNSLIIPTIMATITLLNL YFYLRLIYSTSITLLPMSNNVKMKWQFEHTKPPHSSPHSSPLPRYSYLSPLLY But CRS is: MNPLAQPVIYSTIFAGTLITALSSHWFFTWVGLEMNMLAFIPVLTKKMNPRSTEAAIK YFLTQATASMILLMAILFNNMLSGQWTMTNTTNQYSSLMIMMAMAMKLGMAPFHFWVP EVTQGTPLTSGLLLLTWQKLAPISIMYQISPSLNVSLLLTLSILSIMAGSWGGLNQTQ LRKILAYSSITHMGWMMAVLPYNPNMTILNLTIYIILTTTAFLLLNLNSSTTTLLLSR TWNKLTWLTPLIPSTLLSLGGLPPLTGFLPKWAIIEEFTKNNSLIIPTIMATITLLNL YFYLRLIYSTSITLLPMSNNVKMKWQFEHTKPTPFLPTLIALTTLLLPISPFMLMIL ..... 5436.1 DQ246818 Rajkumar NAD2 - erroneous at codon 323. - GenBank gives this translation: MNPLAQPVIYSTIFAGTLITALSSHWFFTWVGLEMNMLAFIPVLTKKMNPRSTEAAIK YFLTQATASMILLMAILFNNMLSGQWTMTNTTNQYSSLMIMMAMAMKLGMAPFHFWVP EVTQGTPLTSGLLLLTWQKLAPISIMYQISPSLNVSLLLTFSILSIMAGSWGGLNQTQ LRKILAYSSITHMGWMMAVLPYNPNMTILNLTIYIILTTTAFLLLNLNSSTTTLLLSR TWNKLTWLTPLIPSTLLSLGGLPPLTGFLPKWAIIEEFTKNNSLIIPTIMATITLLNL YFYLRLIYSTSITLLPMSNNVKMKWQFEHTKPSPIPPHTHRPYHATPTYLPFYTNNLM EI But CRS is: MNPLAQPVIYSTIFAGTLITALSSHWFFTWVGLEMNMLAFIPVLTKKMNPRSTEAAIK YFLTQATASMILLMAILFNNMLSGQWTMTNTTNQYSSLMIMMAMAMKLGMAPFHFWVP EVTQGTPLTSGLLLLTWQKLAPISIMYQISPSLNVSLLLTLSILSIMAGSWGGLNQTQ LRKILAYSSITHMGWMMAVLPYNPNMTILNLTIYIILTTTAFLLLNLNSSTTTLLLSR TWNKLTWLTPLIPSTLLSLGGLPPLTGFLPKWAIIEEFTKNNSLIIPTIMATITLLNL YFYLRLIYSTSITLLPMSNNVKMKWQFEHTKPTPFLPTLIALTTLLLPISPFMLMIL ....... 6077.1 'GTC' FJ625852 Cerny COX1 - insertion after codon 58 - GenBank gives this translation: MFADRWLFSTNHKDIGTLYLLFGAWAGVLGTALSLLIRAELGQPGNLLGNDHIYNVIVV TAHAFVMIFFMVMPIMIGGFGNWLVPLMIGAPDMAFPRMNNMSFWLLPPSLLLLLASA MVEAGAGTGWTVYPPLAGNYSHPGASVDLTIFSLHLAGVSSILGAINFITTIINMKPP AMTQYQTPLFVWSVLITAVLLLLSLPVLAAGITMLLTDRNLNTTFFDPAGGGDPILYQ HLFWFFGHPEVYILILPGFGMISHIVTYYSGKKEPFGYMGMVWAMMSIGFLGFIVWAH HMFTVGMDVDTRAYFTSATMIIAIPTGVKVFSWLATLHGSNMKWSAAVLWALGFIFLF TVGGLTGIVLANSSLDIVLHDTYYVVAHFHYVLSMGAVFAIMGGFIHWFPLFSGYTLD QTYAKIHFTIMFIGVNLTFFPQHFLGLSGMPRRYSDYPDAYTTWNILSSVGSFISLTA VMLMIFMIWEAFASKRKVLMVEEPSMNLEWLYGCPPPYHTFEEPVYMKS BUT CRS is: MFADRWLFSTNHKDIGTLYLLFGAWAGVLGTALSLLIRAELGQPGNLLGNDHIYNVIV TAHAFVMIFFMVMPIMIGGFGNWLVPLMIGAPDMAFPRMNNMSFWLLPPSLLLLLASA MVEAGAGTGWTVYPPLAGNYSHPGASVDLTIFSLHLAGVSSILGAINFITTIINMKPP AMTQYQTPLFVWSVLITAVLLLLSLPVLAAGITMLLTDRNLNTTFFDPAGGGDPILYQ HLFWFFGHPEVYILILPGFGMISHIVTYYSGKKEPFGYMGMVWAMMSIGFLGFIVWAH HMFTVGMDVDTRAYFTSATMIIAIPTGVKVFSWLATLHGSNMKWSAAVLWALGFIFLF TVGGLTGIVLANSSLDIVLHDTYYVVAHFHYVLSMGAVFAIMGGFIHWFPLFSGYTLD QTYAKIHFTIMFIGVNLTFFPQHFLGLSGMPRRYSDYPDAYTTWNILSSVGSFISLTA VMLMIFMIWEAFASKRKVLMVEEPSMNLEWLYGCPPPYHTFEEPVYMKS ....... G6322- EU443478 Kumar COX1 - erroneous at 140th. codon GenBank does not give a translation. ....... G7444A AF347006(Saami) COX1 - altered STOP from 'AGA' to 'AAA' - GenBank gives this translation: MFADRWLFSTNHKDIGTLYLLFGAWAGVLGTALSLLIRAELGQPGNLLGNDHIYNVIV TAHAFVMIFFMVMPIMIGGFGNWLVPLMIGAPDMAFPRMNNMSFWLLPPSLLLLLASA MVEAGAGTGWTVYPPLAGNYSHPGASVDLTIFSLHLAGVSSILGAINFITTIINMKPP AMTQYQTPLFVWSVLITAVLLLLSLPVLAAGITMLLTDRNLNTTFFDPAGGGDPILYQ HLFWFFGHPEVYILILPGFGMISHIVTYYSGKKEPFGYMGMVWAMMSIGFLGFIVWAH HMFTVGMDVDTRAYFTSATMIIAIPTGVKVFSWLATLHGSNMKWSAAVLWALGFIFLF TVGGLTGIVLANSSLDIVLHDTYYVVAHFHYVLSMGAVFAIMGGFIHWFPLFSGYTLD QTYAKIHFTIMFIGVNLTFFPQHFLGLSGMPRRYSDYPDAYTTWNILSSVGSFISLTA VMLMIFMIWEAFASKRKVLMVEEPSMNLEWLYGCPPPYHTFEEPVYMKSKQK ....... A7445C EU482325(Yukaghir) COX1 - altered STOP from 'AGA' to 'AAC' - GenBank gives this translation: MFADRWLFSTNHKDIGTLYLLFGAWAGVLGTALSLLIRAELGQPGNLLGNDHIYNVIV TAHAFVMIFFMVMPIMIGGFGNWLVPLMIGAPDMAFPRMNNMSFWLLPPSLLLLLASA MVEAGAGTGWTVYPPLAGNYSHPGASVDLTIFSLHLAGVSSILGAINFITTIINMKPP AMTQYQTPLFVWSVLITAVLLLLSLPVLAAGITMLLTDRNLNTTFFDPAGGGDPILYQ HLFWFFGHPEVYILILPGFGMISHIVTYYSGKKEPFGYMGMVWAMMSIGFLGFIVWAH HMFTVGMDVDTRAYFTSATMIIAIPTGVKVFSWLATLHGSNMKWSAAVLWALGFIFLF TVGGLTGIVLANSSLDIVLHDTYYVVAHFHYVLSMGAVFAIMGGFIHWFPLFSGYTLD QTYAKIHFTIMFIGVNLTFFPQHFLGLSGMPRRYSDYPDAYTTWNILSSVGSFISLTA VMLMIFMIWEAFASKRKVLMVEEPSMNLEWLYGCPPPYHTFEEPVYMKSSQK ....... A7445G EU571946(Hungary) COX1 - altered STOP from 'AGA' to 'AAG' - GenBank gives this translation: MFADRWLFSTNHKDIGTLYLLFGAWAGVLGTALSLLIRAELGQPGNLLGNDHIYNVIV TAHAFVMIFFMVMPIMIGGFGNWLVPLMIGAPDMAFPRMNNMSFWLLPPSLLLLLASA MVEAGAGTGWTVYPPLAGNYSHPGASVDLTIFSLHLAGVSSILGAINFITTIINMKPP AMTQYQTPLFVWSVLITAVLLLLSLPVLAAGITMLLTDRNLNTTFFDPAGGGDPILYQ HLFWFFGHPEVYILILPGFGMISHIVTYYSGKKEPFGYMGMVWAMMSIGFLGFIVWAH HMFTVGMDVDTRAYFTSATMIIAIPTGVKVFSWLATLHGSNMKWSAAVLWALGFIFLF TVGGLTGIVLANSSLDIVLHDTYYVVAHFHYVLSMGAVFAIMGGFIHWFPLFSGYTLD QTYAKIHFTIMFIGVNLTFFPQHFLGLSGMPRRYSDYPDAYTTWNILSSVGSFISLTA VMLMIFMIWEAFASKRKVLMVEEPSMNLEWLYGCPPPYHTFEEPVYMKS ....... 8495.1A A8508- GQ214523(Kiribati) ATP8 - erroneous 5- amino acid block - GenBank gives this translation: MPQLNTTVWPTMITPMLLTLFLITQLKMLNTNYHLPPSPKPMKNKKLYKPWEPKWTKI CSLHSLPPQS which has the stretch (NKKLY): ASN-LYS-LYS-LEU-TYR whilst the CRS has (MKNYN): MET-LYS-ASN-TYR-ASN ....... C8488- EF184640(Tanzania) Gonder ATP8 - erroneous at 41st. codon GenBank does not give a translation. ....... 8527.1A EU443477 affects ATP8 & ATP6. '8527' is 54th codon of ATP8 and 1st.of ATP6 GenBank has ATP8 as: MPQLNTTVWPAMITPMLLTLFLITQLKMLNTNYHLPPSPKPMKMKSYNKPWEPKMNEN LFASFIAPTIL ATP6 is unaffected: MNENLFASFIAPTILGLPAAVLIILFPPLLIPTSKYLINNRLITTQQWLIKLTSKQMM AMHNTKGRTWSLMLVSLIIFIATTNLLGLLPHSFTPTTQLSMNLAMAIPLWAGAVIMG FRSKIKNALAHFLPQGTPTPLIPMLVIIETISLLIQPMALAVRLTANITAGHLLMHLI GSATLAMSTINLPSTLIIFTILILLTILEIAVALIQAYVFTLLVSLYLHDNT But CRS is: ATP8 MPQLNTTVWPTMITPMLLTLFLITQLKMLNTNYHLPPSPKPMKMKNYNKPWEPKWTKI CSLHSLPPQS and ATP6 MNENLFASFIAPTILGLPAAVLIILFPPLLIPTSKYLINNRLITTQQWLIKLTSKQMM TMHNTKGRTWSLMLVSLIIFIATTNLLGLLPHSFTPTTQLSMNLAMAIPLWAGTVIMG FRSKIKNALAHFLPQGTPTPLIPMLVIIETISLLIQPMALAVRLTANITAGHLLMHLI GSATLAMSTINLPSTLIIFTILILLTILEIAVALIQAYVFTLLVSLYLHDNT ....... T9205C EU431081 Achilli & EU600328 Shlush GenBank has ATP6 as: MNENLFASFIAPTILGLPAAVLIILFPPLLIPTSKYLINNRLITTQQWLIKLTSKQMM TMHNTKGRTWSLMLVSLIIFIATTNLLGLLPYSFTPTTQLSMNLAMAIPLWAGAVIMG FRSKIKNALAHFLPQGTPTPLIPMLVIIETISLLIQPMALAVRLTANITAGHLLMHLI GSATLAMSTINLPSTLIIFTILILLTILEIAVALIQAYVFTLLVSLYLHDNTQWPTNH MPIM But CRS is: MNENLFASFIAPTILGLPAAVLIILFPPLLIPTSKYLINNRLITTQQWLIKLTSKQMM TMHNTKGRTWSLMLVSLIIFIATTNLLGLLPHSFTPTTQLSMNLAMAIPLWAGTVIMG FRSKIKNALAHFLPQGTPTPLIPMLVIIETISLLIQPMALAVRLTANITAGHLLMHLI GSATLAMSTINLPSTLIIFTILILLTILEIAVALIQAYVFTLLVSLYLHDNT ....... T9959- EU443476 Kumar COX3 - erroneous at 251st codon. - last 10 amino acids are changed. GenBank gives this translation. MTHQSHAYHMVKPSPWPLTGALSALLMTSGLAMWFHFHSMTLLMLGLLTNTLTMYQWW RDVTRESTYQGHHTPPVQKGLRYGMILFITSEVFFFAGFFWAFYHSSLAPTPQLGGHW PPTGITPLNPLEVPLLNTSVLLASGVSITWAHHSLMENNRNQMIQALLITILLGLYFT LLQASEYFESPFTISDGIYGSTFFVATGFHGLHVIIGSTFLTICFIRQLMFHFTSKHH FGFEAAAWYWHFVDVVWLFCMSPSIDE But CRS is: MTHQSHAYHMVKPSPWPLTGALSALLMTSGLAMWFHFHSMTLLMLGLLTNTLTMYQWW RDVTRESTYQGHHTPPVQKGLRYGMILFITSEVFFFAGFFWAFYHSSLAPTPQLGGHW PPTGITPLNPLEVPLLNTSVLLASGVSITWAHHSLMENNRNQMIQALLITILLGLYFT LLQASEYFESPFTISDGIYGSTFFVATGFHGLHVIIGSTFLTICFIRQLMFHFTSKHH FGFEAAAWYWHFVDVVWLFLYVSIYWWGS ....... A10116- EF660930(Italy) Gasparre T10117- EF660930(Italy) Gasparre NAD3 - erroneous at 20th. codon. GenBank does not give a translation. ...... T10390- KC622235(Khoisan)Barbieri - GenBank gives this translation: MNFALILMINTLLALLLMIITFWLPQLNGYMEKSTPYECGFDPM SPARVPFSMKFFLVAITFLLFDLEIALLLPLPWALQTTNLPLMV MSSLLLIIILALSLAX (premature 'TAG' STOP) T10404- EF184634(Tanzania) Gonder NAD3 - erroneous at terminal codon. - GenBank gives this translation: MNFALILMINTLLALLLMIITFWLPQLNGYMEKSTPYECGFDPM SPARVPFSMKFFLVAITFLLFDLEIALLLPLPWALQTTNLPLMV MSSLLLIIILALSLAYEWLQKGLDWAE But CRS is: MNFALILMINTLLALLLMIITFWLPQLNGYMEKSTPYECGFDPM SPARVPFSMKFFLVAITFLLFDLEIALLLPLPWALQTTNLPLMV MSSLLLIIILALSLAYEWLQKGLDWTE ...... A11038- EF660995(Italy) Gasparre NAD4 - erroneous at 93rd. codon. GenBank does not give a translation. ...... C11085- EF660994(Italy) Gasparre A11086- EF660994(Italy) Gasparre NAD4 - erroneous at 109th. codon. GenBank does not give a translation. ...... A11376- EU443512 Kumar NAD4 - erroneous at 206th. codon. GenBank does not give a translation. ...... T12338C - GQ999958 Yu NAD5 - error in 1st codon. Leads to peptide being 2 codons shorter as it starts a codon '3' Possibly a LHON mutation ! GenBank gives this translation: MHTTMTTLTLTSLIPPILTTLVNPNKKNSYPHYVKSIVASTFII SLFPTTMFMCLDQEVIISNWHWATTQTTQLSLSFKLDYFSMMFIPVALFVTWSIMEFS LWYMNSDPNINQFFKYLLIFLITMLILVTANNLFQLFIGWEGVGIMSFLLISWWYARA DANTAAIQAILYNRIGDIGFILALAWFILHSNSWDPQQMALLNANPSLTPLLGLLLAA AGKSAQLGLHPWLPSAMEGPTPVSALLHSSTMVVAGIFLLIRFHPLAENSPLIQTLTL CLGAITTLFAAVCALTQNDIKKIVAFSTSSQLGLMMVTIGINQPHLAFLHICTHAFFK AMLFMCSGSIIHNLNNEQDIRKMGGLLKTMPLTSTSLTIGSLALAGMPFLTGFYSKDH IIETANMSYTNAWALSITLIATSLTSAYSTRMILLTLTGQPRFPTLTNINENNPTLLN PIKRLTAGSLFAGFLITNNISPASPFQTTIPLYLKLTALAVTFLGLLTALDLNYLTNK LKMKSPLCTFYFSNMLGFYPTITHRTIPYLGLLTSQNLPLLLLDLAWLEKLLPKTISQ HQISTSIITSTQKGMIKLYFLSFFFPLILTLLLIT CRS starts: MTMHTTMTTLTLTSLIPPILTTLVNPNKKNSYPHYVKSIVASTF ... ....... 12617.1 EU443497 Kumar NAD5 - erroneous at codon 94 GenBank does not give a translation .......... 13235.1 EF660996(Italy) Gasparre NAD5 - erroneous at codon 300 GenBank does not give a translation. ......... 14189.1 FJ383217 Rao NAD6 - erroneous at codon 162 - GenBank gives this translation: MMYALFLLSVGLVMGFVGFSSKPSPIYGGLVLIVSGVVGCVIIL NFGGGYMGLMVFLIYLGGMMVVFGYTTAMAIEEYPEAWGSGVEV LVSVLVGLAMEVGLVLWVKEYDGVVVVVNFNSVGSWMIYEGEGS GLIREDPIGAGALYDYGRWLVVVTGWTLFVWCMYCNWDCSGE But CRS is: MMYALFLLSVGLVMGFVGFSSKPSPIYGGLVLIVSGVVGCVIIL NFGGGYMGLMVFLIYLGGMMVVFGYTTAMAIEEYPEAWGSGVEV LVSVLVGLAMEVGLVLWVKEYDGVVVVVNFNSVGSWMIYEGEGS GLIREDPIGAGALYDYGRWLVVVTGWTLFVGVYIVIEIARGN ......... 15719.1 EU443443 & EU443444 Kumar CYTB - erroneous at codon 325 GenBank does not offer a translation ...... A15788- EF184610(S. Africa) Gonder C15789- EF184610(S. Africa) Gonder C15790- EF184610(S. Africa) Gonder A15791- EF184610(S. Africa) Gonder T15792- EF184610(S. Africa) Gonder CYTB - erroneous at 348th. codon. GenBank does not give a translation. ................