BLASTX nr result
ID: Forsythia23_contig00035728
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00035728 (651 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011100484.1| PREDICTED: endonuclease III homolog 1, chlor... 224 2e-56 emb|CDP20302.1| unnamed protein product [Coffea canephora] 211 2e-52 ref|XP_008222537.1| PREDICTED: endonuclease III homolog 1, chlor... 209 8e-52 ref|XP_009344368.1| PREDICTED: endonuclease III homolog 1, chlor... 202 2e-49 ref|XP_009363648.1| PREDICTED: endonuclease III homolog 1, chlor... 202 2e-49 ref|XP_008369130.1| PREDICTED: endonuclease III homolog 1, chlor... 201 2e-49 ref|XP_012830760.1| PREDICTED: endonuclease III homolog 1, chlor... 200 6e-49 gb|EYU42853.1| hypothetical protein MIMGU_mgv1a009936mg [Erythra... 200 6e-49 ref|XP_007034070.1| DNA glycosylase superfamily protein isoform ... 198 2e-48 ref|XP_007034069.1| DNA glycosylase superfamily protein isoform ... 198 2e-48 gb|KHG16754.1| Endonuclease III-like protein 1 [Gossypium arboreum] 196 1e-47 gb|EYU42852.1| hypothetical protein MIMGU_mgv1a009936mg [Erythra... 193 8e-47 ref|XP_012454114.1| PREDICTED: endonuclease III homolog 1, chlor... 192 1e-46 ref|XP_012454117.1| PREDICTED: endonuclease III homolog 1, chlor... 192 1e-46 gb|KJB71883.1| hypothetical protein B456_011G146700 [Gossypium r... 192 1e-46 ref|XP_012454115.1| PREDICTED: endonuclease III homolog 1, chlor... 192 1e-46 ref|XP_002264475.3| PREDICTED: endonuclease III homolog 1, chlor... 192 2e-46 ref|XP_009588403.1| PREDICTED: endonuclease III homolog 1, chlor... 192 2e-46 ref|XP_009781250.1| PREDICTED: endonuclease III homolog 1, chlor... 191 4e-46 ref|XP_007034068.1| DNA glycosylase superfamily protein isoform ... 191 4e-46 >ref|XP_011100484.1| PREDICTED: endonuclease III homolog 1, chloroplastic-like [Sesamum indicum] Length = 359 Score = 224 bits (572), Expect = 2e-56 Identities = 129/205 (62%), Positives = 146/205 (71%), Gaps = 4/205 (1%) Frame = -1 Query: 606 VASPRFIVSFSFNKMPTTRISASKLGMPPSSREPENPAPESSNASSXXXXXXXXXXXXXX 427 V SP+ S SF KM TTR+S++K G S E ENP E++ Sbjct: 16 VTSPKITPSSSFIKMQTTRLSSAK-GQTYPSDERENPGSETATDDCSKKVRVFVRRKRAT 74 Query: 426 KTLEITPTAVEAETRDQKLCSPPEIEDFAFGKDSSYSRLRQPP----PANWEKVLQEIRK 259 KT+ T + E DQK CS PEIEDFA+GKDSS+ PP PANWEKVL+ IR Sbjct: 75 KTVLTTVEKGKHEILDQKPCSLPEIEDFAYGKDSSF-----PPSTHTPANWEKVLEGIRT 129 Query: 258 MRSSEGAPVDSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQNDLLTA 79 MRSSE APVDSMGCEKAGSSLPPKERRFAVL SSLLSSQTKDHVTHGAIQRLLQN+LLTA Sbjct: 130 MRSSENAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDHVTHGAIQRLLQNNLLTA 189 Query: 78 EAIDRSDEVTIKNLIYPVGFYTRKA 4 EAI+++DE TIK+LIYPVGFY+RKA Sbjct: 190 EAIEQADEGTIKDLIYPVGFYSRKA 214 >emb|CDP20302.1| unnamed protein product [Coffea canephora] Length = 363 Score = 211 bits (538), Expect = 2e-52 Identities = 124/204 (60%), Positives = 141/204 (69%), Gaps = 9/204 (4%) Frame = -1 Query: 588 IVSFS---FNKMPTTRISASKLGMPPSSREPENPAPESSNASSXXXXXXXXXXXXXXKTL 418 IVSFS KM TR S+ P+S +NP ES N S K+L Sbjct: 18 IVSFSNRTIRKMSQTRFSSKT--QIPTSNHKKNPGDESCNGGSAANIRVFVRKKRAKKSL 75 Query: 417 EITPTA-VEAET-RDQKLCSPPEIEDFAFGKDSSYSRLRQPPPANWEKVLQEIRKMRSSE 244 EI+P V+AE R Q+LCSPP+IEDFA+GK YS Q P NWEKVL+ IR+MRSSE Sbjct: 76 EISPKEEVKAEEPRQQQLCSPPDIEDFAYGKKCGYSYSTQAPE-NWEKVLEGIRRMRSSE 134 Query: 243 GAPVDSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHG----AIQRLLQNDLLTAE 76 APVDSMGCEKAG+SLPPKERRFAVL SSLLSSQTKDHVTHG A+QRLLQN LL + Sbjct: 135 DAPVDSMGCEKAGNSLPPKERRFAVLVSSLLSSQTKDHVTHGKFKCAVQRLLQNGLLNPD 194 Query: 75 AIDRSDEVTIKNLIYPVGFYTRKA 4 A+D ++E TIKNLIYPVGFYTRKA Sbjct: 195 ALDNTEEATIKNLIYPVGFYTRKA 218 >ref|XP_008222537.1| PREDICTED: endonuclease III homolog 1, chloroplastic [Prunus mume] Length = 358 Score = 209 bits (533), Expect = 8e-52 Identities = 124/214 (57%), Positives = 145/214 (67%), Gaps = 4/214 (1%) Frame = -1 Query: 633 MFALFRNTPVASPRFIVSFSFNKMPTTRISA--SKLGMPPSSREPENPAPESSNASSXXX 460 MF + ++P I FN+MP TR SA SK +P S EP NP E SN S Sbjct: 1 MFQIRVSSPSTVAFAIGRIQFNRMPKTRFSAIQSKTEIPTS--EP-NPGSEGSNDVSAPE 57 Query: 459 XXXXXXXXXXXKTLEITPTAVEAETRDQKLCSPPEIEDFAFGKDSSYSRLRQP--PPANW 286 KT E+ +E + KL PP+IE+FA+ K S+ + PPANW Sbjct: 58 LRVFTRRKRLKKT-EVQKIHLEVKPHAPKLAEPPDIEEFAYTKVSASTNSIDTGKPPANW 116 Query: 285 EKVLQEIRKMRSSEGAPVDSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHGAIQR 106 EKVL+ IRKMRSSEGAPVDSMGCEKAGS+LPPKERRFAVL SSLLSSQTKDHV HGAIQR Sbjct: 117 EKVLEGIRKMRSSEGAPVDSMGCEKAGSALPPKERRFAVLVSSLLSSQTKDHVNHGAIQR 176 Query: 105 LLQNDLLTAEAIDRSDEVTIKNLIYPVGFYTRKA 4 LLQN+LL+A++ID++DE TIK LIYPVGFYTRKA Sbjct: 177 LLQNNLLSADSIDKADEATIKGLIYPVGFYTRKA 210 >ref|XP_009344368.1| PREDICTED: endonuclease III homolog 1, chloroplastic-like [Pyrus x bretschneideri] Length = 360 Score = 202 bits (513), Expect = 2e-49 Identities = 117/196 (59%), Positives = 136/196 (69%), Gaps = 6/196 (3%) Frame = -1 Query: 573 FNKMPTTRISA----SKLGMPPSSREPENPAPESSNASSXXXXXXXXXXXXXXKTLEITP 406 FN+M TR S SK +P S +P NP E+ +A+ T E+ Sbjct: 21 FNRMSKTRFSVKPIQSKTEIPTSDPDP-NPGSENGSAAELRVFSRRKRVKR---TEEVQK 76 Query: 405 TAVEAETRDQKLCSPPEIEDFAFGK--DSSYSRLRQPPPANWEKVLQEIRKMRSSEGAPV 232 +EA+ QKL P+IE+FA+ K S+ S PPANWEKVL IRKMRSSE APV Sbjct: 77 LQLEAKPIAQKLAVLPDIEEFAYKKVNTSTNSIDTGKPPANWEKVLDGIRKMRSSEDAPV 136 Query: 231 DSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQNDLLTAEAIDRSDEV 52 DSMGCEKAGSSLPPKERRFAVL SSLLSSQTKDHV HGAIQRLLQNDLL+A++ID++DE Sbjct: 137 DSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDHVNHGAIQRLLQNDLLSADSIDKADEA 196 Query: 51 TIKNLIYPVGFYTRKA 4 TIK+LIYPVGFYTRKA Sbjct: 197 TIKSLIYPVGFYTRKA 212 >ref|XP_009363648.1| PREDICTED: endonuclease III homolog 1, chloroplastic [Pyrus x bretschneideri] Length = 371 Score = 202 bits (513), Expect = 2e-49 Identities = 117/196 (59%), Positives = 136/196 (69%), Gaps = 6/196 (3%) Frame = -1 Query: 573 FNKMPTTRISA----SKLGMPPSSREPENPAPESSNASSXXXXXXXXXXXXXXKTLEITP 406 FN+M TR S SK +P S +P NP E+ +A+ T E+ Sbjct: 32 FNRMSKTRFSVKPIQSKTEIPTSDPDP-NPGSENGSAAELRVFSRRKRVKR---TEEVQK 87 Query: 405 TAVEAETRDQKLCSPPEIEDFAFGK--DSSYSRLRQPPPANWEKVLQEIRKMRSSEGAPV 232 +EA+ QKL P+IE+FA+ K S+ S PPANWEKVL IRKMRSSE APV Sbjct: 88 LQLEAKPIAQKLAVLPDIEEFAYKKVNTSTNSIDTGKPPANWEKVLDGIRKMRSSEDAPV 147 Query: 231 DSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQNDLLTAEAIDRSDEV 52 DSMGCEKAGSSLPPKERRFAVL SSLLSSQTKDHV HGAIQRLLQNDLL+A++ID++DE Sbjct: 148 DSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDHVNHGAIQRLLQNDLLSADSIDKADEA 207 Query: 51 TIKNLIYPVGFYTRKA 4 TIK+LIYPVGFYTRKA Sbjct: 208 TIKSLIYPVGFYTRKA 223 >ref|XP_008369130.1| PREDICTED: endonuclease III homolog 1, chloroplastic [Malus domestica] Length = 360 Score = 201 bits (512), Expect = 2e-49 Identities = 117/197 (59%), Positives = 136/197 (69%), Gaps = 6/197 (3%) Frame = -1 Query: 573 FNKMPTTRISA----SKLGMPPSSREPENPAPESSNASSXXXXXXXXXXXXXXKTLEITP 406 FN+M TR S SK +P S +P NP E+ +A+ T E+ Sbjct: 21 FNRMSKTRFSVKPIQSKTEIPTSDPDP-NPGSENGSAAELRVFSRRKRVKR---TEEVQK 76 Query: 405 TAVEAETRDQKLCSPPEIEDFAFGK--DSSYSRLRQPPPANWEKVLQEIRKMRSSEGAPV 232 +EA+ QKL P+IE+FA+ K S+ S PPANWEKVL IRKMRSSE APV Sbjct: 77 LQLEAKPIAQKLAVLPDIEEFAYKKVNTSTNSIDTGKPPANWEKVLDGIRKMRSSEDAPV 136 Query: 231 DSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQNDLLTAEAIDRSDEV 52 DSMGCEKAGSSLPPKERRFAVL SSLLSSQTKDHV HGAIQRLLQN LL+A++ID++DE Sbjct: 137 DSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDHVNHGAIQRLLQNGLLSADSIDKADEA 196 Query: 51 TIKNLIYPVGFYTRKAG 1 TIK+LIYPVGFYTRKAG Sbjct: 197 TIKSLIYPVGFYTRKAG 213 >ref|XP_012830760.1| PREDICTED: endonuclease III homolog 1, chloroplastic-like [Erythranthe guttatus] Length = 309 Score = 200 bits (508), Expect = 6e-49 Identities = 120/213 (56%), Positives = 144/213 (67%), Gaps = 1/213 (0%) Frame = -1 Query: 639 QNMFALFRNTPVASPRFIVSFSFNKMPTTRIS-ASKLGMPPSSREPENPAPESSNASSXX 463 +N+ +L R T SP+ S S KM TR+S A+K + PS + + +S+ S+ Sbjct: 7 KNICSLVRLT---SPKIKPSSSSIKMRITRLSSAAKREINPSIQSKNSGCETASDESAQK 63 Query: 462 XXXXXXXXXXXXKTLEITPTAVEAETRDQKLCSPPEIEDFAFGKDSSYSRLRQPPPANWE 283 KT++ + E D+K CS PEIEDFA+G +S SRL + P NWE Sbjct: 64 SVRVYIRKKRSNKTVQPIVEEINPEIIDEKPCSLPEIEDFAYGNGNSVSRLTKLPE-NWE 122 Query: 282 KVLQEIRKMRSSEGAPVDSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRL 103 KVL+ IR MRSSE APVDSMGCEKAGSSLPPKERRFAVL SSLLSSQTKD VTHGAIQRL Sbjct: 123 KVLEGIRTMRSSEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDQVTHGAIQRL 182 Query: 102 LQNDLLTAEAIDRSDEVTIKNLIYPVGFYTRKA 4 L+ DLLTAEAID ++E IK LIYPVGFY+RKA Sbjct: 183 LEKDLLTAEAIDGANEGAIKELIYPVGFYSRKA 215 >gb|EYU42853.1| hypothetical protein MIMGU_mgv1a009936mg [Erythranthe guttata] Length = 319 Score = 200 bits (508), Expect = 6e-49 Identities = 120/213 (56%), Positives = 144/213 (67%), Gaps = 1/213 (0%) Frame = -1 Query: 639 QNMFALFRNTPVASPRFIVSFSFNKMPTTRIS-ASKLGMPPSSREPENPAPESSNASSXX 463 +N+ +L R T SP+ S S KM TR+S A+K + PS + + +S+ S+ Sbjct: 7 KNICSLVRLT---SPKIKPSSSSIKMRITRLSSAAKREINPSIQSKNSGCETASDESAQK 63 Query: 462 XXXXXXXXXXXXKTLEITPTAVEAETRDQKLCSPPEIEDFAFGKDSSYSRLRQPPPANWE 283 KT++ + E D+K CS PEIEDFA+G +S SRL + P NWE Sbjct: 64 SVRVYIRKKRSNKTVQPIVEEINPEIIDEKPCSLPEIEDFAYGNGNSVSRLTKLPE-NWE 122 Query: 282 KVLQEIRKMRSSEGAPVDSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRL 103 KVL+ IR MRSSE APVDSMGCEKAGSSLPPKERRFAVL SSLLSSQTKD VTHGAIQRL Sbjct: 123 KVLEGIRTMRSSEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDQVTHGAIQRL 182 Query: 102 LQNDLLTAEAIDRSDEVTIKNLIYPVGFYTRKA 4 L+ DLLTAEAID ++E IK LIYPVGFY+RKA Sbjct: 183 LEKDLLTAEAIDGANEGAIKELIYPVGFYSRKA 215 >ref|XP_007034070.1| DNA glycosylase superfamily protein isoform 4 [Theobroma cacao] gi|508713099|gb|EOY04996.1| DNA glycosylase superfamily protein isoform 4 [Theobroma cacao] Length = 336 Score = 198 bits (504), Expect = 2e-48 Identities = 113/214 (52%), Positives = 142/214 (66%), Gaps = 4/214 (1%) Frame = -1 Query: 633 MFALFRNTPVASPRFIVSFSFN-KMPTTRISASKLGMPPSSREPE---NPAPESSNASSX 466 M+A+ R+ P+ + N KMP TR++ L ++ P NP E+++ S Sbjct: 1 MYAVPRSFPLGFGVGLGGMKLNSKMPKTRLAFKTLSSSSTTEVPSSDPNPGSETTDNVSV 60 Query: 465 XXXXXXXXXXXXXKTLEITPTAVEAETRDQKLCSPPEIEDFAFGKDSSYSRLRQPPPANW 286 KT+++ +AE + KLC P+IE+FA+ K S L PANW Sbjct: 61 PAVRVFTRKKRVKKTVDVVQEIPKAENKGLKLCGLPDIEEFAYKKVDGPS-LSGNAPANW 119 Query: 285 EKVLQEIRKMRSSEGAPVDSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHGAIQR 106 EKVL+ IRKMRS+E APVD+MGCEKAGS LPPKERRFAVL SSLLSSQTKDHVTHGAIQR Sbjct: 120 EKVLEGIRKMRSAEDAPVDTMGCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGAIQR 179 Query: 105 LLQNDLLTAEAIDRSDEVTIKNLIYPVGFYTRKA 4 L+QN L+T +AID++DE TIK+LIYPVGFYTRKA Sbjct: 180 LIQNCLMTPDAIDKADEATIKDLIYPVGFYTRKA 213 >ref|XP_007034069.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao] gi|508713098|gb|EOY04995.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao] Length = 364 Score = 198 bits (504), Expect = 2e-48 Identities = 113/214 (52%), Positives = 142/214 (66%), Gaps = 4/214 (1%) Frame = -1 Query: 633 MFALFRNTPVASPRFIVSFSFN-KMPTTRISASKLGMPPSSREPE---NPAPESSNASSX 466 M+A+ R+ P+ + N KMP TR++ L ++ P NP E+++ S Sbjct: 1 MYAVPRSFPLGFGVGLGGMKLNSKMPKTRLAFKTLSSSSTTEVPSSDPNPGSETTDNVSV 60 Query: 465 XXXXXXXXXXXXXKTLEITPTAVEAETRDQKLCSPPEIEDFAFGKDSSYSRLRQPPPANW 286 KT+++ +AE + KLC P+IE+FA+ K S L PANW Sbjct: 61 PAVRVFTRKKRVKKTVDVVQEIPKAENKGLKLCGLPDIEEFAYKKVDGPS-LSGNAPANW 119 Query: 285 EKVLQEIRKMRSSEGAPVDSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHGAIQR 106 EKVL+ IRKMRS+E APVD+MGCEKAGS LPPKERRFAVL SSLLSSQTKDHVTHGAIQR Sbjct: 120 EKVLEGIRKMRSAEDAPVDTMGCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGAIQR 179 Query: 105 LLQNDLLTAEAIDRSDEVTIKNLIYPVGFYTRKA 4 L+QN L+T +AID++DE TIK+LIYPVGFYTRKA Sbjct: 180 LIQNCLMTPDAIDKADEATIKDLIYPVGFYTRKA 213 >gb|KHG16754.1| Endonuclease III-like protein 1 [Gossypium arboreum] Length = 384 Score = 196 bits (497), Expect = 1e-47 Identities = 115/213 (53%), Positives = 130/213 (61%), Gaps = 22/213 (10%) Frame = -1 Query: 576 SFNKMPTTRISASKLGMPPSSREPENPAPESSNASSXXXXXXXXXXXXXXKTLEITPTAV 397 S NKMP TR S L S + P NP +N S KTL++ Sbjct: 26 SNNKMPKTRFSVKALSSSSSDQNP-NPGSGLTNNVSLPQVRVFTRKKRLKKTLDVVKENP 84 Query: 396 EAETRDQKLCSPPEIEDFAFGKDSSYSR----------------LRQP------PPANWE 283 + E D CS P+IE+FA+ K +R + P PPANWE Sbjct: 85 KPENEDHNSCSLPDIEEFAYKKVDGPARSGKSKSACDELNVGVGIASPIGVGGKPPANWE 144 Query: 282 KVLQEIRKMRSSEGAPVDSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRL 103 KVL+ IRKMRS E APVD+MGCEKAGS LPPKERRFAVL SSLLSSQTKDHVTHGAIQRL Sbjct: 145 KVLEGIRKMRSLEDAPVDTMGCEKAGSVLPPKERRFAVLVSSLLSSQTKDHVTHGAIQRL 204 Query: 102 LQNDLLTAEAIDRSDEVTIKNLIYPVGFYTRKA 4 QN LLT +AID++DE TIKNLIYPVGFYTRKA Sbjct: 205 SQNCLLTPDAIDKADEATIKNLIYPVGFYTRKA 237 >gb|EYU42852.1| hypothetical protein MIMGU_mgv1a009936mg [Erythranthe guttata] Length = 327 Score = 193 bits (490), Expect = 8e-47 Identities = 120/221 (54%), Positives = 144/221 (65%), Gaps = 9/221 (4%) Frame = -1 Query: 639 QNMFALFRNTPVASPRFIVSFSFNKMPTTRIS-ASKLGMPPSSREPENPAPESSNASSXX 463 +N+ +L R T SP+ S S KM TR+S A+K + PS + + +S+ S+ Sbjct: 7 KNICSLVRLT---SPKIKPSSSSIKMRITRLSSAAKREINPSIQSKNSGCETASDESAQK 63 Query: 462 XXXXXXXXXXXXKTLEITPTAVEAETRDQKL--------CSPPEIEDFAFGKDSSYSRLR 307 KT++ + E D+K CS PEIEDFA+G +S SRL Sbjct: 64 SVRVYIRKKRSNKTVQPIVEEINPEIIDEKQLSSYLTQPCSLPEIEDFAYGNGNSVSRLT 123 Query: 306 QPPPANWEKVLQEIRKMRSSEGAPVDSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHV 127 + P NWEKVL+ IR MRSSE APVDSMGCEKAGSSLPPKERRFAVL SSLLSSQTKD V Sbjct: 124 KLPE-NWEKVLEGIRTMRSSEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDQV 182 Query: 126 THGAIQRLLQNDLLTAEAIDRSDEVTIKNLIYPVGFYTRKA 4 THGAIQRLL+ DLLTAEAID ++E IK LIYPVGFY+RKA Sbjct: 183 THGAIQRLLEKDLLTAEAIDGANEGAIKELIYPVGFYSRKA 223 >ref|XP_012454114.1| PREDICTED: endonuclease III homolog 1, chloroplastic-like isoform X3 [Gossypium raimondii] gi|763804947|gb|KJB71885.1| hypothetical protein B456_011G146700 [Gossypium raimondii] Length = 387 Score = 192 bits (488), Expect = 1e-46 Identities = 114/213 (53%), Positives = 130/213 (61%), Gaps = 22/213 (10%) Frame = -1 Query: 576 SFNKMPTTRISASKLGMPPSSREPENPAPESSNASSXXXXXXXXXXXXXXKTLEITPTAV 397 S NKMP TR SA L S + P NP +N S KTL++ Sbjct: 26 SNNKMPKTRFSAKALSSSSSDQNP-NPGSGLTNNVSLPQVRVFTRKKRLKKTLDVVKENP 84 Query: 396 EAETRDQKLCSPPEIEDFAFGKDSSYSR----------------LRQP------PPANWE 283 + E D CS P+IE+FA+ K +R + P PANWE Sbjct: 85 KPENEDHNSCSLPDIEEFAYKKVDGPARSGKLKSACDELNVGVSIASPIGVGGKAPANWE 144 Query: 282 KVLQEIRKMRSSEGAPVDSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRL 103 KVL+ IRKMRS E APVD+MGCEKAGS LPPKERRFAVL SSLLSSQTKDHVTHGAIQRL Sbjct: 145 KVLEGIRKMRSLEDAPVDTMGCEKAGSVLPPKERRFAVLVSSLLSSQTKDHVTHGAIQRL 204 Query: 102 LQNDLLTAEAIDRSDEVTIKNLIYPVGFYTRKA 4 QN LLT +AID++DE TIK+LIYPVGFYTRKA Sbjct: 205 SQNCLLTPDAIDKADEATIKDLIYPVGFYTRKA 237 >ref|XP_012454117.1| PREDICTED: endonuclease III homolog 1, chloroplastic-like isoform X6 [Gossypium raimondii] gi|763804946|gb|KJB71884.1| hypothetical protein B456_011G146700 [Gossypium raimondii] Length = 356 Score = 192 bits (488), Expect = 1e-46 Identities = 114/213 (53%), Positives = 130/213 (61%), Gaps = 22/213 (10%) Frame = -1 Query: 576 SFNKMPTTRISASKLGMPPSSREPENPAPESSNASSXXXXXXXXXXXXXXKTLEITPTAV 397 S NKMP TR SA L S + P NP +N S KTL++ Sbjct: 26 SNNKMPKTRFSAKALSSSSSDQNP-NPGSGLTNNVSLPQVRVFTRKKRLKKTLDVVKENP 84 Query: 396 EAETRDQKLCSPPEIEDFAFGKDSSYSR----------------LRQP------PPANWE 283 + E D CS P+IE+FA+ K +R + P PANWE Sbjct: 85 KPENEDHNSCSLPDIEEFAYKKVDGPARSGKLKSACDELNVGVSIASPIGVGGKAPANWE 144 Query: 282 KVLQEIRKMRSSEGAPVDSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRL 103 KVL+ IRKMRS E APVD+MGCEKAGS LPPKERRFAVL SSLLSSQTKDHVTHGAIQRL Sbjct: 145 KVLEGIRKMRSLEDAPVDTMGCEKAGSVLPPKERRFAVLVSSLLSSQTKDHVTHGAIQRL 204 Query: 102 LQNDLLTAEAIDRSDEVTIKNLIYPVGFYTRKA 4 QN LLT +AID++DE TIK+LIYPVGFYTRKA Sbjct: 205 SQNCLLTPDAIDKADEATIKDLIYPVGFYTRKA 237 >gb|KJB71883.1| hypothetical protein B456_011G146700 [Gossypium raimondii] Length = 357 Score = 192 bits (488), Expect = 1e-46 Identities = 114/213 (53%), Positives = 130/213 (61%), Gaps = 22/213 (10%) Frame = -1 Query: 576 SFNKMPTTRISASKLGMPPSSREPENPAPESSNASSXXXXXXXXXXXXXXKTLEITPTAV 397 S NKMP TR SA L S + P NP +N S KTL++ Sbjct: 26 SNNKMPKTRFSAKALSSSSSDQNP-NPGSGLTNNVSLPQVRVFTRKKRLKKTLDVVKENP 84 Query: 396 EAETRDQKLCSPPEIEDFAFGKDSSYSR----------------LRQP------PPANWE 283 + E D CS P+IE+FA+ K +R + P PANWE Sbjct: 85 KPENEDHNSCSLPDIEEFAYKKVDGPARSGKLKSACDELNVGVSIASPIGVGGKAPANWE 144 Query: 282 KVLQEIRKMRSSEGAPVDSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRL 103 KVL+ IRKMRS E APVD+MGCEKAGS LPPKERRFAVL SSLLSSQTKDHVTHGAIQRL Sbjct: 145 KVLEGIRKMRSLEDAPVDTMGCEKAGSVLPPKERRFAVLVSSLLSSQTKDHVTHGAIQRL 204 Query: 102 LQNDLLTAEAIDRSDEVTIKNLIYPVGFYTRKA 4 QN LLT +AID++DE TIK+LIYPVGFYTRKA Sbjct: 205 SQNCLLTPDAIDKADEATIKDLIYPVGFYTRKA 237 >ref|XP_012454115.1| PREDICTED: endonuclease III homolog 1, chloroplastic-like isoform X4 [Gossypium raimondii] gi|763804943|gb|KJB71881.1| hypothetical protein B456_011G146700 [Gossypium raimondii] Length = 384 Score = 192 bits (488), Expect = 1e-46 Identities = 114/213 (53%), Positives = 130/213 (61%), Gaps = 22/213 (10%) Frame = -1 Query: 576 SFNKMPTTRISASKLGMPPSSREPENPAPESSNASSXXXXXXXXXXXXXXKTLEITPTAV 397 S NKMP TR SA L S + P NP +N S KTL++ Sbjct: 26 SNNKMPKTRFSAKALSSSSSDQNP-NPGSGLTNNVSLPQVRVFTRKKRLKKTLDVVKENP 84 Query: 396 EAETRDQKLCSPPEIEDFAFGKDSSYSR----------------LRQP------PPANWE 283 + E D CS P+IE+FA+ K +R + P PANWE Sbjct: 85 KPENEDHNSCSLPDIEEFAYKKVDGPARSGKLKSACDELNVGVSIASPIGVGGKAPANWE 144 Query: 282 KVLQEIRKMRSSEGAPVDSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRL 103 KVL+ IRKMRS E APVD+MGCEKAGS LPPKERRFAVL SSLLSSQTKDHVTHGAIQRL Sbjct: 145 KVLEGIRKMRSLEDAPVDTMGCEKAGSVLPPKERRFAVLVSSLLSSQTKDHVTHGAIQRL 204 Query: 102 LQNDLLTAEAIDRSDEVTIKNLIYPVGFYTRKA 4 QN LLT +AID++DE TIK+LIYPVGFYTRKA Sbjct: 205 SQNCLLTPDAIDKADEATIKDLIYPVGFYTRKA 237 >ref|XP_002264475.3| PREDICTED: endonuclease III homolog 1, chloroplastic-like isoform X1 [Vitis vinifera] Length = 376 Score = 192 bits (487), Expect = 2e-46 Identities = 105/209 (50%), Positives = 133/209 (63%), Gaps = 22/209 (10%) Frame = -1 Query: 561 PTTRISASKLGMPPSSREPENPAPESSNASSXXXXXXXXXXXXXXKTLEITPTAVEAETR 382 P +R ++S + P+ + + E+ N S +E ++AE + Sbjct: 21 PMSRATSSSKPLLPALQSKTSAHEETPNGVSGSEVRVFVRKKRVKMAVETPEKEIKAEPQ 80 Query: 381 DQKLCSPPEIEDFAFGKDSSYSRLRQ-------PP---------------PANWEKVLQE 268 QK+C P+IE+F + K + LR+ PP PANWEK+L+ Sbjct: 81 QQKICELPDIEEFTYRKGKRSTHLRKSKPTSDVPPGGTEITSSIRPAAELPANWEKILEG 140 Query: 267 IRKMRSSEGAPVDSMGCEKAGSSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQNDL 88 IRKMRSSE APVDSMGCEKAGSSLPP+ERRFAVL SSLLSSQTKD+VTHGAIQRLLQN L Sbjct: 141 IRKMRSSEDAPVDSMGCEKAGSSLPPRERRFAVLVSSLLSSQTKDNVTHGAIQRLLQNGL 200 Query: 87 LTAEAIDRSDEVTIKNLIYPVGFYTRKAG 1 L A+AID++DE T+K+LIYPVGFY+RKAG Sbjct: 201 LVADAIDKADEATVKSLIYPVGFYSRKAG 229 >ref|XP_009588403.1| PREDICTED: endonuclease III homolog 1, chloroplastic {ECO:0000255|HAMAP-Rule:MF_03183} isoform X2 [Nicotiana tomentosiformis] Length = 368 Score = 192 bits (487), Expect = 2e-46 Identities = 118/230 (51%), Positives = 135/230 (58%), Gaps = 21/230 (9%) Frame = -1 Query: 630 FALFRNTPVAS--PRFIVSFSFNKMPTTRISASKLGMPPSSREPENPAPESSNASSXXXX 457 F L RNT + P I + S KMP TR S + + NP E S SS Sbjct: 3 FCLVRNTALLPLVPTRIQTISSAKMPITRSSFKR-----ETPSETNPGSEGSGGSSVPEL 57 Query: 456 XXXXXXXXXXKTLEITPTAVEAETRDQKLCSPPEIEDFAFGKDSSYSRL----------- 310 KT+E+ V+ E+ QK P+IEDF++ KD YS+ Sbjct: 58 RVFVRRKRVKKTVEVIAKEVKEESSAQKFVKLPDIEDFSYLKDHMYSQSTPAKTVHLTGE 117 Query: 309 --------RQPPPANWEKVLQEIRKMRSSEGAPVDSMGCEKAGSSLPPKERRFAVLASSL 154 + PP NWEKVL+ IRKMRSSE APVDSMGCEKAGSSLP KERRFAVL SSL Sbjct: 118 KSLSQLTRKVRPPPNWEKVLEGIRKMRSSEDAPVDSMGCEKAGSSLPAKERRFAVLVSSL 177 Query: 153 LSSQTKDHVTHGAIQRLLQNDLLTAEAIDRSDEVTIKNLIYPVGFYTRKA 4 LSSQTKD V HGAIQRLLQN LL +AID +E TIK+LIYPVGFYTRKA Sbjct: 178 LSSQTKDQVNHGAIQRLLQNGLLAPDAIDTKNEDTIKSLIYPVGFYTRKA 227 >ref|XP_009781250.1| PREDICTED: endonuclease III homolog 1, chloroplastic-like isoform X2 [Nicotiana sylvestris] Length = 369 Score = 191 bits (484), Expect = 4e-46 Identities = 117/229 (51%), Positives = 136/229 (59%), Gaps = 22/229 (9%) Frame = -1 Query: 624 LFRNT---PVASPRFIVSFSFNKMPTTRISASKLGMPPSSREPENPAPESSNASSXXXXX 454 L RNT P+ P I + S KMP R S + + +NP E S SS Sbjct: 5 LLRNTALLPLFVPLRIQTISSAKMPRNRSSFKR-----ETPFDKNPGSEGSGGSSVPEFR 59 Query: 453 XXXXXXXXXKTLEITPTAVEAETRDQKLCSPPEIEDFAFGKDSSYSRL------------ 310 KTLE+ V+ E+ +K P+IEDF++ KD YS+ Sbjct: 60 VFVRKKRVKKTLEVMDKEVKEESSGKKFVKLPDIEDFSYVKDDMYSQSTPAETVHLTGEK 119 Query: 309 -------RQPPPANWEKVLQEIRKMRSSEGAPVDSMGCEKAGSSLPPKERRFAVLASSLL 151 + PP NWEKVL+ IRKMRSSE APVDSMGCEKAGSSLP KERRFAVL SSLL Sbjct: 120 ALSQLTQKVRPPLNWEKVLEGIRKMRSSEDAPVDSMGCEKAGSSLPAKERRFAVLVSSLL 179 Query: 150 SSQTKDHVTHGAIQRLLQNDLLTAEAIDRSDEVTIKNLIYPVGFYTRKA 4 SSQTKD V HGAIQRLLQN LL +AID ++E TIK+LIYPVGFYTRKA Sbjct: 180 SSQTKDQVNHGAIQRLLQNGLLAPDAIDTANEETIKSLIYPVGFYTRKA 228 >ref|XP_007034068.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao] gi|508713097|gb|EOY04994.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao] Length = 359 Score = 191 bits (484), Expect = 4e-46 Identities = 115/236 (48%), Positives = 145/236 (61%), Gaps = 26/236 (11%) Frame = -1 Query: 633 MFALFRNTPVASPRFIVSFSFN-KMPTTRISASKLGMPPSSREPE---NPAPESSNASSX 466 M+A+ R+ P+ + N KMP TR++ L ++ P NP E+++ S Sbjct: 1 MYAVPRSFPLGFGVGLGGMKLNSKMPKTRLAFKTLSSSSTTEVPSSDPNPGSETTDNVSV 60 Query: 465 XXXXXXXXXXXXXKTLEITPTAVEAETRDQKLCSPPEIEDFAF---------GKDSSYSR 313 KT+++ +AE + KLC P+IE+FA+ GK S S Sbjct: 61 PAVRVFTRKKRVKKTVDVVQEIPKAENKGLKLCGLPDIEEFAYKKVDGPSLSGKSKSTSD 120 Query: 312 -------LRQP------PPANWEKVLQEIRKMRSSEGAPVDSMGCEKAGSSLPPKERRFA 172 + P PANWEKVL+ IRKMRS+E APVD+MGCEKAGS LPPKERRFA Sbjct: 121 EINVGTGIASPVGIGGNAPANWEKVLEGIRKMRSAEDAPVDTMGCEKAGSVLPPKERRFA 180 Query: 171 VLASSLLSSQTKDHVTHGAIQRLLQNDLLTAEAIDRSDEVTIKNLIYPVGFYTRKA 4 VL SSLLSSQTKDHVTHGAIQRL+QN L+T +AID++DE TIK+LIYPVGFYTRKA Sbjct: 181 VLISSLLSSQTKDHVTHGAIQRLIQNCLMTPDAIDKADEATIKDLIYPVGFYTRKA 236