BLASTX nr result
ID: Mentha25_contig00027272
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00027272 (517 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004508835.1| PREDICTED: protein ROS1-like [Cicer arietinu... 287 9e-76 ref|XP_007155390.1| hypothetical protein PHAVU_003G197200g [Phas... 285 5e-75 ref|XP_003525486.1| PREDICTED: uncharacterized protein LOC100802... 283 2e-74 ref|XP_007036109.1| DNA glycosylase superfamily protein isoform ... 282 3e-74 ref|XP_003608916.1| Ultraviolet N-glycosylase/AP lyase [Medicago... 282 3e-74 ref|XP_002321564.2| hypothetical protein POPTR_0015s08260g [Popu... 281 5e-74 ref|XP_002511456.1| Endonuclease III, putative [Ricinus communis... 281 9e-74 ref|XP_006476719.1| PREDICTED: protein ROS1-like isoform X2 [Cit... 278 7e-73 ref|XP_006476718.1| PREDICTED: protein ROS1-like isoform X1 [Cit... 278 7e-73 ref|XP_004236146.1| PREDICTED: endonuclease III-like [Solanum ly... 278 7e-73 ref|XP_006345014.1| PREDICTED: protein ROS1-like [Solanum tubero... 277 9e-73 ref|XP_006439743.1| hypothetical protein CICLE_v10021561mg [Citr... 277 9e-73 gb|EXB42063.1| Protein ROS1 [Morus notabilis] 276 2e-72 ref|XP_006476720.1| PREDICTED: protein ROS1-like isoform X3 [Cit... 268 4e-70 ref|XP_007036110.1| DNA glycosylase superfamily protein isoform ... 268 6e-70 ref|XP_007036108.1| DNA glycosylase superfamily protein isoform ... 268 6e-70 emb|CBI15085.3| unnamed protein product [Vitis vinifera] 267 1e-69 ref|XP_002283633.1| PREDICTED: endonuclease III-like [Vitis vini... 267 1e-69 ref|NP_566893.1| DNA glycosylase superfamily protein [Arabidopsi... 267 1e-69 ref|XP_006404333.1| hypothetical protein EUTSA_v10010580mg [Eutr... 266 2e-69 >ref|XP_004508835.1| PREDICTED: protein ROS1-like [Cicer arietinum] gi|502152248|ref|XP_004508836.1| PREDICTED: protein ROS1-like [Cicer arietinum] Length = 285 Score = 287 bits (735), Expect = 9e-76 Identities = 136/171 (79%), Positives = 153/171 (89%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLV+TILSQNTTE NS +AFASLK SFPTW+HV AESK +E+AIRCGGLAPTK+SC Sbjct: 88 VLDGLVRTILSQNTTESNSNKAFASLKSSFPTWEHVHGAESKELENAIRCGGLAPTKASC 147 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IKNLLR LLEKRGK C+EYLR+LSV ++KAELSL KGIGPKTV+CVLMFNLQ DDFPVDT Sbjct: 148 IKNLLRCLLEKRGKFCLEYLRDLSVAQIKAELSLFKGIGPKTVACVLMFNLQQDDFPVDT 207 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 H+F+IAK +GWVP VAD TYLHLN+RIP+ELKFDLNCLLYTHGK C +C Sbjct: 208 HIFEIAKTIGWVPAVADRNKTYLHLNQRIPNELKFDLNCLLYTHGKFCSKC 258 >ref|XP_007155390.1| hypothetical protein PHAVU_003G197200g [Phaseolus vulgaris] gi|561028744|gb|ESW27384.1| hypothetical protein PHAVU_003G197200g [Phaseolus vulgaris] Length = 282 Score = 285 bits (729), Expect = 5e-75 Identities = 131/171 (76%), Positives = 155/171 (90%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLV+T+LSQNTTE NS++AF SLK SFPTW+HV AESK VE+AIRCGGLAPTK+SC Sbjct: 85 VLDGLVRTVLSQNTTEANSQKAFVSLKSSFPTWEHVFGAESKDVENAIRCGGLAPTKASC 144 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IKN+LR L E+RG+LC+EYLR+LSV+E KAELSL KGIGPKTV+CVLMFNLQ DDFPVDT Sbjct: 145 IKNMLRCLRERRGQLCLEYLRDLSVDEAKAELSLFKGIGPKTVACVLMFNLQQDDFPVDT 204 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 H+F+I+K MGWVP VAD +YLHLN+RIP+ELKFDLNCL++THGK+CR+C Sbjct: 205 HIFEISKTMGWVPSVADRNKSYLHLNQRIPNELKFDLNCLMFTHGKLCRKC 255 >ref|XP_003525486.1| PREDICTED: uncharacterized protein LOC100802952 [Glycine max] Length = 284 Score = 283 bits (724), Expect = 2e-74 Identities = 132/171 (77%), Positives = 156/171 (91%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLV+T+LSQNTTE NS++AFASLK SFP+W+ VL AESK VE+AIRCGGLAPTK+SC Sbjct: 85 VLDGLVRTVLSQNTTEANSQKAFASLKSSFPSWEQVLWAESKDVENAIRCGGLAPTKASC 144 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IKN+LR L E+RG+LC+EYLR+LSV+EVKAELSL KGIGPKTV+CVLMFNLQ DDFPVDT Sbjct: 145 IKNVLRCLRERRGELCLEYLRDLSVDEVKAELSLFKGIGPKTVACVLMFNLQQDDFPVDT 204 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 H+F+IAK MGWVP VA+ +YLHLN+R+P+ELKFDLNCLLYTHGK+C +C Sbjct: 205 HIFEIAKTMGWVPAVANRNKSYLHLNQRVPNELKFDLNCLLYTHGKLCHQC 255 >ref|XP_007036109.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao] gi|508773354|gb|EOY20610.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao] Length = 292 Score = 282 bits (722), Expect = 3e-74 Identities = 130/171 (76%), Positives = 155/171 (90%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLVKT+LSQNTTELNS++AFASLK +FPTW+ VL AESK +E+AIRCGGLAP K+SC Sbjct: 90 VLDGLVKTVLSQNTTELNSQKAFASLKSAFPTWEDVLAAESKNLENAIRCGGLAPRKASC 149 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IKN+LR L E++GKLC EYLR+LS++E+KAELS KG+GPKTV+CVLMFNLQ DDFPVDT Sbjct: 150 IKNVLRCLHERKGKLCFEYLRDLSIDEIKAELSNFKGVGPKTVACVLMFNLQQDDFPVDT 209 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 HVF+IA+ +GWVP AD K TYLHLN+RIP++LKFDLNCLLYTHGK+CR+C Sbjct: 210 HVFEIARAIGWVPATADRKKTYLHLNRRIPNKLKFDLNCLLYTHGKLCRKC 260 >ref|XP_003608916.1| Ultraviolet N-glycosylase/AP lyase [Medicago truncatula] gi|355509971|gb|AES91113.1| Ultraviolet N-glycosylase/AP lyase [Medicago truncatula] Length = 280 Score = 282 bits (722), Expect = 3e-74 Identities = 134/171 (78%), Positives = 150/171 (87%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLV+TILSQNTTE NS +AFASLK FPTW+HV AESK +E+AIRCGGLAPTK+ C Sbjct: 83 VLDGLVRTILSQNTTEANSNKAFASLKSLFPTWEHVHGAESKELENAIRCGGLAPTKAKC 142 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IKNLL LLE++GK+C+EYLR+LSV+EVKAELSL KGIGPKTVSCVLMFNLQ DDFPVDT Sbjct: 143 IKNLLSCLLERKGKMCLEYLRDLSVDEVKAELSLFKGIGPKTVSCVLMFNLQLDDFPVDT 202 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 H+F+IAK MGWVP AD TYLHLN+RIP ELKFDLNCLLYTHGK+C C Sbjct: 203 HIFEIAKTMGWVPAAADRNKTYLHLNQRIPDELKFDLNCLLYTHGKLCSNC 253 >ref|XP_002321564.2| hypothetical protein POPTR_0015s08260g [Populus trichocarpa] gi|550322300|gb|EEF05691.2| hypothetical protein POPTR_0015s08260g [Populus trichocarpa] Length = 306 Score = 281 bits (720), Expect = 5e-74 Identities = 130/171 (76%), Positives = 153/171 (89%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLVKT+LSQNTTE+NS+RAF +LK +FPTW++VL AESK++EDAIRCGGLAPTK++C Sbjct: 109 VLDGLVKTVLSQNTTEVNSQRAFLNLKSAFPTWENVLAAESKFIEDAIRCGGLAPTKAAC 168 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 I+N+L L+EK G+LC+EYLR+L V E+KAELS KGIGPKTV+CVLMFNLQ DDFPVDT Sbjct: 169 IRNILSSLMEKNGRLCLEYLRDLPVAEIKAELSHFKGIGPKTVACVLMFNLQKDDFPVDT 228 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 HVF+IAK +GWVP VAD TYLHLN RIP ELKFDLNCLLYTHGK+CR+C Sbjct: 229 HVFEIAKAIGWVPPVADRNKTYLHLNHRIPKELKFDLNCLLYTHGKLCRKC 279 >ref|XP_002511456.1| Endonuclease III, putative [Ricinus communis] gi|223550571|gb|EEF52058.1| Endonuclease III, putative [Ricinus communis] Length = 291 Score = 281 bits (718), Expect = 9e-74 Identities = 128/171 (74%), Positives = 154/171 (90%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLVKT+LSQNTTE+NS+RAF +LK FPTWQ VL AE K++E+AIRCGGLAP K+SC Sbjct: 86 VLDGLVKTVLSQNTTEVNSQRAFDNLKSDFPTWQDVLAAEPKWIENAIRCGGLAPAKASC 145 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IKN+L LLEK+GK+C+EYLR++SV+E+KAELS KG+GPKTV+CVLMF+LQ +DFPVDT Sbjct: 146 IKNILNCLLEKKGKICLEYLRDMSVDEIKAELSQFKGVGPKTVACVLMFHLQQEDFPVDT 205 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 HVF+IAK +GWVP VAD TYLHLN+RIP+ELKFDLNCLLYTHGK+CR+C Sbjct: 206 HVFEIAKALGWVPEVADRNKTYLHLNQRIPNELKFDLNCLLYTHGKLCRKC 256 >ref|XP_006476719.1| PREDICTED: protein ROS1-like isoform X2 [Citrus sinensis] Length = 278 Score = 278 bits (710), Expect = 7e-73 Identities = 128/171 (74%), Positives = 151/171 (88%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLVKT+LSQNTTE NS +AFASLK +FPTW+HVL AE K +E+AIRCGGLAPTK++C Sbjct: 82 VLDGLVKTVLSQNTTEANSLKAFASLKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAAC 141 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IKN+L+ LLE +GKLC+EYLR LS++E+KAELS +GIGPKTV+CVLMF+LQ DDFPVDT Sbjct: 142 IKNILKCLLESKGKLCLEYLRGLSIDEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDT 201 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 HVF+I+K +GWVP AD TYLHLN+RIP ELKFDLNCLLYTHGK+CR C Sbjct: 202 HVFEISKAIGWVPTAADRNKTYLHLNQRIPKELKFDLNCLLYTHGKLCRNC 252 >ref|XP_006476718.1| PREDICTED: protein ROS1-like isoform X1 [Citrus sinensis] Length = 281 Score = 278 bits (710), Expect = 7e-73 Identities = 128/171 (74%), Positives = 151/171 (88%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLVKT+LSQNTTE NS +AFASLK +FPTW+HVL AE K +E+AIRCGGLAPTK++C Sbjct: 82 VLDGLVKTVLSQNTTEANSLKAFASLKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAAC 141 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IKN+L+ LLE +GKLC+EYLR LS++E+KAELS +GIGPKTV+CVLMF+LQ DDFPVDT Sbjct: 142 IKNILKCLLESKGKLCLEYLRGLSIDEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDT 201 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 HVF+I+K +GWVP AD TYLHLN+RIP ELKFDLNCLLYTHGK+CR C Sbjct: 202 HVFEISKAIGWVPTAADRNKTYLHLNQRIPKELKFDLNCLLYTHGKLCRNC 252 >ref|XP_004236146.1| PREDICTED: endonuclease III-like [Solanum lycopersicum] Length = 301 Score = 278 bits (710), Expect = 7e-73 Identities = 128/171 (74%), Positives = 148/171 (86%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGL+ TILSQNTTE NS++AFASLK SFPTW+ VL A++K VED IRCGGLAPTK+SC Sbjct: 104 VLDGLINTILSQNTTEANSQKAFASLKSSFPTWECVLAADAKLVEDTIRCGGLAPTKTSC 163 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IK +L LL+K+G LC+EYLRELS+EE+K ELS +GIGPKTV+CVLMF LQ DDFPVDT Sbjct: 164 IKGILSSLLQKKGNLCLEYLRELSIEEIKRELSCFRGIGPKTVACVLMFQLQRDDFPVDT 223 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 H+FQIAK + WVP ADVK TY+HLN+RIP ELKFDLNCL+YTHGKVCR C Sbjct: 224 HIFQIAKTLHWVPAAADVKKTYIHLNRRIPDELKFDLNCLIYTHGKVCREC 274 >ref|XP_006345014.1| PREDICTED: protein ROS1-like [Solanum tuberosum] Length = 301 Score = 277 bits (709), Expect = 9e-73 Identities = 128/171 (74%), Positives = 148/171 (86%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGL+ TILSQNTTE NS++AFASLK SFPTW+ VL A++K VED IRCGGLAPTK+SC Sbjct: 104 VLDGLINTILSQNTTEANSQKAFASLKSSFPTWECVLAADAKLVEDTIRCGGLAPTKTSC 163 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IK +L LL+K+G LC+EYLRELS+EE+K ELS +GIGPKTV+CVLMF LQ DDFPVDT Sbjct: 164 IKGILSSLLQKKGNLCLEYLRELSIEEIKRELSCFRGIGPKTVACVLMFQLQRDDFPVDT 223 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 H+FQIAK + WVP ADVK TY+HLN+RIP ELKFDLNCL+YTHGKVCR C Sbjct: 224 HIFQIAKTLHWVPAAADVKKTYIHLNQRIPDELKFDLNCLIYTHGKVCREC 274 >ref|XP_006439743.1| hypothetical protein CICLE_v10021561mg [Citrus clementina] gi|557542005|gb|ESR52983.1| hypothetical protein CICLE_v10021561mg [Citrus clementina] Length = 281 Score = 277 bits (709), Expect = 9e-73 Identities = 128/171 (74%), Positives = 151/171 (88%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLVKT+LSQNTTE NS +AFASLK +FPTW+HVL AE K +E+AIRCGGLAPTK++C Sbjct: 82 VLDGLVKTLLSQNTTEANSLKAFASLKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAAC 141 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IKN+L+ LLE +GKLC+EYLR LS++E+KAELS +GIGPKTV+CVLMF+LQ DDFPVDT Sbjct: 142 IKNILKCLLESKGKLCLEYLRGLSIDEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDT 201 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 HVF+I+K +GWVP AD TYLHLN+RIP ELKFDLNCLLYTHGK+CR C Sbjct: 202 HVFEISKAIGWVPTAADRNKTYLHLNQRIPKELKFDLNCLLYTHGKLCRNC 252 >gb|EXB42063.1| Protein ROS1 [Morus notabilis] Length = 308 Score = 276 bits (706), Expect = 2e-72 Identities = 129/171 (75%), Positives = 150/171 (87%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLV T+LSQNTTE NS+RAFASLK +FPTW+ VL A+SK +EDAIRCGGLAP K+SC Sbjct: 112 VLDGLVMTVLSQNTTEANSQRAFASLKSAFPTWEQVLNADSKCIEDAIRCGGLAPKKASC 171 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IKN LR LLE++GKLC+EYL + SV+EVKAELS KGIGPKTV+CVLMF+LQ DDFPVDT Sbjct: 172 IKNTLRSLLERKGKLCLEYLLDFSVDEVKAELSCFKGIGPKTVACVLMFHLQQDDFPVDT 231 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 HVF+IAK +GW+P AD YLHLN+RIP+ELKFDLNCLLYTHGK+CR+C Sbjct: 232 HVFEIAKALGWLPAGADRNKAYLHLNQRIPNELKFDLNCLLYTHGKMCRKC 282 >ref|XP_006476720.1| PREDICTED: protein ROS1-like isoform X3 [Citrus sinensis] Length = 258 Score = 268 bits (686), Expect = 4e-70 Identities = 125/170 (73%), Positives = 148/170 (87%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLVKT+LSQNTTE NS +AFASLK +FPTW+HVL AE K +E+AIRCGGLAPTK++C Sbjct: 82 VLDGLVKTVLSQNTTEANSLKAFASLKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAAC 141 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IKN+L+ LLE +GKLC+EYLR LS++E+KAELS +GIGPKTV+CVLMF+LQ DDFPVDT Sbjct: 142 IKNILKCLLESKGKLCLEYLRGLSIDEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDT 201 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRR 512 HVF+I+K +GWVP AD TYLHLN+RIP ELKFDLNCLLYTHG + R Sbjct: 202 HVFEISKAIGWVPTAADRNKTYLHLNQRIPKELKFDLNCLLYTHGNILPR 251 >ref|XP_007036110.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao] gi|508773355|gb|EOY20611.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao] Length = 264 Score = 268 bits (685), Expect = 6e-70 Identities = 125/164 (76%), Positives = 148/164 (90%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLVKT+LSQNTTELNS++AFASLK +FPTW+ VL AESK +E+AIRCGGLAP K+SC Sbjct: 90 VLDGLVKTVLSQNTTELNSQKAFASLKSAFPTWEDVLAAESKNLENAIRCGGLAPRKASC 149 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IKN+LR L E++GKLC EYLR+LS++E+KAELS KG+GPKTV+CVLMFNLQ DDFPVDT Sbjct: 150 IKNVLRCLHERKGKLCFEYLRDLSIDEIKAELSNFKGVGPKTVACVLMFNLQQDDFPVDT 209 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTH 494 HVF+IA+ +GWVP AD K TYLHLN+RIP++LKFDLNCLLYTH Sbjct: 210 HVFEIARAIGWVPATADRKKTYLHLNRRIPNKLKFDLNCLLYTH 253 >ref|XP_007036108.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508773353|gb|EOY20609.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 446 Score = 268 bits (685), Expect = 6e-70 Identities = 125/164 (76%), Positives = 148/164 (90%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLVKT+LSQNTTELNS++AFASLK +FPTW+ VL AESK +E+AIRCGGLAP K+SC Sbjct: 90 VLDGLVKTVLSQNTTELNSQKAFASLKSAFPTWEDVLAAESKNLENAIRCGGLAPRKASC 149 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IKN+LR L E++GKLC EYLR+LS++E+KAELS KG+GPKTV+CVLMFNLQ DDFPVDT Sbjct: 150 IKNVLRCLHERKGKLCFEYLRDLSIDEIKAELSNFKGVGPKTVACVLMFNLQQDDFPVDT 209 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTH 494 HVF+IA+ +GWVP AD K TYLHLN+RIP++LKFDLNCLLYTH Sbjct: 210 HVFEIARAIGWVPATADRKKTYLHLNRRIPNKLKFDLNCLLYTH 253 >emb|CBI15085.3| unnamed protein product [Vitis vinifera] Length = 310 Score = 267 bits (683), Expect = 1e-69 Identities = 125/171 (73%), Positives = 147/171 (85%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLV ILSQNTT++NS+RAFASLK +FPTWQ VL A+SK +E+AIRCGGLA TK+SC Sbjct: 107 VLDGLVSIILSQNTTDVNSQRAFASLKSAFPTWQDVLAADSKSIENAIRCGGLAVTKASC 166 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IK +L LLE++GKLC+EYLR+L+V+E+K ELS KGIGPKTV+CVLMF+LQ DDFPVDT Sbjct: 167 IKKMLSCLLERKGKLCLEYLRDLTVDEIKTELSHFKGIGPKTVACVLMFHLQRDDFPVDT 226 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 HV QI K +GWVP VAD K YLHLN+RIP ELKFDLNCLL+THGK+C C Sbjct: 227 HVIQIGKAIGWVPAVADRKKAYLHLNRRIPDELKFDLNCLLFTHGKLCHEC 277 >ref|XP_002283633.1| PREDICTED: endonuclease III-like [Vitis vinifera] Length = 310 Score = 267 bits (683), Expect = 1e-69 Identities = 125/171 (73%), Positives = 147/171 (85%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLV ILSQNTT++NS+RAFASLK +FPTWQ VL A+SK +E+AIRCGGLA TK+SC Sbjct: 107 VLDGLVSIILSQNTTDVNSQRAFASLKSAFPTWQDVLAADSKSIENAIRCGGLAVTKASC 166 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IK +L LLE++GKLC+EYLR+L+V+E+K ELS KGIGPKTV+CVLMF+LQ DDFPVDT Sbjct: 167 IKKMLSCLLERKGKLCLEYLRDLTVDEIKTELSHFKGIGPKTVACVLMFHLQRDDFPVDT 226 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 HV QI K +GWVP VAD K YLHLN+RIP ELKFDLNCLL+THGK+C C Sbjct: 227 HVIQIGKAIGWVPAVADRKKAYLHLNRRIPDELKFDLNCLLFTHGKLCHEC 277 >ref|NP_566893.1| DNA glycosylase superfamily protein [Arabidopsis thaliana] gi|332644814|gb|AEE78335.1| DNA glycosylase superfamily protein [Arabidopsis thaliana] Length = 293 Score = 267 bits (682), Expect = 1e-69 Identities = 126/171 (73%), Positives = 144/171 (84%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLVK +LSQNTTE NS+RAFASLK +FP W VL AESK +E+AIRCGGLAP K+ C Sbjct: 96 VLDGLVKILLSQNTTESNSQRAFASLKATFPKWDDVLNAESKSIENAIRCGGLAPKKAVC 155 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IKN+L L +RG+LC+EYLR LSVEEVK ELS KG+GPKTVSCVLMFNLQH+DFPVDT Sbjct: 156 IKNILNRLQNERGRLCLEYLRGLSVEEVKTELSHFKGVGPKTVSCVLMFNLQHNDFPVDT 215 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 HVF+IAK +GWVP AD TY+HLN++IP ELKFDLNCLLYTHGK+C C Sbjct: 216 HVFEIAKALGWVPKTADRNKTYVHLNRKIPDELKFDLNCLLYTHGKICSNC 266 >ref|XP_006404333.1| hypothetical protein EUTSA_v10010580mg [Eutrema salsugineum] gi|557105452|gb|ESQ45786.1| hypothetical protein EUTSA_v10010580mg [Eutrema salsugineum] Length = 302 Score = 266 bits (681), Expect = 2e-69 Identities = 127/171 (74%), Positives = 145/171 (84%) Frame = +3 Query: 3 VLDGLVKTILSQNTTELNSERAFASLKESFPTWQHVLEAESKYVEDAIRCGGLAPTKSSC 182 VLDGLVK +LSQNTTE+NS+RAFASLK +FP W+ VL AE K +E+AIRCGGLAP K+ C Sbjct: 102 VLDGLVKILLSQNTTEINSQRAFASLKAAFPKWEDVLGAEPKSIENAIRCGGLAPKKAVC 161 Query: 183 IKNLLRVLLEKRGKLCMEYLRELSVEEVKAELSLLKGIGPKTVSCVLMFNLQHDDFPVDT 362 IKN+L L +RG+LC+EYLR LSVEEVK ELS KGIGPKTVSCVLMFNLQH+DFPVDT Sbjct: 162 IKNILSRLQSERGRLCLEYLRGLSVEEVKTELSHFKGIGPKTVSCVLMFNLQHNDFPVDT 221 Query: 363 HVFQIAKRMGWVPGVADVKTTYLHLNKRIPSELKFDLNCLLYTHGKVCRRC 515 HVF+IAK +GWVP AD TY+HLN+RIP ELKFDLNCLLYTHGK+C C Sbjct: 222 HVFEIAKAIGWVPKTADRNKTYVHLNRRIPDELKFDLNCLLYTHGKLCSNC 272