BLASTX nr result
ID: Rauwolfia21_contig00004988
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00004988 (2214 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002263612.1| PREDICTED: uncharacterized protein LOC100256... 482 e-133 emb|CAN68394.1| hypothetical protein VITISV_042519 [Vitis vinifera] 479 e-132 ref|XP_002303719.2| hypothetical protein POPTR_0003s15520g [Popu... 477 e-132 ref|XP_002303720.2| hypothetical protein POPTR_0003s15520g [Popu... 477 e-132 ref|XP_004232605.1| PREDICTED: uncharacterized protein LOC101246... 467 e-129 gb|EMJ13510.1| hypothetical protein PRUPE_ppa006731mg [Prunus pe... 466 e-128 gb|EXB83232.1| Putative Glutamine amidotransferase [Morus notabi... 465 e-128 ref|XP_006364818.1| PREDICTED: uncharacterized protein LOC102594... 461 e-127 ref|XP_006468594.1| PREDICTED: uncharacterized protein LOC102614... 454 e-125 gb|EOX96813.1| DNA glycosylase superfamily protein, putative [Th... 450 e-123 gb|EMJ12930.1| hypothetical protein PRUPE_ppa006139mg [Prunus pe... 442 e-121 ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791... 441 e-121 gb|EOY14286.1| DNA glycosylase superfamily protein isoform 1 [Th... 440 e-120 ref|XP_006448576.1| hypothetical protein CICLE_v10015639mg [Citr... 439 e-120 ref|XP_003546838.1| PREDICTED: uncharacterized protein LOC100811... 438 e-120 gb|ESW21148.1| hypothetical protein PHAVU_005G045900g [Phaseolus... 437 e-120 ref|XP_004295546.1| PREDICTED: uncharacterized protein LOC101298... 435 e-119 ref|XP_002530111.1| DNA-3-methyladenine glycosylase, putative [R... 435 e-119 ref|XP_002315089.2| methyladenine glycosylase family protein [Po... 434 e-119 ref|XP_002312220.1| methyladenine glycosylase family protein [Po... 432 e-118 >ref|XP_002263612.1| PREDICTED: uncharacterized protein LOC100256507 [Vitis vinifera] gi|297738175|emb|CBI27376.3| unnamed protein product [Vitis vinifera] Length = 398 Score = 482 bits (1240), Expect = e-133 Identities = 263/407 (64%), Positives = 298/407 (73%), Gaps = 1/407 (0%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVV-AVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVP 1850 MCSSKSK D+ + + INGRP LQP CNR P LER + Sbjct: 1 MCSSKSKLH---QGIDITPSKAQINGRPALQPTCNRIPSLERHHSFKKISPKSPTSPLPA 57 Query: 1849 LSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSNKT 1670 P KSPR PA KRGNDPNGLNSS+EKV LTP+ + K+ Sbjct: 58 SPPPPTTIINTTKTKPSLTPPASPNLKSPRQPALKRGNDPNGLNSSLEKV-LTPRGTTKS 116 Query: 1669 VAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRI 1490 + KK+K C G S D S L N SSSLIVEAPGSIAAARREQ+AIMQVQRKMRI Sbjct: 117 SSSPKKTKK-CSAGL--APSSDTSSL-NYSSSLIVEAPGSIAAARREQMAIMQVQRKMRI 172 Query: 1489 AHYGRTKSAKYEGKIVPLDSSATAAVKEERRCHFITLNSDPIYIAYHDEEWGVPVHEDKM 1310 AHYGRTKSAKYE KI P+D +EE+RC FIT NSDP Y+ YHDEEWGVPVH+DK Sbjct: 173 AHYGRTKSAKYEEKIGPVDP-LVITTREEKRCSFITPNSDPSYVEYHDEEWGVPVHDDKR 231 Query: 1309 LFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGIELSQV 1130 LFELLV+TGAQVGSDW TVLKKRQ++RDA + +DAEIV K+SEKK+ +I YGI+LSQV Sbjct: 232 LFELLVMTGAQVGSDWTTVLKKRQEYRDALSGYDAEIVGKFSEKKITSISAYYGIDLSQV 291 Query: 1129 RGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISKGMVRR 950 RGVVDN++RILEIKREFGSF KY+W FVN+KPI TQYKSC KIPVKTSKSE+ISK MVRR Sbjct: 292 RGVVDNSNRILEIKREFGSFHKYIWGFVNHKPITTQYKSCHKIPVKTSKSESISKDMVRR 351 Query: 949 GFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809 GFRLVGPTVI+SFM+AAGLTNDHLI+CPRHL+C AL+SH PAVAPAL Sbjct: 352 GFRLVGPTVIYSFMQAAGLTNDHLISCPRHLQCIALSSHQPAVAPAL 398 >emb|CAN68394.1| hypothetical protein VITISV_042519 [Vitis vinifera] Length = 398 Score = 479 bits (1234), Expect = e-132 Identities = 262/407 (64%), Positives = 297/407 (72%), Gaps = 1/407 (0%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVV-AVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVP 1850 MCSSKSK D+ + + INGRP LQP CNR P LER + Sbjct: 1 MCSSKSKLH---QGIDITPSKAQINGRPALQPTCNRIPSLERHHSFKKISPKSPTSPLPA 57 Query: 1849 LSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSNKT 1670 P KSPR PA KRGNDPNGLNSS+EKV LTP+ + K+ Sbjct: 58 SLPPPTTIINTTKTKPSLTPPASPNLKSPRQPALKRGNDPNGLNSSLEKV-LTPRGTTKS 116 Query: 1669 VAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRI 1490 + KK+K C G S D S L N SSS IVEAPGSIAAARREQ+AIMQVQRKMRI Sbjct: 117 SSSPKKTKK-CSAGL--APSSDTSSL-NYSSSFIVEAPGSIAAARREQMAIMQVQRKMRI 172 Query: 1489 AHYGRTKSAKYEGKIVPLDSSATAAVKEERRCHFITLNSDPIYIAYHDEEWGVPVHEDKM 1310 AHYGRTKSAKYE KI P+D +EE+RC FIT NSDP Y+ YHDEEWGVPVH+DK Sbjct: 173 AHYGRTKSAKYEEKISPVDP-LVITTREEKRCSFITPNSDPSYVEYHDEEWGVPVHDDKR 231 Query: 1309 LFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGIELSQV 1130 LFELLV+TGAQVGSDW TVLKKRQ++RDAF+ +DAEIV K+SEKK+ +I YGI+LSQV Sbjct: 232 LFELLVMTGAQVGSDWTTVLKKRQEYRDAFSGYDAEIVGKFSEKKITSISAYYGIDLSQV 291 Query: 1129 RGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISKGMVRR 950 RGVVDN++RILEIKREFGSF KY+W FVN+KPI TQ KSC KIPVKTSKSE+ISK MVRR Sbjct: 292 RGVVDNSNRILEIKREFGSFHKYIWGFVNHKPITTQXKSCHKIPVKTSKSESISKDMVRR 351 Query: 949 GFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809 GFRLVGPTVI+SFM+AAGLTNDHLI+CPRHL+C AL+SH PAVAPAL Sbjct: 352 GFRLVGPTVIYSFMQAAGLTNDHLISCPRHLQCIALSSHQPAVAPAL 398 >ref|XP_002303719.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] gi|550343248|gb|EEE78698.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] Length = 420 Score = 477 bits (1228), Expect = e-132 Identities = 256/413 (61%), Positives = 308/413 (74%), Gaps = 9/413 (2%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQ--- 1856 MCSSKS+ ST+ ++ INGRPVLQP N+ P LER N + Sbjct: 1 MCSSKSRLNQSTSNI-ATTIAKINGRPVLQPKSNQVPSLERHNSLKKNSPPKSPTREPAG 59 Query: 1855 --VPLSSPSXXXXXXXXXXXXXXXXXXXXXK-SPRPPATKRGNDPNGLNSSVEKVVLTPK 1685 VPL P+ SPRPPA KRGN+P GLN+S EKV LTP+ Sbjct: 60 PPVPLMQPACNAAGTKTRLPSALSPPISPKLKSPRPPAVKRGNEPGGLNTSAEKV-LTPR 118 Query: 1684 CSNK-TVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQV 1508 + K T + VKKSK S G + S+D +K SSSL+VEAPGSIAAARREQVA+MQ Sbjct: 119 STTKVTTSTVKKSKKSSTAGVPH--SVDTFAMK-YSSSLLVEAPGSIAAARREQVAVMQE 175 Query: 1507 QRKMRIAHYGRTKSAKYEGKIVPLDSSATAAV-KEERRCHFITLNSDPIYIAYHDEEWGV 1331 QRKMRIAHYGRTKSAKY+GKIVP +S AT+ + +EE+RC FIT NSDP+Y+AYHDEEWGV Sbjct: 176 QRKMRIAHYGRTKSAKYQGKIVPANSPATSTITREEKRCSFITPNSDPVYVAYHDEEWGV 235 Query: 1330 PVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDY 1151 PVH+DK+LFELL LTGAQVGS+W +VLKKR+ FR+AF+ FDAEIV+K++EKK+ +I +Y Sbjct: 236 PVHDDKLLFELLALTGAQVGSEWTSVLKKREAFREAFSGFDAEIVAKFTEKKIASISAEY 295 Query: 1150 GIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETI 971 G+++SQVRGVVDN++RILE+KREFGSFD+YLW +VN+KPI+TQYKSC KIPVKTSKSETI Sbjct: 296 GLDISQVRGVVDNSNRILEVKREFGSFDEYLWGYVNHKPISTQYKSCQKIPVKTSKSETI 355 Query: 970 SKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSP-AVAP 815 SK MV+RGFR VGPTVIHSFM+A GL+NDHLI CPRHL+C ALAS P VAP Sbjct: 356 SKDMVKRGFRFVGPTVIHSFMQAGGLSNDHLITCPRHLQCIALASQLPRTVAP 408 >ref|XP_002303720.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] gi|550343247|gb|EEE78699.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] Length = 417 Score = 477 bits (1228), Expect = e-132 Identities = 256/413 (61%), Positives = 308/413 (74%), Gaps = 9/413 (2%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQ--- 1856 MCSSKS+ ST+ ++ INGRPVLQP N+ P LER N + Sbjct: 1 MCSSKSRLNQSTSNI-ATTIAKINGRPVLQPKSNQVPSLERHNSLKKNSPPKSPTREPAG 59 Query: 1855 --VPLSSPSXXXXXXXXXXXXXXXXXXXXXK-SPRPPATKRGNDPNGLNSSVEKVVLTPK 1685 VPL P+ SPRPPA KRGN+P GLN+S EKV LTP+ Sbjct: 60 PPVPLMQPACNAAGTKTRLPSALSPPISPKLKSPRPPAVKRGNEPGGLNTSAEKV-LTPR 118 Query: 1684 CSNK-TVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQV 1508 + K T + VKKSK S G + S+D +K SSSL+VEAPGSIAAARREQVA+MQ Sbjct: 119 STTKVTTSTVKKSKKSSTAGVPH--SVDTFAMK-YSSSLLVEAPGSIAAARREQVAVMQE 175 Query: 1507 QRKMRIAHYGRTKSAKYEGKIVPLDSSATAAV-KEERRCHFITLNSDPIYIAYHDEEWGV 1331 QRKMRIAHYGRTKSAKY+GKIVP +S AT+ + +EE+RC FIT NSDP+Y+AYHDEEWGV Sbjct: 176 QRKMRIAHYGRTKSAKYQGKIVPANSPATSTITREEKRCSFITPNSDPVYVAYHDEEWGV 235 Query: 1330 PVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDY 1151 PVH+DK+LFELL LTGAQVGS+W +VLKKR+ FR+AF+ FDAEIV+K++EKK+ +I +Y Sbjct: 236 PVHDDKLLFELLALTGAQVGSEWTSVLKKREAFREAFSGFDAEIVAKFTEKKIASISAEY 295 Query: 1150 GIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETI 971 G+++SQVRGVVDN++RILE+KREFGSFD+YLW +VN+KPI+TQYKSC KIPVKTSKSETI Sbjct: 296 GLDISQVRGVVDNSNRILEVKREFGSFDEYLWGYVNHKPISTQYKSCQKIPVKTSKSETI 355 Query: 970 SKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSP-AVAP 815 SK MV+RGFR VGPTVIHSFM+A GL+NDHLI CPRHL+C ALAS P VAP Sbjct: 356 SKDMVKRGFRFVGPTVIHSFMQAGGLSNDHLITCPRHLQCIALASQLPRTVAP 408 >ref|XP_004232605.1| PREDICTED: uncharacterized protein LOC101246304 [Solanum lycopersicum] Length = 395 Score = 467 bits (1202), Expect = e-129 Identities = 263/417 (63%), Positives = 302/417 (72%), Gaps = 11/417 (2%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXL---- 1859 MC+SK+K Q S +S INGRPVLQP+ N PL ERRN Sbjct: 1 MCNSKTKLQSSAQT-----LSQINGRPVLQPHSNIVPLYERRNSLKKTTHTAAPVTANGS 55 Query: 1858 -QVPLSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGN--DPNGLNSSVEKVVLTP 1688 +V +SS + SPR PA KRGN DPNGL+SS EK+V Sbjct: 56 TKVKMSSSTTPPVSPKMK-------------SPRLPAIKRGNNIDPNGLSSSAEKIVTPK 102 Query: 1687 KCSNKTVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQV 1508 +NK +KK K S G G + +S++NS LK SSSLIVEAPGSIAAARREQVAI QV Sbjct: 103 GTANKAPILLKKPKKSSG-GLASPSSVENSSLK-YSSSLIVEAPGSIAAARREQVAIAQV 160 Query: 1507 QRKMRIAHYGRTKSAKYEGKIVPLDSSATAAV----KEERRCHFITLNSDPIYIAYHDEE 1340 QRKM+IAHYGRTKSAKYEGK+ LD S +AV +E++RC FIT NSDP+YIAYHDEE Sbjct: 161 QRKMKIAHYGRTKSAKYEGKVSSLDPSFASAVIPNPREDKRCSFITPNSDPLYIAYHDEE 220 Query: 1339 WGVPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTIC 1160 WGVPVH+D +LFELLVLTGAQVGSDW +VLKKRQ+FRDAF+ FD EIVSKY+EKK+ + Sbjct: 221 WGVPVHDDNLLFELLVLTGAQVGSDWTSVLKKRQEFRDAFSGFDPEIVSKYNEKKITSTS 280 Query: 1159 NDYGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKS 980 +YGIELSQ+RG VDN++RILEIK+ FGSFDKYLW FVN KPIATQYK+C KIPVKTSKS Sbjct: 281 VEYGIELSQIRGAVDNSTRILEIKKTFGSFDKYLWGFVNNKPIATQYKACNKIPVKTSKS 340 Query: 979 ETISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809 ETISK MV+RGFR VGPTVIHSFM+AAGLTNDHLI CPRHL C ALA+ PA PAL Sbjct: 341 ETISKDMVKRGFRYVGPTVIHSFMQAAGLTNDHLIACPRHLPCVALAT-QPA-PPAL 395 >gb|EMJ13510.1| hypothetical protein PRUPE_ppa006731mg [Prunus persica] Length = 397 Score = 466 bits (1199), Expect = e-128 Identities = 255/407 (62%), Positives = 296/407 (72%), Gaps = 2/407 (0%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVPL 1847 MCSSK K Q +T+ + +N RPVLQP N+ P LE+R +P Sbjct: 1 MCSSKPKLQRTTSVPP--STPKMNRRPVLQPTGNQFPSLEQRKSLKKSSQEPLAPTPLPS 58 Query: 1846 SSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSNKTV 1667 PS SPRPPA KRG DPN LNSS EKVV TP+C+ K Sbjct: 59 PLPSAKTKASLSPPISPKLP------SPRPPAFKRGKDPNELNSSAEKVV-TPRCTTKFT 111 Query: 1666 APVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRIA 1487 + VKKSK S G ++ A S LKNISS LIVEAPGSIAAARREQVA MQ QRKMRIA Sbjct: 112 SSVKKSKKSSG--SVAAAPSAESILKNISS-LIVEAPGSIAAARREQVATMQEQRKMRIA 168 Query: 1486 HYGRTKSAKYEGKIVPLDSSATAAV-KEERRCHFITLNSDPIYIAYHDEEWGVPVHEDKM 1310 HYGRTKSAK EGK+VPLD+S T +++RRC FIT NSDPIY+AYHDEEWGVPVH+D + Sbjct: 169 HYGRTKSAKNEGKVVPLDASPTTDFGRDQRRCTFITPNSDPIYVAYHDEEWGVPVHDDNL 228 Query: 1309 LFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGIELSQV 1130 L ELLVLTGAQVGSDW +VL+KRQ R++F+ FDA+ V+K+SE+K+ ++ +D GI++S V Sbjct: 229 LLELLVLTGAQVGSDWTSVLRKRQALRESFSGFDADGVAKFSERKITSVSSDSGIDISLV 288 Query: 1129 RGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISKGMVRR 950 RG VDNA RIL+IKRE GSFDKYLW FVN+KPI+TQYKSC KIPVK SKSE+ISK MVRR Sbjct: 289 RGAVDNAKRILQIKREVGSFDKYLWGFVNHKPISTQYKSCHKIPVKNSKSESISKDMVRR 348 Query: 949 GFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAA-LASHSPAVAPA 812 GFRLVGPTVIHSFM+AAGLTNDHLI CPRHL+CAA LAS P APA Sbjct: 349 GFRLVGPTVIHSFMQAAGLTNDHLITCPRHLQCAASLASSPPVAAPA 395 >gb|EXB83232.1| Putative Glutamine amidotransferase [Morus notabilis] Length = 394 Score = 465 bits (1196), Expect = e-128 Identities = 262/409 (64%), Positives = 295/409 (72%), Gaps = 3/409 (0%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVPL 1847 MCSSK K T A INGRPVLQP CNR LERR PL Sbjct: 1 MCSSKPKTLLGTNTI-TSAEPKINGRPVLQPTCNRVSSLERR--MSLKKTTPKSPTSPPL 57 Query: 1846 SSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPN-GLNSSVEKVVLTPKCSNKT 1670 + P SPRPPA KRG DPN LNSS EKV LTP+C K+ Sbjct: 58 ALP-IQNGACKTKPSTLSPPVSPKLPSPRPPAIKRGKDPNYELNSSAEKV-LTPRCIIKS 115 Query: 1669 VAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRI 1490 + +KKSK CG + +L NS SSLIVEAPGSIAAARREQVAIMQ QRK+RI Sbjct: 116 TSSIKKSKK-CGGAGVVAETLKNS------SSLIVEAPGSIAAARREQVAIMQEQRKIRI 168 Query: 1489 AHYGRTKSAKYEGKIVP--LDSSATAAVKEERRCHFITLNSDPIYIAYHDEEWGVPVHED 1316 AHYGRTKSAK+EGK+V LDSS KE++RC +IT NSDPIY+AYHDEEWGVPVH+D Sbjct: 169 AHYGRTKSAKFEGKVVAPMLDSSVG---KEQKRCSYITPNSDPIYVAYHDEEWGVPVHDD 225 Query: 1315 KMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGIELS 1136 K+LFELLVLTGAQVGSDW +VLKKR+ FR+AF+ FDAE VSKY+EKK+ +I DYGIELS Sbjct: 226 KLLFELLVLTGAQVGSDWTSVLKKREIFRNAFSGFDAEAVSKYNEKKITSIGADYGIELS 285 Query: 1135 QVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISKGMV 956 +RG VDNA+RILEIK+EFGS +KYLW FVN K I+TQYKSC KIPVKTSKSE+ISK MV Sbjct: 286 LIRGAVDNANRILEIKKEFGSLNKYLWGFVNNKLISTQYKSCQKIPVKTSKSESISKDMV 345 Query: 955 RRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809 RRGFR VGPTVI+SFM+AAGLTNDHLI CPRHL+C ALAS P+VAPAL Sbjct: 346 RRGFRFVGPTVIYSFMQAAGLTNDHLITCPRHLQCLALASQLPSVAPAL 394 >ref|XP_006364818.1| PREDICTED: uncharacterized protein LOC102594852 [Solanum tuberosum] Length = 395 Score = 461 bits (1185), Expect = e-127 Identities = 259/412 (62%), Positives = 298/412 (72%), Gaps = 6/412 (1%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVPL 1847 MC+SK+K Q S +S INGRPVLQP+ N PL ERRN Sbjct: 1 MCNSKTKLQSSPQT-----LSQINGRPVLQPHSNIVPLYERRNSLKKTTNTA-------- 47 Query: 1846 SSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGN--DPNGLNSSVEKVVLTPKCSNK 1673 +S + KSPR PA KRGN DPNGL+SS EK+V +NK Sbjct: 48 ASVTANGSTKVKTSSSTTPPVSPKMKSPRLPAIKRGNNIDPNGLSSSAEKIVTPKGTANK 107 Query: 1672 TVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMR 1493 +KK K S G G + ++NS LK SSSLIVEAPGSIAAARREQVAI QVQRKM+ Sbjct: 108 APILLKKPKKSSG-GLASPPYVENSSLK-YSSSLIVEAPGSIAAARREQVAIAQVQRKMK 165 Query: 1492 IAHYGRTKSAKYEGKIVPLDSSATAAV----KEERRCHFITLNSDPIYIAYHDEEWGVPV 1325 IAHYGRTKSAKYEGK+ LD S +AV +EE+RC FIT NSDP+YIAYHDEEWGVPV Sbjct: 166 IAHYGRTKSAKYEGKVSSLDPSFASAVIPNPREEKRCSFITPNSDPLYIAYHDEEWGVPV 225 Query: 1324 HEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGI 1145 H+D +LFELLVLTGAQVGSDW +VL+KRQ+FRDAF+ FD EIVSKY+EKK+ + +YGI Sbjct: 226 HDDNLLFELLVLTGAQVGSDWTSVLRKRQEFRDAFSGFDPEIVSKYNEKKITSTSVEYGI 285 Query: 1144 ELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISK 965 ELSQ+RG VDN++RILEIK+ F SF+KYLW FVN KPIATQYK+C KIPVKTSKSETISK Sbjct: 286 ELSQIRGAVDNSTRILEIKKTFDSFNKYLWGFVNNKPIATQYKACNKIPVKTSKSETISK 345 Query: 964 GMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809 MV+RGFR VGPTVIHSFM+AAGLTNDHLI CPRHL+C ALA+ PA PAL Sbjct: 346 DMVKRGFRYVGPTVIHSFMQAAGLTNDHLIACPRHLQCMALAT-QPA-PPAL 395 >ref|XP_006468594.1| PREDICTED: uncharacterized protein LOC102614205 [Citrus sinensis] Length = 375 Score = 454 bits (1167), Expect = e-125 Identities = 249/408 (61%), Positives = 289/408 (70%), Gaps = 2/408 (0%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVPL 1847 MCSSKSK +T INGRPVLQP N+ P LE+RN + Sbjct: 1 MCSSKSKLHSAT---------QINGRPVLQPTSNQVPSLEKRNSIKKTGSPKSPITTDNV 51 Query: 1846 SSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSNKTV 1667 +S S SPRP A KRGNDPN LN+S EK+ +TPK K Sbjct: 52 NSKSFTKSLLSPPVSPKLK-------SPRPAAVKRGNDPNVLNTSAEKI-MTPK---KLA 100 Query: 1666 APVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRIA 1487 + VKK KN + VA SSLIVEAPGSIAAARRE VAIMQ QRK+RIA Sbjct: 101 SLVKKPKN------VGVAPC-------YDSSLIVEAPGSIAAARREHVAIMQEQRKLRIA 147 Query: 1486 HYGRTKSAKYEGKIVPLDSSATAAV--KEERRCHFITLNSDPIYIAYHDEEWGVPVHEDK 1313 HYGRTKSAK+EGK+ LDS A +EE+RC FIT NSDPIY+AYHDEEWGVPVH+DK Sbjct: 148 HYGRTKSAKFEGKVPGLDSFANGDNNDREEKRCSFITPNSDPIYVAYHDEEWGVPVHDDK 207 Query: 1312 MLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGIELSQ 1133 +LFELLVLT AQVGSDW +VLKKRQ FR+AF+ FDAE+V+K++EKKM ++ +Y I+LSQ Sbjct: 208 LLFELLVLTAAQVGSDWTSVLKKRQAFREAFSGFDAEVVAKFTEKKMTSLSANYAIDLSQ 267 Query: 1132 VRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISKGMVR 953 VRG+VDN+ RILE+K++FGSFDKYLW FVN+KPI TQY+S KIPVKTSKSE ISK MV+ Sbjct: 268 VRGIVDNSIRILEVKKQFGSFDKYLWGFVNHKPINTQYRSSQKIPVKTSKSEAISKDMVK 327 Query: 952 RGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809 +GFR VGPTVIHSFM+AAGLTNDHLI C RHL+C ALASH PAVAPAL Sbjct: 328 KGFRFVGPTVIHSFMQAAGLTNDHLITCTRHLQCTALASHQPAVAPAL 375 >gb|EOX96813.1| DNA glycosylase superfamily protein, putative [Theobroma cacao] Length = 398 Score = 450 bits (1158), Expect = e-123 Identities = 245/408 (60%), Positives = 292/408 (71%), Gaps = 2/408 (0%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVPL 1847 MC SK K + A VA INGRPVLQP N+ ++RN PL Sbjct: 1 MCCSKFKLHKDSNIASTVA--EINGRPVLQPPSNQITSSDKRNSLKKISSNSPAL-SAPL 57 Query: 1846 SSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSNKTV 1667 + SPRP A KRG D N LNSS EKV+ P+C+ K Sbjct: 58 QLSNSRARAVKATMPSLSPPISPK--SPRPTALKRGKDSNELNSSSEKVI-APRCNVKLD 114 Query: 1666 APVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRIA 1487 + VKK KN+ G G + + S+D K SS +++EAPGSIAAARREQVA++Q QRKMRIA Sbjct: 115 SKVKKPKNASG-GGVALTSVD---AKYSSSFMVLEAPGSIAAARREQVAMIQEQRKMRIA 170 Query: 1486 HYGRTKSAKYEGKIVPLDSSA--TAAVKEERRCHFITLNSDPIYIAYHDEEWGVPVHEDK 1313 HYGRTKSAKYE K+V LDSSA TAA +++RRC FIT+NSDP+Y AYHDEEWGV VH+DK Sbjct: 171 HYGRTKSAKYERKMVGLDSSAARTAARQDQRRCSFITVNSDPVYAAYHDEEWGVAVHDDK 230 Query: 1312 MLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGIELSQ 1133 +LFEL+VL GAQVGSDW +VLKKRQDFR+AF+ FDAE+++ +SEK + +I +DYGI++SQ Sbjct: 231 LLFELVVLIGAQVGSDWTSVLKKRQDFREAFSGFDAEVIAGFSEKNILSISSDYGIDVSQ 290 Query: 1132 VRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISKGMVR 953 VR VDNA+RILE+++EFGSF+ YLW FVN+KPI TQYKSC KIPVKTSKSE ISK MVR Sbjct: 291 VRAAVDNANRILEVRKEFGSFNNYLWGFVNHKPIVTQYKSCHKIPVKTSKSEAISKDMVR 350 Query: 952 RGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809 RGFR VGPTVIHS M+AAGLTNDHL CPRHL+C ALAS P VAPAL Sbjct: 351 RGFRFVGPTVIHSLMQAAGLTNDHLSTCPRHLQCIALASQFPTVAPAL 398 >gb|EMJ12930.1| hypothetical protein PRUPE_ppa006139mg [Prunus persica] Length = 426 Score = 442 bits (1138), Expect = e-121 Identities = 242/429 (56%), Positives = 293/429 (68%), Gaps = 23/429 (5%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQ-VP 1850 MCSSK+K +VA INGRPVLQP CNR P L+RRN +P Sbjct: 1 MCSSKAKVTIGVEVTPMVA--RINGRPVLQPTCNRVPSLDRRNSIKKISTPRAPPPPPLP 58 Query: 1849 LSSPSXXXXXXXXXXXXXXXXXXXXXK-SPRPPATKRGNDPNGLNSSVEKVVLTPKCSNK 1673 SS S SPRPPA KRGNDPNGLNSS EKVV + Sbjct: 59 TSSASSTSPRISNKASSLLTPPISPKSKSPRPPAIKRGNDPNGLNSSSEKVVTPGGTTRA 118 Query: 1672 TVAPVKKSKN----SCGVGNINV--------------ASLDNSPLKNISSSLIVEAPGSI 1547 + KKSK+ S GV + +SL+ + SSSLI EAPGSI Sbjct: 119 KILERKKSKSFKRASVGVDGASADLHHHGDFSAGGFSSSLNIEASLSYSSSLITEAPGSI 178 Query: 1546 AAARREQVAIMQVQRKMRIAHYGRTKSAKYEGKIVPLDSSATAAVK---EERRCHFITLN 1376 AA RREQ+A+ QRKMRIAHYGR+KSA +E ++VP+D+S K EE+RC FIT N Sbjct: 179 AAVRREQMALQHAQRKMRIAHYGRSKSANFE-RVVPVDASGNIEAKGAEEEKRCSFITAN 237 Query: 1375 SDPIYIAYHDEEWGVPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIV 1196 SDPIY+AYHDEEWGVPVH+DKMLFELLVL+GAQVGSDW ++LKKRQDFR+AF++FDAEIV Sbjct: 238 SDPIYVAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRNAFSDFDAEIV 297 Query: 1195 SKYSEKKMNTICNDYGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYK 1016 + +++K+M +I ++YGI++S+VRGVVDN++RILEIK+EFGSFDKY+W FVN KPI+ QYK Sbjct: 298 ANFTDKQMVSIGSEYGIDISRVRGVVDNSNRILEIKKEFGSFDKYIWGFVNQKPISPQYK 357 Query: 1015 SCLKIPVKTSKSETISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALAS 836 KIPVKTSKSE+ISK MVRRGFR VGPTV+HSFM+A+GLTNDHLI C RHL+C LA+ Sbjct: 358 LGYKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQASGLTNDHLITCHRHLQCTLLAA 417 Query: 835 HSPAVAPAL 809 P + L Sbjct: 418 RRPTLEEVL 426 >ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791725 [Glycine max] Length = 400 Score = 441 bits (1135), Expect = e-121 Identities = 241/406 (59%), Positives = 292/406 (71%), Gaps = 7/406 (1%) Frame = -3 Query: 2026 MCSSKSKPQ---GSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQ 1856 MCSSK+K + AA +V+ INGRPVLQP CNR P LERRN Sbjct: 1 MCSSKTKVTVGLEAVVAAAKPSVARINGRPVLQPTCNRVPNLERRNSIKKVAPPKS---- 56 Query: 1855 VPLSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSN 1676 LS PS SPR PATKRGND NGLNSS EK+V+ P+ S Sbjct: 57 --LSPPSPPLPSKTSLTPPVSPKLK----SPRLPATKRGNDNNGLNSSYEKIVI-PRSST 109 Query: 1675 KTVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKM 1496 KT +K S G+ AS++ S + SSSLI ++PGSIAA RREQ+A+ Q QRKM Sbjct: 110 KTPTLERKKSKSFKEGSCVSASIEAS--LSYSSSLITDSPGSIAAVRREQMALQQAQRKM 167 Query: 1495 RIAHYGRTKSAKYEGKIVPLDSSATAAV----KEERRCHFITLNSDPIYIAYHDEEWGVP 1328 +IAHYGR+KSAK+E ++VPLD S T+ +EE+RC FIT NSDPIYIAYHDEEWGVP Sbjct: 168 KIAHYGRSKSAKFE-RVVPLDPSNTSLASKPTEEEKRCSFITPNSDPIYIAYHDEEWGVP 226 Query: 1327 VHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYG 1148 VH+DKMLFELLVL+GAQVGSDW + LKKR DFR AF+EFDAE V+ ++K+M +I ++YG Sbjct: 227 VHDDKMLFELLVLSGAQVGSDWTSTLKKRLDFRAAFSEFDAETVANLTDKQMMSISSEYG 286 Query: 1147 IELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETIS 968 I++S+VRGVVDNA++ILEIK++FGSFDKY+W FVN+KPI+TQYK KIPVKTSKSE+IS Sbjct: 287 IDISRVRGVVDNANQILEIKKDFGSFDKYIWGFVNHKPISTQYKFGHKIPVKTSKSESIS 346 Query: 967 KGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHS 830 K MVRRGFR VGPTV+HSFM+ +GLTNDHLI C RHL+C LA+ S Sbjct: 347 KDMVRRGFRFVGPTVVHSFMQTSGLTNDHLITCHRHLQCTLLAARS 392 >gb|EOY14286.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 409 Score = 440 bits (1131), Expect = e-120 Identities = 237/405 (58%), Positives = 293/405 (72%), Gaps = 6/405 (1%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVV-AVSHINGRPVLQPNCNRSPLLERRN---XXXXXXXXXXXXL 1859 MCSS +K TA ++ AV+ INGRPVLQP CNR P L+RRN L Sbjct: 1 MCSSNAKV---TAGVEITPAVARINGRPVLQPTCNRVPSLDRRNSLKKIPPLSPPTPPSL 57 Query: 1858 QVPLSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCS 1679 L + S KSPRP A KRG+DPN LN+S EK V+TP+ Sbjct: 58 ASTLPATSATVGNGGRAKASLTPPISPKSKSPRPAAIKRGSDPNALNTSSEK-VMTPRNI 116 Query: 1678 NKTVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRK 1499 KT+ K G+GN + ++ P + SSSLIVEAPGSIAA RREQ+A+ Q QRK Sbjct: 117 TKTLERKKSKSFKEGMGNGLSSWIE--PSLSYSSSLIVEAPGSIAAVRREQMALQQAQRK 174 Query: 1498 MRIAHYGRTKSAKYEGKIVPLDSSA--TAAVKEERRCHFITLNSDPIYIAYHDEEWGVPV 1325 M+IAHYGR+KSAK+E K+VPL++S+ T +EE+RC FIT NSDP+Y+AYHDEEWGVPV Sbjct: 175 MKIAHYGRSKSAKFESKVVPLNTSSAMTKPDEEEKRCSFITPNSDPVYVAYHDEEWGVPV 234 Query: 1324 HEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGI 1145 H+D MLFELLVL+GAQVGSDW ++LKKRQDFRDAF+ FDAE V+K+++K+M TI ++YGI Sbjct: 235 HDDSMLFELLVLSGAQVGSDWISILKKRQDFRDAFSGFDAETVAKFTDKEMTTISSEYGI 294 Query: 1144 ELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISK 965 ++S+V GVVDN++RILE+K +FGSFDKY+W FVN+K I+TQYK KIPVKTSKSE+ISK Sbjct: 295 DISRVLGVVDNSNRILEVKGQFGSFDKYIWGFVNHKAISTQYKFGHKIPVKTSKSESISK 354 Query: 964 GMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHS 830 M+RRGFR VGPTV+HSFM+AAGLTNDHLI C RHL C LA+ S Sbjct: 355 DMLRRGFRCVGPTVVHSFMQAAGLTNDHLITCHRHLPCTLLAASS 399 >ref|XP_006448576.1| hypothetical protein CICLE_v10015639mg [Citrus clementina] gi|557551187|gb|ESR61816.1| hypothetical protein CICLE_v10015639mg [Citrus clementina] Length = 375 Score = 439 bits (1128), Expect = e-120 Identities = 241/408 (59%), Positives = 288/408 (70%), Gaps = 2/408 (0%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVPL 1847 MCSSKSK +T INGRPVLQP N+ P LE+R+ + Sbjct: 1 MCSSKSKLHSAT---------QINGRPVLQPTSNQVPSLEKRSSIKKTGSPKSPITTNNV 51 Query: 1846 SSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSNKTV 1667 +S S SPRP A KRGNDPN LN+S EK+ +TPK K Sbjct: 52 NSKSFTKSLLSPPVSPKLK-------SPRPAAVKRGNDPNVLNTSAEKI-MTPK---KLA 100 Query: 1666 APVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRIA 1487 + VKK KN+ + +P + SSLIVEAPGSIAAARRE VAIMQ QRK+RIA Sbjct: 101 SFVKKPKNA-----------EVAPCYD--SSLIVEAPGSIAAARREHVAIMQEQRKLRIA 147 Query: 1486 HYGRTKSAKYEGKIVPLDSSATAAV--KEERRCHFITLNSDPIYIAYHDEEWGVPVHEDK 1313 HYGRTKSAK+EGK+ LDS A +EE+RC FIT NSDP Y+AYHDEEWGVPVH+DK Sbjct: 148 HYGRTKSAKFEGKVPGLDSFANGDNNDREEKRCSFITPNSDPKYVAYHDEEWGVPVHDDK 207 Query: 1312 MLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGIELSQ 1133 +LFELLVLT AQVGSDW +VLKKR+ FR+AF+ FDAE+V+K++EKK+ ++ +Y I+LSQ Sbjct: 208 LLFELLVLTAAQVGSDWTSVLKKRRAFREAFSGFDAEVVAKFTEKKITSLSANYAIDLSQ 267 Query: 1132 VRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISKGMVR 953 VRG+VDN+ RILE+K++FGSFDKYLW FVN+K I TQY+S KIP KTSKSE ISK MV+ Sbjct: 268 VRGIVDNSIRILEVKKQFGSFDKYLWGFVNHKTINTQYRSSQKIPAKTSKSEAISKDMVK 327 Query: 952 RGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809 +GFR VGPTVIHSFM+AAGL+NDHLI C RHL+C ALASH PAVAPAL Sbjct: 328 KGFRFVGPTVIHSFMQAAGLSNDHLITCTRHLQCTALASHQPAVAPAL 375 >ref|XP_003546838.1| PREDICTED: uncharacterized protein LOC100811352 [Glycine max] Length = 400 Score = 438 bits (1126), Expect = e-120 Identities = 239/411 (58%), Positives = 291/411 (70%), Gaps = 12/411 (2%) Frame = -3 Query: 2026 MCSSKSK--------PQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXX 1871 MC SK+K +T +V+ INGRPVLQP CNR P LERRN Sbjct: 1 MCGSKTKVTIGLEVIAAAATTTTAKPSVARINGRPVLQPTCNRVPNLERRNSIKKVAPAK 60 Query: 1870 XXXLQVPLSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLT 1691 LS PS SPR PATKRGND NGLNSS EK+V+ Sbjct: 61 S------LSPPSPPLPSKTSLTPPVSPKSK----SPRLPATKRGNDNNGLNSSYEKIVI- 109 Query: 1690 PKCSNKTVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQ 1511 P+ S KT +K S G+ AS++ S + SSSLI ++PGSIAA RREQ+A+ Q Sbjct: 110 PRSSIKTPTLERKKSKSFKEGSCVSASIEAS--LSYSSSLITDSPGSIAAVRREQMALQQ 167 Query: 1510 VQRKMRIAHYGRTKSAKYEGKIVPLDSSATAAV----KEERRCHFITLNSDPIYIAYHDE 1343 QRKM+IAHYGR+KSAK+E ++VPLD S T+ +EE+RC FIT NSDPIYIAYHDE Sbjct: 168 AQRKMKIAHYGRSKSAKFE-RVVPLDPSNTSLASKPTEEEKRCSFITANSDPIYIAYHDE 226 Query: 1342 EWGVPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTI 1163 EWGVPVH+DKMLFELLVL+GAQVGSDW + LKKR DFR AF+EFDAE V+ ++K+M +I Sbjct: 227 EWGVPVHDDKMLFELLVLSGAQVGSDWTSTLKKRLDFRAAFSEFDAETVANLTDKQMMSI 286 Query: 1162 CNDYGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSK 983 ++YGI++S+VRGVVDNA++ILEIK++FGSFDKY+W FVN+KP++TQYK KIPVKTSK Sbjct: 287 SSEYGIDISRVRGVVDNANQILEIKKDFGSFDKYIWGFVNHKPLSTQYKFGHKIPVKTSK 346 Query: 982 SETISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHS 830 SE+ISK MVRRGFR VGPTV+HSFM+A+GLTNDHLI C RHL+C LA+ S Sbjct: 347 SESISKDMVRRGFRYVGPTVVHSFMQASGLTNDHLITCHRHLQCTLLAARS 397 >gb|ESW21148.1| hypothetical protein PHAVU_005G045900g [Phaseolus vulgaris] Length = 405 Score = 437 bits (1124), Expect = e-120 Identities = 239/411 (58%), Positives = 292/411 (71%), Gaps = 14/411 (3%) Frame = -3 Query: 2026 MCSSKSKP----QGSTAAADVVA-----VSHINGRPVLQPNCNRSPLLERRNXXXXXXXX 1874 MCSSK+K +G AAA + V+ INGRPVLQP CNR P LERRN Sbjct: 1 MCSSKAKVTVGIEGVVAAATTTSTVMPSVARINGRPVLQPTCNRVPNLERRNSIKKVQPP 60 Query: 1873 XXXXL-QVPLSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVV 1697 PLSS + SPR PA KRGND NGLN+S EK+ Sbjct: 61 KSLSPPSPPLSSKTSLTPPVSPKSK-----------SPRLPAVKRGNDNNGLNTSYEKIA 109 Query: 1696 LTPKCSNKTVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAI 1517 + PK S+K +K S G+ AS + S + +SSLI ++PGSIAA RREQ+A+ Sbjct: 110 I-PKSSSKAPTLERKKSKSFKEGSCAPASTEAS--FSYASSLITDSPGSIAAVRREQMAL 166 Query: 1516 MQVQRKMRIAHYGRTKSAKYEGKIVPLDSSATAAV----KEERRCHFITLNSDPIYIAYH 1349 Q QRKM+IAHYGR+KSAK+E ++VPLD S T +EE+RC FIT NSDPIYIAYH Sbjct: 167 QQAQRKMKIAHYGRSKSAKFE-RVVPLDPSTTTLTSKPTEEEKRCSFITANSDPIYIAYH 225 Query: 1348 DEEWGVPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMN 1169 DEEWGVPVH+DKMLFELLVL+GAQVGSDW + LKKRQDFR AF++FDAE V+ ++K+M Sbjct: 226 DEEWGVPVHDDKMLFELLVLSGAQVGSDWTSTLKKRQDFRAAFSDFDAETVANLTDKQMM 285 Query: 1168 TICNDYGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKT 989 +I ++YGI++S+VRGVVDNA++ILEIK++FGSFDKY+W FVN+KPI+TQYK KIPVKT Sbjct: 286 SISSEYGIDISRVRGVVDNANQILEIKKDFGSFDKYIWGFVNHKPISTQYKFGHKIPVKT 345 Query: 988 SKSETISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALAS 836 SKSE+ISK MVRRG+R VGPTV+HSFM+AAGLTNDHLI C RHL+C LA+ Sbjct: 346 SKSESISKDMVRRGYRFVGPTVVHSFMQAAGLTNDHLITCHRHLQCTLLAA 396 >ref|XP_004295546.1| PREDICTED: uncharacterized protein LOC101298985 [Fragaria vesca subsp. vesca] Length = 410 Score = 435 bits (1119), Expect = e-119 Identities = 234/410 (57%), Positives = 289/410 (70%), Gaps = 12/410 (2%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVVA-VSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVP 1850 MCSSK+K T ++ VS INGRPVLQP CNR P L+RRN L + Sbjct: 1 MCSSKAKV---TMGIEITPLVSRINGRPVLQPTCNRVPSLDRRNSLKKLSTPPPPPLPLS 57 Query: 1849 LSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKR-GNDPNGLNSSVEKVVLTPKCSNK 1673 +S + KSPRPPA KR GNDPNGLNSS EKVV + Sbjct: 58 NASSTSTSPRISTKASLTTPPVSPKSKSPRPPAIKRSGNDPNGLNSSSEKVVTPGGTTRA 117 Query: 1672 TVAPVKKSK---------NSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVA 1520 V KKSK N+ G ++ AS++ S + SSSLI EAPG+IAA RREQ+A Sbjct: 118 KVLERKKSKSFKLGVGADNAHDHGRLSSASIEAS--LSYSSSLITEAPGTIAAGRREQMA 175 Query: 1519 IMQVQRKMRIAHYGRTKSAKYEGKIVPLDS-SATAAVKEERRCHFITLNSDPIYIAYHDE 1343 + QRKMRIAHYGR+ SA +E ++ P+D+ A ++ +RC FIT NSDPIY+AYHD+ Sbjct: 176 LQHAQRKMRIAHYGRSNSANFE-RVAPIDTMEAKGGEEDHKRCSFITANSDPIYVAYHDQ 234 Query: 1342 EWGVPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTI 1163 EWGVPVH+DKMLFELLVL+GAQVGSDW ++LKKRQDFRDAF+ FDAE V+ ++K+M +I Sbjct: 235 EWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRDAFSGFDAEAVANLTDKQMISI 294 Query: 1162 CNDYGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSK 983 C++YGI++S+VRGVVDN++RILE+KREFGSF KY+W FVN+KPI+ QYK KIPVKTSK Sbjct: 295 CSEYGIDISRVRGVVDNSNRILEVKREFGSFHKYIWGFVNHKPISPQYKQGYKIPVKTSK 354 Query: 982 SETISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASH 833 SE+ISK MVRRGFR VGPTV+HSFM+A+GLTNDHL C RHL+C LA+H Sbjct: 355 SESISKDMVRRGFRFVGPTVVHSFMQASGLTNDHLTTCHRHLQCTLLAAH 404 >ref|XP_002530111.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] gi|223530365|gb|EEF32255.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] Length = 403 Score = 435 bits (1119), Expect = e-119 Identities = 241/409 (58%), Positives = 287/409 (70%), Gaps = 12/409 (2%) Frame = -3 Query: 2026 MCSSKSK--PQGSTAAAD----VVAVSHINGRPVLQPNCNRSPLLERRN--XXXXXXXXX 1871 MCSSKSK G+ AAA+ ++ INGRPVLQP ++ P LERRN Sbjct: 1 MCSSKSKLHHHGAAAAANHHIPASTIAKINGRPVLQPKSDQVPTLERRNSLKKNSPKSPI 60 Query: 1870 XXXLQVPLSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLT 1691 PL KSPRPPA KRGND N LNSS EK + Sbjct: 61 IQPPAAPLPLLPTTTTIKPKQPSSLSPPISPKLKSPRPPALKRGNDLNTLNSSAEKFLTP 120 Query: 1690 PKCSNKTVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQ 1511 K + T+ KKS + V + + N SSSLIVEAPGSIAAARRE VA MQ Sbjct: 121 RKAVSTTLKKSKKSSPATPV------VAETCTVLNYSSSLIVEAPGSIAAARREHVATMQ 174 Query: 1510 VQRKMRIAHYGRTKS---AKYEGKIVPLDS-SATAAVKEERRCHFITLNSDPIYIAYHDE 1343 QRK+R AHYGR S +K + KIVP+DS +ATA +EERRC FIT +SDPIY+AYHD+ Sbjct: 175 EQRKLRTAHYGRVNSGSKSKRDAKIVPVDSPAATAVPQEERRCSFITPSSDPIYVAYHDQ 234 Query: 1342 EWGVPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTI 1163 EWGVPVH+DKMLFELLVLTGAQ+GSDW +VLKKR+ FR+AF+ FDAEIV+K+SEKK +I Sbjct: 235 EWGVPVHDDKMLFELLVLTGAQIGSDWTSVLKKREAFREAFSGFDAEIVAKFSEKKTTSI 294 Query: 1162 CNDYGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSK 983 +YG+E+SQVRGVVDN++RIL++K+EFGSFDKYLW FVN+KPI TQY+S KIPVKTSK Sbjct: 295 SAEYGMEISQVRGVVDNSNRILQVKKEFGSFDKYLWGFVNHKPITTQYRSSNKIPVKTSK 354 Query: 982 SETISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALAS 836 SETISK MV+RGFR VGPTV+HSFM+AAGL+NDHLI+C RH +C ALAS Sbjct: 355 SETISKDMVKRGFRYVGPTVMHSFMQAAGLSNDHLISCSRHHQCLALAS 403 >ref|XP_002315089.2| methyladenine glycosylase family protein [Populus trichocarpa] gi|550330066|gb|EEF01260.2| methyladenine glycosylase family protein [Populus trichocarpa] Length = 411 Score = 434 bits (1117), Expect = e-119 Identities = 236/417 (56%), Positives = 285/417 (68%), Gaps = 12/417 (2%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVPL 1847 MCSS +K +T AV+ INGRPVLQP CNR P LER N PL Sbjct: 1 MCSSNAKV--TTGVEITPAVARINGRPVLQPTCNRVPTLERHNSLKKTAPKSPPPPPPPL 58 Query: 1846 SSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSNKTV 1667 P+ SPR PA KRG+D N LNSS +KVV+ + + Sbjct: 59 PPPTSANKTNKASPPLSPKSK-----SPRLPAIKRGSDANSLNSSSDKVVIPRSTAKTPI 113 Query: 1666 APVKKSKN--SCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMR 1493 KKSK+ VG+ ++S + L + SSSLIVEAPGSIAA RREQ+A+ QRKMR Sbjct: 114 LERKKSKSFKETSVGSGALSSSIEASL-SYSSSLIVEAPGSIAAVRREQMALQHAQRKMR 172 Query: 1492 IAHYGRTKSAKYEGKIVPLDSSATAAVK---EERRCHFITLNS-------DPIYIAYHDE 1343 IAHYGR+KS+++E K+VP+DSS K EE+RC FIT NS +PIY+AYHD+ Sbjct: 173 IAHYGRSKSSRFEAKVVPVDSSINVTTKTDEEEKRCSFITANSGKEKYEMNPIYVAYHDK 232 Query: 1342 EWGVPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTI 1163 EWGVPVH+DKMLFELLVL+GAQVGSDW ++LKKRQDFRDAF+ FDAEIV+ +EK+M +I Sbjct: 233 EWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRDAFSGFDAEIVANITEKQMMSI 292 Query: 1162 CNDYGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSK 983 +YGIE+S+VRGVVDN+ RILEIK+EFGSFD+Y+W FVN KP + QYK KIPVKTSK Sbjct: 293 SAEYGIEISRVRGVVDNSKRILEIKKEFGSFDRYIWTFVNNKPFSNQYKFGHKIPVKTSK 352 Query: 982 SETISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPA 812 SETISK MVRRGFR VGPT++HSFM+A GLTNDHLI C RHL C +A+ P A A Sbjct: 353 SETISKDMVRRGFRFVGPTMVHSFMQAVGLTNDHLITCHRHLPCTLMAARRPTEAQA 409 >ref|XP_002312220.1| methyladenine glycosylase family protein [Populus trichocarpa] gi|118486806|gb|ABK95238.1| unknown [Populus trichocarpa] gi|222852040|gb|EEE89587.1| methyladenine glycosylase family protein [Populus trichocarpa] Length = 403 Score = 432 bits (1112), Expect = e-118 Identities = 240/406 (59%), Positives = 284/406 (69%), Gaps = 9/406 (2%) Frame = -3 Query: 2026 MCSSKSKPQGSTAAADVV-AVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVP 1850 MCS K+K T D+ AV+ INGRPVLQP CN LERRN P Sbjct: 1 MCSFKAKV---TTGVDITPAVARINGRPVLQPTCNLVSTLERRN---------SLKKTAP 48 Query: 1849 LSS--PSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSN 1676 SS P KSPR PA KRG+D N LNSS EKVV+ + Sbjct: 49 KSSPPPPPPPPTFSNKTNKASPPLSPMSKSPRLPAIKRGSDANSLNSSSEKVVIPRNTTK 108 Query: 1675 KTVAPVKKSKN--SCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQR 1502 KKSK+ VG +S + L + SSSLIVEAPGSIAA RREQ+A+ QR Sbjct: 109 TPTLERKKSKSFKESSVGRGVHSSFIEASL-SYSSSLIVEAPGSIAAVRREQMALQHAQR 167 Query: 1501 KMRIAHYGRTKSAKYEGKIVPLDSSATAAVK----EERRCHFITLNSDPIYIAYHDEEWG 1334 KMRIAHYGR+KSA++E ++VP DSS + A K EE+RC FIT NSDPIY+AYHDEEWG Sbjct: 168 KMRIAHYGRSKSARFEDQVVPNDSSISMATKTDQEEEKRCSFITANSDPIYVAYHDEEWG 227 Query: 1333 VPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICND 1154 VPVH+DKMLFELLVL+GAQVGSDW ++LKKRQDFRDAF+ FDAEIV+ SEK++ +I + Sbjct: 228 VPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRDAFSGFDAEIVANISEKQIMSISAE 287 Query: 1153 YGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSET 974 YGI++S+VRGVVDN++RILEIK+EFGSFD+Y+W FVN KPI+T YK KIPVKTSKSET Sbjct: 288 YGIDMSRVRGVVDNSNRILEIKKEFGSFDRYIWTFVNNKPISTSYKFGHKIPVKTSKSET 347 Query: 973 ISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALAS 836 ISK MVRRGFR VGPT++HSFM+AAGLTNDHLI C RHL C +A+ Sbjct: 348 ISKDMVRRGFRFVGPTMVHSFMQAAGLTNDHLITCHRHLPCTLMAA 393