BLASTX nr result
ID: Zingiber25_contig00014214
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Zingiber25_contig00014214 (1209 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002303719.2| hypothetical protein POPTR_0003s15520g [Popu... 363 8e-98 ref|XP_002303720.2| hypothetical protein POPTR_0003s15520g [Popu... 363 8e-98 ref|XP_006468594.1| PREDICTED: uncharacterized protein LOC102614... 355 2e-95 gb|EMJ13510.1| hypothetical protein PRUPE_ppa006731mg [Prunus pe... 348 3e-93 gb|ESW21148.1| hypothetical protein PHAVU_005G045900g [Phaseolus... 348 3e-93 gb|EOX96813.1| DNA glycosylase superfamily protein, putative [Th... 348 3e-93 ref|XP_003545728.1| PREDICTED: uncharacterized protein LOC100793... 348 3e-93 gb|AFK37052.1| unknown [Medicago truncatula] 346 1e-92 ref|XP_003546838.1| PREDICTED: uncharacterized protein LOC100811... 346 1e-92 ref|XP_003595924.1| DNA-3-methyladenine glycosylase [Medicago tr... 346 1e-92 ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791... 345 2e-92 ref|XP_002263612.1| PREDICTED: uncharacterized protein LOC100256... 345 3e-92 ref|XP_004295546.1| PREDICTED: uncharacterized protein LOC101298... 343 8e-92 ref|XP_006448576.1| hypothetical protein CICLE_v10015639mg [Citr... 343 1e-91 ref|XP_003531809.1| PREDICTED: uncharacterized protein LOC100793... 342 1e-91 emb|CAN68394.1| hypothetical protein VITISV_042519 [Vitis vinifera] 342 1e-91 ref|XP_002312220.1| methyladenine glycosylase family protein [Po... 342 2e-91 ref|XP_002315089.2| methyladenine glycosylase family protein [Po... 342 2e-91 gb|EOY14286.1| DNA glycosylase superfamily protein isoform 1 [Th... 342 2e-91 gb|EMJ12930.1| hypothetical protein PRUPE_ppa006139mg [Prunus pe... 341 3e-91 >ref|XP_002303719.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] gi|550343248|gb|EEE78698.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] Length = 420 Score = 363 bits (932), Expect = 8e-98 Identities = 198/346 (57%), Positives = 244/346 (70%), Gaps = 11/346 (3%) Frame = +3 Query: 102 PTKLSPPVSPKPKQAKA-APQRGIESNGLSTSSDKLAVPKPTPTKLPRPVMKRSMSSVGA 278 P+ LSPP+SPK K + A +RG E GL+TS++K+ P+ T TK+ +K+S S A Sbjct: 79 PSALSPPISPKLKSPRPPAVKRGNEPGGLNTSAEKVLTPRST-TKVTTSTVKKSKKSSTA 137 Query: 279 G------QLAVSSESMSLLGFDRAPGSIAAAQREHAVLAQAKRKMRIAHYGRTP-AKLDG 437 G A+ S L+ APGSIAAA+RE + Q +RKMRIAHYGRT AK G Sbjct: 138 GVPHSVDTFAMKYSSSLLV---EAPGSIAAARREQVAVMQEQRKMRIAHYGRTKSAKYQG 194 Query: 438 KVVPVDSSTPNDANSQEEKRCSFITSNSDSVYVAYHDEEWGVPVHDDKMLFELLVLAGVQ 617 K+VP +S + ++EEKRCSFIT NSD VYVAYHDEEWGVPVHDDK+LFELL L G Q Sbjct: 195 KIVPANSPATSTI-TREEKRCSFITPNSDPVYVAYHDEEWGVPVHDDKLLFELLALTGAQ 253 Query: 618 VGLDWTTILXXXXXXXXXXXXXDAELVAMFTERQMASVCAACVLDIGKVRGVIDNAKRIL 797 VG +WT++L DAE+VA FTE+++AS+ A LDI +VRGV+DN+ RIL Sbjct: 254 VGSEWTSVLKKREAFREAFSGFDAEIVAKFTEKKIASISAEYGLDISQVRGVVDNSNRIL 313 Query: 798 EVRREFASFANYLWGFINHKPLSPNYRSCRKIPVKTSKSESISKDMVRRGFRFVGPTVIH 977 EV+REF SF YLWG++NHKP+S Y+SC+KIPVKTSKSE+ISKDMV+RGFRFVGPTVIH Sbjct: 314 EVKREFGSFDEYLWGYVNHKPISTQYKSCQKIPVKTSKSETISKDMVKRGFRFVGPTVIH 373 Query: 978 SFMQAAGLTNDHLVSCPRHLHCSMTTITNANDYP---A*PSLCKLI 1106 SFMQA GL+NDHL++CPRHL C I A+ P A PS KLI Sbjct: 374 SFMQAGGLSNDHLITCPRHLQC----IALASQLPRTVAPPSQKKLI 415 >ref|XP_002303720.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] gi|550343247|gb|EEE78699.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] Length = 417 Score = 363 bits (932), Expect = 8e-98 Identities = 198/346 (57%), Positives = 244/346 (70%), Gaps = 11/346 (3%) Frame = +3 Query: 102 PTKLSPPVSPKPKQAKA-APQRGIESNGLSTSSDKLAVPKPTPTKLPRPVMKRSMSSVGA 278 P+ LSPP+SPK K + A +RG E GL+TS++K+ P+ T TK+ +K+S S A Sbjct: 79 PSALSPPISPKLKSPRPPAVKRGNEPGGLNTSAEKVLTPRST-TKVTTSTVKKSKKSSTA 137 Query: 279 G------QLAVSSESMSLLGFDRAPGSIAAAQREHAVLAQAKRKMRIAHYGRTP-AKLDG 437 G A+ S L+ APGSIAAA+RE + Q +RKMRIAHYGRT AK G Sbjct: 138 GVPHSVDTFAMKYSSSLLV---EAPGSIAAARREQVAVMQEQRKMRIAHYGRTKSAKYQG 194 Query: 438 KVVPVDSSTPNDANSQEEKRCSFITSNSDSVYVAYHDEEWGVPVHDDKMLFELLVLAGVQ 617 K+VP +S + ++EEKRCSFIT NSD VYVAYHDEEWGVPVHDDK+LFELL L G Q Sbjct: 195 KIVPANSPATSTI-TREEKRCSFITPNSDPVYVAYHDEEWGVPVHDDKLLFELLALTGAQ 253 Query: 618 VGLDWTTILXXXXXXXXXXXXXDAELVAMFTERQMASVCAACVLDIGKVRGVIDNAKRIL 797 VG +WT++L DAE+VA FTE+++AS+ A LDI +VRGV+DN+ RIL Sbjct: 254 VGSEWTSVLKKREAFREAFSGFDAEIVAKFTEKKIASISAEYGLDISQVRGVVDNSNRIL 313 Query: 798 EVRREFASFANYLWGFINHKPLSPNYRSCRKIPVKTSKSESISKDMVRRGFRFVGPTVIH 977 EV+REF SF YLWG++NHKP+S Y+SC+KIPVKTSKSE+ISKDMV+RGFRFVGPTVIH Sbjct: 314 EVKREFGSFDEYLWGYVNHKPISTQYKSCQKIPVKTSKSETISKDMVKRGFRFVGPTVIH 373 Query: 978 SFMQAAGLTNDHLVSCPRHLHCSMTTITNANDYP---A*PSLCKLI 1106 SFMQA GL+NDHL++CPRHL C I A+ P A PS KLI Sbjct: 374 SFMQAGGLSNDHLITCPRHLQC----IALASQLPRTVAPPSQKKLI 415 >ref|XP_006468594.1| PREDICTED: uncharacterized protein LOC102614205 [Citrus sinensis] Length = 375 Score = 355 bits (912), Expect = 2e-95 Identities = 184/332 (55%), Positives = 234/332 (70%), Gaps = 2/332 (0%) Frame = +3 Query: 57 KNAARADADHNAPVTPTKLSPPVSPKPKQAK-AAPQRGIESNGLSTSSDKLAVPKPTPTK 233 K+ D ++ T + LSPPVSPK K + AA +RG + N L+TS++K+ PK + Sbjct: 43 KSPITTDNVNSKSFTKSLLSPPVSPKLKSPRPAAVKRGNDPNVLNTSAEKIMTPKKLASL 102 Query: 234 LPRPVMKRSMSSVGAGQLAVSSESMSLLGFDRAPGSIAAAQREHAVLAQAKRKMRIAHYG 413 + +P +VG SS + APGSIAAA+REH + Q +RK+RIAHYG Sbjct: 103 VKKP------KNVGVAPCYDSSLIVE------APGSIAAARREHVAIMQEQRKLRIAHYG 150 Query: 414 RTP-AKLDGKVVPVDSSTPNDANSQEEKRCSFITSNSDSVYVAYHDEEWGVPVHDDKMLF 590 RT AK +GKV +DS D N +EEKRCSFIT NSD +YVAYHDEEWGVPVHDDK+LF Sbjct: 151 RTKSAKFEGKVPGLDSFANGDNNDREEKRCSFITPNSDPIYVAYHDEEWGVPVHDDKLLF 210 Query: 591 ELLVLAGVQVGLDWTTILXXXXXXXXXXXXXDAELVAMFTERQMASVCAACVLDIGKVRG 770 ELLVL QVG DWT++L DAE+VA FTE++M S+ A +D+ +VRG Sbjct: 211 ELLVLTAAQVGSDWTSVLKKRQAFREAFSGFDAEVVAKFTEKKMTSLSANYAIDLSQVRG 270 Query: 771 VIDNAKRILEVRREFASFANYLWGFINHKPLSPNYRSCRKIPVKTSKSESISKDMVRRGF 950 ++DN+ RILEV+++F SF YLWGF+NHKP++ YRS +KIPVKTSKSE+ISKDMV++GF Sbjct: 271 IVDNSIRILEVKKQFGSFDKYLWGFVNHKPINTQYRSSQKIPVKTSKSEAISKDMVKKGF 330 Query: 951 RFVGPTVIHSFMQAAGLTNDHLVSCPRHLHCS 1046 RFVGPTVIHSFMQAAGLTNDHL++C RHL C+ Sbjct: 331 RFVGPTVIHSFMQAAGLTNDHLITCTRHLQCT 362 >gb|EMJ13510.1| hypothetical protein PRUPE_ppa006731mg [Prunus persica] Length = 397 Score = 348 bits (893), Expect = 3e-93 Identities = 193/352 (54%), Positives = 241/352 (68%), Gaps = 9/352 (2%) Frame = +3 Query: 18 TLQKSLSLPSSFAKNAARADADHNAPVTPTK--LSPPVSPK-PKQAKAAPQRGIESNGLS 188 +L++ SL S + A P TK LSPP+SPK P A +RG + N L+ Sbjct: 36 SLEQRKSLKKSSQEPLAPTPLPSPLPSAKTKASLSPPISPKLPSPRPPAFKRGKDPNELN 95 Query: 189 TSSDKLAVPKPTPTKLPRPVMKRSMSSVGAGQLAVSSESM-----SLLGFDRAPGSIAAA 353 +S++K+ P+ T TK V K+S S G+ A S+ES+ SL+ APGSIAAA Sbjct: 96 SSAEKVVTPRCT-TKFTSSV-KKSKKSSGSVAAAPSAESILKNISSLIV--EAPGSIAAA 151 Query: 354 QREHAVLAQAKRKMRIAHYGRTP-AKLDGKVVPVDSSTPNDANSQEEKRCSFITSNSDSV 530 +RE Q +RKMRIAHYGRT AK +GKVVP+D+S D ++++RC+FIT NSD + Sbjct: 152 RREQVATMQEQRKMRIAHYGRTKSAKNEGKVVPLDASPTTDFG-RDQRRCTFITPNSDPI 210 Query: 531 YVAYHDEEWGVPVHDDKMLFELLVLAGVQVGLDWTTILXXXXXXXXXXXXXDAELVAMFT 710 YVAYHDEEWGVPVHDD +L ELLVL G QVG DWT++L DA+ VA F+ Sbjct: 211 YVAYHDEEWGVPVHDDNLLLELLVLTGAQVGSDWTSVLRKRQALRESFSGFDADGVAKFS 270 Query: 711 ERQMASVCAACVLDIGKVRGVIDNAKRILEVRREFASFANYLWGFINHKPLSPNYRSCRK 890 ER++ SV + +DI VRG +DNAKRIL+++RE SF YLWGF+NHKP+S Y+SC K Sbjct: 271 ERKITSVSSDSGIDISLVRGAVDNAKRILQIKREVGSFDKYLWGFVNHKPISTQYKSCHK 330 Query: 891 IPVKTSKSESISKDMVRRGFRFVGPTVIHSFMQAAGLTNDHLVSCPRHLHCS 1046 IPVK SKSESISKDMVRRGFR VGPTVIHSFMQAAGLTNDHL++CPRHL C+ Sbjct: 331 IPVKNSKSESISKDMVRRGFRLVGPTVIHSFMQAAGLTNDHLITCPRHLQCA 382 >gb|ESW21148.1| hypothetical protein PHAVU_005G045900g [Phaseolus vulgaris] Length = 405 Score = 348 bits (892), Expect = 3e-93 Identities = 181/322 (56%), Positives = 233/322 (72%), Gaps = 7/322 (2%) Frame = +3 Query: 105 TKLSPPVSPKPKQAKA-APQRGIESNGLSTSSDKLAVPKPTPTKLPRPVMKRSMSSVGAG 281 T L+PPVSPK K + A +RG ++NGL+TS +K+A+PK + +K P K+S S Sbjct: 74 TSLTPPVSPKSKSPRLPAVKRGNDNNGLNTSYEKIAIPKSS-SKAPTLERKKSKSFKEGS 132 Query: 282 QLAVSSESM----SLLGFDRAPGSIAAAQREHAVLAQAKRKMRIAHYGRTPAKLDGKVVP 449 S+E+ S L D +PGSIAA +RE L QA+RKM+IAHYGR+ + +VVP Sbjct: 133 CAPASTEASFSYASSLITD-SPGSIAAVRREQMALQQAQRKMKIAHYGRSKSAKFERVVP 191 Query: 450 VDSSTPNDAN--SQEEKRCSFITSNSDSVYVAYHDEEWGVPVHDDKMLFELLVLAGVQVG 623 +D ST + ++EEKRCSFIT+NSD +Y+AYHDEEWGVPVHDDKMLFELLVL+G QVG Sbjct: 192 LDPSTTTLTSKPTEEEKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVG 251 Query: 624 LDWTTILXXXXXXXXXXXXXDAELVAMFTERQMASVCAACVLDIGKVRGVIDNAKRILEV 803 DWT+ L DAE VA T++QM S+ + +DI +VRGV+DNA +ILE+ Sbjct: 252 SDWTSTLKKRQDFRAAFSDFDAETVANLTDKQMMSISSEYGIDISRVRGVVDNANQILEI 311 Query: 804 RREFASFANYLWGFINHKPLSPNYRSCRKIPVKTSKSESISKDMVRRGFRFVGPTVIHSF 983 +++F SF Y+WGF+NHKP+S Y+ KIPVKTSKSESISKDMVRRG+RFVGPTV+HSF Sbjct: 312 KKDFGSFDKYIWGFVNHKPISTQYKFGHKIPVKTSKSESISKDMVRRGYRFVGPTVVHSF 371 Query: 984 MQAAGLTNDHLVSCPRHLHCSM 1049 MQAAGLTNDHL++C RHL C++ Sbjct: 372 MQAAGLTNDHLITCHRHLQCTL 393 >gb|EOX96813.1| DNA glycosylase superfamily protein, putative [Theobroma cacao] Length = 398 Score = 348 bits (892), Expect = 3e-93 Identities = 192/361 (53%), Positives = 240/361 (66%), Gaps = 4/361 (1%) Frame = +3 Query: 9 LKKTLQKS--LSLPSSFAKNAARADADHNAPVTPTKLSPPVSPKPKQAKAAPQRGIESNG 182 LKK S LS P + + ARA T LSPP+SPK + A +RG +SN Sbjct: 43 LKKISSNSPALSAPLQLSNSRARA-----VKATMPSLSPPISPKSPRPTAL-KRGKDSNE 96 Query: 183 LSTSSDKLAVPKPTPTKLPRPVMK-RSMSSVGAGQLAVSSESMSLLGFDRAPGSIAAAQR 359 L++SS+K+ P+ KL V K ++ S G +V ++ S APGSIAAA+R Sbjct: 97 LNSSSEKVIAPRCN-VKLDSKVKKPKNASGGGVALTSVDAKYSSSFMVLEAPGSIAAARR 155 Query: 360 EHAVLAQAKRKMRIAHYGRTP-AKLDGKVVPVDSSTPNDANSQEEKRCSFITSNSDSVYV 536 E + Q +RKMRIAHYGRT AK + K+V +DSS A Q+++RCSFIT NSD VY Sbjct: 156 EQVAMIQEQRKMRIAHYGRTKSAKYERKMVGLDSSAARTAARQDQRRCSFITVNSDPVYA 215 Query: 537 AYHDEEWGVPVHDDKMLFELLVLAGVQVGLDWTTILXXXXXXXXXXXXXDAELVAMFTER 716 AYHDEEWGV VHDDK+LFEL+VL G QVG DWT++L DAE++A F+E+ Sbjct: 216 AYHDEEWGVAVHDDKLLFELVVLIGAQVGSDWTSVLKKRQDFREAFSGFDAEVIAGFSEK 275 Query: 717 QMASVCAACVLDIGKVRGVIDNAKRILEVRREFASFANYLWGFINHKPLSPNYRSCRKIP 896 + S+ + +D+ +VR +DNA RILEVR+EF SF NYLWGF+NHKP+ Y+SC KIP Sbjct: 276 NILSISSDYGIDVSQVRAAVDNANRILEVRKEFGSFNNYLWGFVNHKPIVTQYKSCHKIP 335 Query: 897 VKTSKSESISKDMVRRGFRFVGPTVIHSFMQAAGLTNDHLVSCPRHLHCSMTTITNANDY 1076 VKTSKSE+ISKDMVRRGFRFVGPTVIHS MQAAGLTNDHL +CPRHL C I A+ + Sbjct: 336 VKTSKSEAISKDMVRRGFRFVGPTVIHSLMQAAGLTNDHLSTCPRHLQC----IALASQF 391 Query: 1077 P 1079 P Sbjct: 392 P 392 >ref|XP_003545728.1| PREDICTED: uncharacterized protein LOC100793449 [Glycine max] Length = 398 Score = 348 bits (892), Expect = 3e-93 Identities = 184/353 (52%), Positives = 241/353 (68%), Gaps = 9/353 (2%) Frame = +3 Query: 30 SLSLPSSFAKNAARADADHNAPV-TPTKLSPPVSPKPKQAKAAP-QRGIESNGLSTSSDK 203 +L +S K + ++ + + P+ + T L+PPVSPK K + P +RG ESNGL++SS+K Sbjct: 38 NLERRNSIKKLSPKSRSPPSPPLLSKTSLTPPVSPKSKSPRPPPIKRGNESNGLNSSSEK 97 Query: 204 LAVPKPTPTKLPRPVMKRSMS----SVGAGQLAVSSE---SMSLLGFDRAPGSIAAAQRE 362 + P+ T K P K+S S S GA L+ S+E S S +PGSIAA +RE Sbjct: 98 IVTPRNT-IKTPTLERKKSKSFKEGSCGALGLSASTEASLSYSSTLITESPGSIAAVRRE 156 Query: 363 HAVLAQAKRKMRIAHYGRTPAKLDGKVVPVDSSTPNDANSQEEKRCSFITSNSDSVYVAY 542 L A+RKM+IAHYGR+ + +V+P++ ST + + EEKRCSFIT+NSD +Y+AY Sbjct: 157 QMALQHAQRKMKIAHYGRSKSAKFARVIPLEPSTNLTSKTSEEKRCSFITANSDPIYIAY 216 Query: 543 HDEEWGVPVHDDKMLFELLVLAGVQVGLDWTTILXXXXXXXXXXXXXDAELVAMFTERQM 722 HDEEWGVPVHDDKMLFELLVL+G QVG DWT+IL DA +A T++QM Sbjct: 217 HDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRTAFSEFDAATLANLTDKQM 276 Query: 723 ASVCAACVLDIGKVRGVIDNAKRILEVRREFASFANYLWGFINHKPLSPNYRSCRKIPVK 902 S+ +DI +VRGV+DNA RIL + ++F SF Y+W F+NHKP+S Y+ KIPVK Sbjct: 277 VSISMEYDIDISRVRGVVDNANRILAINKDFGSFDKYIWDFVNHKPISTQYKFGHKIPVK 336 Query: 903 TSKSESISKDMVRRGFRFVGPTVIHSFMQAAGLTNDHLVSCPRHLHCSMTTIT 1061 TSKSESISKDM+RRGFR VGPTV+HSFMQAAGLTNDHL++C RHL C++ T Sbjct: 337 TSKSESISKDMIRRGFRCVGPTVLHSFMQAAGLTNDHLITCHRHLQCTLLAST 389 >gb|AFK37052.1| unknown [Medicago truncatula] Length = 390 Score = 346 bits (888), Expect = 1e-92 Identities = 183/357 (51%), Positives = 242/357 (67%), Gaps = 10/357 (2%) Frame = +3 Query: 9 LKKTLQKSLS-LPSSFAKNAARADADHNAPVTPTKLSPPVSPKPKQAKA----APQRGIE 173 +KK+ KSLS LP N + L+PP+SPKPK + A +RG + Sbjct: 46 IKKSTPKSLSPLPLPNKTNTS-------------SLTPPISPKPKSPTSTRPLAIKRGND 92 Query: 174 SNGLSTSSDKLAVPKPTPTKLPRPVMKRSMS-SVGAGQLAVSSESMSLLG--FDRAPGSI 344 +NGL+ S +K+++PK + P ++R S S G + + S+S +PGSI Sbjct: 93 NNGLNLSCEKISIPKNI---MKTPTLERKKSKSFKEGSFGIEAASLSYSSSLITDSPGSI 149 Query: 345 AAAQREHAVLAQAKRKMRIAHYGRTPAKLDGKVVPVDSSTPNDAN--SQEEKRCSFITSN 518 AA +RE L QA+RKM+IAHYGR+ + +V P+D S+ D+ +QEEKRCSFIT+N Sbjct: 150 AAVRREQVALQQAQRKMKIAHYGRSKSAKFERVFPIDPSSALDSKITNQEEKRCSFITTN 209 Query: 519 SDSVYVAYHDEEWGVPVHDDKMLFELLVLAGVQVGLDWTTILXXXXXXXXXXXXXDAELV 698 SD +Y+AYHDEEWGVPVHDDKMLFELL+L+G QVG DWT+ L DAE+V Sbjct: 210 SDPIYIAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSTLKKRLDFRAAFSEFDAEIV 269 Query: 699 AMFTERQMASVCAACVLDIGKVRGVIDNAKRILEVRREFASFANYLWGFINHKPLSPNYR 878 A T++QM S+ + +DI KVRGV+DNA +IL+VR+ F SF Y+WGF+NHKP+S Y+ Sbjct: 270 ANLTDKQMMSISSEYGIDISKVRGVVDNANQILQVRKGFGSFDKYIWGFVNHKPISNQYK 329 Query: 879 SCRKIPVKTSKSESISKDMVRRGFRFVGPTVIHSFMQAAGLTNDHLVSCPRHLHCSM 1049 KIPVKTSKSESISKDM++RGFR+VGPTV+HSFMQAAGLTNDHL++C RHL C++ Sbjct: 330 FGHKIPVKTSKSESISKDMIKRGFRYVGPTVVHSFMQAAGLTNDHLITCHRHLQCTL 386 >ref|XP_003546838.1| PREDICTED: uncharacterized protein LOC100811352 [Glycine max] Length = 400 Score = 346 bits (888), Expect = 1e-92 Identities = 186/350 (53%), Positives = 243/350 (69%), Gaps = 10/350 (2%) Frame = +3 Query: 30 SLSLPSSFAKNAARADADHNAPVTPTK--LSPPVSPKPKQAKA-APQRGIESNGLSTSSD 200 +L +S K A +P P+K L+PPVSPK K + A +RG ++NGL++S + Sbjct: 46 NLERRNSIKKVAPAKSLSPPSPPLPSKTSLTPPVSPKSKSPRLPATKRGNDNNGLNSSYE 105 Query: 201 KLAVPKPTPTKLPRPVMKRSMS-----SVGAGQLAVSSESMSLLGFDRAPGSIAAAQREH 365 K+ +P+ + K P K+S S V A A S S SL+ +PGSIAA +RE Sbjct: 106 KIVIPRSS-IKTPTLERKKSKSFKEGSCVSASIEASLSYSSSLI--TDSPGSIAAVRREQ 162 Query: 366 AVLAQAKRKMRIAHYGRTPAKLDGKVVPVDSSTPNDAN--SQEEKRCSFITSNSDSVYVA 539 L QA+RKM+IAHYGR+ + +VVP+D S + A+ ++EEKRCSFIT+NSD +Y+A Sbjct: 163 MALQQAQRKMKIAHYGRSKSAKFERVVPLDPSNTSLASKPTEEEKRCSFITANSDPIYIA 222 Query: 540 YHDEEWGVPVHDDKMLFELLVLAGVQVGLDWTTILXXXXXXXXXXXXXDAELVAMFTERQ 719 YHDEEWGVPVHDDKMLFELLVL+G QVG DWT+ L DAE VA T++Q Sbjct: 223 YHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSTLKKRLDFRAAFSEFDAETVANLTDKQ 282 Query: 720 MASVCAACVLDIGKVRGVIDNAKRILEVRREFASFANYLWGFINHKPLSPNYRSCRKIPV 899 M S+ + +DI +VRGV+DNA +ILE++++F SF Y+WGF+NHKPLS Y+ KIPV Sbjct: 283 MMSISSEYGIDISRVRGVVDNANQILEIKKDFGSFDKYIWGFVNHKPLSTQYKFGHKIPV 342 Query: 900 KTSKSESISKDMVRRGFRFVGPTVIHSFMQAAGLTNDHLVSCPRHLHCSM 1049 KTSKSESISKDMVRRGFR+VGPTV+HSFMQA+GLTNDHL++C RHL C++ Sbjct: 343 KTSKSESISKDMVRRGFRYVGPTVVHSFMQASGLTNDHLITCHRHLQCTL 392 >ref|XP_003595924.1| DNA-3-methyladenine glycosylase [Medicago truncatula] gi|355484972|gb|AES66175.1| DNA-3-methyladenine glycosylase [Medicago truncatula] Length = 390 Score = 346 bits (888), Expect = 1e-92 Identities = 183/357 (51%), Positives = 242/357 (67%), Gaps = 10/357 (2%) Frame = +3 Query: 9 LKKTLQKSLS-LPSSFAKNAARADADHNAPVTPTKLSPPVSPKPKQAKA----APQRGIE 173 +KK+ KSLS LP N + L+PP+SPKPK + A +RG + Sbjct: 46 IKKSTPKSLSPLPLPNKTNTS-------------SLTPPISPKPKSPTSTRPLAIKRGND 92 Query: 174 SNGLSTSSDKLAVPKPTPTKLPRPVMKRSMS-SVGAGQLAVSSESMSLLG--FDRAPGSI 344 +NGL+ S +K+++PK + P ++R S S G + + S+S +PGSI Sbjct: 93 NNGLNLSCEKISIPKNI---MKTPTLERKKSKSFKEGSFGIEAASLSYSSSLITDSPGSI 149 Query: 345 AAAQREHAVLAQAKRKMRIAHYGRTPAKLDGKVVPVDSSTPNDANS--QEEKRCSFITSN 518 AA +RE L QA+RKM+IAHYGR+ + +V P+D S+ D+ + QEEKRCSFIT+N Sbjct: 150 AAVRREQVALQQAQRKMKIAHYGRSKSAKFERVFPIDPSSALDSKTTNQEEKRCSFITTN 209 Query: 519 SDSVYVAYHDEEWGVPVHDDKMLFELLVLAGVQVGLDWTTILXXXXXXXXXXXXXDAELV 698 SD +Y+AYHDEEWGVPVHDDKMLFELL+L+G QVG DWT+ L DAE+V Sbjct: 210 SDPIYIAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSTLKKRLDFRAAFSEFDAEIV 269 Query: 699 AMFTERQMASVCAACVLDIGKVRGVIDNAKRILEVRREFASFANYLWGFINHKPLSPNYR 878 A T++QM S+ + +DI KVRGV+DNA +IL+VR+ F SF Y+WGF+NHKP+S Y+ Sbjct: 270 ANLTDKQMMSISSEYGIDISKVRGVVDNANQILQVRKGFGSFDKYIWGFVNHKPISNQYK 329 Query: 879 SCRKIPVKTSKSESISKDMVRRGFRFVGPTVIHSFMQAAGLTNDHLVSCPRHLHCSM 1049 KIPVKTSKSESISKDM++RGFR+VGPTV+HSFMQAAGLTNDHL++C RHL C++ Sbjct: 330 FGHKIPVKTSKSESISKDMIKRGFRYVGPTVVHSFMQAAGLTNDHLITCHRHLQCTL 386 >ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791725 [Glycine max] Length = 400 Score = 345 bits (886), Expect = 2e-92 Identities = 186/350 (53%), Positives = 242/350 (69%), Gaps = 10/350 (2%) Frame = +3 Query: 30 SLSLPSSFAKNAARADADHNAPVTPTK--LSPPVSPKPKQAKA-APQRGIESNGLSTSSD 200 +L +S K A +P P+K L+PPVSPK K + A +RG ++NGL++S + Sbjct: 41 NLERRNSIKKVAPPKSLSPPSPPLPSKTSLTPPVSPKLKSPRLPATKRGNDNNGLNSSYE 100 Query: 201 KLAVPKPTPTKLPRPVMKRSMS-----SVGAGQLAVSSESMSLLGFDRAPGSIAAAQREH 365 K+ +P+ + TK P K+S S V A A S S SL+ +PGSIAA +RE Sbjct: 101 KIVIPRSS-TKTPTLERKKSKSFKEGSCVSASIEASLSYSSSLI--TDSPGSIAAVRREQ 157 Query: 366 AVLAQAKRKMRIAHYGRTPAKLDGKVVPVDSSTPNDAN--SQEEKRCSFITSNSDSVYVA 539 L QA+RKM+IAHYGR+ + +VVP+D S + A+ ++EEKRCSFIT NSD +Y+A Sbjct: 158 MALQQAQRKMKIAHYGRSKSAKFERVVPLDPSNTSLASKPTEEEKRCSFITPNSDPIYIA 217 Query: 540 YHDEEWGVPVHDDKMLFELLVLAGVQVGLDWTTILXXXXXXXXXXXXXDAELVAMFTERQ 719 YHDEEWGVPVHDDKMLFELLVL+G QVG DWT+ L DAE VA T++Q Sbjct: 218 YHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSTLKKRLDFRAAFSEFDAETVANLTDKQ 277 Query: 720 MASVCAACVLDIGKVRGVIDNAKRILEVRREFASFANYLWGFINHKPLSPNYRSCRKIPV 899 M S+ + +DI +VRGV+DNA +ILE++++F SF Y+WGF+NHKP+S Y+ KIPV Sbjct: 278 MMSISSEYGIDISRVRGVVDNANQILEIKKDFGSFDKYIWGFVNHKPISTQYKFGHKIPV 337 Query: 900 KTSKSESISKDMVRRGFRFVGPTVIHSFMQAAGLTNDHLVSCPRHLHCSM 1049 KTSKSESISKDMVRRGFRFVGPTV+HSFMQ +GLTNDHL++C RHL C++ Sbjct: 338 KTSKSESISKDMVRRGFRFVGPTVVHSFMQTSGLTNDHLITCHRHLQCTL 387 >ref|XP_002263612.1| PREDICTED: uncharacterized protein LOC100256507 [Vitis vinifera] gi|297738175|emb|CBI27376.3| unnamed protein product [Vitis vinifera] Length = 398 Score = 345 bits (884), Expect = 3e-92 Identities = 179/321 (55%), Positives = 220/321 (68%), Gaps = 2/321 (0%) Frame = +3 Query: 87 NAPVTPTKLSPPVSPKPKQAKA-APQRGIESNGLSTSSDKLAVPKPTPTKLPRPVMKRSM 263 N T L+PP SP K + A +RG + NGL++S +K+ P+ T P + Sbjct: 67 NTTKTKPSLTPPASPNLKSPRQPALKRGNDPNGLNSSLEKVLTPRGTTKSSSSPKKTKKC 126 Query: 264 SSVGAGQLAVSSESMSLLGFDRAPGSIAAAQREHAVLAQAKRKMRIAHYGRTP-AKLDGK 440 S+ A SS + S APGSIAAA+RE + Q +RKMRIAHYGRT AK + K Sbjct: 127 SAGLAPSSDTSSLNYSSSLIVEAPGSIAAARREQMAIMQVQRKMRIAHYGRTKSAKYEEK 186 Query: 441 VVPVDSSTPNDANSQEEKRCSFITSNSDSVYVAYHDEEWGVPVHDDKMLFELLVLAGVQV 620 + PVD P ++EEKRCSFIT NSD YV YHDEEWGVPVHDDK LFELLV+ G QV Sbjct: 187 IGPVD---PLVITTREEKRCSFITPNSDPSYVEYHDEEWGVPVHDDKRLFELLVMTGAQV 243 Query: 621 GLDWTTILXXXXXXXXXXXXXDAELVAMFTERQMASVCAACVLDIGKVRGVIDNAKRILE 800 G DWTT+L DAE+V F+E+++ S+ A +D+ +VRGV+DN+ RILE Sbjct: 244 GSDWTTVLKKRQEYRDALSGYDAEIVGKFSEKKITSISAYYGIDLSQVRGVVDNSNRILE 303 Query: 801 VRREFASFANYLWGFINHKPLSPNYRSCRKIPVKTSKSESISKDMVRRGFRFVGPTVIHS 980 ++REF SF Y+WGF+NHKP++ Y+SC KIPVKTSKSESISKDMVRRGFR VGPTVI+S Sbjct: 304 IKREFGSFHKYIWGFVNHKPITTQYKSCHKIPVKTSKSESISKDMVRRGFRLVGPTVIYS 363 Query: 981 FMQAAGLTNDHLVSCPRHLHC 1043 FMQAAGLTNDHL+SCPRHL C Sbjct: 364 FMQAAGLTNDHLISCPRHLQC 384 >ref|XP_004295546.1| PREDICTED: uncharacterized protein LOC101298985 [Fragaria vesca subsp. vesca] Length = 410 Score = 343 bits (880), Expect = 8e-92 Identities = 188/358 (52%), Positives = 234/358 (65%), Gaps = 17/358 (4%) Frame = +3 Query: 27 KSLSLPSSFAKNAARADADHNAPVTPTKLS---PPVSPKPKQAK--AAPQRGIESNGLST 191 K LS P + A + +P TK S PPVSPK K + A + G + NGL++ Sbjct: 44 KKLSTPPPPPLPLSNASSTSTSPRISTKASLTTPPVSPKSKSPRPPAIKRSGNDPNGLNS 103 Query: 192 SSDKLAVPKPTPTK--LPRPVMKRSMSSVGA------GQLAVSSESMSLLG----FDRAP 335 SS+K+ P T L R K VGA G+L+ +S SL AP Sbjct: 104 SSEKVVTPGGTTRAKVLERKKSKSFKLGVGADNAHDHGRLSSASIEASLSYSSSLITEAP 163 Query: 336 GSIAAAQREHAVLAQAKRKMRIAHYGRTPAKLDGKVVPVDSSTPNDANSQEEKRCSFITS 515 G+IAA +RE L A+RKMRIAHYGR+ + +V P+D+ ++ KRCSFIT+ Sbjct: 164 GTIAAGRREQMALQHAQRKMRIAHYGRSNSANFERVAPIDTMEAK-GGEEDHKRCSFITA 222 Query: 516 NSDSVYVAYHDEEWGVPVHDDKMLFELLVLAGVQVGLDWTTILXXXXXXXXXXXXXDAEL 695 NSD +YVAYHD+EWGVPVHDDKMLFELLVL+G QVG DWT+IL DAE Sbjct: 223 NSDPIYVAYHDQEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRDAFSGFDAEA 282 Query: 696 VAMFTERQMASVCAACVLDIGKVRGVIDNAKRILEVRREFASFANYLWGFINHKPLSPNY 875 VA T++QM S+C+ +DI +VRGV+DN+ RILEV+REF SF Y+WGF+NHKP+SP Y Sbjct: 283 VANLTDKQMISICSEYGIDISRVRGVVDNSNRILEVKREFGSFHKYIWGFVNHKPISPQY 342 Query: 876 RSCRKIPVKTSKSESISKDMVRRGFRFVGPTVIHSFMQAAGLTNDHLVSCPRHLHCSM 1049 + KIPVKTSKSESISKDMVRRGFRFVGPTV+HSFMQA+GLTNDHL +C RHL C++ Sbjct: 343 KQGYKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQASGLTNDHLTTCHRHLQCTL 400 >ref|XP_006448576.1| hypothetical protein CICLE_v10015639mg [Citrus clementina] gi|557551187|gb|ESR61816.1| hypothetical protein CICLE_v10015639mg [Citrus clementina] Length = 375 Score = 343 bits (879), Expect = 1e-91 Identities = 176/318 (55%), Positives = 227/318 (71%), Gaps = 2/318 (0%) Frame = +3 Query: 99 TPTKLSPPVSPKPKQAK-AAPQRGIESNGLSTSSDKLAVPKPTPTKLPRPVMKRSMSSVG 275 T + LSPPVSPK K + AA +RG + N L+TS++K+ PK + + +P Sbjct: 57 TKSLLSPPVSPKLKSPRPAAVKRGNDPNVLNTSAEKIMTPKKLASFVKKPKN-------- 108 Query: 276 AGQLAVSSESMSLLGFDRAPGSIAAAQREHAVLAQAKRKMRIAHYGRTP-AKLDGKVVPV 452 ++A +S ++ APGSIAAA+REH + Q +RK+RIAHYGRT AK +GKV + Sbjct: 109 -AEVAPCYDSSLIV---EAPGSIAAARREHVAIMQEQRKLRIAHYGRTKSAKFEGKVPGL 164 Query: 453 DSSTPNDANSQEEKRCSFITSNSDSVYVAYHDEEWGVPVHDDKMLFELLVLAGVQVGLDW 632 DS D N +EEKRCSFIT NSD YVAYHDEEWGVPVHDDK+LFELLVL QVG DW Sbjct: 165 DSFANGDNNDREEKRCSFITPNSDPKYVAYHDEEWGVPVHDDKLLFELLVLTAAQVGSDW 224 Query: 633 TTILXXXXXXXXXXXXXDAELVAMFTERQMASVCAACVLDIGKVRGVIDNAKRILEVRRE 812 T++L DAE+VA FTE+++ S+ A +D+ +VRG++DN+ RILEV+++ Sbjct: 225 TSVLKKRRAFREAFSGFDAEVVAKFTEKKITSLSANYAIDLSQVRGIVDNSIRILEVKKQ 284 Query: 813 FASFANYLWGFINHKPLSPNYRSCRKIPVKTSKSESISKDMVRRGFRFVGPTVIHSFMQA 992 F SF YLWGF+NHK ++ YRS +KIP KTSKSE+ISKDMV++GFRFVGPTVIHSFMQA Sbjct: 285 FGSFDKYLWGFVNHKTINTQYRSSQKIPAKTSKSEAISKDMVKKGFRFVGPTVIHSFMQA 344 Query: 993 AGLTNDHLVSCPRHLHCS 1046 AGL+NDHL++C RHL C+ Sbjct: 345 AGLSNDHLITCTRHLQCT 362 >ref|XP_003531809.1| PREDICTED: uncharacterized protein LOC100793991 [Glycine max] Length = 400 Score = 342 bits (878), Expect = 1e-91 Identities = 181/349 (51%), Positives = 236/349 (67%), Gaps = 9/349 (2%) Frame = +3 Query: 30 SLSLPSSFAKNAARADADHNAPV-TPTKLSPPVSPKPKQAKAAP-QRGIESNGLSTSSDK 203 +L +S K + ++ + P+ + T L+P VSPK K + P +RG ES GL++SS+K Sbjct: 39 NLERRNSIKKLSPKSPCPPSPPLPSKTSLAPLVSPKSKSPRPPPIKRGNESTGLNSSSEK 98 Query: 204 LAVPK---PTPT---KLPRPVMKRSMSSVGAGQLAVSSESMSLLGFDRAPGSIAAAQREH 365 + P+ TPT K + +RS ++G +S S S +PGSIAA +RE Sbjct: 99 IVTPRNTIKTPTLERKKSKSFKERSYDALGLSASTEASLSYSSNLITESPGSIAAVRREQ 158 Query: 366 AVLAQAKRKMRIAHYGRTPAKLDGKVVPVD-SSTPNDANSQEEKRCSFITSNSDSVYVAY 542 L A+RKM+IAHYGR+ + +VVP+D SS S+EEKRCSFIT+NSD +Y+AY Sbjct: 159 MALQHAQRKMKIAHYGRSKSAKFERVVPLDPSSNLTSKTSEEEKRCSFITANSDPIYIAY 218 Query: 543 HDEEWGVPVHDDKMLFELLVLAGVQVGLDWTTILXXXXXXXXXXXXXDAELVAMFTERQM 722 HDEEWGVPVHDDKMLFELLVL+G QVG DWT+IL D +A T++QM Sbjct: 219 HDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRAAFSEFDVATLANLTDKQM 278 Query: 723 ASVCAACVLDIGKVRGVIDNAKRILEVRREFASFANYLWGFINHKPLSPNYRSCRKIPVK 902 S+ +DI +VRGV+DNA RILE+ ++F SF Y+WGF+NHKP+S Y+ KIPVK Sbjct: 279 VSISLEYGIDISQVRGVVDNANRILEINKDFGSFDKYIWGFVNHKPISTQYKFGHKIPVK 338 Query: 903 TSKSESISKDMVRRGFRFVGPTVIHSFMQAAGLTNDHLVSCPRHLHCSM 1049 TSKSESISKDM+RRGFR VGPTV+HSFMQAAGLTNDHL++C RHL C++ Sbjct: 339 TSKSESISKDMIRRGFRCVGPTVLHSFMQAAGLTNDHLITCHRHLQCTL 387 >emb|CAN68394.1| hypothetical protein VITISV_042519 [Vitis vinifera] Length = 398 Score = 342 bits (878), Expect = 1e-91 Identities = 178/321 (55%), Positives = 219/321 (68%), Gaps = 2/321 (0%) Frame = +3 Query: 87 NAPVTPTKLSPPVSPKPKQAKA-APQRGIESNGLSTSSDKLAVPKPTPTKLPRPVMKRSM 263 N T L+PP SP K + A +RG + NGL++S +K+ P+ T P + Sbjct: 67 NTTKTKPSLTPPASPNLKSPRQPALKRGNDPNGLNSSLEKVLTPRGTTKSSSSPKKTKKC 126 Query: 264 SSVGAGQLAVSSESMSLLGFDRAPGSIAAAQREHAVLAQAKRKMRIAHYGRTP-AKLDGK 440 S+ A SS + S APGSIAAA+RE + Q +RKMRIAHYGRT AK + K Sbjct: 127 SAGLAPSSDTSSLNYSSSFIVEAPGSIAAARREQMAIMQVQRKMRIAHYGRTKSAKYEEK 186 Query: 441 VVPVDSSTPNDANSQEEKRCSFITSNSDSVYVAYHDEEWGVPVHDDKMLFELLVLAGVQV 620 + PVD P ++EEKRCSFIT NSD YV YHDEEWGVPVHDDK LFELLV+ G QV Sbjct: 187 ISPVD---PLVITTREEKRCSFITPNSDPSYVEYHDEEWGVPVHDDKRLFELLVMTGAQV 243 Query: 621 GLDWTTILXXXXXXXXXXXXXDAELVAMFTERQMASVCAACVLDIGKVRGVIDNAKRILE 800 G DWTT+L DAE+V F+E+++ S+ A +D+ +VRGV+DN+ RILE Sbjct: 244 GSDWTTVLKKRQEYRDAFSGYDAEIVGKFSEKKITSISAYYGIDLSQVRGVVDNSNRILE 303 Query: 801 VRREFASFANYLWGFINHKPLSPNYRSCRKIPVKTSKSESISKDMVRRGFRFVGPTVIHS 980 ++REF SF Y+WGF+NHKP++ +SC KIPVKTSKSESISKDMVRRGFR VGPTVI+S Sbjct: 304 IKREFGSFHKYIWGFVNHKPITTQXKSCHKIPVKTSKSESISKDMVRRGFRLVGPTVIYS 363 Query: 981 FMQAAGLTNDHLVSCPRHLHC 1043 FMQAAGLTNDHL+SCPRHL C Sbjct: 364 FMQAAGLTNDHLISCPRHLQC 384 >ref|XP_002312220.1| methyladenine glycosylase family protein [Populus trichocarpa] gi|118486806|gb|ABK95238.1| unknown [Populus trichocarpa] gi|222852040|gb|EEE89587.1| methyladenine glycosylase family protein [Populus trichocarpa] Length = 403 Score = 342 bits (877), Expect = 2e-91 Identities = 189/368 (51%), Positives = 249/368 (67%), Gaps = 22/368 (5%) Frame = +3 Query: 12 KKTLQKSLSLPSSFAK-NAARADADHNAPVTP----------TKLSPPVSPKPKQAKA-A 155 + LQ + +L S+ + N+ + A ++P P K SPP+SP K + A Sbjct: 24 RPVLQPTCNLVSTLERRNSLKKTAPKSSPPPPPPPPTFSNKTNKASPPLSPMSKSPRLPA 83 Query: 156 PQRGIESNGLSTSSDKLAVPKPTPTKLPRPVMKRSMS----SVGAG---QLAVSSESMSL 314 +RG ++N L++SS+K+ +P+ T TK P K+S S SVG G +S S S Sbjct: 84 IKRGSDANSLNSSSEKVVIPRNT-TKTPTLERKKSKSFKESSVGRGVHSSFIEASLSYSS 142 Query: 315 LGFDRAPGSIAAAQREHAVLAQAKRKMRIAHYGRTP-AKLDGKVVPVDSSTP--NDANSQ 485 APGSIAA +RE L A+RKMRIAHYGR+ A+ + +VVP DSS + + Sbjct: 143 SLIVEAPGSIAAVRREQMALQHAQRKMRIAHYGRSKSARFEDQVVPNDSSISMATKTDQE 202 Query: 486 EEKRCSFITSNSDSVYVAYHDEEWGVPVHDDKMLFELLVLAGVQVGLDWTTILXXXXXXX 665 EEKRCSFIT+NSD +YVAYHDEEWGVPVHDDKMLFELLVL+G QVG DWT+IL Sbjct: 203 EEKRCSFITANSDPIYVAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFR 262 Query: 666 XXXXXXDAELVAMFTERQMASVCAACVLDIGKVRGVIDNAKRILEVRREFASFANYLWGF 845 DAE+VA +E+Q+ S+ A +D+ +VRGV+DN+ RILE+++EF SF Y+W F Sbjct: 263 DAFSGFDAEIVANISEKQIMSISAEYGIDMSRVRGVVDNSNRILEIKKEFGSFDRYIWTF 322 Query: 846 INHKPLSPNYRSCRKIPVKTSKSESISKDMVRRGFRFVGPTVIHSFMQAAGLTNDHLVSC 1025 +N+KP+S +Y+ KIPVKTSKSE+ISKDMVRRGFRFVGPT++HSFMQAAGLTNDHL++C Sbjct: 323 VNNKPISTSYKFGHKIPVKTSKSETISKDMVRRGFRFVGPTMVHSFMQAAGLTNDHLITC 382 Query: 1026 PRHLHCSM 1049 RHL C++ Sbjct: 383 HRHLPCTL 390 >ref|XP_002315089.2| methyladenine glycosylase family protein [Populus trichocarpa] gi|550330066|gb|EEF01260.2| methyladenine glycosylase family protein [Populus trichocarpa] Length = 411 Score = 342 bits (876), Expect = 2e-91 Identities = 191/364 (52%), Positives = 244/364 (67%), Gaps = 24/364 (6%) Frame = +3 Query: 30 SLSLPSSFAKNAARADADHNAPVTP-------TKLSPPVSPKPKQAKA-APQRGIESNGL 185 +L +S K A ++ P+ P K SPP+SPK K + A +RG ++N L Sbjct: 36 TLERHNSLKKTAPKSPPPPPPPLPPPTSANKTNKASPPLSPKSKSPRLPAIKRGSDANSL 95 Query: 186 STSSDKLAVPKPTPTKLPRPVMKRSMS----SVGAGQLAVSSE---SMSLLGFDRAPGSI 344 ++SSDK+ +P+ T K P K+S S SVG+G L+ S E S S APGSI Sbjct: 96 NSSSDKVVIPRST-AKTPILERKKSKSFKETSVGSGALSSSIEASLSYSSSLIVEAPGSI 154 Query: 345 AAAQREHAVLAQAKRKMRIAHYGRTPA-KLDGKVVPVDSS-TPNDANSQEEKRCSFITSN 518 AA +RE L A+RKMRIAHYGR+ + + + KVVPVDSS +EEKRCSFIT+N Sbjct: 155 AAVRREQMALQHAQRKMRIAHYGRSKSSRFEAKVVPVDSSINVTTKTDEEEKRCSFITAN 214 Query: 519 S-------DSVYVAYHDEEWGVPVHDDKMLFELLVLAGVQVGLDWTTILXXXXXXXXXXX 677 S + +YVAYHD+EWGVPVHDDKMLFELLVL+G QVG DWT+IL Sbjct: 215 SGKEKYEMNPIYVAYHDKEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRDAFS 274 Query: 678 XXDAELVAMFTERQMASVCAACVLDIGKVRGVIDNAKRILEVRREFASFANYLWGFINHK 857 DAE+VA TE+QM S+ A ++I +VRGV+DN+KRILE+++EF SF Y+W F+N+K Sbjct: 275 GFDAEIVANITEKQMMSISAEYGIEISRVRGVVDNSKRILEIKKEFGSFDRYIWTFVNNK 334 Query: 858 PLSPNYRSCRKIPVKTSKSESISKDMVRRGFRFVGPTVIHSFMQAAGLTNDHLVSCPRHL 1037 P S Y+ KIPVKTSKSE+ISKDMVRRGFRFVGPT++HSFMQA GLTNDHL++C RHL Sbjct: 335 PFSNQYKFGHKIPVKTSKSETISKDMVRRGFRFVGPTMVHSFMQAVGLTNDHLITCHRHL 394 Query: 1038 HCSM 1049 C++ Sbjct: 395 PCTL 398 >gb|EOY14286.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 409 Score = 342 bits (876), Expect = 2e-91 Identities = 183/352 (51%), Positives = 232/352 (65%), Gaps = 4/352 (1%) Frame = +3 Query: 6 PLKKTLQKSLSLPSSFAKNAARADADHNAPVTPTKLSPPVSPKPKQAK-AAPQRGIESNG 182 P +L +L S+ N RA A L+PP+SPK K + AA +RG + N Sbjct: 52 PTPPSLASTLPATSATVGNGGRAKAS---------LTPPISPKSKSPRPAAIKRGSDPNA 102 Query: 183 LSTSSDKLAVPKPTPTKLPRPVMKRSMSSVGAGQLAVSSESMSLLG--FDRAPGSIAAAQ 356 L+TSS+K+ P+ L R K +G G + S+S APGSIAA + Sbjct: 103 LNTSSEKVMTPRNITKTLERKKSKSFKEGMGNGLSSWIEPSLSYSSSLIVEAPGSIAAVR 162 Query: 357 REHAVLAQAKRKMRIAHYGRTP-AKLDGKVVPVDSSTPNDANSQEEKRCSFITSNSDSVY 533 RE L QA+RKM+IAHYGR+ AK + KVVP+++S+ +EEKRCSFIT NSD VY Sbjct: 163 REQMALQQAQRKMKIAHYGRSKSAKFESKVVPLNTSSAMTKPDEEEKRCSFITPNSDPVY 222 Query: 534 VAYHDEEWGVPVHDDKMLFELLVLAGVQVGLDWTTILXXXXXXXXXXXXXDAELVAMFTE 713 VAYHDEEWGVPVHDD MLFELLVL+G QVG DW +IL DAE VA FT+ Sbjct: 223 VAYHDEEWGVPVHDDSMLFELLVLSGAQVGSDWISILKKRQDFRDAFSGFDAETVAKFTD 282 Query: 714 RQMASVCAACVLDIGKVRGVIDNAKRILEVRREFASFANYLWGFINHKPLSPNYRSCRKI 893 ++M ++ + +DI +V GV+DN+ RILEV+ +F SF Y+WGF+NHK +S Y+ KI Sbjct: 283 KEMTTISSEYGIDISRVLGVVDNSNRILEVKGQFGSFDKYIWGFVNHKAISTQYKFGHKI 342 Query: 894 PVKTSKSESISKDMVRRGFRFVGPTVIHSFMQAAGLTNDHLVSCPRHLHCSM 1049 PVKTSKSESISKDM+RRGFR VGPTV+HSFMQAAGLTNDHL++C RHL C++ Sbjct: 343 PVKTSKSESISKDMLRRGFRCVGPTVVHSFMQAAGLTNDHLITCHRHLPCTL 394 >gb|EMJ12930.1| hypothetical protein PRUPE_ppa006139mg [Prunus persica] Length = 426 Score = 341 bits (875), Expect = 3e-91 Identities = 190/373 (50%), Positives = 245/373 (65%), Gaps = 26/373 (6%) Frame = +3 Query: 9 LKKTLQKSLSLPSSFAKNAARADADHNAPVTPTKLSPPVSPKPKQAKA-APQRGIESNGL 185 +KK P ++A + + + + L+PP+SPK K + A +RG + NGL Sbjct: 43 IKKISTPRAPPPPPLPTSSASSTSPRISNKASSLLTPPISPKSKSPRPPAIKRGNDPNGL 102 Query: 186 STSSDKLAVPKPTPTKLPRPVMKRSMS----SVGA----------GQLAVSSESMSL--- 314 ++SS+K+ P T T+ K+S S SVG G + S SL Sbjct: 103 NSSSEKVVTPGGT-TRAKILERKKSKSFKRASVGVDGASADLHHHGDFSAGGFSSSLNIE 161 Query: 315 --LGFD-----RAPGSIAAAQREHAVLAQAKRKMRIAHYGRTPAKLDGKVVPVDSSTPND 473 L + APGSIAA +RE L A+RKMRIAHYGR+ + +VVPVD+S + Sbjct: 162 ASLSYSSSLITEAPGSIAAVRREQMALQHAQRKMRIAHYGRSKSANFERVVPVDASGNIE 221 Query: 474 AN-SQEEKRCSFITSNSDSVYVAYHDEEWGVPVHDDKMLFELLVLAGVQVGLDWTTILXX 650 A ++EEKRCSFIT+NSD +YVAYHDEEWGVPVHDDKMLFELLVL+G QVG DWT+IL Sbjct: 222 AKGAEEEKRCSFITANSDPIYVAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKK 281 Query: 651 XXXXXXXXXXXDAELVAMFTERQMASVCAACVLDIGKVRGVIDNAKRILEVRREFASFAN 830 DAE+VA FT++QM S+ + +DI +VRGV+DN+ RILE+++EF SF Sbjct: 282 RQDFRNAFSDFDAEIVANFTDKQMVSIGSEYGIDISRVRGVVDNSNRILEIKKEFGSFDK 341 Query: 831 YLWGFINHKPLSPNYRSCRKIPVKTSKSESISKDMVRRGFRFVGPTVIHSFMQAAGLTND 1010 Y+WGF+N KP+SP Y+ KIPVKTSKSESISKDMVRRGFRFVGPTV+HSFMQA+GLTND Sbjct: 342 YIWGFVNQKPISPQYKLGYKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQASGLTND 401 Query: 1011 HLVSCPRHLHCSM 1049 HL++C RHL C++ Sbjct: 402 HLITCHRHLQCTL 414