BLASTX nr result
ID: Mentha26_contig00044200
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00044200 (718 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU24594.1| hypothetical protein MIMGU_mgv1a007518mg [Mimulus... 309 5e-82 gb|EYU21733.1| hypothetical protein MIMGU_mgv1a024334mg [Mimulus... 231 1e-58 ref|XP_006341344.1| PREDICTED: uncharacterized protein LOC102594... 192 1e-46 ref|XP_004235942.1| PREDICTED: uncharacterized protein LOC101247... 189 8e-46 gb|EPS59186.1| hypothetical protein M569_15624 [Genlisea aurea] 180 5e-43 ref|XP_006473998.1| PREDICTED: uncharacterized protein LOC102607... 164 3e-38 ref|XP_007204814.1| hypothetical protein PRUPE_ppa026720mg [Prun... 163 6e-38 ref|XP_006453620.1| hypothetical protein CICLE_v10008612mg [Citr... 162 8e-38 ref|XP_006453617.1| hypothetical protein CICLE_v10008612mg [Citr... 162 8e-38 ref|XP_004158792.1| PREDICTED: probable GMP synthase [glutamine-... 162 1e-37 ref|XP_004136097.1| PREDICTED: probable GMP synthase [glutamine-... 162 1e-37 ref|XP_007011936.1| DNA glycosylase superfamily protein isoform ... 161 2e-37 ref|XP_006857230.1| hypothetical protein AMTR_s00065p00208780 [A... 158 2e-36 ref|XP_007135822.1| hypothetical protein PHAVU_010G161200g [Phas... 157 4e-36 ref|XP_004287130.1| PREDICTED: uncharacterized protein LOC101313... 155 1e-35 ref|XP_002324538.1| methyladenine glycosylase family protein [Po... 150 3e-34 ref|XP_003530263.1| PREDICTED: uncharacterized protein LOC100805... 148 2e-33 ref|XP_006350100.1| PREDICTED: uncharacterized protein LOC102595... 148 2e-33 ref|XP_006350099.1| PREDICTED: uncharacterized protein LOC102595... 148 2e-33 ref|XP_002309346.1| methyladenine glycosylase family protein [Po... 147 3e-33 >gb|EYU24594.1| hypothetical protein MIMGU_mgv1a007518mg [Mimulus guttatus] Length = 404 Score = 309 bits (792), Expect = 5e-82 Identities = 161/226 (71%), Positives = 176/226 (77%), Gaps = 4/226 (1%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKARSVELRKPVVKPKSENTQKPPPTGEAKGNK-- 222 MSGPP+VKSMNFAEPEAR VLGPAGNKARSVELRKP++K KSE TQKP EAKGN Sbjct: 1 MSGPPLVKSMNFAEPEARPVLGPAGNKARSVELRKPILKQKSEKTQKPLDADEAKGNTAP 60 Query: 223 SPAALKSPGMNAEKIPSPVGSRKSGGGAASILRQRQPNLSLNXXXXXXXXXXXXXXXXXT 402 SPAA SP M EKIPSPVG +K+ AASILRQRQPNLS+N T Sbjct: 61 SPAAFLSPEMKTEKIPSPVGFKKNASSAASILRQRQPNLSMNASCSSDASTDSSHSRAST 120 Query: 403 GRLSRRNSG--PTLRRKPQCSSKGEKFEMVEGYVRSIGSESDDASLDGSLVKKRCAWVTS 576 GRL RR++ P LRRK QCS KGE+ EM+EG +++GSESD LDGSLVKKRCAWVTS Sbjct: 121 GRLLRRSATFTPPLRRKHQCSPKGERIEMIEGNGKNVGSESDGVVLDGSLVKKRCAWVTS 180 Query: 577 NTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 NTDPLYAAFHDEEWGLPVHDDKKLFELLS STALAE++WPVILSKR Sbjct: 181 NTDPLYAAFHDEEWGLPVHDDKKLFELLSLSTALAELSWPVILSKR 226 >gb|EYU21733.1| hypothetical protein MIMGU_mgv1a024334mg [Mimulus guttatus] Length = 390 Score = 231 bits (590), Expect = 1e-58 Identities = 131/225 (58%), Positives = 152/225 (67%), Gaps = 3/225 (1%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKARSVELRKPVVKPKSENTQKPPPTGEAKGNKSP 228 MSGPP VK M AE EAR VLGP GNKARSVELRKP++K KSE Q+ ++KG KSP Sbjct: 1 MSGPPRVKLMTSAELEARPVLGPTGNKARSVELRKPMLKSKSEKAQRAQDVDDSKGKKSP 60 Query: 229 AALKSPGMNAEKIPSPVGSRKSGGGAASILRQRQPNLSLNXXXXXXXXXXXXXXXXXTGR 408 AL+ P EKIPSPVG K+G AAS QR ++SLN TGR Sbjct: 61 TALQLPETKPEKIPSPVGFMKNGRSAASFFMQR--SMSLNVSCSSDASSDSSHSRASTGR 118 Query: 409 LSRRNSGPT--LRRKPQCSSKGEKFEMVEGYVRSIGSESDDASLDGS-LVKKRCAWVTSN 579 +S R+ PT L+R Q S K E+ E + +G E + +DG+ +VKKRCAWVT+N Sbjct: 119 ISWRSGTPTPPLKRNQQSSFKRERIEKI------VGGEGE--VVDGAAVVKKRCAWVTAN 170 Query: 580 TDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 TDPLYAAFHDEEWGL VHDDKKLFELLSFSTALAE+TWPVILSKR Sbjct: 171 TDPLYAAFHDEEWGLAVHDDKKLFELLSFSTALAELTWPVILSKR 215 >ref|XP_006341344.1| PREDICTED: uncharacterized protein LOC102594169 [Solanum tuberosum] Length = 399 Score = 192 bits (487), Expect = 1e-46 Identities = 121/236 (51%), Positives = 139/236 (58%), Gaps = 14/236 (5%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKARSVELRKPVVKPKSENTQKPPPTGEAKGNKSP 228 MSG P VK MN A+ E R+VLGPAGNKARSVELRKPV KP +K + E+KG K Sbjct: 1 MSGGPRVKLMNNADSEVRSVLGPAGNKARSVELRKPVEKP----IKKAAESEESKGKKFE 56 Query: 229 AALKSPGMNAEKIPSPVG-SRKSGGGAASILRQRQ--------PNLSLNXXXXXXXXXXX 381 P A PV S+K GG SILRQ+Q PNLSLN Sbjct: 57 GTDSVPQSRA-----PVAASKKCGGAVPSILRQQQDHRSLMMRPNLSLNASCSSDASTDS 111 Query: 382 XXXXXXT-GRLSRRNSGPTLRRKPQCSS----KGEKFEMVEGYVRSIGSESDDASLDGSL 546 T G+LSR + PT R+ QCSS K EK G +S+ S D S+ Sbjct: 112 SHSRASTTGKLSRGSVTPTAGRRKQCSSPKVVKSEKIGKTVGEGQSLASSPTPG--DASV 169 Query: 547 VKKRCAWVTSNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 +KKRCAWVT NTDP YAAFHDEEWG+ +HDDKKLFELLS TALAE++WP ILSKR Sbjct: 170 MKKRCAWVTPNTDPSYAAFHDEEWGVSIHDDKKLFELLSLCTALAELSWPAILSKR 225 >ref|XP_004235942.1| PREDICTED: uncharacterized protein LOC101247118 [Solanum lycopersicum] Length = 395 Score = 189 bits (480), Expect = 8e-46 Identities = 119/235 (50%), Positives = 135/235 (57%), Gaps = 13/235 (5%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKARSVELRKPVVKPKSENTQKPPPTGEAKGNKSP 228 MSG P VK MN A+ E R+VLGPAGNKARSVELRKPV KP +K + E+KG K Sbjct: 1 MSGGPRVKLMNNADSEVRSVLGPAGNKARSVELRKPVEKP----VKKAAESEESKGKKFE 56 Query: 229 AALKSPGMNAEKIPSPVGSRKSGGGAASILRQRQ--------PNLSLNXXXXXXXXXXXX 384 P A RK GG SILRQ+Q PNLSLN Sbjct: 57 GTDSVPQSRA---------RKCGGAVPSILRQQQDHRSLMMRPNLSLNASCSSDASTDSS 107 Query: 385 XXXXXT-GRLSRRNSGPTLRRKPQCSS----KGEKFEMVEGYVRSIGSESDDASLDGSLV 549 T G++SR + PT R+ QCSS K EK G S+ S D S++ Sbjct: 108 HSRASTTGKMSRGSVTPTAGRRKQCSSPKVVKSEKIGKTVGEGESLASSPTPD--DASVM 165 Query: 550 KKRCAWVTSNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 KKRCAWVT NTDP YAAFHDEEWG+ VHDDKKLFELLS TALAE++WP ILSKR Sbjct: 166 KKRCAWVTPNTDPSYAAFHDEEWGVSVHDDKKLFELLSLCTALAELSWPAILSKR 220 >gb|EPS59186.1| hypothetical protein M569_15624 [Genlisea aurea] Length = 351 Score = 180 bits (456), Expect = 5e-43 Identities = 100/215 (46%), Positives = 134/215 (62%), Gaps = 2/215 (0%) Frame = +1 Query: 76 MNFAEPEARAVLGPAGNKARSVELRKPVVKPKSENTQKPPPTGEAKGNKSPAALKSPGMN 255 MN EPE R VL PAGNK+RSV+ RKPV K K +++ +AKG K P+ K P + Sbjct: 1 MNLTEPEERPVLVPAGNKSRSVDFRKPVKKEKEKDSSAGD---DAKGKKFPSPAKLPEIA 57 Query: 256 AEKIPSPVGSRKSGGGAASILRQRQPNLSLNXXXXXXXXXXXXXXXXXTGRLSRRNSGPT 435 AE++PS ++ A SIL+ RQ N+S + TGR+ R+NS P Sbjct: 58 AERVPSGEAFGRNRKNACSILKCRQNNMSASCSSDASTDSSHSKAS--TGRIIRQNSAPA 115 Query: 436 --LRRKPQCSSKGEKFEMVEGYVRSIGSESDDASLDGSLVKKRCAWVTSNTDPLYAAFHD 609 L R+ Q SS + E + + +++ + G +KKRCAW+TSNTDPLYAAFHD Sbjct: 116 RYLERRRQRSSTDD-----EKLFKILAPDAELSGGGGHSIKKRCAWITSNTDPLYAAFHD 170 Query: 610 EEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 +EWG+P+HDDKKLFEL S+STALAE+TWP IL++R Sbjct: 171 QEWGIPIHDDKKLFELFSYSTALAELTWPAILARR 205 >ref|XP_006473998.1| PREDICTED: uncharacterized protein LOC102607933 [Citrus sinensis] Length = 385 Score = 164 bits (415), Expect = 3e-38 Identities = 108/229 (47%), Positives = 124/229 (54%), Gaps = 7/229 (3%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKARSVELRKPVVKPKSENTQKPPPTGEAKGNKSP 228 MSG V+SMN AE E R VLGPAGNK S+ KP KP + + P A+ K+ Sbjct: 1 MSGATRVRSMNVAESETRPVLGPAGNKTGSLSAWKPASKPSRKVEKSPVEVNAAEEKKT- 59 Query: 229 AALKSPGMNAEKIPSPVGSRKSGG-GAASILRQR----QPNLSLNXXXXXXXXXXXXXXX 393 SP A P+ S KS SILR+ Q NLSLN Sbjct: 60 ---LSPSSKAATPPASKLSPKSHSLSVPSILRRHEQLLQSNLSLNASCSSDASTDSFHSR 116 Query: 394 XXTGRLSRRNSGPTLRRKPQCSSKGEKFEMVEGYVRSIGSES--DDASLDGSLVKKRCAW 567 TGRL+R NS +RRKP S RS+ S+ D DGS KKRCAW Sbjct: 117 ASTGRLTRSNS-VGIRRKPFPSKP-----------RSVVSDGGLDSPPPDGSQTKKRCAW 164 Query: 568 VTSNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 VT NTDP YAAFHDEEWG+PVHDDKKLFELL S AL+E+TWP I+SKR Sbjct: 165 VTPNTDPCYAAFHDEEWGVPVHDDKKLFELLVLSGALSELTWPAIMSKR 213 >ref|XP_007204814.1| hypothetical protein PRUPE_ppa026720mg [Prunus persica] gi|462400345|gb|EMJ06013.1| hypothetical protein PRUPE_ppa026720mg [Prunus persica] Length = 378 Score = 163 bits (412), Expect = 6e-38 Identities = 105/230 (45%), Positives = 128/230 (55%), Gaps = 8/230 (3%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKARSVELRKPVVKP--KSEN-TQKPPPTGEAKGN 219 MSG P V+S+N A+ E+R VLGPAGNKA + RKPV KP K+E +K E K Sbjct: 1 MSGAPRVRSINVADSESRPVLGPAGNKAGTFSARKPVSKPLRKAEKLAEKVASAEEKKTR 60 Query: 220 KSPAALKSPGMNAEKIPSPVGSRKSGGGAASILRQRQP----NLSLNXXXXXXXXXXXXX 387 +S SP +++ +PS +LR+ + N SLN Sbjct: 61 QSSMLTTSPQLHSPSVPS-------------VLRRHEQLLHSNFSLNASCSSDASTDSFH 107 Query: 388 XXXXTGRLSRRNSGPTLRRKPQCSSKGEKFEMVEGYVRSIGSESD-DASLDGSLVKKRCA 564 TGRL+R NS + RRK S RS+ S+ D+ DGS KKRCA Sbjct: 108 SRASTGRLTRSNSAGS-RRKQYVSKP-----------RSVVSDGGLDSPPDGSQSKKRCA 155 Query: 565 WVTSNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 WVT NTDP YAAFHDEEWGLPVHDDKKLFELL S ALAE++WP ILSK+ Sbjct: 156 WVTPNTDPCYAAFHDEEWGLPVHDDKKLFELLVLSGALAELSWPAILSKK 205 >ref|XP_006453620.1| hypothetical protein CICLE_v10008612mg [Citrus clementina] gi|567923232|ref|XP_006453622.1| hypothetical protein CICLE_v10008612mg [Citrus clementina] gi|557556846|gb|ESR66860.1| hypothetical protein CICLE_v10008612mg [Citrus clementina] gi|557556848|gb|ESR66862.1| hypothetical protein CICLE_v10008612mg [Citrus clementina] Length = 385 Score = 162 bits (411), Expect = 8e-38 Identities = 108/229 (47%), Positives = 123/229 (53%), Gaps = 7/229 (3%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKARSVELRKPVVKPKSENTQKPPPTGEAKGNKSP 228 MSG V+SMN AE E R VLGPAGNK S+ KP KP + + P A+ K+ Sbjct: 1 MSGATRVRSMNVAESETRPVLGPAGNKTGSLSAWKPASKPSRKIEKSPVEVNAAEEKKT- 59 Query: 229 AALKSPGMNAEKIPSPVGSRKSGG-GAASILRQR----QPNLSLNXXXXXXXXXXXXXXX 393 SP A P+ S KS SILR+ Q NLSLN Sbjct: 60 ---LSPSSKAATPPASKLSPKSHSLSVPSILRRHEQLLQSNLSLNASCSSDASTDSFHSR 116 Query: 394 XXTGRLSRRNSGPTLRRKPQCSSKGEKFEMVEGYVRSIGSES--DDASLDGSLVKKRCAW 567 GRL+R NS +RRKP S RS+ S+ D DGS KKRCAW Sbjct: 117 ASIGRLTRSNS-VGIRRKPFPSKP-----------RSVVSDGGLDSPPPDGSQTKKRCAW 164 Query: 568 VTSNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 VT NTDP YAAFHDEEWG+PVHDDKKLFELL S AL+E+TWP ILSKR Sbjct: 165 VTPNTDPCYAAFHDEEWGVPVHDDKKLFELLVLSGALSELTWPAILSKR 213 >ref|XP_006453617.1| hypothetical protein CICLE_v10008612mg [Citrus clementina] gi|567923230|ref|XP_006453621.1| hypothetical protein CICLE_v10008612mg [Citrus clementina] gi|557556843|gb|ESR66857.1| hypothetical protein CICLE_v10008612mg [Citrus clementina] gi|557556847|gb|ESR66861.1| hypothetical protein CICLE_v10008612mg [Citrus clementina] Length = 271 Score = 162 bits (411), Expect = 8e-38 Identities = 108/229 (47%), Positives = 123/229 (53%), Gaps = 7/229 (3%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKARSVELRKPVVKPKSENTQKPPPTGEAKGNKSP 228 MSG V+SMN AE E R VLGPAGNK S+ KP KP + + P A+ K+ Sbjct: 1 MSGATRVRSMNVAESETRPVLGPAGNKTGSLSAWKPASKPSRKIEKSPVEVNAAEEKKT- 59 Query: 229 AALKSPGMNAEKIPSPVGSRKSGG-GAASILRQR----QPNLSLNXXXXXXXXXXXXXXX 393 SP A P+ S KS SILR+ Q NLSLN Sbjct: 60 ---LSPSSKAATPPASKLSPKSHSLSVPSILRRHEQLLQSNLSLNASCSSDASTDSFHSR 116 Query: 394 XXTGRLSRRNSGPTLRRKPQCSSKGEKFEMVEGYVRSIGSES--DDASLDGSLVKKRCAW 567 GRL+R NS +RRKP S RS+ S+ D DGS KKRCAW Sbjct: 117 ASIGRLTRSNS-VGIRRKPFPSKP-----------RSVVSDGGLDSPPPDGSQTKKRCAW 164 Query: 568 VTSNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 VT NTDP YAAFHDEEWG+PVHDDKKLFELL S AL+E+TWP ILSKR Sbjct: 165 VTPNTDPCYAAFHDEEWGVPVHDDKKLFELLVLSGALSELTWPAILSKR 213 >ref|XP_004158792.1| PREDICTED: probable GMP synthase [glutamine-hydrolyzing]-like [Cucumis sativus] Length = 371 Score = 162 bits (409), Expect = 1e-37 Identities = 109/231 (47%), Positives = 133/231 (57%), Gaps = 9/231 (3%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKARSVELRKPVVKPKSENTQKPPPTGEAKGNKSP 228 MSGPP ++SMN A+ ++R VLGP GNKAR+VE RKP VKP + +KP E+K + P Sbjct: 1 MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKK-LEKPRQEVESKDKRVP 59 Query: 229 AALKSPGMNAEKIPSPVGSRKSGGGAASILRQRQP-----NLSLNXXXXXXXXXXXXXXX 393 L P +PS +LRQ+ NLS+N Sbjct: 60 --LSPP--QCVTVPS-------------VLRQQDRHQAILNLSMNASCSSDASSDSFNSR 102 Query: 394 XXTGRLSRRNSGPTLRRKPQCSS-KGEKFEMVEGYVRSIGSESDDASLD--GSLV-KKRC 561 + R +R+ GP LRRK QCS+ KG + V +G ES +D G L KKRC Sbjct: 103 ASSARGTRQR-GPNLRRK-QCSTVKG-----ADKAVEKVGVESVAVVVDTVGCLESKKRC 155 Query: 562 AWVTSNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 AWVT NTDP YAAFHDEEWG+PVHDDKKLFELL S ALAE+TWP IL+KR Sbjct: 156 AWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKR 206 >ref|XP_004136097.1| PREDICTED: probable GMP synthase [glutamine-hydrolyzing]-like [Cucumis sativus] Length = 380 Score = 162 bits (409), Expect = 1e-37 Identities = 109/231 (47%), Positives = 133/231 (57%), Gaps = 9/231 (3%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKARSVELRKPVVKPKSENTQKPPPTGEAKGNKSP 228 MSGPP ++SMN A+ ++R VLGP GNKAR+VE RKP VKP + +KP E+K + P Sbjct: 1 MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKK-LEKPRQEVESKDKRVP 59 Query: 229 AALKSPGMNAEKIPSPVGSRKSGGGAASILRQRQP-----NLSLNXXXXXXXXXXXXXXX 393 L P +PS +LRQ+ NLS+N Sbjct: 60 --LSPP--QCVTVPS-------------VLRQQDRHQAILNLSMNASCSSDASSDSFNSR 102 Query: 394 XXTGRLSRRNSGPTLRRKPQCSS-KGEKFEMVEGYVRSIGSESDDASLD--GSLV-KKRC 561 + R +R+ GP LRRK QCS+ KG + V +G ES +D G L KKRC Sbjct: 103 ASSARGTRQR-GPNLRRK-QCSTVKG-----ADKAVEKVGVESVAVVVDTVGCLESKKRC 155 Query: 562 AWVTSNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 AWVT NTDP YAAFHDEEWG+PVHDDKKLFELL S ALAE+TWP IL+KR Sbjct: 156 AWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKR 206 >ref|XP_007011936.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|590572766|ref|XP_007011937.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|590572769|ref|XP_007011938.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|590572773|ref|XP_007011939.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782299|gb|EOY29555.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782300|gb|EOY29556.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782301|gb|EOY29557.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782302|gb|EOY29558.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 379 Score = 161 bits (407), Expect = 2e-37 Identities = 104/227 (45%), Positives = 125/227 (55%), Gaps = 5/227 (2%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKARSVELRKPVVKPKSENTQKPPPTGEAKGNKSP 228 MSG P ++SMN A+ EAR VLGPAGNKA S+ RKP KP + + P A+ K Sbjct: 1 MSGAPRMRSMNVADSEARPVLGPAGNKAGSLSARKPASKPLRKVEKSPVEVTVAEEKK-- 58 Query: 229 AALKSPGMNAEKIPSPVGSRKSGGGAASILRQRQP----NLSLNXXXXXXXXXXXXXXXX 396 AL S +N+ + + S+LR+ + NLSLN Sbjct: 59 -ALPSSTVNS------LSPKTHSVSVPSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRA 111 Query: 397 XTGRLSRRNSGPTLRRKPQCSSKGEKFEMVEGYVRSIGSESD-DASLDGSLVKKRCAWVT 573 TGRL R NS RRKP S RS+ S+ D+ DGS KKRCAWVT Sbjct: 112 STGRLIRSNSVGN-RRKPYASKP-----------RSVVSDGGLDSPPDGSHQKKRCAWVT 159 Query: 574 SNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 NTDP Y AFHDEEWG+PVHDD+KLFELL S AL+E+TWP ILSKR Sbjct: 160 PNTDPSYVAFHDEEWGVPVHDDRKLFELLVLSGALSELTWPAILSKR 206 >ref|XP_006857230.1| hypothetical protein AMTR_s00065p00208780 [Amborella trichopoda] gi|548861313|gb|ERN18697.1| hypothetical protein AMTR_s00065p00208780 [Amborella trichopoda] Length = 397 Score = 158 bits (400), Expect = 2e-36 Identities = 103/228 (45%), Positives = 121/228 (53%), Gaps = 6/228 (2%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKARSVELRKPVVKP--KSENTQKPPPTGEAKGNK 222 MSGPP ++SMN A+ E R VLGPAGNKARS+ RKP KP K E + PP + Sbjct: 1 MSGPPKIRSMNVADAEVRPVLGPAGNKARSIATRKPASKPLRKQEKPEITPPPSNKASVE 60 Query: 223 SPAALKSPGMNAEKIPSPVGSRKSGGGAASILRQRQ----PNLSLNXXXXXXXXXXXXXX 390 P K+P P P R A+ ILR+++ NLSLN Sbjct: 61 EP---KTPPAVVSSQPMPPSPR-----ASLILRRQELLLHSNLSLNASCSSDASSDSVYS 112 Query: 391 XXXTGRLSRRNSGPTLRRKPQCSSKGEKFEMVEGYVRSIGSESDDASLDGSLVKKRCAWV 570 TG++ R S P +RK Q K K V E K+RC WV Sbjct: 113 RASTGKIFR--SSPGSKRK-QTGPKPVKVAPATAVVLPTPLEG----------KRRCHWV 159 Query: 571 TSNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 T+NT+P YAAFHDEEWGLPVHDDKKLFELL S ALAE+TWP ILSKR Sbjct: 160 TANTEPCYAAFHDEEWGLPVHDDKKLFELLVLSGALAELTWPSILSKR 207 >ref|XP_007135822.1| hypothetical protein PHAVU_010G161200g [Phaseolus vulgaris] gi|561008867|gb|ESW07816.1| hypothetical protein PHAVU_010G161200g [Phaseolus vulgaris] Length = 380 Score = 157 bits (396), Expect = 4e-36 Identities = 101/226 (44%), Positives = 127/226 (56%), Gaps = 4/226 (1%) Frame = +1 Query: 49 MSGPPMVKSMNFA--EPEARAVLGPAGNKARS-VELRKPVVKPKSENTQKPPPTGEAKGN 219 MSGPP V+SMN A +P+AR VL PAGNK R+ V++RKPV K E +KP A N Sbjct: 1 MSGPPRVRSMNVAVADPDARPVLVPAGNKVRAAVDVRKPVKKSTPEAEKKPV----AHSN 56 Query: 220 KSPAALKSPGMNAEKIPSPVGSRKSGGGAASILRQRQPNLSLNXXXXXXXXXXXXXXXXX 399 P + +P P R+ A + N S + Sbjct: 57 APPQCIS--------VPPPFILRRQERHQAVLKNLSSMNASYSSDASSTDSSTHSSGASS 108 Query: 400 TGRLSRRNSGPTLRRKPQCSSKGEKFEMVEGYVRSIGSESDDASLDGSLV-KKRCAWVTS 576 +G+++RR S RK QC K +K + ++G SDDA L SL KKRCAWVT Sbjct: 109 SGKVARRVS--VQLRKKQCGPKTDKVS-----IDNVGG-SDDADLSDSLEGKKRCAWVTP 160 Query: 577 NTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 NT+P YAAFHD EWG+PVHDD+KLFE+LSFS ALAE+TWP IL+KR Sbjct: 161 NTEPCYAAFHDNEWGVPVHDDRKLFEVLSFSGALAELTWPTILNKR 206 >ref|XP_004287130.1| PREDICTED: uncharacterized protein LOC101313540 [Fragaria vesca subsp. vesca] Length = 429 Score = 155 bits (393), Expect = 1e-35 Identities = 100/229 (43%), Positives = 122/229 (53%), Gaps = 7/229 (3%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKARSVELRKPVVKP--KSENTQKPPPTGEAKGNK 222 MSG P VKS+N A E+R+VLGPAGNK + RKP KP K+E + + E K + Sbjct: 1 MSGAPRVKSINVANSESRSVLGPAGNKGGAFSARKPATKPLRKTEKMVEEFTSAEDKKTQ 60 Query: 223 SPAALK-SPGMNAEKIPSPVGSRKSGGGAASILRQRQPNLSLNXXXXXXXXXXXXXXXXX 399 + L SP +++ +PS + + + Q N SLN Sbjct: 61 QSSKLSTSPQLHSLSVPSVLRRHE---------QLLQSNFSLNASCSSDASTDSFHSRAS 111 Query: 400 TGRLSRRNSGPTLRRKPQCSSKGEKFEMVEGYVRSIGSESDDASLD----GSLVKKRCAW 567 TGRL R NS + R++ YV S D LD GS KKRCAW Sbjct: 112 TGRLIRSNSVGSRRKQ---------------YVSKPRSVVSDGGLDSPPGGSQSKKRCAW 156 Query: 568 VTSNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 VT NTDP Y AFHDEEWGLPVHDDKKLFELL S ALAE++WP+ILSKR Sbjct: 157 VTPNTDPCYVAFHDEEWGLPVHDDKKLFELLVLSGALAELSWPLILSKR 205 >ref|XP_002324538.1| methyladenine glycosylase family protein [Populus trichocarpa] gi|222865972|gb|EEF03103.1| methyladenine glycosylase family protein [Populus trichocarpa] Length = 380 Score = 150 bits (380), Expect = 3e-34 Identities = 99/227 (43%), Positives = 122/227 (53%), Gaps = 5/227 (2%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGN-KARSVELRKPVVKPKSENTQKPPPTGEAKGNKS 225 MSG P V+SMN A+ EAR+VLGP GN KA + RKPV K +S +K P E K + Sbjct: 1 MSGAPRVRSMNVADSEARSVLGPTGNNKAGPLSARKPVSK-QSRKVEKSPE--EVKLGEE 57 Query: 226 PAALKSPGMNAEKIPSPVGSRKSGGGAASILRQRQ----PNLSLNXXXXXXXXXXXXXXX 393 L P + + + +S+LR+ + NLSLN Sbjct: 58 KKTLTVPAVGT------LSPKSHSLNISSVLRRHELLLHSNLSLNASCSSDASTDSFHSR 111 Query: 394 XXTGRLSRRNSGPTLRRKPQCSSKGEKFEMVEGYVRSIGSESDDASLDGSLVKKRCAWVT 573 TGRL+R NS T R++ + +V G ES S D S KK CAWVT Sbjct: 112 ASTGRLTRSNSAGTRRKQYVLRPRS--------FVSEGGLESPP-SPDDSQSKKSCAWVT 162 Query: 574 SNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 NTDP YA FHDEEWG+P+HDD+KLFELL S ALAE+TWP ILSKR Sbjct: 163 PNTDPCYATFHDEEWGVPIHDDRKLFELLVLSGALAELTWPAILSKR 209 >ref|XP_003530263.1| PREDICTED: uncharacterized protein LOC100805836 [Glycine max] Length = 377 Score = 148 bits (374), Expect = 2e-33 Identities = 103/228 (45%), Positives = 123/228 (53%), Gaps = 6/228 (2%) Frame = +1 Query: 49 MSGPPMVKSMNFA--EPEARAVLGPAGNKARSV-ELRKPVVKPKSENTQKPPPTGEAKGN 219 MSGPP V+SMN A + +AR VL PAGNK R V E RKPV K +E +KP Sbjct: 1 MSGPPRVRSMNVAVADADARPVLVPAGNKVRPVVEGRKPVKKSSTETEKKPVA------- 53 Query: 220 KSPAALKSPGMNAEKIPSPVGSRKSGGGAASILRQRQPNLSLNXXXXXXXXXXXXXXXXX 399 SP + +P+ SR+ A + N S + Sbjct: 54 HSPQCVS--------VPAVAISRQQEHHQAVLKSMSSMNASFSSDTSSTDSSTHSSGASS 105 Query: 400 TGRLSRRNSGPTLRRKPQCSSKGEKF--EMVEGYVRSIGSESDDASLDGSLV-KKRCAWV 570 +G+++RR S RK Q K EK + V G SDDA L SL KKRCAWV Sbjct: 106 SGKVTRRVS--VALRKKQVGPKTEKASCDNVAG--------SDDADLSDSLEGKKRCAWV 155 Query: 571 TSNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 T NT+P Y AFHDEEWG+PVHDD+KLFELLSFS ALAE+TWP ILSKR Sbjct: 156 TPNTEPCYIAFHDEEWGVPVHDDRKLFELLSFSGALAELTWPTILSKR 203 >ref|XP_006350100.1| PREDICTED: uncharacterized protein LOC102595001 isoform X2 [Solanum tuberosum] Length = 343 Score = 148 bits (373), Expect = 2e-33 Identities = 100/228 (43%), Positives = 125/228 (54%), Gaps = 6/228 (2%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKA-RSVELRKPVVKP--KSENTQKPPPTGEAKGN 219 MSG V+SMN A+ EAR VLG AGNKA RS RK V KP K +++ G+ G+ Sbjct: 1 MSGASRVRSMNVADSEARPVLGLAGNKAQRSPGSRKSVSKPTRKIVKSKEELEMGDKNGH 60 Query: 220 KSPAALKSPGMNAEKIPSPVGSRKSGGGAASILRQRQP---NLSLNXXXXXXXXXXXXXX 390 + SP + + +PS ILR+++ N SL+ Sbjct: 61 QP-----SPSLLSFDVPS-------------ILRRQESLYSNFSLSASCSSDASTDSFHS 102 Query: 391 XXXTGRLSRRNSGPTLRRKPQCSSKGEKFEMVEGYVRSIGSESDDASLDGSLVKKRCAWV 570 TGR+ R NS T R+ Q +SK + R + + D+S+DGS KKRCAWV Sbjct: 103 SASTGRIYRMNS--TSSRRKQLASKSK---------RIVSDDISDSSIDGSQSKKRCAWV 151 Query: 571 TSNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 T NTDP YA FHDEEWG+PVHDDKKLFELL ALAE+TWP IL KR Sbjct: 152 TPNTDPSYANFHDEEWGVPVHDDKKLFELLVLCGALAELTWPSILCKR 199 >ref|XP_006350099.1| PREDICTED: uncharacterized protein LOC102595001 isoform X1 [Solanum tuberosum] Length = 372 Score = 148 bits (373), Expect = 2e-33 Identities = 100/228 (43%), Positives = 125/228 (54%), Gaps = 6/228 (2%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKA-RSVELRKPVVKP--KSENTQKPPPTGEAKGN 219 MSG V+SMN A+ EAR VLG AGNKA RS RK V KP K +++ G+ G+ Sbjct: 1 MSGASRVRSMNVADSEARPVLGLAGNKAQRSPGSRKSVSKPTRKIVKSKEELEMGDKNGH 60 Query: 220 KSPAALKSPGMNAEKIPSPVGSRKSGGGAASILRQRQP---NLSLNXXXXXXXXXXXXXX 390 + SP + + +PS ILR+++ N SL+ Sbjct: 61 QP-----SPSLLSFDVPS-------------ILRRQESLYSNFSLSASCSSDASTDSFHS 102 Query: 391 XXXTGRLSRRNSGPTLRRKPQCSSKGEKFEMVEGYVRSIGSESDDASLDGSLVKKRCAWV 570 TGR+ R NS T R+ Q +SK + R + + D+S+DGS KKRCAWV Sbjct: 103 SASTGRIYRMNS--TSSRRKQLASKSK---------RIVSDDISDSSIDGSQSKKRCAWV 151 Query: 571 TSNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 T NTDP YA FHDEEWG+PVHDDKKLFELL ALAE+TWP IL KR Sbjct: 152 TPNTDPSYANFHDEEWGVPVHDDKKLFELLVLCGALAELTWPSILCKR 199 >ref|XP_002309346.1| methyladenine glycosylase family protein [Populus trichocarpa] gi|222855322|gb|EEE92869.1| methyladenine glycosylase family protein [Populus trichocarpa] Length = 381 Score = 147 bits (372), Expect = 3e-33 Identities = 100/228 (43%), Positives = 117/228 (51%), Gaps = 6/228 (2%) Frame = +1 Query: 49 MSGPPMVKSMNFAEPEARAVLGPAGNKARS--VELRKPVVKPKSENTQKPPPTGEAKGNK 222 MSG P V+SMN A+ EAR VLGP GN RKP K ++ + P EAK + Sbjct: 1 MSGAPRVRSMNVADSEARPVLGPTGNTKAGPLTSARKPASKQLRKDGKSPE---EAKLGE 57 Query: 223 SPAALKSPGMNAEKIPSPVGSRKSGGGAASILRQRQP----NLSLNXXXXXXXXXXXXXX 390 L P + + + G +S+LR+ + NLSLN Sbjct: 58 EKKVLTVPTVGN------LSPKSLSGNFSSVLRRHEQLLHSNLSLNASCSSDASTDSFHS 111 Query: 391 XXXTGRLSRRNSGPTLRRKPQCSSKGEKFEMVEGYVRSIGSESDDASLDGSLVKKRCAWV 570 TGRL R N+ T R+ Q SK V S G S DGS KK CAWV Sbjct: 112 RASTGRLIRSNNVGT--RRKQYVSKPRS-------VVSDGGLESLPSSDGSQSKKSCAWV 162 Query: 571 TSNTDPLYAAFHDEEWGLPVHDDKKLFELLSFSTALAEITWPVILSKR 714 T NTDP Y AFHDEEWGLPVHDD+KLFELL S ALAE+TWP ILSKR Sbjct: 163 TPNTDPCYTAFHDEEWGLPVHDDRKLFELLVLSGALAELTWPAILSKR 210