BLASTX nr result
ID: Mentha29_contig00009730
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00009730 (1495 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665... 306 2e-80 ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664... 290 2e-75 emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 279 2e-72 ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661... 269 3e-69 ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663... 265 3e-68 gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali... 262 3e-67 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 256 2e-65 gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] 254 5e-65 ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298... 253 1e-64 ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268... 239 2e-60 gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 236 2e-59 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 234 1e-58 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 230 1e-57 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 228 5e-57 gb|ABD96948.1| hypothetical protein [Cleome spinosa] 219 3e-54 dbj|BAB08270.1| non-LTR retroelement reverse transcriptase-like ... 217 9e-54 ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein A... 216 2e-53 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 216 3e-53 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 216 3e-53 gb|AAC95175.1| putative non-LTR retroelement reverse transcripta... 215 5e-53 >ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max] Length = 506 Score = 306 bits (783), Expect = 2e-80 Identities = 167/458 (36%), Positives = 251/458 (54%), Gaps = 7/458 (1%) Frame = +2 Query: 2 NGGSHGFIRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLA 181 NG + +RGLRQGDP+SP LF+ ME L+R ++ D F +HPKC T+L Sbjct: 14 NGYKTEIMGARRGLRQGDPISPMLFVIVMECLNRYLYKMQKDGDFNYHPKCDKLKITNLC 73 Query: 182 FADDLLLFGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDLLLF RGD S+ ++ A + F+ +GL +N K + G+ KR ILE+ GF Sbjct: 74 FADDLLLFSRGDKISVGMMMRAYESFSKATGLLVNPQKCSLLCAGIDAVTKREILEVSGF 133 Query: 362 PEGTLPVKYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVE 541 EG LP KYLG+P+ SK L+T YSPL+ +I I+ W+ LS AGRL+L+ SV+ + Sbjct: 134 QEGQLPFKYLGVPVTSKKLSTIHYSPLIDKIVGKIKHWTARLLSYAGRLQLVNSVMFALT 193 Query: 542 CYWLQALPLPGTVIARITKMLRKFLWR-----DSQCPVSWKTVCLPRHEGGLGLRDLAVW 706 YWL P P +V+ +I + R FLW + PV+WK +C PR GGL + D+ +W Sbjct: 194 NYWLNCFPFPKSVLQKIEAICRIFLWTGGFEGSRKSPVAWKQICSPRSCGGLNIIDIDIW 253 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLISD 886 NKA K LWN+ +K DSLW+KWI A Y++ ++ D+ M IL+ R+ L Sbjct: 254 NKANLMKLLWNLSSKEDSLWVKWIQAYYVKRSELMHIEMKNTDSWIMKAILKQREDL--- 310 Query: 887 CEGNLNDAKAKLVGWFAGKGTSEAYEHFRAKGEKKFWYKAVWRSYIPPKFSVTLWLAMHG 1066 +++ + ++ G + Y + G++K W ++ + P+ + LWLA HG Sbjct: 311 --EKIDNMEELMIRGSINMG--KLYRKLQDCGQRKEWKNLLYGNTARPRANFILWLACHG 366 Query: 1067 RLKTFDRM-KHSDI-ARGCVLCDNADETHDHLFFKCDKAMGVWSGICSWLRCRNQMTTIS 1240 RL T DR+ K+ I + C C + +E+ +HLFF CD + VW + W++ R+ + Sbjct: 367 RLSTKDRLCKYGMIDDKSCCFC-SEEESMNHLFFVCDNSKRVWMEVLQWVQIRHDPSDWP 425 Query: 1241 SAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARNLK 1354 + + G G +A+ T+ +W RN K Sbjct: 426 NELHWLTHHTKGKGTRAAVLKMAIAETIYEIWNIRNNK 463 >ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max] Length = 939 Score = 290 bits (741), Expect = 2e-75 Identities = 160/456 (35%), Positives = 246/456 (53%), Gaps = 7/456 (1%) Frame = +2 Query: 2 NGGSHGFIRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLA 181 NG + +RG+RQGDP+SP LF+ MEYL+R++ F +H KC T+L Sbjct: 456 NGRFTRRLEARRGIRQGDPISPLLFILVMEYLNRILSQLDKIPNFNYHSKCEKMKITNLC 515 Query: 182 FADDLLLFGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDLLLF RGD S++++ D + F + GL +N SK +I+ G V K +L + GF Sbjct: 516 FADDLLLFSRGDIGSVQIMLDKFNTFLRSMGLHVNPSKCNIYCGSVDINVKEQLLLISGF 575 Query: 362 PEGTLPVKYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVE 541 EG +P +YLG+PL+SK L Y L+ +I I WS LS AGR++LI+SV+ Sbjct: 576 KEGKMPFRYLGIPLSSKKLNIKHYQVLIDKIVGRITHWSAGLLSYAGRVQLIQSVIFATI 635 Query: 542 CYWLQALPLPGTVIARITKMLRKFLWRDS-----QCPVSWKTVCLPRHEGGLGLRDLAVW 706 +W+Q LPLP VI RI + R FLW + + P++W+ VC P+ GGL + +LA+W Sbjct: 636 NFWMQCLPLPKFVIMRINAICRSFLWIGNSNISRKSPIAWEKVCSPKINGGLNIINLAIW 695 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLISD 886 NK K LWN+ K+D+LWIKW+H Y+RG IW + + M++++++R L+ Sbjct: 696 NKISILKLLWNVCNKSDNLWIKWLHTYYIRGQSIWSMVLKKSHSWIMSSMMKLRPLLL-- 753 Query: 887 CEGNLNDAKAKLVGWFAGKGTSEAYEHFRAKGEKKFWYKAVWRSYIPPKFSVTLWLAMHG 1066 ++++ F K + Y + EK W + + P+ LW A H Sbjct: 754 ------QYQSRMQDVFKMK---KIYLALFEESEKMSWRTLMCNNLARPRALFCLWQACHF 804 Query: 1067 RLKTFDRMKH--SDIARGCVLCDNADETHDHLFFKCDKAMGVWSGICSWLRCRNQMTTIS 1240 RL + DR+ ++ C C + E+H+HLFF C + +W+ + +WL+ + +T S Sbjct: 805 RLASKDRLIKFGLNVDANCAFCSSM-ESHEHLFFGCIELKTIWTAVLNWLQIIHMPSTWS 863 Query: 1241 SAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARN 1348 + R+ G G A T+ ++W RN Sbjct: 864 EELNWITRKCKGKGWRAMLLKCAFTETIYHIWAYRN 899 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 279 bits (714), Expect = 2e-72 Identities = 168/470 (35%), Positives = 238/470 (50%), Gaps = 11/470 (2%) Frame = +2 Query: 26 RGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLAFADDLLLF 205 + ++GLRQGDPMSP LF CMEYLSR + F HPKC + THL FADDLL+F Sbjct: 636 QARKGLRQGDPMSPFLFALCMEYLSRCLEELKGSPDFNFHPKCERLNITHLMFADDLLMF 695 Query: 206 GRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGFPEGTLPVK 385 R D S+ + A F+ SGL + KS+I+ GV R + + G LP + Sbjct: 696 CRADKSSLDHMNVAFQKFSHASGLAASHEKSNIYFCGVDDETARELADYVHMQLGELPFR 755 Query: 386 YLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALP 565 YLG+PL SK LT PL+ I N Q W LS AGRL+LI+S+L ++ YW P Sbjct: 756 YLGVPLTSKKLTYAQCKPLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFP 815 Query: 566 LPGTVIARITKMLRKFLW-----RDSQCPVSWKTVCLPRHEGGLGLRDLAVWNKALHSKT 730 L VI + K+ RKFLW + PV+W T+ P+ GG + ++ WN+A K Sbjct: 816 LSKKVIQAVEKVCRKFLWTGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKL 875 Query: 731 LWNIHAKADSLWIKWIHAEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLISDCEGNLNDA 910 LW I K D LW++WIH+ Y++ DI + + I++ RD L N+ D Sbjct: 876 LWAIEFKRDKLWVRWIHSYYIKRQDILTVNISNQTTWILRKIVKARDHL-----SNIGDW 930 Query: 911 KAKLVG-WFAGKGTSEAYEHFRAKGEKKFWYKAVWRSYIPPKFSVTLWLAMHGRLKTFDR 1087 +G F+ K +AY+ GE+ W + + +Y PK LW+ +H RL T DR Sbjct: 931 DEICIGDKFSMK---KAYKKISENGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDR 987 Query: 1088 MKHSDIA--RGCVLCDNADETHDHLFFKCDKAMGVWSGICSWLRCRNQMTT---ISSAVR 1252 + + LC N ET HLFF C + GVWS IC +R N + I S+V Sbjct: 988 ISRWGVQCDLNYRLCRNDGETIQHLFFSCSYSAGVWSKICYIMRFPNSGVSHQEIISSVC 1047 Query: 1253 RFQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHVIKEI 1402 R+K G I+ + V +W+ RN + + + + V+++I Sbjct: 1048 GQARKKKGKLIV-----MLYTEFVYAIWKQRNKRTFTGENKDENEVLRKI 1092 >ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max] Length = 947 Score = 269 bits (687), Expect = 3e-69 Identities = 142/402 (35%), Positives = 220/402 (54%), Gaps = 10/402 (2%) Frame = +2 Query: 23 IRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLAFADDLLL 202 I +RG+RQGDP+SP LF+ MEYL+RL+ D F HH KC THL FADD+LL Sbjct: 463 IAAKRGIRQGDPISPLLFVVMMEYLNRLLVKLQLDLNFNHHAKCEKLGITHLTFADDVLL 522 Query: 203 FGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGFPEGTLPV 382 F RGD S+ ++ ++ F+ T+GL +N +K I+ GGV K I ++ + EG LPV Sbjct: 523 FCRGDVMSVEMMLHVINKFSATTGLVVNPNKCRIYFGGVDGTTKNKIQQISSYEEGQLPV 582 Query: 383 KYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQAL 562 +YLG+PL SK L Y PL+ +I I+ W++ L+ GR++++ + + +W+Q L Sbjct: 583 RYLGVPLTSKKLNIKYYLPLIDKITTRIRHWTSKLLNMTGRVQMVNCTITAIVQFWMQCL 642 Query: 563 PLPGTVIARITKMLRKFLWRDS-----QCPVSWKTVCLPRHEGGLGLRDLAVWNKALHSK 727 P+P +VI +I M R F+W S + P++W +VC P+ +GGL + +L VWN Sbjct: 643 PIPMSVIKKIDSMCRSFVWSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWNHITVLN 702 Query: 728 TLWNIHAKADSLWIKWIHAEYLRGLDIWEFPYPRRDAPHMTNILRIRD---QLISDCEGN 898 LWN+ K D+LW+KWIHA Y++ + + + N+L R+ L + Sbjct: 703 CLWNLCKKVDNLWVKWIHAHYIKNSSVMNTMVTNNFSWVLKNVLSQREYIHTLQPVWDEL 762 Query: 899 LNDAKAKLVGWFAGKGTSEAYEHFRAKGEKKFWYKAVWRSYIPPKFSVTLWLAMHGRLKT 1078 LN + K+ +AY+ + ++ W + ++ P+ T WLA HGRL T Sbjct: 763 LNSERFKM---------KKAYDKM-MEADRVHWSGLMRKNCARPRAIHTTWLACHGRLGT 812 Query: 1079 FDRMKHSDIARGCV--LCDNADETHDHLFFKCDKAMGVWSGI 1198 DR+ + + LC +ET +H+ F C A +WS + Sbjct: 813 KDRLVRFGMITDKIWSLCKEVEETQNHILFSCKVATDIWSNV 854 >ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max] Length = 514 Score = 265 bits (678), Expect = 3e-68 Identities = 142/405 (35%), Positives = 220/405 (54%), Gaps = 9/405 (2%) Frame = +2 Query: 2 NGGSHGFIRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLA 181 NG + + G+ QGDP+SP LF+ MEY +R++ + +F HH +C THL+ Sbjct: 117 NGELSNVLETKIGIWQGDPISPLLFVLMMEYFNRIMVKMQRNPSFNHHSQCERLGITHLS 176 Query: 182 FADDLLLFGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADD+ L RGD S++++ A F+ ++GL IN +K +F GG+ + I ++ GF Sbjct: 177 FADDVFLLCRGDKKSIKMIIKAFSFFSKSTGLQINPAKCKVFCGGLNCDSIQVITKITGF 236 Query: 362 PEGTLPVKYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVE 541 EGTLPV+YLG+PL+ K L Y PL+ +I I+ WS+ LS AGR++L+RS++ + Sbjct: 237 EEGTLPVRYLGVPLSCKKLNVHHYLPLVEKIVGKIRHWSSKLLSIAGRIQLVRSIITAIA 296 Query: 542 CYWLQALPLPGTVIARITKMLRKFLWRDS-----QCPVSWKTVCLPRHEGGLGLRDLAVW 706 YW+ P+P VI +I + R F+W S + V+WK VC P GGL L +L +W Sbjct: 297 QYWMSVFPMPKKVIQKIDSICRSFIWSGSAEVKRKSLVAWKQVCKPARCGGLNLINLELW 356 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLISD 886 N K LWNI +K D+LW+KWIHA +L+G ++ + ++++ R Q Sbjct: 357 NVTAMLKCLWNICSKEDNLWVKWIHAYFLKGDNVMSATIKSNSTWILKSVMKQRPQ---- 412 Query: 887 CEGNLNDAKAKLVGWFAGKGTS--EAYEHFRAKGEKKFWYKAVWRSYIPPKFSVTLWLAM 1060 +N+ + + + S + Y K W++ + + P+ +VTLWLA Sbjct: 413 ----VNNLQLVWIEMLRKRKFSMKQVYMELVEDHNKIDWFRLLRYNRARPRANVTLWLAC 468 Query: 1061 HGRLKTFDRMKHSDIARG--CVLCDNADETHDHLFFKCDKAMGVW 1189 RL T R+K+ ++ + C LC DE DHL F C +W Sbjct: 469 QNRLATKTRLKNMNMIQCSLCSLCKEQDEDLDHLMFSCRVTKAIW 513 >gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana] gi|20197043|gb|AAM14892.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 1412 Score = 262 bits (669), Expect = 3e-67 Identities = 166/470 (35%), Positives = 234/470 (49%), Gaps = 15/470 (3%) Frame = +2 Query: 38 GLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLAFADDLLLFGRGD 217 GLRQG +SP LF+ CM LS ++ + F +HP+C THL FADD+++F G Sbjct: 910 GLRQGCSLSPYLFVICMNVLSAMLDKGAVEKRFGYHPRCRNMGLTHLCFADDIMVFSAGS 969 Query: 218 PDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGFPEGTLPVKYLGL 397 S+ + F SGL I+ KS +F+ + +IL F F G+LPV+YLGL Sbjct: 970 AHSLEGVLAIFKDFAAFSGLNISLEKSTLFMASISSETCASILARFPFDSGSLPVRYLGL 1029 Query: 398 PLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGT 577 PL +K +T D PLL +I + I W N LS AGRL+L+ SV+ + +W+ A LP Sbjct: 1030 PLMTKRMTLADCLPLLEKIRSRISSWKNRFLSYAGRLQLLNSVISSLTKFWISAFRLPRA 1089 Query: 578 VIARITKMLRKFLWRDS-----QCPVSWKTVCLPRHEGGLGLRDLAVWNKALHSKTLWNI 742 I I ++ FLW + + V+W VC P+ EGGLGLR L NK K +W + Sbjct: 1090 CIREIEQISAAFLWSGTDLNPHKAKVAWHDVCKPKSEGGLGLRSLVDANKICCFKLIWRL 1149 Query: 743 HAKADSLWIKWIHAEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLISD--CEGNLNDAKA 916 + SLW+ WI +R + E R H +IL ++ + C G + Sbjct: 1150 VSAKHSLWVNWIQNNLIR--TVAEALSSHRRRSHRDDILNDIEEELEKLLCRGICTEQDR 1207 Query: 917 KLV----GWFAGKGTS-EAYEHFRAKGEKKFWYKAVWRSYIPPKFSVTLWLAMHGRLKTF 1081 L G F K S E + R +G K W+KA+W S PKF+ WLA H RL T Sbjct: 1208 SLCRSIGGQFKAKFFSPEIWHQIREQGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTG 1267 Query: 1082 DRMK--HSDIARGCVLCDNADETHDHLFFKCDKAMGVWSGIC-SWLRCRNQMTTISSAVR 1252 D+M + I+ CVLC+ + E+ DHLFF C+ + +W + L CR TT A+ Sbjct: 1268 DKMASWNRGISSVCVLCNISAESRDHLFFSCNFSSHIWDRLTRRLLLCR--YTTNFPALL 1325 Query: 1253 RFQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHVIKEI 1402 + SG R AT+ LW+ RN + P + H+IK I Sbjct: 1326 LLLSGQDFSGTKRFLLRYVFQATIHTLWRERNKRRHGDLPIPSDHIIKFI 1375 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 256 bits (653), Expect = 2e-65 Identities = 157/470 (33%), Positives = 229/470 (48%), Gaps = 13/470 (2%) Frame = +2 Query: 32 QRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLAFADDLLLFGR 211 Q+GLRQGDP+SP LF MEYLSR + D F HPKC THL FADDLL+F R Sbjct: 641 QKGLRQGDPLSPFLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMFAR 700 Query: 212 GDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGFPEGTLPVKYL 391 D S+ + A + F+ SGL + KS I+ GGV E + + P G+LP +YL Sbjct: 701 ADASSISKIMAAFNSFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFRYL 760 Query: 392 GLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLP 571 G+PLASK L PL+ +I Q W LS AGRL+L++++L ++ YW Q PLP Sbjct: 761 GVPLASKKLNFSQCKPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLP 820 Query: 572 GTVIARITKMLRKFLWRDS-----QCPVSWKTVCLPRHEGGLGLRDLAVWNKALHSKTLW 736 +I + RKFLW + + PV+W + P+ GGL + ++ +WNKA K LW Sbjct: 821 KKLIKAVETTCRKFLWTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLLW 880 Query: 737 NIHAKADSLWIKWIHAEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLISDCEGNLNDAKA 916 I K D LW++W++A Y++ +I + + I R+ L Sbjct: 881 AITFKQDKLWVRWVNAYYIKRQNIENVTVSSNTSWILRKIFESRELL------------T 928 Query: 917 KLVGWFA-----GKGTSEAYEHFRAKGEKKFWYKAVWRSYIPPKFSVTLWLAMHGRLKTF 1081 + GW A + Y+ + E W + + + PK LWLAM RL T Sbjct: 929 RTGGWEAVSNHMNFSIKKTYKLLQEDYENVVWKRLICNNKATPKSQFILWLAMLNRLATA 988 Query: 1082 DRMK--HSDIARGCVLCDNADETHDHLFFKCDKAMGVWSGICSWLRCRNQMTTISSAVRR 1255 +R+ + D++ C +C N ET HLFF C + +W + +L + Q + A + Sbjct: 989 ERVSRWNRDVSPLCKMCGNEIETIQHLFFNCIYSKEIWGKVLLYLNLQPQAD--AQAKKE 1046 Query: 1256 FQREKAGSGIIRKAKWVAL-GATVQYLWQARNLKYVEKKPFEASHVIKEI 1402 +KA S R +V + +V +W RN K + +K I Sbjct: 1047 LAIKKARSTKDRNKLYVMMFTESVYAIWLLRNAKVFRGIEINQNQAVKSI 1096 >gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] Length = 653 Score = 254 bits (650), Expect = 5e-65 Identities = 160/474 (33%), Positives = 229/474 (48%), Gaps = 10/474 (2%) Frame = +2 Query: 2 NGGSHGFIRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLA 181 NG GF + +RGLRQG +SP LF+ M+ LS+L+ F +H +C THL+ Sbjct: 170 NGELVGFFQSKRGLRQGCSLSPYLFVMSMDVLSKLLDQAASAKKFGYHSRCKELSLTHLS 229 Query: 182 FADDLLLFGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDL++ G S+ + + D+F SGL I+ KS I+L GV I + F Sbjct: 230 FADDLMVLSDGKVRSIDGIVEVFDIFAKFSGLKISMEKSTIYLAGVTEDVYHEIQNRYQF 289 Query: 362 PEGTLPVKYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVE 541 G LPV+YLGLPL +K LT DYSPLL I I W+ LS AGRL LI SVL + Sbjct: 290 DVGQLPVRYLGLPLVTKRLTATDYSPLLEHIKKKIGTWTTRYLSYAGRLNLITSVLWSIC 349 Query: 542 CYWLQALPLPGTVIARITKMLRKFLW-----RDSQCPVSWKTVCLPRHEGGLGLRDLAVW 706 +WL A LP I I K+ FLW + V W VC P+ EGGLGLR L Sbjct: 350 NFWLAAFRLPRECIREIDKICSAFLWSGPDLNPRKTRVCWGDVCKPKQEGGLGLRSLKEM 409 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLISD 886 N+ K +W I + +SLW++WI L+ W + +M ++L Sbjct: 410 NEVSCLKLIWRIVSHTNSLWVRWIEQYLLKHDTFWSV----QTTTNMDSVL--------- 456 Query: 887 CEGNLNDAKAKLVGWFAGKGTSEAYEHFRAKGEKKFWYKAVWRSYIPPKFSVTLWLAMHG 1066 G ++ K T + + R W+ +W ++ PKFS WLA+ Sbjct: 457 WRGRNDEYMPKF-------STRDTWNQTRNTSTPVTWHMGIWFAHATPKFSFCAWLAVQN 509 Query: 1067 RLKTFDRMK--HSDIARGCVLCDNADETHDHLFFKCDKAMGVWSGICSWL---RCRNQMT 1231 RL T D+M + ++ CVLC+N ET +HLFF C +W + + + + Sbjct: 510 RLSTGDKMLQWNRRLSPTCVLCNNNIETRNHLFFSCCYTAEIWENLAKNIYKAKFSTNWS 569 Query: 1232 TISSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHVI 1393 TI ++V R + S + R AT+ +W RN + ++ A+H+I Sbjct: 570 TILTSVSTTWRNRTESFLAR----YIFQATIHTIWHERNGRRHGERSNSATHLI 619 >ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca subsp. vesca] Length = 958 Score = 253 bits (647), Expect = 1e-64 Identities = 166/479 (34%), Positives = 234/479 (48%), Gaps = 12/479 (2%) Frame = +2 Query: 2 NGGSHGFIRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDST-FIHHPKCSTTDTTHL 178 NG GF +RGLRQGDP+SP LF+ ME LS I R + S F +H +C + +HL Sbjct: 464 NGELAGFFARRRGLRQGDPLSPYLFVIAMEVLSLCIQRRINCSPCFRYHWRCDQLNLSHL 523 Query: 179 AFADDLLLFGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFG 358 FADDLL+F GD +S+R L DA F S L N S+S IFL GV ++L++ Sbjct: 524 CFADDLLMFCNGDENSVRTLHDAFSNFESLSSLKANVSESKIFLAGVDGNSSDSVLQVTN 583 Query: 359 FPEGTLPVKYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGV 538 F GT PV+YLG+PL + L D SPLL +I I+ W N LS AGRL+LI+SVL + Sbjct: 584 FSLGTCPVRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSI 643 Query: 539 ECYWLQALPLPGTVIARITKMLRKFLWRD-----SQCPVSWKTVCLPRHEGGLGLRDLAV 703 + YW L LP V+ I K LR FLW + V+W +CLP+ EGGLG++DL Sbjct: 644 QVYWASHLILPKKVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHC 703 Query: 704 WNKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLIS 883 WNKAL +WN+ + + + W W+ L+G W P P + + +L+IR+ S Sbjct: 704 WNKALMISHIWNLVSSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWNWRKLLKIRELCCS 763 Query: 884 DCEGNLNDAKAKLVGWFAGKGTSEAYEHFRAKGEKKFWYKAVWRSYI--PPKFSVTLWLA 1057 + D G+ TS ++++ G W S I S + L Sbjct: 764 FFVNIIGD----------GRATSLWFDNWHPLGP----LTLRWSSNIIGESGLSKSAMLT 809 Query: 1058 MHGRLKTFDRMKHSDIARGCV----LCDNADETHDHLFFKCDKAMGVWSGICSWLRCRNQ 1225 +G T +R V L ETH+HLFF C + G+W+ + S Sbjct: 810 PNGFYSTSSAWNTLRPSRFIVPWYRLVWFVAETHNHLFFDCAYSFGIWTHVLSKCDVSKP 869 Query: 1226 MTTISSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHVIKEI 1402 + S + G+ + +AL A V +W+ RN + + + V K I Sbjct: 870 LLPWSDFIFWVATNWKGNSLPVVILKLALQAVVYAIWRERNNRRFRNESLPPAVVFKGI 928 >ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum lycopersicum] Length = 717 Score = 239 bits (610), Expect = 2e-60 Identities = 118/276 (42%), Positives = 168/276 (60%), Gaps = 5/276 (1%) Frame = +2 Query: 2 NGGSHGFIRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLA 181 NG + +GLRQGDPMSP LF MEYLSRL+ D +F +HPK + D THL Sbjct: 436 NGQNTQRFDAAKGLRQGDPMSPFLFAIAMEYLSRLLKGLKEDKSFKYHPKYAKLDVTHLC 495 Query: 182 FADDLLLFGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDLLLF RGD +S++ L+ F+ SGL N +KS I+ GGV+ ++ I++ G+ Sbjct: 496 FADDLLLFSRGDLNSIKALQKCFTEFSQASGLQANLNKSSIYCGGVQMEVRQQIIQQLGY 555 Query: 362 PEGTLPVKYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVE 541 LP KYLG+PL+SK L T + PL+ ++ I W+ LS AGR +L+++VL GV+ Sbjct: 556 TIEELPFKYLGVPLSSKKLNTIQWYPLIEKVMARINSWTAKKLSYAGRAQLVKTVLFGVQ 615 Query: 542 CYWLQALPLPGTVIARITKMLRKFLWR-----DSQCPVSWKTVCLPRHEGGLGLRDLAVW 706 W Q +P +I I + R +LW + ++W VC P++EGGLGL +L +W Sbjct: 616 ALWAQLFIIPAKIIKLIEGLCRSYLWSGVGYVTKKALIAWDKVCSPKYEGGLGLINLKIW 675 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIWE 814 N++ +K W++ K D LWIKWIHA Y++G W+ Sbjct: 676 NRSAVTKLCWDLANKEDKLWIKWIHAYYIKGQREWK 711 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 236 bits (603), Expect = 2e-59 Identities = 165/557 (29%), Positives = 255/557 (45%), Gaps = 93/557 (16%) Frame = +2 Query: 2 NGGSHGFIRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLA 181 NG G+ + +RGLRQG +SP LF+ CM+ LS+++ F HPKC THL+ Sbjct: 290 NGDLVGYFQSKRGLRQGCSLSPYLFVICMDVLSKMLDKAAGVRKFGFHPKCQRLGLTHLS 349 Query: 182 FADDLLLFGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDL++ G S+ + + D F SGL I+ KS +++ GV P K+ I F F Sbjct: 350 FADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSPIIKQEIAAKFLF 409 Query: 362 PEGTLPVKYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVE 541 G LPV+YLGLPL +K LT+ DYSPLL QI I W+ S AGR LI+SVL + Sbjct: 410 DVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSVLWSIC 469 Query: 542 CYWLQALPLPGTVIARITKMLRKFLWRDSQ-----CPVSWKTVCLPRHEGGLGLRDL--- 697 +WL A LP I I K+ FLW S+ +SW VC P+ EGGLGLR+L Sbjct: 470 NFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDIVCKPKAEGGLGLRNLKEA 529 Query: 698 -----------------AVWNK-----ALHSKTLWNIHAKADSLWIKWIHAEYLRGLDI- 808 ++W K + K++W++ K + WI + L+ D+ Sbjct: 530 NDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSL--KQSTSMGSWIWRKILKIRDVA 587 Query: 809 --------------------W----------------EFPYPRRDAP-----------HM 847 W + PR + H Sbjct: 588 KSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPREASVADAWTRRSRRRHR 647 Query: 848 TNILRIRDQLISDCEGNLNDAKAKLVGWFAGKG--------TSEAYEHFRAKGEKKFWYK 1003 T++L +++++ + +DA+ ++ + GK T + + +A W+K Sbjct: 648 TSLLNEIEEMMAYQRIHHSDAEDTVL--WRGKNDVFKPHFSTRDTWHLIKATSSTVSWHK 705 Query: 1004 AVWRSYIPPKFSVTLWLAMHGRLKTFDRM----KHSDIARGCVLCDNADETHDHLFFKCD 1171 VW + PK+++ WLA+H RL T DRM ++ CVLC N +T +HLFF C Sbjct: 706 GVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGNCVLCTNNSKTLEHLFFSCS 765 Query: 1172 KAMGVWSGICSWL---RCRNQMTTISSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQA 1342 A VW+ + + R + + + + + +++ + R AT+ ++W+ Sbjct: 766 YASTVWAALAKGIWKTRYSTRWSHLLTHISTHFQDRVEGFLTR----YIFQATIYHVWRE 821 Query: 1343 RNLKYVEKKPFEASHVI 1393 RN + + P + VI Sbjct: 822 RNGRRHDAAPNTPATVI 838 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 234 bits (596), Expect = 1e-58 Identities = 120/294 (40%), Positives = 165/294 (56%), Gaps = 5/294 (1%) Frame = +2 Query: 2 NGGSHGFIRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLA 181 NGG+ GF + +GLRQGDP+SP LF+ ME S L+H+R +HPK S +HL Sbjct: 637 NGGNGGFFKSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLSISHLM 696 Query: 182 FADDLLLFGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADD+++F G S+ + + LD F SGL +NK KSH++L G+ E A +GF Sbjct: 697 FADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGLNQLESNA-NAAYGF 755 Query: 362 PEGTLPVKYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVE 541 P GTLP++YLGLPL ++ L +Y PLL +I + W N LS AGR++LI SV+ G Sbjct: 756 PIGTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSI 815 Query: 542 CYWLQALPLPGTVIARITKMLRKFLW-----RDSQCPVSWKTVCLPRHEGGLGLRDLAVW 706 +W+ LP I RI + +FLW + VSW +CLP+ EGGLGLR L W Sbjct: 816 NFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEW 875 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIWEFPYPRRDAPHMTNILRIR 868 NK L + +W + DSLW W H +L W + D+ +L +R Sbjct: 876 NKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTWKRLLSLR 929 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 230 bits (586), Expect = 1e-57 Identities = 116/275 (42%), Positives = 162/275 (58%), Gaps = 5/275 (1%) Frame = +2 Query: 2 NGGSHGFIRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLA 181 +G G+ +G +GLRQGDP+SP+LF+ ME LSRL+ + D + +HPK S + LA Sbjct: 636 SGSLCGYFKGSKGLRQGDPLSPSLFVIAMEILSRLLENKFSDGSIGYHPKASEVRISSLA 695 Query: 182 FADDLLLFGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDL++F G S+R ++ L+ F SGL +N KS ++ G+ +K L FGF Sbjct: 696 FADDLMIFYDGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTL-AFGF 754 Query: 362 PEGTLPVKYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVE 541 GT P +YLGLPL + L DYS L+ +IA W+ LS AGRL+LI SV+ Sbjct: 755 VNGTFPFRYLGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTV 814 Query: 542 CYWLQALPLPGTVIARITKMLRKFLW-----RDSQCPVSWKTVCLPRHEGGLGLRDLAVW 706 +WL + LP + I +M +FLW R VSW+ CLP+ EGGLGLR+ W Sbjct: 815 NFWLSSFILPKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTW 874 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIW 811 NK L+ + +W + A+ DSLW+ W HA LR ++ W Sbjct: 875 NKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFW 909 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 228 bits (581), Expect = 5e-57 Identities = 166/556 (29%), Positives = 246/556 (44%), Gaps = 92/556 (16%) Frame = +2 Query: 2 NGGSHGFIRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLA 181 NG G+ R RG+RQG +SP LF+ ME LS+++ F HPKC THL Sbjct: 48 NGELAGYFRSARGIRQGCALSPYLFVISMEVLSKMLDQAAGGKRFGFHPKCKNLGLTHLC 107 Query: 182 FADDLLLFGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDL++ G S+ + + +++F SGL IN K+ ++ GV + ++ + F Sbjct: 108 FADDLMILTDGKVRSVDGIVEVMNLFAKRSGLQINMEKTTLYTAGVSDHNRYMMISRYPF 167 Query: 362 PEGTLPVKYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVE 541 G LPV+YLGLPL +K LT D SPL QI N I W++ LS AGRL LI SVL Sbjct: 168 GLGQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNRIGTWTSRYLSFAGRLNLISSVLWSTM 227 Query: 542 CYWLQALPLPGTVIARITKMLRKFLWRDSQ-----CPVSWKTVCLPRHEGGLGLRDLA-- 700 +W+ A LP + I + FLW + VSW +C P+ EGGLGLR L Sbjct: 228 NFWMSAFRLPSACLKEINSICSAFLWSGPELHRRKAKVSWDDICKPKQEGGLGLRSLTEA 287 Query: 701 --------VWNKALHSKTLW------NIHAKADSLWI--------KWIHAEYLRGLDIWE 814 +W + +LW N+ K +S W W+ + L+ + + Sbjct: 288 NVVSVLKLIWRVTSNDDSLWVKWSKMNL-LKQESFWSLTPNSSLGSWMWKKMLKYRETAK 346 Query: 815 FPYPRRDAP----------------HMTNILRIRDQL---------ISDCEGN------- 898 P+ R + H+ ++ R Q+ +++ N Sbjct: 347 -PFSRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISRNKTVAEAWSNRRRRKHR 405 Query: 899 ---LNDAKAKL-------------VGWFAGKG--------TSEAYEHFRAKGEKKFWYKA 1006 LND +A L + GKG T + + R K + WYK Sbjct: 406 TEQLNDIEAALNQKYQTRNLLREDATLWRGKGDVFKTSFSTKDTWNQVRKKSNEVAWYKG 465 Query: 1007 VWRSYIPPKFSVTLWLAMHGRLKTFDRMK----HSDIARGCVLCDNADETHDHLFFKCDK 1174 VW S+ PK+ WLA+ RL T RM+ SD+ C C + ET DHLFF C Sbjct: 466 VWFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDVK--CTFCSTSIETRDHLFFSCSY 523 Query: 1175 AMGVWSGICSWL---RCRNQMTTISSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQAR 1345 A +W+ I + R TI + + Q ++ S + R TV +W+ R Sbjct: 524 ASAIWTAIAKNVLQHRFSTDWQTIVNYISETQTDRIRSFLSR----YIFQLTVHTVWKER 579 Query: 1346 NLKYVEKKPFEASHVI 1393 N + ++P ++++I Sbjct: 580 NDRRHGEEPRTSANLI 595 >gb|ABD96948.1| hypothetical protein [Cleome spinosa] Length = 539 Score = 219 bits (557), Expect = 3e-54 Identities = 142/416 (34%), Positives = 203/416 (48%), Gaps = 20/416 (4%) Frame = +2 Query: 2 NGGSHGFIRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLA 181 NG G+ +G+RGLRQGDP+SP LF+ ME LSR++ +S HPKC + THLA Sbjct: 90 NGELAGYFKGRRGLRQGDPLSPYLFIMSMEVLSRMLDRCAAESRLSLHPKCHSPVITHLA 149 Query: 182 FADDLLLFGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADD+++F G+ S+ +++ LD F+ SGL +N K+ IFL G+ E + + GF Sbjct: 150 FADDIMIFTSGETRSLLEVKNTLDSFSRASGLYLNTEKTEIFLRGLNGTEASTLCAVIGF 209 Query: 362 PEGTLPVKYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVE 541 G LPV+YLG+ L+ LT DY PLL ++ I W+ LS AGRL+L+ +V+ G+ Sbjct: 210 TRGYLPVRYLGVSLSPVRLTKSDYQPLLDRVKAKINSWTTRYLSYAGRLQLVGTVIYGMV 269 Query: 542 CYWLQALPLPGTVIARITKMLRKFLW-RDSQCPVSWKTVCLPRHEGGLGLRDLAVWNKAL 718 W LP ++ ++ FLW + VSW T C PR EGGLGLR +A +N Sbjct: 270 NAWGMIFMLPKFFTKQVDRLCAGFLWGAGTTHRVSWDTCCRPRKEGGLGLRKIAEFN--- 326 Query: 719 HSKTLWNIHAKADSLWIKWIHAEYLRGL--------------DIWEFPYPRRDAPHMTNI 856 + W I+ ++++ R L D W FP R D + + Sbjct: 327 --QDPWTIYGSL----LRYVGLTGPRSLRIPLPSSVSQAVAGDSWIFPGVRSD--RLQQV 378 Query: 857 LRIRDQLISDCEGNLNDA---KAKLVGWFAGKGTSEAYEHFRAKGEKKFWYKAVWRSYIP 1027 L + +D+ K K + +S + R W VW Sbjct: 379 LAHISTIPPPSPDGPSDSALWKYKEEDFRPYFSSSRTWNLTRTVHVIAPWSSIVWFPLAI 438 Query: 1028 PKFSVTLWLAMHGRLKTFDRMKHSDIARG--CVLCDNADETHDHLFFKCDKAMGVW 1189 P+ + W M RL T DR++ I C LCD DE+H HLFF C A +W Sbjct: 439 PRHAFLHWQVMLFRLPTKDRLQQWGITSDATCRLCDGEDESHQHLFFGCTYASHLW 494 >dbj|BAB08270.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 489 Score = 217 bits (553), Expect = 9e-54 Identities = 120/308 (38%), Positives = 164/308 (53%), Gaps = 6/308 (1%) Frame = +2 Query: 2 NGGSHGFIRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLA 181 NG G+ RGLRQG +SP LF+ M LS+L+ T F +HP+C THL+ Sbjct: 25 NGELAGYFNSTRGLRQGCSLSPYLFVVSMNVLSKLLDKATGQRRFGYHPRCKQMGLTHLS 84 Query: 182 FADDLLLFGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDL++ G S+ + + + F SGL I+ KS ++ G+ + ++ F F Sbjct: 85 FADDLMVLSDGKVRSIEGIVEVFETFAKCSGLRISMEKSTVYFAGLSHTSPQEVMAHFPF 144 Query: 362 PEGTLPVKYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVE 541 GTLPV+YLGLPL +K L++ DY PL+ I I WS LS AGRL LI SVL + Sbjct: 145 AVGTLPVRYLGLPLVTKQLSSTDYLPLIEHIKKKIGSWSARFLSYAGRLNLISSVLWSIC 204 Query: 542 CYWLQALPLPGTVIARITKMLRKFLW-----RDSQCPVSWKTVCLPRHEGGLGLRDLAVW 706 +W+ A LP I I KM +LW S+ ++W VC P+ EGGLGLR L Sbjct: 205 NFWMGAFRLPRECIREIDKMCSAYLWSGGDLNTSKAKIAWTDVCKPKDEGGLGLRSLKEA 264 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIWEFPYPRRDAPHM-TNILRIRDQLIS 883 N K +W I + ADSLW+KWIHA L+ + W M +L+ RD I Sbjct: 265 NDVSCLKLIWRIISHADSLWVKWIHATLLKQVSFWAVRENTSLGSWMWKKVLKFRDAAIQ 324 Query: 884 DCEGNLND 907 C+ +N+ Sbjct: 325 LCKAEVNN 332 >ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine max] Length = 316 Score = 216 bits (550), Expect = 2e-53 Identities = 114/294 (38%), Positives = 167/294 (56%), Gaps = 5/294 (1%) Frame = +2 Query: 65 PTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLAFADDLLLFGRGDPDSMRVLRD 244 P ++L C + +R + + D+ F HP C+ +HLAFADD++L RGD M + Sbjct: 21 PFIYL-CFVWSTRDMSSFKDDANFKFHPNCAGIQLSHLAFADDIMLLSRGDIPYMSTMFA 79 Query: 245 ALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGFPEGTLPVKYLGLPLASKSLTT 424 L F SGL+I+ KS I+ G+RP+E I +L GF G P +YLG PL S L Sbjct: 80 KLQHFCRVSGLSISSDKSAIYSAGIRPYELSHIQQLTGFSLGGFPFRYLGAPLLSSRLNV 139 Query: 425 PDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVECYWLQALPLPGTVIARITKML 604 Y+PLL +I IQ W+ +LS G+LELI++V+QG+ +W++ PLP +V+ RI Sbjct: 140 CHYAPLLYKIVGLIQGWNKKSLSYVGKLELIKAVIQGIMNFWMRIFPLPQSVLDRINASC 199 Query: 605 RKFLWRDSQCP-----VSWKTVCLPRHEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWI 769 FLW + V+W VC P+ EGGLGL +L WN AL S LW+ H K DSL + Sbjct: 200 CNFLWSKADIGKNKPLVAWPVVCSPKQEGGLGLFNLKDWNLALLSHILWDFHCKKDSLRV 259 Query: 770 KWIHAEYLRGLDIWEFPYPRRDAPHMTNILRIRDQLISDCEGNLNDAKAKLVGW 931 +W+H Y R D W + ++ + I++IRD +IS E ++ + K ++ W Sbjct: 260 RWVHHYYFRRSDEWNYNISSSNSVLIKKIIQIRDFIISK-ELSMEETKKRIQSW 312 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 216 bits (549), Expect = 3e-53 Identities = 113/277 (40%), Positives = 157/277 (56%), Gaps = 6/277 (2%) Frame = +2 Query: 2 NGGSHGFIRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIH-HPKCSTTDTTHL 178 NG + GF R +GLRQGDP+SP LF+ ME S+L+++R +DS +IH HPK +HL Sbjct: 497 NGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSR-YDSGYIHYHPKAGDLSISHL 555 Query: 179 AFADDLLLFGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFG 358 FADD+++F G SM + + LD F SGL +NK KS +F G+ +R +G Sbjct: 556 MFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL-DLSERITSAAYG 614 Query: 359 FPEGTLPVKYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGV 538 FP GT P++YLGLPL + L DY PLL +++ ++ W + LS AGR +LI SV+ G+ Sbjct: 615 FPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGL 674 Query: 539 ECYWLQALPLPGTVIARITKMLRKFLWRDS-----QCPVSWKTVCLPRHEGGLGLRDLAV 703 +W+ LP I +I + KFLW S VSW CLP+ EGGLG R Sbjct: 675 INFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGE 734 Query: 704 WNKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIWE 814 WNK L + +W + + SLW +W L W+ Sbjct: 735 WNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQ 771 Score = 58.5 bits (140), Expect = 7e-06 Identities = 45/169 (26%), Positives = 69/169 (40%), Gaps = 4/169 (2%) Frame = +2 Query: 938 GKGTSEAYEHFRAKGEKKFWYKAVWRSYIPPKFSVTLWLAMHGRLKTFDRMKHSDIARG- 1114 G ++ +E R + K W K+VW PK + W A RL T R+ + Sbjct: 891 GFSAAKTWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSA 950 Query: 1115 -CVLCDNADETHDHLFFKCDKAMGVWSGICSWLRCRNQMTTISSAVRRFQREK--AGSGI 1285 C LC ET DHL CD + VW + L R ++ + + + R+ A + Sbjct: 951 ECCLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQSTAAAPSL 1010 Query: 1286 IRKAKWVALGATVQYLWQARNLKYVEKKPFEASHVIKEIKLDVYRVLYS 1432 +RK V V LW+ RNL S V + + ++ V+ S Sbjct: 1011 LRK---VVAQLVVYNLWRQRNLVLHSSLRVSCSVVFRLVDRELRNVILS 1056 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 216 bits (549), Expect = 3e-53 Identities = 113/277 (40%), Positives = 157/277 (56%), Gaps = 6/277 (2%) Frame = +2 Query: 2 NGGSHGFIRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIH-HPKCSTTDTTHL 178 NG + GF R +GLRQGDP+SP LF+ ME S+L+++R +DS +IH HPK +HL Sbjct: 497 NGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSR-YDSGYIHYHPKAGDLSISHL 555 Query: 179 AFADDLLLFGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFG 358 FADD+++F G SM + + LD F SGL +NK KS +F G+ +R +G Sbjct: 556 MFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL-DLSERITSAAYG 614 Query: 359 FPEGTLPVKYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGV 538 FP GT P++YLGLPL + L DY PLL +++ ++ W + LS AGR +LI SV+ G+ Sbjct: 615 FPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGL 674 Query: 539 ECYWLQALPLPGTVIARITKMLRKFLWRDS-----QCPVSWKTVCLPRHEGGLGLRDLAV 703 +W+ LP I +I + KFLW S VSW CLP+ EGGLG R Sbjct: 675 INFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGE 734 Query: 704 WNKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIWE 814 WNK L + +W + + SLW +W L W+ Sbjct: 735 WNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQ 771 >gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1352 Score = 215 bits (547), Expect = 5e-53 Identities = 111/275 (40%), Positives = 154/275 (56%), Gaps = 5/275 (1%) Frame = +2 Query: 2 NGGSHGFIRGQRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFIHHPKCSTTDTTHLA 181 NG GF R +RGLRQG +SP L++ CM LS ++ + +HP+C + THL Sbjct: 790 NGELSGFFRSERGLRQGCSLSPYLYVICMNVLSCMLDKAAVEKKISYHPRCRNMNLTHLC 849 Query: 182 FADDLLLFGRGDPDSMRVLRDALDVFTMTSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADD+++F G S++ + F S L I+ KS IF+ G+ P K +IL+ F F Sbjct: 850 FADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNAKTSILQQFPF 909 Query: 362 PEGTLPVKYLGLPLASKSLTTPDYSPLLAQIANFIQRWSNSNLSRAGRLELIRSVLQGVE 541 GTLPVKYLGLPL +K +T DY PL+ +I I W+N LS AGRL+LI+SVL + Sbjct: 910 ELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLIKSVLSSIT 969 Query: 542 CYWLQALPLPGTVIARITKMLRKFLW-----RDSQCPVSWKTVCLPRHEGGLGLRDLAVW 706 +WL LP + I KM FLW + ++W VC + EGGLGL+ L Sbjct: 970 NFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLKPLKEA 1029 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGLDIW 811 N+ K +W I + DSLW+KW++ +R W Sbjct: 1030 NEVSLLKLIWRILSARDSLWVKWVNKHLIRKETFW 1064 Score = 70.5 bits (171), Expect = 2e-09 Identities = 57/200 (28%), Positives = 87/200 (43%), Gaps = 2/200 (1%) Frame = +2 Query: 617 WRDSQCPVSWKTVCLPRHEGGLGLRDLAVWNKALHSKTLWNIHAKADSLWIKWIHAEYLR 796 W D CP+ L +H G G DL + N A ++ + N H + K A++L Sbjct: 1104 WHDHWCPLGR----LHQHMGSRGTIDLGIPNNATVAEVM-NTHRR------KRHRADFLN 1152 Query: 797 GLDIWEFPYPRRDAPHMTNILRIRDQLISDCEGNLNDAKAKLVGWFAGKGTSEAYEHFRA 976 + + R+D +G+ + K K + + +S+ ++ R+ Sbjct: 1153 QIKS-QIELARQDR---------------STDGDRSLWKQKEDTFKSSFSSSKTWQQIRS 1196 Query: 977 KGEKKFWYKAVWRSYIPPKFSVTLWLAMHGRLKTFDRM-KHSDIAR-GCVLCDNADETHD 1150 + WY+ VW S PK+S WLA H RL T D++ K + AR CV C ET D Sbjct: 1197 ISLRCDWYRGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSGARYDCVFCGEELETRD 1256 Query: 1151 HLFFKCDKAMGVWSGICSWL 1210 HLFF C + VW + L Sbjct: 1257 HLFFSCPYSSHVWFSLTKGL 1276