BLASTX nr result
ID: Mentha23_contig00016214
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha23_contig00016214 (2757 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] 309 4e-81 emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 300 2e-78 ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268... 280 7e-78 gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali... 295 8e-77 gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 286 2e-76 ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664... 289 5e-75 gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00... 234 3e-74 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 277 9e-74 ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298... 271 4e-72 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 262 2e-71 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 276 5e-71 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 264 1e-70 gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip... 271 1e-69 gb|AAC95175.1| putative non-LTR retroelement reverse transcripta... 259 2e-69 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 265 4e-69 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 269 6e-69 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 269 6e-69 gb|ABD96948.1| hypothetical protein [Cleome spinosa] 269 6e-69 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 258 7e-68 gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] 257 2e-67 >gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] Length = 653 Score = 309 bits (792), Expect = 4e-81 Identities = 194/569 (34%), Positives = 287/569 (50%), Gaps = 35/569 (6%) Frame = +2 Query: 221 PLLALMLFIK*SQKILISRMALFLQKLISLAQSVFIKGRTIMDNFYLAQELIRTYERKSG 400 PL + K KI+ +R+ + L K I+ Q+ F+K R +++N LA EL++ Y ++S Sbjct: 58 PLSCCNVIYKIISKIIANRLKMVLPKFIAGNQTAFVKDRLLIENLLLATELVKDYHKES- 116 Query: 401 ITARCMVKIDLRKAYDCISWDFLRDVLHGLNFHPCFVYWILTYVISATFSITINGGSHGF 580 +++RC +KID+ KA++ + W F+R++L ++F FV+WI+ + +A+FS+ +NG GF Sbjct: 117 VSSRCAIKIDISKAFNSVQWSFIRNILLSMDFPMEFVHWIMLCISTASFSVQVNGELVGF 176 Query: 581 VRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLL 760 + KRGLRQG +SP LF+ M+ LS+L+ F +H +C THL+FADDL++ Sbjct: 177 FQSKRGLRQGCSLSPYLFVMSMDVLSKLLDQAASAKKFGYHSRCKELSLTHLSFADDLMV 236 Query: 761 FGRGDPDSMRVLRDALDEFTVTSGLTINKSKSHIFLGGVRPFEKRAIMELFGFPEGTLPV 940 G S+ + + D F SGL I+ KS I+L GV I + F G LPV Sbjct: 237 LSDGKVRSIDGIVEVFDIFAKFSGLKISMEKSTIYLAGVTEDVYHEIQNRYQFDVGQLPV 296 Query: 941 KYLGLPLASKSLTIPDYSPLLAQISNFIQRWSNSNLSSAGWLELIRSVLQGVGCYWLQAL 1120 +YLGLPL +K LT DYSPLL I I W+ LS AG L LI SVL + +WL A Sbjct: 297 RYLGLPLVTKRLTATDYSPLLEHIKKKIGTWTTRYLSYAGRLNLITSVLWSICNFWLAAF 356 Query: 1121 PLPGTVINRITKMLRKFLWCN-----SQCLVSWKTVCLPRGEGGLGLRDL---------- 1255 LP I I K+ FLW + V W VC P+ EGGLGLR L Sbjct: 357 RLPRECIREIDKICSAFLWSGPDLNPRKTRVCWGDVCKPKQEGGLGLRSLKEMNEVSCLK 416 Query: 1256 AVW-----NNPFIRRPCGTYMPKQTPFGSNGSTLSTSEDRTFGKG----------TSEAY 1390 +W N R Y+ K F S +T T+ D +G T + + Sbjct: 417 LIWRIVSHTNSLWVRWIEQYLLKHDTFWSVQTT--TNMDSVLWRGRNDEYMPKFSTRDTW 474 Query: 1391 EHFRAKGEKKFWYKAVWRSYIPPKFSVTLWLALHGRLKTFDRMKYSD--IARGCVLCEST 1564 R W+ +W ++ PKFS WLA+ RL T D+M + ++ CVLC + Sbjct: 475 NQTRNTSTPVTWHMGIWFAHATPKFSFCAWLAVQNRLSTGDKMLQWNRRLSPTCVLCNNN 534 Query: 1565 DETHDHLFFKFEKALAVWSGICSWL---RCRNQMTTIPSVVRRFQREKAGSGIIRKAKWV 1735 ET +HLFF +W + + + +TI + V R + S + R Sbjct: 535 IETRNHLFFSCCYTAEIWENLAKNIYKAKFSTNWSTILTSVSTTWRNRTESFLAR----Y 590 Query: 1736 ALGATVQYLWHARNLKYVEKKPFEASHVI 1822 AT+ +WH RN + ++ A+H+I Sbjct: 591 IFQATIHTIWHERNGRRHGERSNSATHLI 619 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 300 bits (769), Expect = 2e-78 Identities = 192/582 (32%), Positives = 279/582 (47%), Gaps = 45/582 (7%) Frame = +2 Query: 221 PLLALMLFIK*SQKILISRMALFLQKLISLAQSVFIKGRTIMDNFYLAQELIRTYERKSG 400 P+ + K K+L +RM + ++++ AQS FI GR I DN LA ELIR Y RK Sbjct: 516 PIACCTVIYKIISKMLTNRMKGIIGEVVNEAQSGFIPGRHIADNILLASELIRGYTRKH- 574 Query: 401 ITARCMVKIDLRKAYDCISWDFLRDVLHGLNFHPCFVYWILTYVISATFSITINGGSHGF 580 ++ RC++K+D+RKAYD + W FL +L+ F FV WI+ V + ++S+ +NG Sbjct: 575 MSPRCIMKVDIRKAYDSVEWSFLETLLYEFGFPSRFVGWIMECVSTVSYSVLVNGIPTQP 634 Query: 581 VRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLL 760 + ++GLRQGDPMSP LF CMEYLSR + F HPKC + THL FADDLL+ Sbjct: 635 FQARKGLRQGDPMSPFLFALCMEYLSRCLEELKGSPDFNFHPKCERLNITHLMFADDLLM 694 Query: 761 FGRGDPDSMRVLRDALDEFTVTSGLTINKSKSHIFLGGVRPFEKRAIMELFGFPEGTLPV 940 F R D S+ + A +F+ SGL + KS+I+ GV R + + G LP Sbjct: 695 FCRADKSSLDHMNVAFQKFSHASGLAASHEKSNIYFCGVDDETARELADYVHMQLGELPF 754 Query: 941 KYLGLPLASKSLTIPDYSPLLAQISNFIQRWSNSNLSSAGWLELIRSVLQGVGCYWLQAL 1120 +YLG+PL SK LT PL+ I+N Q W LS AG L+LI+S+L + YW Sbjct: 755 RYLGVPLTSKKLTYAQCKPLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIF 814 Query: 1121 PLPGTVINRITKMLRKFLWC-----NSQCLVSWKTVCLPRGEGGLGLRDLAVWNNP---- 1273 PL VI + K+ RKFLW + V+W T+ P+ GG + ++ WN Sbjct: 815 PLSKKVIQAVEKVCRKFLWTGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLK 874 Query: 1274 ------------FIRRPCGTYMPKQTPFGSNGSTLSTSEDRTFGK--------------- 1372 ++R Y+ +Q N S +T R K Sbjct: 875 LLWAIEFKRDKLWVRWIHSYYIKRQDILTVNISNQTTWILRKIVKARDHLSNIGDWDEIC 934 Query: 1373 -----GTSEAYEHFRAKGEKKFWYKAVWRSYIPPKFSVTLWLALHGRLKTFDRMKYSDIA 1537 +AY+ GE+ W + + +Y PK LW+ LH RL T DR+ + Sbjct: 935 IGDKFSMKKAYKKISENGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDRISRWGVQ 994 Query: 1538 --RGCVLCESTDETHDHLFFKFEKALAVWSGICSWLRCRNQMTTIPSVVRRFQREKAGSG 1711 LC + ET HLFF + VWS IC +R N + ++ G Sbjct: 995 CDLNYRLCRNDGETIQHLFFSCSYSAGVWSKICYIMRFPNSGVSHQEII----SSVCGQA 1050 Query: 1712 IIRKAKWVALGAT--VQYLWHARNLKYVEKKPFEASHVIKEI 1831 +K K + + T V +W RN + + + + V+++I Sbjct: 1051 RKKKGKLIVMLYTEFVYAIWKQRNKRTFTGENKDENEVLRKI 1092 Score = 87.8 bits (216), Expect = 2e-14 Identities = 41/87 (47%), Positives = 59/87 (67%) Frame = +1 Query: 1 IRTALFDIGDEKAPGPDGYTSAFFKKNWDLVRDDVVASVDEFFSK*IILRKLNHTVVSLI 180 I AL IG++KAPG DG+ + FFKK+W ++ ++ A + EFF+ + R +N VV+L+ Sbjct: 443 IDEALAGIGNDKAPGLDGFNAYFFKKSWGSIKQEIYAGIQEFFNNSRMHRPINCIVVTLL 502 Query: 181 PKTTHDPGVGDFRPIACTNVVYKIITK 261 PK H V +FRPIAC V+YKII+K Sbjct: 503 PKVQHATRVKEFRPIACCTVIYKIISK 529 >ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum lycopersicum] Length = 717 Score = 280 bits (717), Expect(2) = 7e-78 Identities = 143/339 (42%), Positives = 203/339 (59%), Gaps = 5/339 (1%) Frame = +2 Query: 290 LQKLISLAQSVFIKGRTIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAYDCISWDFL 469 + +IS +Q+ FI GR I DN LA EL++ Y RK+ ++ RCM+KIDL KAYD + W FL Sbjct: 347 IHTIISDSQAGFIPGRKIGDNIILAHELVKAYTRKN-VSPRCMLKIDLHKAYDSVEWPFL 405 Query: 470 RDVLHGLNFHPCFVYWILTYVISATFSITINGGSHGFVRGKRGLRQGDPMSPTLFLFCME 649 V+ GL F F W++ V + ++I +NG + +GLRQGDPMSP LF ME Sbjct: 406 EQVMEGLGFPDLFTKWVMKCVKTVNYTIVVNGQNTQRFDAAKGLRQGDPMSPFLFAIAME 465 Query: 650 YLSRLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLLFGRGDPDSMRVLRDALDEFTVTS 829 YLSRL+ D +F +HPK + D THL FADDLLLF RGD +S++ L+ EF+ S Sbjct: 466 YLSRLLKGLKEDKSFKYHPKYAKLDVTHLCFADDLLLFSRGDLNSIKALQKCFTEFSQAS 525 Query: 830 GLTINKSKSHIFLGGVRPFEKRAIMELFGFPEGTLPVKYLGLPLASKSLTIPDYSPLLAQ 1009 GL N +KS I+ GGV+ ++ I++ G+ LP KYLG+PL+SK L + PL+ + Sbjct: 526 GLQANLNKSSIYCGGVQMEVRQQIIQQLGYTIEELPFKYLGVPLSSKKLNTIQWYPLIEK 585 Query: 1010 ISNFIQRWSNSNLSSAGWLELIRSVLQGVGCYWLQALPLPGTVINRITKMLRKFLW---- 1177 + I W+ LS AG +L+++VL GV W Q +P +I I + R +LW Sbjct: 586 VMARINSWTAKKLSYAGRAQLVKTVLFGVQALWAQLFIIPAKIIKLIEGLCRSYLWSGVG 645 Query: 1178 -CNSQCLVSWKTVCLPRGEGGLGLRDLAVWNNPFIRRPC 1291 + L++W VC P+ EGGLGL +L +WN + + C Sbjct: 646 YVTKKALIAWDKVCSPKYEGGLGLINLKIWNRSAVTKLC 684 Score = 40.4 bits (93), Expect(2) = 7e-78 Identities = 16/34 (47%), Positives = 22/34 (64%) Frame = +1 Query: 1267 QSLHSKTLWNIHAKTDSLWIKWIHAEYLRG*DIW 1368 +S +K W++ K D LWIKWIHA Y++G W Sbjct: 677 RSAVTKLCWDLANKEDKLWIKWIHAYYIKGQREW 710 >gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana] gi|20197043|gb|AAM14892.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 1412 Score = 295 bits (755), Expect = 8e-77 Identities = 188/604 (31%), Positives = 283/604 (46%), Gaps = 55/604 (9%) Frame = +2 Query: 185 RRLMTRELEISGPLLALMLFIK*SQKILISRMALFLQKLISLAQSVFIKGRTIMDNFYLA 364 +R +E++ P+ + K K+L +R+ L + I+ QS FI R +M+N LA Sbjct: 785 KRTYAKEMKDYRPISCCNVLYKAISKLLANRLKCLLPEFIAPNQSAFISDRLLMENLLLA 844 Query: 365 QELIRTYERKSGITARCMVKIDLRKAYDCISWDFLRDVLHGLNFHPCFVYWILTYVISAT 544 EL++ Y K G++ RC +KIDL KA+D + W FL + L L+ F++WI + +A+ Sbjct: 845 SELVKDYH-KDGLSPRCAMKIDLSKAFDSVQWPFLLNTLAALDIPEKFIHWINLCISTAS 903 Query: 545 FSITINGGSHGFVRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFMHHPKCSTTD 724 FS+ +NG LRQG +SP LF+ CM LS ++ + F +HP+C Sbjct: 904 FSVQVNG-----------LRQGCSLSPYLFVICMNVLSAMLDKGAVEKRFGYHPRCRNMG 952 Query: 725 TTHLAFADDLLLFGRGDPDSMRVLRDALDEFTVTSGLTINKSKSHIFLGGVRPFEKRAIM 904 THL FADD+++F G S+ + +F SGL I+ KS +F+ + +I+ Sbjct: 953 LTHLCFADDIMVFSAGSAHSLEGVLAIFKDFAAFSGLNISLEKSTLFMASISSETCASIL 1012 Query: 905 ELFGFPEGTLPVKYLGLPLASKSLTIPDYSPLLAQISNFIQRWSNSNLSSAGWLELIRSV 1084 F F G+LPV+YLGLPL +K +T+ D PLL +I + I W N LS AG L+L+ SV Sbjct: 1013 ARFPFDSGSLPVRYLGLPLMTKRMTLADCLPLLEKIRSRISSWKNRFLSYAGRLQLLNSV 1072 Query: 1085 LQGVGCYWLQALPLPGTVINRITKMLRKFLWCNS-----QCLVSWKTVCLPRGEGGLGLR 1249 + + +W+ A LP I I ++ FLW + + V+W VC P+ EGGLGLR Sbjct: 1073 ISSLTKFWISAFRLPRACIREIEQISAAFLWSGTDLNPHKAKVAWHDVCKPKSEGGLGLR 1132 Query: 1250 DLA----------VWNNPFIRRPCGTYMPKQTPFGSNGSTLS------------------ 1345 L +W + + + LS Sbjct: 1133 SLVDANKICCFKLIWRLVSAKHSLWVNWIQNNLIRTVAEALSSHRRRSHRDDILNDIEEE 1192 Query: 1346 ----------TSEDRTFGKG----------TSEAYEHFRAKGEKKFWYKAVWRSYIPPKF 1465 T +DR+ + + E + R +G K W+KA+W S PKF Sbjct: 1193 LEKLLCRGICTEQDRSLCRSIGGQFKAKFFSPEIWHQIREQGLVKQWHKAIWFSGATPKF 1252 Query: 1466 SVTLWLALHGRLKTFDRMK--YSDIARGCVLCESTDETHDHLFFKFEKALAVWSGICSWL 1639 + WLA H RL T D+M I+ CVLC + E+ DHLFF + +W + L Sbjct: 1253 TFISWLAAHDRLTTGDKMASWNRGISSVCVLCNISAESRDHLFFSCNFSSHIWDRLTRRL 1312 Query: 1640 RCRNQMTTIPSVVRRFQREKAGSGIIRKAKWVALGATVQYLWHARNLKYVEKKPFEASHV 1819 T P+++ + SG R AT+ LW RN + P + H+ Sbjct: 1313 LLCRYTTNFPALLLLLSGQDF-SGTKRFLLRYVFQATIHTLWRERNKRRHGDLPIPSDHI 1371 Query: 1820 IKEI 1831 IK I Sbjct: 1372 IKFI 1375 Score = 81.3 bits (199), Expect = 2e-12 Identities = 35/87 (40%), Positives = 56/87 (64%) Frame = +1 Query: 1 IRTALFDIGDEKAPGPDGYTSAFFKKNWDLVRDDVVASVDEFFSK*IILRKLNHTVVSLI 180 + F I K+PGPDGYT FF++ W ++ +V ++ FF+ + + LN T+++LI Sbjct: 724 VMKVFFSIPLNKSPGPDGYTVEFFRETWSVIGQEVTMAIKSFFTYGFLPKGLNSTILALI 783 Query: 181 PKTTHDPGVGDFRPIACTNVVYKIITK 261 PK T+ + D+RPI+C NV+YK I+K Sbjct: 784 PKRTYAKEMKDYRPISCCNVLYKAISK 810 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 286 bits (731), Expect(2) = 2e-76 Identities = 155/392 (39%), Positives = 221/392 (56%), Gaps = 5/392 (1%) Frame = +2 Query: 110 LPWMNSFQNESSSGNLTTPSFRSSQRRLMTRELEISGPLLALMLFIK*SQKILISRMALF 289 LP + FQ + + ++L +E+ P+ + K KI+ +R+ L Sbjct: 141 LPVQSFFQKGFLPKGINSIILALIPKKLAAKEMRDYRPISCCNVLYKVISKIIANRLKLL 200 Query: 290 LQKLISLAQSVFIKGRTIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAYDCISWDFL 469 L + I+ QS F+K R +++N LA EL++ Y + S I+ARC +KID+ KA+D + W FL Sbjct: 201 LPRFIAENQSAFVKDRLLIENLLLATELVKDYHKDS-ISARCAIKIDISKAFDSVQWSFL 259 Query: 470 RDVLHGLNFHPCFVYWILTYVISATFSITINGGSHGFVRGKRGLRQGDPMSPTLFLFCME 649 + L +NF P F++WI + +A+FS+ +NG G+ + KRGLRQG +SP LF+ CM+ Sbjct: 260 TNTLVAMNFSPTFIHWINLCITTASFSVQVNGDLVGYFQSKRGLRQGCSLSPYLFVICMD 319 Query: 650 YLSRLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLLFGRGDPDSMRVLRDALDEFTVTS 829 LS+++ F HPKC THL+FADDL++ G S+ + + DEF S Sbjct: 320 VLSKMLDKAAGVRKFGFHPKCQRLGLTHLSFADDLMVLSDGKTRSIEGILEVFDEFCKRS 379 Query: 830 GLTINKSKSHIFLGGVRPFEKRAIMELFGFPEGTLPVKYLGLPLASKSLTIPDYSPLLAQ 1009 GL I+ KS +++ GV P K+ I F F G LPV+YLGLPL +K LT DYSPLL Q Sbjct: 380 GLRISLEKSTLYMAGVSPIIKQEIAAKFLFDVGQLPVRYLGLPLVTKRLTSADYSPLLEQ 439 Query: 1010 ISNFIQRWSNSNLSSAGWLELIRSVLQGVGCYWLQALPLPGTVINRITKMLRKFLWCNSQ 1189 I I W+ S AG LI+SVL + +WL A LP I I K+ FLW S+ Sbjct: 440 IKKRIATWTFRFFSFAGRFNLIKSVLWSICNFWLAAFRLPRQCIREIDKLCSSFLWSGSE 499 Query: 1190 -----CLVSWKTVCLPRGEGGLGLRDLAVWNN 1270 +SW VC P+ EGGLGLR+L N+ Sbjct: 500 MSSHKAKISWDIVCKPKAEGGLGLRNLKEAND 531 Score = 30.4 bits (67), Expect(2) = 2e-76 Identities = 11/29 (37%), Positives = 17/29 (58%) Frame = +1 Query: 1282 KTLWNIHAKTDSLWIKWIHAEYLRG*DIW 1368 K +W I + ++SLW KW+ +R IW Sbjct: 536 KLVWRIISNSNSLWTKWVAEYLIRKKSIW 564 Score = 86.7 bits (213), Expect = 5e-14 Identities = 37/87 (42%), Positives = 57/87 (65%) Frame = +1 Query: 1 IRTALFDIGDEKAPGPDGYTSAFFKKNWDLVRDDVVASVDEFFSK*IILRKLNHTVVSLI 180 I+T LF + +K+PGPDGYTS F+K WD++ + V FF K + + +N +++LI Sbjct: 105 IKTVLFSMPKDKSPGPDGYTSEFYKATWDIIGQEFTLPVQSFFQKGFLPKGINSIILALI 164 Query: 181 PKTTHDPGVGDFRPIACTNVVYKIITK 261 PK + D+RPI+C NV+YK+I+K Sbjct: 165 PKKLAAKEMRDYRPISCCNVLYKVISK 191 Score = 69.3 bits (168), Expect = 9e-09 Identities = 45/177 (25%), Positives = 81/177 (45%), Gaps = 17/177 (9%) Frame = +2 Query: 1343 STSEDRTFGKGTSEAYE-HF---------RAKGEKKFWYKAVWRSYIPPKFSVTLWLALH 1492 S +ED +G ++ ++ HF +A W+K VW + PK+++ WLA+H Sbjct: 666 SDAEDTVLWRGKNDVFKPHFSTRDTWHLIKATSSTVSWHKGVWFRHATPKYALCTWLAIH 725 Query: 1493 GRLKTFDRM----KYSDIARGCVLCESTDETHDHLFFKFEKALAVWSGICSWL---RCRN 1651 RL T DRM ++ CVLC + +T +HLFF A VW+ + + R Sbjct: 726 NRLPTGDRMLKWNSSGSVSGNCVLCTNNSKTLEHLFFSCSYASTVWAALAKGIWKTRYST 785 Query: 1652 QMTTIPSVVRRFQREKAGSGIIRKAKWVALGATVQYLWHARNLKYVEKKPFEASHVI 1822 + + + + + +++ + R AT+ ++W RN + + P + VI Sbjct: 786 RWSHLLTHISTHFQDRVEGFLTR----YIFQATIYHVWRERNGRRHDAAPNTPATVI 838 >ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max] Length = 939 Score = 289 bits (739), Expect = 5e-75 Identities = 179/527 (33%), Positives = 274/527 (51%), Gaps = 39/527 (7%) Frame = +2 Query: 314 QSVFIKGRTIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAYDCISWDFLRDVLHGLN 493 Q+ F+ G+ + D+ LA EL+R YERK G T +CM++ID++KAYD + WD L +L L Sbjct: 375 QAAFVPGQQLHDHVMLAFELLRGYERKHG-TPKCMLQIDIQKAYDTVHWDALEHILRELG 433 Query: 494 FHPCFVYWILTYVISATFSITINGGSHGFVRGKRGLRQGDPMSPTLFLFCMEYLSRLIHA 673 F F+ WI+ V S T+ ING + +RG+RQGDP+SP LF+ MEYL+R++ Sbjct: 434 FPDQFIKWIMIAVRSVTYVFNINGRFTRRLEARRGIRQGDPISPLLFILVMEYLNRILSQ 493 Query: 674 RTHDSTFMHHPKCSTTDTTHLAFADDLLLFGRGDPDSMRVLRDALDEFTVTSGLTINKSK 853 F +H KC T+L FADDLLLF RGD S++++ D + F + GL +N SK Sbjct: 494 LDKIPNFNYHSKCEKMKITNLCFADDLLLFSRGDIGSVQIMLDKFNTFLRSMGLHVNPSK 553 Query: 854 SHIFLGGVRPFEKRAIMELFGFPEGTLPVKYLGLPLASKSLTIPDYSPLLAQISNFIQRW 1033 +I+ G V K ++ + GF EG +P +YLG+PL+SK L I Y L+ +I I W Sbjct: 554 CNIYCGSVDINVKEQLLLISGFKEGKMPFRYLGIPLSSKKLNIKHYQVLIDKIVGRITHW 613 Query: 1034 SNSNLSSAGWLELIRSVLQGVGCYWLQALPLPGTVINRITKMLRKFLWCNSQCL-----V 1198 S LS AG ++LI+SV+ +W+Q LPLP VI RI + R FLW + + + Sbjct: 614 SAGLLSYAGRVQLIQSVIFATINFWMQCLPLPKFVIMRINAICRSFLWIGNSNISRKSPI 673 Query: 1199 SWKTVCLPRGEGGLGLRDLAVWN----------------NPFIRRPCGTYMPKQTPFG-- 1324 +W+ VC P+ GGL + +LA+WN N +I+ Y+ Q+ + Sbjct: 674 AWEKVCSPKINGGLNIINLAIWNKISILKLLWNVCNKSDNLWIKWLHTYYIRGQSIWSMV 733 Query: 1325 --SNGSTLSTSEDR---TFGKGTSEAYEHFRAK---------GEKKFWYKAVWRSYIPPK 1462 + S + +S + + S + F+ K EK W + + P+ Sbjct: 734 LKKSHSWIMSSMMKLRPLLLQYQSRMQDVFKMKKIYLALFEESEKMSWRTLMCNNLARPR 793 Query: 1463 FSVTLWLALHGRLKTFDRM-KYS-DIARGCVLCESTDETHDHLFFKFEKALAVWSGICSW 1636 LW A H RL + DR+ K+ ++ C C S E+H+HLFF + +W+ + +W Sbjct: 794 ALFCLWQACHFRLASKDRLIKFGLNVDANCAFCSSM-ESHEHLFFGCIELKTIWTAVLNW 852 Query: 1637 LRCRNQMTTIPSVVRRFQREKAGSGIIRKAKWVALGATVQYLWHARN 1777 L+ + +T + R+ G G A T+ ++W RN Sbjct: 853 LQIIHMPSTWSEELNWITRKCKGKGWRAMLLKCAFTETIYHIWAYRN 899 >gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis thaliana] Length = 1253 Score = 234 bits (597), Expect(2) = 3e-74 Identities = 125/341 (36%), Positives = 191/341 (56%), Gaps = 5/341 (1%) Frame = +2 Query: 260 KILISRMALFLQKLISLAQSVFIKGRTIMDNFYLAQELIRTYERKSGITARCMVKIDLRK 439 ++L +R+ L ++IS QS F+ GR + +N LA EL++ Y R++ I R M+K+DLRK Sbjct: 491 RLLTNRLQCLLSQVISPFQSAFLPGRFLAENVLLATELVQGYNRQN-IDPRGMLKVDLRK 549 Query: 440 AYDCISWDFLRDVLHGLNFHPCFVYWILTYVISATFSITINGGSHGFVRGKRGLRQGDPM 619 A+D I WDF+ L + FVYWI + + TFS+ +NG + GF + RGLRQG+P+ Sbjct: 550 AFDSIRWDFIISALKAIGIPDRFVYWITQCISTPTFSVCVNGNTGGFFKSTRGLRQGNPL 609 Query: 620 SPTLFLFCMEYLSRLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLLFGRGDPDSMRVLR 799 SP LF+ ME S L+++R +HPK S +HL FADD+++F G S+ + Sbjct: 610 SPFLFVLAMEVFSSLLNSRFQAGYIHYHPKTSPLSISHLMFADDIMVFFDGGSSSLHGIS 669 Query: 800 DALDEFTVTSGLTINKSKSHIFLGGVRPFEKRAIMELFGFPEGTLPVKYLGLPLASKSLT 979 +AL++F SGL +N+ K+H++L G+ E I ++ L Sbjct: 670 EALEDFAFWSGLVLNREKTHLYLAGLDRIEASTI---------------------ARKLR 708 Query: 980 IPDYSPLLAQISNFIQRWSNSNLSSAGWLELIRSVLQGVGCYWLQALPLPGTVINRITKM 1159 I +Y PLL +++ + WS LS AG ++LI SV+ G+ +W+ LP + RI + Sbjct: 709 IAEYGPLLEKLAKRFRSWSVKCLSFAGRVQLIASVISGIINFWISTFILPKGCVKRIEAL 768 Query: 1160 LRKFLWCNS-----QCLVSWKTVCLPRGEGGLGLRDLAVWN 1267 +FLW + V+W VCLP+ EGG+GLR V N Sbjct: 769 CARFLWSGNIDVKKGAKVAWSEVCLPKEEGGVGLRRFTVLN 809 Score = 74.3 bits (181), Expect(2) = 3e-74 Identities = 34/91 (37%), Positives = 55/91 (60%), Gaps = 4/91 (4%) Frame = +1 Query: 1 IRTALFDIGDEKAPGPDGYTSAFFKKNWDLVRDDVVASVDEFFSK*IILRKLNHTVVSLI 180 I++A F K GPDG+ FFK+ W ++ +V +V EFF+ ++L++ N T + LI Sbjct: 401 IKSAFFSFPSNKTSGPDGFPVEFFKETWSVIGTEVTDAVSEFFTSSVLLKQWNATTLVLI 460 Query: 181 PKTTHDPGVGDFRPIACTN----VVYKIITK 261 PK T+ + DFRPI+C + +YK+I + Sbjct: 461 PKITNASKMNDFRPISCNDFGPITLYKVIAR 491 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 277 bits (709), Expect(2) = 9e-74 Identities = 146/360 (40%), Positives = 210/360 (58%), Gaps = 5/360 (1%) Frame = +2 Query: 221 PLLALMLFIK*SQKILISRMALFLQKLISLAQSVFIKGRTIMDNFYLAQELIRTYERKSG 400 P+ L K ++L R+ L +IS AQS F+ GR++ +N LA +L+ Y S Sbjct: 525 PISCLNTLYKVIARLLTDRLQRLLSGVISSAQSAFLPGRSLAENVLLATDLVHGYNW-SN 583 Query: 401 ITARCMVKIDLRKAYDCISWDFLRDVLHGLNFHPCFVYWILTYVISATFSITINGGSHGF 580 I+ R M+K+DL+KA+D + W+F+ L L F+ WI + + TF+++INGG+ GF Sbjct: 584 ISPRGMLKVDLKKAFDSVRWEFVIAALRALAIPEKFINWISQCISTPTFTVSINGGNGGF 643 Query: 581 VRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLL 760 + +GLRQGDP+SP LF+ ME S L+H+R +HPK S +HL FADD+++ Sbjct: 644 FKSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLSISHLMFADDVMI 703 Query: 761 FGRGDPDSMRVLRDALDEFTVTSGLTINKSKSHIFLGGVRPFEKRAIMELFGFPEGTLPV 940 F G S+ + + LD+F SGL +NK KSH++L G+ E A +GFP GTLP+ Sbjct: 704 FFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGLNQLESNA-NAAYGFPIGTLPI 762 Query: 941 KYLGLPLASKSLTIPDYSPLLAQISNFIQRWSNSNLSSAGWLELIRSVLQGVGCYWLQAL 1120 +YLGLPL ++ L I +Y PLL +I+ + W N LS AG ++LI SV+ G +W+ Sbjct: 763 RYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTF 822 Query: 1121 PLPGTVINRITKMLRKFLWCNS-----QCLVSWKTVCLPRGEGGLGLRDLAVWNNPFIRR 1285 LP I RI + +FLW + VSW +CLP+ EGGLGLR L WN R Sbjct: 823 LLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMR 882 Score = 29.6 bits (65), Expect(2) = 9e-74 Identities = 10/34 (29%), Positives = 16/34 (47%) Frame = +1 Query: 1267 QSLHSKTLWNIHAKTDSLWIKWIHAEYLRG*DIW 1368 ++L + +W + DSLW W H +L W Sbjct: 877 KTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFW 910 Score = 81.3 bits (199), Expect = 2e-12 Identities = 37/87 (42%), Positives = 55/87 (63%) Frame = +1 Query: 1 IRTALFDIGDEKAPGPDGYTSAFFKKNWDLVRDDVVASVDEFFSK*IILRKLNHTVVSLI 180 IR ALF + K+ GPDG+T+ FF +W +V +V ++ EFFS +L++ N T + LI Sbjct: 452 IRAALFSLPRNKSCGPDGFTAEFFIDSWSIVGAEVTDAIKEFFSSGCLLKQWNATTIVLI 511 Query: 181 PKTTHDPGVGDFRPIACTNVVYKIITK 261 PK + DFRPI+C N +YK+I + Sbjct: 512 PKIVNPTCTSDFRPISCLNTLYKVIAR 538 >ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca subsp. vesca] Length = 958 Score = 271 bits (693), Expect(2) = 4e-72 Identities = 151/355 (42%), Positives = 203/355 (57%), Gaps = 6/355 (1%) Frame = +2 Query: 221 PLLALMLFIK*SQKILISRMALFLQKLISLAQSVFIKGRTIMDNFYLAQELIRTYERKSG 400 P+ F K K+L +R+ L ++ +QS FI GR I DN LAQE+I Y + G Sbjct: 352 PISCCNTFYKIIAKLLANRLKGTLHLIVGPSQSTFIPGRRIGDNILLAQEIICDYHKADG 411 Query: 401 ITARCMVKIDLRKAYDCISWDFLRDVLHGLNFHPCFVYWILTYVISATFSITINGGSHGF 580 RC +D+ KA D + WDF+ L N + WI + + SA FS+ +NG GF Sbjct: 412 -QPRCTFMVDMMKANDTVEWDFIIATLQAFNIPSTLIGWIKSCISSAKFSVCVNGELAGF 470 Query: 581 VRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDST-FMHHPKCSTTDTTHLAFADDLL 757 +RGLRQGDP+SP LF+ ME LS I R + S F +H +C + +HL FADDLL Sbjct: 471 FARRRGLRQGDPLSPYLFVIAMEVLSLCIQRRINCSPCFRYHWRCDQLNLSHLCFADDLL 530 Query: 758 LFGRGDPDSMRVLRDALDEFTVTSGLTINKSKSHIFLGGVRPFEKRAIMELFGFPEGTLP 937 +F GD +S+R L DA F S L N S+S IFL GV +++++ F GT P Sbjct: 531 MFCNGDENSVRTLHDAFSNFESLSSLKANVSESKIFLAGVDGNSSDSVLQVTNFSLGTCP 590 Query: 938 VKYLGLPLASKSLTIPDYSPLLAQISNFIQRWSNSNLSSAGWLELIRSVLQGVGCYWLQA 1117 V+YLG+PL + L + D SPLL +I I+ W N LS AG L+LI+SVL + YW Sbjct: 591 VRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVYWASH 650 Query: 1118 LPLPGTVINRITKMLRKFLW---CNSQCL--VSWKTVCLPRGEGGLGLRDLAVWN 1267 L LP V+ I K LR FLW C+ + V+W +CLP+ EGGLG++DL WN Sbjct: 651 LILPKKVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHCWN 705 Score = 30.4 bits (67), Expect(2) = 4e-72 Identities = 10/39 (25%), Positives = 19/39 (48%) Frame = +1 Query: 1252 LGCVEQSLHSKTLWNIHAKTDSLWIKWIHAEYLRG*DIW 1368 L C ++L +WN+ + + + W W+ L+G W Sbjct: 701 LHCWNKALMISHIWNLVSSSSNFWTDWVKVYLLKGNSFW 739 Score = 89.4 bits (220), Expect = 8e-15 Identities = 51/140 (36%), Positives = 74/140 (52%), Gaps = 1/140 (0%) Frame = +1 Query: 1 IRTALFDIGDEKAPGPDGYTSAFFKKNWDLVRDDVVAS-VDEFFSK*IILRKLNHTVVSL 177 IR F + K+PGPDG+ FF+K W ++ D+VVA+ V EFFS +L +LN T+++L Sbjct: 278 IRAVFFSMNPNKSPGPDGFNGCFFQKAWLVIGDNVVAAAVKEFFSYGSLLMELNSTIITL 337 Query: 178 IPKTTHDPGVGDFRPIACTNVVYKIITKNSNF*NGTFFAETHLSGSICLYQGPNYYG*FL 357 +PK + + DFRPI+C N YKII K L G++ L GP+ F+ Sbjct: 338 VPKVANPTTMSDFRPISCCNTFYKIIAK---------LLANRLKGTLHLIVGPS-QSTFI 387 Query: 358 PCPGAHQNVREEEWYHCTLH 417 P N+ + C H Sbjct: 388 PGRRIGDNILLAQEIICDYH 407 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 262 bits (670), Expect(2) = 2e-71 Identities = 140/354 (39%), Positives = 204/354 (57%), Gaps = 5/354 (1%) Frame = +2 Query: 221 PLLALMLFIK*SQKILISRMALFLQKLISLAQSVFIKGRTIMDNFYLAQELIRTYERKSG 400 P+ K K+L R+ L IS +QS F+KGR + +N LA EL++ + + + Sbjct: 524 PISCCNAIYKVISKLLARRLENILPLWISPSQSAFVKGRLLTENVLLATELVQGFGQ-AN 582 Query: 401 ITARCMVKIDLRKAYDCISWDFLRDVLHGLNFHPCFVYWILTYVISATFSITINGGSHGF 580 I++R ++K+DLRKA+D + W F+ + L N P FV WI + S +FSI ++G G+ Sbjct: 583 ISSRGVLKVDLRKAFDSVGWGFIIETLKAANAPPRFVNWIKQCITSTSFSINVSGSLCGY 642 Query: 581 VRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLL 760 +G +GLRQGDP+SP+LF+ ME LSRL+ + D + +HPK S + LAFADDL++ Sbjct: 643 FKGSKGLRQGDPLSPSLFVIAMEILSRLLENKFSDGSIGYHPKASEVRISSLAFADDLMI 702 Query: 761 FGRGDPDSMRVLRDALDEFTVTSGLTINKSKSHIFLGGVRPFEKRAIMELFGFPEGTLPV 940 F G S+R ++ L+ F SGL +N KS ++ G+ +K + FGF GT P Sbjct: 703 FYDGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTL-AFGFVNGTFPF 761 Query: 941 KYLGLPLASKSLTIPDYSPLLAQISNFIQRWSNSNLSSAGWLELIRSVLQGVGCYWLQAL 1120 +YLGLPL + L DYS L+ +I+ W+ LS AG L+LI SV+ +WL + Sbjct: 762 RYLGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTVNFWLSSF 821 Query: 1121 PLPGTVINRITKMLRKFLWCN-----SQCLVSWKTVCLPRGEGGLGLRDLAVWN 1267 LP + I +M +FLW N VSW+ CLP+ EGGLGLR+ WN Sbjct: 822 ILPKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTWN 875 Score = 36.6 bits (83), Expect(2) = 2e-71 Identities = 13/34 (38%), Positives = 22/34 (64%) Frame = +1 Query: 1267 QSLHSKTLWNIHAKTDSLWIKWIHAEYLRG*DIW 1368 ++L+ + +W + A+ DSLW+ W HA LR + W Sbjct: 876 KTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFW 909 Score = 88.6 bits (218), Expect = 1e-14 Identities = 38/87 (43%), Positives = 58/87 (66%) Frame = +1 Query: 1 IRTALFDIGDEKAPGPDGYTSAFFKKNWDLVRDDVVASVDEFFSK*IILRKLNHTVVSLI 180 I++ F + K+PGPDGYTS FFKK W +V ++A+V EFF +L + N T V+++ Sbjct: 451 IKSEFFALPSNKSPGPDGYTSEFFKKTWSIVGPSLIAAVQEFFRSGRLLGQWNSTAVTMV 510 Query: 181 PKTTHDPGVGDFRPIACTNVVYKIITK 261 PK + + +FRPI+C N +YK+I+K Sbjct: 511 PKKPNADRITEFRPISCCNAIYKVISK 537 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 276 bits (705), Expect = 5e-71 Identities = 185/581 (31%), Positives = 274/581 (47%), Gaps = 44/581 (7%) Frame = +2 Query: 221 PLLALMLFIK*SQKILISRMALFLQKLISLAQSVFIKGRTIMDNFYLAQELIRTYERKSG 400 P+ K KIL R+ + +++ AQ+ FI R I DN LA ELIR Y R+ Sbjct: 519 PIACCSTLYKIISKILTKRLQAVITEVVDCAQTGFIPERHIGDNILLATELIRGYNRRH- 577 Query: 401 ITARCMVKIDLRKAYDCISWDFLRDVLHGLNFHPCFVYWILTYVISATFSITINGGSHGF 580 ++ RC++K+D+RKAYD + W FL +L L F F+ WI+ V + ++SI +NG Sbjct: 578 VSPRCVIKVDIRKAYDSVEWVFLESMLKELGFPSMFIRWIMACVKTVSYSILLNGIPSIP 637 Query: 581 VRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLL 760 ++GLRQGDP+SP LF MEYLSR + D F HPKC THL FADDLL+ Sbjct: 638 FDAQKGLRQGDPLSPFLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLM 697 Query: 761 FGRGDPDSMRVLRDALDEFTVTSGLTINKSKSHIFLGGVRPFEKRAIMELFGFPEGTLPV 940 F R D S+ + A + F+ SGL + KS I+ GGV E + + P G+LP Sbjct: 698 FARADASSISKIMAAFNSFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPF 757 Query: 941 KYLGLPLASKSLTIPDYSPLLAQISNFIQRWSNSNLSSAGWLELIRSVLQGVGCYWLQAL 1120 +YLG+PLASK L PL+ +I+ Q W LS AG L+L++++L + YW Q Sbjct: 758 RYLGVPLASKKLNFSQCKPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIF 817 Query: 1121 PLPGTVINRITKMLRKFLWCNS-----QCLVSWKTVCLPRGEGGLGLRDLAVWNNP---- 1273 PLP +I + RKFLW + + V+W + P+ GGL + ++ +WN Sbjct: 818 PLPKKLIKAVETTCRKFLWTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILK 877 Query: 1274 ------------FIRRPCGTYMPKQ----TPFGSNGS----TLSTSEDRTFGKGTSEA-- 1387 ++R Y+ +Q SN S + S + G EA Sbjct: 878 LLWAITFKQDKLWVRWVNAYYIKRQNIENVTVSSNTSWILRKIFESRELLTRTGGWEAVS 937 Query: 1388 ----------YEHFRAKGEKKFWYKAVWRSYIPPKFSVTLWLALHGRLKTFDRMK--YSD 1531 Y+ + E W + + + PK LWLA+ RL T +R+ D Sbjct: 938 NHMNFSIKKTYKLLQEDYENVVWKRLICNNKATPKSQFILWLAMLNRLATAERVSRWNRD 997 Query: 1532 IARGCVLCESTDETHDHLFFKFEKALAVWSGICSWLRCRNQMTTIPSVVRRFQREKAGSG 1711 ++ C +C + ET HLFF + +W + +L + Q + +KA S Sbjct: 998 VSPLCKMCGNEIETIQHLFFNCIYSKEIWGKVLLYLNLQPQADA--QAKKELAIKKARST 1055 Query: 1712 IIRKAKWVAL-GATVQYLWHARNLKYVEKKPFEASHVIKEI 1831 R +V + +V +W RN K + +K I Sbjct: 1056 KDRNKLYVMMFTESVYAIWLLRNAKVFRGIEINQNQAVKSI 1096 Score = 80.5 bits (197), Expect = 4e-12 Identities = 38/87 (43%), Positives = 55/87 (63%) Frame = +1 Query: 1 IRTALFDIGDEKAPGPDGYTSAFFKKNWDLVRDDVVASVDEFFSK*IILRKLNHTVVSLI 180 I AL DI D KAPG DG+ S FFKK+W +++ ++ + +FF + + +N T V+LI Sbjct: 446 IDQALADIDDTKAPGLDGFNSVFFKKSWLVIKQEIYEGILDFFENGFMHKPINCTAVTLI 505 Query: 181 PKTTHDPGVGDFRPIACTNVVYKIITK 261 PK D+RPIAC + +YKII+K Sbjct: 506 PKIDEAKHAKDYRPIACCSTLYKIISK 532 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 264 bits (674), Expect(2) = 1e-70 Identities = 144/367 (39%), Positives = 209/367 (56%), Gaps = 5/367 (1%) Frame = +2 Query: 185 RRLMTRELEISGPLLALMLFIK*SQKILISRMALFLQKLISLAQSVFIKGRTIMDNFYLA 364 ++ RE++ P+ + K KI+ +R+ L L K I+ QS F+K R +++N LA Sbjct: 519 KKTEAREMKDYRPISCCNVLYKVISKIIANRLKLVLPKFIAGNQSAFVKDRLLIENLLLA 578 Query: 365 QELIRTYERKSGITARCMVKIDLRKAYDCISWDFLRDVLHGLNFHPCFVYWILTYVISAT 544 EL++ Y K I+ RC +KID+ KA+D + W FL +V L F F++WI + +A+ Sbjct: 579 TELVKDYH-KDTISTRCAIKIDISKAFDSVQWPFLINVFTILGFPREFIHWINICITTAS 637 Query: 545 FSITINGGSHGFVRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFMHHPKCSTTD 724 FS+ +NG G+ + RGLRQG +SP LF+ CM+ LS+++ F +HPKC T Sbjct: 638 FSVQVNGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMG 697 Query: 725 TTHLAFADDLLLFGRGDPDSMRVLRDALDEFTVTSGLTINKSKSHIFLGGVRPFEKRAIM 904 THL+FADDL++ G S+ + DEF SGL I+ KS ++L G+ + + Sbjct: 698 LTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVA 757 Query: 905 ELFGFPEGTLPVKYLGLPLASKSLTIPDYSPLLAQISNFIQRWSNSNLSSAGWLELIRSV 1084 + F F G LPV+YLGLPL +K L+ D PLL Q+ I W++ LS AG L LI SV Sbjct: 758 DRFPFSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSV 817 Query: 1085 LQGVGCYWLQALPLPGTVINRITKMLRKFLWC-----NSQCLVSWKTVCLPRGEGGLGLR 1249 L + +WL A LP I + KM FLW +++ +SW VC P+ EGGLGLR Sbjct: 818 LWSICNFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLR 877 Query: 1250 DLAVWNN 1270 L N+ Sbjct: 878 SLKEAND 884 Score = 32.7 bits (73), Expect(2) = 1e-70 Identities = 11/29 (37%), Positives = 17/29 (58%) Frame = +1 Query: 1282 KTLWNIHAKTDSLWIKWIHAEYLRG*DIW 1368 K +W I + ++SLW+KW+ LR W Sbjct: 889 KLVWKIVSHSNSLWVKWVDQHLLRNASFW 917 Score = 93.2 bits (230), Expect = 6e-16 Identities = 40/87 (45%), Positives = 61/87 (70%) Frame = +1 Query: 1 IRTALFDIGDEKAPGPDGYTSAFFKKNWDLVRDDVVASVDEFFSK*IILRKLNHTVVSLI 180 IR LF + +K+PGPDGYTS FFK W+++ D+ +V FF+K + + +N T+++LI Sbjct: 458 IRKVLFRMPSDKSPGPDGYTSEFFKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALI 517 Query: 181 PKTTHDPGVGDFRPIACTNVVYKIITK 261 PK T + D+RPI+C NV+YK+I+K Sbjct: 518 PKKTEAREMKDYRPISCCNVLYKVISK 544 Score = 79.3 bits (194), Expect = 9e-12 Identities = 61/237 (25%), Positives = 96/237 (40%), Gaps = 26/237 (10%) Frame = +2 Query: 1232 GGLGLRDLAV---------WNNPFIRRPCG-TYMPKQTPFGSNGSTLSTSEDRTFGKGTS 1381 G GL DL + W N RR Y + + T + +ED+ +G S Sbjct: 972 GDRGLIDLGISRRMTVEEAWTNRRQRRHRNDVYNVIEDALKKSWDTRTETEDKVLWRGKS 1031 Query: 1382 EAYE----------HFRAKGEKKFWYKAVWRSYIPPKFSVTLWLALHGRLKTFDRM--KY 1525 + + H R+ + W+K +W S+ PK+S WLA HGRL T DRM Sbjct: 1032 DVFRTTFSTRDTWHHTRSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWA 1091 Query: 1526 SDIARGCVLCESTDETHDHLFFKFEKALAVWSGICSWLRCRNQMTTIPSVVRRFQREKAG 1705 + IA C+ C+ T ET DHLFF +W + + + S++ + Sbjct: 1092 NGIATDCIFCQGTLETRDHLFFTCSFTSVIWVDLARGIFKTQYTSHWQSIIEAITNSQH- 1150 Query: 1706 SGIIRKAKW----VALGATVQYLWHARNLKYVEKKPFEASHVIKEIKLDVYRVLYSL 1864 + +W AT+ +W RN + + P AS ++ I + L S+ Sbjct: 1151 ----HRVEWFLRRYVFQATIYIVWRERNGRRHGEPPNTASQLVGWIDKQIRNQLSSI 1203 >gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score: 72.31) [Arabidopsis thaliana] Length = 928 Score = 271 bits (693), Expect = 1e-69 Identities = 178/562 (31%), Positives = 271/562 (48%), Gaps = 10/562 (1%) Frame = +2 Query: 200 RELEISGPLLALMLFIK*SQKILISRMALFLQKLISLAQSVFIKGRTIMDNFYLAQELIR 379 +E++ P+ + K KI+ +R+ L L K I QS F+K R +++N LA E+++ Sbjct: 412 KEMKDYRPISCCNVLYKVISKIIANRLKLVLPKFIVGNQSAFVKDRLLIENVLLATEIVK 471 Query: 380 TYERKSGITARCMVKIDLRKAYDCISWDFLRDVLHGLNFHPCFVYWILTYVISATFSITI 559 Y + S +++RC +KID+ KA+D + W FL +VL +NF P F +WI + +A+FS+ + Sbjct: 472 DYHKDS-VSSRCALKIDISKAFDSVQWKFLINVLEAMNFPPEFTHWITLCITTASFSVQV 530 Query: 560 NGGSHGFVRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFMHHPKCSTTDTTHLA 739 NG G R LRQG +SP LF+ M+ LS+++ F +HPKC THL+ Sbjct: 531 NGELAGVFSSARELRQGCSLSPYLFVISMDVLSKMLDKAVGARQFGYHPKCRAIGLTHLS 590 Query: 740 FADDLLLFGRGDPDSMRVLRDALDEFTVTSGLTINKSKSHIFLGGVRPFEKRAIMELFGF 919 FADDL++ G S+ + L EF SGL I+ KS ++L GV+ + I++ F F Sbjct: 591 FADDLMILSDGKVRSIDGIVKVLYEFAKWSGLKISMEKSTMYLAGVQASVYQEIVQKFSF 650 Query: 920 PEGTLPVKYLGLPLASKSLTIPDYSPLLAQISNFIQRWSNSNLSSAGWLELIRSVLQGVG 1099 G LPV+YLGLPL SK LT D PL+ Q+ I+ W++ LS AG L LI S L + Sbjct: 651 DVGKLPVRYLGLPLVSKRLTASDCLPLIEQLRKKIEAWTSRFLSFAGRLNLISSTLWSIC 710 Query: 1100 CYWLQALPLPGTVINRITKMLRKFLW-----CNSQCLVSWKTVCLPRGEGGLGLRDLAVW 1264 +W+ A LP I I K+ FLW +++ VSW+ +C P+ E W Sbjct: 711 NFWMAAFRLPRACIREIDKLCSAFLWSGTELSSNKAKVSWEAICKPKKE---------AW 761 Query: 1265 NNPFIRRPCGTYMPKQTPFGSNGSTLSTSEDRTFGKGTSEAYEHFRAKGEKKFWYKAVWR 1444 + G + +TP Sbjct: 762 HK-------GVWFAHETP------------------------------------------ 772 Query: 1445 SYIPPKFSVTLWLALHGRLKTFDRMKYSDI--ARGCVLCESTDETHDHLFFKFEKALAVW 1618 K S +WLA+ +L T RM++ ++ + GCVLC + ET DHLFF +W Sbjct: 773 -----KHSFCVWLAIWNKLSTGQRMQHWNLQSSVGCVLCNNNLETRDHLFFSCAYTSGIW 827 Query: 1619 SGICSWLRCRNQMT---TIPSVVRRFQREKAGSGIIRKAKWVALGATVQYLWHARNLKYV 1789 + L R+ T TI S V ++ + R L A+V +W RN + Sbjct: 828 EALAKNLLQRSYTTDWQTIISYVSGQCHDRVSCFLARS----VLQASVYTIWRERNGRRH 883 Query: 1790 EKKPFEASHVIKEIKLDVYRVL 1855 + P A+ +I+ I + +L Sbjct: 884 GETPNPAARLIQWIDKHIRNML 905 Score = 83.2 bits (204), Expect = 6e-13 Identities = 33/87 (37%), Positives = 60/87 (68%) Frame = +1 Query: 1 IRTALFDIGDEKAPGPDGYTSAFFKKNWDLVRDDVVASVDEFFSK*IILRKLNHTVVSLI 180 I +F + ++K+PGPDGYT+ F+K W+++ + + ++ FF+K + + +N T+++LI Sbjct: 346 IHKVVFSMPNDKSPGPDGYTAEFYKGAWNIIGAEFILAIQSFFAKGFLPKGINSTILALI 405 Query: 181 PKTTHDPGVGDFRPIACTNVVYKIITK 261 PK + D+RPI+C NV+YK+I+K Sbjct: 406 PKKKEAKEMKDYRPISCCNVLYKVISK 432 >gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1352 Score = 259 bits (662), Expect(2) = 2e-69 Identities = 140/365 (38%), Positives = 212/365 (58%), Gaps = 10/365 (2%) Frame = +2 Query: 191 LMTRELEISG-----PLLALMLFIK*SQKILISRMALFLQKLISLAQSVFIKGRTIMDNF 355 L++++ E+SG P+ + K K++ +R+ L I+ QS FIK R +M+N Sbjct: 663 LISKKHEVSGMKDYRPISCCNVLYKIVSKLMANRLKEILPASIAPNQSAFIKDRLMMENL 722 Query: 356 YLAQELIRTYERKSGITARCMVKIDLRKAYDCISWDFLRDVLHGLNFHPCFVYWILTYVI 535 LA EL++ Y ++S I++R +KID+ KA+D + W FL +VL ++ F++WI + Sbjct: 723 LLASELVKDYHKES-ISSRSALKIDISKAFDFVQWPFLINVLKAIHLPEMFIHWIELCIG 781 Query: 536 SATFSITINGGSHGFVRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFMHHPKCS 715 +A+FS+ +NG GF R +RGLRQG +SP L++ CM LS ++ + +HP+C Sbjct: 782 TASFSVQVNGELSGFFRSERGLRQGCSLSPYLYVICMNVLSCMLDKAAVEKKISYHPRCR 841 Query: 716 TTDTTHLAFADDLLLFGRGDPDSMRVLRDALDEFTVTSGLTINKSKSHIFLGGVRPFEKR 895 + THL FADD+++F G S++ ++F S L I+ KS IF+ G+ P K Sbjct: 842 NMNLTHLCFADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNAKT 901 Query: 896 AIMELFGFPEGTLPVKYLGLPLASKSLTIPDYSPLLAQISNFIQRWSNSNLSSAGWLELI 1075 +I++ F F GTLPVKYLGLPL +K +T DY PL+ +I I W+N LS AG L+LI Sbjct: 902 SILQQFPFELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLI 961 Query: 1076 RSVLQGVGCYWLQALPLPGTVINRITKMLRKFLWC-----NSQCLVSWKTVCLPRGEGGL 1240 +SVL + +WL LP + I KM FLW + ++W VC + EGGL Sbjct: 962 KSVLSSITNFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKEEGGL 1021 Query: 1241 GLRDL 1255 GL+ L Sbjct: 1022 GLKPL 1026 Score = 33.1 bits (74), Expect(2) = 2e-69 Identities = 11/29 (37%), Positives = 17/29 (58%) Frame = +1 Query: 1282 KTLWNIHAKTDSLWIKWIHAEYLRG*DIW 1368 K +W I + DSLW+KW++ +R W Sbjct: 1036 KLIWRILSARDSLWVKWVNKHLIRKETFW 1064 Score = 78.2 bits (191), Expect = 2e-11 Identities = 37/83 (44%), Positives = 56/83 (67%), Gaps = 2/83 (2%) Frame = +1 Query: 19 DIGDE--KAPGPDGYTSAFFKKNWDLVRDDVVASVDEFFSK*IILRKLNHTVVSLIPKTT 192 DI +E K+PGPDGYT FFK W ++ D+V ++ FF K + + +N T+++LI K Sbjct: 609 DIKEEAHKSPGPDGYTVEFFKTAWPVLGRDLVIAIQSFFLKGFLPKGINTTILALISKKH 668 Query: 193 HDPGVGDFRPIACTNVVYKIITK 261 G+ D+RPI+C NV+YKI++K Sbjct: 669 EVSGMKDYRPISCCNVLYKIVSK 691 Score = 63.9 bits (154), Expect = 4e-07 Identities = 47/158 (29%), Positives = 67/158 (42%), Gaps = 22/158 (13%) Frame = +2 Query: 1232 GGLGLRDLAVWNNPFIRRPCGTYMPKQ----------TPFGSNGSTLSTSEDRTFGK--- 1372 G G DL + NN + T+ K+ + ST DR+ K Sbjct: 1119 GSRGTIDLGIPNNATVAEVMNTHRRKRHRADFLNQIKSQIELARQDRSTDGDRSLWKQKE 1178 Query: 1373 -------GTSEAYEHFRAKGEKKFWYKAVWRSYIPPKFSVTLWLALHGRLKTFDRM-KYS 1528 +S+ ++ R+ + WY+ VW S PK+S WLA H RL T D++ K++ Sbjct: 1179 DTFKSSFSSSKTWQQIRSISLRCDWYRGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWN 1238 Query: 1529 DIAR-GCVLCESTDETHDHLFFKFEKALAVWSGICSWL 1639 AR CV C ET DHLFF + VW + L Sbjct: 1239 SGARYDCVFCGEELETRDHLFFSCPYSSHVWFSLTKGL 1276 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 265 bits (678), Expect(2) = 4e-69 Identities = 142/360 (39%), Positives = 201/360 (55%), Gaps = 5/360 (1%) Frame = +2 Query: 203 ELEISGPLLALMLFIK*SQKILISRMALFLQKLISLAQSVFIKGRTIMDNFYLAQELIRT 382 E++ P+ + K KIL +R+ L L I QS F+K R +M+N LA EL++ Sbjct: 822 EMKDYRPISCCNVLYKVISKILANRLKLLLPSFILQNQSAFVKERLLMENVLLATELVKD 881 Query: 383 YERKSGITARCMVKIDLRKAYDCISWDFLRDVLHGLNFHPCFVYWILTYVISATFSITIN 562 Y ++S +T RC +KID+ KA+D + W FL + L LNF F +WI + +ATFS+ +N Sbjct: 882 YHKES-VTPRCAMKIDISKAFDSVQWQFLLNTLEALNFPETFRHWIKLCISTATFSVQVN 940 Query: 563 GGSHGFVRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFMHHPKCSTTDTTHLAF 742 G GF RGLRQG +SP LF+ CM LS +I +HPKC THL F Sbjct: 941 GELAGFFGSSRGLRQGCALSPYLFVICMNVLSHMIDEAAVHRNIGYHPKCEKIGLTHLCF 1000 Query: 743 ADDLLLFGRGDPDSMRVLRDALDEFTVTSGLTINKSKSHIFLGGVRPFEKRAIMELFGFP 922 ADDL++F G S+ + + EF SGL I+ KS I+L GV ++ + F F Sbjct: 1001 ADDLMVFVDGHQWSIEGVINVFKEFAGRSGLQISLEKSTIYLAGVSASDRVQTLSSFPFA 1060 Query: 923 EGTLPVKYLGLPLASKSLTIPDYSPLLAQISNFIQRWSNSNLSSAGWLELIRSVLQGVGC 1102 G LPV+YLGLPL +K +T DYSPL+ + I W+ +LS AG L L+ SV+ + Sbjct: 1061 NGQLPVRYLGLPLLTKQMTTADYSPLIEAVKTKISSWTARSLSYAGRLALLNSVIVSIAN 1120 Query: 1103 YWLQALPLPGTVINRITKMLRKFLWCN-----SQCLVSWKTVCLPRGEGGLGLRDLAVWN 1267 +W+ A LP I I K+ FLW + ++W ++C P+ EGGLG++ LA N Sbjct: 1121 FWMSAYRLPAGCIREIEKLCSAFLWSGPVLNPKKAKIAWSSICQPKKEGGLGIKSLAEAN 1180 Score = 26.2 bits (56), Expect(2) = 4e-69 Identities = 10/32 (31%), Positives = 15/32 (46%) Frame = +1 Query: 1282 KTLWNIHAKTDSLWIKWIHAEYLRG*DIW*RN 1377 K +W + + SLW+ WI +R W N Sbjct: 1186 KLIWRLLSTQPSLWVTWIWTFIIRKGTFWSAN 1217 Score = 88.2 bits (217), Expect = 2e-14 Identities = 40/87 (45%), Positives = 58/87 (66%) Frame = +1 Query: 1 IRTALFDIGDEKAPGPDGYTSAFFKKNWDLVRDDVVASVDEFFSK*IILRKLNHTVVSLI 180 I+ LF + + K+PGPDGYTS FFK W L D +A++ FF K + + LN T+++LI Sbjct: 755 IQKVLFAMPNNKSPGPDGYTSEFFKATWSLTGPDFIAAIQSFFVKGFLPKGLNATILALI 814 Query: 181 PKTTHDPGVGDFRPIACTNVVYKIITK 261 PK + D+RPI+C NV+YK+I+K Sbjct: 815 PKKDEAIEMKDYRPISCCNVLYKVISK 841 Score = 68.6 bits (166), Expect = 2e-08 Identities = 34/94 (36%), Positives = 47/94 (50%), Gaps = 2/94 (2%) Frame = +2 Query: 1376 TSEAYEHFRAKGEKKFWYKAVWRSYIPPKFSVTLWLALHGRLKTFDRMKYSDIAR--GCV 1549 T + + R ++ WYK VW Y PK+S LWL + RL T DR+K + + C Sbjct: 1338 TKVTWNNVRTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLVTCT 1397 Query: 1550 LCESTDETHDHLFFKFEKALAVWSGICSWLRCRN 1651 LC + +ET DHLFF + VW + L N Sbjct: 1398 LCNNAEETRDHLFFSCQYTSYVWEALTQRLLSTN 1431 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 269 bits (687), Expect = 6e-69 Identities = 142/361 (39%), Positives = 210/361 (58%), Gaps = 6/361 (1%) Frame = +2 Query: 221 PLLALMLFIK*SQKILISRMALFLQKLISLAQSVFIKGRTIMDNFYLAQELIRTYERKSG 400 P+ L K K+L SR+ L +I +QS F+ GR++ +N LA E++ Y R + Sbjct: 385 PISCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLPGRSLAENVLLATEMVHGYNRLN- 443 Query: 401 ITARCMVKIDLRKAYDCISWDFLRDVLHGLNFHPCFVYWILTYVISATFSITINGGSHGF 580 I+ R M+K+DL+KA+D + W+F+ L L ++ WI + + +F+I++NG + GF Sbjct: 444 ISPRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCITTPSFTISVNGATGGF 503 Query: 581 VRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFMH-HPKCSTTDTTHLAFADDLL 757 R +GLRQGDP+SP LF+ ME S+L+++R +DS ++H HPK +HL FADD++ Sbjct: 504 FRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSR-YDSGYIHYHPKAGDLSISHLMFADDVM 562 Query: 758 LFGRGDPDSMRVLRDALDEFTVTSGLTINKSKSHIFLGGVRPFEKRAIMELFGFPEGTLP 937 +F G SM + + LD+F SGL +NK KS +F G+ +R +GFP GT P Sbjct: 563 IFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL-DLSERITSAAYGFPAGTFP 621 Query: 938 VKYLGLPLASKSLTIPDYSPLLAQISNFIQRWSNSNLSSAGWLELIRSVLQGVGCYWLQA 1117 ++YLGLPL + L I DY PLL ++S ++ W + LS AG +LI SV+ G+ +W+ Sbjct: 622 IRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMST 681 Query: 1118 LPLPGTVINRITKMLRKFLWCNS-----QCLVSWKTVCLPRGEGGLGLRDLAVWNNPFIR 1282 LP I +I + KFLW S VSW CLP+ EGGLG R WN + Sbjct: 682 FLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLL 741 Query: 1283 R 1285 R Sbjct: 742 R 742 Score = 78.6 bits (192), Expect = 1e-11 Identities = 34/87 (39%), Positives = 56/87 (64%) Frame = +1 Query: 1 IRTALFDIGDEKAPGPDGYTSAFFKKNWDLVRDDVVASVDEFFSK*IILRKLNHTVVSLI 180 I+ A + K GPDGY+ FF+ W ++ +V+A++ EFF +L++ N T + LI Sbjct: 312 IKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFDSGQLLKQWNATTLVLI 371 Query: 181 PKTTHDPGVGDFRPIACTNVVYKIITK 261 PKT++ + +FRPI+C N +YK+I+K Sbjct: 372 PKTSNACTISEFRPISCLNTLYKVISK 398 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 269 bits (687), Expect = 6e-69 Identities = 142/361 (39%), Positives = 210/361 (58%), Gaps = 6/361 (1%) Frame = +2 Query: 221 PLLALMLFIK*SQKILISRMALFLQKLISLAQSVFIKGRTIMDNFYLAQELIRTYERKSG 400 P+ L K K+L SR+ L +I +QS F+ GR++ +N LA E++ Y R + Sbjct: 385 PISCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLPGRSLAENVLLATEMVHGYNRLN- 443 Query: 401 ITARCMVKIDLRKAYDCISWDFLRDVLHGLNFHPCFVYWILTYVISATFSITINGGSHGF 580 I+ R M+K+DL+KA+D + W+F+ L L ++ WI + + +F+I++NG + GF Sbjct: 444 ISPRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCITTPSFTISVNGATGGF 503 Query: 581 VRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFMH-HPKCSTTDTTHLAFADDLL 757 R +GLRQGDP+SP LF+ ME S+L+++R +DS ++H HPK +HL FADD++ Sbjct: 504 FRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSR-YDSGYIHYHPKAGDLSISHLMFADDVM 562 Query: 758 LFGRGDPDSMRVLRDALDEFTVTSGLTINKSKSHIFLGGVRPFEKRAIMELFGFPEGTLP 937 +F G SM + + LD+F SGL +NK KS +F G+ +R +GFP GT P Sbjct: 563 IFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL-DLSERITSAAYGFPAGTFP 621 Query: 938 VKYLGLPLASKSLTIPDYSPLLAQISNFIQRWSNSNLSSAGWLELIRSVLQGVGCYWLQA 1117 ++YLGLPL + L I DY PLL ++S ++ W + LS AG +LI SV+ G+ +W+ Sbjct: 622 IRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMST 681 Query: 1118 LPLPGTVINRITKMLRKFLWCNS-----QCLVSWKTVCLPRGEGGLGLRDLAVWNNPFIR 1282 LP I +I + KFLW S VSW CLP+ EGGLG R WN + Sbjct: 682 FLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLL 741 Query: 1283 R 1285 R Sbjct: 742 R 742 Score = 78.6 bits (192), Expect = 1e-11 Identities = 34/87 (39%), Positives = 56/87 (64%) Frame = +1 Query: 1 IRTALFDIGDEKAPGPDGYTSAFFKKNWDLVRDDVVASVDEFFSK*IILRKLNHTVVSLI 180 I+ A + K GPDGY+ FF+ W ++ +V+A++ EFF +L++ N T + LI Sbjct: 312 IKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFDSGQLLKQWNATTLVLI 371 Query: 181 PKTTHDPGVGDFRPIACTNVVYKIITK 261 PKT++ + +FRPI+C N +YK+I+K Sbjct: 372 PKTSNACTISEFRPISCLNTLYKVISK 398 >gb|ABD96948.1| hypothetical protein [Cleome spinosa] Length = 539 Score = 269 bits (687), Expect = 6e-69 Identities = 166/492 (33%), Positives = 247/492 (50%), Gaps = 52/492 (10%) Frame = +2 Query: 299 LISLAQSVFIKGRTIMDNFYLAQELIRTYERKSGITARCMVKIDLRKAYDCISWDFLRDV 478 + S Q F++GR +++N LA EL+ Y R + + R M+KIDLRKA+D +SW+F+ + Sbjct: 4 IFSPNQGAFLEGRLMVENVLLATELVHEYNRPN-TSKRAMLKIDLRKAFDTVSWEFITKI 62 Query: 479 LHGLNFHPCFVYWILTYVISATFSITINGGSHGFVRGKRGLRQGDPMSPTLFLFCMEYLS 658 + LN FV W+ + + FS++ING G+ +G+RGLRQGDP+SP LF+ ME LS Sbjct: 63 MQALNLPRTFVTWVKVCMETPKFSVSINGELAGYFKGRRGLRQGDPLSPYLFIMSMEVLS 122 Query: 659 RLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLLFGRGDPDSMRVLRDALDEFTVTSGLT 838 R++ +S HPKC + THLAFADD+++F G+ S+ +++ LD F+ SGL Sbjct: 123 RMLDRCAAESRLSLHPKCHSPVITHLAFADDIMIFTSGETRSLLEVKNTLDSFSRASGLY 182 Query: 839 INKSKSHIFLGGVRPFEKRAIMELFGFPEGTLPVKYLGLPLASKSLTIPDYSPLLAQISN 1018 +N K+ IFL G+ E + + GF G LPV+YLG+ L+ LT DY PLL ++ Sbjct: 183 LNTEKTEIFLRGLNGTEASTLCAVIGFTRGYLPVRYLGVSLSPVRLTKSDYQPLLDRVKA 242 Query: 1019 FIQRWSNSNLSSAGWLELIRSVLQGVGCYWLQALPLPGTVINRITKMLRKFLW-CNSQCL 1195 I W+ LS AG L+L+ +V+ G+ W LP ++ ++ FLW + Sbjct: 243 KINSWTTRYLSYAGRLQLVGTVIYGMVNAWGMIFMLPKFFTKQVDRLCAGFLWGAGTTHR 302 Query: 1196 VSWKTVCLPRGEGGLGLRDLAVWN-NPF-----------------IRRPCG--------- 1294 VSW T C PR EGGLGLR +A +N +P+ +R P Sbjct: 303 VSWDTCCRPRKEGGLGLRKIAEFNQDPWTIYGSLLRYVGLTGPRSLRIPLPSSVSQAVAG 362 Query: 1295 --------------------TYMPKQTPFGSNGSTLSTSEDRTFGK--GTSEAYEHFRAK 1408 + +P +P G + S L ++ F +S + R Sbjct: 363 DSWIFPGVRSDRLQQVLAHISTIPPPSPDGPSDSALWKYKEEDFRPYFSSSRTWNLTRTV 422 Query: 1409 GEKKFWYKAVWRSYIPPKFSVTLWLALHGRLKTFDRMKYSDIARG--CVLCESTDETHDH 1582 W VW P+ + W + RL T DR++ I C LC+ DE+H H Sbjct: 423 HVIAPWSSIVWFPLAIPRHAFLHWQVMLFRLPTKDRLQQWGITSDATCRLCDGEDESHQH 482 Query: 1583 LFFKFEKALAVW 1618 LFF A +W Sbjct: 483 LFFGCTYASHLW 494 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 258 bits (658), Expect(2) = 7e-68 Identities = 142/357 (39%), Positives = 202/357 (56%), Gaps = 5/357 (1%) Frame = +2 Query: 200 RELEISGPLLALMLFIK*SQKILISRMALFLQKLISLAQSVFIKGRTIMDNFYLAQELIR 379 RE++ P+ + K KIL +R+ L K I QS F+K R +++N LA EL++ Sbjct: 245 REIKDYRPISCCNVLYKAISKILANRLKRILPKFIVGNQSAFVKDRLLIENVLLATELVK 304 Query: 380 TYERKSGITARCMVKIDLRKAYDCISWDFLRDVLHGLNFHPCFVYWILTYVISATFSITI 559 Y + S I+ RC +KID+ KA+D + W FL VL +NF F++WI + +A+FSI + Sbjct: 305 DYHKDS-ISTRCAMKIDISKAFDSLQWSFLTHVLAAMNFPGEFIHWISLCMSTASFSIQV 363 Query: 560 NGGSHGFVRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFMHHPKCSTTDTTHLA 739 NG G+ R RGLRQG +SP LF+ M+ LSR++ F +HP+C T THL Sbjct: 364 NGELAGYFRSARGLRQGCSLSPYLFVISMDVLSRMLDKAAGAREFGYHPRCKTLGLTHLC 423 Query: 740 FADDLLLFGRGDPDSMRVLRDALDEFTVTSGLTINKSKSHIFLGGVRPFEKRAIMELFGF 919 FADDL++ G S+ + L++F GL I K+ ++L GV ++ + + F Sbjct: 424 FADDLMILTDGKIRSVDGIVKVLNQFAAKLGLKICMEKTTLYLAGVSDHSRQLMSSRYSF 483 Query: 920 PEGTLPVKYLGLPLASKSLTIPDYSPLLAQISNFIQRWSNSNLSSAGWLELIRSVLQGVG 1099 G LPV+YLGLPL +K LT DYSPL+ QI I W++ LS AG L LI SVL + Sbjct: 484 GVGKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRIGMWTSRYLSFAGRLSLINSVLWSIT 543 Query: 1100 CYWLQALPLPGTVINRITKMLRKFLWCN-----SQCLVSWKTVCLPRGEGGLGLRDL 1255 +W+ A LP IN I ++ LW + VSW +C P+ EGGLGL+ L Sbjct: 544 NFWMNAFRLPRECINEINRISSALLWSGPELNPKKAKVSWDEICKPKKEGGLGLQSL 600 Score = 29.6 bits (65), Expect(2) = 7e-68 Identities = 10/29 (34%), Positives = 15/29 (51%) Frame = +1 Query: 1282 KTLWNIHAKTDSLWIKWIHAEYLRG*DIW 1368 K +W + + DSLW+KW L+ W Sbjct: 610 KLIWRLLSCQDSLWVKWTRMNLLKKESFW 638 Score = 90.1 bits (222), Expect = 5e-15 Identities = 36/87 (41%), Positives = 62/87 (71%) Frame = +1 Query: 1 IRTALFDIGDEKAPGPDGYTSAFFKKNWDLVRDDVVASVDEFFSK*IILRKLNHTVVSLI 180 I+ +F + +K+PGPDGYTS F+K +W+++ D+V+ ++ FF+K + + +N T+++LI Sbjct: 179 IKKVIFSMPKDKSPGPDGYTSEFYKASWEIIGDEVIIAIQSFFAKGFLPKGVNSTILALI 238 Query: 181 PKTTHDPGVGDFRPIACTNVVYKIITK 261 PK + D+RPI+C NV+YK I+K Sbjct: 239 PKKKEAREIKDYRPISCCNVLYKAISK 265 Score = 78.2 bits (191), Expect = 2e-11 Identities = 45/154 (29%), Positives = 74/154 (48%), Gaps = 2/154 (1%) Frame = +2 Query: 1376 TSEAYEHFRAKGEKKFWYKAVWRSYIPPKFSVTLWLALHGRLKTFDRMK--YSDIARGCV 1549 T + + H R ++ W+K VW ++ PKFS WLA+ RL T DRM + CV Sbjct: 762 TKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCV 821 Query: 1550 LCESTDETHDHLFFKFEKALAVWSGICSWLRCRNQMTTIPSVVRRFQREKAGSGIIRKAK 1729 C S ET DHLFF+ + +W+ I + +++ +T S V + + I Sbjct: 822 FCSSPMETRDHLFFQCCYSSEIWTSIAKNV-YKDRFSTKWSAVVNYISDSQPDRIQSFLS 880 Query: 1730 WVALGATVQYLWHARNLKYVEKKPFEASHVIKEI 1831 ++ +W RN + +K AS++I++I Sbjct: 881 RYTFQVSIHSIWRERNSRRHGEKSRSASNLIRQI 914 >gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] Length = 740 Score = 257 bits (657), Expect(2) = 2e-67 Identities = 137/350 (39%), Positives = 197/350 (56%), Gaps = 5/350 (1%) Frame = +2 Query: 221 PLLALMLFIK*SQKILISRMALFLQKLISLAQSVFIKGRTIMDNFYLAQELIRTYERKSG 400 P+ + K KI+ +R+ + L I QS F++ R +++N LA EL++ Y + S Sbjct: 104 PISCCNVIYKVISKIIANRLKVMLPTFILQNQSAFVRERLLIENVLLATELVKDYHKDS- 162 Query: 401 ITARCMVKIDLRKAYDCISWDFLRDVLHGLNFHPCFVYWILTYVISATFSITINGGSHGF 580 I+ RC +KID+ KA+D + W FL + L LNF F +WI + +ATFS+ +NG GF Sbjct: 163 ISPRCAMKIDISKAFDSVQWQFLLNTLEALNFPENFCHWIKLCISTATFSVQVNGELAGF 222 Query: 581 VRGKRGLRQGDPMSPTLFLFCMEYLSRLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLL 760 KRGLRQG +SP LF+ CM LS +I +HPKC THL FADDL++ Sbjct: 223 FGSKRGLRQGCALSPYLFVICMNVLSHMIDVAAVHRNIGYHPKCKKLSLTHLCFADDLMV 282 Query: 761 FGRGDPDSMRVLRDALDEFTVTSGLTINKSKSHIFLGGVRPFEKRAIMELFGFPEGTLPV 940 F G S+ + + EF SGL I+ KS ++L GV + I+ F F G LPV Sbjct: 283 FIDGQQRSVEGVINIFKEFAGKSGLHISLEKSTLYLAGVSELNRNNILSAFPFASGQLPV 342 Query: 941 KYLGLPLASKSLTIPDYSPLLAQISNFIQRWSNSNLSSAGWLELIRSVLQGVGCYWLQAL 1120 +YLGLPL +K +T DYSPLL ++ + I W+ +LS AG L LI SV+ + +W+ A Sbjct: 343 RYLGLPLLTKQMTTADYSPLLDKVRSKISSWTARSLSYAGRLALINSVIVSLSNFWMSAY 402 Query: 1121 PLPGTVINRITKMLRKFLWCN-----SQCLVSWKTVCLPRGEGGLGLRDL 1255 LP I I K+ FLW + ++W ++C + EGGLG++ L Sbjct: 403 RLPAGCIKEIEKLCSAFLWSGPELNPKKAKITWTSLCKLKQEGGLGIKSL 452 Score = 28.5 bits (62), Expect(2) = 2e-67 Identities = 9/32 (28%), Positives = 16/32 (50%) Frame = +1 Query: 1282 KTLWNIHAKTDSLWIKWIHAEYLRG*DIW*RN 1377 K +W + ++ SLW+ W+ +R W N Sbjct: 462 KLIWRLVSRQSSLWVNWVWTYIIRKGSFWSAN 493 Score = 85.1 bits (209), Expect = 2e-13 Identities = 38/83 (45%), Positives = 54/83 (65%) Frame = +1 Query: 13 LFDIGDEKAPGPDGYTSAFFKKNWDLVRDDVVASVDEFFSK*IILRKLNHTVVSLIPKTT 192 LF + K PGPDGYTS FFK W + D +A++ FF K + + LN T+++LIPK Sbjct: 35 LFAMPSNKFPGPDGYTSEFFKATWSITGQDFIAAIKSFFIKGFLPKGLNATILALIPKKD 94 Query: 193 HDPGVGDFRPIACTNVVYKIITK 261 + D+RPI+C NV+YK+I+K Sbjct: 95 EATLMRDYRPISCCNVIYKVISK 117