BLASTX nr result
ID: Ephedra25_contig00009718
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra25_contig00009718 (2033 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAD32906.1| putative retroelement pol polyprotein [Arabidopsi... 239 3e-60 emb|CAN77122.1| hypothetical protein VITISV_013624 [Vitis vinifera] 235 5e-59 gb|AGW47867.1| polyprotein [Phaseolus vulgaris] 234 1e-58 dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsi... 233 3e-58 gb|ACL97384.1| Gag-Pol polyprotein [Medicago truncatula] 232 4e-58 dbj|BAA11674.1| unnamed protein product [Nicotiana tabacum] 232 5e-58 emb|CAN61272.1| hypothetical protein VITISV_039063 [Vitis vinifera] 230 2e-57 gb|ACL97385.1| Gag-Pol polyprotein [Medicago truncatula] 229 4e-57 emb|CAN68235.1| hypothetical protein VITISV_037104 [Vitis vinifera] 228 6e-57 gb|ACL97386.1| Gag-Pol polyprotein [Medicago truncatula] 227 1e-56 gb|ACL97383.1| Gag-Pol polyprotein [Medicago truncatula] 227 1e-56 ref|XP_005715938.1| unnamed protein product [Chondrus crispus] g... 202 2e-56 gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabi... 224 8e-56 emb|CAA31653.1| polyprotein [Arabidopsis thaliana] 224 1e-55 gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabi... 224 1e-55 pir||S23319 hypothetical protein 2 - Arabidopsis thaliana retrot... 223 2e-55 ref|XP_002064813.1| GK15001 [Drosophila willistoni] gi|194160898... 223 2e-55 gb|AAD17409.1| putative retroelement pol polyprotein [Arabidopsi... 223 2e-55 gb|EOY11267.1| Uncharacterized protein TCM_026511 [Theobroma cacao] 222 4e-55 emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] 222 4e-55 >gb|AAD32906.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 822 Score = 239 bits (611), Expect = 3e-60 Identities = 140/416 (33%), Positives = 224/416 (53%), Gaps = 6/416 (1%) Frame = -2 Query: 1330 LDSGCISHMTGNESILTNITWKEDNNSIKIAGTDMLTIRGFGDTEIILPNFKLQLKNDLL 1151 +DSGC +HMT NE + T I ++ I++ ++ G GD E++ K +++ LL Sbjct: 32 IDSGCTNHMTPNEKLFTKIN-RDFKVPIRVGNGAVMMSEGKGDIEVMTRKDKRGIRDVLL 90 Query: 1150 VHRLRRNLISVKKLVMAG--ITIEGNLTKFHDDQLRLFYNKKLISTSIGSS*LYLLKSDT 977 V +L +NL+SV ++++ G +T++ N HD + ++++ S + L+ + Sbjct: 91 VPKLGKNLLSVPQMIINGYQVTLKNNYCTIHDSARKKIGEVEMVNKS------FHLRWLS 144 Query: 976 LRESNLANQRNTWKQWNQNLGHVNDIYLNEIYKKVNGKNLP--NTQE-ICESCAKGKMSR 806 E+ + + + W++ LGH L + K LP N +E CESC K SR Sbjct: 145 NEETAMVAKDEATELWHKRLGHTGHSNLKILQSKEMVTGLPKFNVEEGKCESCILSKHSR 204 Query: 805 SPFI-QSNTKTSRILELIHSAICGPMPTQSYQGYRYFMSFIDDFSRYSFVYLLKSRDEVF 629 PF +S T+ LELIHS +CGPM S G RY ++FIDD +R +VY LK++ EVF Sbjct: 205 DPFPKESETRAKHKLELIHSDVCGPMQNSSINGSRYILTFIDDATRMVWVYFLKAKSEVF 264 Query: 628 EKFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFLAYYQNKGITREIIAPYSPQ*NGLAE 449 + FK+++ L++N ++ LR + G EY S EF + + GI R++ A YSPQ N ++E Sbjct: 265 QTFKKFKNLVENNANCRIKKLRIDRGTEYLSKEFSEFLEGNGIERQLTAAYSPQQNEVSE 324 Query: 448 *KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTTNYWPNSNNDMKIPK*VWKKEKINIQ 269 + +L+E R ++ L K W +A+ A Y N P K P W K ++ Sbjct: 325 RRNRSLVEMARAMIKAKDLPLKLWAEAVHVAAYAQNRTPTRTLKNKTPLEAWSDSKPSVS 384 Query: 268 HLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPLGTKGYKLMDCDTKKIIVSRDI 101 H+K+FG + +P E R+K D K + IF+GY TKGY++ KI +SRD+ Sbjct: 385 HMKVFGSICYVHIPDEKRRKWDDKSKRAIFVGYSSQTKGYRVYLLKENKIDISRDV 440 >emb|CAN77122.1| hypothetical protein VITISV_013624 [Vitis vinifera] Length = 1269 Score = 235 bits (600), Expect = 5e-59 Identities = 155/502 (30%), Positives = 252/502 (50%), Gaps = 10/502 (1%) Frame = -2 Query: 1549 HNPKNCWND---LTNPNAVELKERAKRRMRPARKEANITKVINPNTIALITEHINDSNKY 1379 H K+CW+ L N N +++ R +K++ P A +TE + +++ Sbjct: 245 HAEKDCWHKGKPLFNCNFCNKLGHSEKYCRAKKKQSQH----QPEQHASVTEEDKNDDEH 300 Query: 1378 TPELNICIDDN-YSI*CLDSGCISHMTGNESILTNITWKEDNNSIKIAGTDMLTIRGFGD 1202 + + + + +DSGC SHMT + SI T+I + +K+ +++ +G G Sbjct: 301 LFMASQALSSHELNTWLIDSGCTSHMTKHLSIFTSID-RSVQPKVKLGNGEVVQAKGKGT 359 Query: 1201 TEIILPNFKLQLKNDLLVHRLRRNLISVKKLVMAG--ITIEGNLTKFHDDQLRLFYNKKL 1028 I + N L + L +NL+SV +++ G ++ + N D + K+ Sbjct: 360 IAISTKRGTKIVTNVLYIPDLDQNLLSVAQMLRNGYAVSFKENFCFISD-----VHGTKI 414 Query: 1027 ISTSIGSS*LYLLKSDTLRESNLANQRNTWKQWNQNLGHVNDIYLNEIYKKVNGKNLPNT 848 + + YL K D + + + + W++ GH N L + + +++P Sbjct: 415 AKIKMNGNSFYL-KLDLVEGHVFSAKIDESVVWHKRYGHFNLKSLRFMQEAGMVEDMPEI 473 Query: 847 Q---EICESCAKGKMSRSPFIQSNTK-TSRILELIHSAICGPMPTQSYQGYRYFMSFIDD 680 + CESC GK R PF Q+ +K + LELIHS ICGPM T S YF FIDD Sbjct: 474 SVNAQTCESCELGKQQRQPFPQNMSKRATHKLELIHSDICGPMSTTSLSNNVYFALFIDD 533 Query: 679 FSRYSFVYLLKSRDEVFEKFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFLAYYQNKGI 500 FSR ++VY LK++ +V FK ++K+++ + G + LR++ G EY S EF + Q GI Sbjct: 534 FSRMTWVYFLKTKSQVLSMFKSFKKMVETQSGQNVKVLRTDNGGEYTSKEFSVFCQEAGI 593 Query: 499 TREIIAPYSPQ*NGLAE*KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTTNYWPNSNN 320 ++ APYSPQ NG+++ K T+ME RC+L L K W +A+ T+ Y N P + Sbjct: 594 VHQLTAPYSPQQNGVSKRKNRTVMEMARCMLFEKKLPKLLWAEAVNTSVYLLNRLPTKSV 653 Query: 319 DMKIPK*VWKKEKINIQHLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPLGTKGYKLM 140 K P W K +++HLK+FG + VP R KLD + G+F+GY +KGY++ Sbjct: 654 QSKTPIEAWSGVKPSVKHLKVFGSFCYLHVPSVKRGKLDERAEKGVFVGYVAESKGYRIY 713 Query: 139 DCDTKKIIVSRDISNKQNELDN 74 KI++SRD+ +N N Sbjct: 714 SLSRMKIVISRDVHFDENSYWN 735 >gb|AGW47867.1| polyprotein [Phaseolus vulgaris] Length = 1471 Score = 234 bits (597), Expect = 1e-58 Identities = 154/468 (32%), Positives = 256/468 (54%), Gaps = 12/468 (2%) Frame = -2 Query: 1372 ELNICIDDNYSI*CLDSGCISHMTGNESILTNITWKEDNNSIKIAGTDMLTIRGFGDTEI 1193 E+NI +N ++ LDSG +HM G+E + ++ ED + + + ++G G Sbjct: 347 EVNI---NNDTLWYLDSGASNHMCGHEYLFKDMQKIEDGH-VSFGDASKVEVKGRGTVCY 402 Query: 1192 ILPNFKL-QLKNDLLVHRLRRNLISVKKLVMAGITIEGNLTKFHDDQLRLFYNKK---LI 1025 + + + L++ V L+ N++S+ +L G +I F D+ NK+ + Sbjct: 403 LQKDGLIGSLQDVYYVPDLKTNILSMGQLTEKGYSI------FLKDRFLHLKNKQGCLVA 456 Query: 1024 STSIGSS*LYLLKSDTLRESNL-ANQRNTWKQWNQNLGHVNDIYLNEIYKKVNGKNLPNT 848 + + +Y L ++RE L N + W+ GH++ L E+ KK LPN Sbjct: 457 RIEMARNRMYKLNLRSIREKCLQVNIEDKASLWHLRFGHLHHGGLKELAKKNMVHGLPNM 516 Query: 847 Q---EICESCAKGKMSRSPFIQ-SNTKTSRILELIHSAICGPMPTQSYQGYRYFMSFIDD 680 + CE C K R+ F + + + LELIH+ ICGP+ +S+ G RYF++FIDD Sbjct: 517 DYEGKFCEECVLSKHVRTSFPKKAQYWAKQPLELIHTDICGPITPESFSGKRYFITFIDD 576 Query: 679 FSRYSFVYLLKSRDEVFEKFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFLAYYQNKGI 500 FSR ++VY LK + E FE FK+++ +++ ++ +RS+ G EY ST F+ Y + +GI Sbjct: 577 FSRKTWVYFLKEKSEAFEVFKKFKVMVERTTDKQIKAVRSDRGGEYTSTTFMEYCEEQGI 636 Query: 499 TREIIAPYSPQ*NGLAE*KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTTNYWPNSNN 320 R + APY+PQ NG+AE K T+++ VR +L + + K++W +A+ A Y N P+ Sbjct: 637 RRFLTAPYTPQQNGVAERKNRTILDMVRSMLKSKKMPKEFWAEAVQCAIYVQNRCPHVKL 696 Query: 319 DMKIPK*VWKKEKINIQHLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPLGTKGYKLM 140 D + P+ W +K + HLK+FG + VP + R KL+ K + +F+GY TKGYKL+ Sbjct: 697 DDQTPQEAWSGQKPTVSHLKVFGSVAYAHVPDQRRTKLEDKSKRYVFIGYDEKTKGYKLL 756 Query: 139 DCDTKKIIVSRDIS-NKQNELDNLLENDESQILIPETKDIPD--NSDT 5 D +KK+ VSRD+ N+ +E D N+ S+++I + P NS+T Sbjct: 757 DPISKKVTVSRDVQINEASEWD---WNNSSEVMIEVGESSPTSINSET 801 >dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsis thaliana] Length = 1499 Score = 233 bits (593), Expect = 3e-58 Identities = 143/443 (32%), Positives = 238/443 (53%), Gaps = 9/443 (2%) Frame = -2 Query: 1330 LDSGCISHMTGNESILTNITWKEDNNSIKIAGTDMLTIRGFGDTEIILPNFKLQLKNDLL 1151 +DSGC +HM+ + + + I+I + G GD + +K+ L Sbjct: 326 VDSGCTNHMSKDVRHFIALD-RSKKIIIRIGNGGKVVSEGKGDIRVSTNKGDHVIKDVLY 384 Query: 1150 VHRLRRNLISVKKLVMAG--ITIEGNLTKFHDDQLRLFYNKKLISTSIGSS*LYLLKSDT 977 V L RNL+SV +++ G + E N D +K++ + ++ + Sbjct: 385 VPELARNLLSVSQMISNGYRVIFEDNKCVIQD-----LKGRKILDIKMKDRSFPIIWKKS 439 Query: 976 LRESNLANQRNTWKQ--WNQNLGHVNDIYLNEIYKKVNGKNLPNTQEI---CESCAKGKM 812 E+ +A + + W++ GHVN + + + LP + I C +C GK Sbjct: 440 REETYMAFEEKEEQTDLWHKRFGHVNYDKIETMQTLKIVEKLPKFEVIKGICAACEMGKQ 499 Query: 811 SRSPFIQ-SNTKTSRILELIHSAICGPMPTQSYQGYRYFMSFIDDFSRYSFVYLLKSRDE 635 SR F + S + T++ LELIHS +CGPM T+S G RYF++FIDDFSR ++VY LK++ E Sbjct: 500 SRRSFPKKSQSNTNKTLELIHSDVCGPMQTESINGSRYFLTFIDDFSRMTWVYFLKNKSE 559 Query: 634 VFEKFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFLAYYQNKGITREIIAPYSPQ*NGL 455 V KFK ++ ++N+ ++ LR++GG E+ S EF+ Q GI EI PYSPQ NG+ Sbjct: 560 VITKFKIFKPYVENQSESRIKRLRTDGGGEFLSREFIKLCQESGIHHEITTPYSPQQNGV 619 Query: 454 AE*KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTTNYWPNSNNDMKI-PK*VWKKEKI 278 AE + TL+E R +++ LS K+W +A+ T+ Y N P+ + + + P +W +K Sbjct: 620 AERRNRTLVEMARSMIEEKKLSNKFWAEAIATSTYLQNRLPSKSLEKGVTPMEIWSGKKP 679 Query: 277 NIQHLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPLGTKGYKLMDCDTKKIIVSRDIS 98 ++ HLK+FG + +P E R+KLD K + GIF+GY +KGY++ + +KI VS+D++ Sbjct: 680 SVDHLKVFGCVCYIHIPDEKRRKLDTKAKQGIFVGYSNESKGYRVFLLNEEKIEVSKDVT 739 Query: 97 NKQNELDNLLENDESQILIPETK 29 + + + E E + ++ K Sbjct: 740 FDEKKTWSHDEKGERKAILSLVK 762 >gb|ACL97384.1| Gag-Pol polyprotein [Medicago truncatula] Length = 1305 Score = 232 bits (592), Expect = 4e-58 Identities = 165/538 (30%), Positives = 267/538 (49%), Gaps = 22/538 (4%) Frame = -2 Query: 1549 HNPKNCWNDLTNPNAVELKERAKRRMRPARKEANITKVINPNTIALITEHINDSNKYTPE 1370 H K CWN +K+ ++ + + + + I + S+K Sbjct: 238 HVKKECWN---------IKKNGEKNSEASTSQGCVASTSDDGEI--LYSEAATSSKGERR 286 Query: 1369 LNICIDDNYSI*CLDSGCISHMTGN-------ESILTNITWKEDNNSIKIAGTDMLTIRG 1211 LN + +DSG HMT + E I + ++++++IAG + ++ Sbjct: 287 LN-------DVWIMDSGATWHMTPHRDWFFSYEPISEGSVYMGNDHALEIAGVGTIRLKM 339 Query: 1210 FGDTEIILPNFKLQLKNDLLVHRLRRNLISVKKLVMAGITI--EGNLTKFHDDQLRLFYN 1037 T +++ V L++NL+SV +L G I E + K L + Sbjct: 340 HDGTV-------RKIQGVRHVKGLKKNLLSVGQLDDLGCKIHTESGILKVVKGNLVVMKA 392 Query: 1036 KKLISTSIGSS*LYLLKSDTLRESNL----ANQRNTWKQWNQNLGHVNDIYLNEIYKK-- 875 +K+ S LY+L DTL+E++ A+Q T W+Q LGH+++ L + ++ Sbjct: 393 EKITSN------LYMLLGDTLQEADASVAAASQEETTMMWHQRLGHMSERGLKVLVERNL 446 Query: 874 ---VNGKNLPNTQEICESCAKGKMSRSPFIQSNTKTSRILELIHSAICGPMPTQSYQGYR 704 + NLP CE C K R F + T++ IL+LIHS + P S G R Sbjct: 447 LHGLKTVNLP----FCEHCVISKQHRLKFARVTTRSKHILDLIHSDVW-ESPELSLGGAR 501 Query: 703 YFMSFIDDFSRYSFVYLLKSRDEVFEKFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFL 524 YF+SFIDD+SR +VY +K + +VF FK ++ ++ + G K+ CLR++ G EY EFL Sbjct: 502 YFVSFIDDYSRRLWVYPIKKKSDVFPVFKAFKAQIELETGKKIKCLRTDNGGEYVDGEFL 561 Query: 523 AYYQNKGITREIIAPYSPQ*NGLAE*KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTT 344 A+ + +GI R+ ++PQ NG+AE TL+E R +L G++K +W +A+ TA Y Sbjct: 562 AFCKQEGIVRQFTVAHTPQQNGVAERMNRTLLERTRAMLKTAGMAKSFWAEAVKTACYVI 621 Query: 343 NYWPNSNNDMKIPK*VWKKEKINIQHLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPL 164 N P++ D+K P +WK + ++ L +FG + + + R KLDPK R IFLGY Sbjct: 622 NRSPSTAIDLKTPMEMWKGKPVDYSSLHVFGCPVYVMYNSQERTKLDPKSRKCIFLGYAD 681 Query: 163 GTKGYKLMDCDTKKIIVSRDISNKQNELDNLLEND----ESQILIPETKDIPDNSDTA 2 KGY+L D +K++VSRD+ +NEL + +ND E+ I+ E K +S A Sbjct: 682 NVKGYRLWDPTARKVVVSRDVVFAENELQSKQKNDSTSKETAIVQMEEKSKESDSSEA 739 >dbj|BAA11674.1| unnamed protein product [Nicotiana tabacum] Length = 1338 Score = 232 bits (591), Expect = 5e-58 Identities = 153/449 (34%), Positives = 237/449 (52%), Gaps = 14/449 (3%) Frame = -2 Query: 1405 EHINDSNKYTPELNICIDDNY-------SI*CLDSGCISHMTGNESILTNITWKEDNNSI 1247 E +D E N+ DD+ +DSG H T + ++ T D + Sbjct: 262 ESSDDETNSFGEFNVVYDDDIINLTTQEMTWVIDSGATIHATPRRELFSSYTLG-DFGRV 320 Query: 1246 KIAGTDMLTIRGFGDTEIILPN-FKLQLKNDLLVHRLRRNLISVKKLVMAGITIEGNLTK 1070 K+ + T+ G GD + N KL L++ V +R NLISV KL EG Sbjct: 321 KMGNANFSTVVGKGDVCLETMNGMKLLLRDVRHVPDMRLNLISVDKL-----DEEGYCNT 375 Query: 1069 FHDDQLRLFYNKKLISTSIGSS*LYLLKSDTLRES-NLANQRNTWKQWNQNLGHVNDIYL 893 FH+ Q +L +++ S LY+ ++ ++ N+A + K W++ LGH+++ + Sbjct: 376 FHNGQWKLTKGSLMVARGTKQSKLYVTQASISQQVINVAENDSNIKLWHRRLGHMSEKSM 435 Query: 892 NEIYKKVNGKNLPNTQEI----CESCAKGKMSRSPFIQ-SNTKTSRILELIHSAICGPMP 728 + KK LP +I C C GK +R F + ++ +L+L+HS +CGP Sbjct: 436 ARLVKK---NALPGLNQIQLKKCADCLAGKQNRVSFKRFPPSRRQNVLDLVHSDVCGPFK 492 Query: 727 TQSYQGYRYFMSFIDDFSRYSFVYLLKSRDEVFEKFKEYRKLMQNKKGVKLACLRSNGGL 548 +S G RYF++FIDD SR ++VY LK++D+VF+ FK++ L++ + G KL C+R++ G Sbjct: 493 -KSLGGARYFVTFIDDHSRKTWVYTLKTKDQVFQVFKQFLTLVERETGKKLKCIRTDNGG 551 Query: 547 EYKSTEFLAYYQNKGITREIIAPYSPQ*NGLAE*KI*TLMECVRCLLDNVGLSKKWWGDA 368 EY+ +F AY + GI + P +PQ NGLAE TL+E RCLL + L K +WG+A Sbjct: 552 EYQG-QFDAYCKEHGIRHQFTPPKTPQLNGLAERMNRTLIERTRCLLSHSKLPKAFWGEA 610 Query: 367 LLTANYTTNYWPNSNNDMKIPK*VWKKEKINIQHLKIFGDEYFCLVPKELRKKLDPKFRS 188 L+TA Y N+ P K P+ +W I+ L++FG + + VPK+ R KLD K R Sbjct: 611 LVTAAYVLNHSPCVPLQYKAPEKIWLGRDISYDQLRVFGCKAYVHVPKDERSKLDVKTRE 670 Query: 187 GIFLGYPLGTKGYKLMDCDTKKIIVSRDI 101 +F+GY GYK D KK++ SRD+ Sbjct: 671 CVFIGYGQDMLGYKFYDPVEKKLVRSRDV 699 >emb|CAN61272.1| hypothetical protein VITISV_039063 [Vitis vinifera] Length = 1643 Score = 230 bits (586), Expect = 2e-57 Identities = 143/415 (34%), Positives = 228/415 (54%), Gaps = 4/415 (0%) Frame = -2 Query: 1330 LDSGCISHMTGNESILTNITWKEDNNSIKIAGTDMLTIRGFGDTEIILPNFKLQLKNDLL 1151 LD+G HM + + T T+KE N S+K+ L ++G G +I + + ++ N Sbjct: 622 LDTGASYHMAYSRDLFT--TFKEWNGSVKLGDDGELGVKGSGSVQIKMYDGLVRTLNAWY 679 Query: 1150 VHRLRRNLISVKKLVMAGITIEGNLTKFHDDQLRLFYNKKLISTSIGSS*LYLLKSDTLR 971 V LR+NLISV L G T G+ LR+ ++ +Y L ++ Sbjct: 680 VPGLRKNLISVGTLDKNGYTFSGS-----GGVLRVSKGALVVMKGRLQHGIYTLMGSSVL 734 Query: 970 ESNLANQRNTWKQWNQNLGHVNDIYLNEIYKK--VNGKNLPNTQEICESCAKGKMSRSPF 797 + + N + W++ LGH+++ L+ + K+ ++G + CE+C GK R F Sbjct: 735 GTAAVEEDNCTELWHRRLGHMSEKGLSILSKQGLLSGAETGKLK-FCETCVMGKQRRVKF 793 Query: 796 IQSNTKTSRILELIHSAICGPMPTQSYQGYRYFMSFIDDFSRYSFVYLLKSRDEVFEKFK 617 + T+ +LE IHS + GP P +S+ G RY+++FIDDFSR +VY LK++DEVF KFK Sbjct: 794 SMGSHTTNGVLEYIHSDLWGPSPVESHSGCRYYVTFIDDFSRKVWVYFLKAKDEVFGKFK 853 Query: 616 EYRKLMQNKKGVKLACLRSNGGLEYKSTEFLAYYQNKGITREIIAPYSPQ*NGLAE*KI* 437 E++ +++ + G + LR++ GLE+ + +F + + +GI R ++PQ NG+AE Sbjct: 854 EWKTMVEKRTGKVVKTLRTDNGLEFCNKDFDEFCRKEGIVRHRTVRHTPQQNGVAERMNQ 913 Query: 436 TLMECVRCLLDNVGLSKKWWGDALLTANYTTNYWPNSNNDMKIPK*VWKKEKINIQHLKI 257 TL++ RC+ + GLSKK+W +A+ TA Y N P++ D K P+ VW + N LKI Sbjct: 914 TLVQRARCMRIDAGLSKKFWAEAVNTAAYLVNRSPSTAIDFKTPQEVWSGKPSNYSGLKI 973 Query: 256 FGDEYFCLVPKELRKKLDPKFRSGIFLGYPLGTKGYKLMDCD--TKKIIVSRDIS 98 FG + V KL+P+ IFLGY G KGY+L + T K I+SRD++ Sbjct: 974 FGCPAYAHVSD---GKLEPRAMKCIFLGYATGVKGYRLWCTEDRTPKFIISRDVT 1025 >gb|ACL97385.1| Gag-Pol polyprotein [Medicago truncatula] Length = 1305 Score = 229 bits (583), Expect = 4e-57 Identities = 154/465 (33%), Positives = 244/465 (52%), Gaps = 22/465 (4%) Frame = -2 Query: 1330 LDSGCISHMTGN-------ESILTNITWKEDNNSIKIAGTDMLTIRGFGDTEIILPNFKL 1172 +DSG HMT + E I + ++++++IAG + ++ T Sbjct: 293 MDSGATWHMTPHRDWFYSYEPISEGSVYMGNDHALEIAGVGTIRLKMHDGTV-------R 345 Query: 1171 QLKNDLLVHRLRRNLISVKKLVMAGITI--EGNLTKFHDDQLRLFYNKKLISTSIGSS*L 998 +++ V L++NL+SV +L G I E + K L + +K+ S L Sbjct: 346 KIQGVRHVKGLKKNLLSVGQLDDLGCKIHSESGILKVVKGNLVVMKAEKITSN------L 399 Query: 997 YLLKSDTLRESNL----ANQRNTWKQWNQNLGHVNDIYLNEIYKK-----VNGKNLPNTQ 845 Y+L DTL+E++ A+Q T W+Q LGH+++ L + ++ + NLP Sbjct: 400 YMLLGDTLQEADASVAAASQEETTMMWHQRLGHMSERGLKVLVERNLLHGLKTVNLP--- 456 Query: 844 EICESCAKGKMSRSPFIQSNTKTSRILELIHSAICGPMPTQSYQGYRYFMSFIDDFSRYS 665 CE C K R F + T++ IL+LIHS + P S G RYF+SFIDD+SR Sbjct: 457 -FCEHCVISKQHRLKFARVTTRSKHILDLIHSDVW-ESPKLSLGGARYFVSFIDDYSRRL 514 Query: 664 FVYLLKSRDEVFEKFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFLAYYQNKGITREII 485 +VY +K + +VF FK ++ ++ + G K+ CLR++ G EY EFLA+ + +GI R+ Sbjct: 515 WVYPIKKKSDVFPVFKAFKAQIELETGKKIKCLRTDNGGEYVDGEFLAFCKQEGIVRQFT 574 Query: 484 APYSPQ*NGLAE*KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTTNYWPNSNNDMKIP 305 ++PQ NG+AE TL+E R +L ++K +W +A+ TA Y N P++ D+K P Sbjct: 575 VAHTPQQNGVAERMNRTLLERTRAMLKTAEMAKSFWAEAVKTACYVINRSPSTTIDLKTP 634 Query: 304 K*VWKKEKINIQHLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPLGTKGYKLMDCDTK 125 +WK + ++ L +FG + + + R KLDPK R IFLGY KGY+L D + Sbjct: 635 MEMWKGKPVDYSSLHVFGCPVYVMYNSQERTKLDPKSRKCIFLGYADNVKGYRLWDPTAR 694 Query: 124 KIIVSRDISNKQNELDNLLEND----ESQILIPETKDIPDNSDTA 2 K++VSRD+ +NEL + +ND E+ IL E K +S A Sbjct: 695 KVVVSRDVVFAENELQSEQKNDSTFKETAILQIEEKSKESDSSEA 739 >emb|CAN68235.1| hypothetical protein VITISV_037104 [Vitis vinifera] Length = 2041 Score = 228 bits (582), Expect = 6e-57 Identities = 155/498 (31%), Positives = 241/498 (48%), Gaps = 10/498 (2%) Frame = -2 Query: 1549 HNPKNCWND---LTNPNAVELKERAKRRMRPARKEANITKVINPNTIALIT-EHINDSNK 1382 H K+CW+ L N N +++ R +K++ P A +T E ND Sbjct: 978 HAEKDCWHKGKPLFNCNFCNKLGHSEKYCRAKKKQSQQ----QPEQHASVTXEEKNDDEH 1033 Query: 1381 YTPELNICIDDNYSI*CLDSGCISHMTGNESILTNITWKEDNNSIKIAGTDMLTIRGFGD 1202 + +DSG SHMT + SI T+I + +K+ + + +G G Sbjct: 1034 LFMASQALSSHELNTWLIDSGXTSHMTKHLSIFTSID-RSVQPKVKLGNGEXVQAKGKGT 1092 Query: 1201 TEIILPNFKLQLKNDLLVHRLRRNLISVKKLVMAG--ITIEGNLTKFHDDQLRLFYNKKL 1028 I + N L + L +NL+SV +++ G ++ + N D K+ Sbjct: 1093 IAISTKRGTKIVTNVLYIPDLDQNLLSVAQMLRNGYXVSFKENFCFISDVHGTEIXKIKM 1152 Query: 1027 ISTSIGSS*LYLLKSDTLRESNLANQRNTWKQWNQNLGHVNDIYLNEIYKKVNGKNLPNT 848 S + LK D + + + + W++ H N L + + +++P Sbjct: 1153 NGNS------FYLKLDLVEGHVFSAKIDESVVWHKRYXHFNLKSLRFMQEAXMVEDMPEI 1206 Query: 847 Q---EICESCAKGKMSRSPFIQSNTK-TSRILELIHSAICGPMPTQSYQGYRYFMSFIDD 680 + CESC GK R PF Q+ +K + LELIHS ICGPM T S YF FIDD Sbjct: 1207 SVNAQTCESCELGKQQRQPFPQNMSKRATHKLELIHSDICGPMSTTSLSNNVYFALFIDD 1266 Query: 679 FSRYSFVYLLKSRDEVFEKFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFLAYYQNKGI 500 FSR ++VY LK++ +V FK ++K+++ + G + LR++ G EY S EF + Q GI Sbjct: 1267 FSRMTWVYFLKTKSQVLSMFKSFKKMVETQSGQXVKVLRTDNGGEYTSKEFSVFCQEAGI 1326 Query: 499 TREIIAPYSPQ*NGLAE*KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTTNYWPNSNN 320 ++ APYSPQ NG++E K T+ME RC+L L K W +A+ T+ Y N P + Sbjct: 1327 VHQLTAPYSPQXNGVSERKNRTVMEMARCMLFEKKLPKLLWAEAVNTSVYLLNRLPTKSV 1386 Query: 319 DMKIPK*VWKKEKINIQHLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPLGTKGYKLM 140 K P W K +++HLK+FG + VP R KLD + G+F+GY +KGY++ Sbjct: 1387 QSKTPIEAWSGVKPSVKHLKVFGSFCYLHVPSVKRGKLDERAEKGVFVGYAAESKGYRIY 1446 Query: 139 DCDTKKIIVSRDISNKQN 86 KI++SRD+ +N Sbjct: 1447 SLSRMKIVISRDVHFDEN 1464 >gb|ACL97386.1| Gag-Pol polyprotein [Medicago truncatula] Length = 1305 Score = 227 bits (579), Expect = 1e-56 Identities = 163/538 (30%), Positives = 265/538 (49%), Gaps = 22/538 (4%) Frame = -2 Query: 1549 HNPKNCWNDLTNPNAVELKERAKRRMRPARKEANITKVINPNTIALITEHINDSNKYTPE 1370 H K CWN +K+ ++ + + + + I + S+K Sbjct: 238 HVKKECWN---------IKKNGEKNSEASTSQGCVASTSDDGEI--LYSEAATSSKGERR 286 Query: 1369 LNICIDDNYSI*CLDSGCISHMTGN-------ESILTNITWKEDNNSIKIAGTDMLTIRG 1211 LN + +DSG HMT + E I + ++++++IAG + ++ Sbjct: 287 LN-------DVWIMDSGATWHMTPHRDWFYSYEPISEGSVYMGNDHALEIAGVGTIRLKM 339 Query: 1210 FGDTEIILPNFKLQLKNDLLVHRLRRNLISVKKLVMAGITI--EGNLTKFHDDQLRLFYN 1037 T +++ V L++NL+SV +L G I E + K L + Sbjct: 340 HDGTV-------RKIQGVRHVKGLKKNLLSVGQLDDLGCKIHTESGILKVVKGNLVVMKA 392 Query: 1036 KKLISTSIGSS*LYLLKSDTLRESNL----ANQRNTWKQWNQNLGHVNDIYLNEIYKK-- 875 +K+ S LY+L DTL+E++ A+Q T W+Q LGH+++ L + ++ Sbjct: 393 EKITSN------LYMLLGDTLQEADASVAAASQEETTMMWHQRLGHMSERGLKVLVERNL 446 Query: 874 ---VNGKNLPNTQEICESCAKGKMSRSPFIQSNTKTSRILELIHSAICGPMPTQSYQGYR 704 + NLP CE C K R F + T++ IL+LIHS + P S G R Sbjct: 447 LHGLKTVNLP----FCEHCVMSKQHRLKFARVTTRSKHILDLIHSDVW-ESPEISLGGAR 501 Query: 703 YFMSFIDDFSRYSFVYLLKSRDEVFEKFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFL 524 YF+SFIDD+SR +VY +K + +VF FK ++ ++ + K+ CLR++ G EY EFL Sbjct: 502 YFVSFIDDYSRRLWVYPIKKKSDVFPVFKAFKAQIELETEKKIKCLRTDNGGEYVDGEFL 561 Query: 523 AYYQNKGITREIIAPYSPQ*NGLAE*KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTT 344 A+ + +GI R+ ++PQ NG+AE TL+E R +L G++K +W +A TA Y Sbjct: 562 AFCKQEGIVRQFTVAHTPQQNGVAERMNRTLLERTRAMLKTAGMAKSFWAEAAKTACYVI 621 Query: 343 NYWPNSNNDMKIPK*VWKKEKINIQHLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPL 164 N P++ D+K P +WK + ++ L +FG + + + + KLDPK R IFLGY Sbjct: 622 NRSPSTAIDLKTPMEMWKGKPVDYSSLHVFGCPVYVMYNSQEKTKLDPKSRKCIFLGYAD 681 Query: 163 GTKGYKLMDCDTKKIIVSRDISNKQNELDNLLEND----ESQILIPETKDIPDNSDTA 2 KGY+L D +K++VSRD+ +NEL + +ND E+ I+ E K +S A Sbjct: 682 NVKGYRLWDPTARKVVVSRDVVFAENELQSEQKNDSTSKETAIVQMEEKSKESDSSEA 739 >gb|ACL97383.1| Gag-Pol polyprotein [Medicago truncatula] Length = 1305 Score = 227 bits (579), Expect = 1e-56 Identities = 163/538 (30%), Positives = 266/538 (49%), Gaps = 22/538 (4%) Frame = -2 Query: 1549 HNPKNCWNDLTNPNAVELKERAKRRMRPARKEANITKVINPNTIALITEHINDSNKYTPE 1370 H K CWN +K+ ++ + + + + I + S+K Sbjct: 238 HVKKECWN---------IKKNGEKNSEASTSQGCVASTSDDGEI--LYSEAATSSKGERR 286 Query: 1369 LNICIDDNYSI*CLDSGCISHMTGN-------ESILTNITWKEDNNSIKIAGTDMLTIRG 1211 LN + +DSG HMT + E I + ++++++IAG + ++ Sbjct: 287 LN-------DVWIMDSGATWHMTPHRDWFFSYEPISEGSVYMGNDHALEIAGVGTIRLKM 339 Query: 1210 FGDTEIILPNFKLQLKNDLLVHRLRRNLISVKKLVMAGITI--EGNLTKFHDDQLRLFYN 1037 T +++ V L++NL+SV +L G I E + K L + Sbjct: 340 HDGTV-------RKIQGVRHVKGLKKNLLSVGQLDDLGCKIHTESGILKVVKGNLVVMKA 392 Query: 1036 KKLISTSIGSS*LYLLKSDTLRESNLA----NQRNTWKQWNQNLGHVNDIYLNEIYKK-- 875 +K+ S LY+L DTL+E++ + +Q T W+Q LGH+++ L + ++ Sbjct: 393 EKITSN------LYMLLGDTLQEADASVAASSQEETTMMWHQRLGHMSERGLKVLAERNL 446 Query: 874 ---VNGKNLPNTQEICESCAKGKMSRSPFIQSNTKTSRILELIHSAICGPMPTQSYQGYR 704 + NLP CE C K R F + T++ IL+LIHS + P S G R Sbjct: 447 LHGLKAVNLP----FCEHCVISKQHRLKFARVTTRSKHILDLIHSDVW-ESPEISLGGAR 501 Query: 703 YFMSFIDDFSRYSFVYLLKSRDEVFEKFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFL 524 YF+SFIDD+SR +VY +K + +VF FK ++ ++ + K+ CLR++ G EY EFL Sbjct: 502 YFVSFIDDYSRRLWVYPIKKKSDVFPVFKAFKAQIELETRKKIKCLRTDNGGEYIDGEFL 561 Query: 523 AYYQNKGITREIIAPYSPQ*NGLAE*KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTT 344 A+ + +GI R+ ++PQ NG+AE TL+E R +L G++K +W +A+ TA Y Sbjct: 562 AFCKQEGIVRQFTVAHTPQQNGVAERMNRTLLERTRAMLKTAGMAKSFWAEAVKTACYVI 621 Query: 343 NYWPNSNNDMKIPK*VWKKEKINIQHLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPL 164 N P++ D+K P +WK + ++ L +FG + + + R KLDPK R IFLGY Sbjct: 622 NRSPSTAIDLKTPMEMWKGKPVDYSSLHVFGCPVYVMYNSQERTKLDPKSRKCIFLGYAD 681 Query: 163 GTKGYKLMDCDTKKIIVSRDISNKQNELDNLLEND----ESQILIPETKDIPDNSDTA 2 KGY+L D +K++VSRD+ +NEL + +ND E+ I+ E K +S A Sbjct: 682 NVKGYRLWDPTARKVVVSRDVVFAENELQSEQKNDSTSKETAIVQMEEKSKESDSSEA 739 >ref|XP_005715938.1| unnamed protein product [Chondrus crispus] gi|507112437|emb|CDF36119.1| unnamed protein product [Chondrus crispus] Length = 753 Score = 202 bits (515), Expect(2) = 2e-56 Identities = 143/509 (28%), Positives = 253/509 (49%), Gaps = 7/509 (1%) Frame = -2 Query: 1549 HNPKNCWNDLTNPNAVELKERAKRRMRPARKEANITKVINPNTIALITEHINDSNKYTPE 1370 H CW N K RM ++ A +T+ +P+ + + +K + Sbjct: 229 HTATRCWGKDLNGRRPAPPSGYKPRMSQRKQSAFVTQKPDPDVVVNSVDFTCLMSKASRT 288 Query: 1369 LNICIDDNYSI*CLDSGCISHMTGNESILTNITWKEDNNSIKIAGTDMLTIRGFGDTEII 1190 ++ + ++ + DS C +H+T + S+ E + S+++ + G GD + Sbjct: 289 NDLEMSPSWLV---DSACTAHITYDRSLFATYEPLE-SASVQMGTKASAKVAGRGDVHLK 344 Query: 1189 LP-NFKLQ---LKNDLLVHRLRRNLISVKKLVMAGITIEGNLTKFHDDQLRLFYNKKLIS 1022 L N +++ L + L V +L+SV ++ G+ + F + + + +++ Sbjct: 345 LNVNGRIEPCKLTDVLHVPDFAFSLLSVSRMTELGLKVG-----FENGKCMIRRGSTVVA 399 Query: 1021 TSIGSS*LYLLKSDTLRESNLANQRNTWKQWNQNLGHVNDIYLNEIYKKVNGKNLPNTQE 842 T+ LY+L D + + A+ T + W++ H N K+N N E Sbjct: 400 TATLVGELYVL--DIVSDVGSAHAA-TLQTWHERFAHAN---------KINNTNNDCISE 447 Query: 841 ICESCAKGKMSRS--PFIQSNTKTSRILELIHSAICGPMPTQSYQGYRYFMSFIDDFSRY 668 C +C GK +RS P +S+ + L+L+HS +CGP+ QS G +YF++FIDD S + Sbjct: 448 KCSACVYGKATRSVIPKERSSRRAYFCLDLVHSDVCGPLEVQSIGGAKYFITFIDDHSNW 507 Query: 667 SFVYLLKSRDEVFEKFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFLAYYQNKGITREI 488 S VY + + E FE++K + +L Q G K+ LR++ G EY STEF ++ G ++ Sbjct: 508 SVVYPMHHKSEAFERYKTFAQLAQTHTGRKIKVLRTDRGGEYLSTEFKSFLIANGTQHQM 567 Query: 487 IAPYSPQ*NGLAE*KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTTNYWPNSNNDMKI 308 Y+P+ NG+AE TL+ VR +L + ++K +W +AL A Y N + K+ Sbjct: 568 TTAYTPEQNGVAERLNRTLVNLVRSMLAHKKVTKGFWAEALANAVYVRNRVTSRAIPPKM 627 Query: 307 -PK*VWKKEKINIQHLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPLGTKGYKLMDCD 131 P +W K N+ HL++FG + + +PK +KLD + + +FLGY TK YKL D D Sbjct: 628 TPYHLWMKSVPNVGHLRVFGSKCWYTLPKHNIQKLDARAKEAMFLGYAENTKAYKLWDGD 687 Query: 130 TKKIIVSRDISNKQNELDNLLENDESQIL 44 K+IVSRD++ ++ ++N +++L Sbjct: 688 LHKVIVSRDVTFDESTGGKSVQNRATRVL 716 Score = 45.8 bits (107), Expect(2) = 2e-56 Identities = 42/160 (26%), Positives = 68/160 (42%), Gaps = 4/160 (2%) Frame = -3 Query: 2031 LLSLSVSEEIAPLIRKSQNAHDAWTRLNKHFGRKNPTKLRLLIAEIENLKMKEDENSAKL 1852 L+ LS+S+E +R +AH+ W + F R E +KM E Sbjct: 67 LIGLSLSDEHLEHVRDVDSAHEMWEAIVNVFERHTLLNKLAARREFYTVKMLSGEKVLAY 126 Query: 1851 IRKVLYLQQQIEDQGKNLLDIDLIHYTLKSLPLKFVDFIRKFD---NDDQDITYDVFCNK 1681 I +V L ++ N+ D ++ L LP +F I D N+++ + D ++ Sbjct: 127 INRVKQLAAILKSMSVNIDDKEMAMAVLNGLPARFEALIVALDALGNEEKIFSLDFVKSR 186 Query: 1680 LQIVETKLALRKKLPDQFDAMVAHRYPNKRRKPTY-CKHC 1564 L + E + A K Q A+V +R PN R Y C +C Sbjct: 187 L-LQEEQRANMKSSSSQTSALV-NRAPNNRDINDYKCTNC 224 >gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabidopsis thaliana] Length = 1352 Score = 224 bits (572), Expect = 8e-56 Identities = 137/417 (32%), Positives = 223/417 (53%), Gaps = 7/417 (1%) Frame = -2 Query: 1330 LDSGCISHMTGNESILTNITWKEDNNSIKIAGTDMLTIRGFGDTEIILPNFKLQ-LKNDL 1154 LDSG +HM G +S+ + N + + + ++G G+ I L N Q + N Sbjct: 337 LDSGASNHMCGRKSMFAELDESVRGN-VALGDESKMEVKGKGNILIRLKNGDHQFISNVY 395 Query: 1153 LVHRLRRNLISVKKLVMAG--ITIEGNLTKFHDDQLRLFYNKKLISTSIGSS*LYLLKSD 980 + ++ N++S+ +L+ G I ++ N D + L + S + +++D Sbjct: 396 YIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITK---VPMSKNRMFVLNIRND 452 Query: 979 TLRESNLANQRNTWKQWNQNLGHVNDIYLNEIYKKVNGKNLP---NTQEICESCAKGKMS 809 + + + +W W+ GH+N L + +K + LP + ++CE C GK Sbjct: 453 IAQCLKMCYKEESWL-WHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVCEGCLLGKQF 511 Query: 808 RSPFI-QSNTKTSRILELIHSAICGPMPTQSYQGYRYFMSFIDDFSRYSFVYLLKSRDEV 632 + F +S+++ + LELIH+ +CGP+ +S YF+ FIDDFSR ++VY LK + EV Sbjct: 512 KMSFPKESSSRAQKSLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFLKEKSEV 571 Query: 631 FEKFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFLAYYQNKGITREIIAPYSPQ*NGLA 452 FE FK+++ ++ + G+ + +RS+ G E+ S EFL Y ++ GI R++ P SPQ NG+A Sbjct: 572 FEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVA 631 Query: 451 E*KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTTNYWPNSNNDMKIPK*VWKKEKINI 272 E K T++E R +L + L K+ W +A+ A Y N P + K P+ W K + Sbjct: 632 ERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVSGKTPQEAWSGRKSGV 691 Query: 271 QHLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPLGTKGYKLMDCDTKKIIVSRDI 101 HL++FG VP E R KLD K IF+GY +KGYKL + DTKK I+SR+I Sbjct: 692 SHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNI 748 >emb|CAA31653.1| polyprotein [Arabidopsis thaliana] Length = 1291 Score = 224 bits (570), Expect = 1e-55 Identities = 156/464 (33%), Positives = 239/464 (51%), Gaps = 20/464 (4%) Frame = -2 Query: 1432 NPNTIALITEHINDSNKYTPELNICIDDNYSI*CLDSGCISHMT------------GNES 1289 NP +ITE + S + ++ + D I LDSGC SHM+ G + Sbjct: 290 NPGEAGVITEKLVFSEALSVN-DLAVRD---IWVLDSGCTSHMSARRDWFCSFREDGGPT 345 Query: 1288 ILTNITWKEDNNSIKIAGTDMLTIRGFGDTEIILPNFKLQLKNDLLVHRLRRNLISVKKL 1109 IL D++S+K G + I G T I L N K V LRRNLIS L Sbjct: 346 ILLG-----DDHSVKSQGQGSIKIETHGGTIIGLENVKY-------VPELRRNLISTGTL 393 Query: 1108 VMAGITIEGNLTKFHDDQLRLFYNKKLISTSIGSS*LYLLKSDTLRESNLANQRNTWKQ- 932 G EG D ++R F N+K + LY+L +T+ + + K Sbjct: 394 DKRGYKHEGG-----DGKVRYFKNQKTALRGELVNGLYILDGNTVLSETCVAEGSKGKTE 448 Query: 931 -WNQNLGHVNDIYLNEIYKKVNGKNLPNTQEI-----CESCAKGKMSRSPFIQSNTKTSR 770 W+ LGH+ LN + K + GK L + +EI CE+C GK + F + Sbjct: 449 LWHSRLGHIG---LNNM-KVLAGKGLVSKEEIRVLDFCENCVMGKAKKVSFNVGKHNSED 504 Query: 769 ILELIHSAICGPMP-TQSYQGYRYFMSFIDDFSRYSFVYLLKSRDEVFEKFKEYRKLMQN 593 +L +H+ + G T S G +YF+S IDD +R ++Y L+S+DE F++F E+++L++N Sbjct: 505 VLRYVHADLWGSTNVTPSLSGNKYFLSIIDDKTRKVWLYFLRSKDETFDRFCEWKELVEN 564 Query: 592 KKGVKLACLRSNGGLEYKSTEFLAYYQNKGITREIIAPYSPQ*NGLAE*KI*TLMECVRC 413 ++ K+ CLR++ GLE+ + +F AY + GI R Y+PQ NG+AE T+ME VRC Sbjct: 565 QQNKKVKCLRTDNGLEFCNLKFDAYCKEHGIERHKTCTYTPQQNGVAERMNRTIMEKVRC 624 Query: 412 LLDNVGLSKKWWGDALLTANYTTNYWPNSNNDMKIPK*VWKKEKINIQHLKIFGDEYFCL 233 +L+ GL +++W +A TA Y N P S D +P+ +W +K +HL+ FG + Sbjct: 625 MLNESGLGEEFWAEAAATAAYLINRSPASAIDHNVPEELWLNKKPGYKHLRRFGSIAYVH 684 Query: 232 VPKELRKKLDPKFRSGIFLGYPLGTKGYKLMDCDTKKIIVSRDI 101 + + KL P+ GIF+GYP GTKGYK+ + K ++SR++ Sbjct: 685 ID---QGKLKPRALKGIFIGYPAGTKGYKIWLLEEHKCVISRNV 725 >gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabidopsis thaliana] gi|12321387|gb|AAG50765.1|AC079131_10 copia-type polyprotein, putative [Arabidopsis thaliana] Length = 1320 Score = 224 bits (570), Expect = 1e-55 Identities = 137/417 (32%), Positives = 223/417 (53%), Gaps = 7/417 (1%) Frame = -2 Query: 1330 LDSGCISHMTGNESILTNITWKEDNNSIKIAGTDMLTIRGFGDTEIILPNFKLQ-LKNDL 1154 LDSG +HM G +S+ + N + + + ++G G+ I L N Q + N Sbjct: 337 LDSGASNHMCGRKSMFAELDESVRGN-VALGDESKMEVKGKGNILIRLKNGDHQFISNVY 395 Query: 1153 LVHRLRRNLISVKKLVMAG--ITIEGNLTKFHDDQLRLFYNKKLISTSIGSS*LYLLKSD 980 + ++ N++S+ +L+ G I ++ N D + L + S + +++D Sbjct: 396 YIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITK---VPMSKNRMFVLNIRND 452 Query: 979 TLRESNLANQRNTWKQWNQNLGHVNDIYLNEIYKKVNGKNLP---NTQEICESCAKGKMS 809 + + + +W W+ GH+N L + +K + LP + ++CE C GK Sbjct: 453 IAQCLKMCYKEESWL-WHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVCEGCLLGKQF 511 Query: 808 RSPFI-QSNTKTSRILELIHSAICGPMPTQSYQGYRYFMSFIDDFSRYSFVYLLKSRDEV 632 + F +S+++ + LELIH+ +CGP+ +S YF+ FIDDFSR ++VY LK + EV Sbjct: 512 KMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFLKEKSEV 571 Query: 631 FEKFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFLAYYQNKGITREIIAPYSPQ*NGLA 452 FE FK+++ ++ + G+ + +RS+ G E+ S EFL Y ++ GI R++ P SPQ NG+A Sbjct: 572 FEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVA 631 Query: 451 E*KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTTNYWPNSNNDMKIPK*VWKKEKINI 272 E K T++E R +L + L K+ W +A+ A Y N P + K P+ W K + Sbjct: 632 ERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGV 691 Query: 271 QHLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPLGTKGYKLMDCDTKKIIVSRDI 101 HL++FG VP E R KLD K IF+GY +KGYKL + DTKK I+SR+I Sbjct: 692 SHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNI 748 >pir||S23319 hypothetical protein 2 - Arabidopsis thaliana retrotransposon Ta1-2 (strain Landsberg) (fragment) gi|16384|emb|CAA37924.1| unnamed protein product [Arabidopsis thaliana] Length = 1084 Score = 223 bits (569), Expect = 2e-55 Identities = 151/459 (32%), Positives = 239/459 (52%), Gaps = 15/459 (3%) Frame = -2 Query: 1432 NPNTIALITEHINDSNKYTPELNICIDDNYSI*CLDSGCISHMTGNESILTNITWKE--- 1262 NP +ITE + S + ++ + D I LDSGC SHM+ + N Sbjct: 190 NPGEAGVITEKLVFSEALSVN-DLAVRD---IWVLDSGCTSHMSARKDWFCNFRKDGGTT 245 Query: 1261 ----DNNSIKIAGTDMLTIRGFGDTEIILPNFKLQLKNDLLVHRLRRNLISVKKLVMAGI 1094 D++S+K G + I G T +L N K V LRRNLIS L G Sbjct: 246 ILLGDDHSVKSQGQGSIKIDTHGGTITVLENVKY-------VPELRRNLISTGTLDKRGY 298 Query: 1093 TIEGNLTKFHDDQLRLFYNKKLISTSIGSS*LYLLKSDTLRESNLANQRNTWKQ--WNQN 920 EG D ++R F N+K + LY+L +T+ + + K W+ Sbjct: 299 KHEGG-----DGKVRYFKNQKTALRGEIVNGLYILDGNTILSETCVAEGSKGKTELWHSR 353 Query: 919 LGHVNDIYLNEIYKKVNGKNLPNTQEI-----CESCAKGKMSRSPFIQSNTKTSRILELI 755 LGH+ LN + K + GK L + +EI CE+C GK + F + +L + Sbjct: 354 LGHMG---LNNM-KVLAGKGLVSKEEIRELDFCENCVMGKAKKVSFNMGKHNSEYVLSYV 409 Query: 754 HSAICGPMP-TQSYQGYRYFMSFIDDFSRYSFVYLLKSRDEVFEKFKEYRKLMQNKKGVK 578 H+ + G T S G +YF+S IDD +R ++Y L+S+DE F++F E ++L++N++ K Sbjct: 410 HADLWGSTNVTPSLSGNKYFLSIIDDKTRKVWLYFLRSKDETFDRFCERKELVENQQNKK 469 Query: 577 LACLRSNGGLEYKSTEFLAYYQNKGITREIIAPYSPQ*NGLAE*KI*TLMECVRCLLDNV 398 + CLR++ GLE+ + +F AY ++ GI R + Y+PQ NG+A+ T+ME VRC+L+ Sbjct: 470 VKCLRTDNGLEFCNLKFDAYCKDHGIERHMTCTYTPQQNGVADRMNRTIMEKVRCMLNES 529 Query: 397 GLSKKWWGDALLTANYTTNYWPNSNNDMKIPK*VWKKEKINIQHLKIFGDEYFCLVPKEL 218 GL +++W +A TA Y + P S D +P+ +W +K +HL+ FG + + Sbjct: 530 GLGEEFWAEAAATAAYLISRSPASAIDHNVPEELWLNKKPGYKHLRRFGSIAYVHID--- 586 Query: 217 RKKLDPKFRSGIFLGYPLGTKGYKLMDCDTKKIIVSRDI 101 + KL P+ GIF+GYP GTKGYK+ + +K ++SR++ Sbjct: 587 QGKLKPRALKGIFIGYPSGTKGYKIWLLEEQKCVISRNV 625 >ref|XP_002064813.1| GK15001 [Drosophila willistoni] gi|194160898|gb|EDW75799.1| GK15001 [Drosophila willistoni] Length = 1249 Score = 223 bits (569), Expect = 2e-55 Identities = 143/418 (34%), Positives = 219/418 (52%), Gaps = 7/418 (1%) Frame = -2 Query: 1333 CLDSGCISHMTGNESILTNITWKEDNNSIKIAGTDMLTIRGFGDTEIILPNFKLQLKNDL 1154 CLDSG SHM ++S+ ++ + ++ S+ AG L G G I KL + N L Sbjct: 270 CLDSGATSHMCCDKSMFSDFSVHDEKISLADAG--YLRAEGKGKVTIRTGICKLTMNNVL 327 Query: 1153 LVHRLRRNLISVKKLVMAGITIEGNLTKFHDDQLRLFYNKKLISTSIGSS*LYLLKSDTL 974 V L N +SV +++ + F ++ N + I + L++ ++++ Sbjct: 328 YVPGLAGNFMSVARVIEYNSVVH-----FEKHMAKIIQNGECILKAKKIGNLFVFEAES- 381 Query: 973 RESNLANQRNTWKQWNQNLGHVNDIYLNEIYKK--VNGKNL----PNTQEICESCAKGKM 812 E+ A W++ GH+N L +I K V G ++ PNT C++C K+ Sbjct: 382 -ENLFAAVGEDVSLWHKRFGHLNYKSLTQIASKGLVRGLSVTNFAPNTP--CKTCMVSKI 438 Query: 811 SRSPFIQ-SNTKTSRILELIHSAICGPMPTQSYQGYRYFMSFIDDFSRYSFVYLLKSRDE 635 PF + + +++S +L+L+HS +CGP T+S G RYF++FIDD SR FVY LK +DE Sbjct: 439 HVQPFPKMTESRSSELLQLVHSDVCGPFGTKSLGGSRYFLTFIDDKSRRIFVYFLKGKDE 498 Query: 634 VFEKFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFLAYYQNKGITREIIAPYSPQ*NGL 455 VF KF E++ L++ + G KL C+RS+ G EY + F Y + GI R++ Y+PQ NG+ Sbjct: 499 VFGKFLEFKSLVERQTGKKLKCIRSDNGREYVNNAFDDYLKKNGILRQLTIAYTPQQNGV 558 Query: 454 AE*KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTTNYWPNSNNDMKIPK*VWKKEKIN 275 AE TL+E RCLL GL + W +A+ TA Y N P S + P W +K Sbjct: 559 AERANRTLVEMSRCLLAQSGLCEALWAEAIFTAVYLRNRSPTSALTNQTPMEAWTGKKPC 618 Query: 274 IQHLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPLGTKGYKLMDCDTKKIIVSRDI 101 I HLK+FG L K PK + +GY KGY+L D +T+K++ RD+ Sbjct: 619 INHLKVFGSVAVALSKGHQESKFRPKGKEYRMVGYSREAKGYRLYDGETRKVVERRDV 676 >gb|AAD17409.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1347 Score = 223 bits (569), Expect = 2e-55 Identities = 141/448 (31%), Positives = 236/448 (52%), Gaps = 7/448 (1%) Frame = -2 Query: 1330 LDSGCISHMTGNESILTNITWKEDNNSIKIAGTDMLTIRGFGDTEIILPNFKLQLKNDLL 1151 +DSGC +HMT E +NI K I++ D++ G GD ++ + K +KN L Sbjct: 328 VDSGCTNHMTKEERYFSNIN-KSIKVPIRVRNGDIVMTAGKGDITVMTRHGKRIIKNVFL 386 Query: 1150 VHRLRRNLISVKKLVMAGITIEGNLTKFHDDQLRLF-YNKKLISTSIGSS*LYLLKSDTL 974 V L +NL+SV +++ +G + +F D + + N K I + + +K ++ Sbjct: 387 VPGLEKNLLSVPQIISSGYWV-----RFQDKRCIIQDANGKEIMNIEMTDKSFKIKLSSV 441 Query: 973 RESNLANQRNTWKQWNQNLGHVNDIYLNEIYKKVNGKNLPN---TQEICESCAKGKMSRS 803 E + T + W++ LGHV++ L ++ K LP T+E C++C GK SR Sbjct: 442 EEEAMTANVQTEETWHKRLGHVSNKRLQQMQDKELVNGLPRFKVTKETCKACNLGKQSRK 501 Query: 802 PFI-QSNTKTSRILELIHSAICGPMPTQSYQGYRYFMSFIDDFSRYSFVYLLKSRDEVFE 626 F +S TKT LE++H+ +CGPM QS G RY++ F+DD++ +VY LK + E F Sbjct: 502 SFPKESQTKTREKLEIVHTDVCGPMQHQSIDGSRYYVLFLDDYTHMCWVYFLKQKSETFA 561 Query: 625 KFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFLAYYQNKGITREIIAPYSPQ*NGLAE* 446 FK+++ L++ + + LR + +++GI R++ PYSPQ NG AE Sbjct: 562 TFKKFKALVEKQSNCSIKTLR----------PMEVFCEDEGINRQVTLPYSPQQNGAAER 611 Query: 445 KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTTNYWPN-SNNDMKIPK*VWKKEKINIQ 269 K +L+E R +L L K W +A+ T+ Y N P+ + D P W K N+ Sbjct: 612 KNRSLVEMARSMLVEQDLPLKLWAEAVYTSAYLQNRLPSKAIEDDVTPMEKWCGHKPNVS 671 Query: 268 HLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPLGTKGYKLMDCDTKKIIVSRDISNKQ 89 HL+IFG + +P + R+KLD K + GI +GY TKGY++ + +K+ VSRD+ ++ Sbjct: 672 HLRIFGSICYVHIPDQKRRKLDAKAKCGILIGYSNQTKGYRVFLLEDEKVEVSRDVVFQE 731 Query: 88 NELDNLLENDE-SQILIPETKDIPDNSD 8 ++ + + +E + + DI ++ D Sbjct: 732 DKKWDWDKQEEVKKTFVMSINDIQESRD 759 >gb|EOY11267.1| Uncharacterized protein TCM_026511 [Theobroma cacao] Length = 1318 Score = 222 bits (566), Expect = 4e-55 Identities = 145/437 (33%), Positives = 230/437 (52%), Gaps = 9/437 (2%) Frame = -2 Query: 1342 SI*CLDSGCISHMTGNESILTNITWKEDNNSIKIAGTDMLTIRGFGDTEIILPNFKLQLK 1163 SI +DS C +H+TG ++ K ++++I ++L I G G I + Sbjct: 327 SIWLIDSACSTHITGKIKNFLDLN-KAYKSTVEIGDGNLLKIAGRGTVGITTKKGMKTIA 385 Query: 1162 NDLLVHRLRRNLISVKKLVMAGITIEGNLTKFHDDQLRLFY-NKKLISTSIGSS*LYLLK 986 N + +NL+SV +LV E N F D+ +F + + I+T + + L Sbjct: 386 NVCFAPEVTQNLLSVGQLVK-----EKNSLLFKDELCTIFDPSGREIATVKMRNKCFPLD 440 Query: 985 SDTLRESNLANQRNTWKQWNQNLGHVNDIYLNEIYKKVNGKNLPNTQEI-------CESC 827 + N + W++ LGH+N ++ K + NL N I CE C Sbjct: 441 LNEAGHMAYKCVSNEARLWHRRLGHINYQFI----KNMGSLNLVNDMPIITEVEKTCEVC 496 Query: 826 AKGKMSRSPFI-QSNTKTSRILELIHSAICGPMPTQSYQGYRYFMSFIDDFSRYSFVYLL 650 +GK SR PF QS T+T+ L+LIH+ ICGP+ T S G +YF+ FIDDFSR+ +++ L Sbjct: 497 LQGKQSRHPFPKQSQTRTANRLQLIHTDICGPIGTLSLNGNKYFILFIDDFSRFCWIFFL 556 Query: 649 KSRDEVFEKFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFLAYYQNKGITREIIAPYSP 470 K + E + F +++ L++ + K+ LRS+ G EY S EF A +GI + + PYSP Sbjct: 557 KQKSEAIQYFMKFKVLVEKQTDQKIKALRSDNGSEYTSNEFKALLTQEGIKQFLTVPYSP 616 Query: 469 Q*NGLAE*KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTTNYWPNSNNDMKIPK*VWK 290 Q NG++E K T+ME +RCLL + K +W +A A N P + + P VW Sbjct: 617 QQNGVSERKNRTIMEMIRCLLFEQQMPKYFWAEAANFAVTLQNLIPTTALNSMTPFEVWH 676 Query: 289 KEKINIQHLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPLGTKGYKLMDCDTKKIIVS 110 K +I ++K+FG + VP++ R KLD K + I LGY +KGY+L + +TKK+ +S Sbjct: 677 GYKPSISNVKVFGCIAYAQVPQQKRTKLDSKTQISINLGYSSVSKGYRLFNVETKKVFIS 736 Query: 109 RDISNKQNELDNLLEND 59 RD+ ++ N ++N+ Sbjct: 737 RDVVFNEDIHWNWMKNE 753 >emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] Length = 1352 Score = 222 bits (566), Expect = 4e-55 Identities = 136/417 (32%), Positives = 222/417 (53%), Gaps = 7/417 (1%) Frame = -2 Query: 1330 LDSGCISHMTGNESILTNITWKEDNNSIKIAGTDMLTIRGFGDTEIILPNFKLQ-LKNDL 1154 LDSG +HM G +S+ + N + + + ++G G+ I L N Q + N Sbjct: 337 LDSGASNHMCGRKSMFAELDESVRGN-VALGDESKMEVKGKGNILIRLKNGDHQFISNVY 395 Query: 1153 LVHRLRRNLISVKKLVMAG--ITIEGNLTKFHDDQLRLFYNKKLISTSIGSS*LYLLKSD 980 + ++ N++S+ +L+ G I ++ N D + L + S + +++D Sbjct: 396 YIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITK---VPMSKNRMFVLNIRND 452 Query: 979 TLRESNLANQRNTWKQWNQNLGHVNDIYLNEIYKKVNGKNLP---NTQEICESCAKGKMS 809 + + + +W W+ GH+N L + +K + LP + ++CE C GK Sbjct: 453 IAQCLKMCYKEESWL-WHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVCEGCLLGKQF 511 Query: 808 RSPFI-QSNTKTSRILELIHSAICGPMPTQSYQGYRYFMSFIDDFSRYSFVYLLKSRDEV 632 + F +S+++ + LELIH+ +CGP+ +S YF+ FIDDFSR ++VY LK + EV Sbjct: 512 KMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFLKEKSEV 571 Query: 631 FEKFKEYRKLMQNKKGVKLACLRSNGGLEYKSTEFLAYYQNKGITREIIAPYSPQ*NGLA 452 FE FK+++ ++ + G+ + +RS+ G E+ S EFL Y ++ GI R++ P SPQ NG+ Sbjct: 572 FEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVV 631 Query: 451 E*KI*TLMECVRCLLDNVGLSKKWWGDALLTANYTTNYWPNSNNDMKIPK*VWKKEKINI 272 E K T++E R +L + L K+ W +A+ A Y N P + K P+ W K + Sbjct: 632 ERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGV 691 Query: 271 QHLKIFGDEYFCLVPKELRKKLDPKFRSGIFLGYPLGTKGYKLMDCDTKKIIVSRDI 101 HL++FG VP E R KLD K IF+GY +KGYKL + DTKK I+SR+I Sbjct: 692 SHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNI 748