BLASTX nr result
ID: Astragalus22_contig00034596
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00034596 (1033 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI37296.3| unnamed protein product, partial [Vitis vinifera] 76 4e-24 gb|PNX66220.1| copia-type polyprotein, partial [Trifolium pratense] 73 6e-24 gb|KYP50278.1| Retrovirus-related Pol polyprotein from transposo... 80 4e-23 gb|PNX99782.1| copia-type polyprotein [Trifolium pratense] 77 1e-22 dbj|GAU26746.1| hypothetical protein TSUD_317440 [Trifolium subt... 107 7e-22 gb|PRQ17740.1| putative RNA-directed DNA polymerase [Rosa chinen... 103 2e-20 gb|PRQ42077.1| putative RNA-directed DNA polymerase [Rosa chinen... 102 6e-20 gb|PRQ52345.1| putative RNA-directed DNA polymerase [Rosa chinen... 97 2e-18 gb|PNX93789.1| copia-type polyprotein [Trifolium pratense] 97 2e-18 gb|KYP44586.1| Retrovirus-related Pol polyprotein from transposo... 96 8e-18 ref|XP_017188308.1| PREDICTED: uncharacterized protein LOC108173... 92 6e-17 gb|PNY15642.1| copia-type polyprotein [Trifolium pratense] 92 9e-17 dbj|GAU41840.1| hypothetical protein TSUD_177510 [Trifolium subt... 92 2e-16 gb|PNX89974.1| copia-type polyprotein, partial [Trifolium pratense] 91 2e-16 gb|PNX95763.1| retrotransposon-related protein, partial [Trifoli... 87 6e-15 gb|PNX94698.1| copia-type polyprotein [Trifolium pratense] 87 9e-15 gb|PNX77752.1| copia-type polyprotein, partial [Trifolium pratense] 86 1e-14 gb|PNX91151.1| copia-type polyprotein, partial [Trifolium pratense] 85 2e-14 gb|PNX96091.1| retrotransposon-related protein [Trifolium pratense] 86 2e-14 gb|KYP37051.1| Retrovirus-related Pol polyprotein from transposo... 85 3e-14 >emb|CBI37296.3| unnamed protein product, partial [Vitis vinifera] Length = 3048 Score = 76.3 bits (186), Expect(2) = 4e-24 Identities = 38/71 (53%), Positives = 46/71 (64%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+NRTI+NMVR +LS KKLPK FW +AV W V V RSPT A+ AW Sbjct: 613 QNGVAERKNRTIMNMVRSMLSAKKLPKTFWPEAVNWTVHVLNRSPTFAVQNKTPEEAWGK 672 Query: 414 LKSSVHFFQNF 446 LK SV +F+ F Sbjct: 673 LKPSVDYFRVF 683 Score = 65.1 bits (157), Expect(2) = 4e-24 Identities = 35/80 (43%), Positives = 51/80 (63%), Gaps = 8/80 (10%) Frame = +1 Query: 433 FFRTFRCVAHLHVHGAQRKKLDNS--------VSEESKSYMLYDPIAKRILINRDVKFVR 588 +FR F C++H+HV ++R KLD+ VSEESK+Y LYDPI+++I+I+RDV F Sbjct: 679 YFRVFGCLSHVHVPDSKRTKLDDKSFSCVLLGVSEESKAYRLYDPISQKIIISRDVVFEE 738 Query: 589 MKHGSGEKKAQIAKFRLLIW 648 K+ +KK + A L W Sbjct: 739 DKNWDWDKKYEEAIVCDLEW 758 >gb|PNX66220.1| copia-type polyprotein, partial [Trifolium pratense] Length = 268 Score = 73.2 bits (178), Expect(2) = 6e-24 Identities = 32/71 (45%), Positives = 46/71 (64%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+NRT+LN+VR ++ + +PK FW +A+KWA +V RSPT ++ AW G Sbjct: 138 QNGVSERKNRTLLNIVRSMIHARSVPKRFWPEAIKWATYVMNRSPTLSVKDMTPEEAWSG 197 Query: 414 LKSSVHFFQNF 446 K SVH F+ F Sbjct: 198 RKPSVHHFKVF 208 Score = 67.4 bits (163), Expect(2) = 6e-24 Identities = 33/61 (54%), Positives = 42/61 (68%), Gaps = 8/61 (13%) Frame = +1 Query: 436 FRTFRCVAHLHVHGAQRKKLDNS--------VSEESKSYMLYDPIAKRILINRDVKFVRM 591 F+ F CVAH+H+H +QRKKLD+ VSEESK+Y LYDPI +I+I+RDV F Sbjct: 205 FKVFGCVAHVHIHDSQRKKLDDKSKKCILLGVSEESKAYKLYDPIENKIIISRDVIFEES 264 Query: 592 K 594 K Sbjct: 265 K 265 >gb|KYP50278.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 180 Score = 79.7 bits (195), Expect(2) = 4e-23 Identities = 37/71 (52%), Positives = 47/71 (66%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+NRTI+N+VR +LSEKKLPK FW +AV W +V RSPT A+ AW G Sbjct: 39 QNGVAERKNRTIMNLVRTLLSEKKLPKSFWPEAVNWVAYVLNRSPTLAVKNQMPEEAWSG 98 Query: 414 LKSSVHFFQNF 446 +K SV F+ F Sbjct: 99 VKPSVEHFRVF 109 Score = 58.2 bits (139), Expect(2) = 4e-23 Identities = 29/57 (50%), Positives = 38/57 (66%), Gaps = 8/57 (14%) Frame = +1 Query: 436 FRTFRCVAHLHVHGAQRKKLDNS--------VSEESKSYMLYDPIAKRILINRDVKF 582 FR F CV H+HV A+R KL+N +SEESK Y LY+PI ++I+I+RDV F Sbjct: 106 FRVFGCVTHVHVPDARRTKLENKSCKCVLLGMSEESKGYRLYNPITRKIVISRDVVF 162 >gb|PNX99782.1| copia-type polyprotein [Trifolium pratense] Length = 912 Score = 76.6 bits (187), Expect(2) = 1e-22 Identities = 34/71 (47%), Positives = 47/71 (66%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+NRT+LNMVR ++S + +PK+FW +A KWA +V R PT A+ AW G Sbjct: 582 QNGVSERKNRTVLNMVRSMISARSVPKKFWPEAAKWATYVMNRCPTHAVKNVTPEEAWSG 641 Query: 414 LKSSVHFFQNF 446 +K SVH F+ F Sbjct: 642 IKPSVHHFRVF 652 Score = 59.3 bits (142), Expect(2) = 1e-22 Identities = 30/68 (44%), Positives = 42/68 (61%), Gaps = 8/68 (11%) Frame = +1 Query: 436 FRTFRCVAHLHVHGAQRKKLDNS--------VSEESKSYMLYDPIAKRILINRDVKFVRM 591 FR F C+AH H+ RKKLDN VSEESK+Y LY+PI ++I+++R V F + Sbjct: 649 FRVFGCLAHAHIPDVHRKKLDNKSIACVFLGVSEESKAYKLYNPIERKIIVSRVVVFEEL 708 Query: 592 KHGSGEKK 615 K + K+ Sbjct: 709 KGWNWNKQ 716 >dbj|GAU26746.1| hypothetical protein TSUD_317440 [Trifolium subterraneum] Length = 1608 Score = 107 bits (268), Expect = 7e-22 Identities = 93/315 (29%), Positives = 148/315 (46%), Gaps = 21/315 (6%) Frame = +3 Query: 150 GGEQLWAKVE---RNQGVHLCHIFLLPYGGIQKRLVERENRTILNMVRCILSEKKLPKEF 320 GGE + + E ++QG+ C Y Q + ER+NRTI+N VR +L+E+++P+ F Sbjct: 566 GGEFISNEFEEFCKDQGI--CRQLTASYTPQQNGVAERKNRTIMNAVRAVLNERQVPRVF 623 Query: 321 WSDAVKWAVFVEKRSPTTALHXXXXXXAWCGLKSSVHFFQNF*VCSSSPRTWCTKKEARQ 500 W +AVKW V V+ RSPT+A+ AW G++ SV +F+ F C + K+ Sbjct: 624 WPEAVKWCVHVQNRSPTSAVDHITPEEAWTGVRPSVDYFRIF-GCVAHAHVPDQKRSKLD 682 Query: 501 *CQ*RVKIL--------YAL*SYC-KKDLDQQGCKICEDETWKWREEGSNSKIQVADLEE 653 R L Y L KK + + ED++W W K+ V D EE Sbjct: 683 DKSKRCVFLGVSDESKAYKLFDPIEKKVIVNRDVVFEEDKSWDWGRTEEECKVDVLDWEE 742 Query: 654 KEEEGSVVGTSAGNQTRNAGANVQEQNASANEN---TNEDSLVEQGRALVEGRIKRKPAY 824 EE+G GT+ + + N +S N+ TNE + VEQ R +R+P + Sbjct: 743 NEEDGEDHGTAQNEEENSGDINQGASPSSLNKTGSPTNETNDVEQEFLERAARSRRRPGW 802 Query: 825 L----QDMSLSK*SWMSSNT*KTLALLWSLKQKILSPLRML*EVKNGK--KAIDLEIEAI 986 L +D +LS+ + ++ + P K+ K +A+++E++AI Sbjct: 803 LVDFEEDPNLSE---------EESLMVMMTAENGSDPYLFEEAFKSAKWREAMNMEMKAI 853 Query: 987 EKLKRHMGTDNLPKG 1031 EK K + TD P+G Sbjct: 854 EKNKTWVLTD-APRG 867 >gb|PRQ17740.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 1302 Score = 103 bits (257), Expect = 2e-20 Identities = 79/266 (29%), Positives = 119/266 (44%), Gaps = 13/266 (4%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+NRTI+NMVR +LSEK++PK FW +AV W++ + RSPT A+ AW G Sbjct: 590 QNGVAERKNRTIMNMVRSMLSEKQVPKVFWPEAVNWSIHILNRSPTLAVKDMTPEEAWSG 649 Query: 414 LKSSVHFFQNF*VCSSSPRTWCTKKEARQ*CQ*RVKILYAL*SYCKKDLDQQGCKIC--- 584 +K VH+F+ F + +K+ V + + S + D +I Sbjct: 650 IKPVVHYFKVFGCIAHVHIPEAKRKKLDNKSYKCVLLGVSEESKAYRLYDPISERIVVSR 709 Query: 585 -----EDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQN---AS 740 EDE+W+W ++ V D + EEE N A +E+N S Sbjct: 710 DVVFEEDESWEWGRTAEEVRLDVLDWSDGEEE------------ENESAQSEEENEYAQS 757 Query: 741 ANENTNEDSLVEQGRALVEGRIKRKPAYLQDMSLSK*SWMSSNT*KTLALLWSLKQKILS 920 EN N D E+ L EGR + +P ++QD + T + + + Sbjct: 758 EEENVNNDDAEEEEEILEEGRTRMQPVWMQDYVSGEGLSEEEETNNVV-----MFTSVTD 812 Query: 921 PLRML*EVKNG--KKAIDLEIEAIEK 992 P K+ K A+D EIEAIE+ Sbjct: 813 PATFEEAFKSAKWKAAMDQEIEAIER 838 >gb|PRQ42077.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 1044 Score = 102 bits (253), Expect = 6e-20 Identities = 77/263 (29%), Positives = 124/263 (47%), Gaps = 10/263 (3%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+NRTI+NMVR +LSEK++PK FW +AV W++ + RSPT A+ AW G Sbjct: 311 QNGVAERKNRTIMNMVRSMLSEKQVPKVFWPEAVNWSIHILNRSPTLAVKDMTPEEAWSG 370 Query: 414 LKSSVHFFQNF*VCSSSPRTWCTKKEARQ*CQ*RVKILYAL*SYCKKDLDQQGCKIC--- 584 +K +VH+F+ F + +K+ V + + S + D +I Sbjct: 371 IKPAVHYFRVFGCIAHVHIPEAKRKKLDDKSYKCVLLGVSKESKAYRLYDPISERIVVSR 430 Query: 585 -----EDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASANE 749 EDE W W ++ V D + EEE + SA ++ N A +E+N + N+ Sbjct: 431 DVVFEEDENWDWGRTAEEVRLDVLDWSDGEEEEN---ESAQSEEENEFAQSEEENVN-ND 486 Query: 750 NTNEDSLVEQGRALVEGRIKRKPAYLQDMSLSK*SWMSSNT*KTLALLWSLKQKILSPLR 929 + E+ ++E EGR + +P ++QD + T + + + P Sbjct: 487 DAEEEEILE------EGRTRMQPVWMQDYVSGEGLSEEEETNNIVMFTY-----VTDPTT 535 Query: 930 ML*EVKNG--KKAIDLEIEAIEK 992 K+ K A+D EIEAIE+ Sbjct: 536 FEEAFKSAKWKAAMDQEIEAIER 558 >gb|PRQ52345.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 1316 Score = 97.4 bits (241), Expect = 2e-18 Identities = 76/263 (28%), Positives = 121/263 (46%), Gaps = 10/263 (3%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+NRTI+NMVR +LSEK++PK FW +AV W++ + RSPT A+ AW G Sbjct: 592 QNGVAERKNRTIMNMVRSMLSEKQVPKVFWPEAVNWSIHILNRSPTLAVKDMTPEEAWSG 651 Query: 414 LKSSVHFFQNF*VCSSSPRTWCTKKEARQ*CQ*RVKILYAL*SYCKKDLDQQGCKIC--- 584 +K +VH+F+ F + +K+ V + + S + D +I Sbjct: 652 IKPAVHYFRVFGCIAHVHIPEAKRKKLDDKSYKCVLLGVSKESKAYRLYDPISERIVVSR 711 Query: 585 -----EDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASANE 749 EDE+W W ++ V D + EEE N A +E+N + N+ Sbjct: 712 DVVFEEDESWDWGRTAEEERLDVLDWSDGEEE------------ENESAQSEEENVN-ND 758 Query: 750 NTNEDSLVEQGRALVEGRIKRKPAYLQDMSLSK*SWMSSNT*KTLALLWSLKQKILSPLR 929 + E+ ++E EGR + +P ++QD +S + + S + P Sbjct: 759 DAEEEEILE------EGRTRLQPIWMQDY-VSGEGLSEEEETNNIVMFTS----VTDPTT 807 Query: 930 ML*EVKNG--KKAIDLEIEAIEK 992 K+ K A+D EIEAIE+ Sbjct: 808 FEEAFKSAKWKAAMDQEIEAIER 830 >gb|PNX93789.1| copia-type polyprotein [Trifolium pratense] Length = 1347 Score = 97.4 bits (241), Expect = 2e-18 Identities = 66/221 (29%), Positives = 104/221 (47%), Gaps = 21/221 (9%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+NRTI+NMVR +L EKK+PK FW +AVKW+V + R PT A+ AW G Sbjct: 593 QNGVAERKNRTIMNMVRSMLVEKKVPKMFWPEAVKWSVHILNRCPTLAVQNKTPEEAWSG 652 Query: 414 LKSSVHFFQNF*VCSSSPRTWCTKKEARQ*CQ*RVKIL--------YAL*S-YCKKDLDQ 566 +K ++++F+ F C + K+ + +L Y L KK + Sbjct: 653 IKPTINYFRVF-GCVAHAHIPDQKRSKLDDKSKKCVLLGVSDESKAYKLYDPVSKKIIIS 711 Query: 567 QGCKICEDETWKWREEGSNSKIQVA--------DLEEKEEEGSVVGTSAGNQTRN----A 710 + ED W W ++ V D+EE E VG + + N Sbjct: 712 KDVIFEEDVCWNWDNNKDERRVDVLEWKNDYENDIEEAIEGNEEVGNNGNEEEHNNGNEG 771 Query: 711 GANVQEQNASANENTNEDSLVEQGRALVEGRIKRKPAYLQD 833 G N N+ + +++ +S ++ +VEGR++R P+YL D Sbjct: 772 GNNDDTTNSIESNSSSSESHEDESPNMVEGRVRRPPSYLAD 812 >gb|KYP44586.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 783 Score = 95.5 bits (236), Expect = 8e-18 Identities = 62/210 (29%), Positives = 96/210 (45%), Gaps = 10/210 (4%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+N+TI+NMVR +L EK +PK FW +AV WAV V RSPT A+ AW G Sbjct: 36 QNGVAERKNQTIINMVRSVLGEKGVPKAFWPEAVMWAVHVLNRSPTLAVKDITPEEAWSG 95 Query: 414 LKSSVHFFQNF*VCS-----SSPRTWCTKKEARQ*C-----Q*RVKILYAL*SYCKKDLD 563 +K SV +F+ F + R+ K + C K KK L Sbjct: 96 IKPSVSYFRIFGCIGYAYVHNQQRSKLDDKSTK--CVLLGVSEESKAYKLYDPVKKKILI 153 Query: 564 QQGCKICEDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASA 743 + K ED W W E ++S + +L EE V +++ + A ++ Sbjct: 154 SRDVKFQEDAAWDWSEAKNSSILDTGELNPLEESSQVAEEKTQDKSNDTAATSSATTTNS 213 Query: 744 NENTNEDSLVEQGRALVEGRIKRKPAYLQD 833 + N + S+ + +GR +R P +++D Sbjct: 214 SSNVPDYSVPAASESHDQGRTRRPPTWMRD 243 >ref|XP_017188308.1| PREDICTED: uncharacterized protein LOC108173558 [Malus domestica] Length = 497 Score = 92.4 bits (228), Expect = 6e-17 Identities = 74/214 (34%), Positives = 107/214 (50%), Gaps = 14/214 (6%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+NRTI+NMVR +L EKK+PK FW +AV W V V RSPT A+ AW G Sbjct: 285 QNGVAERKNRTIMNMVRSMLIEKKIPKTFWPEAVNWTVHVLNRSPTIAVKSKTPEEAWQG 344 Query: 414 LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKILY---AL*SYCKKDLDQQGCK 578 LK SV F+ F S P K EA+ +K ++ + S + D K Sbjct: 345 LKPSVEHFRVFGCISHVHIPDNKRVKLEAKS-----LKCIFLGVSDESKAYRLFDPISSK 399 Query: 579 IC--------EDETWKWREEGSNSKIQVADLE-EKEEEGSVVGTSAGNQTRNAGANVQEQ 731 I ED+ W W E + + +ADLE E EE S + N + A ++E Sbjct: 400 IIVSRDVVFEEDQEWSWDE--VHKQTILADLEWEVNEEASTEEENNENGSETA-EELEEH 456 Query: 732 NASANENTNEDSLVEQGRALVEGRIKRKPAYLQD 833 ++++ + ED+ +L EGR +R PA+++D Sbjct: 457 GSNSSGSFEEDT--SNVTSLPEGRTRRPPAWMRD 488 >gb|PNY15642.1| copia-type polyprotein [Trifolium pratense] Length = 822 Score = 92.4 bits (228), Expect = 9e-17 Identities = 89/282 (31%), Positives = 132/282 (46%), Gaps = 16/282 (5%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+NRTILNMVR +L+ +++PK FW +AVKWA +V RSPT A+ AW G Sbjct: 80 QNGVSERKNRTILNMVRSMLAAREVPKNFWPEAVKWATYVMNRSPTFAVQDMTPEEAWSG 139 Query: 414 LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKIL----------YAL*SYCKKD 557 +K SVH F+ F + P K + + VK + Y L + +K Sbjct: 140 VKPSVHHFRVFGCLAHVHVPDVQRKKLDGKS-----VKCIHLGLSEESKAYKLYNPNEKK 194 Query: 558 LDQQGCKICEDET-WKWREEGSNSKI---QVADLEEKEEEGSVVGTSAGNQTRNAGANVQ 725 + I E++ W W+++ S I +D E +E V GNQ+ +A ++ Sbjct: 195 IIVSRDVIFEEQKGWNWKKKNYKSPIIHDTESDSEVAAQENHPVAPE-GNQSDDAEIDMD 253 Query: 726 EQNASANENTNEDSLVEQGRALVEGRIKRKPAYLQDMSLSK*SWMSSNT*KTLALLWSLK 905 Q A++ NEDS + G L R +R P YL+D + N + AL S + Sbjct: 254 TQ---ASDTENEDSDNDNGNNL-PPRTRRPPGYLEDYDTTT-GEEQENMIQHFALFSSKE 308 Query: 906 QKILSPLRML*EVKNGKKAIDLEIEAIEKLKRHMGTDNLPKG 1031 ++ KKA++ EIE+I K T LPKG Sbjct: 309 DP--ESYEDAIKIDVWKKAMESEIESINKNDTWELT-TLPKG 347 >dbj|GAU41840.1| hypothetical protein TSUD_177510 [Trifolium subterraneum] Length = 936 Score = 91.7 bits (226), Expect = 2e-16 Identities = 85/292 (29%), Positives = 139/292 (47%), Gaps = 17/292 (5%) Frame = +3 Query: 207 IFLLP-YGGIQKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALH 383 ++ +P Y Q + ER+NRTIL+MVR ++S + +PK FW AV WA +V+ RSPT + Sbjct: 350 VYYIPAYTPQQNGVSERKNRTILDMVRSLISARNVPKRFWPGAVNWATYVKNRSPTHVVQ 409 Query: 384 XXXXXXAWCGLKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKILYAL*SYC--- 548 AW G+K SVH F+ F + P K + + + + +Y Sbjct: 410 DITLEEAWSGVKPSVHNFRIFGCVAHVHIPDVNRKKLDGKSIMCILLGVSEESKTYKLYN 469 Query: 549 ---KKDLDQQGCKICEDETWKW-REEGSNSKIQVADLEE---KEEEGSVVGTSAGNQTRN 707 KK + + E ++W W ++E S++K Q D+E+ ++ G + G+ T N Sbjct: 470 PSEKKIIISRDVVFEESKSWNWNKQETSSAKGQTIDIEDNDANDDTGQIEAKVTGSNTDN 529 Query: 708 AGANVQEQNASANENTNEDSLVEQG--RALVEGRIKRKPAYLQDMSLSK*SWMSSNT*KT 881 ++ ANE+ +D V ++ R +R P YL+D ++ + + Sbjct: 530 TESH---DGNEANEDIQQDGHVSDSSDSEVLTPRTRRPPNYLRDYVTNQEQENEVDVMQN 586 Query: 882 LALLWSLKQKILSPLRML*EVKNG--KKAIDLEIEAIEKLKRHMGTDNLPKG 1031 A L+S K+ P VK+ KKA++ EIE I+K TD LP+G Sbjct: 587 FA-LFSFKE---DPNSYEEAVKHDVWKKAMESEIEVIKKNDTWELTD-LPQG 633 >gb|PNX89974.1| copia-type polyprotein, partial [Trifolium pratense] Length = 415 Score = 90.5 bits (223), Expect = 2e-16 Identities = 82/276 (29%), Positives = 118/276 (42%), Gaps = 10/276 (3%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+N+TI+NMVRC+LSEK LPK W +AV WAV + RSPT A+ AW Sbjct: 47 QNGVAERKNQTIMNMVRCMLSEKHLPKFLWGEAVNWAVHILNRSPTLAVKDKTPEEAWSD 106 Query: 414 LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKILYAL*SY------CKKDLDQQ 569 +K +VH+ + F + P K +A+ + I +Y K+ + + Sbjct: 107 IKPAVHYLKVFGCVAHVHIPEAKRKKLDAKSFRCVMLGISDESKAYRLFDPTTKRIVISK 166 Query: 570 GCKICEDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASANE 749 E+E W W K + D E EEE RN E E Sbjct: 167 DVIFEENECWDWERSPEQMKPDLLDWGESEEE------------RN------ENTEEVRE 208 Query: 750 NTNEDSLVEQGRALVEGRIKRKPAYLQDMSLSK*SWMSSNT*KTLALLWSLKQKILSPLR 929 S + A +EGR++R P ++QD + + S + L + P Sbjct: 209 GMGSSSSLSSEEAPIEGRVRRPPGWMQDYTSGE--EFSEEEIQNLVMFTVAS----DPTT 262 Query: 930 ML*EVKNGK--KAIDLEIEAIEKLKRHMGTDNLPKG 1031 VK+ K A++ E+EAIEK TD LP G Sbjct: 263 FEEAVKSEKWRNAMNNEMEAIEKNNTWELTD-LPTG 297 >gb|PNX95763.1| retrotransposon-related protein, partial [Trifolium pratense] Length = 1327 Score = 87.0 bits (214), Expect = 6e-15 Identities = 59/208 (28%), Positives = 92/208 (44%), Gaps = 8/208 (3%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+NRTI+NMVR +LSEK++PKEFW++A W++ + R PTTAL AW G Sbjct: 591 QNGVAERKNRTIMNMVRSMLSEKQMPKEFWAEAANWSIHILNRCPTTALENMTPQEAWTG 650 Query: 414 LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKILYAL*SY------CKKDLDQQ 569 K V F+ F + P K + + + + +Y KK + Sbjct: 651 CKPRVDHFRIFGCLAHVHVPDQKRIKLDDKSKTHIFLGVSKESKAYKLFDPITKKITISR 710 Query: 570 GCKICEDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASANE 749 K E+ WKW++ + V DLE+K + + + N +N Q + Sbjct: 711 DVKFEENACWKWKQSKGEVQSDVLDLEDKNSDANKELELEEDSDSNNTSNTISQTGGNSS 770 Query: 750 NTNEDSLVEQGRALVEGRIKRKPAYLQD 833 T+ GR +R P ++ D Sbjct: 771 TTSSGGSEPNSPT---GRFRRAPGWMSD 795 >gb|PNX94698.1| copia-type polyprotein [Trifolium pratense] Length = 1324 Score = 86.7 bits (213), Expect = 9e-15 Identities = 66/214 (30%), Positives = 100/214 (46%), Gaps = 14/214 (6%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+NRT+LNMVR +L+ + +PK+FW +AVKWA +V RSPT ++ AW G Sbjct: 589 QNGVSERKNRTLLNMVRSMLAGRNVPKKFWPEAVKWATYVLNRSPTLSVKDSTPEEAWSG 648 Query: 414 LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKILYAL*SY------CKKDLDQQ 569 LK SVH F+ F + P TK +A+ + + +Y KK + + Sbjct: 649 LKPSVHHFKIFGCLAYVHVPDAKRTKLDAKSLKCVHLGVSEESKAYKLYDPVNKKIIVSR 708 Query: 570 GCKICEDETWKWREE------GSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQ 731 E W W ++ SN+ ++D + EEE G + GN N ++ + Sbjct: 709 DVVFEEGTEWNWNDKKKAAASSSNNNDLISDETDIEEEAK-NGVNTGN---NESSSEYDS 764 Query: 732 NASANENTNEDSLVEQGRALVEGRIKRKPAYLQD 833 N+ E+ L R K+KP YL D Sbjct: 765 EQEGNDYETEEEL--------PPRPKKKPGYLND 790 >gb|PNX77752.1| copia-type polyprotein, partial [Trifolium pratense] Length = 736 Score = 86.3 bits (212), Expect = 1e-14 Identities = 63/211 (29%), Positives = 94/211 (44%), Gaps = 11/211 (5%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + E +NRTI+NMVRC+LSEK +PK FW +AV WA + RSPT A+ AW G Sbjct: 358 QNGVSESKNRTIVNMVRCMLSEKNVPKNFWPEAVNWAAHILNRSPTFAVKDITPEEAWSG 417 Query: 414 LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKI-----LYAL*SYCKKDLD-QQ 569 +K SV F+ F + P K + + + I Y L KK + + Sbjct: 418 IKPSVSHFKVFGCIAYVHVPDNLRKKLDDKSTVCIHLGISEESKAYKLYDPIKKRIAVSK 477 Query: 570 GCKICEDETWKWRE---EGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNAS 740 K E + W W + E SN Q+ D ++ E + A N +N ++ Sbjct: 478 DVKFDESKQWNWNDKNTENSNKNKQIIDCDDIETPSTSNQNEASNDAEEQASNSHSEDMD 537 Query: 741 ANENTNEDSLVEQGRALVEGRIKRKPAYLQD 833 +E+ L + R+ ++P YL D Sbjct: 538 LVVTDSEEEDGNDENPLGK-RVSKRPDYLND 567 >gb|PNX91151.1| copia-type polyprotein, partial [Trifolium pratense] Length = 430 Score = 84.7 bits (208), Expect = 2e-14 Identities = 62/208 (29%), Positives = 94/208 (45%), Gaps = 8/208 (3%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+NRTI+NMVRC+LS+KK+PK+FW ++VKWAV+V RSPT + AW Sbjct: 193 QNGVSERKNRTIMNMVRCMLSDKKVPKKFWPESVKWAVYVLNRSPTLLVKDITPEEAWSN 252 Query: 414 LKSSVHFFQNF*VCSSSPRTWCTKKEARQ*CQ*RVKILYAL*SYCKKDLDQQGCKIC--- 584 +K SV F+ F + +K+ V + + S K + +I Sbjct: 253 MKPSVKHFKVFGCLAFVHVPDAQRKKLDDKSIKCVHLGVSEESKAYKLYNPADRRIIVSR 312 Query: 585 -----EDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASANE 749 E + W W G NS+ Q + E E ++ AG NV+ + Sbjct: 313 DVVFDESKGWNW---GENSQAQATQYDNSENE-----VYETDEEPAAGENVEADPQNITV 364 Query: 750 NTNEDSLVEQGRALVEGRIKRKPAYLQD 833 +E + + R+ R+P YL D Sbjct: 365 PDSESDEQYESEEELPPRVIRRPGYLND 392 >gb|PNX96091.1| retrotransposon-related protein [Trifolium pratense] Length = 1326 Score = 85.5 bits (210), Expect = 2e-14 Identities = 78/277 (28%), Positives = 130/277 (46%), Gaps = 11/277 (3%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+NRT++NMVR +L +K +PK FW +AV W ++V R PT A+ AW G Sbjct: 589 QNGVAERKNRTVMNMVRSLLFDKNIPKTFWPEAVNWTIYVLNRCPTLAVKDVTPEEAWSG 648 Query: 414 LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKILYAL*SY------CKKDLDQQ 569 +K SV+ F+ F + P TK ++R + + Y KK + + Sbjct: 649 VKPSVNHFRVFGCIAHVHVPEAKRTKLDSRSITCVLLGVSEESKGYRFFDPVSKKIVVSR 708 Query: 570 GCKICEDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASANE 749 ED+ W W E+ + + ++ D E +E ++ N G NV+++ E Sbjct: 709 DVIFEEDKQWDWEEKQTVADLEWNDGENEERVSENDNEERVSENDNQG-NVEKEREVIRE 767 Query: 750 NTNEDSLVEQGRALV-EGRIKRKPAYLQDMSLSK*SWMSSNT*KTLALLWSLKQKILSPL 926 ++ + +G +V E R +R P ++ D + S +AL+ S + PL Sbjct: 768 EEHDSN---EGEEIVKEYRERRPPGWMSDFESGE---GLSEDEAHMALMVS-----IDPL 816 Query: 927 RML*EVK--NGKKAIDLEIEAIEKLKRHMGTDNLPKG 1031 VK N + A++ EI++IEK + T+ LP G Sbjct: 817 CFEEAVKSENWRLAMEKEIKSIEKNQTWTLTE-LPAG 852 >gb|KYP37051.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1079 Score = 85.1 bits (209), Expect = 3e-14 Identities = 68/261 (26%), Positives = 114/261 (43%), Gaps = 8/261 (3%) Frame = +3 Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413 Q + ER+NRTI+N VR +L EK++ K FW + V+W V ++ RSPTTA+ W G Sbjct: 572 QNGVAERKNRTIMNAVRAVLHEKQVSKSFWPEVVRWCVHIQNRSPTTAIDHGTLEEVWSG 631 Query: 414 LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ------*CQ*RVKILYAL*SYCKKDLDQQ 569 +K V +F+ F + P +K + + K KK + + Sbjct: 632 IKPRVDYFRTFGCVAHVHIPDQRRSKLDDKSHTCVLLGVSDEAKAYKLFDPISKKVIVSR 691 Query: 570 GCKICEDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASANE 749 ED+ W W + + S D+E ++EEGS S G+N + Sbjct: 692 DVVFEEDKGWNWHKGTTESTPLALDMEGQDEEGSDDVDSTPQLVATRGSNNNSEPPEPVS 751 Query: 750 NTNEDSLVEQGRALVEGRIKRKPAYLQDMSLSK*SWMSSNT*KTLALLWSLKQKILSPLR 929 N+ +GRA R +R+P ++ D + + + LA+L + + Sbjct: 752 NSESIVPPVEGRAT---RTQRQPLWMTDYETN----LFAEEESLLAML-TTNSEDPQTFE 803 Query: 930 ML*EVKNGKKAIDLEIEAIEK 992 + K+A+D E++AIE+ Sbjct: 804 EASTSQKWKEAMDTEMKAIER 824