BLASTX nr result
ID: Astragalus23_contig00022363
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00022363 (717 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU41450.1| hypothetical protein TSUD_98460 [Trifolium subte... 130 7e-38 gb|PNY17781.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 135 1e-32 gb|KYP68937.1| Retrotransposon-derived protein PEG10 [Cajanus ca... 129 3e-31 gb|PNY07310.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 130 7e-31 ref|XP_017423676.1| PREDICTED: uncharacterized protein LOC108332... 107 5e-29 ref|XP_014511429.1| uncharacterized protein LOC106770116 [Vigna ... 120 1e-27 gb|PNX79664.1| hypothetical protein L195_g035651 [Trifolium prat... 117 1e-26 ref|XP_017426291.1| PREDICTED: uncharacterized protein LOC108334... 116 6e-26 gb|KYP76287.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan] 87 1e-25 ref|XP_006574291.1| PREDICTED: uncharacterized protein LOC102661... 79 2e-24 gb|PNX92353.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 112 2e-24 gb|KYP42639.1| Retrovirus-related Pol polyprotein from transposo... 111 2e-24 gb|KYP61806.1| hypothetical protein KK1_016317 [Cajanus cajan] 96 2e-22 dbj|GAU26773.1| hypothetical protein TSUD_317710 [Trifolium subt... 105 2e-22 gb|PNX92970.1| Ty3/gypsy retrotransposon protein, partial [Trifo... 105 5e-22 gb|KHN07600.1| Retrovirus-related Pol polyprotein from transposo... 103 1e-21 gb|PNX98954.1| retrotransposon-related protein, partial [Trifoli... 103 1e-21 dbj|GAU45274.1| hypothetical protein TSUD_99960 [Trifolium subte... 102 3e-21 gb|PNX92469.1| hypothetical protein L195_g015607 [Trifolium prat... 102 3e-21 dbj|GAU40605.1| hypothetical protein TSUD_28110 [Trifolium subte... 102 4e-21 >dbj|GAU41450.1| hypothetical protein TSUD_98460 [Trifolium subterraneum] Length = 1385 Score = 130 bits (326), Expect(3) = 7e-38 Identities = 83/191 (43%), Positives = 100/191 (52%), Gaps = 19/191 (9%) Frame = +2 Query: 200 RFEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTES----STSKNA 367 R EVLAQQP LSQAAGLARL E+K+ DLLRL R K P++P T S T++ +T N+ Sbjct: 199 RREVLAQQPVDLSQAAGLARLHEEKIQDLLRLARPKQPFTPWNTSSSTKTFAAPTTKPNS 258 Query: 368 ----TSXXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXX 535 S RFRQL AAE+ADRREKGLCFN DERFS++HRCKAR Sbjct: 259 EITKNSPPPPLLPTPQPKTRFRQLYAAELADRREKGLCFNCDERFSRNHRCKARFLLLIA 318 Query: 536 XXXXXXXXXXXXXXXVWPIV-----------DPEAEPTQFALHTMTGAHTAHTFRVQGHI 682 + EA+ Q + H M+G TA T +V G I Sbjct: 319 VDNDEEEKGGPEAEIGESEIPTDSLLALLGTQEEAQLAQLSYHAMSGIQTAQTIKVLGKI 378 Query: 683 DDEPVHILVDG 715 VH+LVDG Sbjct: 379 AQHSVHVLVDG 389 Score = 39.3 bits (90), Expect(3) = 7e-38 Identities = 15/19 (78%), Positives = 16/19 (84%) Frame = +3 Query: 81 YQWMHSNGQITSWTQFLTA 137 YQW HSNG+I SWTQFL A Sbjct: 120 YQWKHSNGEIVSWTQFLRA 138 Score = 37.4 bits (85), Expect(3) = 7e-38 Identities = 18/23 (78%), Positives = 20/23 (86%) Frame = +1 Query: 136 HRIVGLSTHNLLSCFVSGLKPEI 204 +RIVGLS +LLSCFVSGLK EI Sbjct: 176 NRIVGLSPQDLLSCFVSGLKVEI 198 >gb|PNY17781.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 1478 Score = 135 bits (340), Expect = 1e-32 Identities = 103/296 (34%), Positives = 127/296 (42%), Gaps = 64/296 (21%) Frame = +2 Query: 20 VFKISQFFAYHRTPETERITVSV------------------DAQQWPN------------ 109 +FKISQFF YH TPE ERITV+ WP Sbjct: 56 IFKISQFFTYHNTPEEERITVASFYLDGPALAWYQWMYRNGQIVSWPQVLQALELRFAPT 115 Query: 110 ------------HLLDTVSHRIASWGSLRTIY*VALSRV-----------SNRRFEVLAQ 220 H TV+ ++ + SL V LS S R EVLAQ Sbjct: 116 AYDDPRGKLFKLHQTTTVASYLSDFESLANRI-VGLSPPDLLSCFISGLRSEIRREVLAQ 174 Query: 221 QPSSLSQAAGLARLQEDKVNDLLRLTRQKP--PWSPAQTPSRTESSTSKNATSXXXXXXX 394 QP+SL+QAA LARLQE+K+ DLLRL + + PWS + + SS N Sbjct: 175 QPTSLTQAAALARLQEEKIQDLLRLAKPRTTAPWS-----NPSSSSPRSNPAPTTASLLP 229 Query: 395 XXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXXXXXXX 574 R+RQLS EM +RREKGLCFN DERFS++HRCKAR Sbjct: 230 TPANHPRYRQLSPTEMNERREKGLCFNCDERFSRTHRCKARFLLFIADEDEELAGLDPGE 289 Query: 575 XXVWPIVDP---------EAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715 P DP E Q + H ++G +A T RV G + V +LVDG Sbjct: 290 TDPPPAADPLPDVSGLVEEFHSAQLSYHALSGVQSAQTIRVPGRVGAHSVRVLVDG 345 >gb|KYP68937.1| Retrotransposon-derived protein PEG10 [Cajanus cajan] Length = 507 Score = 129 bits (325), Expect = 3e-31 Identities = 100/295 (33%), Positives = 127/295 (43%), Gaps = 63/295 (21%) Frame = +2 Query: 20 VFKISQFFAYHRTPETERITVS---VDAQ-----QWP---------NHLLDTVSHRIASW 148 +FKI+QFF YH TPE ERITV+ +D QW HLL + R A Sbjct: 78 IFKITQFFDYHNTPEEERITVASFYLDGAALAWFQWMYRNGQIHSWQHLLQALETRFAPT 137 Query: 149 G--------------SLRTIY*VALSRVSNR---------------------RFEVLAQQ 223 + + + V+NR R EV+AQQ Sbjct: 138 AFDDPRGRLFKLTQTTTVSAFLTEFEAVANRVTGLSPQFLLSCFIFGLKPEIRREVIAQQ 197 Query: 224 PSSLSQAAGLARLQEDKVNDLLRLTRQKP--PWS--PAQTPSRTESSTSKNATSXXXXXX 391 P SL+ A GLARL E+K+ DL R+ R KP PWS P + A Sbjct: 198 PPSLTHAVGLARLHEEKLQDLSRIQRAKPGAPWSSPPFSRTFTPFAPPQTIAPKPLPPLL 257 Query: 392 XXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXXXXXX 571 RFRQL+ AEMADRREKGLCFN D+++S+SHRC AR Sbjct: 258 PSPPPKTRFRQLTEAEMADRREKGLCFNCDQKYSRSHRCPARFLLLIAEDDDPPSAPDLD 317 Query: 572 XXXVWP---IVDP----EAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715 P +VDP P Q +LH ++G T R+ G I P+ +LVDG Sbjct: 318 FPAADPDPSLVDPSTVTSVHPAQISLHALSGTGAPKTLRLTGQIAHHPIRVLVDG 372 >gb|PNY07310.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] gb|PNY07311.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 1494 Score = 130 bits (327), Expect = 7e-31 Identities = 102/292 (34%), Positives = 133/292 (45%), Gaps = 55/292 (18%) Frame = +2 Query: 5 NAPALVFKISQFFAYHRTPETERITVS--------VDAQQWP---------NHLLDTVSH 133 +A +FKISQFF YH+TPE +RIT++ + QW N L + Sbjct: 69 DANGWIFKISQFFTYHQTPEEDRITIASFYLDGPALAWYQWMYRNSQIVSWNQFLRALET 128 Query: 134 RIASW------GSLRTI--------Y*VALSRVSNR---------------------RFE 208 R A G+L + Y ++NR R E Sbjct: 129 RFAPTAYDDPKGNLFKLTQSGSVNDYLTEFESLANRIVGLSPLDLLSCFISGLKVEIRRE 188 Query: 209 VLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQT-PSRTESSTSKNATSXXXX 385 VLAQQP+SLSQAAGLARLQEDK+ D ++ +R K SPA T PSR + Sbjct: 189 VLAQQPNSLSQAAGLARLQEDKIQDQIKASRSK--LSPAYTAPSRPNFNLPGRPAPGLLP 246 Query: 386 XXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXXXX 565 RFR LS E+A+RREKGLCFN D+++SK HRC R Sbjct: 247 APPSKP---RFRHLSEPELAERREKGLCFNCDQKWSKQHRCGGRTFLLLADEEDEEVDPS 303 Query: 566 XXXXXVWPIVDP--EAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715 + P + Q +LH + G+H TFRV+G I +PV+ILVDG Sbjct: 304 QLESTIDIDTSPPDDTPQAQLSLHALAGSHATDTFRVEGQILKQPVNILVDG 355 >ref|XP_017423676.1| PREDICTED: uncharacterized protein LOC108332889 [Vigna angularis] Length = 556 Score = 107 bits (268), Expect(3) = 5e-29 Identities = 69/182 (37%), Positives = 88/182 (48%), Gaps = 10/182 (5%) Frame = +2 Query: 200 RFEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQ--TPSRTESSTSKNATS 373 R EVL+QQP +LSQA+GLARL E+K DL RL RQ+ P T S T Sbjct: 173 RREVLSQQPQTLSQASGLARLHEEKFQDLTRLIRQRSGPGPLSLLTRSPTTPLVPLVPLK 232 Query: 374 XXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKAR--------XXXX 529 RF+QL+ AEMADRRE+GLCFN D++FS++HRC AR Sbjct: 233 QLPPLLPAPPPRTRFKQLTEAEMADRRERGLCFNCDQKFSRNHRCPARYMLLVAEEDNDS 292 Query: 530 XXXXXXXXXXXXXXXXXVWPIVDPEAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILV 709 V P++ + Q L+ ++G T RV G ID V +LV Sbjct: 293 CKSSLHAPILEPNPPDPVSPVITDDPNQAQLCLNALSGFGAPETLRVSGQIDQFQVTVLV 352 Query: 710 DG 715 DG Sbjct: 353 DG 354 Score = 37.4 bits (85), Expect(3) = 5e-29 Identities = 17/23 (73%), Positives = 19/23 (82%) Frame = +1 Query: 136 HRIVGLSTHNLLSCFVSGLKPEI 204 +RIVGL LLSCF+SGLKPEI Sbjct: 150 NRIVGLQPQFLLSCFISGLKPEI 172 Score = 31.6 bits (70), Expect(3) = 5e-29 Identities = 12/19 (63%), Positives = 14/19 (73%) Frame = +3 Query: 81 YQWMHSNGQITSWTQFLTA 137 +QWM+ NGQI SW Q L A Sbjct: 94 FQWMYRNGQIHSWPQLLQA 112 >ref|XP_014511429.1| uncharacterized protein LOC106770116 [Vigna radiata var. radiata] Length = 851 Score = 120 bits (302), Expect = 1e-27 Identities = 96/299 (32%), Positives = 124/299 (41%), Gaps = 67/299 (22%) Frame = +2 Query: 20 VFKISQFFAYHRTPETERITVS---VDAQ-----QWP---------NHLLDTVSHRIA-- 142 +FKI+QFF YH TPE ERI V+ +D QW HLL + R A Sbjct: 101 IFKINQFFDYHNTPEEERIIVASFYLDGAALAWFQWMYRNGQILSWTHLLQALETRFAPT 160 Query: 143 ------------SWGSLRTIY*VALSRVSNR---------------------RFEVLAQQ 223 S S + Y +NR R EV+AQQ Sbjct: 161 AFEDPRGKLFKLSQTSSVSAYLNEFEATANRVTGXSPPFLLSCFLSGLKSEXRREVVAQQ 220 Query: 224 PSSLSQAAGLARLQEDKVNDLLRLTRQKP--PWSPAQTPSRTESSTSKNATSXXXXXXXX 397 P +LS A GLARLQE+K+ DL R+ R KP W + + + Sbjct: 221 PQTLSLAVGLARLQEEKLWDLSRVQRVKPLSSWPTSSLTRTVPTPIQQPPPKPLPPILPS 280 Query: 398 XXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXXXXXXXX 577 R+RQL+ AEMADR EKGLCFN D+++S+SHRC AR Sbjct: 281 PSPKTRYRQLTEAEMADRHEKGLCFNCDQKYSRSHRCPARFLLLIAEEDDSTGGLASNPT 340 Query: 578 XVWPIVDPEAE-------------PTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715 P DP + P Q +LH ++G T R+ GHI P+ +LVDG Sbjct: 341 SFDP--DPRTDEQPPQPIDLLLDLPAQISLHALSGIGGPETLRLTGHIGQHPIRVLVDG 397 >gb|PNX79664.1| hypothetical protein L195_g035651 [Trifolium pratense] Length = 536 Score = 117 bits (293), Expect = 1e-26 Identities = 90/303 (29%), Positives = 129/303 (42%), Gaps = 65/303 (21%) Frame = +2 Query: 2 TNAPALVFKISQFFAYHRTPETERITVS---VDAQQWPNHLLDTVSHRIASWGSLR---- 160 T+ +FKISQFF YH+TPE ERIT++ +D + + +IASW Sbjct: 61 TDTHGWIFKISQFFDYHQTPEEERITIASFYLDGAALAWYQWMYRNRQIASWAQFLEKLE 120 Query: 161 ------------------------TIY*VALSRVSNR---------------------RF 205 + Y ++NR R Sbjct: 121 TRFAPTAFDDPRGNLFKLTQSTTVSAYLTEFEALANRLEGLSDVDLLSCFISGLKSDVRR 180 Query: 206 EVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTE-----SSTSKNAT 370 EV+AQQP+S+SQAAGLARLQE+K+ D+ R +R W P + +S +KN + Sbjct: 181 EVVAQQPTSISQAAGLARLQEEKLQDIARASRPTSSWQPPSVARPIQKAPEVTSPAKNTS 240 Query: 371 SXXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXX 550 RFR LS E+ ++REKGLCFN D+++ K H+C AR Sbjct: 241 G----LLPTPPAKPRFRHLSGPELDEQREKGLCFNCDKKWPKQHKCGAR-VFVMLADNDD 295 Query: 551 XXXXXXXXXXVWPIVDP--------EAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHIL 706 + +DP E + Q +L ++G A T R+ G+I PV +L Sbjct: 296 SFSTTAAEELLTDSIDPSGVLTPQSEVQAAQLSLFALSGVPAADTIRILGYIGSHPVRVL 355 Query: 707 VDG 715 VDG Sbjct: 356 VDG 358 >ref|XP_017426291.1| PREDICTED: uncharacterized protein LOC108334872 [Vigna angularis] Length = 756 Score = 116 bits (290), Expect = 6e-26 Identities = 95/302 (31%), Positives = 130/302 (43%), Gaps = 70/302 (23%) Frame = +2 Query: 20 VFKISQFFAYHRTPETERITV--------SVDAQQWP---------NHLLDTVSHRIASW 148 +FKI+QFF YH TPE ERITV ++ QW LL + R A Sbjct: 81 IFKITQFFDYHNTPEEERITVASFYLDGAALALFQWMYRNGQLHSWQQLLQALETRFAPT 140 Query: 149 ------GSLRTI--------Y*VALSRVSNR---------------------RFEVLAQQ 223 G L + + ++NR R EV+AQQ Sbjct: 141 AFDDPKGKLFKLAQTTTVSDFLTEFESIANRVAGLPPSFLLSCFISGLKPEIRREVVAQQ 200 Query: 224 PSSLSQAAGLARLQEDKVNDLLRLTRQKP--PWSPAQT----PSRTESSTSKNAT----- 370 P +LS A GLARLQE+K+ DL R+ + K PW P S+T+S T+K+ Sbjct: 201 PPTLSHAVGLARLQEEKIWDLNRVPKPKSVSPWPPPSINRTITSQTQSQTTKHLPPLLTS 260 Query: 371 ---SXXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKAR----XXXX 529 R+RQL+ AEMADRREKGLCFN ++++S+SHRC AR Sbjct: 261 PIEKSLPPLLTPPPPKTRYRQLTEAEMADRREKGLCFNCEQKYSRSHRCPARFLFFIAEE 320 Query: 530 XXXXXXXXXXXXXXXXXVWPIVDPEAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILV 709 + P+ D + Q +LH ++G T R+ G I + +LV Sbjct: 321 ADSVGEGDLEMPTFDGVLEPMGDSPNQSAQISLHALSGTGAPETLRLMGQIGLHQISVLV 380 Query: 710 DG 715 DG Sbjct: 381 DG 382 >gb|KYP76287.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan] Length = 712 Score = 86.7 bits (213), Expect(4) = 1e-25 Identities = 62/177 (35%), Positives = 82/177 (46%), Gaps = 7/177 (3%) Frame = +2 Query: 206 EVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTESSTSKNATSXXXX 385 EV A +P+SL LA+LQEDK+ + R KP A TPS + + Sbjct: 131 EVQALRPASLDHTTQLAKLQEDKIEERRRAFFPKPQ---ALTPSSHTALPTPQPR----- 182 Query: 386 XXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXXXX 565 FR+LS +MA RREKGLC+N DE F+ SHRCK + Sbjct: 183 --------VNFRRLSPDDMAARREKGLCYNCDELFTPSHRCKGKFFLLTTDDPIVDDFTP 234 Query: 566 XXXXXVWPIVDP-------EAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715 ++DP + P+Q +LH TG + T R+QG I + PV ILVDG Sbjct: 235 DPTL----VLDPPPPEPVSDDTPSQVSLHAFTGGVGSSTIRLQGQIRNNPVSILVDG 287 Score = 36.2 bits (82), Expect(4) = 1e-25 Identities = 14/22 (63%), Positives = 19/22 (86%) Frame = +2 Query: 20 VFKISQFFAYHRTPETERITVS 85 +FKISQFF YH TP++ER+ V+ Sbjct: 17 IFKISQFFDYHNTPKSERLQVA 38 Score = 31.2 bits (69), Expect(4) = 1e-25 Identities = 13/21 (61%), Positives = 15/21 (71%) Frame = +3 Query: 75 SRYQWMHSNGQITSWTQFLTA 137 S YQWM+ NGQI +W FL A Sbjct: 48 SWYQWMYWNGQIQTWFGFLRA 68 Score = 31.2 bits (69), Expect(4) = 1e-25 Identities = 14/23 (60%), Positives = 17/23 (73%) Frame = +1 Query: 136 HRIVGLSTHNLLSCFVSGLKPEI 204 +RIVGL L+CF+SGL PEI Sbjct: 106 NRIVGLPAPFALNCFISGLTPEI 128 >ref|XP_006574291.1| PREDICTED: uncharacterized protein LOC102661730 [Glycine max] Length = 1588 Score = 79.3 bits (194), Expect(4) = 2e-24 Identities = 60/185 (32%), Positives = 82/185 (44%), Gaps = 13/185 (7%) Frame = +2 Query: 200 RFEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTESSTSKNATSXX 379 R EV A P S+ QAAGLARLQ +KV D R +PP +P P + S A + Sbjct: 319 RREVQAHHPLSMVQAAGLARLQAEKVLDQRPSPRSRPP-NPTPFPPQLGPPPSLPAPTLP 377 Query: 380 XXXXXXXXXXX--------RFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXX 535 +++S EMA RREKGLCFN DE++ + H+C +R Sbjct: 378 PLLNPPPPPRPPTTPMSTPTLKRVSPDEMALRREKGLCFNCDEKYHRGHKCSSRFFILIS 437 Query: 536 XXXXXXXXXXXXXXXV-WPIVDP----EAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVH 700 P DP + PTQ +L+++ G T R G + +P+ Sbjct: 438 DDLEPIPSHIPIPDLTHHPPPDPPDNLDLYPTQISLNSLAGHIAPETLRFVGQLSGQPML 497 Query: 701 ILVDG 715 ILVDG Sbjct: 498 ILVDG 502 Score = 35.8 bits (81), Expect(4) = 2e-24 Identities = 13/22 (59%), Positives = 19/22 (86%) Frame = +2 Query: 20 VFKISQFFAYHRTPETERITVS 85 +FKI+QFF YH TPE +++TV+ Sbjct: 207 IFKINQFFEYHSTPEQDKLTVA 228 Score = 34.7 bits (78), Expect(4) = 2e-24 Identities = 15/22 (68%), Positives = 18/22 (81%) Frame = +1 Query: 139 RIVGLSTHNLLSCFVSGLKPEI 204 R+VG+S LLSCF+SGL PEI Sbjct: 297 RVVGISPPLLLSCFISGLSPEI 318 Score = 31.2 bits (69), Expect(4) = 2e-24 Identities = 11/24 (45%), Positives = 16/24 (66%) Frame = +3 Query: 66 RSASRYQWMHSNGQITSWTQFLTA 137 R+ + YQWM +N TSW+ F+ A Sbjct: 235 RALAWYQWMKANNHFTSWSSFIQA 258 >gb|PNX92353.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 1502 Score = 112 bits (279), Expect = 2e-24 Identities = 91/293 (31%), Positives = 126/293 (43%), Gaps = 55/293 (18%) Frame = +2 Query: 2 TNAPALVFKISQFFAYHRTPETERITVSVDAQQWP-----------------NHLLDTVS 130 ++A +FKISQFF +H TPE +R+T++ + P + LL+ + Sbjct: 84 SDAMGWIFKISQFFEFHATPEADRLTIASFYMEGPALGWYQWMARNGQLTSWHGLLNAIE 143 Query: 131 HRIASW------GSLRTI--------Y*VALSRVSNR---------------------RF 205 R A GSL + Y A ++NR R Sbjct: 144 ARFAPSQYDDPKGSLFKLTQKGSVSEYLSAFETLANRIVGLQPPFLLSCFISGLIPEIRR 203 Query: 206 EVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTESSTSKNATSXXXX 385 EV+A QP +L QAA LARLQE+K ND R R + +P SSTS+ T Sbjct: 204 EVMALQPLNLIQAASLARLQEEKFNDARRALRNRGILNPTPLQQIPPSSTSR--TPLALL 261 Query: 386 XXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKAR-XXXXXXXXXXXXXXX 562 F++LS EMA RREKGLCFN DE+F H+C +R Sbjct: 262 PPPPKPSPPTFKRLSPTEMAQRREKGLCFNCDEKFRPGHKCSSRFFILITDDDIDPDLTH 321 Query: 563 XXXXXXVWPIVDPEAEPT--QFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715 P +P+ EP+ Q + H ++G T R+ G I + VHIL+DG Sbjct: 322 IDPNSAAQPDPEPDLEPSQAQISFHALSGHLAPETLRLAGRIAHQRVHILMDG 374 >gb|KYP42639.1| Retrovirus-related Pol polyprotein from transposon 297 family [Cajanus cajan] Length = 894 Score = 111 bits (278), Expect = 2e-24 Identities = 98/302 (32%), Positives = 132/302 (43%), Gaps = 64/302 (21%) Frame = +2 Query: 2 TNAPALVFKISQFFAYHRTPETERITVS---VDAQ-----QWP---------NHLLDTVS 130 ++A +FKI+QFF YH TPE E ITV+ +D QW N +L + Sbjct: 63 SDALGWIFKITQFFEYHNTPEEECITVASFYLDGSALAWFQWMYRNGQIHSWNQMLQALE 122 Query: 131 HRIASW------GSLR--------TIY*VALSRVSNR---------------------RF 205 +R A G L T Y ++NR R Sbjct: 123 NRFAPTAFDNPRGKLFKLTQSFSVTSYLTEFESLANRIVGLQPSFLLSCFISGLKPELRR 182 Query: 206 EVLAQQPSSLSQAAGLARLQEDKVNDLLRLTR--QKPPWSPAQTPSRTESSTSKNAT-SX 376 +V+A QPSSLSQA G ARL E+K+ D R R Q P WS A SRT S S + Sbjct: 183 DVIAHQPSSLSQAVGYARLHEEKLFDSSRTHRPSQSPRWS-APPVSRTFSPLSPSPPPKS 241 Query: 377 XXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKAR---------XXXX 529 RF+QL+ AEMAD+REKGLCFN D++FS++HRC AR Sbjct: 242 LPPLLPPPPPKTRFKQLTEAEMADKREKGLCFNCDQKFSRNHRCLARYFLLIVDEDESPP 301 Query: 530 XXXXXXXXXXXXXXXXXVWPIVDPEAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILV 709 V ++D + Q +L+ ++G+ T T R+ G + V +LV Sbjct: 302 PDSDGGSDLGAGSDPKLVEELLDLSPDSAQLSLNALSGSGTPETLRIVGLLAQYQVRVLV 361 Query: 710 DG 715 DG Sbjct: 362 DG 363 >gb|KYP61806.1| hypothetical protein KK1_016317 [Cajanus cajan] Length = 287 Score = 95.9 bits (237), Expect(3) = 2e-22 Identities = 56/112 (50%), Positives = 70/112 (62%), Gaps = 6/112 (5%) Frame = +2 Query: 200 RFEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWS-PAQTPSRTESSTSKN---- 364 R EV+AQQP SL+ A GLARLQE++++DL R R +P S PA +RT +S Sbjct: 77 RREVIAQQPQSLATAVGLARLQEERLSDLSRFQRPRPASSWPAPLLTRTITSAPPPQQPT 136 Query: 365 -ATSXXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKAR 517 A RFRQL+ AEMADRREKGLCFN D+++S+SHRC AR Sbjct: 137 AAPPRAPPLLPTPAPKTRFRQLTEAEMADRREKGLCFNCDQKYSRSHRCPAR 188 Score = 33.9 bits (76), Expect(3) = 2e-22 Identities = 15/23 (65%), Positives = 19/23 (82%) Frame = +1 Query: 136 HRIVGLSTHNLLSCFVSGLKPEI 204 +R+ GLS LLSCF+SGL+PEI Sbjct: 54 NRVTGLSPPFLLSCFLSGLQPEI 76 Score = 25.0 bits (53), Expect(3) = 2e-22 Identities = 10/16 (62%), Positives = 11/16 (68%) Frame = +3 Query: 90 MHSNGQITSWTQFLTA 137 M+ NGQI SWT L A Sbjct: 1 MYRNGQILSWTHLLQA 16 >dbj|GAU26773.1| hypothetical protein TSUD_317710 [Trifolium subterraneum] Length = 1395 Score = 105 bits (263), Expect = 2e-22 Identities = 92/301 (30%), Positives = 123/301 (40%), Gaps = 63/301 (20%) Frame = +2 Query: 2 TNAPALVFKISQFFAYHRTPETERITVSV-----DAQQWPNH------------LLDTVS 130 T+A +FKISQFF YH TPETER+TV+ A W + LL + Sbjct: 62 TDAMGWIFKISQFFDYHNTPETERLTVASFYMDGPALTWYQYMYRNGHINSWFGLLQALE 121 Query: 131 HRIA---------------SWGSLRTIY*VALSRVSNR---------------------R 202 R A GSL Y R++NR R Sbjct: 122 ARFAPSYYDDPSQALFKLTQRGSLNQ-YLTEFERLANRIIGLPQPFILNCFISGLAPEIR 180 Query: 203 FEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQK---------PPWSPAQTPSRTESST 355 EV A QP++LS A LA+LQEDK++D R + K PP P S T+ Sbjct: 181 REVQALQPATLSLATALAKLQEDKIDDRRRNFKTKQHTSSSSTTPPLLPTPLSSTTQPPN 240 Query: 356 SKNATSXXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXX 535 + + +FR+LS+ +MA RREKGLC+N DE F H+CK R Sbjct: 241 NPSRV--------------QFRKLSSEDMASRREKGLCYNCDETFIPGHKCKGRLYLLVS 286 Query: 536 XXXXXXXXXXXXXXXV-WPIVDPEAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVD 712 + I P Q + H++ G+ T R+ G I + PV IL+D Sbjct: 287 DEPDPAESPPSQTPDLDHSIESPPDLEGQISFHSLAGSSATATLRIIGQIANHPVTILID 346 Query: 713 G 715 G Sbjct: 347 G 347 >gb|PNX92970.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense] Length = 1483 Score = 105 bits (261), Expect = 5e-22 Identities = 92/300 (30%), Positives = 123/300 (41%), Gaps = 62/300 (20%) Frame = +2 Query: 2 TNAPALVFKISQFFAYHRTPETERITVSVDAQQWPNHLLDTVSHR---IASW-GSLRTI- 166 ++A +FKISQFF YH+TPE ER+TV+ + P HR I +W G L+ + Sbjct: 67 SDAMGWIFKISQFFDYHQTPEEERLTVASFYMEGPALSWFQWMHRNGQITTWFGLLQALE 126 Query: 167 --------------------------Y*VALSRVSNR---------------------RF 205 Y R++NR R Sbjct: 127 TRFAPSYYDDPSSSLFKLTQRTTVNEYLAEFERLANRIVGLQPPFLLSCFISGLSPEIRR 186 Query: 206 EVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKP---PWSPAQTPSRTESSTSKNAT-S 373 EV A +P SL+QA LA+LQEDK+ D R + KP S + P ST T + Sbjct: 187 EVQALRPMSLTQATALAKLQEDKIADRRRFFKNKPNSQQISSSSNPFGPPPSTPPLPTPN 246 Query: 374 XXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXX 553 FR+LS EMA RREKGLC+N DE F+ H+C+ R Sbjct: 247 TLPLLPPPKPNRPNFRKLSPEEMASRREKGLCYNCDETFTPQHKCRGRFFLLVTEEPMES 306 Query: 554 XXXXXXXXXVWPIVDPEAEPT------QFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715 P E PT Q +LH ++G A T R+ G I + PV +L+DG Sbjct: 307 PPDLIDFTE--PDPPNETTPTDAAIDAQISLHALSGCTVASTIRLMGCIANHPVTVLIDG 364 >gb|KHN07600.1| Retrovirus-related Pol polyprotein from transposon opus, partial [Glycine soja] Length = 466 Score = 103 bits (256), Expect = 1e-21 Identities = 87/264 (32%), Positives = 122/264 (46%), Gaps = 33/264 (12%) Frame = +2 Query: 20 VFKISQFFAYHRTPETERITVSV-----DAQQWPNHLLDTVSHRIASW-GSLRTIY*V-- 175 +FKISQ F Y TPE ERITV+ A W + + I SW G L+ + Sbjct: 60 IFKISQLFEYQNTPEEERITVAFFYLDGAALSWYQWMFR--NGFITSWSGFLQALESRFA 117 Query: 176 ---------ALSRVSNR----RFEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPW 316 AL +++ R R EVLA QP SL QA LA+LQEDK+ D RQ PP Sbjct: 118 PSYYDDPKGALFKLTQRGTDIRREVLALQPISLPQAMALAKLQEDKIRD----RRQAPPR 173 Query: 317 SPAQTPSRTESSTSKNATSXXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSK 496 + TPS + + ++ S + Q + EMA RREKGLC+N +E++S Sbjct: 174 NH-NTPSASYAPPTRKPHST-------------YVQRTPDEMALRREKGLCYNCEEKWSS 219 Query: 497 SHRCKARXXXXXXXXXXXXXXXXXXXXXVWPIVDP------------EAEPTQFALHTMT 640 +HRCK R + P+ +P E P +LH ++ Sbjct: 220 THRCKGRVLLFIADNPSPTSDEPISEPPLLPLPEPTPACPPDLDSTSELTPPHVSLHALS 279 Query: 641 GAHTAHTFRVQGHIDDEPVHILVD 712 G ++ TFR+ G I+ P+ IL+D Sbjct: 280 GLPSSETFRLVGIINHSPLTILID 303 >gb|PNX98954.1| retrotransposon-related protein, partial [Trifolium pratense] Length = 957 Score = 103 bits (258), Expect = 1e-21 Identities = 88/302 (29%), Positives = 128/302 (42%), Gaps = 64/302 (21%) Frame = +2 Query: 2 TNAPALVFKISQFFAYHRTPETERITVS---VDAQ-----QWPNHLLDTVSHRIASW-GS 154 ++A +FKISQFF YH+TPE ER+TV+ ++ Q QW + ++++ +W G Sbjct: 65 SDAMGWIFKISQFFDYHQTPEEERLTVASFYMEGQALSWFQWMHR-----NNQLNTWFGF 119 Query: 155 LRTI---------------------------Y*VALSRVSNR------------------ 199 L+ + Y R++NR Sbjct: 120 LQALETRFAPSFYDEPSSALFKLVQRSSVNNYLTEFERLANRIVGLPQPFLLSCFISGLS 179 Query: 200 ---RFEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQ-TPSRTESSTSKNA 367 R EV A +P +L QA LA+LQEDK++D RL + K S TP + S+ Sbjct: 180 PEIRREVQALRPVTLCQATALAKLQEDKIDDRRRLFKSKNSTSTLNPTPIASSSAPPLLP 239 Query: 368 TSXXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXX 547 T FR+LS EMA RREKGLC+N DE F+ H+C+ R Sbjct: 240 TPKPNNKV-------NFRKLSPEEMATRREKGLCYNCDETFTPLHKCRGRFFLLVADDDC 292 Query: 548 XXXXXXXXXXXVWPIVDPEAEPT------QFALHTMTGAHTAHTFRVQGHIDDEPVHILV 709 + P P PT Q + H M+G+ T R+ G + + PV +L+ Sbjct: 293 DPDDIPDPPPDIDPTPPPPTLPTTEPSEAQISFHAMSGSADPATIRISGFLANHPVTVLI 352 Query: 710 DG 715 DG Sbjct: 353 DG 354 >dbj|GAU45274.1| hypothetical protein TSUD_99960 [Trifolium subterraneum] Length = 970 Score = 102 bits (255), Expect = 3e-21 Identities = 87/297 (29%), Positives = 126/297 (42%), Gaps = 59/297 (19%) Frame = +2 Query: 2 TNAPALVFKISQFFAYHRTPETERITVSVDAQQWPNHLLDTVSHR---IASW-GSLRTI- 166 T+A +FKISQFF +H+T E ER+TV+ + P H+ I +W G L+ + Sbjct: 63 TDAMGWIFKISQFFDFHQTTEEERLTVASFYMEGPALSWYQWMHKNNQINTWFGFLQALE 122 Query: 167 --------------------------Y*VALSRVSNR---------------------RF 205 Y + R++NR R Sbjct: 123 MRFAPSYYDEPSSALFKLVQKTTVNSYLIKFERLTNRIVGLPQPFLLSCFISGLSPEIRR 182 Query: 206 EVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTESSTSKNATSXXXX 385 EV A +P SL+QA LA+LQEDK+ D RL + K + A T + +S N + Sbjct: 183 EVQALRPLSLTQATALAKLQEDKIEDRRRLFKTK---TSASTTTTNALPSSSNLPALLPN 239 Query: 386 XXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXXXX 565 FR+LS EMA+RREKGLC+N DE F+ H+CK R Sbjct: 240 PKPPNRV--NFRKLSPEEMANRREKGLCYNCDETFTPQHKCKGRFFLLIADDDFDSDEPP 297 Query: 566 XXXXXVW-------PIVDPEAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715 + PI+ +E Q + H M+G+ T R+ G + + PV +L+DG Sbjct: 298 IPPPTIESPPPESPPIITDPSE-AQISFHAMSGSTDQTTIRIPGRLANHPVTVLIDG 353 >gb|PNX92469.1| hypothetical protein L195_g015607 [Trifolium pratense] Length = 566 Score = 102 bits (254), Expect = 3e-21 Identities = 91/290 (31%), Positives = 119/290 (41%), Gaps = 52/290 (17%) Frame = +2 Query: 2 TNAPALVFKISQFFAYHRTPETERITVS--------VDAQQWP---------NHLLDTVS 130 T+A +FKI QFF YH TPE ERIT++ + QW N L + Sbjct: 61 TDAHGWIFKICQFFTYHETPEEERITIASFYLDGPALSWYQWMYRNSQLVSWNQFLQALE 120 Query: 131 HRIASW------GSLRTI--------Y*VALSRVSNR---------------------RF 205 R A G+L + Y V ++NR R Sbjct: 121 TRFAPTAYDDPRGNLFKLTQSTTVAAYLVEFEALANRIVGLSSADLLSCFISGLKLDIRR 180 Query: 206 EVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTESSTSKNATSXXXX 385 EVLA+QP+SL+QAAGLARLQEDK+ D R R P TP S+ + T Sbjct: 181 EVLARQPTSLTQAAGLARLQEDKLLDQQRANR------PKFTPPPPRYSSDSSITRPSPG 234 Query: 386 XXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXXXX 565 RFR LS E+A+RREKGLCF Y R ++ K Sbjct: 235 LLPTPPAKPRFRHLSEPELAERREKGLCFXYQNRSWRNAGKKDYVSTVIKSDQELEAIVT 294 Query: 566 XXXXXVWPIVDPEAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715 + ++ Q +LH ++G + TF+V G I V ILVDG Sbjct: 295 TPDSS----NNDDSPSAQLSLHALSGHQASDTFKVTGKIATHTVDILVDG 340 >dbj|GAU40605.1| hypothetical protein TSUD_28110 [Trifolium subterraneum] Length = 1208 Score = 102 bits (254), Expect = 4e-21 Identities = 89/298 (29%), Positives = 123/298 (41%), Gaps = 60/298 (20%) Frame = +2 Query: 2 TNAPALVFKISQFFAYHRTPETERITVS--------VDAQQWPNH---------LLDTVS 130 T+A +FKISQFF YH TPE ER+TV+ + QW LL + Sbjct: 63 TDAMGWIFKISQFFDYHNTPEEERLTVASFYMDGPALSWYQWMFRNGLITTWFALLQAIE 122 Query: 131 HRIA---------------SWGSLRTIY*VALSRVSNR---------------------R 202 R A G L Y RV+NR R Sbjct: 123 TRFAPSYYDDPSQALFKLTQRGPLNQ-YLTEFERVANRIVGLPQPFLLSCFISGLSPEIR 181 Query: 203 FEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTESSTSKNATSXXX 382 EV A QP+SLS A LA+LQEDK+ + R + P Q + + SS++ N+T Sbjct: 182 REVQALQPASLSLATALAKLQEDKIEERRR------NYKPRQNNTSSSSSSNTNSTPLLP 235 Query: 383 XXXXXXXXXX-RFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXX 559 +FR+LS+ EM+ RREKGLC+N D+ F+ H+CK R Sbjct: 236 SPTTPSNPPRVQFRKLSSEEMSSRREKGLCYNCDDTFTPGHKCKGRFYLLVSDDPESPPV 295 Query: 560 XXXXXXXVWPIVDPEAE------PTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715 P D E Q + H+++G+ T R+ G I + V +L+DG Sbjct: 296 EPLSIQS--PETDTENHLDTPDLDAQISFHSLSGSSATATLRIPGQIANHSVTVLIDG 351