BLASTX nr result
ID: Glycyrrhiza36_contig00018950
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza36_contig00018950 (1322 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value GAU15708.1 hypothetical protein TSUD_307180 [Trifolium subterran... 331 2e-98 GAU45704.1 hypothetical protein TSUD_86800 [Trifolium subterraneum] 320 5e-97 GAU17048.1 hypothetical protein TSUD_105470 [Trifolium subterran... 308 6e-94 GAU38852.1 hypothetical protein TSUD_154140 [Trifolium subterran... 314 4e-92 GAU31820.1 hypothetical protein TSUD_58210 [Trifolium subterraneum] 308 2e-90 KYP68633.1 Retrovirus-related Pol polyprotein from transposon TN... 301 1e-88 GAU27958.1 hypothetical protein TSUD_146760 [Trifolium subterran... 286 5e-88 GAU41868.1 hypothetical protein TSUD_366150 [Trifolium subterran... 290 7e-87 GAU31058.1 hypothetical protein TSUD_214940 [Trifolium subterran... 291 3e-86 GAU39523.1 hypothetical protein TSUD_222930 [Trifolium subterran... 291 2e-84 GAU46782.1 hypothetical protein TSUD_351810 [Trifolium subterran... 289 3e-83 GAU47513.1 hypothetical protein TSUD_138850 [Trifolium subterran... 285 4e-82 AJY78067.1 putative polyprotein [Glycine max] 276 9e-81 KYP42564.1 Retrovirus-related Pol polyprotein from transposon TN... 278 2e-79 GAU31769.1 hypothetical protein TSUD_22150 [Trifolium subterraneum] 275 1e-78 KYP41351.1 hypothetical protein KK1_037277 [Cajanus cajan] 263 3e-78 KYP40816.1 Retrovirus-related Pol polyprotein from transposon TN... 270 5e-77 XP_012570493.1 PREDICTED: uncharacterized protein LOC101511025 [... 266 8e-77 KYP68194.1 Retrovirus-related Pol polyprotein from transposon TN... 264 5e-76 KYP42296.1 Retrovirus-related Pol polyprotein from transposon TN... 261 2e-75 >GAU15708.1 hypothetical protein TSUD_307180 [Trifolium subterraneum] Length = 1433 Score = 331 bits (849), Expect = 2e-98 Identities = 178/425 (41%), Positives = 257/425 (60%), Gaps = 15/425 (3%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLNDNFAMVKS 1053 FFT+L G+WEEL+ +RP+P CTC RC CE +R AR R+ED V+ FL GLND++A+V+S Sbjct: 127 FFTKLRGIWEELDVFRPVPTCTCIARCQCEGIRNARKLRQEDLVLIFLTGLNDHYAVVRS 186 Query: 1052 QILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQSLVNAADGKKHY-GRGRGKW----- 891 QIL+M+ P +NT SM+++HE NGLE ++ A Q +N A G +H +G+ + Sbjct: 187 QILIMEPFPDINTAFSMIIQHESFNGLEAVE-APQVELNLAHGARHAPSKGKSAYHPPSN 245 Query: 890 SNKQCSHCGKAGHTIDVCYEIHGYPIGYKRNFKPSVNLAADAS-------DTKSEMGDKM 732 ++ C+ CG+ GH I +CY +GYP G+K + +A AS DTK Sbjct: 246 GDQSCTFCGRKGHDISICYRKNGYPPGFKYRDGSTPPKSAMASYVASANSDTKPAAAKST 305 Query: 731 ASSSISQEEFKQLMDLLKKVNVAQPSPVLKE--EHSVNQIYADAEGIISCRSNSICIFSP 558 S S EEF+ L LLK + SP L + S + D GI S N++ S Sbjct: 306 GSLGFSAEEFEALRSLLKN-HKPSASPHLHQFTTASSSSSAEDTRGITSF--NALSPNSL 362 Query: 557 WIIDSGATDHICNDLNSFELSHKIDPIGVRLPNGASIIAKLAGTVRITADLVLEDVLYLP 378 WIIDSGATDH C+ L+ F K+ PI VRLPNG+ + + G + IT + L +VLYLP Sbjct: 363 WIIDSGATDHACSSLSMFSHYTKVSPIPVRLPNGSIVNTDIIGDIHITDTIALTNVLYLP 422 Query: 377 EFNINLISIPKLTRGSHFSVRFVNENCIIQDPLQRKIGSGKLKHGLYHLDYGGGKVVPVV 198 F NL+S+ ++T + F C I + QR IGSGKL +GLY+L+ +V Sbjct: 423 HFTYNLLSVSRVTHQLACTFTFAFNMCTIHNSQQRMIGSGKLLNGLYYLEGTNASTHSLV 482 Query: 197 SHVNATVDSSYTIPNSAIWHFRFGHASLAKIDMLSKNFPAIHVNKSLVCDICHLARQRRL 18 V TV + ++IP SA+WHFRFGHAS ++++++ K++P+I +NK VCD+CHLA+Q++L Sbjct: 483 KPVTGTVCTVFSIPQSALWHFRFGHASNSRLEIMHKSYPSISINKDCVCDVCHLAKQKKL 542 Query: 17 PFVVS 3 + +S Sbjct: 543 SYSLS 547 >GAU45704.1 hypothetical protein TSUD_86800 [Trifolium subterraneum] Length = 902 Score = 320 bits (819), Expect = 5e-97 Identities = 180/439 (41%), Positives = 266/439 (60%), Gaps = 22/439 (5%) Frame = -1 Query: 1262 IEAGTHVCH*FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKG 1083 ++ G+ F+++L LWEE E Y +P+CTC RCTCEAMR+AR + Y +RFL G Sbjct: 144 LQQGSKTVTEFYSDLKILWEEFEIYMSVPQCTCRSRCTCEAMRSARQNHLVLYAVRFLTG 203 Query: 1082 LNDNFAMVKSQILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQSLVNAADGKKHYGRG 903 LN+NFAMVKSQILL+ LP +N + SMVL+HERQ + D S+ LVNAAD KK Y + Sbjct: 204 LNENFAMVKSQILLIDPLPPMNKIFSMVLQHERQGNFASV-DESKVLVNAADSKKPYYKN 262 Query: 902 --------RGKWSNKQCSHCGKAGHTIDVCYEIHGYPIGYKRNFKPSVNLAADASDTKSE 747 GK N+ C++C + GHT+D C++ HGYP +RNF N + + SD++S+ Sbjct: 263 SKPNFQSFNGK-GNRHCTYCDRQGHTVDGCFKKHGYPPHMQRNFGSVHNTSTEGSDSQSQ 321 Query: 746 MGDKMASS-----SISQEEFKQLMDLLKKVNVAQPSPVLKEEHSVNQIYA------DAEG 600 ++ SS S++Q++F QLM LL+ + Q S + H VN + D +G Sbjct: 322 QMERGESSNSSPASLTQDQFDQLMLLLQSSGMNQSSG-SQTSHQVNSSQSFGPSSNDIKG 380 Query: 599 IISCRSNSICIFS--PWIIDSGATDHICNDLNSFELSHKIDPIGVRLPNGASIIAKLAGT 426 +S S+ C S WIIDSGA+DHIC L+ F+ ++I PI +RLPNG +AK+AGT Sbjct: 381 SVSISSSFCCNISQGSWIIDSGASDHICGSLHWFDSYNQIKPINIRLPNGHLTVAKIAGT 440 Query: 425 VRITADLVLEDVLYLPEFNINLISIPKLTRGSHFSVRFVNENCIIQD-PLQRKIGSGKLK 249 ++ + L +E+VLY+ +F +NLIS+ KL +V F CIIQ+ +R IGS + Sbjct: 441 IKFSDSLTIENVLYVSDFTLNLISVSKLCHALGCTVLFNGSTCIIQERESKRMIGSAEQI 500 Query: 248 HGLYHLDYGGGKVVPVVSHVNATVDSSYTIPNSAIWHFRFGHASLAKIDMLSKNFPAIHV 69 LY+L V+A+ S+ ++ +SA+WHFR GH S +++ + FP ++V Sbjct: 501 EDLYYL-------ALQTKEVHASNVSTNSLLDSALWHFRLGHLSTSRMISMRSEFPFVNV 553 Query: 68 NKSLVCDICHLARQRRLPF 12 + ++VCD+C A+QR+L F Sbjct: 554 DNNVVCDVCKYAKQRKLVF 572 >GAU17048.1 hypothetical protein TSUD_105470 [Trifolium subterraneum] Length = 769 Score = 308 bits (790), Expect = 6e-94 Identities = 166/425 (39%), Positives = 250/425 (58%), Gaps = 15/425 (3%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLNDNFAMVKS 1053 FFT+L GLWEELE RP+P CTCT RC CEAMR A+ +EED V+ FL GLNDN+AMV+S Sbjct: 153 FFTKLKGLWEELELSRPVPNCTCTFRCVCEAMRNAKRFKEEDLVLLFLTGLNDNYAMVRS 212 Query: 1052 QILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQSLVNAAD-GKKHYGRG-RGKWSNKQ 879 Q+LLM+ P LN V +V++HE NGL+ + D +A + +K YG+ + K+ Sbjct: 213 QVLLMEPFPLLNAVFGLVIQHESLNGLDSVDDQLDQTTSAINFARKSYGKNYLPPKTEKK 272 Query: 878 CSHCGKAGHTIDVCYEIHGYPIGYK------------RNFKPSVNLAADASDTKSEMGDK 735 C++C K H +D C+ HG+P GY+ + + + + A+ ++ +S + + Sbjct: 273 CTYCHKTNHIVDNCFRKHGFPPGYRFKDGTVVGGSKNQGYSSANCIDAEDNEAQSSVDTR 332 Query: 734 MASSSISQEEFKQLMDLLKKVNVAQPSPVLKEEHSVNQIYADAEGIISCRSNSICIFSPW 555 M + S E+++ LM LLK A + ++V ++ A + + N W Sbjct: 333 M---TFSAEDYQALMALLKSTKNAGEG--TSQVNNVTKVIASSYS-NDKQGNVPNHLDTW 386 Query: 554 IIDSGATDHICNDLNSFELSHKIDPIGVRLPNGASIIAKLAGTVRITADLVLEDVLYLPE 375 I+DSGATDH+C L+ F K+ PI V+LPNG + + G + +T + L +VLY+P Sbjct: 387 ILDSGATDHVCASLSLFTAYKKVHPIPVKLPNGNIVTTDIIGDILVTPSITLRNVLYMPH 446 Query: 374 FNINLISIPKLTRGSHFSVRFVNENCIIQDPLQRKIGSGKLKHGLYHLDYGGGKVVPVVS 195 F+ NLIS+ ++++ F + CIIQ+ LQR IGSG++ +GLY+L+ G + P + Sbjct: 447 FSFNLISVSRVSKDLDCVFAFTDNLCIIQNSLQRMIGSGRMFNGLYYLE--GTQSQPNIQ 504 Query: 194 HVNATVDSSYTIPNSAIWHFRFGHASLAKIDMLSKNFPAIHVNK-SLVCDICHLARQRRL 18 N +S IP A+WHFR GH S ++D+L K +P I VNK CD+CH A+QR+L Sbjct: 505 TGNKC--NSIAIPRDALWHFRLGHTSKNRLDILHKLYPNIEVNKVDFCCDVCHFAKQRKL 562 Query: 17 PFVVS 3 P+ S Sbjct: 563 PYDTS 567 >GAU38852.1 hypothetical protein TSUD_154140 [Trifolium subterraneum] Length = 1494 Score = 314 bits (804), Expect = 4e-92 Identities = 174/428 (40%), Positives = 253/428 (59%), Gaps = 18/428 (4%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLNDNFAMVKS 1053 FFT+L GLWEELE RP+P CTCT RC CEAMR A+ +EED ++ FL GLND++AMV+S Sbjct: 155 FFTKLKGLWEELELSRPIPTCTCTFRCVCEAMRNAKKFKEEDLILLFLTGLNDHYAMVRS 214 Query: 1052 QILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQSLVNAAD-GKKHYGRGRG-KWSNKQ 879 QILLM+ P LN V +V++HE NGL+ D +A + +K YG+ S K+ Sbjct: 215 QILLMEPFPVLNAVFGLVIQHESINGLDITDDQLDPTASAINFARKSYGKANSTSQSQKK 274 Query: 878 CSHCGKAGHTIDVCYEIHGYPIGYKRNFKPSVNLAADASDTKS----EMGDKMASSSI-- 717 C++C K H +D C+ HG+P GY+ FK + + S D M SS+ Sbjct: 275 CTYCHKTNHVVDNCFRKHGFPPGYR--FKDGTVVGSKNQGQSSANCVNADDNMEQSSVDT 332 Query: 716 ----SQEEFKQLMDLLKKVNVAQPSPVLKEEHSVNQIYA-----DAEGIISCRSNSICIF 564 S E+++ LM LLK N + ++V++ A D +G + ++ Sbjct: 333 RMTFSAEDYQALMALLK--NSKSAGEGSSQVNNVSKFIASSFTNDKQGNVPNHLDT---- 386 Query: 563 SPWIIDSGATDHICNDLNSFELSHKIDPIGVRLPNGASIIAKLAGTVRITADLVLEDVLY 384 WIIDSGATDH+C L+ F K++PI V+LPNG+ + + G + IT + L+ VLY Sbjct: 387 --WIIDSGATDHVCASLSLFTEYRKVNPIPVKLPNGSIVTTDIIGNISITPTITLKHVLY 444 Query: 383 LPEFNINLISIPKLTRGSHFSVRFVNENCIIQDPLQRKIGSGKLKHGLYHLDYGGGKVVP 204 +P F+ NLIS+ ++++ F + C IQ+ LQR IGSG++ +GLY+L+ G P Sbjct: 445 MPHFSFNLISVSRVSKDLDCVFAFTDNLCFIQNSLQRMIGSGRMLNGLYYLE--GTHSQP 502 Query: 203 VVSHVNATVDSSYTIPNSAIWHFRFGHASLAKIDMLSKNFPAIHVNK-SLVCDICHLARQ 27 + + +S IPN+A+WHFRFGH S ++++L K +P I VNK CD+CHLA+Q Sbjct: 503 --NLLTGKQCNSLAIPNNALWHFRFGHTSQNRLEILQKLYPTIEVNKVDFCCDVCHLAKQ 560 Query: 26 RRLPFVVS 3 R+LP+V S Sbjct: 561 RKLPYVTS 568 >GAU31820.1 hypothetical protein TSUD_58210 [Trifolium subterraneum] Length = 1409 Score = 308 bits (790), Expect = 2e-90 Identities = 173/448 (38%), Positives = 259/448 (57%), Gaps = 31/448 (6%) Frame = -1 Query: 1262 IEAGTHVCH*FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKG 1083 ++ GT +F EL LWEEL ++ P+P C C C C AM+ R ED V++FL+G Sbjct: 135 LKQGTRSVLDYFIELKALWEELNSHHPIPVCICVHPCRCPAMQLVRNYGHEDQVLQFLQG 194 Query: 1082 LNDNFAMVKSQILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQSLVNAADGKKHYGRG 903 LNDNF++VK+Q+LL+ LP++N V S+V++ E + D S SL+NAA + G+G Sbjct: 195 LNDNFSIVKTQVLLLDPLPTINKVYSLVVQEESNHKSIASHDDSSSLINAAQRYEAKGKG 254 Query: 902 -----RGKWSNKQCSHCGKAGHTIDVCYEIHGYPIGYKRNFKPSVNLA------ADASDT 756 + K SN+QC+ C ++GHT+D CY+ HG+P +K+N + SVN A A AS + Sbjct: 255 IASSSQSKNSNRQCTFCHRSGHTVDFCYQKHGHP-SFKKN-RSSVNAANTQVVQAPASVS 312 Query: 755 KSEMGDKMASSS-ISQEEFKQLMDLLKKVNVAQPSPVLKEEHSVNQIYA----------- 612 +E+G +SS ISQE+F QLM LL + N+ S S NQ A Sbjct: 313 NTEVGSSSGTSSPISQEQFGQLMALLHQTNLLPASENSAPSSSTNQFSATPMTQGHPPHE 372 Query: 611 DAEGIISCRSNSICI-FSPWIIDSGATDHICNDLNSFELSHKIDPIGVRLPNGASIIAKL 435 + GI+ NS+ W++D GA DH+C+ + F + I P+ ++LPNG S+I + Sbjct: 373 SSSGILLTNVNSLTANHDYWLLDFGANDHVCSSYHHFSSFYPIKPVHIKLPNGHSVIVQY 432 Query: 434 AGTVRITADLVLEDVLYLPEFNINLISIPKLTRGSHFSVRFVNENCIIQDPLQRK-IGSG 258 AG ++ + L L DVLY PEF++NLIS+ KL + + S++F + C++QD + +K IG G Sbjct: 433 AGNIQFSESLYLTDVLYSPEFHLNLISVSKLCKNLNCSIQFFDHKCLLQDMITKKMIGLG 492 Query: 257 KLKHGLYHLDYG----GGKVVP--VVSHVNATVDSSYTIPNSAIWHFRFGHASLAKIDML 96 GLY L Y + +P +N +S IP SA+WHFR GH S ++ + Sbjct: 493 DQVDGLYRLQYNHTFLASQALPQSFPKSINNVAVNSVVIPVSALWHFRLGHVSNNRLLRM 552 Query: 95 SKNFPAIHVNKSLVCDICHLARQRRLPF 12 SK +P + ++ VCD+C AR+R+LPF Sbjct: 553 SKLYPFLSIDNKAVCDVCQFARKRKLPF 580 >KYP68633.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1108 Score = 301 bits (770), Expect = 1e-88 Identities = 171/421 (40%), Positives = 244/421 (57%), Gaps = 11/421 (2%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLNDNFAMVKS 1053 FFTEL LWEELE YR + +CTC VRC C+A +FL GLND+F +VKS Sbjct: 154 FFTELKRLWEELECYRTILDCTCPVRCNCQAQ------------FQFLNGLNDHFNVVKS 201 Query: 1052 QILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQSLVNAADGKKHYGRGRGK--W---S 888 QILLM LPSLN V SM+++HERQ G + + + D +K +G G+G W S Sbjct: 202 QILLMDPLPSLNKVFSMIIQHERQ-GNPSLGEEIKVFAGTTDNRKSFGIGKGNNSWGRGS 260 Query: 887 NKQCSHCGKAGHTIDVCYEIHGYPIGY-KRNFKPSVNLAADASDTKSEMGDKMASSS--- 720 K CS+CGK+GHT+DVCY+ HGYP+ + +N N+ + ++ + K SS+ Sbjct: 261 GKVCSYCGKSGHTVDVCYKKHGYPLNFGSKNNSTVQNIIQEETEENEDQSRKEDSSNSQQ 320 Query: 719 -ISQEEFKQLMDLLKKVNVAQPSPVLKEEHSVNQIYADAEGIISCRSNSICIFSPWIIDS 543 I+QE+++ L+ L+++ N+ S H+ NQ+ D + + S Sbjct: 321 VITQEQYRNLLALIQQSNLQASS-----SHTSNQVSTDPP-------------TQFHSSS 362 Query: 542 GATDHICNDLNSFELSHKIDPIGVRLPNGASIIAKLAGTVRITADLVLEDVLYLPEFNIN 363 GATDHIC L F+ H+I P+ V LPNG +AK +GTV + L L++VL+LPEFN N Sbjct: 363 GATDHICCSLTLFDTFHEIKPVSVTLPNGHQTLAKFSGTVMFSPALTLKNVLFLPEFNFN 422 Query: 362 LISIPKLTRGSHFSVRFVNENCIIQD-PLQRKIGSGKLKHGLYHLDYGGGKVVPVVSHVN 186 L+S+ KL R S+ F +NC +QD R IG K GLY L + + V N Sbjct: 423 LVSVSKLCRNSNCLASFSFKNCQLQDMKSTRMIGLAKQVGGLYLLKAKTQEKMAEVQVSN 482 Query: 185 ATVDSSYTIPNSAIWHFRFGHASLAKIDMLSKNFPAIHVNKSLVCDICHLARQRRLPFVV 6 T +S IP S++WHFR GH S +++ +S+ P I +NK VCDICHLA++++LP+++ Sbjct: 483 ITTES---IPESSLWHFRLGHLSHERLETMSRENPIIFINKDAVCDICHLAKKKKLPYLM 539 Query: 5 S 3 S Sbjct: 540 S 540 >GAU27958.1 hypothetical protein TSUD_146760 [Trifolium subterraneum] Length = 531 Score = 286 bits (733), Expect = 5e-88 Identities = 151/388 (38%), Positives = 231/388 (59%), Gaps = 11/388 (2%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLNDNFAMVKS 1053 FFT+L GLWEELE YRP+P CTCT +C C+AMR A+ REED V+ FL GL+D++ MV+S Sbjct: 153 FFTKLKGLWEELELYRPIPNCTCTFQCVCDAMRNAKKFREEDLVLLFLTGLSDHYGMVRS 212 Query: 1052 QILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQS-LVNAADGKKHYGRGRGKWSNKQC 876 QILLM+ P LN+ MV++HE NGL+ ++D + S +N YG+ K S+K C Sbjct: 213 QILLMEPFPQLNSPFGMVIQHESLNGLDQLEDQTISGSINYVRKPSSYGKYPPK-SDKLC 271 Query: 875 SHCGKAGHTIDVCYEIHGYPIGY----------KRNFKPSVNLAADASDTKSEMGDKMAS 726 ++C K H +D C++ HG+P G+ + N + S N ++ + + DK + Sbjct: 272 TYCHKTNHIVDNCFKKHGFPPGFRFRDGTVAGSRHNGQASKNYF--NAENQEQTSDKRVA 329 Query: 725 SSISQEEFKQLMDLLKKVNVAQPSPVLKEEHSVNQIYADAEGIISCRSNSICIFSPWIID 546 +S S EE++ LM LLK + + + +S ++ +A + + F WI+D Sbjct: 330 ASFSNEEYQALMALLKSTSNSAGESSSSQVNSFSKCFASS----ASNDKQGTGFIQWILD 385 Query: 545 SGATDHICNDLNSFELSHKIDPIGVRLPNGASIIAKLAGTVRITADLVLEDVLYLPEFNI 366 SGATDH+CN L+ F I P+ V+LPNG + ++ G +++T+ L+L VLY+P F+ Sbjct: 386 SGATDHVCNSLSFFTNHRSIPPLLVKLPNGNLVSTEIVGDIQVTSTLILHGVLYMPNFHY 445 Query: 365 NLISIPKLTRGSHFSVRFVNENCIIQDPLQRKIGSGKLKHGLYHLDYGGGKVVPVVSHVN 186 NLIS+ K+ + F ++ C+IQ +Q+ IGSG+L HGLY+L + + + Sbjct: 446 NLISLSKIVLDLDCRIVFTDDLCLIQTKMQKMIGSGRLIHGLYYLQETSSDNSSMCTGKS 505 Query: 185 ATVDSSYTIPNSAIWHFRFGHASLAKID 102 +S IP SA+WHFRFGH S +++D Sbjct: 506 C---NSIVIPKSALWHFRFGHTSQSRLD 530 >GAU41868.1 hypothetical protein TSUD_366150 [Trifolium subterraneum] Length = 792 Score = 290 bits (743), Expect = 7e-87 Identities = 161/416 (38%), Positives = 243/416 (58%), Gaps = 6/416 (1%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLNDNFAMVKS 1053 F+++L L EELE Y PMP C+C VRC+CEAMR+ARA+ ++RFL GLND+F++VKS Sbjct: 149 FYSDLKLLCEELEIYLPMPNCSCRVRCSCEAMRSARANHTLLNIVRFLTGLNDHFSVVKS 208 Query: 1052 QILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQSLVNAADGKKHYGRGRGKWSNKQCS 873 Q+LLM LP LN V SMVL+HERQ P D S++L+NAA+ K H+ K + + C+ Sbjct: 209 QVLLMDPLPPLNKVFSMVLQHERQGNFYP-SDESKALLNAANSKGHF---NPKSTVRICT 264 Query: 872 HCGKAGHTIDVCYEIHGYPIGYKRNFKPSVNLAADASDTKSEMGDKMASSSISQEEFKQL 693 CGK H ++ C++ +G P K+N + N A + + + I+Q++ QL Sbjct: 265 LCGKDNHIVENCFKKYGIPPHMKKN-STAHNAAIEGGSEEPIASESTLGPPITQDQALQL 323 Query: 692 MDLLKKVNVAQPSPVLKEEHSVNQIYADAEGIISCRSNSI---CIFSPWIIDSGATDHIC 522 + LL+ Q S + D + + + I C W++DSGA+ H+C Sbjct: 324 ISLLQSSFPGQSSGTASSNQVGSVDIIDHPSMTKGKQSHIFQACSLGNWLVDSGASHHMC 383 Query: 521 NDLNSFELSHKIDPIGVRLPNGASIIAKLAGTVRITADLVLEDVLYLPEFNINLISIPKL 342 N + F S +I PI ++ PNG +++AK +GTV+ + ++ DVLY+P+F++NLIS+ +L Sbjct: 384 NSIQWFHSSSEIIPIKIKSPNGNTVLAKHSGTVKFSPSFIITDVLYVPQFSVNLISVSQL 443 Query: 341 TRGSHFSVRFVNENCIIQDPL-QRKIGSGKLKHGLYHLDYGGGKVVPVVSHVNATVDSSY 165 + F + C IQD L +R IG ++ GLY+L+ V H T D S+ Sbjct: 444 CATQKYITDFNSVQCSIQDSLTKRMIGFVDMREGLYYLNLTNKDV-----HA-YTADGSH 497 Query: 164 T--IPNSAIWHFRFGHASLAKIDMLSKNFPAIHVNKSLVCDICHLARQRRLPFVVS 3 IP A+WHFR GH S +++ +L FP I V+ VCDICHLAR +++P+ +S Sbjct: 498 NTHIPEPALWHFRLGHMSFSRMQLLKSQFPFISVDNKSVCDICHLARHKKIPYNIS 553 >GAU31058.1 hypothetical protein TSUD_214940 [Trifolium subterraneum] Length = 927 Score = 291 bits (746), Expect = 3e-86 Identities = 173/441 (39%), Positives = 253/441 (57%), Gaps = 31/441 (7%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLNDNFAMVKS 1053 +FTE+ +WEEL+ +RP+P+CTC C C AMR+AR+ R ED +++FL GLN+ F ++ Sbjct: 153 YFTEMRAMWEELDQFRPIPQCTCPFMCVCVAMRSARSYRTEDKIIQFLMGLNEQFKNIRC 212 Query: 1052 QILLMKDLPSLNTVLSMVLEHERQN--GLEPIQDA----SQSLVNAAD---GKKHYGRGR 900 QILLM+ L ++N VLSMVL+ ERQ GL P DA S +L N+ D ++ YGRGR Sbjct: 213 QILLMEPLSTINKVLSMVLQEERQQSYGLTPQVDAKTESSDALANSVDRHGARRGYGRGR 272 Query: 899 GKWS-------------NKQCSHCGKAGHTIDVCYEIHGYP--IGYKRNFKPSVNLAADA 765 G +S K CS+CGK GHTID+CY+ HGYP GY R + + Sbjct: 273 GNFSYQGGRGRGNNSNTAKVCSYCGKNGHTIDICYKKHGYPPNWGYTRGNNGGNSSVNNV 332 Query: 764 SDTKSEMGDKMASSSISQEEFKQLMDLLKKVNVAQPSPVLKEEHSVNQIYADAEGIIS-- 591 + G ++ S++++++ L+ LL++ N+ P +HS N + ++ S Sbjct: 333 EVDHDDEGGN-SNVSLTKDQYNSLLALLERNNLDNP------QHSTNFVKGESSQCYSAG 385 Query: 590 -CRSNSICIFSPWIIDSGATDHICNDLNSF-ELSHKIDPIGVRLPNGASIIAKLAGTVRI 417 + S S WIID+GATDHICN ++ F + I PI V LPNG I+A + G V++ Sbjct: 386 VAGNTSTHCLSDWIIDTGATDHICNSMHWFMSYTAVIPPISVGLPNGNKILAHVIGKVKL 445 Query: 416 TADLVLEDVLYLPEFNINLISIPKLTRGSHFSVRFVNENCIIQD-PLQRKIGSGKLKHGL 240 DL+L+ VLYL +FN+NL+S+ +L +G++ + F N + IQ+ RKIG K GL Sbjct: 446 NDDLILDKVLYLLDFNVNLLSVSRLVKGNNCVLSFENASFTIQEKDNMRKIGLAKQSDGL 505 Query: 239 YHLDYGGGKVVPVVSHVNATVDSSYTIPNSA--IWHFRFGHASLAKIDMLSKNFPAIHVN 66 Y L P + +V S T N++ +WH R GH S +I L+K + I Sbjct: 506 YFLK-------PCQVSQSISVHSIDTSSNNSGLLWHLRLGHLSFERIKCLNKRYSYIPAL 558 Query: 65 KSLVCDICHLARQRRLPFVVS 3 CD+CH+A+Q+RLPF VS Sbjct: 559 DHNPCDVCHMAKQKRLPFPVS 579 >GAU39523.1 hypothetical protein TSUD_222930 [Trifolium subterraneum] Length = 1210 Score = 291 bits (744), Expect = 2e-84 Identities = 165/416 (39%), Positives = 242/416 (58%), Gaps = 8/416 (1%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLNDNFAMVKS 1053 F+++L LWEELE Y P+P C+C RCTCEAMR AR + Y++RFL GLN++FAMVKS Sbjct: 100 FYSDLKLLWEELEIYMPIPNCSCRQRCTCEAMRAARNNHNLLYIIRFLTGLNESFAMVKS 159 Query: 1052 QILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQSLVNAADGKKHYGRGRGKWS--NKQ 879 QILL+ LP +N V SMVL+HERQ I D S+ L+NAA K+ G + N+ Sbjct: 160 QILLIDPLPPMNNVFSMVLQHERQLSAH-ISDDSKILINAA---KYRGSSSSSFKPPNRV 215 Query: 878 CSHCGKAGHTIDVCYEIHGYPIGY-KRNFKPSVNLAADASDTKSEMGDKMASSSISQEEF 702 C+ CGK H ++ C++ HG P + K + + + DT + S++I+Q++ Sbjct: 216 CTLCGKDNHIVENCFKKHGLPPHFRKASSSNNAMVEGGFDDTAGATESTIGSNTITQDQA 275 Query: 701 KQLMDLLK----KVNVAQPSPVLKEEHSVNQIYADAEGIISCRSNSICIFSPWIIDSGAT 534 QL+ LL+ N S + + G +S SN+ C WI+DSGA+ Sbjct: 276 LQLISLLQSSFPNSNSDNASSSKAGSNEFTGHTSVNPGNVSGSSNA-CSLGNWILDSGAS 334 Query: 533 DHICNDLNSFELSHKIDPIGVRLPNGASIIAKLAGTVRITADLVLEDVLYLPEFNINLIS 354 HICN + F ++I PI V+LPNG + AK +G V+ ++ L+L DVL +P F++NL+S Sbjct: 335 HHICNTVQWFHSYNEITPIRVKLPNGNHVFAKQSGIVKFSSSLILTDVLCVPNFSVNLLS 394 Query: 353 IPKLTRGSHFSVRFVNENCIIQDPLQR-KIGSGKLKHGLYHLDYGGGKVVPVVSHVNATV 177 + +L + S + ++F + C IQD + IG GLY+L +V H++A Sbjct: 395 VSQLCKNSKYILQFNDNQCSIQDTATKMMIGFADGIEGLYYLVLQDDEV-----HIHAAE 449 Query: 176 DSSYTIPNSAIWHFRFGHASLAKIDMLSKNFPAIHVNKSLVCDICHLARQRRLPFV 9 S IPN A+WHFR GH SL+++ L FP I V+ VCD+CHLARQ+++P + Sbjct: 450 GSDNIIPNQALWHFRLGHPSLSRLHSLHSKFPYITVDDKGVCDVCHLARQKKIPMI 505 >GAU46782.1 hypothetical protein TSUD_351810 [Trifolium subterraneum] Length = 1512 Score = 289 bits (739), Expect = 3e-83 Identities = 166/433 (38%), Positives = 252/433 (58%), Gaps = 23/433 (5%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTC-EAMRTARASREEDYVMRFLKGLNDNFAMVK 1056 FFT L LWEE E Y P P C C +C C + AR + +RFL GLNDNF MV+ Sbjct: 155 FFTALRILWEEFEIYLPAPVCNCPRKCVCVTGVSNARTQHDLLRTIRFLTGLNDNFDMVR 214 Query: 1055 SQILLMKDLPSLNTVLSMVLEHERQ-NGLEPIQDASQSLV--NAADGKKHYGRGRG---- 897 SQILLM LP +N V SMV++HERQ L+ + D S V NA+D ++ GRGR Sbjct: 215 SQILLMDPLPPINKVFSMVIQHERQFTPLQAVLDVEDSKVSVNASDSRRSQGRGRSGFNS 274 Query: 896 --------KWSNKQ--CSHCGKAGHTIDVCYEIHGYPIGYKRNFKPSVNLAADASDTKSE 747 +++NK+ C++CGK H ++ CY+ HG+P Y R + A + D Sbjct: 275 QYNSGFNPQYNNKKKVCTYCGKENHVVENCYKKHGFPPHYGRGSTANNANAGELMDNDDA 334 Query: 746 MGDKMASS-SISQEEFKQLMDLLK-KVNVAQPSPVLKEEHSVNQIYADAEGIISCRSNSI 573 + + S S ++ +++QL++LL+ + + P + A A S S + Sbjct: 335 RSTRGSDSFSFTKAQYEQLVNLLQTSASTSSAGPSTSINGASTSYLAKAGNTNSVFSCNH 394 Query: 572 CIFSPWIIDSGATDHICNDLNSFELSHKIDPIGVRLPNGASIIAKLAGTVRITADLVLED 393 + WIIDSGA+DHIC+ L+ H I+PI V++PNG AK AG+V++ + ++++ Sbjct: 395 FSYGAWIIDSGASDHICSSLSMLTDHHDINPIQVKMPNGTIAYAKQAGSVQLGPNFIIDN 454 Query: 392 VLYLPEFNINLISIPKLTRGSHFSVRFVNENCIIQDPLQRK-IGSGKLKHGLYHLDYGGG 216 VL +PEF++NL+S+P+LT S F V F N +C+IQ+ K IGSG+L GLY+L Sbjct: 455 VLLVPEFSLNLLSVPRLTHNSKFVVLFDNLDCLIQEKKSLKMIGSGELIEGLYYLT---N 511 Query: 215 KVVPVVSHVNATVD--SSYTIPNSAIWHFRFGHASLAKIDMLSKNFPAIHVNKSLVCDIC 42 K PV ++ + +++ S+ IP A+WHFR GH S A++ ++ +FP + +++ VCDIC Sbjct: 512 KPQPVSANSSISINPSSNIHIPKQALWHFRLGHLSHARLLLMQSSFPFVTIDEHAVCDIC 571 Query: 41 HLARQRRLPFVVS 3 HLAR ++L + +S Sbjct: 572 HLARHKKLTYKLS 584 >GAU47513.1 hypothetical protein TSUD_138850 [Trifolium subterraneum] Length = 1469 Score = 285 bits (730), Expect = 4e-82 Identities = 160/415 (38%), Positives = 241/415 (58%), Gaps = 5/415 (1%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLNDNFAMVKS 1053 F+++L +WEELE Y PMP C+C RCTCE+MR+ARA+ Y++RFL GLN+NFA+VKS Sbjct: 146 FYSDLKLIWEELEIYLPMPNCSCRNRCTCESMRSARANHSLLYIIRFLTGLNENFAVVKS 205 Query: 1052 QILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQSLVNAADGKKHYGRGRGKWSNKQCS 873 QILLM LP +N V S+VL+HERQ P D S L+NAA K + + + C+ Sbjct: 206 QILLMDPLPPMNKVFSLVLQHERQGKFSP-SDDSNVLLNAAKSKGSF----PSKTTRVCT 260 Query: 872 HCGKAGHTIDVCYEIHGYPIGYKRNFKPSVNLAADASDTKSEMGDKMASSSISQEEFKQL 693 CGK H ++ C++ +G P +++N + + +A+ E S+ I+Q++ QL Sbjct: 261 FCGKDNHIVENCFKKYGTPPHFRKNSQAN---SAEIEGGNDEQSTAANSNFITQDQALQL 317 Query: 692 MDLLKKVNVAQPSPVLKEEHSVNQIYADAEGIISCRSNSI---CIFSPWIIDSGATDHIC 522 + LL+ +Q S + + + + I C WI+DSGA+ H+C Sbjct: 318 ISLLQSSFPSQASSSAASNQVGSVEFTGHTSVNQGMHSKIFKTCSLGNWIVDSGASHHMC 377 Query: 521 NDLNSFELSHKIDPIGVRLPNGASIIAKLAGTVRITADLVLEDVLYLPEFNINLISIPKL 342 + + F +I PI VRLPNG ++AK +G V+ + LV+ +VL +P F++NLIS+ +L Sbjct: 378 SSIQCFHSYSEIIPIKVRLPNGNVVLAKHSGVVKFSDSLVITNVLCIPNFSVNLISVSQL 437 Query: 341 TRGSHFSVRFVNENCIIQDPL-QRKIGSGKLKHGLYHLDYGGGKVVPVVSHVNATVDSSY 165 + ++ V F + C IQD L +R IG L GLY+L +V H ++ + + Sbjct: 438 CKIQNYKVHFTDSKCTIQDQLTKRMIGFADLIEGLYYLTLTSKEV-----HAHSIDGTQH 492 Query: 164 T-IPNSAIWHFRFGHASLAKIDMLSKNFPAIHVNKSLVCDICHLARQRRLPFVVS 3 T IP+ A+WHFR GH S K+ +L FP I V+ VCDICHLA+ ++L + VS Sbjct: 493 TNIPDQALWHFRLGHTSNTKMSLLQSVFPFITVDNKGVCDICHLAKHKKLYYKVS 547 >AJY78067.1 putative polyprotein [Glycine max] Length = 886 Score = 276 bits (706), Expect = 9e-81 Identities = 165/439 (37%), Positives = 247/439 (56%), Gaps = 22/439 (5%) Frame = -1 Query: 1253 GTHVCH*FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLND 1074 GT F+++L LWEELE Y P+P CTC RC+C+AMR AR +VMRFL GLND Sbjct: 142 GTRSVTTFYSDLKALWEELEIYMPIPNCTCHHRCSCDAMRLARRHHHTLHVMRFLTGLND 201 Query: 1073 NFAMVKSQILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQSLVNAADGKKHYGRGRGK 894 F VKSQILL++ LPS+ + SMV++ ERQN + P D S++LVNA+ K G G+ Sbjct: 202 EFNAVKSQILLIEPLPSITKIFSMVIQFERQNCV-PNLDDSKALVNASTSKSQ-GSANGR 259 Query: 893 ---WSNKQCSHCGKAGHTIDVCYEIHGYPIGYKRNFKPSVNLAA-------DASDTKSEM 744 S + C++C K H ++ C++ HG P +N S + +A ++S Sbjct: 260 SNSGSKRYCTYCHKTNHFVENCFQKHGVPPHMMKNHSGSAHHSAVDGGERVESSTASQNT 319 Query: 743 GDKMASSSISQEEFKQLMDLLKKVNV----AQPSPVLKEEHSVNQIYADAEG--IISCRS 582 + S++QE+ +L+ L++ +V A S + ++ AD +G I Sbjct: 320 TSVTMTPSLTQEQLDKLLQLIQPPSVNHCNASTSKQVCSFNTAGPSSADTKGMKISHFFY 379 Query: 581 NSIC---IFSPWIIDSGATDHICNDLNSFELSHKIDPIGVRLPNGASIIAKLAGTVRITA 411 +SIC WIIDSGA+ HIC L+ F +I+P+ ++LPNG + K AGTV ++ Sbjct: 380 SSICNNIALDTWIIDSGASHHICASLHWFHSYSEINPMIIKLPNGNHVTTKYAGTVVFSS 439 Query: 410 DLVLEDVLYLPEFNINLISIPKLTRGSHFSVRFVNENCIIQDPLQRK-IGSGKLKHGLYH 234 + +VLY+P F +NLIS+ +L + +++ F + C IQ+ K IG G+ + GLY+ Sbjct: 440 SFSITNVLYVPTFTVNLISVSQLCHHTPYTLNFTDTICSIQEQKSLKMIGLGESRDGLYY 499 Query: 233 LDYGGGKVVPVVSHVNATVDSSYT--IPNSAIWHFRFGHASLAKIDMLSKNFPAIHVNKS 60 L + ++++ S+ IP +AIWHFR GH S ++I +L FP I + S Sbjct: 500 LTQTNKECASSNYNISSIFSSANNVHIPENAIWHFRLGHLSSSRIALLHSQFPFIVNDSS 559 Query: 59 LVCDICHLARQRRLPFVVS 3 VCDICH A+ R+LPFV S Sbjct: 560 SVCDICHFAKHRKLPFVHS 578 >KYP42564.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1427 Score = 278 bits (711), Expect = 2e-79 Identities = 160/445 (35%), Positives = 241/445 (54%), Gaps = 35/445 (7%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLNDNFAMVKS 1053 +FTEL LWEELE+ RP P CTC VRC+C+ + ++ +YV+ FLKGLND + VKS Sbjct: 99 YFTELKILWEELESLRPTPTCTCDVRCSCDLSTKVKDYKDTEYVICFLKGLNDQYNTVKS 158 Query: 1052 QILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQSLVN---------AADGKKHYGRGR 900 QIL+M LP +N V S+VL+ ERQ + D+ +N +++G+ RGR Sbjct: 159 QILIMDPLPVINKVFSLVLQQERQQNTPNLADSRIFSLNTQSGGNWRSSSNGRGGSSRGR 218 Query: 899 GKW--------SNKQCSHCGKAGHTIDVCYEIHGYPIGYKRNFK---PSVN-LAADASDT 756 G+ + K C+ CGK HTID CY HG+P +K K S+N +++ A T Sbjct: 219 GRTGGRGSFTNAGKVCTFCGKENHTIDSCYFKHGFPPNFKFKDKGNTTSINTISSKAPST 278 Query: 755 K----SEMGDKMASSSISQEEFKQLMDLLKKVNVAQPSPVLKEEHSVNQI--YADAEGII 594 + S +K ++S+ + E++ L+D+LK+ + P EHS+NQ+ E + Sbjct: 279 QASEISRKQNKESTSNFTHEDYDHLIDMLKRAKLQSP------EHSINQLVHQTTTESVS 332 Query: 593 SCRSNSICIFSP-----WIIDSGATDHICNDLNSFELSHKIDPIGVRLPNGASIIAKLAG 429 S +P WI+D+GATDH+CN L F H IDP+ V+LPNG + A+ +G Sbjct: 333 SSNLQQNQPGNPLEHTDWILDTGATDHVCNSLYFFTKYHPIDPVHVKLPNGNTSTAQFSG 392 Query: 428 TVRITADLVLEDVLYLPEFNINLISIPKLTRGSHFSVRFVNENCIIQDPLQRK-IGSGKL 252 T+ + L DVLY+P F++N+IS+ K+ + + F +CIIQD +K IG ++ Sbjct: 393 TIIFSEKFFLNDVLYIPNFHLNIISVQKIAASLDYELMFNKNSCIIQDLTSKKTIGLAEV 452 Query: 251 KHGLYHLDYGG--GKVVPVVSHVNATVDSSYTIPNSAIWHFRFGHASLAKIDMLSKNFPA 78 K+ LY L + S+ T + +WH+R GH S + + + FP Sbjct: 453 KNHLYILQRPSKDNSISACKSNSVLNAQPKGTTSSFDLWHYRLGHPSHVVLQTVKRLFPY 512 Query: 77 IHVNKSLVCDICHLARQRRLPFVVS 3 + NK++ CD CH +Q RLPF S Sbjct: 513 VTYNKNITCDYCHFGKQARLPFPTS 537 >GAU31769.1 hypothetical protein TSUD_22150 [Trifolium subterraneum] Length = 1372 Score = 275 bits (704), Expect = 1e-78 Identities = 159/416 (38%), Positives = 241/416 (57%), Gaps = 6/416 (1%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLNDNFAMVKS 1053 F+++L +WEELE Y PM C+C RCTCE+MR+ RA+ Y++RFL GLN+NFA+VKS Sbjct: 105 FYSDLKLIWEELEIYLPMSNCSCRNRCTCESMRSTRANHFLLYIIRFLTGLNENFAVVKS 164 Query: 1052 QILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQSLVNAADGKKHYGRGRGKWSNKQCS 873 QILLM LP +N V S+VL+HERQ P D S L+NAA K + + + C+ Sbjct: 165 QILLMDPLPPMNKVFSLVLQHERQGKFSP-SDDSNVLLNAAKSKGSF----PSKTTRVCT 219 Query: 872 HCGKAGHTIDVCYEIHGYPIGYKRNFKPSVNLAADASDTKSEMGDKMASSSISQEEFKQL 693 CGK H ++ C++ +G P +++N + + +A+ E S+ I+Q++ QL Sbjct: 220 LCGKDNHIVENCFKKYGTPPHFRKNSQAN---SAEIEGGNDEQSTAANSNFITQDQALQL 276 Query: 692 MDLLKKVNVAQPSPVLKEEHSVNQIYADAEGIISCRSNS----ICIFSPWIIDSGATDHI 525 + LL+ +Q S + V + + ++ +S C WI+DSGA+ H+ Sbjct: 277 ISLLQNSFPSQASS-SAASNQVGSVEFTSHTSVNQGMHSQFFKTCSLGNWIVDSGASHHM 335 Query: 524 CNDLNSFELSHKIDPIGVRLPNGASIIAKLAGTVRITADLVLEDVLYLPEFNINLISIPK 345 C+ + F +I PI VRLPNG ++AK +G V+ + LV+ +VL +P F++NLIS+ + Sbjct: 336 CSSIQCFHSYSEIIPIKVRLPNGNVVLAKHSGVVKFSDSLVITNVLCIPNFSVNLISVSQ 395 Query: 344 LTRGSHFSVRFVNENCIIQDPL-QRKIGSGKLKHGLYHLDYGGGKVVPVVSHVNATVDSS 168 L + ++ V F + C IQD L +R IGS L GLY+L +V H + + Sbjct: 396 LCKIQNYKVHFTDSKCTIQDQLTKRMIGSADLIEGLYYLTLTSEEV-----HAHNIDGTQ 450 Query: 167 YT-IPNSAIWHFRFGHASLAKIDMLSKNFPAIHVNKSLVCDICHLARQRRLPFVVS 3 +T IP+ A+W FR GH S K+ +L FP I V+ VCDICHLA+ ++L + VS Sbjct: 451 HTNIPDQALWLFRLGHTSNTKMSLLQSVFPFIAVDNKGVCDICHLAKHKKLYYKVS 506 >KYP41351.1 hypothetical protein KK1_037277 [Cajanus cajan] Length = 584 Score = 263 bits (671), Expect = 3e-78 Identities = 149/424 (35%), Positives = 237/424 (55%), Gaps = 14/424 (3%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLNDNFAMVKS 1053 F+T L LW+EL+ P+P CTC +C+C A++ R ++ V+RF +GLND ++ V+S Sbjct: 148 FYTALKILWDELDVLNPLPVCTCNPKCSCGAIKKIEEERNKNQVVRFFRGLNDQYSGVRS 207 Query: 1052 QILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQSLV---NAADGKKHYGRGRG---KW 891 Q++L+ + P++N V ++V + ERQ E +V +A+ K++ G G G ++ Sbjct: 208 QLMLLDNCPNVNRVFALVAQQERQFATENASMPKALIVASDDASHSKRNQGNGNGWNNRY 267 Query: 890 SNKQCSHCGKAGHTIDVCYEIHGYPIGYKRNFKPSVNLAA-DASDTKSEMGDKMASSSIS 714 ++K+CS C K GHTIDVCY HG+P +K FK S N+ A+ +E + S Sbjct: 268 TSKKCSWCEKMGHTIDVCYRKHGFPSSFK--FKNSKNIVPRSANMLMTEENETNCPEDSS 325 Query: 713 QEEFKQLMDLLKKVNVAQPSPVL---KEEHSVNQIYADAEGIISCR--SNSICIFSPWII 549 ++ ++ + N Q L +++ Q A + + SN + S WI+ Sbjct: 326 GKDTQESVRFGFTTNQYQNLLALLPQQDQRDSTQHTAHVKAFTNSNPTSNGNALISRWIL 385 Query: 548 DSGATDHICNDLNSFELSHKIDPIGVRLPNGASIIAKLAGTVRITADLVLEDVLYLPEFN 369 D+GATDHI N L+ F I PI V LPNG + A ++GT+ +++ VL DVL+LP F Sbjct: 386 DTGATDHISNSLSYFTAYKNIPPIKVSLPNGILVSATISGTIHLSSSFVLTDVLFLPSFK 445 Query: 368 INLISIPKLTRGSHFSVRFVNENCIIQDPLQRK-IGSGKLKHGLY-HLDYGGGKVVPVVS 195 NLIS+ KLT+ + + F+++ C+IQD K IG+ K + GLY V + Sbjct: 446 FNLISVTKLTQTLYCKLTFLDDICLIQDSNACKMIGTAKAERGLYIFTQTVQSSQSSVYN 505 Query: 194 HVNATVDSSYTIPNSAIWHFRFGHASLAKIDMLSKNFPAIHVNKSLVCDICHLARQRRLP 15 ++ + ++ S +WH+R GH + +S+ FP IH NK+ VCDICH+A+QR+L Sbjct: 506 TISKKISCNFPSSLSNLWHYRLGHPCFDRSQAVSEMFPFIHCNKNHVCDICHIAKQRKLS 565 Query: 14 FVVS 3 F +S Sbjct: 566 FPLS 569 >KYP40816.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1298 Score = 270 bits (691), Expect = 5e-77 Identities = 159/454 (35%), Positives = 245/454 (53%), Gaps = 34/454 (7%) Frame = -1 Query: 1262 IEAGTHVCH*FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKG 1083 I+ G + T+L LWEELE RP+P C+CTV+CTC+ ++T R +E +YV+ FLKG Sbjct: 143 IKQGERTISTYHTDLKTLWEELEILRPIPACSCTVKCTCDLVKTVRNYKETEYVICFLKG 202 Query: 1082 LNDNFAMVKSQILLMKDLPSLNTVLSMVLEHERQ---------NGLEPIQDASQSLVNAA 930 LND++ V+SQ+L+M LP++N V S+VL+ ERQ N + I + QS Sbjct: 203 LNDSYNTVRSQVLMMDPLPNINKVFSLVLQQERQVLGNLLQEVNLVASITNNKQSF-GGR 261 Query: 929 DGKKHYGRGRGKWS--------NKQCSHCGKAGHTIDVCYEIHGYPIGYKRNFK-----P 789 +G GRGRG+++ K C+ CGK HT++ CY HG+P + NFK Sbjct: 262 NGSSS-GRGRGRFNQRNFSNQPQKICTFCGKERHTVETCYFKHGFPPNF--NFKDKRQTS 318 Query: 788 SVNLAADASDTKSEMGD-----------KMASSSISQEEFKQLMDLLKKVNVAQPSPVLK 642 +VN S T + SS+I+ E++ QLM++LK NV+ Sbjct: 319 TVNSYTSDSSTSPNTVEGSDHKITGKEYTEVSSTITTEQYNQLMEILKGSNVS------G 372 Query: 641 EEHSVNQIYADAEGIISCRSNSICIFSPWIIDSGATDHICNDLNSFELSHKIDPIGVRLP 462 E H+V+ + I+S + + WI+DSGATDH+ N L+ ++ + I+PI ++LP Sbjct: 373 ETHAVDNLQQHPGNILSTQKEQKLPDNVWILDSGATDHVSNSLSCYDSYNTIEPIRIKLP 432 Query: 461 NGASIIAKLAGTVRITADLVLEDVLYLPEFNINLISIPKLTRGSHFSVRFVNENCIIQDP 282 NG +++GTV + L+DVLY+P+FN N+IS+ K+ + + + F C IQD Sbjct: 433 NGFITQTQISGTVAFSGRFFLQDVLYIPDFNYNIISVGKIVKNFNCKIVFDKLCCYIQDH 492 Query: 281 LQR-KIGSGKLKHGLYHLDYGGGKVVPVVSHVNATVDSSYTIPNSAIWHFRFGHASLAKI 105 + IG L+ LY L +V+ + D++ N +WH+R GH S + + Sbjct: 493 NNKLMIGPANLQCNLYILQRQNFSENKIVNSARTSSDNNMNNSNFDLWHYRLGHPSDSVL 552 Query: 104 DMLSKNFPAIHVNKSLVCDICHLARQRRLPFVVS 3 + FP + + LVCD CHLA+Q +L F +S Sbjct: 553 QQIKGQFPYVKYDHKLVCDYCHLAKQCKLSFPIS 586 >XP_012570493.1 PREDICTED: uncharacterized protein LOC101511025 [Cicer arietinum] Length = 948 Score = 266 bits (681), Expect = 8e-77 Identities = 161/434 (37%), Positives = 244/434 (56%), Gaps = 27/434 (6%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLNDNFAMVKS 1053 ++T L LW+ELEN+RP+P CTC ++C+C + R R+ DYV++FLKGLND ++ V+S Sbjct: 194 YYTRLKKLWQELENFRPLPSCTCAIKCSCALIPKIREYRDGDYVIQFLKGLNDQYSSVRS 253 Query: 1052 QILLMKDLPSLNTVLSMVLEHERQ---NGLEPIQDASQSLVNAADGKKHYGRGRGKWS-- 888 Q++LM LP++N V S++++ E Q N E A+ S N A+ + H G RG+ S Sbjct: 254 QVMLMDPLPNINKVFSLLVQQEHQIFPNSEEIPTIANVSNSNRANSRGH-GDTRGRSSGS 312 Query: 887 ----NKQCSHCGKAGHTIDVCYEIHGYPIGYKRNFKPSVNLA----ADASDTKSEMGDK- 735 ++ CSHC ++GHT+DVC+ HG+P +++N S N D D KS D Sbjct: 313 SGRPSRYCSHCHRSGHTVDVCFRKHGFPPHFRKNGNSSANNCVASDVDNDDQKSSCADDS 372 Query: 734 ---MASSSISQEEFKQLMDLLKKVNVAQPSPV---LKEEHSVNQIYADAEGIISCRSNSI 573 S + E+ K L+ LL + V H+ + + II+ +S Sbjct: 373 LGGFPHSGFTTEQKKALLALLHTSQTSSSHVVNHLSNSSHTGSFSFPHISNIINTQS--- 429 Query: 572 CIFSPWIIDSGATDHICNDLNSFELSHKIDPIGVRLPNGASIIAKLAGTVRITADLVLED 393 WI+D+GAT+H+C F+ +I P+ +RLPN +II +LAGTV ++ L D Sbjct: 430 -----WILDTGATNHVCYSDTLFQSLKRIKPVRIRLPNDCTIITELAGTVFFNSEFYLCD 484 Query: 392 VLYLPEFNINLISIPKLTRGSHFSVRFVNENCIIQDPL-QRKIGSGKLKHGLYHLDYGGG 216 VL++PEF+ NLISIP LT + ++ F NC IQD L Q++IG+ L HGLY L Sbjct: 485 VLFIPEFSCNLISIPCLTMSLNCNLIFNAHNCWIQDNLSQKRIGTADLIHGLYLL---SD 541 Query: 215 KVVPVVSHVNATVDSSYTI----PNSA-IWHFRFGHASLAKIDMLSKNFPAIHVNKSL-V 54 +P+V + A S P+ IWH GHAS ++ ++K+FP + ++KS+ Sbjct: 542 PCLPIVFTLVANCIFSCLTNVIGPHVCNIWHNLLGHASFDVLNHMNKSFPFVRISKSISP 601 Query: 53 CDICHLARQRRLPF 12 CD+C A+ +RLPF Sbjct: 602 CDVCFHAKHKRLPF 615 >KYP68194.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 914 Score = 264 bits (674), Expect = 5e-76 Identities = 150/435 (34%), Positives = 244/435 (56%), Gaps = 25/435 (5%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLNDNFAMVKS 1053 F+T L LW+EL+ P+P CTC +C+C A++ R ++ V+RFL+GLND ++ V+S Sbjct: 148 FYTALKTLWDELDVLNPLPVCTCNPKCSCGAIKKIEEERNKNQVVRFLRGLNDQYSGVRS 207 Query: 1052 QILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQSLV---NAADGKKHYGRGRG---KW 891 Q++L+ +LP++N V ++V + ERQ E +V +A+ K++ G G G ++ Sbjct: 208 QLMLLDNLPNVNRVFALVAQQERQFATENAYVPKALIVASDDASHSKRNQGNGNGWNNRY 267 Query: 890 SNKQCSHCGKAGHTIDVCYEIHGYPIGYKRNFKPSVNLAADAS--------------DTK 753 ++K+CS CGK GHTIDVCY HG+P G+K FK S N+ ++ D+ Sbjct: 268 TSKKCSWCGKMGHTIDVCYRKHGFPAGFK--FKNSKNIVPRSANMLMTEENETNCPEDSS 325 Query: 752 SEMGDKMASSSISQEEFKQLMDLLKKVNVAQPSPVLKEEHSVNQIYADAEGIISCRSNSI 573 + + + +++ L+ LL + + + + + + + +A + N Sbjct: 326 GKDTQESVRFGFTANQYQNLLALLPQQD--------QRDSTQHTAHVNAFTNSNPTRNGN 377 Query: 572 CIFSPWIIDSGATDHICNDLNSFELSHKIDPIGVRLPNGASIIAKLAGTVRITADLVLED 393 + S WI+D+GATDHI N L+ F I PI V LPNG + ++GT+ +++ VL D Sbjct: 378 VLISRWILDTGATDHISNSLSYFTAYKNIPPIKVSLPNGILVSDTISGTIHLSSSFVLTD 437 Query: 392 VLYLPEFNINLISIPKLTRGSHFSVRFVNENCIIQDPLQRK-IGSGKLKHGLY----HLD 228 VL+L NLIS+ KLT+ H + F+++ C+IQD K IG+ K + GLY ++ Sbjct: 438 VLFLLSLKFNLISVTKLTQTLHCKLTFLDDICLIQDSNACKMIGTAKAERGLYIFTQIVE 497 Query: 227 YGGGKVVPVVSHVNATVDSSYTIPNSAIWHFRFGHASLAKIDMLSKNFPAIHVNKSLVCD 48 V +S N + + ++PN +WH+R GH + LS+ FP I+ NK+ VCD Sbjct: 498 SSQSSVYSTISE-NFSCNFPSSLPN--LWHYRLGHPCFDRSQALSEMFPFIYCNKNHVCD 554 Query: 47 ICHLARQRRLPFVVS 3 ICH+A+QR+L F +S Sbjct: 555 ICHIAKQRKLSFPLS 569 >KYP42296.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 849 Score = 261 bits (667), Expect = 2e-75 Identities = 152/421 (36%), Positives = 233/421 (55%), Gaps = 14/421 (3%) Frame = -1 Query: 1232 FFTELSGLWEELENYRPMPECTCTVRCTCEAMRTARASREEDYVMRFLKGLNDNFAMVKS 1053 +FT+L +W+ELE+YRP P CTC V+C C A+ + +D +M+FL+GLND + V+S Sbjct: 65 YFTKLRMIWDELESYRPDPFCTCDVKCLCNALTEVMLRKMQDRIMQFLRGLNDQYHNVRS 124 Query: 1052 QILLMKDLPSLNTVLSMVLEHERQNGLEPIQDASQSLVNAADG---KKHYGRGRGKWSNK 882 IL+M LPS+ V S V++ ERQ I + +++NAA + +Y G S K Sbjct: 125 NILMMDPLPSIAKVFSYVVQQERQFTNSDIV-GNLNVINAASSLSSRTNYSWGTAGRSGK 183 Query: 881 QCSHCGKAGHTIDVCYEIHGYPIGYK--RNFKPSVNLAADASDTKSEMGDKMASSSISQE 708 C+HCG GHT++ CY+ HGYP G++ + +N + E DK+ + +Q+ Sbjct: 184 VCTHCGFNGHTVEECYKKHGYPPGHRLHKTSGAHINNTVRDENNPQEGNDKVKQENQNQD 243 Query: 707 ------EFKQLMDLLKKVNVAQPSPVLKEEHSVNQIYADAEGIISCRSNSICIFSPWIID 546 +++ LM LL++ N S + + I++ + SN WI+D Sbjct: 244 MRLTPQQYQTLMSLLQQHN-GGTSTNASHINQIGNIFSLTCNMNKNNSNF------WILD 296 Query: 545 SGATDHICNDLNSFELSHKIDPIGVRLPNGASIIAKLAGTVRITADLVLEDVLYLPEFNI 366 SGATDH+ + L+ + KI+P+ V LP G +IA +G V+ T LEDVL+LP F+ Sbjct: 297 SGATDHVTSSLHLYSTFKKINPVVVSLPTGQQVIATHSGVVKFTKYFYLEDVLFLPSFSF 356 Query: 365 NLISIPKLTRGSHFSVRFVNENCIIQDPL-QRKIGSGKLKHGLYHLDYGGGKVVPVVSHV 189 NLISI L F + F N++C+IQD QR IG+ + GLY L K ++V Sbjct: 357 NLISISNLVSSLKFQLIFSNDHCLIQDVTNQRMIGTVDVVDGLYKLKMPVAKSNTHSNNV 416 Query: 188 NATVDSSYTIPNSAI--WHFRFGHASLAKIDMLSKNFPAIHVNKSLVCDICHLARQRRLP 15 + V S ++ I WHFR GH S ++ +L + +P ++ +K+ VCD CH A+QR+L Sbjct: 417 SPNVKSVFSCNKIPIDLWHFRLGHPSHDRLQLLKQCYPTLYSDKNFVCDTCHKAKQRKLS 476 Query: 14 F 12 F Sbjct: 477 F 477