BLASTX nr result
ID: Astragalus23_contig00010648
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00010648 (2435 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNX96991.1| retrotransposon-related protein [Trifolium pratense] 627 0.0 dbj|GAU22332.1| hypothetical protein TSUD_106600 [Trifolium subt... 617 0.0 dbj|GAU30142.1| hypothetical protein TSUD_360350 [Trifolium subt... 612 0.0 dbj|GAU33426.1| hypothetical protein TSUD_380620 [Trifolium subt... 600 0.0 dbj|GAU42392.1| hypothetical protein TSUD_296900 [Trifolium subt... 605 0.0 dbj|GAU44417.1| hypothetical protein TSUD_100640 [Trifolium subt... 595 0.0 gb|PNX95963.1| retrotransposon-related protein, partial [Trifoli... 592 0.0 dbj|GAU39052.1| hypothetical protein TSUD_396570 [Trifolium subt... 588 0.0 dbj|GAU34493.1| hypothetical protein TSUD_388050 [Trifolium subt... 586 0.0 gb|PNX98468.1| putative copia-type polyprotein, partial [Trifoli... 580 0.0 gb|PNY05002.1| putative copia-type polyprotein [Trifolium pratense] 562 0.0 dbj|GAU40816.1| hypothetical protein TSUD_398000 [Trifolium subt... 551 0.0 gb|PNX93875.1| copia-type polyprotein [Trifolium pratense] 568 0.0 gb|KYP31826.1| Retrovirus-related Pol polyprotein from transposo... 567 0.0 gb|KYP68287.1| Retrovirus-related Pol polyprotein from transposo... 565 0.0 dbj|GAU36721.1| hypothetical protein TSUD_318190 [Trifolium subt... 555 e-180 dbj|GAU36409.1| hypothetical protein TSUD_38770 [Trifolium subte... 555 e-178 gb|KYP48234.1| Retrovirus-related Pol polyprotein from transposo... 548 e-178 dbj|GAU26253.1| hypothetical protein TSUD_224440 [Trifolium subt... 555 e-177 gb|KYP46743.1| Retrovirus-related Pol polyprotein from transposo... 538 e-176 >gb|PNX96991.1| retrotransposon-related protein [Trifolium pratense] Length = 1333 Score = 627 bits (1616), Expect = 0.0 Identities = 315/599 (52%), Positives = 413/599 (68%), Gaps = 10/599 (1%) Frame = +1 Query: 667 MRVLFGAQDVLDLVNDGVTELAENATEVQRSAHRERRKKDQKALFHIHQCVDPKVFEKIA 846 M+V+F Q+V D VN + E NATE QR+ RE +KKD KALF IHQCVD KV EKIA Sbjct: 1 MKVIFTFQEVFDQVNAEIAEHPANATEEQRTTFREAKKKDNKALFLIHQCVDSKVLEKIA 60 Query: 847 ESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKYELLQMKPGEKVGDYVSRVVTVTNQMK 1026 ++ T K AWD L + YGGDTKVKKVKLQ+L+++YELL+MK EKV DY +R+VT+TNQMK Sbjct: 61 DAETSKAAWDILQKSYGGDTKVKKVKLQALKRQYELLEMKNDEKVADYFTRLVTLTNQMK 120 Query: 1027 VCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAKDTSAMNLDELQGNLEAHEMRISDRGP 1206 CG + EQ VEKVLR+LT +FDHIVV IE+ KD S + +++LQ LEAHEM+ +R Sbjct: 121 NCGNTLEEQEKVEKVLRTLTSKFDHIVVTIEETKDLSEVKIEDLQSTLEAHEMKHGERDH 180 Query: 1207 EKESEQAL-----KAQTNKKTYGDXXXXXXXXXXXXXXXXXEGGSTEKADTANKGGKKYS 1371 K+ EQ L K Q NKK + + +K +++N GG K Sbjct: 181 GKDDEQVLLAKFKKFQNNKKNW---------QKKKKEFKKDKDNDEDKPESSNGGGGKQK 231 Query: 1372 NQKGKEKFDKRKVQCFNCEKYGHFADECWSNKENQKQ----EANIAKS-DSDDDPVLLMV 1536 Q K+ DK +QC+NC K+GH+A++C ++K+N+ Q EAN+A++ DSDDD V MV Sbjct: 232 KQFKKKTTDKSHIQCYNCSKFGHYANQCTASKKNKTQQGDEEANVAENTDSDDDDVSFMV 291 Query: 1537 TTADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSSKKTRIRLADDR*LQAEGMGNIIVEK 1716 T D+ WY DTGCSNHMTGNR LT+FD T+I+LAD + A+G+GN+++++ Sbjct: 292 TITDEIAGSMEWYFDTGCSNHMTGNRNILTDFDKCVNTKIKLADSNSIDAKGIGNVVIQR 351 Query: 1717 KDGKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVTMEDGLMQLYGKKKRQILKSHLEKNR 1896 K+GK VIE V YVP MK NLMSVGQL++KGF V +EDG +QL K+ ILKS KNR Sbjct: 352 KNGKKCVIENVLYVPSMKCNLMSVGQLLDKGFKVYLEDGALQLLDSKRNLILKSAQSKNR 411 Query: 1897 TFKVSIKAAETHCFAATEGERDIALWHKRYGHLNFRSLGHLHTKNLVVGVPAVAAVDKTC 2076 TFK ++A E C AT D LWHKRYGHLNF+SL L++KN+V+G+P+V A +TC Sbjct: 412 TFKTQLRAIEYECLTATTKSNDSELWHKRYGHLNFKSLSKLNSKNMVLGLPSVIAPVETC 471 Query: 2077 EICMRSKQPKLPFTSELPPRATHALGVVHSDVCGPLEVPTLGGNKYFVSFVDEYTRMIWI 2256 C+ KQP+ F + LP R++ L +VHSD+CGP++V + GGNKYF++FVDE++RM W+ Sbjct: 472 TTCLLGKQPRDSFKNYLPMRSSDVLNIVHSDICGPIDVLSTGGNKYFITFVDEFSRMTWL 531 Query: 2257 YLIKAKSEVLDQFRKFKNYAEKQSGCQLKILRTDSGGEYNSTEFKRYCEDQGIEHEVTA 2433 Y IKAKS+ D F+KFK EKQSG +K+LRTD GGEY STEF+ +C +QGI HEVTA Sbjct: 532 YHIKAKSDAFDVFKKFKALVEKQSGKSIKVLRTDGGGEYTSTEFENFCTEQGIIHEVTA 590 >dbj|GAU22332.1| hypothetical protein TSUD_106600 [Trifolium subterraneum] Length = 1171 Score = 617 bits (1592), Expect = 0.0 Identities = 312/619 (50%), Positives = 417/619 (67%), Gaps = 7/619 (1%) Frame = +1 Query: 598 GNGGFNTHLHVFDGSNWNRWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHRERR 777 G F+ +L + DG NW+ W QM+V+F Q+ + VN + L NATE QR+ RE + Sbjct: 3 GKSNFHANLPILDGKNWDTWVKQMKVIFIVQEADEQVNTILDPLPANATEQQRTTFREAQ 62 Query: 778 KKDQKALFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKYELL 957 KKD KALF IHQCVD KVFEKIA++ T K AWD L + YGGD KVKKVKLQ+L++++ELL Sbjct: 63 KKDSKALFLIHQCVDSKVFEKIADAITSKDAWDILQKSYGGDAKVKKVKLQALKRQFELL 122 Query: 958 QMKPGEKVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAKDTS 1137 +MK E V +Y +RV T+TNQMK CG ++E+ +VEKVLR+LT +FDHIVV IEQ +D S Sbjct: 123 EMKNDEAVAEYFTRVETLTNQMKNCGSTLSEEEMVEKVLRTLTHKFDHIVVTIEQTRDLS 182 Query: 1138 AMNLDELQGNLEAHEMRISDRGPEKESEQALKAQTNKKTYGDXXXXXXXXXXXXXXXXXE 1317 + +++LQ LEAHE++ +R +KE EQAL + K Y D + Sbjct: 183 EIKMEDLQSTLEAHELKHGERNHDKEDEQALFVKF--KRYQDEKKKWQNK---------K 231 Query: 1318 GGSTEKADTANK--GGKKYSNQKGKEKFDKRKVQCFNCEKYGHFADECWSNK----ENQK 1479 G K NK KK QK K+ DK +QC+NC KYGH+A EC + K +N + Sbjct: 232 GSKKGKESVENKPESSKKEGGQKTKK--DKSTIQCYNCNKYGHYASECKAPKKKKSQNTE 289 Query: 1480 QEANIAKSDS-DDDPVLLMVTTADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSSKKTRI 1656 +EAN+A+ S +D V MVT D++ +WY DTGCSNHMTGN LT+F+ TRI Sbjct: 290 EEANVAQDGSTSEDDVSFMVTITDETTESMVWYFDTGCSNHMTGNTSILTDFNKCLNTRI 349 Query: 1657 RLADDR*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVTMEDGL 1836 +LA+D + AE MGN+++++ +GK VIE+V YVP MK NLMSVGQL+EKGF E Sbjct: 350 KLANDNFIAAECMGNVVIQRSNGKKAVIEKVLYVPGMKCNLMSVGQLLEKGFKAVFEGET 409 Query: 1837 MQLYGKKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHKRYGHLNFRSLGH 2016 ++L+ K+R ILK+ +NRTFK +K E C A + ++D LWH+RYGHLNF+SL Sbjct: 410 LKLFDSKQRLILKTAQSQNRTFKTQVKTIEVECLATSTEDKDGDLWHRRYGHLNFKSLSM 469 Query: 2017 LHTKNLVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVVHSDVCGPLEVPT 2196 L++KN+V+G+P+V A TC C+ K P+ PF S LP R++ L VVHSD+CGP++V + Sbjct: 470 LNSKNMVLGLPSVIAPVDTCTTCLLGKHPRSPFKSNLPMRSSEVLNVVHSDICGPIDVLS 529 Query: 2197 LGGNKYFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQLKILRTDSGGEYN 2376 GGNKYF++FVDEY+R+IW+Y IKAKSE + F++FK EKQS +K+LRTD GGEY Sbjct: 530 TGGNKYFITFVDEYSRLIWLYHIKAKSEAFEVFKRFKTLVEKQSDKSIKVLRTDGGGEYT 589 Query: 2377 STEFKRYCEDQGIEHEVTA 2433 S EF+ YC+DQGI HEVTA Sbjct: 590 SKEFENYCKDQGIIHEVTA 608 >dbj|GAU30142.1| hypothetical protein TSUD_360350 [Trifolium subterraneum] Length = 1242 Score = 612 bits (1578), Expect = 0.0 Identities = 311/616 (50%), Positives = 414/616 (67%), Gaps = 7/616 (1%) Frame = +1 Query: 598 GNGGFNTHLHVFDGSNWNRWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHRERR 777 G F+ +L +FDG NW+ W QM+V+F Q+ VN V L NA E QR+ RE + Sbjct: 3 GKSNFHANLPIFDGKNWDTWVKQMKVIFIVQEADQQVNTVVDPLPANAIEQQRTTFREAQ 62 Query: 778 KKDQKALFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKYELL 957 KKD KALF IHQCVD +VFEKIA++TT K AWD L + YGGD KVKKVKLQ+L++++ELL Sbjct: 63 KKDSKALFLIHQCVDSQVFEKIADATTSKDAWDILQKSYGGDAKVKKVKLQALKRQFELL 122 Query: 958 QMKPGEKVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAKDTS 1137 +MK E V +Y +RV T+TNQMK CG ++E+ +VEKVLR+LT +FDHIVV IEQ KD S Sbjct: 123 EMKNDEAVAEYFTRVETLTNQMKNCGSTLSEKEMVEKVLRTLTHKFDHIVVTIEQTKDLS 182 Query: 1138 AMNLDELQGNLEAHEMRISDRGPEKESEQALKAQTNKKTYGDXXXXXXXXXXXXXXXXXE 1317 + +++LQ LEAHE++ +R KE EQAL + K Y D Sbjct: 183 EIKMEDLQSTLEAHELKHGERNHGKEDEQALFVKFKK--YQD------------------ 222 Query: 1318 GGSTEKADTANKGGKKYSNQKG--KEKFDKRKVQCFNCEKYGHFADECWSNK----ENQK 1479 EK KK+ N+KG K K DK +QC+NC KYGH+A EC + K +N + Sbjct: 223 ----EK--------KKWQNKKGGQKTKKDKSTIQCYNCNKYGHYASECKAPKKKKSQNTE 270 Query: 1480 QEANIAKSDS-DDDPVLLMVTTADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSSKKTRI 1656 +EAN+A+ S +D V MVT D++ +WY DTGCSNHMTGN+ LT+F+ TRI Sbjct: 271 EEANVAQDGSTSEDDVSFMVTITDETAESMVWYFDTGCSNHMTGNKSILTDFNKCLNTRI 330 Query: 1657 RLADDR*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVTMEDGL 1836 +LA+ + AEGMGN+++++++GK VIE+V YV MK NLMSVGQL+EKGF E Sbjct: 331 KLANGNFIAAEGMGNVVIQRRNGKKAVIEKVLYVSGMKCNLMSVGQLLEKGFKAVFEGET 390 Query: 1837 MQLYGKKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHKRYGHLNFRSLGH 2016 ++L+ K+R ILK+ +NRTFK +K E C A + ++D LWHKRYGHLNF+SL Sbjct: 391 LKLFDSKQRLILKTAQSQNRTFKTQVKTIEVECLATSTEDKDSDLWHKRYGHLNFKSLSM 450 Query: 2017 LHTKNLVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVVHSDVCGPLEVPT 2196 L++KN+V+G+P+V A TC C+ K P+ F S LP R++ L VVHSD+CGP++V + Sbjct: 451 LNSKNMVLGLPSVIAPVDTCTTCLLGKHPRSSFKSNLPMRSSEVLNVVHSDICGPIDVLS 510 Query: 2197 LGGNKYFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQLKILRTDSGGEYN 2376 GGNKYF+++VDEY+RMIW+Y IKAKSE + F++FK EKQS Q+K+LRTD GGEY Sbjct: 511 TGGNKYFITYVDEYSRMIWLYHIKAKSEAFEVFKRFKTLVEKQSDKQIKVLRTDGGGEYT 570 Query: 2377 STEFKRYCEDQGIEHE 2424 S EF+ YC+DQGI HE Sbjct: 571 SKEFENYCKDQGIIHE 586 >dbj|GAU33426.1| hypothetical protein TSUD_380620 [Trifolium subterraneum] Length = 990 Score = 600 bits (1548), Expect = 0.0 Identities = 303/621 (48%), Positives = 409/621 (65%), Gaps = 10/621 (1%) Frame = +1 Query: 598 GNGGFNTHLHVFDGSNWNRWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHRERR 777 G F+ +L + DG NW+ W QM+V+F Q+V + VN + L NATE QR+ +E + Sbjct: 3 GKSNFHANLPILDGKNWDTWVKQMKVIFIVQEVDEQVNTVLDPLPANATEQQRTTFKEAQ 62 Query: 778 KKDQKALFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKYELL 957 KKD K LF IHQC+D KVFEKIA++TT K AWD L + YGGD KVKKVKLQ+L++++ELL Sbjct: 63 KKDSKTLFLIHQCMDSKVFEKIADATTSKDAWDILQKIYGGDAKVKKVKLQALKRQFELL 122 Query: 958 QMKPGEKVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAKDTS 1137 +MK E V +Y +RV T+TNQMK CG ++E+ +VEKVLR+ T +FD+IVV IEQ KD S Sbjct: 123 EMKNDEAVAEYFTRVETLTNQMKNCGSTLSEEEMVEKVLRTSTHKFDYIVVTIEQTKDLS 182 Query: 1138 AMNLDELQGNLEAHEMRISDRGPEKESEQAL-----KAQTNKKTYGDXXXXXXXXXXXXX 1302 + +++LQ LEAHE++ +R KE EQA K Q KK + + Sbjct: 183 EIKMEDLQSTLEAHELKHGERNHGKEDEQAQFVKFKKYQNEKKKWQNK------------ 230 Query: 1303 XXXXEGGSTEKADTANKGGKKYSNQKGKEKFDKRKVQCFNCEKYGHFADECWSNKENQKQ 1482 +G K +K K K DK +QC+NC KYGH+A EC + K+ + Q Sbjct: 231 ----KGSKKGKKSVEDKSESSKKEGGQKTKKDKSTIQCYNCNKYGHYASECKAPKKKKSQ 286 Query: 1483 ----EANIAKSDS-DDDPVLLMVTTADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSSKK 1647 EAN+A+ S +D V MVT D++ +WY DTGCSNHMTGN+ LT+F+ Sbjct: 287 DTEEEANVAQDGSTSEDDVSFMVTITDETAESMVWYFDTGCSNHMTGNKSILTDFNKCLN 346 Query: 1648 TRIRLADDR*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVTME 1827 TRI+LA+ + AEGMGN+++++ +GK VIE+V YVP MK NLMSVGQL+EKGF E Sbjct: 347 TRIKLANGNFIAAEGMGNVVIQRSNGKKTVIEKVLYVPGMKCNLMSVGQLLEKGFKAVFE 406 Query: 1828 DGLMQLYGKKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHKRYGHLNFRS 2007 ++L+ K+R ILK+ +NRTFK +K E C A + ++D LWH RYGHLNF+S Sbjct: 407 GETLKLFDSKQRLILKTAQSQNRTFKTQVKTIEVECLATSTEDKDSDLWHIRYGHLNFKS 466 Query: 2008 LGHLHTKNLVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVVHSDVCGPLE 2187 L L++KN+V+G+ +V A TC C+ K P+ F S LP R++ L VVHSD+CGP++ Sbjct: 467 LSMLNSKNMVLGLSSVIAPVDTCTTCLLGKHPRSSFKSNLPMRSSEVLNVVHSDICGPID 526 Query: 2188 VPTLGGNKYFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQLKILRTDSGG 2367 V + GGNKYF++FV+EY+RMIW+Y IKAKSE + F++FK EKQS +K+LRTD GG Sbjct: 527 VLSTGGNKYFITFVNEYSRMIWLYHIKAKSEAFEVFKRFKTLVEKQSYKSIKVLRTDGGG 586 Query: 2368 EYNSTEFKRYCEDQGIEHEVT 2430 EY S EF+ YC+DQGI HEVT Sbjct: 587 EYTSKEFENYCKDQGIIHEVT 607 >dbj|GAU42392.1| hypothetical protein TSUD_296900 [Trifolium subterraneum] Length = 1224 Score = 605 bits (1559), Expect = 0.0 Identities = 305/622 (49%), Positives = 414/622 (66%), Gaps = 10/622 (1%) Frame = +1 Query: 598 GNGGFNTHLHVFDGSNWNRWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHRERR 777 G F+ +L + DG NW+ W QM+V+F Q+ + VN + L NATE QR+ RE + Sbjct: 3 GKSNFHANLPILDGKNWDTWVKQMKVIFIVQEADEQVNTILDPLPANATEQQRTTFREAQ 62 Query: 778 KKDQKALFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKYELL 957 KKD KALF IHQCVD KVFEKIA++TT K AWD L + YGGD KVKKVKLQ+L++++ELL Sbjct: 63 KKDSKALFLIHQCVDSKVFEKIADATTSKDAWDILQKRYGGDAKVKKVKLQALKRQFELL 122 Query: 958 QMKPGEKVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAKDTS 1137 +MK E V +Y +RV T+TNQMK CG ++E+ +VEKVLR+LT +FDHIVV IEQ +D S Sbjct: 123 EMKNDEAVAEYFTRVETLTNQMKNCGSTLSEEEMVEKVLRTLTHKFDHIVVTIEQTRDLS 182 Query: 1138 AMNLDELQGNLEAHEMRISDRGPEKESEQAL-----KAQTNKKTYGDXXXXXXXXXXXXX 1302 + +++LQ LEAH+++ +R KE EQAL K Q KK + + Sbjct: 183 EIKIEDLQNTLEAHKLKHGERNHGKEDEQALFVKFKKYQDEKKKWQNKKGSKK------- 235 Query: 1303 XXXXEGGSTEKADTANKGGKKYSNQKGKEKFDKRKVQCFNCEKYGHFADECWSNK----E 1470 E + + KK QK K+ DK +QC+NC KYGH+A EC + K + Sbjct: 236 -------GKESVEEKLESSKKEGGQKTKK--DKSTIQCYNCNKYGHYASECKAPKKKKSQ 286 Query: 1471 NQKQEANIAKSDS-DDDPVLLMVTTADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSSKK 1647 N ++EAN+A+ S +D V M+T D++ +WY DTGCSNHMTGN+ LT+F++ Sbjct: 287 NTEEEANVAQDGSTSEDDVSFMITITDETAESMVWYFDTGCSNHMTGNKSILTDFNNCLN 346 Query: 1648 TRIRLADDR*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVTME 1827 TRI+LA+ + AEGMGN+++++ +GK VIE+V YV MK NLMSVGQL+EKGF E Sbjct: 347 TRIKLANGNFIAAEGMGNVVIQRSNGKKAVIEKVLYVSGMKCNLMSVGQLLEKGFKAVFE 406 Query: 1828 DGLMQLYGKKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHKRYGHLNFRS 2007 ++L+ K+R ILK +NRTFK +K E C A + ++D LW++RYGHLNF+S Sbjct: 407 GETLKLFDSKQRLILKIAQSQNRTFKTQVKTIEVECLATSTEDKDSDLWNRRYGHLNFKS 466 Query: 2008 LGHLHTKNLVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVVHSDVCGPLE 2187 L L++KN+V+G+P+V A TC C+ K P+ F S P R++ L VVHSD+CGP++ Sbjct: 467 LSMLNSKNMVLGLPSVIAPVDTCTTCLLGKHPRSSFKSNFPMRSSEVLNVVHSDICGPID 526 Query: 2188 VPTLGGNKYFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQLKILRTDSGG 2367 V + GGNKYF++FVDEY+ MIW+Y IKAKSE + F++FK EKQS +K+LRTD GG Sbjct: 527 VLSTGGNKYFITFVDEYSMMIWLYHIKAKSEAFEVFKRFKTLVEKQSDKSIKVLRTDGGG 586 Query: 2368 EYNSTEFKRYCEDQGIEHEVTA 2433 EY S EF+ YC+DQGI HEVTA Sbjct: 587 EYTSKEFENYCKDQGIIHEVTA 608 >dbj|GAU44417.1| hypothetical protein TSUD_100640 [Trifolium subterraneum] Length = 1318 Score = 595 bits (1535), Expect = 0.0 Identities = 296/614 (48%), Positives = 410/614 (66%), Gaps = 6/614 (0%) Frame = +1 Query: 610 FNTHLHVFDGSNWNRWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHRERRKKDQ 789 F +L VF G N++RW QM+V+F QDVL+ V +GV ELA NA E R+ H E +KKD Sbjct: 11 FPANLPVFKGENYDRWCAQMKVIFRFQDVLETVINGVAELAANAEEAARTHHHELKKKDA 70 Query: 790 KALFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKYELLQMKP 969 KALF IHQCVDP +FEKI E T K AWDTL YGGD K+K +KLQ+LR++YE++QM Sbjct: 71 KALFIIHQCVDPNIFEKIIEEETSKGAWDTLKNTYGGDEKLKGIKLQALRRQYEMMQMNE 130 Query: 970 GEKVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAKDTSAMNL 1149 E + +Y++R++++TN MK CGE +++++ +EKVLR+LT +FDHIVVAIE++KD + M + Sbjct: 131 QETIAEYLARMLSLTNLMKACGEALSDRSKIEKVLRTLTEKFDHIVVAIEESKDLATMKI 190 Query: 1150 DELQGNLEAHEMRISDRGPEKESEQALKAQTNKKTYGDXXXXXXXXXXXXXXXXXEGGST 1329 +ELQ +LEAHE+R+ R K EQAL+A+ K Y E S Sbjct: 191 EELQASLEAHELRVKQRSSNKAVEQALQAKIQNKNY------KGKDKWKKKKEEPENSSK 244 Query: 1330 EKADTANKGGKKYSNQKG-KEKFDKRKVQCFNCEKYGHFADECWSNK-----ENQKQEAN 1491 A K N+K K+K DK+ +QC+NC+ YGH+A EC S K +++ Q AN Sbjct: 245 NSKTQAVGSIKGNQNKKNPKKKIDKKDIQCYNCQNYGHYARECNSKKVERGDKDEAQFAN 304 Query: 1492 IAKSDSDDDPVLLMVTTADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSSKKTRIRLADD 1671 SDS+D LLM T + LWY+DTGCSNHMTGN+K D S + I+ AD+ Sbjct: 305 GGGSDSNDS--LLMAITNSEVDKSNLWYLDTGCSNHMTGNKKWFLKLDHSVRRSIKFADN 362 Query: 1672 R*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVTMEDGLMQLYG 1851 + GMG ++V++KDG VI EV YVP M SNL+S+GQL+EK +++ +E+ +++Y Sbjct: 363 SQVIYAGMGTVLVKRKDGHESVINEVLYVPSMTSNLISLGQLLEKDYTMKLENRELKIYD 422 Query: 1852 KKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHKRYGHLNFRSLGHLHTKN 2031 K R ILK+ L NRTFK+ I + HC A+ + +WH R+GHLNF+SL L++K+ Sbjct: 423 AKSRLILKAPLSNNRTFKIEINVIDHHCLASITNPEENWIWHHRFGHLNFKSLSMLNSKD 482 Query: 2032 LVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVVHSDVCGPLEVPTLGGNK 2211 +V G+P + + CE C +KQ + F +ELP ++TH L +V+SDVCGP EV ++GGN Sbjct: 483 MVHGLPQIKTPSEVCEDCCAAKQTRNSFKNELPMKSTHKLEMVYSDVCGPFEVKSIGGNN 542 Query: 2212 YFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQLKILRTDSGGEYNSTEFK 2391 YF++F+D+YTR +W+YLI+ KSEV +F+KFK+ EKQSGC LK LRTD GGEY S EF Sbjct: 543 YFLTFIDDYTRHVWLYLIEKKSEVFTKFKKFKSLVEKQSGCDLKKLRTDGGGEYTSLEFA 602 Query: 2392 RYCEDQGIEHEVTA 2433 ++CED+GI HE+TA Sbjct: 603 KFCEDEGIVHEITA 616 >gb|PNX95963.1| retrotransposon-related protein, partial [Trifolium pratense] Length = 1290 Score = 592 bits (1525), Expect = 0.0 Identities = 299/617 (48%), Positives = 413/617 (66%), Gaps = 2/617 (0%) Frame = +1 Query: 589 MNGGNGGFNTHLHVFDGSNWNRWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHR 768 MN F+ +L + DG NW++W ++M V+F QDV DL+ G LA NATE Q++A R Sbjct: 1 MNNTGNAFSANLPILDGKNWDQWFVKMNVIFSYQDVEDLITTGYEPLAANATEAQQTAFR 60 Query: 769 ERRKKDQKALFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKY 948 E +KKD KALF IHQCVD F+KIA + T K AWD L +GG KVKKVKLQSLR++Y Sbjct: 61 ETKKKDSKALFLIHQCVDSSNFDKIAGAKTAKAAWDILSNAHGGGEKVKKVKLQSLRRQY 120 Query: 949 ELLQMKPGEKVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAK 1128 EL+ M E +G+Y +R+ T+ N MK GE+V++Q ++EKVLR+L QFDHIVVAIE++K Sbjct: 121 ELVGMMDKESIGEYFTRLQTLVNSMKNYGEIVSDQQVIEKVLRTLNPQFDHIVVAIEESK 180 Query: 1129 DTSAMNLDELQGNLEAHEMRISDRGPEKESEQALKAQTNKKTYGDXXXXXXXXXXXXXXX 1308 D S M+++ELQ +LEAHE R+++R +K++ KA ++ Y Sbjct: 181 DLSTMSVNELQSSLEAHEQRLNERKEKKDN----KANQDQALY----------------- 219 Query: 1309 XXEGGSTEKADTANKGGKKYSNQKGKEKFDKRKVQCFNCEKYGHFADECWSN-KENQKQE 1485 A G + G +K DK K+QC+NCEK+GH+A EC S K+ Q E Sbjct: 220 ------------AKNGDSFIKGKNGGKKGDKSKIQCYNCEKWGHYASECRSKGKKKQDNE 267 Query: 1486 ANIAK-SDSDDDPVLLMVTTADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSSKKTRIRL 1662 AN A+ +DSD D VL+MVT+ ++ +LWY+DTGCSNHMTG+R L FD + K++++ Sbjct: 268 ANHARHNDSDSDGVLMMVTSNSENDNSKLWYLDTGCSNHMTGHRDWLLGFDENFKSKVKF 327 Query: 1663 ADDR*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVTMEDGLMQ 1842 ADD ++ EG ++V++K+G + +V YVP MK NL+S+GQL+EKGF+ + +D +Q Sbjct: 328 ADDSTIKVEGKDKVMVQRKNGNHTFVTDVLYVPSMKHNLLSLGQLLEKGFNYSTKDHCIQ 387 Query: 1843 LYGKKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHKRYGHLNFRSLGHLH 2022 ++ + I+K+ L KNRTFKV+++A+ CF++ E + LWH RYGHLNF+SL HL Sbjct: 388 VFDPHNKLIMKAPLSKNRTFKVNLQASTFQCFSSLITEDEKWLWHYRYGHLNFKSLNHLC 447 Query: 2023 TKNLVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVVHSDVCGPLEVPTLG 2202 K +VVG+P + +K CE C SKQ + F S + R+ +L VVHSDVCGP+EVPTLG Sbjct: 448 NKKMVVGLPLIHTPEKLCEGCFESKQSRNSFKSLVYSRSKQSLDVVHSDVCGPIEVPTLG 507 Query: 2203 GNKYFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQLKILRTDSGGEYNST 2382 G++YF++FVDE+TR IWIYL+K KSEV F+ F AE+QS +LK+LRTD GGEYNS Sbjct: 508 GSRYFMTFVDEFTRKIWIYLLKEKSEVFAMFKNFCALAERQSEHKLKVLRTDGGGEYNSK 567 Query: 2383 EFKRYCEDQGIEHEVTA 2433 EF+ YC +GI HEVTA Sbjct: 568 EFQAYCTQRGIIHEVTA 584 >dbj|GAU39052.1| hypothetical protein TSUD_396570 [Trifolium subterraneum] Length = 1309 Score = 588 bits (1516), Expect = 0.0 Identities = 298/622 (47%), Positives = 413/622 (66%), Gaps = 7/622 (1%) Frame = +1 Query: 589 MNGGNGGFNTHLHVFDGSNWNRWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHR 768 MN N ++L + D NW++W I+M V+FG QDV DLV +G LA +ATE Q++ R Sbjct: 1 MNSSNSSLPSNLPILDSKNWDQWCIRMNVIFGFQDVEDLVKNGYNALAADATEAQQTTFR 60 Query: 769 ERRKKDQKALFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKY 948 E +KKD KALF IHQCVD FEKI+ +T+ K AWDTL +GG KVKKV+LQSLR++Y Sbjct: 61 EVKKKDCKALFLIHQCVDSANFEKISSATSSKQAWDTLNNAHGGGDKVKKVRLQSLRRQY 120 Query: 949 ELLQMKPGEKVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAK 1128 ELL M E +GD+ +R+ T+ N MK G+ +++Q ++EKVLR+L QFDHIVVAIE++K Sbjct: 121 ELLGMMDKESIGDFFTRLQTLVNTMKNLGDQISDQQVIEKVLRTLNSQFDHIVVAIEESK 180 Query: 1129 DTSAMNLDELQGNLEAHEMRISDRGPEK-ESEQALKAQTNKKTYGDXXXXXXXXXXXXXX 1305 D S M+L+ELQ +LEAHE R+ +R K + EQAL A+ K Sbjct: 181 DLSTMSLNELQSSLEAHEQRLKERKESKNQQEQALYARNGSKN--------GKGKGKWKN 232 Query: 1306 XXXEGGSTEKADTANKGGKKYSNQ-----KGKEKFDKRKVQCFNCEKYGHFADECWSNKE 1470 +G S D + G + ++ K + K DKRK++CF+C K+GH+A EC + + Sbjct: 233 EKYKGKSESSYDQDHDNGDQSQSESSMKNKNQGKKDKRKIKCFSCNKWGHYASECQNKGK 292 Query: 1471 NQKQEANIA-KSDSDDDPVLLMVTTADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSSKK 1647 +++EAN A ++D+D D VLLMVTT LWY+DTGC+NHMTG+R L D + K Sbjct: 293 KKQEEANHAGQNDTDSDGVLLMVTTNTSEDQSTLWYLDTGCTNHMTGHRDWLLELDETFK 352 Query: 1648 TRIRLADDR*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVTME 1827 + ++ AD+ + EG G ++V +K+G + +V YVP MK NL+S+GQLIEKGF+++ + Sbjct: 353 STVKFADNSTISVEGKGKVMVTRKNGNHTFVTDVLYVPTMKHNLLSLGQLIEKGFALSTK 412 Query: 1828 DGLMQLYGKKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHKRYGHLNFRS 2007 D ++++ K+ +LK+ L KNRTFKV+++A+E CF++ E LWH RYGHLNF+S Sbjct: 413 DKFLEVHDPYKKLVLKAPLSKNRTFKVNLQASEVQCFSSLITEDKKRLWHYRYGHLNFKS 472 Query: 2008 LGHLHTKNLVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVVHSDVCGPLE 2187 L L +K++VVG+P + DK CE C SKQP+ F + + R T L VVHSD+CGP+E Sbjct: 473 LNQLSSKHMVVGLPLIHTPDKLCEGCFASKQPRNSFKNTVYYRLTQPLHVVHSDICGPIE 532 Query: 2188 VPTLGGNKYFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQLKILRTDSGG 2367 TLGGN+YF++ VDE+TR +WIYL+K KSE F+KF AE+Q LK LRTD GG Sbjct: 533 TATLGGNRYFMTCVDEFTRKVWIYLLKEKSEAFSAFKKFCATAERQCENHLKTLRTDGGG 592 Query: 2368 EYNSTEFKRYCEDQGIEHEVTA 2433 EYNS EFK +CE++GI HEVTA Sbjct: 593 EYNSNEFKTFCEEKGITHEVTA 614 >dbj|GAU34493.1| hypothetical protein TSUD_388050 [Trifolium subterraneum] Length = 1412 Score = 586 bits (1511), Expect = 0.0 Identities = 298/598 (49%), Positives = 403/598 (67%), Gaps = 10/598 (1%) Frame = +1 Query: 667 MRVLFGAQDVLDLVNDGVTELAENATEVQRSAHRERRKKDQKALFHIHQCVDPKVFEKIA 846 M+V+F Q+ + VN + L NATE QR+ RE +KKD KALF IHQCVD KVFEKIA Sbjct: 1 MKVIFIVQEADEQVNTFLDPLPANATEQQRTTFREVQKKDSKALFLIHQCVDSKVFEKIA 60 Query: 847 ESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKYELLQMKPGEKVGDYVSRVVTVTNQMK 1026 ++TT K WD L + YGGD KVKKVKLQ+L++++ELL+MK E V +Y +RV T+TNQMK Sbjct: 61 DATTSKDVWDILQKSYGGDAKVKKVKLQALKRQFELLEMKNDEAVAEYFTRVETLTNQMK 120 Query: 1027 VCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAKDTSAMNLDELQGNLEAHEMRISDRGP 1206 CG ++++ +VEKVLR+LT +FDHIV IEQ KD S + +++LQ LEAHE++ +R Sbjct: 121 NCGSTLSKEEMVEKVLRTLTHKFDHIVETIEQTKDLSEIKMEDLQSTLEAHELKHGERNH 180 Query: 1207 EKESEQAL-----KAQTNKKTYGDXXXXXXXXXXXXXXXXXEGGSTEKADTANKGGKKYS 1371 KE EQAL K Q KK + + + +K +++ K G K Sbjct: 181 GKEDEQALFVKFKKYQDEKKKWQNKKGSKKG----------KESVEDKPESSKKEGGK-- 228 Query: 1372 NQKGKEKFDKRKVQCFNCEKYGHFADECWSNK----ENQKQEANIAKSDS-DDDPVLLMV 1536 K K DK +QC+NC KYGH+A EC + K +N ++EANIA+ S +D V +V Sbjct: 229 ----KTKKDKSTIQCYNCNKYGHYASECKAPKKKKSQNTEEEANIAQDGSTSEDDVSFIV 284 Query: 1537 TTADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSSKKTRIRLADDR*LQAEGMGNIIVEK 1716 T D++ +WY DTGCSNHMTGN+ LT+F+ TRI+L + + AEGMGN+++++ Sbjct: 285 TITDETAESMVWYFDTGCSNHMTGNKSILTDFNKCLNTRIKLTNGNFIAAEGMGNVVIQR 344 Query: 1717 KDGKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVTMEDGLMQLYGKKKRQILKSHLEKNR 1896 +GK VIE+V YVP +K NLMSVGQL+EKGF E ++L+ K+R ILK+ +NR Sbjct: 345 SNGKKAVIEKVLYVPGIKCNLMSVGQLLEKGFKAVFEGETLKLFDSKQRLILKTAQSQNR 404 Query: 1897 TFKVSIKAAETHCFAATEGERDIALWHKRYGHLNFRSLGHLHTKNLVVGVPAVAAVDKTC 2076 TFK +K E C A + ++D LWH+RYGHLNF+SL L++KN+V+G+P+V A TC Sbjct: 405 TFKTQVKTIEVECLATSTEDKDSDLWHRRYGHLNFKSLSMLNSKNMVLGLPSVIASVDTC 464 Query: 2077 EICMRSKQPKLPFTSELPPRATHALGVVHSDVCGPLEVPTLGGNKYFVSFVDEYTRMIWI 2256 C+ K P+ F S LP R++ L VVHSD+CGP++V + GGNKYF++FVDEY+RMIW+ Sbjct: 465 TTCLLGKHPRSSFKSNLPMRSSEVLNVVHSDICGPIDVLSTGGNKYFITFVDEYSRMIWL 524 Query: 2257 YLIKAKSEVLDQFRKFKNYAEKQSGCQLKILRTDSGGEYNSTEFKRYCEDQGIEHEVT 2430 Y IKAKSE + F++FK EKQS Q+K+LRTD GGEY S EF+ YC+DQGI HEVT Sbjct: 525 YHIKAKSEAFEVFKRFKTLVEKQSDKQIKVLRTDGGGEYTSKEFENYCKDQGIIHEVT 582 >gb|PNX98468.1| putative copia-type polyprotein, partial [Trifolium pratense] Length = 1267 Score = 580 bits (1496), Expect = 0.0 Identities = 290/614 (47%), Positives = 403/614 (65%), Gaps = 6/614 (0%) Frame = +1 Query: 610 FNTHLHVFDGSNWNRWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHRERRKKDQ 789 F +L VF G N++RW QMRV+ QD L++V DGV ELAE+A + R+ H+E ++KD Sbjct: 11 FPANLPVFKGENYDRWCAQMRVILRFQDCLEIVTDGVGELAEDADDEARTLHKETKRKDA 70 Query: 790 KALFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKYELLQMKP 969 K+LF IHQCVDP VFEKI E T K AWD L YGGD K+K +KLQ+LR++YE +QM+ Sbjct: 71 KSLFIIHQCVDPNVFEKIIEEETSKGAWDKLKDYYGGDEKLKGIKLQALRRQYETMQMEE 130 Query: 970 GEKVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAKDTSAMNL 1149 E +G+Y+SR++++TN MK CGE + ++ ++KVLR+LT +FDHIVVAIE++KD S M + Sbjct: 131 KESIGEYMSRLLSLTNLMKSCGEALEVKSKIQKVLRTLTEKFDHIVVAIEESKDLSTMKI 190 Query: 1150 DELQGNLEAHEMRISDRGPEKESEQALKAQTNKKTYGDXXXXXXXXXXXXXXXXXEGGST 1329 +ELQ +LEAHE+R+ R K EQAL+A+ K + E S Sbjct: 191 EELQASLEAHELRVKQRSSSKAVEQALQAKVQNKNH------KGKDKCKKKKDDSESSSK 244 Query: 1330 EKADTANKGGKKYSNQKG-KEKFDKRKVQCFNCEKYGHFADECWSNK-----ENQKQEAN 1491 + A + K N+K K+K DK+ VQC+NC+K+GH+A EC S K +++ Q AN Sbjct: 245 NSKNQAGESSKGNQNKKNFKKKVDKKDVQCYNCQKHGHYARECHSKKVDRDDKDEAQFAN 304 Query: 1492 IAKSDSDDDPVLLMVTTADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSSKKTRIRLADD 1671 S+SDD LLM T D+ LWY+DTGCSNHMTGN+K D S + I+ AD+ Sbjct: 305 GGGSESDDS--LLMAITNSDADKSNLWYLDTGCSNHMTGNKKWFLKLDDSVRRSIKFADN 362 Query: 1672 R*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVTMEDGLMQLYG 1851 +++ GMG + V +KDG VI EV YVP M SNL+S+GQL+EKG+ ++M + +++Y Sbjct: 363 SQIESAGMGTVSVMRKDGHESVINEVLYVPSMTSNLISLGQLLEKGYEMSMANRELKIYD 422 Query: 1852 KKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHKRYGHLNFRSLGHLHTKN 2031 K R ILK+ L NRTFK I + HC + + WH RYGHLNF+SL L +K+ Sbjct: 423 AKSRLILKAPLSNNRTFKAEINVIDHHCLSLITNSEENWKWHHRYGHLNFKSLSMLQSKD 482 Query: 2032 LVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVVHSDVCGPLEVPTLGGNK 2211 +V G+P + A + C C +KQ + F S+L ++ L +V+SDVCGP E ++GGN Sbjct: 483 MVHGLPQIKAPSEVCGECCAAKQARNSFKSDLHMKSAQKLEMVYSDVCGPFEEKSIGGNN 542 Query: 2212 YFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQLKILRTDSGGEYNSTEFK 2391 YF++F+D+YTR +W+YLI+ KS+V +F+KFK EK SGC +K LRTD GGEY S EF Sbjct: 543 YFLTFIDDYTRHVWLYLIEKKSDVFTKFKKFKTLVEKHSGCSVKKLRTDGGGEYTSHEFA 602 Query: 2392 RYCEDQGIEHEVTA 2433 ++CED+GI HE+TA Sbjct: 603 KFCEDEGIVHEITA 616 >gb|PNY05002.1| putative copia-type polyprotein [Trifolium pratense] Length = 762 Score = 562 bits (1448), Expect = 0.0 Identities = 280/624 (44%), Positives = 398/624 (63%), Gaps = 13/624 (2%) Frame = +1 Query: 601 NGGFNTHLHVFDGSNWNRWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHRERRK 780 N GF HL +FDG N+++W +M+V+F QDV+++VNDGV L N + Q + H+E +K Sbjct: 6 NNGFPAHLPIFDGKNYDQWIAKMKVIFRLQDVVEIVNDGVAALPRNPNDEQNAVHKESKK 65 Query: 781 KDQKALFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKYELLQ 960 KD KALF IHQC+D +FEKI + K AWDTL R YGGD K+KKV+LQSLR++YELLQ Sbjct: 66 KDGKALFIIHQCLDADIFEKILHCESAKEAWDTLARNYGGDEKLKKVRLQSLRRQYELLQ 125 Query: 961 MKPGEKVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAKDTSA 1140 M E V Y R+ ++TNQM GE +++ +EKVLR+LT +FDHIVVA E++K+ Sbjct: 126 MNESETVSQYFVRLTSLTNQMVRNGETISDLMKIEKVLRTLTPKFDHIVVAKEESKNLEE 185 Query: 1141 MNLDELQGNLEAHEMRISDRG------PEKESEQALKAQTNKKTYGDXXXXXXXXXXXXX 1302 + +ELQ +LEAHE+R+++R E ++QAL+AQ NKK Sbjct: 186 LKFEELQASLEAHELRLTERSKNNGKQSEDSNDQALQAQYNKK----------------- 228 Query: 1303 XXXXEGGSTEKADTANKGGKKYSNQKG-KEKFDKRKVQCFNCEKYGHFADECWSNK--EN 1473 + ++ + + N+ + N G K+KF+K+++QC+NC+K+GHFA EC S K Sbjct: 229 ---GKNQNSNEGNGKNQDSNQQENSNGQKKKFNKKEIQCYNCQKWGHFAAECKSKKVPRE 285 Query: 1474 QKQEANIAKSDSDDDP----VLLMVTTADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSS 1641 + EA ++DP ++ ++ +D + WY+D GCSNHM+G R D + Sbjct: 286 KTDEAKFVYDKQEEDPESSMLMAVIKEEEDD---DKWYLDIGCSNHMSGKRTWFYELDET 342 Query: 1642 KKTRIRLADDR*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVT 1821 RIR ADD ++AEG+G I + KDGK +I +V YVP MKSNL+S+GQL+EK + V Sbjct: 343 VNRRIRFADDSSVRAEGIGKIKIRSKDGKDALISDVLYVPTMKSNLISIGQLLEKNYVVK 402 Query: 1822 MEDGLMQLYGKKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHKRYGHLNF 2001 MED +++++ K+R ILK+ + K RTFK+ + + +C AT D +WH+RYGHLNF Sbjct: 403 MEDKVLRVFDSKRRLILKAPMTKQRTFKIGLNVIDGNCLLATASNED-WIWHQRYGHLNF 461 Query: 2002 RSLGHLHTKNLVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVVHSDVCGP 2181 + L L KN+V G+P + D+ C+ C SKQ + + +E+P RAT L +HSDVCGP Sbjct: 462 KDLSILQRKNMVNGLPQIKVSDQVCDKCCISKQSRNSYNTEIPSRATRRLEAIHSDVCGP 521 Query: 2182 LEVPTLGGNKYFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQLKILRTDS 2361 E + GGN YFVSF+DE+TR +WIY+I KSEV D F+KF+ + +SG + LRTD Sbjct: 522 FEEKSTGGNSYFVSFIDEFTRKLWIYMIAKKSEVFDVFKKFRIIVQNESGEVISKLRTDG 581 Query: 2362 GGEYNSTEFKRYCEDQGIEHEVTA 2433 GGEY S EFK +C GI+HE+TA Sbjct: 582 GGEYTSNEFKSFCASNGIKHEITA 605 >dbj|GAU40816.1| hypothetical protein TSUD_398000 [Trifolium subterraneum] Length = 637 Score = 551 bits (1420), Expect = 0.0 Identities = 284/567 (50%), Positives = 381/567 (67%), Gaps = 7/567 (1%) Frame = +1 Query: 664 QMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHRERRKKDQKALFHIHQCVDPKVFEKI 843 QM+V+F Q+ + VN + L NATE+QR+ RE +KKD KALF IHQCVD KVFEKI Sbjct: 4 QMKVIFIVQEADEQVNTILDPLPANATELQRTTFREAQKKDSKALFLIHQCVDSKVFEKI 63 Query: 844 AESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKYELLQMKPGEKVGDYVSRVVTVTNQM 1023 A++TT K AWD L + YGGD KVKKVKLQ+L++++ELL+MK E V +Y +RV T+TNQM Sbjct: 64 ADATTSKDAWDILQKSYGGDAKVKKVKLQALKRQFELLEMKNDEAVAEYFTRVETLTNQM 123 Query: 1024 KVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAKDTSAMNLDELQGNLEAHEMRISDRG 1203 K CG ++E+ +VEKVLR+LT +FDHIVV IEQ KD S + +++LQ L+AHE++ +R Sbjct: 124 KNCGSTLSEEEMVEKVLRTLTHKFDHIVVTIEQTKDLSEIKMEDLQSTLKAHELKHGERN 183 Query: 1204 PEKESEQALKAQTNKKTYGDXXXXXXXXXXXXXXXXXEGGSTEKADTANK--GGKKYSNQ 1377 KE EQAL + K Y D EK NK KK Q Sbjct: 184 HGKEDEQALFVKF--KRYQD----------------------EKKKWQNKKVSSKKEGGQ 219 Query: 1378 KGKEKFDKRKVQCFNCEKYGHFADECWSNK----ENQKQEANIAKSDS-DDDPVLLMVTT 1542 K K+ DK +QC+NC KYGH+A EC + K +N ++EAN+A+ DS +D V MVT Sbjct: 220 KTKK--DKSIIQCYNCNKYGHYASECKAPKKKKSQNTEEEANVAQDDSTSEDDVSFMVTI 277 Query: 1543 ADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSSKKTRIRLADDR*LQAEGMGNIIVEKKD 1722 D++ +WY DTGCSNHMTGN+ LT+F+ TRI+LA+ + AEGMGN+++++ + Sbjct: 278 TDETAESMVWYFDTGCSNHMTGNKSILTDFNKCLNTRIKLANGNFIAAEGMGNVVIQRSN 337 Query: 1723 GKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVTMEDGLMQLYGKKKRQILKSHLEKNRTF 1902 GK IE+V YVP MK NLMSVGQL+EKGF E ++L+ ++R ILK+ +NRTF Sbjct: 338 GKKADIEKVLYVPGMKCNLMSVGQLLEKGFKAVFEGETLKLFDSRQRLILKTAQSQNRTF 397 Query: 1903 KVSIKAAETHCFAATEGERDIALWHKRYGHLNFRSLGHLHTKNLVVGVPAVAAVDKTCEI 2082 K +K E C A + ++D LWHKRYGHLNF+SL L++KN+V+G+P+V A TC Sbjct: 398 KTQVKTIEVECLATSTEDKDSDLWHKRYGHLNFKSLSMLNSKNMVLGLPSVIAPVDTCTT 457 Query: 2083 CMRSKQPKLPFTSELPPRATHALGVVHSDVCGPLEVPTLGGNKYFVSFVDEYTRMIWIYL 2262 C+ K P+ PF S LP R++ L VVHSD+CGP++V + GGNKYF++FVDEY+RMIW+Y Sbjct: 458 CLLGKHPRSPFKSNLPMRSSEVLNVVHSDICGPIDVLSTGGNKYFITFVDEYSRMIWLYH 517 Query: 2263 IKAKSEVLDQFRKFKNYAEKQSGCQLK 2343 IKAKSE + F++FK EKQS +K Sbjct: 518 IKAKSEAFEVFKRFKTLVEKQSDKSIK 544 >gb|PNX93875.1| copia-type polyprotein [Trifolium pratense] Length = 1350 Score = 568 bits (1463), Expect = 0.0 Identities = 276/621 (44%), Positives = 410/621 (66%), Gaps = 11/621 (1%) Frame = +1 Query: 598 GNGGFNTHLHVFDGSNWNRWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHRERR 777 GNG F +L + D N++RW QM+V+FG QDV ++V++GV ++ E A++ ++ A++E + Sbjct: 5 GNGQFPANLPILDNKNYDRWCKQMKVVFGYQDVWEMVSNGVEQITETASDAEKVAYKELK 64 Query: 778 KKDQKALFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKYELL 957 K+D KALF IHQ V P +FEK+++ + K AWD L YGG KVKKV+LQ+LR++YELL Sbjct: 65 KRDYKALFIIHQSVSPDIFEKVSDCESSKQAWDILAVAYGGGEKVKKVRLQTLRRQYELL 124 Query: 958 QMKPGEKVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAKDTS 1137 QM+ E V D+ +R+ + N MK CGEV++ Q VEK+LRSL+ +FD+IVVAIE++KD+S Sbjct: 125 QMEEKETVSDFFTRISKLVNSMKSCGEVISSQNRVEKILRSLSPRFDNIVVAIEESKDSS 184 Query: 1138 AMNLDELQGNLEAHEMRISDRGPEKESEQALKAQTN----KKTYGDXXXXXXXXXXXXXX 1305 + +DELQG+LEAHE R+++R EK+ E AL Q N KK G Sbjct: 185 EITVDELQGSLEAHEQRMNERSSEKDKEVALNVQQNNNKDKKGKGKWNGNKGRGGYQNSN 244 Query: 1306 XXXE-----GGSTEKADTANKGGK--KYSNQKGKEKFDKRKVQCFNCEKYGHFADECWSN 1464 + GG+ + KGG+ + G KFDK+ VQC+NC+KYGHFADEC S Sbjct: 245 VKDKQDSTSGGNGGRGGRGYKGGRGGRGGRNNGNYKFDKKNVQCYNCQKYGHFADECRSK 304 Query: 1465 KENQKQEANIAKSDSDDDPVLLMVTTADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSSK 1644 +E EA +A++D D+ V+LMVTT ++ + WY+DTGCS HM+G + + +++ Sbjct: 305 EETNDAEAKLARNDDDEGSVMLMVTTREEGECSDQWYLDTGCSTHMSGRKDWFISLKTTQ 364 Query: 1645 KTRIRLADDR*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVTM 1824 +++ A+D + +G+G++ ++++DGK VI V Y+P MK NL+SVGQL+EK + + M Sbjct: 365 NNQVKFANDSTMNVKGIGDVSIKRRDGKHAVISSVLYIPGMKCNLLSVGQLLEKEYKIVM 424 Query: 1825 EDGLMQLYGKKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHKRYGHLNFR 2004 E L+ + + +LK+ + KNRTFKV + + C T R+ LWH R GHLNF+ Sbjct: 425 EHKLLNVLDTRGNLLLKAPMSKNRTFKVQLNVMKHKCL-MTASSREEWLWHYRMGHLNFK 483 Query: 2005 SLGHLHTKNLVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVVHSDVCGPL 2184 L + +N+V G+P + ++ CE C++SKQ + F+ + + L VV+SDVCGP+ Sbjct: 484 DLTVMQRQNMVTGLPKIEMPEEMCEDCIQSKQHRHSFSKDALSKTKSVLEVVYSDVCGPM 543 Query: 2185 EVPTLGGNKYFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQLKILRTDSG 2364 +V ++GGNKYFVSF+D+++R +W YLI KS+VLD F+KFK E+QSG +LK LRTD G Sbjct: 544 QVDSIGGNKYFVSFIDDFSRKMWTYLISKKSDVLDVFKKFKLTVERQSGYKLKTLRTDGG 603 Query: 2365 GEYNSTEFKRYCEDQGIEHEV 2427 GEY STEF ++C+ +GI H+V Sbjct: 604 GEYVSTEFAKFCDSEGIVHDV 624 >gb|KYP31826.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1340 Score = 567 bits (1461), Expect = 0.0 Identities = 283/631 (44%), Positives = 418/631 (66%), Gaps = 16/631 (2%) Frame = +1 Query: 589 MNGGNGGFNTHLHVFDGSNWNRWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHR 768 M F +L V +G NW+RW +QM+ + G Q+V ++V +G L +++T+ Q++ +R Sbjct: 1 MGSTGTNFPANLPVLNGKNWDRWRVQMKAILGYQEVAEIVEEGYPTLTKDSTDAQKALYR 60 Query: 769 ERRKKDQKALFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKY 948 E +KKD KA F IHQCVD FEKIA + T + AW L +C G ++KKV+LQ++R++Y Sbjct: 61 ENKKKDCKATFLIHQCVDEAHFEKIAGAATSQEAWKILEKCSEGAEQLKKVRLQTMRRQY 120 Query: 949 ELLQMKPGEKVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAK 1128 EL+QM+ EK+ ++ +R++T TN MK CGE +T+QTIVEK+LR+L +FDHIVVAIE++K Sbjct: 121 ELMQMENNEKIAEFFNRIITHTNAMKNCGEKITDQTIVEKILRTLDPKFDHIVVAIEESK 180 Query: 1129 DTSAMNLDELQGNLEAHEMRISDRGPEKESE-QALKAQTNKKTYGDXXXXXXXXXXXXXX 1305 + ++ELQG+LEAHE R+ +RG K + QAL+AQT+KK Sbjct: 181 KLEELKVEELQGSLEAHEQRLIERGSVKSDDHQALQAQTSKK----------------GR 224 Query: 1306 XXXEGGSTEKADTANKGGKKYSNQKGKEK--FDKRKVQCFNCEKYGHFADECWS------ 1461 +G + +N+ G +SN +G +K D+++++CFNC + GHF+ EC + Sbjct: 225 YNSKGNFRGRGQNSNRRGS-FSNWRGGKKKVIDRKRIKCFNCNRIGHFSAECEAAPGRTD 283 Query: 1462 ---NKENQKQEANIAKSDSD----DDPVLLMVTTADDSVVVELWYMDTGCSNHMTGNRK* 1620 ++ + +A++AK D++ + P++LM+ T +S E WY+D+GCSNHMTG+R Sbjct: 284 QRGSQSHGDYQAHMAKEDNEANLEEQPLMLMMITNPESYNNEEWYIDSGCSNHMTGHRDW 343 Query: 1621 LTNFDSSKKTRIRLADDR*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLI 1800 NFD KK+ ++ AD+R Q EG GN++V+++DG+ VI EV YVP M +NL+S+GQL+ Sbjct: 344 FVNFDPKKKSTVKFADNRATQVEGSGNVLVKREDGRQTVITEVLYVPGMTTNLISLGQLL 403 Query: 1801 EKGFSVTMEDGLMQLYGKKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHK 1980 EKG SV G +++Y K KR ++K+ L KNRTFKVS+ E+ C +A D LWH+ Sbjct: 404 EKGCSVNSVKGFLEIYDKTKRLVMKAPLAKNRTFKVSLNTIESQCLSAAMLSDDSWLWHR 463 Query: 1981 RYGHLNFRSLGHLHTKNLVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVV 2160 R GHLNFR L L +K ++ G+P++ K C+ C+ SKQP+ F++ +A L VV Sbjct: 464 RLGHLNFRDLSLLKSKEMLTGLPSIKIPKKICDNCLISKQPRNSFSNFTASKANEVLHVV 523 Query: 2161 HSDVCGPLEVPTLGGNKYFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQL 2340 +SDVCGP++ P+LGGN+YFVSFVD+ +R W+YLIKAKS+V F+ FK EKQSG + Sbjct: 524 YSDVCGPIDTPSLGGNRYFVSFVDDLSRKAWLYLIKAKSDVFSIFKDFKALVEKQSGKCI 583 Query: 2341 KILRTDSGGEYNSTEFKRYCEDQGIEHEVTA 2433 KILRTD GGE+ S EF+ +C++ GI HEVTA Sbjct: 584 KILRTDGGGEFTSGEFEGFCKEHGIVHEVTA 614 >gb|KYP68287.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1340 Score = 565 bits (1457), Expect = 0.0 Identities = 283/631 (44%), Positives = 417/631 (66%), Gaps = 16/631 (2%) Frame = +1 Query: 589 MNGGNGGFNTHLHVFDGSNWNRWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHR 768 M F +L V +G NW+RW +QM+ + G Q+V ++V +G L +++T+ Q++ +R Sbjct: 1 MGSTGTNFPANLPVLNGKNWDRWRVQMKAILGYQEVAEIVEEGYPTLTKDSTDAQKALYR 60 Query: 769 ERRKKDQKALFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKY 948 E +KKD KA F IHQCVD FEKIA + T + AW L +C G ++KKV+LQ++R++Y Sbjct: 61 ENKKKDCKATFLIHQCVDEAHFEKIAGAATSQEAWKILEKCSEGAEQLKKVRLQTMRRQY 120 Query: 949 ELLQMKPGEKVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAK 1128 EL+QM+ EK+ ++ +R++T TN MK CGE +T+QTIVEK+LR+L +FDHIVVAIE++K Sbjct: 121 ELMQMENNEKIAEFFNRIITHTNAMKNCGEKITDQTIVEKILRTLDPKFDHIVVAIEESK 180 Query: 1129 DTSAMNLDELQGNLEAHEMRISDRGPEKESE-QALKAQTNKKTYGDXXXXXXXXXXXXXX 1305 + ++ELQG+LEAHE R+ +RG K + QAL+AQT+KK Sbjct: 181 KLEELKVEELQGSLEAHEQRLIERGSVKSDDHQALQAQTSKK----------------GR 224 Query: 1306 XXXEGGSTEKADTANKGGKKYSNQKGKEK--FDKRKVQCFNCEKYGHFADECWS------ 1461 +G + +N+ G +SN +G +K D+++++CFNC + GHF+ EC + Sbjct: 225 YNSKGNFRGRGQNSNRRGS-FSNWRGGKKKVIDRKRIKCFNCNRIGHFSAECEAAPGRTD 283 Query: 1462 ---NKENQKQEANIAKSDSD----DDPVLLMVTTADDSVVVELWYMDTGCSNHMTGNRK* 1620 ++ + +A++AK D++ + P++LM+ T +S E WY+D+GCSNHMTG+R Sbjct: 284 QRGSQSHGDYQAHMAKEDNEANLEEQPLMLMMITNPESYNNEEWYIDSGCSNHMTGHRDW 343 Query: 1621 LTNFDSSKKTRIRLADDR*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLI 1800 NFD KK+ ++ AD+R Q EG GN++V+++DG+ VI EV YVP M +NL+S+GQL+ Sbjct: 344 FVNFDPKKKSTVKFADNRATQVEGSGNVLVKREDGRQTVITEVLYVPGMTTNLISLGQLL 403 Query: 1801 EKGFSVTMEDGLMQLYGKKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHK 1980 EKG SV G +++Y K KR ++K+ L KNRTFKVS+ E+ C +A D LWH Sbjct: 404 EKGCSVNSVKGFLEIYDKTKRLVMKAPLAKNRTFKVSLNTIESQCLSAAMLSDDSWLWHL 463 Query: 1981 RYGHLNFRSLGHLHTKNLVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVV 2160 R GHLNFR L L +K ++ G+P++ K C+ C+ SKQP+ F++ +A L VV Sbjct: 464 RLGHLNFRDLSLLKSKEMLTGLPSIKIPKKICDNCLISKQPRNSFSNFTASKANEVLHVV 523 Query: 2161 HSDVCGPLEVPTLGGNKYFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQL 2340 +SDVCGP++ P+LGGN+YFVSFVD+ +R W+YLIKAKS+V F+ FK EKQSG + Sbjct: 524 YSDVCGPIDTPSLGGNRYFVSFVDDLSRKAWLYLIKAKSDVFSIFKDFKALVEKQSGKCI 583 Query: 2341 KILRTDSGGEYNSTEFKRYCEDQGIEHEVTA 2433 KILRTD GGE+ S EF+ +C++ GI HEVTA Sbjct: 584 KILRTDGGGEFTSGEFEGFCKEHGIVHEVTA 614 >dbj|GAU36721.1| hypothetical protein TSUD_318190 [Trifolium subterraneum] Length = 1087 Score = 555 bits (1431), Expect = e-180 Identities = 291/621 (46%), Positives = 391/621 (62%), Gaps = 10/621 (1%) Frame = +1 Query: 598 GNGGFNTHLHVFDGSNWNRWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHRERR 777 G F+ +L + DG NW+ W QM+V+F Q+ + VN + L NATE QR+ RE + Sbjct: 3 GKSNFHANLPILDGKNWDTWVKQMKVIFIVQEADEQVNTILDPLPANATEQQRTTFREAQ 62 Query: 778 KKDQKALFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKYELL 957 KKD K LF IHQCVD KVFEKIA++TT K AW L + YGGD KVKKVKLQ+L++++ELL Sbjct: 63 KKDSKTLFLIHQCVDSKVFEKIADATTSKDAWGILQKSYGGDAKVKKVKLQALKRQFELL 122 Query: 958 QMKPGEKVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAKDTS 1137 +MK E V +Y +RV T+TNQMK CG ++E+ +VEKVLR+LT +FDHIVV IEQ +D S Sbjct: 123 EMKNDEAVAEYFTRVETLTNQMKNCGSTLSEEEMVEKVLRTLTHKFDHIVVTIEQTRDLS 182 Query: 1138 AMNLDELQGNLEAHEMRISDRGPEKESEQAL-----KAQTNKKTYGDXXXXXXXXXXXXX 1302 + +++LQ LEAHE++ +R KE EQAL K Q KK + + Sbjct: 183 EIKMEDLQNTLEAHELKHCERNHGKEDEQALFVKFKKYQDEKKKWQNKKGSKK------- 235 Query: 1303 XXXXEGGSTEKADTANKGGKKYSNQKGKEKFDKRKVQCFNCEKYGHFADECWSNK----E 1470 E + ++ KK QK K+ DK +QC+NC KYGH+A EC + K + Sbjct: 236 -------GKESVEDKSESSKKEGGQKTKK--DKSTIQCYNCNKYGHYASECKAPKKKKSQ 286 Query: 1471 NQKQEANIAKSDS-DDDPVLLMVTTADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSSKK 1647 N ++EAN+A+ DS +D V MVT D++ +WY DTGCSNHMTGN+ LT+F+ Sbjct: 287 NTEEEANVAQDDSTSEDDVSFMVTITDETTESMVWYFDTGCSNHMTGNKSILTDFNKCLN 346 Query: 1648 TRIRLADDR*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVTME 1827 TRI+LA+ + AEGMGN+++++ +GK VIE+V YVP MK N MSVGQL+EKGF E Sbjct: 347 TRIKLANGNFIAAEGMGNVVIQRSNGKKAVIEKVLYVPGMKYNPMSVGQLLEKGFKAVFE 406 Query: 1828 DGLMQLYGKKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHKRYGHLNFRS 2007 ++L+ K+R ILK+ +NRTFK +K E C A + ++D Sbjct: 407 GERLKLFDLKQRLILKTAQSQNRTFKTQVKTIEIKCLATSTEDKD--------------- 451 Query: 2008 LGHLHTKNLVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVVHSDVCGPLE 2187 +P+V A TC C+ K P+ F S LP R + L VVHSD+CGP++ Sbjct: 452 -----------RLPSVIAPVDTCTTCLLGKHPRSSFKSNLPMRYSEVLNVVHSDICGPID 500 Query: 2188 VPTLGGNKYFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQLKILRTDSGG 2367 V + GGNKYF++FVDEY+RMIW+Y IKAKSE + F++FK EKQS +K+LRTD GG Sbjct: 501 VLSTGGNKYFITFVDEYSRMIWLYHIKAKSEACEVFKRFKTLVEKQSDKSIKVLRTDGGG 560 Query: 2368 EYNSTEFKRYCEDQGIEHEVT 2430 EY S EF+ YC+DQGI HEVT Sbjct: 561 EYTSKEFENYCKDQGIIHEVT 581 >dbj|GAU36409.1| hypothetical protein TSUD_38770 [Trifolium subterraneum] Length = 1259 Score = 555 bits (1430), Expect = e-178 Identities = 284/645 (44%), Positives = 410/645 (63%), Gaps = 39/645 (6%) Frame = +1 Query: 616 THLHVFDGSNWNRWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHRERRKKDQKA 795 T L NW+RW+IQM+ +FG Q+VL+++ +G + + TE QR+ + +KKD K Sbjct: 8 TQLPKLTSENWDRWNIQMQAIFGFQEVLEVIQNGYAVVGDEGTEAQRTLYHANKKKDCKT 67 Query: 796 LFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKYELLQMKPGE 975 ++ HQ +D F+KI T K AWDTL RC+ GD K KKV LQ+LRK+YE +M+ E Sbjct: 68 IYLTHQSIDEVNFDKIFACATAKQAWDTLERCHTGDMKAKKVNLQALRKQYEHTEMEDDE 127 Query: 976 KVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAKDTSAMNLDE 1155 K+ D+ ++V +TN M + GE ++++ EK+LRSL +FD IV IE+ KD S M E Sbjct: 128 KIEDFFNKVKVITNSMALNGETISDEQFCEKILRSLPSRFDFIVCTIEETKDLSTMTPTE 187 Query: 1156 LQGNLEAHEMRISDRGPEKESEQALKAQTNKKTYGDXXXXXXXXXXXXXXXXXEGGSTEK 1335 L L+A E+R S+R +K+S+QAL A ++KK++G + S +K Sbjct: 188 LLSTLQARELRFSERNGDKKSDQALYA-SSKKSHGGGKKQWAKNKGKDNQDHHKTQSNDK 246 Query: 1336 ADTANKGGKKYSNQK-GKEKFDKRKVQCFNCEKYGHFADECWSNKENQ-----KQEANI- 1494 + ++KGG N K G +KF+K+ V+C+NC+K GHFADECW K+++ K +++ Sbjct: 247 TEPSSKGGGGGVNYKHGGQKFNKKNVKCYNCDKVGHFADECWFAKDHKWKGKKKHDSDAC 306 Query: 1495 -----AKSDSDDDPVLLMVTT----ADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSSKK 1647 + SD D++ V LM+ T +DD + W++DTGCSNHMT +++ L + + SKK Sbjct: 307 AAQEESSSDGDENEVKLMMATLSEVSDDQSHTDYWFLDTGCSNHMTSHKEWLIDINPSKK 366 Query: 1648 TRIRLADDR*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVTME 1827 ++IR ADDR LQ EGMG +++ + DGK+V++E+V YVP +KSNL+S+GQLI+KGF V M+ Sbjct: 367 SKIRFADDRTLQVEGMGKMVITRDDGKNVIMEDVLYVPGIKSNLLSIGQLIQKGFEVKMK 426 Query: 1828 DGLMQLYGKKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHKRYGHLNFRS 2007 + + L+ + ILK+ L KNRTF++++ A+ C +A E E +WH R+GHLN +S Sbjct: 427 NNSLSLFDTNHKLILKTPLTKNRTFQINMSTAKLMCLSAVETEDINWIWHARFGHLNSKS 486 Query: 2008 LGHLHTKNLVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVVHSDVCGPLE 2187 L L T ++V G+P + +K C++CM KQ + F SE+P RA + L VVHSDVCGP E Sbjct: 487 LRELGTNHMVNGLPIIKVPEKVCKVCMIGKQTRNSFKSEIPSRARNQLEVVHSDVCGPFE 546 Query: 2188 VPTLGGNKYFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQLKILRTDSGG 2367 V TL GN+YF+SFVDE++RM+WIYLIKAKSE D F+KFK EK+S +KI RTD GG Sbjct: 547 VATLAGNRYFISFVDEFSRMMWIYLIKAKSESFDVFKKFKKKVEKESEKSIKIFRTDGGG 606 Query: 2368 -----------------------EYNSTEFKRYCEDQGIEHEVTA 2433 E+ S EFK++ DQGIEHEVTA Sbjct: 607 EFEASDHVVASELFQSLIIQTSIEFTSNEFKQFLVDQGIEHEVTA 651 >gb|KYP48234.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1029 Score = 548 bits (1412), Expect = e-178 Identities = 280/591 (47%), Positives = 391/591 (66%), Gaps = 3/591 (0%) Frame = +1 Query: 667 MRVLFGAQDVLDLVNDGVTELAENATEVQRSAHRERRKKDQKALFHIHQCVDPKVFEKIA 846 MRV+FG QDVLD+V +GV E+ ATE Q++A++E +KKD KALF IHQCVD FEKI+ Sbjct: 1 MRVIFGFQDVLDIVKEGVQEMDSGATEAQKAANKEAKKKDCKALFIIHQCVDMGNFEKIS 60 Query: 847 ESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKYELLQMKPGEKVGDYVSRVVTVTNQMK 1026 TT K AW+ L R + G+ K+K V+LQ+LR++YELLQM+ E+VG Y +RV+++TN MK Sbjct: 61 NVTTTKEAWEILERAHAGNDKLK-VRLQTLRRQYELLQMEGNEEVGAYFTRVLSLTNLMK 119 Query: 1027 VCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAKDTSAMNLDELQGNLEAHEMRISDRGP 1206 GE ++ Q IVEKVLRSL +FDHIVVAIE++K+ + ++ELQ +LEAHE R+ +R Sbjct: 120 SYGERISSQMIVEKVLRSLLPRFDHIVVAIEESKNLEILKVEELQVSLEAHEQRMVERSA 179 Query: 1207 EKESEQALKAQTNKKTYGDXXXXXXXXXXXXXXXXXEGGSTEKADTANKGGKKYSNQKGK 1386 EK +EQAL+ + K EG +A +G K+ + +G+ Sbjct: 180 EKTAEQALQVHSVKH---------------------EG----EASRNKRGRGKWKHNRGR 214 Query: 1387 ---EKFDKRKVQCFNCEKYGHFADECWSNKENQKQEANIAKSDSDDDPVLLMVTTADDSV 1557 +K +KR +QCFNC+++GH+A+EC++ KE+ + + + DSD D VLLMVTT Sbjct: 215 GGRKKVNKRHIQCFNCQRWGHYAEECYNKKEHNNEAQLVKEDDSDSDQVLLMVTTKTVEN 274 Query: 1558 VVELWYMDTGCSNHMTGNRK*LTNFDSSKKTRIRLADDR*LQAEGMGNIIVEKKDGKSVV 1737 LWY+DTGCS+HMTG + + D S K++++ AD + L AEG+G + + K+ + + Sbjct: 275 SENLWYLDTGCSSHMTGRKDWFSKLDESVKSKVKFADHKTLDAEGIGEVAIRSKNRQRSL 334 Query: 1738 IEEVWYVPEMKSNLMSVGQLIEKGFSVTMEDGLMQLYGKKKRQILKSHLEKNRTFKVSIK 1917 I V +VP MKSNL+S GQL+E+GF + ME+ M +Y KKR IL++ L KNRTF+V I+ Sbjct: 335 ISNVLFVPHMKSNLLSFGQLLERGFKMVMENNGMTVYDNKKRLILRAPLSKNRTFRVEIQ 394 Query: 1918 AAETHCFAATEGERDIALWHKRYGHLNFRSLGHLHTKNLVVGVPAVAAVDKTCEICMRSK 2097 E C + + LWH R+GHLNFR L L N+V G+P + + C C+ K Sbjct: 395 VLEHQCLVSAVNSEE-WLWHYRFGHLNFRDLNLLSRYNMVTGLPRLHQPKEMCRECVECK 453 Query: 2098 QPKLPFTSELPPRATHALGVVHSDVCGPLEVPTLGGNKYFVSFVDEYTRMIWIYLIKAKS 2277 QP+ F +P R+ L VV+SDVCGPL+ +LGGNKYFV+F+D+++R +W+YLIK K Sbjct: 454 QPRNTFKQHVPIRSRCKLEVVYSDVCGPLQTESLGGNKYFVTFIDDFSRKVWVYLIKNKG 513 Query: 2278 EVLDQFRKFKNYAEKQSGCQLKILRTDSGGEYNSTEFKRYCEDQGIEHEVT 2430 +V F+KFK AEKQ C +KILRTD GGEY S EF +CED+GI HEVT Sbjct: 514 DVFSTFKKFKCLAEKQCDCSVKILRTDGGGEYVSAEFTSFCEDEGIVHEVT 564 >dbj|GAU26253.1| hypothetical protein TSUD_224440 [Trifolium subterraneum] Length = 1312 Score = 555 bits (1431), Expect = e-177 Identities = 278/628 (44%), Positives = 401/628 (63%), Gaps = 18/628 (2%) Frame = +1 Query: 598 GNGGFNTHLHVFDGSNWNRWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSAHRERR 777 GNG F +L + +G N++ W QM+V+FG QDV DLV GV + + +T+V+++ +E + Sbjct: 14 GNGAFPGNLLILNGKNYDTWCKQMKVVFGFQDVWDLVQSGVEPITDTSTDVEKATFKELK 73 Query: 778 KKDQKALFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRKKYELL 957 KKD KALF IHQCV P FE++ ++ + K AWD L + YGG TKVKKV+LQ+ ++++ELL Sbjct: 74 KKDYKALFIIHQCVSPDNFERVGDALSSKEAWDNLEKAYGGATKVKKVRLQTYKRQFELL 133 Query: 958 QMKPGEKVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQAKDTS 1137 Q++ E VGD+V+RV + N MK CGE + +Q++VEK+LRSLT +FD IVVAIE++KD S Sbjct: 134 QLEEKESVGDFVTRVTKLVNLMKGCGESMNDQSVVEKILRSLTPRFD-IVVAIEESKDLS 192 Query: 1138 AMNLDELQGNLEAHEMRISDRGPEKESEQALKAQTNKKTYGDXXXXXXXXXXXXXXXXXE 1317 + ++E+QG LEA E + ++R + ++E AL+A N+ G + Sbjct: 193 STTVEEIQGVLEASEQKSNERLEKSKNEVALQAHNNQAKKGKGKWSGNRGRGGYQGSNAK 252 Query: 1318 GGSTEKADTANKGGKKYSNQK------------------GKEKFDKRKVQCFNCEKYGHF 1443 N GG+ N G + FDK VQC+NC+KYGHF Sbjct: 253 DNQENGNPNQNNGGRGGFNGNHRGGRGGRGGRNGRGGFNGYKGFDKSNVQCYNCQKYGHF 312 Query: 1444 ADECWSNKENQKQEANIAKSDSDDDPVLLMVTTADDSVVVELWYMDTGCSNHMTGNRK*L 1623 ADEC S E+Q EA +AK D ++PV+LMVTT + E WY+D+GCS HMTG R Sbjct: 313 ADECRSKNESQDDEARVAKQDESENPVMLMVTTKEYQRCGEEWYLDSGCSTHMTGRRDWF 372 Query: 1624 TNFDSSKKTRIRLADDR*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLIE 1803 ++FD S + +++ A+D L AEG+G + + K+G I +V Y+P +K NL+SVGQLIE Sbjct: 373 SSFDQSHRNKVKFANDSTLNAEGVGVVCIRSKNGDQAFINDVLYIPGIKCNLLSVGQLIE 432 Query: 1804 KGFSVTMEDGLMQLYGKKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHKR 1983 K + + +ED +M+L ++ ILK+ + +NRTFK+ + C AT ERD WH R Sbjct: 433 KDYKIVIEDRMMKLMDSNRKLILKAPMSRNRTFKIELNVMNHMCL-ATAIERDDWTWHYR 491 Query: 1984 YGHLNFRSLGHLHTKNLVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVVH 2163 +GHLNFR L + K++V G+P + ++ CE C++SKQ + F ++ R L V++ Sbjct: 492 FGHLNFRDLNMMSNKSVVSGLPKIQIPNEVCEDCVQSKQHRDSFNKDVKSRTKSVLEVIY 551 Query: 2164 SDVCGPLEVPTLGGNKYFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQLK 2343 SDVCGP++V + GGN+YFVSFVD+++R +W YLIK KSEV D F+KFK A+KQS +LK Sbjct: 552 SDVCGPMQVDSNGGNRYFVSFVDDHSRKLWTYLIKRKSEVFDVFKKFKAMAKKQSDHKLK 611 Query: 2344 ILRTDSGGEYNSTEFKRYCEDQGIEHEV 2427 +L+TD GGEY S F +CE +GI HEV Sbjct: 612 VLKTDGGGEYVSKVFSEFCEAEGIVHEV 639 >gb|KYP46743.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 892 Score = 538 bits (1387), Expect = e-176 Identities = 268/622 (43%), Positives = 406/622 (65%), Gaps = 8/622 (1%) Frame = +1 Query: 592 NGGNGGFNTHLHVFDGSNWN---RWSIQMRVLFGAQDVLDLVNDGVTELAENATEVQRSA 762 NG G + ++ + D N N RW ++M+ +FG Q+V+++VN G ++ ENAT+ Q++A Sbjct: 3 NGNEGTLSGNMPMLDSKNLNSYDRWCVRMKAIFGFQEVMEIVNIGYSDPPENATDKQQAA 62 Query: 763 HRERRKKDQKALFHIHQCVDPKVFEKIAESTTVKHAWDTLVRCYGGDTKVKKVKLQSLRK 942 R+ ++KD KALF++HQCVD FEKIA + T K AWD L + Y G ++K V+LQ+LR+ Sbjct: 63 FRKAKRKDCKALFYLHQCVDEANFEKIALAKTAKEAWDILAQSYAGVERLKTVRLQTLRR 122 Query: 943 KYELLQMKPGEKVGDYVSRVVTVTNQMKVCGEVVTEQTIVEKVLRSLTVQFDHIVVAIEQ 1122 +YELLQM E + +Y S++ ++TN MK CGE + E+TIVEKVLR+L +FD I +AIE+ Sbjct: 123 QYELLQMGNQETIQEYFSQLQSLTNLMKSCGENIKERTIVEKVLRTLDTKFDMIAIAIEE 182 Query: 1123 AKDTSAMNLDELQGNLEAHEMRISDRGPEKE-SEQALKAQTNKKTYGDXXXXXXXXXXXX 1299 +K+ ++ L+ELQG+LEA+E R+ +R +K SEQAL A+ NKK + Sbjct: 183 SKNLDSLKLEELQGSLEAYEQRLRERNGDKSGSEQALLAKQNKKAESNRG---------- 232 Query: 1300 XXXXXEGGSTEKADTANKGGKKYSNQKGKEKFDKRKVQCFNCEKYGHFADECWSNK--EN 1473 K + +G G DK VQC+NC KYGH+A +CWS + + Sbjct: 233 -----------KFNKRGRGRGFRGGYNGGSGSDKSHVQCYNCNKYGHYASDCWSKEGSNS 281 Query: 1474 QKQEANIAKSDSDDDPVLLMVTT--ADDSVVVELWYMDTGCSNHMTGNRK*LTNFDSSKK 1647 +++E N+A+ + +D VLLMVTT + + E WY+DTGCSNHM+ +K N + K Sbjct: 282 KEEEVNVAQKEESEDEVLLMVTTEKPEKKTLSESWYLDTGCSNHMSFQKKWFINLNEKIK 341 Query: 1648 TRIRLADDR*LQAEGMGNIIVEKKDGKSVVIEEVWYVPEMKSNLMSVGQLIEKGFSVTME 1827 ++++ AD+ ++ EG G I++ +KDGK+ VI +V YVP MK NL+S+GQL++KG+ + + Sbjct: 342 SKVKFADNSTVECEGKGKILIRRKDGKTTVISDVLYVPAMKHNLLSIGQLLQKGYLIDWK 401 Query: 1828 DGLMQLYGKKKRQILKSHLEKNRTFKVSIKAAETHCFAATEGERDIALWHKRYGHLNFRS 2007 D ++++ K ILK+ L NRTF+V I ++ CFAA + + LWH R+GHLNF S Sbjct: 402 DQMLRILDKNGSPILKAPLSNNRTFRVDIPVSDCMCFAAAVLDTN-WLWHLRFGHLNFGS 460 Query: 2008 LGHLHTKNLVVGVPAVAAVDKTCEICMRSKQPKLPFTSELPPRATHALGVVHSDVCGPLE 2187 L L K +VVG+P + + TCE CM KQ + PF + L R+ L V+++DVCGP E Sbjct: 461 LSQLAGKEMVVGLPHIQKSEMTCESCMLGKQARNPFKAHLKTRSKDVLEVIYTDVCGPFE 520 Query: 2188 VPTLGGNKYFVSFVDEYTRMIWIYLIKAKSEVLDQFRKFKNYAEKQSGCQLKILRTDSGG 2367 V +LGGNKYF++F+D++++ +W+YLI KS+V F +FK+ EKQSG +K++R+D GG Sbjct: 521 VSSLGGNKYFITFIDDFSKKMWLYLINRKSDVFKCFVEFKSLVEKQSGKVIKVIRSDGGG 580 Query: 2368 EYNSTEFKRYCEDQGIEHEVTA 2433 EY ++EF+ +C+ +G+ HEV A Sbjct: 581 EYTNSEFEIFCKKEGLIHEVVA 602