BLASTX nr result
ID: Astragalus24_contig00013007
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus24_contig00013007 (777 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAO23078.1| polyprotein [Glycine max] 244 4e-70 ref|XP_014634047.1| PREDICTED: uncharacterized protein LOC106799... 231 1e-68 dbj|GAU10615.1| hypothetical protein TSUD_418280, partial [Trifo... 222 4e-65 gb|PNX66565.1| hypothetical protein L195_g055159, partial [Trifo... 209 3e-64 gb|PNX61186.1| hypothetical protein L195_g052324, partial [Trifo... 207 4e-64 gb|PNX55196.1| hypothetical protein L195_g048823, partial [Trifo... 212 7e-64 gb|PNX88023.1| hypothetical protein L195_g044123, partial [Trifo... 208 9e-62 gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] 216 2e-60 dbj|GAU45812.1| hypothetical protein TSUD_115000 [Trifolium subt... 215 4e-60 ref|XP_020233863.1| uncharacterized protein LOC109813966 [Cajanu... 208 4e-60 gb|PNX93204.1| hypothetical protein L195_g016355 [Trifolium prat... 207 1e-59 dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subt... 213 2e-59 gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 212 4e-59 dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subte... 211 2e-58 gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifo... 211 2e-58 gb|KYP49650.1| hypothetical protein KK1_028622 [Cajanus cajan] 201 2e-58 dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subt... 210 2e-58 gb|KYP48558.1| hypothetical protein KK1_029709 [Cajanus cajan] 205 4e-58 gb|PNX99332.1| retrotransposon-related protein [Trifolium pratense] 208 1e-57 gb|KYP40400.1| hypothetical protein KK1_038268 [Cajanus cajan] 200 1e-57 >gb|AAO23078.1| polyprotein [Glycine max] Length = 1552 Score = 244 bits (622), Expect = 4e-70 Identities = 127/250 (50%), Positives = 166/250 (66%), Gaps = 7/250 (2%) Frame = -3 Query: 775 RFKELSAEVKRNAEETNAKFNDVETQILKLQ-------HMIEATDAKLDDRFGKLNDAIA 617 R KE+ AE+K+NA+ +D++ I +L+ IE + D +F +LN ++ Sbjct: 6 RMKEVYAELKKNADAITRVSDDLQNHIERLEATNHAQMEKIEVMQSTNDSQFSQLNAVMS 65 Query: 616 MILQRFXXXXXXXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIF 437 +LQR NSQ Q R +FQVR VKLDFPRFDG NV+ WIF Sbjct: 66 QVLQRLQNIPMSSHGAS----------NSQKEQQRSSFQVRSVKLDFPRFDGKNVMDWIF 115 Query: 436 KAE*FFVYHNTPYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPN 257 KAE FF Y+ TP +RL IASVHLD D+V W+QM+++ E F W+ FTRALE DFGPS Sbjct: 116 KAEQFFDYYATPDADRLIIASVHLDQDVVPWYQMLQKTEPFSSWQAFTRALELDFGPSAY 175 Query: 256 DCPRASLFKLVQTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPS 77 DCPRA+LFKL Q+ TV++YY+ FT+L N+ +GL+ EA+LDCFVSGLQE+I RDVK++EP Sbjct: 176 DCPRATLFKLNQSATVNEYYMQFTALVNRVDGLSAEAILDCFVSGLQEEISRDVKAMEPR 235 Query: 76 TLIRAVALAK 47 TL +AVALAK Sbjct: 236 TLTKAVALAK 245 >ref|XP_014634047.1| PREDICTED: uncharacterized protein LOC106799639 [Glycine max] Length = 600 Score = 231 bits (590), Expect = 1e-68 Identities = 122/247 (49%), Positives = 162/247 (65%), Gaps = 4/247 (1%) Frame = -3 Query: 775 RFKELSAEVKRNAEETNAKFNDVETQILKLQ----HMIEATDAKLDDRFGKLNDAIAMIL 608 R KELS+++KRNAE +ND +I +L+ EA + +F ++N+A+ M+L Sbjct: 6 RMKELSSDIKRNAESIEKMYNDFHEKIDRLEIANASRFEAMQTNTESKFSQINNALDMLL 65 Query: 607 QRFXXXXXXXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE 428 + NS ++ FQVR +KL+FPRFDG NV+ WIF+AE Sbjct: 66 NQ------------SPHKSSHGVGNS----SKQPFQVRNIKLEFPRFDGKNVLEWIFRAE 109 Query: 427 *FFVYHNTPYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCP 248 FF Y+ TP +RLTIASVHLD D+V WFQMM+R+ F W FTRALE DFGPS +CP Sbjct: 110 QFFDYYGTPDPDRLTIASVHLDKDVVPWFQMMQRSHPFHSWVEFTRALELDFGPSIYECP 169 Query: 247 RASLFKLVQTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLI 68 RA+LFKL QTGTV+DYYL FTSLANK GL+N+A++DCF+SGL +IRRDV P +++ Sbjct: 170 RATLFKLSQTGTVADYYLQFTSLANKVYGLSNDALIDCFISGLIPEIRRDVMIHTPISMV 229 Query: 67 RAVALAK 47 + V+LAK Sbjct: 230 KVVSLAK 236 >dbj|GAU10615.1| hypothetical protein TSUD_418280, partial [Trifolium subterraneum] Length = 602 Score = 222 bits (566), Expect = 4e-65 Identities = 121/248 (48%), Positives = 157/248 (63%), Gaps = 7/248 (2%) Frame = -3 Query: 769 KELSAEVKRNAEETNAKFNDVETQILKLQHMIEATDAKLDD-------RFGKLNDAIAMI 611 KE+ AE+K+N E + QI +L+ A ++ + +F KLN IA + Sbjct: 8 KEIYAELKKNTEAIETVSTTLTGQIDRLELGGNAQLLRMKEIQRSNETQFAKLNANIAQL 67 Query: 610 LQRFXXXXXXXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKA 431 LQRF NS Q + +FQVR VKLDFPRFDG NV+ WIFK Sbjct: 68 LQRFSPGQSSSHGGR----------NSGNDQPQSSFQVRYVKLDFPRFDGKNVMDWIFKD 117 Query: 430 E*FFVYHNTPYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDC 251 E FF Y+ TP ERL IA VHLDHD+V W+QM+++ F W TRALE DFGPS DC Sbjct: 118 EQFFDYYATPDSERLLIALVHLDHDVVPWYQMIQKTNPFLTWSALTRALELDFGPSAYDC 177 Query: 250 PRASLFKLVQTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTL 71 PRA+LFKL Q+G+V+DYY+ FTSL N+ +GL+ +A+LDCF+SGLQ++I RDVK++EP L Sbjct: 178 PRATLFKLQQSGSVNDYYMQFTSLVNRVDGLSLDAILDCFISGLQDEINRDVKAMEPRNL 237 Query: 70 IRAVALAK 47 +AV LAK Sbjct: 238 SKAVPLAK 245 >gb|PNX66565.1| hypothetical protein L195_g055159, partial [Trifolium pratense] Length = 225 Score = 209 bits (532), Expect = 3e-64 Identities = 112/239 (46%), Positives = 158/239 (66%) Frame = -3 Query: 763 LSAEVKRNAEETNAKFNDVETQILKLQHMIEATDAKLDDRFGKLNDAIAMILQRFXXXXX 584 +S E KRN +E ++++I + +E ++A+ ++F ++ A+ +++ + Sbjct: 9 ISPENKRNTDEMQI----LQSEIFE---RLERSEAENAEKFARIFTALDILIDQ------ 55 Query: 583 XXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNT 404 + + R FQVR VKL+FPRFDGSNV WIF+AE FF Y++T Sbjct: 56 ----------TPTKQNHGAGLPNRPPFQVRNVKLEFPRFDGSNVHEWIFRAEQFFDYYDT 105 Query: 403 PYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLV 224 P +RLTIASVHLD D+V W+QM++R F W FTRALE DFGPS +CPRA+LFKL Sbjct: 106 PDPDRLTIASVHLDKDVVPWYQMVQRTHPFTSWIEFTRALELDFGPSIYECPRATLFKLT 165 Query: 223 QTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47 Q+GTV++YYL FTSLAN+ GL+N+A++DCFVSGL +IRRDV PS+L++AV+LAK Sbjct: 166 QSGTVAEYYLQFTSLANRVYGLSNDALIDCFVSGLNNEIRRDVLIHTPSSLVKAVSLAK 224 >gb|PNX61186.1| hypothetical protein L195_g052324, partial [Trifolium pratense] Length = 192 Score = 207 bits (528), Expect = 4e-64 Identities = 100/162 (61%), Positives = 128/162 (79%) Frame = -3 Query: 532 SQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVHLDHDI 353 +Q R FQVR VKL+FPRFDG+NV WIF+AE FF Y++TP +RLTI+SVHLD D+ Sbjct: 31 NQGANNRAPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFDYYDTPDIDRLTISSVHLDKDV 90 Query: 352 VSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAFTSLAN 173 V W+QM++R+ F W FTRALE DFGPS DCPRA+LFKL QTGTV++YYL FTSLAN Sbjct: 91 VPWYQMVQRSHPFTSWIEFTRALELDFGPSIYDCPRATLFKLTQTGTVAEYYLQFTSLAN 150 Query: 172 KSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47 + GL+N+A++DCFVSGL +IRRDV P+++++AV+LAK Sbjct: 151 RVYGLSNDALIDCFVSGLTTEIRRDVLIHTPTSIVKAVSLAK 192 >gb|PNX55196.1| hypothetical protein L195_g048823, partial [Trifolium pratense] Length = 348 Score = 212 bits (540), Expect = 7e-64 Identities = 112/239 (46%), Positives = 157/239 (65%), Gaps = 2/239 (0%) Frame = -3 Query: 757 AEVKRNAEETNAKFNDVETQILKLQHM--IEATDAKLDDRFGKLNDAIAMILQRFXXXXX 584 A+ K N +T K N E + + + + +E +A ++F K+ A+ +++++ Sbjct: 2 AQDKANMTQTENKRNKEEMMLFQAEILERMERLEAGTAEKFDKVYAALDVLIEQ------ 55 Query: 583 XXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNT 404 + + R FQVR VKL+FPRFDG+NV WIF+AE FF Y++T Sbjct: 56 ----------SPRKQNHGAGLNNRPPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFDYYDT 105 Query: 403 PYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLV 224 P +RLTI+SVHLD D+V W+QM++R+ F W FTRALE DFGPS DCPRA+LFKL Sbjct: 106 PDLDRLTISSVHLDKDVVPWYQMVQRSHPFTSWIEFTRALELDFGPSVYDCPRATLFKLT 165 Query: 223 QTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47 QTGTV++YYL FTSLAN+ GL+N+A++DCFVSGL +IRRDV PS++++AV+LAK Sbjct: 166 QTGTVAEYYLKFTSLANRVYGLSNDALIDCFVSGLNNEIRRDVLIHTPSSIVKAVSLAK 224 >gb|PNX88023.1| hypothetical protein L195_g044123, partial [Trifolium pratense] Length = 400 Score = 208 bits (530), Expect = 9e-62 Identities = 100/157 (63%), Positives = 126/157 (80%) Frame = -3 Query: 517 TRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVHLDHDIVSWFQ 338 TR FQVR VKL+FPRF+G+NV WIF+AE FF Y++TP +RLTIASVHLD D+V W+Q Sbjct: 60 TRAPFQVRNVKLEFPRFEGTNVHEWIFRAEQFFEYYDTPDLDRLTIASVHLDKDVVPWYQ 119 Query: 337 MMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAFTSLANKSEGL 158 M++R FQ W FTRALE FGPS DCPRA+LFKL QTGTV++YYL FT+LAN+ GL Sbjct: 120 MVQRTHPFQSWIEFTRALELSFGPSVYDCPRATLFKLNQTGTVAEYYLKFTTLANRVYGL 179 Query: 157 TNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47 +N+A++DCFVSGL ++IRRDV PS+L++A +LAK Sbjct: 180 SNDALIDCFVSGLHDEIRRDVLIHTPSSLVKAFSLAK 216 >gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] Length = 1210 Score = 216 bits (550), Expect = 2e-60 Identities = 113/250 (45%), Positives = 163/250 (65%), Gaps = 7/250 (2%) Frame = -3 Query: 775 RFKELSAEVKRNAEETNAKFNDVETQILKLQHMIEATDAKL-------DDRFGKLNDAIA 617 R K+L A+VK+ +++ ++D++ QI KL+ + + +L DD+F +++ A+ Sbjct: 6 RLKDLQADVKQTSDKLEQYYSDLQAQIAKLESVNSSRFERLENVIQANDDKFNQISVALE 65 Query: 616 MILQRFXXXXXXXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIF 437 +LQ + + FQVR VKLDFPRFDG++V+ WIF Sbjct: 66 TLLQH--------------NSSSQGSFHGSSNSFKPPFQVRNVKLDFPRFDGNHVLDWIF 111 Query: 436 KAE*FFVYHNTPYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPN 257 KAE FF Y+ T +RL+IASVHLD+D+V WFQMM+R+ F W FT ALE FGP+ Sbjct: 112 KAEQFFDYYATSEVDRLSIASVHLDNDVVPWFQMMQRSSPFHSWHAFTEALELAFGPTAY 171 Query: 256 DCPRASLFKLVQTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPS 77 +CPRASLFKL QT +V++YY AF +LAN+ G+ NEA+LDCF+SGL++++RRDV +L P Sbjct: 172 ECPRASLFKLNQTDSVAEYYKAFFALANRVSGIDNEALLDCFLSGLKDELRRDVVALSPP 231 Query: 76 TLIRAVALAK 47 +L++AVALAK Sbjct: 232 SLVKAVALAK 241 >dbj|GAU45812.1| hypothetical protein TSUD_115000 [Trifolium subterraneum] Length = 1289 Score = 215 bits (548), Expect = 4e-60 Identities = 117/259 (45%), Positives = 162/259 (62%), Gaps = 10/259 (3%) Frame = -3 Query: 775 RFKELSAEVKRNAEETNAKFNDVETQILKLQHMIEATDAKL----------DDRFGKLND 626 R KELS EVK+N + ++ ETQI + +H +D + D+ ++N+ Sbjct: 6 RMKELSDEVKKNTADMKKLYD--ETQI-QFEHFATVSDNRFKTMEDRHSSTDENLDRMNE 62 Query: 625 AIAMILQRFXXXXXXXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIG 446 +++M+L++ ++ + FQVR VKLDFPRFDG NV+ Sbjct: 63 SLSMLLRK----------------TSQNSSHGATNSYKAPFQVRNVKLDFPRFDGKNVME 106 Query: 445 WIFKAE*FFVYHNTPYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGP 266 WIF+AE FF Y++TP ++RLTI +VHLD D+V WFQM++R F W FTRALE DFGP Sbjct: 107 WIFRAEQFFDYYDTPDKDRLTITAVHLDQDVVPWFQMIQRTNPFNSWVEFTRALELDFGP 166 Query: 265 SPNDCPRASLFKLVQTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSL 86 S DCPRASLFKL Q+GTVSDYY+ FT+LAN GL+ +A++DCF+SG+ +IRRDV Sbjct: 167 SIYDCPRASLFKLNQSGTVSDYYIQFTALANMVYGLSIDALVDCFISGINPEIRRDVMIH 226 Query: 85 EPSTLIRAVALAKFV*GKI 29 P T+++ V+LAK KI Sbjct: 227 TPITIVKDVSLAKVYEEKI 245 >ref|XP_020233863.1| uncharacterized protein LOC109813966 [Cajanus cajan] Length = 548 Score = 208 bits (529), Expect = 4e-60 Identities = 103/227 (45%), Positives = 155/227 (68%) Frame = -3 Query: 727 NAKFNDVETQILKLQHMIEATDAKLDDRFGKLNDAIAMILQRFXXXXXXXXXXXXXXXXX 548 N + +++ + K+ ++E D ++RF +L A+ + ++ Sbjct: 4 NTRLKELDANVKKILDLMEKRDLCYNERFAQLELALHDVSKQ-----------------Q 46 Query: 547 XXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVH 368 +NSQ T FQVR +KLDFP+FDG+NV+ WIFKAE FF Y++TP +RLTI ++H Sbjct: 47 PGGSNSQ---TNLPFQVRNIKLDFPKFDGTNVLQWIFKAEQFFGYYSTPELQRLTIVAIH 103 Query: 367 LDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAF 188 L+ D+V WFQMM++N FQ W+ FT+ALE +FGPSP +CPR++LFKL Q G+V DYY+ F Sbjct: 104 LEKDVVPWFQMMQKNNPFQSWEGFTKALELEFGPSPYECPRSALFKLSQLGSVHDYYVEF 163 Query: 187 TSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47 T+LAN+ GLT +A+LDCF+SGL+ +IRR+V + P+++++AV+LAK Sbjct: 164 TALANRVTGLTVDAILDCFLSGLKLEIRREVLAQSPNSVLKAVSLAK 210 >gb|PNX93204.1| hypothetical protein L195_g016355 [Trifolium pratense] Length = 576 Score = 207 bits (528), Expect = 1e-59 Identities = 111/244 (45%), Positives = 152/244 (62%), Gaps = 1/244 (0%) Frame = -3 Query: 775 RFKELSAEVKRNAEETNAKFND-VETQILKLQHMIEATDAKLDDRFGKLNDAIAMILQRF 599 R ++ E+ +N + KF D ++ + I++ + + R ++ A+ +LQR Sbjct: 6 RLTDIQDEITKNIQLQLKKFTDAMDLRDRDYAQRIDSLEMGNEGRLTRIETAVESLLQR- 64 Query: 598 XXXXXXXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FF 419 NS+ T F R VKL+FPRFDG++ I WIFKAE FF Sbjct: 65 ------------AVETERDEANSRKNPTPTPFHTRSVKLEFPRFDGTHAIEWIFKAEQFF 112 Query: 418 VYHNTPYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRAS 239 Y+NTP +RLTIA+VHLD +V W+QMM+R FQ W+ F RA+E DFGPS DCPRA+ Sbjct: 113 EYYNTPDVDRLTIAAVHLDQKVVPWYQMMQRTNPFQSWQLFARAIEVDFGPSCYDCPRAT 172 Query: 238 LFKLVQTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAV 59 LFKL Q TV++YY+ FTSLAN+ G++ EA+LDCFVSGLQ D++R+V + EPS + RAV Sbjct: 173 LFKLTQKSTVAEYYMEFTSLANRVYGVSTEALLDCFVSGLQPDLQREVIAQEPSCIQRAV 232 Query: 58 ALAK 47 ALAK Sbjct: 233 ALAK 236 >dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subterraneum] Length = 1531 Score = 213 bits (543), Expect = 2e-59 Identities = 102/156 (65%), Positives = 128/156 (82%) Frame = -3 Query: 514 RGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVHLDHDIVSWFQM 335 R FQVR VKL+FPRFDG+NV WIF+AE FF Y++TP +RLTIA+VHLD D+V W+QM Sbjct: 70 RAPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFEYYDTPDLDRLTIAAVHLDKDVVPWYQM 129 Query: 334 MKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAFTSLANKSEGLT 155 M+R+ FQ W FTRALE DFGPS DCPRA+LFKLVQTGTV++Y++ FTSLAN+ GL+ Sbjct: 130 MQRSHPFQSWIDFTRALELDFGPSIYDCPRATLFKLVQTGTVAEYFVQFTSLANRVYGLS 189 Query: 154 NEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47 N+A++DCF+SGL DIRRDV PS+L++AV+LAK Sbjct: 190 NDALVDCFISGLNPDIRRDVLIHTPSSLVKAVSLAK 225 >gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 1535 Score = 212 bits (540), Expect = 4e-59 Identities = 117/239 (48%), Positives = 153/239 (64%), Gaps = 1/239 (0%) Frame = -3 Query: 760 SAEVKRNAEETNAKFNDVETQILKLQHMIEA-TDAKLDDRFGKLNDAIAMILQRFXXXXX 584 SAE+KRN E + ++L+ +E TD+K D + L+ I QR Sbjct: 10 SAEIKRNEE-----MQLFQAELLERMERLETNTDSKFDKVYAALDVLITQSQQR------ 58 Query: 583 XXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNT 404 + +R FQVR VKL+FPRFDG+NV WIF+AE FF Y++T Sbjct: 59 --------------PPHGAGSSSRPPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFEYYDT 104 Query: 403 PYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLV 224 P +RLTIASVHLD D+V W+QM++R F W FTRALE DFGPS DCPRA+LFKL Sbjct: 105 PDLDRLTIASVHLDKDVVPWYQMVQRTHPFTSWIEFTRALELDFGPSVYDCPRATLFKLK 164 Query: 223 QTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47 QTGTV++YYL FTSLAN+ GL+N+A++DCFVSGL ++IRRDV P +L++A++LAK Sbjct: 165 QTGTVAEYYLQFTSLANRVYGLSNDALIDCFVSGLNDEIRRDVLIHTPISLVKAMSLAK 223 >dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subterraneum] Length = 1500 Score = 211 bits (536), Expect = 2e-58 Identities = 111/229 (48%), Positives = 155/229 (67%), Gaps = 2/229 (0%) Frame = -3 Query: 727 NAKFNDVETQILKLQHM--IEATDAKLDDRFGKLNDAIAMILQRFXXXXXXXXXXXXXXX 554 N + ND E Q+ +L+ + +E + + +F K++ A+ +++ + Sbjct: 13 NKRIND-EMQLFQLEILERMERLEMNNESKFDKIHTALDLLISQ---------------- 55 Query: 553 XXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIAS 374 T+ +R FQVR VKL+FPRFDG+NV WIF+AE FF Y++TP +RLTI+S Sbjct: 56 SSPKQTHGHGSSSRPPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFEYYDTPDLDRLTISS 115 Query: 373 VHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYL 194 VHLD D+V W+QM++R F W FTRALE DFGPS DCPRA+LFKL QTGTV++YYL Sbjct: 116 VHLDKDVVPWYQMLQRTHPFTSWIEFTRALELDFGPSVYDCPRATLFKLAQTGTVAEYYL 175 Query: 193 AFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47 FTSLAN+ GL+N+A++DCFVSGL++DIRRDV P +L++A++LAK Sbjct: 176 QFTSLANRVYGLSNDALIDCFVSGLKDDIRRDVVLHTPISLVKAMSLAK 224 >gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense] Length = 1502 Score = 211 bits (536), Expect = 2e-58 Identities = 114/239 (47%), Positives = 154/239 (64%) Frame = -3 Query: 763 LSAEVKRNAEETNAKFNDVETQILKLQHMIEATDAKLDDRFGKLNDAIAMILQRFXXXXX 584 + AE+KRNAE D+ ++ +E ++A ++F K+ AI +++ + Sbjct: 9 IEAELKRNAE-------DMRLYQAEMFERLERSEAANKEKFDKIFTAIDILIDQ------ 55 Query: 583 XXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNT 404 + + R FQVR VKL+FPRFDG+NV WIF+AE FF Y++T Sbjct: 56 ---------SPSKHHHGAGLNNNRPPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFDYYDT 106 Query: 403 PYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLV 224 P +RLTI+SVHLD D+V W+QM++R F W FTRALE DFGPS DCPRA+LFKL Sbjct: 107 PDSDRLTISSVHLDKDVVPWYQMVQRLRPFTSWVEFTRALELDFGPSVYDCPRATLFKLS 166 Query: 223 QTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47 QTGTV++YYL FTSLAN+ GL+N+A++DCFVSGL IRRDV P +L++AV+LAK Sbjct: 167 QTGTVAEYYLQFTSLANRVYGLSNDAMVDCFVSGLNNQIRRDVLIHTPPSLVKAVSLAK 225 >gb|KYP49650.1| hypothetical protein KK1_028622 [Cajanus cajan] Length = 444 Score = 201 bits (511), Expect = 2e-58 Identities = 91/157 (57%), Positives = 126/157 (80%) Frame = -3 Query: 517 TRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVHLDHDIVSWFQ 338 T+ FQVR VK+DFPRFDG+ V+ WIFKAE FF +++TP R+TIA+VHLD D+V WFQ Sbjct: 58 TKPPFQVRNVKIDFPRFDGTEVLSWIFKAEQFFDFYDTPDEHRMTIAAVHLDKDVVPWFQ 117 Query: 337 MMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAFTSLANKSEGL 158 M+ R + FQ WK FT+ALE +FGPSP +CPR++LFKL QT +V++YY+ F SLAN+ G+ Sbjct: 118 MITRMQPFQSWKQFTKALESEFGPSPFECPRSTLFKLFQTASVNEYYMEFISLANRVYGI 177 Query: 157 TNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47 + +A+LDCF+SGL+ +I+RDV + P +L++AV+LAK Sbjct: 178 SPDALLDCFISGLKPEIKRDVIAQSPLSLLKAVSLAK 214 >dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subterraneum] Length = 1512 Score = 210 bits (535), Expect = 2e-58 Identities = 112/239 (46%), Positives = 155/239 (64%), Gaps = 2/239 (0%) Frame = -3 Query: 757 AEVKRNAEETNAKFNDVETQILK--LQHMIEATDAKLDDRFGKLNDAIAMILQRFXXXXX 584 A V + +T K N E +L+ + +E ++A ++F K+ A+ +++ + Sbjct: 2 APVTASPSKTETKRNTDELTLLQGEIFERLERSEAANAEKFAKIFTALDILIDQTPSKQN 61 Query: 583 XXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNT 404 + R FQVR VKL+FPRFDG+NV WIF+AE FF Y++T Sbjct: 62 HGIG----------------LNNRTPFQVRNVKLEFPRFDGNNVHEWIFRAEQFFDYYDT 105 Query: 403 PYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLV 224 P +RLTIASVHLD D+V W+QM++R FQ W FTRALE DFGPS +CPRA+LFKL Sbjct: 106 PDLDRLTIASVHLDKDVVPWYQMVQRTTPFQSWMDFTRALELDFGPSIYECPRATLFKLN 165 Query: 223 QTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47 QTGTV++YYL FTSLAN+ GL+N+A++DCF+SGL DIRRDV P+++++AV+LAK Sbjct: 166 QTGTVAEYYLQFTSLANRVYGLSNDALIDCFISGLSADIRRDVLIHTPNSIVKAVSLAK 224 >gb|KYP48558.1| hypothetical protein KK1_029709 [Cajanus cajan] Length = 681 Score = 205 bits (522), Expect = 4e-58 Identities = 93/158 (58%), Positives = 129/158 (81%) Frame = -3 Query: 520 QTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVHLDHDIVSWF 341 QT FQVR +KLDFP+FDG+NV+ WIFKAE FF Y++TP +RLTI ++HL+ D+V WF Sbjct: 33 QTNLPFQVRNIKLDFPKFDGTNVLQWIFKAEQFFGYYSTPELQRLTIVAIHLEKDVVPWF 92 Query: 340 QMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAFTSLANKSEG 161 QMM++N FQ W+ FT+ALE +FGPSP +CPR++LFKL Q G+V DYY+ FT+LAN+ G Sbjct: 93 QMMQKNNPFQSWEGFTKALELEFGPSPYECPRSALFKLSQLGSVHDYYVEFTALANRVTG 152 Query: 160 LTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47 LT +A+LDCF+SGL+ +IRR+V + P+++++AV+LAK Sbjct: 153 LTVDAILDCFLSGLKLEIRREVLAQSPNSVLKAVSLAK 190 >gb|PNX99332.1| retrotransposon-related protein [Trifolium pratense] Length = 1084 Score = 208 bits (529), Expect = 1e-57 Identities = 108/227 (47%), Positives = 152/227 (66%), Gaps = 2/227 (0%) Frame = -3 Query: 721 KFNDVETQILK--LQHMIEATDAKLDDRFGKLNDAIAMILQRFXXXXXXXXXXXXXXXXX 548 K N E +L+ + +E ++A DD+F ++ A+ +++ + Sbjct: 14 KHNTEEMTLLQGEIFERLERSEAANDDKFNRIFAALDILIDQ----------------TP 57 Query: 547 XXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVH 368 + + R FQVR VKL+FPRFDG+NV WIF+AE FF Y+ TP +RLTIASVH Sbjct: 58 SKQNHGAGLNNRLPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFDYYETPDPDRLTIASVH 117 Query: 367 LDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAF 188 LD D+V W+QM++R FQ W FTRALE D+GPS +CPRA+LFKL QTGTV++YYL F Sbjct: 118 LDKDVVPWYQMVQRTTPFQSWIDFTRALELDYGPSIYECPRATLFKLTQTGTVAEYYLKF 177 Query: 187 TSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47 TSLAN+ GL+N+A++DCF+SGL ++IRRDV P+++++AV+LAK Sbjct: 178 TSLANRVYGLSNDAMVDCFISGLTDEIRRDVLIHTPTSIVKAVSLAK 224 >gb|KYP40400.1| hypothetical protein KK1_038268 [Cajanus cajan] Length = 505 Score = 200 bits (509), Expect = 1e-57 Identities = 91/157 (57%), Positives = 125/157 (79%) Frame = -3 Query: 517 TRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVHLDHDIVSWFQ 338 T+ FQVR VK+DFPRFDG V+ WIFKAE FF +++TP R+TIA+VHLD D+V WFQ Sbjct: 20 TKPPFQVRNVKIDFPRFDGMEVLSWIFKAEQFFDFYDTPDEHRMTIAAVHLDKDVVPWFQ 79 Query: 337 MMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAFTSLANKSEGL 158 M+ R + FQ WK FT+ALE +FGPSP +CPR++LFKL QT +V++YY+ F SLAN+ G+ Sbjct: 80 MITRMQPFQSWKQFTKALESEFGPSPFECPRSTLFKLFQTASVNEYYMEFISLANRVYGI 139 Query: 157 TNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47 + +A+LDCF+SGL+ +I+RDV + P +L++AV+LAK Sbjct: 140 SPDALLDCFISGLKPEIKRDVIAQSPLSLLKAVSLAK 176