BLASTX nr result

ID: Astragalus24_contig00013007 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00013007
         (777 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAO23078.1| polyprotein [Glycine max]                              244   4e-70
ref|XP_014634047.1| PREDICTED: uncharacterized protein LOC106799...   231   1e-68
dbj|GAU10615.1| hypothetical protein TSUD_418280, partial [Trifo...   222   4e-65
gb|PNX66565.1| hypothetical protein L195_g055159, partial [Trifo...   209   3e-64
gb|PNX61186.1| hypothetical protein L195_g052324, partial [Trifo...   207   4e-64
gb|PNX55196.1| hypothetical protein L195_g048823, partial [Trifo...   212   7e-64
gb|PNX88023.1| hypothetical protein L195_g044123, partial [Trifo...   208   9e-62
gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]   216   2e-60
dbj|GAU45812.1| hypothetical protein TSUD_115000 [Trifolium subt...   215   4e-60
ref|XP_020233863.1| uncharacterized protein LOC109813966 [Cajanu...   208   4e-60
gb|PNX93204.1| hypothetical protein L195_g016355 [Trifolium prat...   207   1e-59
dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subt...   213   2e-59
gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   212   4e-59
dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subte...   211   2e-58
gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifo...   211   2e-58
gb|KYP49650.1| hypothetical protein KK1_028622 [Cajanus cajan]        201   2e-58
dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subt...   210   2e-58
gb|KYP48558.1| hypothetical protein KK1_029709 [Cajanus cajan]        205   4e-58
gb|PNX99332.1| retrotransposon-related protein [Trifolium pratense]   208   1e-57
gb|KYP40400.1| hypothetical protein KK1_038268 [Cajanus cajan]        200   1e-57

>gb|AAO23078.1| polyprotein [Glycine max]
          Length = 1552

 Score =  244 bits (622), Expect = 4e-70
 Identities = 127/250 (50%), Positives = 166/250 (66%), Gaps = 7/250 (2%)
 Frame = -3

Query: 775 RFKELSAEVKRNAEETNAKFNDVETQILKLQ-------HMIEATDAKLDDRFGKLNDAIA 617
           R KE+ AE+K+NA+      +D++  I +L+         IE   +  D +F +LN  ++
Sbjct: 6   RMKEVYAELKKNADAITRVSDDLQNHIERLEATNHAQMEKIEVMQSTNDSQFSQLNAVMS 65

Query: 616 MILQRFXXXXXXXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIF 437
            +LQR                      NSQ  Q R +FQVR VKLDFPRFDG NV+ WIF
Sbjct: 66  QVLQRLQNIPMSSHGAS----------NSQKEQQRSSFQVRSVKLDFPRFDGKNVMDWIF 115

Query: 436 KAE*FFVYHNTPYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPN 257
           KAE FF Y+ TP  +RL IASVHLD D+V W+QM+++ E F  W+ FTRALE DFGPS  
Sbjct: 116 KAEQFFDYYATPDADRLIIASVHLDQDVVPWYQMLQKTEPFSSWQAFTRALELDFGPSAY 175

Query: 256 DCPRASLFKLVQTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPS 77
           DCPRA+LFKL Q+ TV++YY+ FT+L N+ +GL+ EA+LDCFVSGLQE+I RDVK++EP 
Sbjct: 176 DCPRATLFKLNQSATVNEYYMQFTALVNRVDGLSAEAILDCFVSGLQEEISRDVKAMEPR 235

Query: 76  TLIRAVALAK 47
           TL +AVALAK
Sbjct: 236 TLTKAVALAK 245


>ref|XP_014634047.1| PREDICTED: uncharacterized protein LOC106799639 [Glycine max]
          Length = 600

 Score =  231 bits (590), Expect = 1e-68
 Identities = 122/247 (49%), Positives = 162/247 (65%), Gaps = 4/247 (1%)
 Frame = -3

Query: 775 RFKELSAEVKRNAEETNAKFNDVETQILKLQ----HMIEATDAKLDDRFGKLNDAIAMIL 608
           R KELS+++KRNAE     +ND   +I +L+       EA     + +F ++N+A+ M+L
Sbjct: 6   RMKELSSDIKRNAESIEKMYNDFHEKIDRLEIANASRFEAMQTNTESKFSQINNALDMLL 65

Query: 607 QRFXXXXXXXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE 428
            +                      NS    ++  FQVR +KL+FPRFDG NV+ WIF+AE
Sbjct: 66  NQ------------SPHKSSHGVGNS----SKQPFQVRNIKLEFPRFDGKNVLEWIFRAE 109

Query: 427 *FFVYHNTPYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCP 248
            FF Y+ TP  +RLTIASVHLD D+V WFQMM+R+  F  W  FTRALE DFGPS  +CP
Sbjct: 110 QFFDYYGTPDPDRLTIASVHLDKDVVPWFQMMQRSHPFHSWVEFTRALELDFGPSIYECP 169

Query: 247 RASLFKLVQTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLI 68
           RA+LFKL QTGTV+DYYL FTSLANK  GL+N+A++DCF+SGL  +IRRDV    P +++
Sbjct: 170 RATLFKLSQTGTVADYYLQFTSLANKVYGLSNDALIDCFISGLIPEIRRDVMIHTPISMV 229

Query: 67  RAVALAK 47
           + V+LAK
Sbjct: 230 KVVSLAK 236


>dbj|GAU10615.1| hypothetical protein TSUD_418280, partial [Trifolium subterraneum]
          Length = 602

 Score =  222 bits (566), Expect = 4e-65
 Identities = 121/248 (48%), Positives = 157/248 (63%), Gaps = 7/248 (2%)
 Frame = -3

Query: 769 KELSAEVKRNAEETNAKFNDVETQILKLQHMIEATDAKLDD-------RFGKLNDAIAMI 611
           KE+ AE+K+N E        +  QI +L+    A   ++ +       +F KLN  IA +
Sbjct: 8   KEIYAELKKNTEAIETVSTTLTGQIDRLELGGNAQLLRMKEIQRSNETQFAKLNANIAQL 67

Query: 610 LQRFXXXXXXXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKA 431
           LQRF                     NS   Q + +FQVR VKLDFPRFDG NV+ WIFK 
Sbjct: 68  LQRFSPGQSSSHGGR----------NSGNDQPQSSFQVRYVKLDFPRFDGKNVMDWIFKD 117

Query: 430 E*FFVYHNTPYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDC 251
           E FF Y+ TP  ERL IA VHLDHD+V W+QM+++   F  W   TRALE DFGPS  DC
Sbjct: 118 EQFFDYYATPDSERLLIALVHLDHDVVPWYQMIQKTNPFLTWSALTRALELDFGPSAYDC 177

Query: 250 PRASLFKLVQTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTL 71
           PRA+LFKL Q+G+V+DYY+ FTSL N+ +GL+ +A+LDCF+SGLQ++I RDVK++EP  L
Sbjct: 178 PRATLFKLQQSGSVNDYYMQFTSLVNRVDGLSLDAILDCFISGLQDEINRDVKAMEPRNL 237

Query: 70  IRAVALAK 47
            +AV LAK
Sbjct: 238 SKAVPLAK 245


>gb|PNX66565.1| hypothetical protein L195_g055159, partial [Trifolium pratense]
          Length = 225

 Score =  209 bits (532), Expect = 3e-64
 Identities = 112/239 (46%), Positives = 158/239 (66%)
 Frame = -3

Query: 763 LSAEVKRNAEETNAKFNDVETQILKLQHMIEATDAKLDDRFGKLNDAIAMILQRFXXXXX 584
           +S E KRN +E       ++++I +    +E ++A+  ++F ++  A+ +++ +      
Sbjct: 9   ISPENKRNTDEMQI----LQSEIFE---RLERSEAENAEKFARIFTALDILIDQ------ 55

Query: 583 XXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNT 404
                           +   +  R  FQVR VKL+FPRFDGSNV  WIF+AE FF Y++T
Sbjct: 56  ----------TPTKQNHGAGLPNRPPFQVRNVKLEFPRFDGSNVHEWIFRAEQFFDYYDT 105

Query: 403 PYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLV 224
           P  +RLTIASVHLD D+V W+QM++R   F  W  FTRALE DFGPS  +CPRA+LFKL 
Sbjct: 106 PDPDRLTIASVHLDKDVVPWYQMVQRTHPFTSWIEFTRALELDFGPSIYECPRATLFKLT 165

Query: 223 QTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47
           Q+GTV++YYL FTSLAN+  GL+N+A++DCFVSGL  +IRRDV    PS+L++AV+LAK
Sbjct: 166 QSGTVAEYYLQFTSLANRVYGLSNDALIDCFVSGLNNEIRRDVLIHTPSSLVKAVSLAK 224


>gb|PNX61186.1| hypothetical protein L195_g052324, partial [Trifolium pratense]
          Length = 192

 Score =  207 bits (528), Expect = 4e-64
 Identities = 100/162 (61%), Positives = 128/162 (79%)
 Frame = -3

Query: 532 SQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVHLDHDI 353
           +Q    R  FQVR VKL+FPRFDG+NV  WIF+AE FF Y++TP  +RLTI+SVHLD D+
Sbjct: 31  NQGANNRAPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFDYYDTPDIDRLTISSVHLDKDV 90

Query: 352 VSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAFTSLAN 173
           V W+QM++R+  F  W  FTRALE DFGPS  DCPRA+LFKL QTGTV++YYL FTSLAN
Sbjct: 91  VPWYQMVQRSHPFTSWIEFTRALELDFGPSIYDCPRATLFKLTQTGTVAEYYLQFTSLAN 150

Query: 172 KSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47
           +  GL+N+A++DCFVSGL  +IRRDV    P+++++AV+LAK
Sbjct: 151 RVYGLSNDALIDCFVSGLTTEIRRDVLIHTPTSIVKAVSLAK 192


>gb|PNX55196.1| hypothetical protein L195_g048823, partial [Trifolium pratense]
          Length = 348

 Score =  212 bits (540), Expect = 7e-64
 Identities = 112/239 (46%), Positives = 157/239 (65%), Gaps = 2/239 (0%)
 Frame = -3

Query: 757 AEVKRNAEETNAKFNDVETQILKLQHM--IEATDAKLDDRFGKLNDAIAMILQRFXXXXX 584
           A+ K N  +T  K N  E  + + + +  +E  +A   ++F K+  A+ +++++      
Sbjct: 2   AQDKANMTQTENKRNKEEMMLFQAEILERMERLEAGTAEKFDKVYAALDVLIEQ------ 55

Query: 583 XXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNT 404
                           +   +  R  FQVR VKL+FPRFDG+NV  WIF+AE FF Y++T
Sbjct: 56  ----------SPRKQNHGAGLNNRPPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFDYYDT 105

Query: 403 PYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLV 224
           P  +RLTI+SVHLD D+V W+QM++R+  F  W  FTRALE DFGPS  DCPRA+LFKL 
Sbjct: 106 PDLDRLTISSVHLDKDVVPWYQMVQRSHPFTSWIEFTRALELDFGPSVYDCPRATLFKLT 165

Query: 223 QTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47
           QTGTV++YYL FTSLAN+  GL+N+A++DCFVSGL  +IRRDV    PS++++AV+LAK
Sbjct: 166 QTGTVAEYYLKFTSLANRVYGLSNDALIDCFVSGLNNEIRRDVLIHTPSSIVKAVSLAK 224


>gb|PNX88023.1| hypothetical protein L195_g044123, partial [Trifolium pratense]
          Length = 400

 Score =  208 bits (530), Expect = 9e-62
 Identities = 100/157 (63%), Positives = 126/157 (80%)
 Frame = -3

Query: 517 TRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVHLDHDIVSWFQ 338
           TR  FQVR VKL+FPRF+G+NV  WIF+AE FF Y++TP  +RLTIASVHLD D+V W+Q
Sbjct: 60  TRAPFQVRNVKLEFPRFEGTNVHEWIFRAEQFFEYYDTPDLDRLTIASVHLDKDVVPWYQ 119

Query: 337 MMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAFTSLANKSEGL 158
           M++R   FQ W  FTRALE  FGPS  DCPRA+LFKL QTGTV++YYL FT+LAN+  GL
Sbjct: 120 MVQRTHPFQSWIEFTRALELSFGPSVYDCPRATLFKLNQTGTVAEYYLKFTTLANRVYGL 179

Query: 157 TNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47
           +N+A++DCFVSGL ++IRRDV    PS+L++A +LAK
Sbjct: 180 SNDALIDCFVSGLHDEIRRDVLIHTPSSLVKAFSLAK 216


>gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 1210

 Score =  216 bits (550), Expect = 2e-60
 Identities = 113/250 (45%), Positives = 163/250 (65%), Gaps = 7/250 (2%)
 Frame = -3

Query: 775 RFKELSAEVKRNAEETNAKFNDVETQILKLQHMIEATDAKL-------DDRFGKLNDAIA 617
           R K+L A+VK+ +++    ++D++ QI KL+ +  +   +L       DD+F +++ A+ 
Sbjct: 6   RLKDLQADVKQTSDKLEQYYSDLQAQIAKLESVNSSRFERLENVIQANDDKFNQISVALE 65

Query: 616 MILQRFXXXXXXXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIF 437
            +LQ                       +      +  FQVR VKLDFPRFDG++V+ WIF
Sbjct: 66  TLLQH--------------NSSSQGSFHGSSNSFKPPFQVRNVKLDFPRFDGNHVLDWIF 111

Query: 436 KAE*FFVYHNTPYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPN 257
           KAE FF Y+ T   +RL+IASVHLD+D+V WFQMM+R+  F  W  FT ALE  FGP+  
Sbjct: 112 KAEQFFDYYATSEVDRLSIASVHLDNDVVPWFQMMQRSSPFHSWHAFTEALELAFGPTAY 171

Query: 256 DCPRASLFKLVQTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPS 77
           +CPRASLFKL QT +V++YY AF +LAN+  G+ NEA+LDCF+SGL++++RRDV +L P 
Sbjct: 172 ECPRASLFKLNQTDSVAEYYKAFFALANRVSGIDNEALLDCFLSGLKDELRRDVVALSPP 231

Query: 76  TLIRAVALAK 47
           +L++AVALAK
Sbjct: 232 SLVKAVALAK 241


>dbj|GAU45812.1| hypothetical protein TSUD_115000 [Trifolium subterraneum]
          Length = 1289

 Score =  215 bits (548), Expect = 4e-60
 Identities = 117/259 (45%), Positives = 162/259 (62%), Gaps = 10/259 (3%)
 Frame = -3

Query: 775 RFKELSAEVKRNAEETNAKFNDVETQILKLQHMIEATDAKL----------DDRFGKLND 626
           R KELS EVK+N  +    ++  ETQI + +H    +D +           D+   ++N+
Sbjct: 6   RMKELSDEVKKNTADMKKLYD--ETQI-QFEHFATVSDNRFKTMEDRHSSTDENLDRMNE 62

Query: 625 AIAMILQRFXXXXXXXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIG 446
           +++M+L++                     ++      +  FQVR VKLDFPRFDG NV+ 
Sbjct: 63  SLSMLLRK----------------TSQNSSHGATNSYKAPFQVRNVKLDFPRFDGKNVME 106

Query: 445 WIFKAE*FFVYHNTPYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGP 266
           WIF+AE FF Y++TP ++RLTI +VHLD D+V WFQM++R   F  W  FTRALE DFGP
Sbjct: 107 WIFRAEQFFDYYDTPDKDRLTITAVHLDQDVVPWFQMIQRTNPFNSWVEFTRALELDFGP 166

Query: 265 SPNDCPRASLFKLVQTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSL 86
           S  DCPRASLFKL Q+GTVSDYY+ FT+LAN   GL+ +A++DCF+SG+  +IRRDV   
Sbjct: 167 SIYDCPRASLFKLNQSGTVSDYYIQFTALANMVYGLSIDALVDCFISGINPEIRRDVMIH 226

Query: 85  EPSTLIRAVALAKFV*GKI 29
            P T+++ V+LAK    KI
Sbjct: 227 TPITIVKDVSLAKVYEEKI 245


>ref|XP_020233863.1| uncharacterized protein LOC109813966 [Cajanus cajan]
          Length = 548

 Score =  208 bits (529), Expect = 4e-60
 Identities = 103/227 (45%), Positives = 155/227 (68%)
 Frame = -3

Query: 727 NAKFNDVETQILKLQHMIEATDAKLDDRFGKLNDAIAMILQRFXXXXXXXXXXXXXXXXX 548
           N +  +++  + K+  ++E  D   ++RF +L  A+  + ++                  
Sbjct: 4   NTRLKELDANVKKILDLMEKRDLCYNERFAQLELALHDVSKQ-----------------Q 46

Query: 547 XXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVH 368
              +NSQ   T   FQVR +KLDFP+FDG+NV+ WIFKAE FF Y++TP  +RLTI ++H
Sbjct: 47  PGGSNSQ---TNLPFQVRNIKLDFPKFDGTNVLQWIFKAEQFFGYYSTPELQRLTIVAIH 103

Query: 367 LDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAF 188
           L+ D+V WFQMM++N  FQ W+ FT+ALE +FGPSP +CPR++LFKL Q G+V DYY+ F
Sbjct: 104 LEKDVVPWFQMMQKNNPFQSWEGFTKALELEFGPSPYECPRSALFKLSQLGSVHDYYVEF 163

Query: 187 TSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47
           T+LAN+  GLT +A+LDCF+SGL+ +IRR+V +  P+++++AV+LAK
Sbjct: 164 TALANRVTGLTVDAILDCFLSGLKLEIRREVLAQSPNSVLKAVSLAK 210


>gb|PNX93204.1| hypothetical protein L195_g016355 [Trifolium pratense]
          Length = 576

 Score =  207 bits (528), Expect = 1e-59
 Identities = 111/244 (45%), Positives = 152/244 (62%), Gaps = 1/244 (0%)
 Frame = -3

Query: 775 RFKELSAEVKRNAEETNAKFND-VETQILKLQHMIEATDAKLDDRFGKLNDAIAMILQRF 599
           R  ++  E+ +N +    KF D ++ +       I++ +   + R  ++  A+  +LQR 
Sbjct: 6   RLTDIQDEITKNIQLQLKKFTDAMDLRDRDYAQRIDSLEMGNEGRLTRIETAVESLLQR- 64

Query: 598 XXXXXXXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FF 419
                                NS+   T   F  R VKL+FPRFDG++ I WIFKAE FF
Sbjct: 65  ------------AVETERDEANSRKNPTPTPFHTRSVKLEFPRFDGTHAIEWIFKAEQFF 112

Query: 418 VYHNTPYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRAS 239
            Y+NTP  +RLTIA+VHLD  +V W+QMM+R   FQ W+ F RA+E DFGPS  DCPRA+
Sbjct: 113 EYYNTPDVDRLTIAAVHLDQKVVPWYQMMQRTNPFQSWQLFARAIEVDFGPSCYDCPRAT 172

Query: 238 LFKLVQTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAV 59
           LFKL Q  TV++YY+ FTSLAN+  G++ EA+LDCFVSGLQ D++R+V + EPS + RAV
Sbjct: 173 LFKLTQKSTVAEYYMEFTSLANRVYGVSTEALLDCFVSGLQPDLQREVIAQEPSCIQRAV 232

Query: 58  ALAK 47
           ALAK
Sbjct: 233 ALAK 236


>dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subterraneum]
          Length = 1531

 Score =  213 bits (543), Expect = 2e-59
 Identities = 102/156 (65%), Positives = 128/156 (82%)
 Frame = -3

Query: 514 RGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVHLDHDIVSWFQM 335
           R  FQVR VKL+FPRFDG+NV  WIF+AE FF Y++TP  +RLTIA+VHLD D+V W+QM
Sbjct: 70  RAPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFEYYDTPDLDRLTIAAVHLDKDVVPWYQM 129

Query: 334 MKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAFTSLANKSEGLT 155
           M+R+  FQ W  FTRALE DFGPS  DCPRA+LFKLVQTGTV++Y++ FTSLAN+  GL+
Sbjct: 130 MQRSHPFQSWIDFTRALELDFGPSIYDCPRATLFKLVQTGTVAEYFVQFTSLANRVYGLS 189

Query: 154 NEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47
           N+A++DCF+SGL  DIRRDV    PS+L++AV+LAK
Sbjct: 190 NDALVDCFISGLNPDIRRDVLIHTPSSLVKAVSLAK 225


>gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1535

 Score =  212 bits (540), Expect = 4e-59
 Identities = 117/239 (48%), Positives = 153/239 (64%), Gaps = 1/239 (0%)
 Frame = -3

Query: 760 SAEVKRNAEETNAKFNDVETQILKLQHMIEA-TDAKLDDRFGKLNDAIAMILQRFXXXXX 584
           SAE+KRN E         + ++L+    +E  TD+K D  +  L+  I    QR      
Sbjct: 10  SAEIKRNEE-----MQLFQAELLERMERLETNTDSKFDKVYAALDVLITQSQQR------ 58

Query: 583 XXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNT 404
                           +     +R  FQVR VKL+FPRFDG+NV  WIF+AE FF Y++T
Sbjct: 59  --------------PPHGAGSSSRPPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFEYYDT 104

Query: 403 PYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLV 224
           P  +RLTIASVHLD D+V W+QM++R   F  W  FTRALE DFGPS  DCPRA+LFKL 
Sbjct: 105 PDLDRLTIASVHLDKDVVPWYQMVQRTHPFTSWIEFTRALELDFGPSVYDCPRATLFKLK 164

Query: 223 QTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47
           QTGTV++YYL FTSLAN+  GL+N+A++DCFVSGL ++IRRDV    P +L++A++LAK
Sbjct: 165 QTGTVAEYYLQFTSLANRVYGLSNDALIDCFVSGLNDEIRRDVLIHTPISLVKAMSLAK 223


>dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subterraneum]
          Length = 1500

 Score =  211 bits (536), Expect = 2e-58
 Identities = 111/229 (48%), Positives = 155/229 (67%), Gaps = 2/229 (0%)
 Frame = -3

Query: 727 NAKFNDVETQILKLQHM--IEATDAKLDDRFGKLNDAIAMILQRFXXXXXXXXXXXXXXX 554
           N + ND E Q+ +L+ +  +E  +   + +F K++ A+ +++ +                
Sbjct: 13  NKRIND-EMQLFQLEILERMERLEMNNESKFDKIHTALDLLISQ---------------- 55

Query: 553 XXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIAS 374
                T+     +R  FQVR VKL+FPRFDG+NV  WIF+AE FF Y++TP  +RLTI+S
Sbjct: 56  SSPKQTHGHGSSSRPPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFEYYDTPDLDRLTISS 115

Query: 373 VHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYL 194
           VHLD D+V W+QM++R   F  W  FTRALE DFGPS  DCPRA+LFKL QTGTV++YYL
Sbjct: 116 VHLDKDVVPWYQMLQRTHPFTSWIEFTRALELDFGPSVYDCPRATLFKLAQTGTVAEYYL 175

Query: 193 AFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47
            FTSLAN+  GL+N+A++DCFVSGL++DIRRDV    P +L++A++LAK
Sbjct: 176 QFTSLANRVYGLSNDALIDCFVSGLKDDIRRDVVLHTPISLVKAMSLAK 224


>gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 1502

 Score =  211 bits (536), Expect = 2e-58
 Identities = 114/239 (47%), Positives = 154/239 (64%)
 Frame = -3

Query: 763 LSAEVKRNAEETNAKFNDVETQILKLQHMIEATDAKLDDRFGKLNDAIAMILQRFXXXXX 584
           + AE+KRNAE       D+     ++   +E ++A   ++F K+  AI +++ +      
Sbjct: 9   IEAELKRNAE-------DMRLYQAEMFERLERSEAANKEKFDKIFTAIDILIDQ------ 55

Query: 583 XXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNT 404
                            + +   R  FQVR VKL+FPRFDG+NV  WIF+AE FF Y++T
Sbjct: 56  ---------SPSKHHHGAGLNNNRPPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFDYYDT 106

Query: 403 PYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLV 224
           P  +RLTI+SVHLD D+V W+QM++R   F  W  FTRALE DFGPS  DCPRA+LFKL 
Sbjct: 107 PDSDRLTISSVHLDKDVVPWYQMVQRLRPFTSWVEFTRALELDFGPSVYDCPRATLFKLS 166

Query: 223 QTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47
           QTGTV++YYL FTSLAN+  GL+N+A++DCFVSGL   IRRDV    P +L++AV+LAK
Sbjct: 167 QTGTVAEYYLQFTSLANRVYGLSNDAMVDCFVSGLNNQIRRDVLIHTPPSLVKAVSLAK 225


>gb|KYP49650.1| hypothetical protein KK1_028622 [Cajanus cajan]
          Length = 444

 Score =  201 bits (511), Expect = 2e-58
 Identities = 91/157 (57%), Positives = 126/157 (80%)
 Frame = -3

Query: 517 TRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVHLDHDIVSWFQ 338
           T+  FQVR VK+DFPRFDG+ V+ WIFKAE FF +++TP   R+TIA+VHLD D+V WFQ
Sbjct: 58  TKPPFQVRNVKIDFPRFDGTEVLSWIFKAEQFFDFYDTPDEHRMTIAAVHLDKDVVPWFQ 117

Query: 337 MMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAFTSLANKSEGL 158
           M+ R + FQ WK FT+ALE +FGPSP +CPR++LFKL QT +V++YY+ F SLAN+  G+
Sbjct: 118 MITRMQPFQSWKQFTKALESEFGPSPFECPRSTLFKLFQTASVNEYYMEFISLANRVYGI 177

Query: 157 TNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47
           + +A+LDCF+SGL+ +I+RDV +  P +L++AV+LAK
Sbjct: 178 SPDALLDCFISGLKPEIKRDVIAQSPLSLLKAVSLAK 214


>dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subterraneum]
          Length = 1512

 Score =  210 bits (535), Expect = 2e-58
 Identities = 112/239 (46%), Positives = 155/239 (64%), Gaps = 2/239 (0%)
 Frame = -3

Query: 757 AEVKRNAEETNAKFNDVETQILK--LQHMIEATDAKLDDRFGKLNDAIAMILQRFXXXXX 584
           A V  +  +T  K N  E  +L+  +   +E ++A   ++F K+  A+ +++ +      
Sbjct: 2   APVTASPSKTETKRNTDELTLLQGEIFERLERSEAANAEKFAKIFTALDILIDQTPSKQN 61

Query: 583 XXXXXXXXXXXXXXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNT 404
                               +  R  FQVR VKL+FPRFDG+NV  WIF+AE FF Y++T
Sbjct: 62  HGIG----------------LNNRTPFQVRNVKLEFPRFDGNNVHEWIFRAEQFFDYYDT 105

Query: 403 PYRERLTIASVHLDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLV 224
           P  +RLTIASVHLD D+V W+QM++R   FQ W  FTRALE DFGPS  +CPRA+LFKL 
Sbjct: 106 PDLDRLTIASVHLDKDVVPWYQMVQRTTPFQSWMDFTRALELDFGPSIYECPRATLFKLN 165

Query: 223 QTGTVSDYYLAFTSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47
           QTGTV++YYL FTSLAN+  GL+N+A++DCF+SGL  DIRRDV    P+++++AV+LAK
Sbjct: 166 QTGTVAEYYLQFTSLANRVYGLSNDALIDCFISGLSADIRRDVLIHTPNSIVKAVSLAK 224


>gb|KYP48558.1| hypothetical protein KK1_029709 [Cajanus cajan]
          Length = 681

 Score =  205 bits (522), Expect = 4e-58
 Identities = 93/158 (58%), Positives = 129/158 (81%)
 Frame = -3

Query: 520 QTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVHLDHDIVSWF 341
           QT   FQVR +KLDFP+FDG+NV+ WIFKAE FF Y++TP  +RLTI ++HL+ D+V WF
Sbjct: 33  QTNLPFQVRNIKLDFPKFDGTNVLQWIFKAEQFFGYYSTPELQRLTIVAIHLEKDVVPWF 92

Query: 340 QMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAFTSLANKSEG 161
           QMM++N  FQ W+ FT+ALE +FGPSP +CPR++LFKL Q G+V DYY+ FT+LAN+  G
Sbjct: 93  QMMQKNNPFQSWEGFTKALELEFGPSPYECPRSALFKLSQLGSVHDYYVEFTALANRVTG 152

Query: 160 LTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47
           LT +A+LDCF+SGL+ +IRR+V +  P+++++AV+LAK
Sbjct: 153 LTVDAILDCFLSGLKLEIRREVLAQSPNSVLKAVSLAK 190


>gb|PNX99332.1| retrotransposon-related protein [Trifolium pratense]
          Length = 1084

 Score =  208 bits (529), Expect = 1e-57
 Identities = 108/227 (47%), Positives = 152/227 (66%), Gaps = 2/227 (0%)
 Frame = -3

Query: 721 KFNDVETQILK--LQHMIEATDAKLDDRFGKLNDAIAMILQRFXXXXXXXXXXXXXXXXX 548
           K N  E  +L+  +   +E ++A  DD+F ++  A+ +++ +                  
Sbjct: 14  KHNTEEMTLLQGEIFERLERSEAANDDKFNRIFAALDILIDQ----------------TP 57

Query: 547 XXXTNSQVIQTRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVH 368
               +   +  R  FQVR VKL+FPRFDG+NV  WIF+AE FF Y+ TP  +RLTIASVH
Sbjct: 58  SKQNHGAGLNNRLPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFDYYETPDPDRLTIASVH 117

Query: 367 LDHDIVSWFQMMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAF 188
           LD D+V W+QM++R   FQ W  FTRALE D+GPS  +CPRA+LFKL QTGTV++YYL F
Sbjct: 118 LDKDVVPWYQMVQRTTPFQSWIDFTRALELDYGPSIYECPRATLFKLTQTGTVAEYYLKF 177

Query: 187 TSLANKSEGLTNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47
           TSLAN+  GL+N+A++DCF+SGL ++IRRDV    P+++++AV+LAK
Sbjct: 178 TSLANRVYGLSNDAMVDCFISGLTDEIRRDVLIHTPTSIVKAVSLAK 224


>gb|KYP40400.1| hypothetical protein KK1_038268 [Cajanus cajan]
          Length = 505

 Score =  200 bits (509), Expect = 1e-57
 Identities = 91/157 (57%), Positives = 125/157 (79%)
 Frame = -3

Query: 517 TRGAFQVRPVKLDFPRFDGSNVIGWIFKAE*FFVYHNTPYRERLTIASVHLDHDIVSWFQ 338
           T+  FQVR VK+DFPRFDG  V+ WIFKAE FF +++TP   R+TIA+VHLD D+V WFQ
Sbjct: 20  TKPPFQVRNVKIDFPRFDGMEVLSWIFKAEQFFDFYDTPDEHRMTIAAVHLDKDVVPWFQ 79

Query: 337 MMKRNEQFQDWKTFTRALEQDFGPSPNDCPRASLFKLVQTGTVSDYYLAFTSLANKSEGL 158
           M+ R + FQ WK FT+ALE +FGPSP +CPR++LFKL QT +V++YY+ F SLAN+  G+
Sbjct: 80  MITRMQPFQSWKQFTKALESEFGPSPFECPRSTLFKLFQTASVNEYYMEFISLANRVYGI 139

Query: 157 TNEAVLDCFVSGLQEDIRRDVKSLEPSTLIRAVALAK 47
           + +A+LDCF+SGL+ +I+RDV +  P +L++AV+LAK
Sbjct: 140 SPDALLDCFISGLKPEIKRDVIAQSPLSLLKAVSLAK 176