BLASTX nr result

ID: Astragalus24_contig00022878 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00022878
         (836 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_020208035.1| uncharacterized protein LOC109792988 [Cajanu...   244   1e-75
dbj|GAU37816.1| hypothetical protein TSUD_276340 [Trifolium subt...   241   2e-75
gb|PNX91511.1| hypothetical protein L195_g047642, partial [Trifo...   238   3e-73
dbj|GAU25735.1| hypothetical protein TSUD_216660 [Trifolium subt...   238   1e-67
gb|PNY17729.1| retrotransposon-related protein [Trifolium pratense]   238   1e-67
gb|PNX93486.1| retrotransposon-related protein, partial [Trifoli...   237   2e-67
dbj|GAU37335.1| hypothetical protein TSUD_395160 [Trifolium subt...   230   2e-67
gb|PNX92911.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   236   4e-67
gb|PNX92266.1| retrotransposon-related protein [Trifolium pratense]   236   5e-67
gb|PNY17651.1| retrotransposon-related protein [Trifolium pratense]   234   1e-66
dbj|GAU12723.1| hypothetical protein TSUD_122150 [Trifolium subt...   233   5e-66
gb|PNX92994.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   233   5e-66
gb|PNX92532.1| retrotransposon-related protein [Trifolium pratense]   232   1e-65
dbj|GAU10396.1| hypothetical protein TSUD_421780, partial [Trifo...   211   1e-65
dbj|GAU24592.1| hypothetical protein TSUD_289530 [Trifolium subt...   231   2e-65
dbj|GAU17344.1| hypothetical protein TSUD_232200 [Trifolium subt...   204   3e-62
dbj|GAU40456.1| hypothetical protein TSUD_141360 [Trifolium subt...   217   1e-60
gb|KYP32928.1| hypothetical protein KK1_046280 [Cajanus cajan]        202   6e-58
dbj|GAU41744.1| hypothetical protein TSUD_180930 [Trifolium subt...   187   2e-56
gb|PNX62764.1| hypothetical protein L195_g053153, partial [Trifo...   189   1e-55

>ref|XP_020208035.1| uncharacterized protein LOC109792988 [Cajanus cajan]
          Length = 388

 Score =  244 bits (624), Expect = 1e-75
 Identities = 127/243 (52%), Positives = 166/243 (68%)
 Frame = +3

Query: 36  DLDCKIEENRAESNARMTQLQESLENRIESRMEELRNLIVAQIRGPSQGSNVTXXXXXXX 215
           DL+ K+++  AE NAR+ Q+  S        MEE+R+L+  Q    SQ  N         
Sbjct: 13  DLEKKVDDKHAEINARLEQMSVS--------MEEIRSLLCTQA---SQNQNEA------- 54

Query: 216 XXENTVRPPPEATPARHDTHPVRNYSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDETPP 395
              + ++     +     +H   +YSTR+SKV+FP+FDGK++K WLYKCDQFF+LD TP 
Sbjct: 55  --SSIIKNNNSFSENIKSSH--HSYSTRISKVEFPRFDGKRMKEWLYKCDQFFMLDGTPA 110

Query: 396 ESRVRLASIHLEGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQVKH 575
           ES+VRLASIHL+G+ALQWHLNYM  +FD YP W  Y  DV  RF ++Y+DPL+ LIQVK 
Sbjct: 111 ESKVRLASIHLDGIALQWHLNYMRNKFDIYPPWQQYVTDVTMRFGEIYDDPLSSLIQVKQ 170

Query: 576 SSTAQNYID*FELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNPKNIAHVANLAKLCE 755
           S T Q+Y+D FELALTQ+S+LPEH+L IFL  L  +TQ HVRMFNP +IAH ANLAKL E
Sbjct: 171 SGTIQDYVDEFELALTQVSLLPEHSLSIFLTSLEYSTQMHVRMFNPNSIAHAANLAKLYE 230

Query: 756 SAK 764
           +++
Sbjct: 231 ASR 233


>dbj|GAU37816.1| hypothetical protein TSUD_276340 [Trifolium subterraneum]
          Length = 296

 Score =  241 bits (614), Expect = 2e-75
 Identities = 130/242 (53%), Positives = 154/242 (63%), Gaps = 5/242 (2%)
 Frame = +3

Query: 57  ENRAESNARMTQLQESLENRIESRMEELRNLIVAQIRGPSQGSNVTXXXXXXXXXENTVR 236
           E R E      QLQ + +      M+ELR LI       SQ  N                
Sbjct: 13  ERRFEDRVNKMQLQRTAD------MDELRTLIHPPSENSSQDPN---------------- 50

Query: 237 PPPEATPARHDTHPVRN-----YSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDETPPES 401
                  +RH +  VRN     Y+TR+SKV+FP+FDGK V+ WLYKCDQFFLLDETP  S
Sbjct: 51  -------SRHGSGGVRNGTQNVYATRISKVEFPRFDGKNVRDWLYKCDQFFLLDETPAAS 103

Query: 402 RVRLASIHLEGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQVKHSS 581
           RVRLASIHL+GLALQWHLNYM  +FD YP W  Y  DV  RF D YEDPL+ L+++KHS 
Sbjct: 104 RVRLASIHLDGLALQWHLNYMRQKFDIYPSWQQYITDVTTRFGDAYEDPLSYLLEIKHSG 163

Query: 582 TAQNYID*FELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNPKNIAHVANLAKLCESA 761
             Q YID FELALTQ++++PEH+L IFLAGL   TQ HVRMFNP +IAH  NLAKL ES+
Sbjct: 164 KIQEYIDRFELALTQVNLIPEHSLSIFLAGLEHNTQIHVRMFNPTSIAHATNLAKLHESS 223

Query: 762 KP 767
           +P
Sbjct: 224 QP 225


>gb|PNX91511.1| hypothetical protein L195_g047642, partial [Trifolium pratense]
          Length = 393

 Score =  238 bits (608), Expect = 3e-73
 Identities = 112/160 (70%), Positives = 131/160 (81%)
 Frame = +3

Query: 288 YSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDETPPESRVRLASIHLEGLALQWHLNYMC 467
           Y+TR+SKV+FP+FDGK V+ W YKCDQFFLLDETPP SRVRLASIHL+GLALQWHLNYM 
Sbjct: 66  YATRISKVEFPRFDGKNVRDWFYKCDQFFLLDETPPTSRVRLASIHLDGLALQWHLNYMR 125

Query: 468 GRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQVKHSSTAQNYID*FELALTQMSILPEH 647
            +FD YP W  Y  DV  RF D YEDPL+ L+QVKH+   Q+YID FELALTQ+S++ EH
Sbjct: 126 QKFDVYPSWQQYITDVTARFGDAYEDPLSSLLQVKHTGKVQDYIDQFELALTQVSLITEH 185

Query: 648 ALRIFLAGLNLTTQAHVRMFNPKNIAHVANLAKLCESAKP 767
           +L IFLAGL+  TQ HVRMFNP +IAH ANLAKL E+A+P
Sbjct: 186 SLSIFLAGLDYNTQMHVRMFNPSSIAHAANLAKLHEAAQP 225


>dbj|GAU25735.1| hypothetical protein TSUD_216660 [Trifolium subterraneum]
          Length = 1417

 Score =  238 bits (606), Expect = 1e-67
 Identities = 130/239 (54%), Positives = 159/239 (66%)
 Frame = +3

Query: 51  IEENRAESNARMTQLQESLENRIESRMEELRNLIVAQIRGPSQGSNVTXXXXXXXXXENT 230
           ++E     + R+TQ+Q  L+   +  MEELR L  A +   SQ S               
Sbjct: 8   LDEIERHFDDRLTQMQ--LQRNTD--MEELRTLFRANVDLSSQDSTGRN----------- 52

Query: 231 VRPPPEATPARHDTHPVRNYSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDETPPESRVR 410
                 +  AR D      Y+TR+SKV+FP+FDGKKV+ WLYKCDQFFLLDETP  SRVR
Sbjct: 53  -----NSGGARSDNQNA--YATRISKVEFPRFDGKKVRDWLYKCDQFFLLDETPESSRVR 105

Query: 411 LASIHLEGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQVKHSSTAQ 590
           LASIHL+GLALQWHLNYM  +FD YP W  Y +DV  RF + YEDPL+ L+Q+KH    Q
Sbjct: 106 LASIHLDGLALQWHLNYMRQKFDIYPSWQQYISDVTTRFGEAYEDPLSSLLQIKHVGKIQ 165

Query: 591 NYID*FELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNPKNIAHVANLAKLCESAKP 767
           +YID FELALTQ++++PEH+L IFLAGL   TQ HVRMFNP +IAH ANLAKL ES+ P
Sbjct: 166 DYIDQFELALTQVNMIPEHSLSIFLAGLEHHTQMHVRMFNPTSIAHAANLAKLHESSNP 224


>gb|PNY17729.1| retrotransposon-related protein [Trifolium pratense]
          Length = 1479

 Score =  238 bits (606), Expect = 1e-67
 Identities = 130/245 (53%), Positives = 163/245 (66%), Gaps = 6/245 (2%)
 Frame = +3

Query: 51  IEENRAESNARMTQLQESLENRIESRMEELRNLIVAQIRGPSQGSNVTXXXXXXXXXENT 230
           ++E     + R+TQLQ  L+   +  MEE+R+L+ A                        
Sbjct: 9   LDELERRFDDRLTQLQ--LQRTTD--MEEIRSLLRAN----------------------- 41

Query: 231 VRPPPEATPARHDTHPVRN-----YSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDETPP 395
             P   A+  RH +   RN     Y+TR+SKV+FP+FDGK V+ WLYKCDQFFLLDET P
Sbjct: 42  ADPGSPASTGRHGSPGARNSNQNMYATRISKVEFPRFDGKNVRDWLYKCDQFFLLDETSP 101

Query: 396 ESRVRLASIHLEGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQVKH 575
            SRVRLASIHL+GLALQWHLNYM  +FD YP W  Y  DV  RF D YEDPL+ L+Q+KH
Sbjct: 102 TSRVRLASIHLDGLALQWHLNYMRQKFDIYPSWQQYITDVTARFGDAYEDPLSSLLQIKH 161

Query: 576 SS-TAQNYID*FELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNPKNIAHVANLAKLC 752
           ++   Q+YID FELALTQ+S++PEH+L IFLAGL+  T+ HVRMFNP +IAH ANLAKL 
Sbjct: 162 TAGKVQDYIDQFELALTQVSLIPEHSLSIFLAGLDNNTKMHVRMFNPSSIAHAANLAKLH 221

Query: 753 ESAKP 767
           E+A+P
Sbjct: 222 EAAQP 226


>gb|PNX93486.1| retrotransposon-related protein, partial [Trifolium pratense]
          Length = 1414

 Score =  237 bits (605), Expect = 2e-67
 Identities = 127/238 (53%), Positives = 151/238 (63%)
 Frame = +3

Query: 57  ENRAESNARMTQLQESLENRIESRMEELRNLIVAQIRGPSQGSNVTXXXXXXXXXENTVR 236
           E R E   +  QLQ S +      M+E+R+L+ A+    S  SN            +   
Sbjct: 12  ERRFEDRLQQLQLQRSTD------MDEIRSLLRARDEQSSPASNGRQGSGGPRSGNSNF- 64

Query: 237 PPPEATPARHDTHPVRNYSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDETPPESRVRLA 416
                            YSTR+SKV+FP+FDGK V+ WLYKCDQFFLLDETP  S VRLA
Sbjct: 65  -----------------YSTRISKVEFPRFDGKNVRDWLYKCDQFFLLDETPATSMVRLA 107

Query: 417 SIHLEGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQVKHSSTAQNY 596
           SIHL+GLALQWHLNYM  +FD YP W  Y  DV  RF D YEDPL+ L+Q+KH    Q Y
Sbjct: 108 SIHLDGLALQWHLNYMRQKFDIYPSWQQYITDVTARFGDAYEDPLSSLLQIKHVGKIQEY 167

Query: 597 ID*FELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNPKNIAHVANLAKLCESAKPL 770
           ID FELALTQ+S++PEH+L IFLAGL   TQ HVRMFNP NIAH ANLAKL ES++ +
Sbjct: 168 IDKFELALTQVSLIPEHSLSIFLAGLEYHTQMHVRMFNPTNIAHAANLAKLHESSRDI 225


>dbj|GAU37335.1| hypothetical protein TSUD_395160 [Trifolium subterraneum]
          Length = 653

 Score =  230 bits (587), Expect = 2e-67
 Identities = 120/253 (47%), Positives = 162/253 (64%), Gaps = 4/253 (1%)
 Frame = +3

Query: 24  QSSGDLDCKIEENRAESNARMTQLQESLENRIE---SRMEELRNLIVAQIRGPSQGSNVT 194
           QS  +L+ K+++  AE   ++      +  R +   S+M+ELR ++        + SN  
Sbjct: 5   QSVSELEKKVDDRHAELEKKVDDHHADVHVRFDQLASQMDELRTML-------GKSSN-- 55

Query: 195 XXXXXXXXXENTVRPPPEATPARHDT-HPVRNYSTRLSKVDFPKFDGKKVKYWLYKCDQF 371
                            + +  RH + H   +Y+TR+SKVDFP+F+GK ++ WLYKCDQF
Sbjct: 56  -------------HHDSDGSSGRHSSSHTPNSYATRISKVDFPRFNGKNIRDWLYKCDQF 102

Query: 372 FLLDETPPESRVRLASIHLEGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPL 551
           FLLD TP  S VRLASIHL+ LALQWHLNYM  +F+ YP+W  Y  D+  RF D +EDPL
Sbjct: 103 FLLDTTPATSMVRLASIHLDDLALQWHLNYMRQKFNIYPIWGQYVTDITARFGDAFEDPL 162

Query: 552 AELIQVKHSSTAQNYID*FELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNPKNIAHV 731
           + L+QVKHS   Q+YID F+LALTQ++++PEH+L IFLAGL    Q HVRMFNP +IAH 
Sbjct: 163 SSLLQVKHSGKVQDYIDQFQLALTQVNLIPEHSLSIFLAGLEYHKQMHVRMFNPSSIAHA 222

Query: 732 ANLAKLCESAKPL 770
            NLAKL ES+K +
Sbjct: 223 VNLAKLHESSKEI 235


>gb|PNX92911.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1478

 Score =  236 bits (602), Expect = 4e-67
 Identities = 128/237 (54%), Positives = 152/237 (64%)
 Frame = +3

Query: 51  IEENRAESNARMTQLQESLENRIESRMEELRNLIVAQIRGPSQGSNVTXXXXXXXXXENT 230
           +EE     + R+ QLQ  L+  I+  MEELR L+ AQ+   S   N              
Sbjct: 9   LEELERRFDDRLNQLQ--LQRNID--MEELRTLLRAQLENKSPDPNSRQGSGGTRTGNQN 64

Query: 231 VRPPPEATPARHDTHPVRNYSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDETPPESRVR 410
           +                  Y+TR+SKV+FPKFDGK V+ WLYKCDQFFLLDETP  SRVR
Sbjct: 65  I------------------YATRISKVEFPKFDGKNVRDWLYKCDQFFLLDETPAVSRVR 106

Query: 411 LASIHLEGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQVKHSSTAQ 590
           LASIHLEGLALQWHLNYM  +FD YP W  Y  DV  RF D +EDPL+ L+Q+K     Q
Sbjct: 107 LASIHLEGLALQWHLNYMRQKFDIYPSWQQYITDVTARFGDAFEDPLSSLLQIKQVGKIQ 166

Query: 591 NYID*FELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNPKNIAHVANLAKLCESA 761
           +Y+D FELALTQ++++PEH+L IFLAGL   TQ HVRMFNP  IAH ANLAKL ESA
Sbjct: 167 DYVDQFELALTQVNLIPEHSLSIFLAGLEHHTQMHVRMFNPSTIAHAANLAKLHESA 223


>gb|PNX92266.1| retrotransposon-related protein [Trifolium pratense]
          Length = 1479

 Score =  236 bits (601), Expect = 5e-67
 Identities = 125/246 (50%), Positives = 163/246 (66%)
 Frame = +3

Query: 27  SSGDLDCKIEENRAESNARMTQLQESLENRIESRMEELRNLIVAQIRGPSQGSNVTXXXX 206
           S  +L+ K+++      AR+    E L  ++ + MEELR+L+    +GPS  SN      
Sbjct: 7   SLAELEKKMDDRHDILQARL----EQLNIQVGTGMEELRSLL----QGPSAHSNNGSPGD 58

Query: 207 XXXXXENTVRPPPEATPARHDTHPVRNYSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDE 386
                     PP +             Y+TR+SKV+FP+F+GK V+ WLYKCDQFF+LD 
Sbjct: 59  SRFGGSRRTPPPSQNL-----------YATRISKVEFPRFNGKNVRDWLYKCDQFFMLDG 107

Query: 387 TPPESRVRLASIHLEGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQ 566
           TP    VRLASIHL+GLALQWHLNYM  +FD YP W  Y +DV  RF D YEDPL+ L+Q
Sbjct: 108 TPATEMVRLASIHLDGLALQWHLNYMRQKFDYYPTWQQYVSDVTTRFGDAYEDPLSALLQ 167

Query: 567 VKHSSTAQNYID*FELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNPKNIAHVANLAK 746
           VKH++  Q+Y+D FELALTQ+++LPEH+L IFLAGL+  TQ HVRMF+P +IAH ANLAK
Sbjct: 168 VKHTAKVQDYVDQFELALTQVTLLPEHSLSIFLAGLDHGTQMHVRMFSPSSIAHAANLAK 227

Query: 747 LCESAK 764
           L E++K
Sbjct: 228 LHEASK 233


>gb|PNY17651.1| retrotransposon-related protein [Trifolium pratense]
          Length = 1478

 Score =  234 bits (598), Expect = 1e-66
 Identities = 124/245 (50%), Positives = 158/245 (64%)
 Frame = +3

Query: 24  QSSGDLDCKIEENRAESNARMTQLQESLENRIESRMEELRNLIVAQIRGPSQGSNVTXXX 203
           Q+  DL+ ++ E   + N + T           + M+E+R+L+ AQ+   +Q SN     
Sbjct: 6   QTLEDLEKRLNEQLQQINLQRT-----------TDMDEIRSLLRAQVEQIAQNSNGRHGA 54

Query: 204 XXXXXXENTVRPPPEATPARHDTHPVRNYSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLD 383
                  NT                   Y+TR+SKV+FP+FDGK V+ WLYKCDQFFLLD
Sbjct: 55  AARNGTSNT-------------------YATRISKVEFPRFDGKNVRDWLYKCDQFFLLD 95

Query: 384 ETPPESRVRLASIHLEGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAELI 563
           ETP  S VRLASIHL+GLALQWHLNYM  +FD YP W  Y  DV  RF D +EDPLA L+
Sbjct: 96  ETPATSMVRLASIHLDGLALQWHLNYMRQKFDIYPSWTQYVTDVTMRFGDAFEDPLATLL 155

Query: 564 QVKHSSTAQNYID*FELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNPKNIAHVANLA 743
           Q++H+   ++YID FELALTQ++++PEH+L +FLAGLN  TQ HVRMFNP +IAH ANLA
Sbjct: 156 QIQHTGKVKDYIDQFELALTQVNLIPEHSLSMFLAGLNHNTQMHVRMFNPTSIAHAANLA 215

Query: 744 KLCES 758
           KL ES
Sbjct: 216 KLHES 220


>dbj|GAU12723.1| hypothetical protein TSUD_122150 [Trifolium subterraneum]
          Length = 1492

 Score =  233 bits (594), Expect = 5e-66
 Identities = 125/248 (50%), Positives = 163/248 (65%), Gaps = 1/248 (0%)
 Frame = +3

Query: 24  QSSGDLDCKIEENRAESNARMTQLQESLENRIESRMEELRNLIVAQIRGPSQGSNVTXXX 203
           +S  DL+  +++  A + +R  +L   +   ++S    LRN+++ Q      G +     
Sbjct: 8   RSISDLEKIMDDRYAYTQSRFDELSTQMAQGLDS----LRNMMLNQSNHGDNGYSGG--- 60

Query: 204 XXXXXXENTVRPPPEATPARHDTHPVRN-YSTRLSKVDFPKFDGKKVKYWLYKCDQFFLL 380
                     R    +T   H     RN Y+TR+SKV+FP+FDGK V+ WLYKCDQFFLL
Sbjct: 61  ----------RNQGTSTSEHH-----RNAYATRISKVEFPRFDGKNVRDWLYKCDQFFLL 105

Query: 381 DETPPESRVRLASIHLEGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAEL 560
           D TPP S VRLASIHL+GLALQWHLNYM  +FD YP W+ Y ADV  RF + YEDPL+ L
Sbjct: 106 DGTPPASMVRLASIHLDGLALQWHLNYMRQKFDMYPTWNQYVADVTTRFGEAYEDPLSSL 165

Query: 561 IQVKHSSTAQNYID*FELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNPKNIAHVANL 740
           +Q+KH+   Q YID +ELALTQ++++PEH+L IFLAGL   TQ HVRMFNP +IAH ANL
Sbjct: 166 LQIKHAGKVQEYIDKYELALTQVNLIPEHSLSIFLAGLEHHTQMHVRMFNPTSIAHAANL 225

Query: 741 AKLCESAK 764
           AKL E++K
Sbjct: 226 AKLHEASK 233


>gb|PNX92994.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1476

 Score =  233 bits (594), Expect = 5e-66
 Identities = 125/246 (50%), Positives = 161/246 (65%)
 Frame = +3

Query: 27  SSGDLDCKIEENRAESNARMTQLQESLENRIESRMEELRNLIVAQIRGPSQGSNVTXXXX 206
           S  DL+ +I+ N AE N R+ QL  S+     + +EE+RNL+  + R  +  SN      
Sbjct: 7   SIADLEKRIDGNHAEVNTRIEQLNVSMN----AGLEEIRNLL--RDRNDAASSNHGGGHR 60

Query: 207 XXXXXENTVRPPPEATPARHDTHPVRNYSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDE 386
                ++             + HP   YSTR+SKVDFP+FDGKK+K WLYKC+QFF LD+
Sbjct: 61  YLRNEDSN-----------RNHHP---YSTRISKVDFPRFDGKKLKEWLYKCNQFFSLDD 106

Query: 387 TPPESRVRLASIHLEGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQ 566
           TP +S+VRL SIHLEG ALQWHLNYM  RFD YP W  Y  +V +RF D++EDPL+ LIQ
Sbjct: 107 TPDDSKVRLVSIHLEGPALQWHLNYMRSRFDVYPSWTEYIVEVTQRFGDVFEDPLSALIQ 166

Query: 567 VKHSSTAQNYID*FELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNPKNIAHVANLAK 746
           VK + T Q YID FELALTQ+S+ PE  L IFLAGL ++TQ HVRMF+P ++ H   LAK
Sbjct: 167 VKQTGTVQEYIDAFELALTQVSLFPEQTLSIFLAGLEISTQMHVRMFHPTSVHHAGRLAK 226

Query: 747 LCESAK 764
             E++K
Sbjct: 227 FHEASK 232


>gb|PNX92532.1| retrotransposon-related protein [Trifolium pratense]
          Length = 1472

 Score =  232 bits (591), Expect = 1e-65
 Identities = 125/240 (52%), Positives = 159/240 (66%)
 Frame = +3

Query: 51  IEENRAESNARMTQLQESLENRIESRMEELRNLIVAQIRGPSQGSNVTXXXXXXXXXENT 230
           ++E     + R+TQ+Q  L+   +  M+E+R+L+ AQ      GS  +            
Sbjct: 9   LDEVERRFDERLTQMQ--LQRNTD--MDEIRSLLRAQA---DHGSPTSAGRHG------- 54

Query: 231 VRPPPEATPARHDTHPVRNYSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDETPPESRVR 410
                 +   RH T  V  Y+TR+SKVDFP+FDGK V+ WLYKCDQFF +DETP  S VR
Sbjct: 55  ------SNGNRHGTQNV--YATRISKVDFPRFDGKNVRDWLYKCDQFFSIDETPATSMVR 106

Query: 411 LASIHLEGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQVKHSSTAQ 590
           LASIHL+GLALQWHLNYM  +FD YP W  Y  DV  RF D YEDPL+ L+Q+KH+   Q
Sbjct: 107 LASIHLDGLALQWHLNYMRQKFDVYPSWQQYITDVTARFGDAYEDPLSSLLQIKHTGKIQ 166

Query: 591 NYID*FELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNPKNIAHVANLAKLCESAKPL 770
           +YID FELALTQ++++PEH+L IFLAGL   TQ HVRMFNP +I+H ANLAKL E++  L
Sbjct: 167 DYIDQFELALTQVNLIPEHSLSIFLAGLEQNTQMHVRMFNPSSISHAANLAKLHEASTSL 226


>dbj|GAU10396.1| hypothetical protein TSUD_421780, partial [Trifolium subterraneum]
          Length = 154

 Score =  211 bits (536), Expect = 1e-65
 Identities = 96/151 (63%), Positives = 122/151 (80%)
 Frame = +3

Query: 276 PVRNYSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDETPPESRVRLASIHLEGLALQWHL 455
           P ++Y+TR+SK+DFP+FDGKK+K WLYKCDQFF LD TP +SRVRLASIHLEG ALQWH+
Sbjct: 2   PQQHYATRISKIDFPRFDGKKMKEWLYKCDQFFALDATPDDSRVRLASIHLEGPALQWHV 61

Query: 456 NYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQVKHSSTAQNYID*FELALTQMSI 635
           NYM  +F+ YP W  Y  DV +RF +++EDPLAELI +K + T Q+YID FELA TQ+++
Sbjct: 62  NYMKSKFNVYPSWTEYVIDVTQRFGEVFEDPLAELINIKQTGTVQDYIDAFELASTQVNL 121

Query: 636 LPEHALRIFLAGLNLTTQAHVRMFNPKNIAH 728
            PE +L IFLAGL  TTQ HVRMF+P +++H
Sbjct: 122 FPEQSLSIFLAGLENTTQMHVRMFHPTSVSH 152


>dbj|GAU24592.1| hypothetical protein TSUD_289530 [Trifolium subterraneum]
          Length = 1330

 Score =  231 bits (589), Expect = 2e-65
 Identities = 125/233 (53%), Positives = 153/233 (65%), Gaps = 4/233 (1%)
 Frame = +3

Query: 81  RMTQLQESLENRIESRMEELRNLIVAQIRGPSQGSNVTXXXXXXXXXENTVRPPPEATPA 260
           R+TQLQ+       + MEE+R+L+  Q+    QGS                     A+  
Sbjct: 19  RLTQLQQQRN----TDMEEIRSLLRVQLE---QGS--------------------PASVG 51

Query: 261 RHDTHP----VRNYSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDETPPESRVRLASIHL 428
           R  +H     V+   TR+SKVDFP+FDGK V+ WLYKCDQFFL DETPP S VRLASIHL
Sbjct: 52  RQGSHGPRRVVQTNQTRISKVDFPRFDGKNVREWLYKCDQFFLFDETPPTSMVRLASIHL 111

Query: 429 EGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQVKHSSTAQNYID*F 608
           +GL LQWHL YM  +FD YP W  Y +DV  RF D YED L+ L+ +KH+ T Q+YID F
Sbjct: 112 DGLTLQWHLTYMRQKFDIYPYWQQYVSDVTARFGDAYEDALSSLLLIKHTGTIQDYIDQF 171

Query: 609 ELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNPKNIAHVANLAKLCESAKP 767
           ELALTQ+++LPEH+L IFLAGL   TQ HVRMFNP +IAH ANLAKL E++ P
Sbjct: 172 ELALTQVTLLPEHSLSIFLAGLEKHTQMHVRMFNPTSIAHAANLAKLHEASLP 224


>dbj|GAU17344.1| hypothetical protein TSUD_232200 [Trifolium subterraneum]
          Length = 222

 Score =  204 bits (520), Expect = 3e-62
 Identities = 108/227 (47%), Positives = 146/227 (64%)
 Frame = +3

Query: 33  GDLDCKIEENRAESNARMTQLQESLENRIESRMEELRNLIVAQIRGPSQGSNVTXXXXXX 212
           G+L+ K++E   E +AR+ +L           M+E+R  ++++                 
Sbjct: 9   GELNKKVDERHEEVSARLEELSVG--------MDEIRKFLLSK---------------KS 45

Query: 213 XXXENTVRPPPEATPARHDTHPVRNYSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDETP 392
              EN+      +  A  D  P ++Y+TR+SK+DFP+FDGKK+K WLYKCDQFF LD TP
Sbjct: 46  DEDENSSHRNKSSKGACRDG-PQQHYATRISKIDFPRFDGKKMKEWLYKCDQFFALDATP 104

Query: 393 PESRVRLASIHLEGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQVK 572
            +SRVRLASIHLE  ALQWH+NYM  +F+ YP W  Y  D  +RF +++EDPLAELI +K
Sbjct: 105 DDSRVRLASIHLECPALQWHVNYMKSKFNVYPSWTEYVIDDTQRFGEVFEDPLAELINIK 164

Query: 573 HSSTAQNYID*FELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNP 713
            + T Q+YID FELA TQ+++ PE +L IFLAGL  TTQ HVRMF+P
Sbjct: 165 QTGTVQDYIDAFELASTQVNLFPEQSLSIFLAGLENTTQMHVRMFHP 211


>dbj|GAU40456.1| hypothetical protein TSUD_141360 [Trifolium subterraneum]
          Length = 1130

 Score =  217 bits (553), Expect = 1e-60
 Identities = 108/176 (61%), Positives = 129/176 (73%), Gaps = 1/176 (0%)
 Frame = +3

Query: 246 EATPARHDTHPVRN-YSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDETPPESRVRLASI 422
           E   + H T   RN Y+TR+SKVDFP+F+GK +   LYKCDQFFLLD TP  S VRL SI
Sbjct: 26  ERHSSSHTTENNRNSYATRISKVDFPRFNGKNICDSLYKCDQFFLLDATPATSMVRLTSI 85

Query: 423 HLEGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQVKHSSTAQNYID 602
           HL+ LALQWHLNYM  +F+ YP W  Y  D+  RF D +EDPL+   QVKHS   Q+YID
Sbjct: 86  HLDDLALQWHLNYMRQKFNIYPTWGQYVTDITARFGDAFEDPLSSFFQVKHSRKVQDYID 145

Query: 603 *FELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNPKNIAHVANLAKLCESAKPL 770
            FELALTQ++++PEH+L IFLAGL   TQ HVRMFNP +IAH ANLAKL ES+K +
Sbjct: 146 QFELALTQLNLIPEHSLSIFLAGLEYHTQMHVRMFNPSSIAHAANLAKLHESSKEI 201


>gb|KYP32928.1| hypothetical protein KK1_046280 [Cajanus cajan]
          Length = 493

 Score =  202 bits (513), Expect = 6e-58
 Identities = 94/142 (66%), Positives = 114/142 (80%)
 Frame = +3

Query: 339 VKYWLYKCDQFFLLDETPPESRVRLASIHLEGLALQWHLNYMCGRFDQYPLWDLYSADVK 518
           +K WLYKCDQFF+LD TP ES+VRLASIHL+G+ALQWHLNYM  +FD YP W  Y  DV 
Sbjct: 1   MKEWLYKCDQFFMLDGTPAESKVRLASIHLDGIALQWHLNYMRNKFDIYPPWQQYVTDVT 60

Query: 519 RRFDDLYEDPLAELIQVKHSSTAQNYID*FELALTQMSILPEHALRIFLAGLNLTTQAHV 698
            RF ++Y+DPL+ LIQVK S T Q+Y+D FELALTQ+S+LPEH+L IFL  L  +TQ HV
Sbjct: 61  MRFGEIYDDPLSSLIQVKQSGTIQDYVDEFELALTQVSLLPEHSLSIFLTSLEYSTQMHV 120

Query: 699 RMFNPKNIAHVANLAKLCESAK 764
           RMFNP +IAH ANLAKL E+++
Sbjct: 121 RMFNPNSIAHAANLAKLYEASR 142


>dbj|GAU41744.1| hypothetical protein TSUD_180930 [Trifolium subterraneum]
          Length = 149

 Score =  187 bits (475), Expect = 2e-56
 Identities = 90/138 (65%), Positives = 105/138 (76%)
 Frame = +3

Query: 288 YSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDETPPESRVRLASIHLEGLALQWHLNYMC 467
           Y+TR+SKVDFP+FDGK V  WL KCDQFF LDET   S VRLASIHL+GLALQWHLNYM 
Sbjct: 10  YATRISKVDFPRFDGKNVCDWLNKCDQFFSLDETLATSMVRLASIHLDGLALQWHLNYMR 69

Query: 468 GRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQVKHSSTAQNYID*FELALTQMSILPEH 647
             FD YP W  Y ADV   F D YED ++ L+Q+KH+   Q+YID FELALTQ++++PEH
Sbjct: 70  QTFDVYPSWQQYIADVTAHFGDAYEDHMSSLLQIKHTGKIQDYIDQFELALTQVTLIPEH 129

Query: 648 ALRIFLAGLNLTTQAHVR 701
           +L IFLAGL   TQ HVR
Sbjct: 130 SLNIFLAGLEHNTQMHVR 147


>gb|PNX62764.1| hypothetical protein L195_g053153, partial [Trifolium pratense]
          Length = 245

 Score =  189 bits (479), Expect = 1e-55
 Identities = 93/174 (53%), Positives = 118/174 (67%)
 Frame = +3

Query: 243 PEATPARHDTHPVRNYSTRLSKVDFPKFDGKKVKYWLYKCDQFFLLDETPPESRVRLASI 422
           P  TP+  ++    +YSTR+SKV+FP+FDGK V+ WLYKC+QFFLLD TPP S VRLASI
Sbjct: 28  PPGTPSPENSQ--HSYSTRISKVEFPRFDGKNVRDWLYKCEQFFLLDGTPPTSMVRLASI 85

Query: 423 HLEGLALQWHLNYMCGRFDQYPLWDLYSADVKRRFDDLYEDPLAELIQVKHSSTAQNYID 602
           HL+GLALQWHLNYM  +FD YP W+ Y  DV  RF D +EDPL+ L+Q+KH    Q Y+D
Sbjct: 86  HLDGLALQWHLNYMRQKFDIYPTWNQYVTDVTTRFGDAFEDPLSSLVQIKHVGKVQEYVD 145

Query: 603 *FELALTQMSILPEHALRIFLAGLNLTTQAHVRMFNPKNIAHVANLAKLCESAK 764
            FELALT++++                    ++MFNP  IAH  NLAKL E++K
Sbjct: 146 QFELALTRVNL--------------------IQMFNPPTIAHAVNLAKLHEASK 179


Top