BLASTX nr result

ID: Astragalus22_contig00016555 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00016555
         (814 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX77860.1| retrovirus-related Pol polyprotein from transposo...   288   2e-90
dbj|GAU22921.1| hypothetical protein TSUD_326940 [Trifolium subt...   278   2e-82
gb|PNX71411.1| retrovirus-related Pol polyprotein from transposo...   267   6e-82
dbj|GAU46782.1| hypothetical protein TSUD_351810 [Trifolium subt...   276   3e-81
gb|PNX76620.1| hypothetical protein L195_g032574 [Trifolium prat...   258   6e-81
gb|KYP51705.1| hypothetical protein KK1_026473 [Cajanus cajan]        254   7e-81
gb|PNX72611.1| peptide transporter PTR2 [Trifolium pratense]          269   7e-81
gb|KYP35344.1| hypothetical protein KK1_043625 [Cajanus cajan]        249   6e-79
dbj|GAU41109.1| hypothetical protein TSUD_139780 [Trifolium subt...   251   1e-78
gb|PNX79728.1| hypothetical protein L195_g035716 [Trifolium prat...   246   9e-78
dbj|GAU43894.1| hypothetical protein TSUD_399420 [Trifolium subt...   263   3e-77
dbj|GAU45259.1| hypothetical protein TSUD_291430 [Trifolium subt...   243   5e-75
gb|PNX59756.1| retrovirus-related Pol polyprotein from transposo...   239   7e-75
gb|PNY05212.1| flavonol sulfotransferase-like protein [Trifolium...   241   4e-74
dbj|GAU49830.1| hypothetical protein TSUD_293850 [Trifolium subt...   241   5e-74
gb|PNY13856.1| hypothetical protein L195_g010524 [Trifolium prat...   241   1e-73
gb|PNX80244.1| hypothetical protein L195_g036241 [Trifolium prat...   238   2e-73
gb|PNX93130.1| retrovirus-related Pol polyprotein from transposo...   238   2e-73
gb|PNX62201.1| retrovirus-related Pol polyprotein from transposo...   234   3e-73
dbj|GAU47169.1| hypothetical protein TSUD_28920 [Trifolium subte...   252   3e-73

>gb|PNX77860.1| retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Trifolium pratense]
          Length = 581

 Score =  288 bits (737), Expect = 2e-90
 Identities = 140/259 (54%), Positives = 191/259 (73%), Gaps = 8/259 (3%)
 Frame = +3

Query: 3   HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182
           HSWIMNSV +SI QSIV+LENAIDVWNELKER S+GD +RISEL+ EIY+ KQG  SV+E
Sbjct: 95  HSWIMNSVEDSIAQSIVYLENAIDVWNELKERFSRGDFIRISELQVEIYSLKQGSRSVSE 154

Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362
           +F+ LKVLWEELEAY P P C CPRKCVC TGI  ++  ++L + IRFLTGLND + +++
Sbjct: 155 FFTALKVLWEELEAYLPVPVCNCPRKCVCVTGIGNARSQHDLLRAIRFLTGLNDTYDLVR 214

Query: 363 SQILIMNPLPKLNKIFSLVIQHERQF------PQSDESKILAGNVDARNGRTKPRGGYTS 524
           SQIL+M+PLP +NKIFS+VIQ+ERQF         ++SK+L    DAR G+ + +G Y +
Sbjct: 215 SQILLMDPLPAINKIFSMVIQYERQFAPVNIGSDLEDSKVLVNASDARRGQGRGKGSYGN 274

Query: 525 GYNSRNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKEE-EKTV 701
           GY S+   +VC++CGK  H VD CYKKH  PP + RN++  +    + AP++ E+   T 
Sbjct: 275 GYGSK--KRVCTYCGKDNHIVDNCYKKHGFPPGFGRNNATNSVNTEDSAPANNEDVGNTK 332

Query: 702 QMQT-GITQEKYDKLINML 755
            +++ G+T+ +Y+KL+N+L
Sbjct: 333 DIESFGLTKAQYEKLVNLL 351


>dbj|GAU22921.1| hypothetical protein TSUD_326940 [Trifolium subterraneum]
          Length = 1122

 Score =  278 bits (710), Expect = 2e-82
 Identities = 142/269 (52%), Positives = 189/269 (70%), Gaps = 18/269 (6%)
 Frame = +3

Query: 3   HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182
           HSWIMNSV ESI +SIV+L+NAIDVWNELKER S+GD +RISEL+ EIY+ KQG  +V+E
Sbjct: 96  HSWIMNSVEESIAKSIVYLDNAIDVWNELKERFSRGDFIRISELQVEIYSLKQGSRTVSE 155

Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362
           +F+ LK+LWEELEAY P P C CP KC+C TG+  ++  + L  VIRFLTGLND F +++
Sbjct: 156 FFTALKILWEELEAYLPVPVCNCPHKCMCATGVGNARHQHSLLHVIRFLTGLNDTFDLVR 215

Query: 363 SQILIMNPLPKLNKIFSLVIQHERQFPQ------SDESKILAGNVDARNGRTKPRG---- 512
           SQIL+M+PLP +NKIFS+VIQHERQF         DESK +    D+R    + RG    
Sbjct: 216 SQILLMDPLPSINKIFSMVIQHERQFVAINGDLLVDESKAIVNASDSRRSYGRGRGYSSS 275

Query: 513 -GYTSGYNSRNGSK--VCSFCGKTGHTVDTCYKKHEVPPHWQRNS--SNAAS---ADTNE 668
            G  SG+++ +GSK  +C+FCGK  H VD CY+K+  PPH+ RN+  SN      A+ N+
Sbjct: 276 HGRGSGFSTNSGSKKRICTFCGKDNHIVDNCYRKYGFPPHYGRNAEVSNVDCEDIAENND 335

Query: 669 APSDKEEEKTVQMQTGITQEKYDKLINML 755
           A S K  EK  +   G+T+ +Y++L+N+L
Sbjct: 336 AHSLKSTEKGTE-SFGLTKAQYERLVNLL 363


>gb|PNX71411.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Trifolium pratense]
          Length = 629

 Score =  267 bits (683), Expect = 6e-82
 Identities = 133/258 (51%), Positives = 178/258 (68%), Gaps = 8/258 (3%)
 Frame = +3

Query: 6   SWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTEY 185
           SWIMNSV ESI QSIV+L+NAIDVWNELKER S+GD +RISEL+ EI   KQ   SV+E+
Sbjct: 40  SWIMNSVEESIAQSIVYLDNAIDVWNELKERFSRGDFIRISELQVEINGLKQDSRSVSEF 99

Query: 186 FSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIKS 365
           F+ LKVLWEELEAY P P C CPRKCVC TG+  ++  ++L + IRFLTGLND F +++S
Sbjct: 100 FTALKVLWEELEAYLPVPVCNCPRKCVCVTGVGNARSQHDLLRAIRFLTGLNDTFDLVRS 159

Query: 366 QILIMNPLPKLNKIFSLVIQHERQF------PQSDESKILAGNVDARNGRTKPRGGYTSG 527
           QI +M+PLP +NKIFS+VIQ+ERQF         D+SK+L    D R  + + +G Y +G
Sbjct: 160 QISLMDPLPAINKIFSMVIQYERQFAPVNIGSDLDDSKVLVNASDTRRSQGRGKGSYGNG 219

Query: 528 YNSRNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKEEEKTVQ- 704
           Y S+   +VC++CGK  H VD  YKKH  PP + RN+S       + AP + E+    + 
Sbjct: 220 YGSK--KRVCTYCGKDNHIVDNYYKKHGFPPSYGRNNSTNNVNTEDSAPVNNEDIGNTKD 277

Query: 705 -MQTGITQEKYDKLINML 755
               G+T+ +++KL+N+L
Sbjct: 278 NESFGLTKAQHEKLVNLL 295


>dbj|GAU46782.1| hypothetical protein TSUD_351810 [Trifolium subterraneum]
          Length = 1512

 Score =  276 bits (706), Expect = 3e-81
 Identities = 136/264 (51%), Positives = 181/264 (68%), Gaps = 13/264 (4%)
 Frame = +3

Query: 3   HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182
           HSWIMNSV ESI QSIVFL+NA+DVW ELKER S GD +RISEL+ EIY  KQG  SV+E
Sbjct: 95  HSWIMNSVEESIAQSIVFLDNALDVWIELKERFSHGDFIRISELQVEIYGLKQGNRSVSE 154

Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362
           +F+ L++LWEE E Y P P C CPRKCVC TG+S ++  ++L + IRFLTGLNDNF M++
Sbjct: 155 FFTALRILWEEFEIYLPAPVCNCPRKCVCVTGVSNARTQHDLLRTIRFLTGLNDNFDMVR 214

Query: 363 SQILIMNPLPKLNKIFSLVIQHERQFP------QSDESKILAGNVDARNGRTKPRGGYTS 524
           SQIL+M+PLP +NK+FS+VIQHERQF         ++SK+     D+R  + + R G+ S
Sbjct: 215 SQILLMDPLPPINKVFSMVIQHERQFTPLQAVLDVEDSKVSVNASDSRRSQGRGRSGFNS 274

Query: 525 GYNS------RNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKE 686
            YNS       N  KVC++CGK  H V+ CYKKH  PPH+ R S+ A +A+  E   + +
Sbjct: 275 QYNSGFNPQYNNKKKVCTYCGKENHVVENCYKKHGFPPHYGRGST-ANNANAGELMDNDD 333

Query: 687 EEKTVQMQT-GITQEKYDKLINML 755
              T    +   T+ +Y++L+N+L
Sbjct: 334 ARSTRGSDSFSFTKAQYEQLVNLL 357


>gb|PNX76620.1| hypothetical protein L195_g032574 [Trifolium pratense]
          Length = 398

 Score =  258 bits (659), Expect = 6e-81
 Identities = 138/262 (52%), Positives = 186/262 (70%), Gaps = 11/262 (4%)
 Frame = +3

Query: 3   HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182
           HSWI+NSV+ESI QSIVF+ENAIDVWN+LKER SQGDLVRI+EL+ EIY+ +Q   SVTE
Sbjct: 89  HSWILNSVSESIAQSIVFMENAIDVWNDLKERFSQGDLVRIAELQQEIYSLRQDSRSVTE 148

Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362
           +FS LK+LWEELE Y P P C C  KC C   +  ++ +++L  VIRFLTGLND+F M+K
Sbjct: 149 FFSALKILWEELELYLPIPTCTCRVKCNC-DAMRRARANHQLMYVIRFLTGLNDHFDMVK 207

Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNS 536
           SQIL+++PLP LNKIFS+VIQHERQ  F  S+ SK L   ++A N R  P G  +S  NS
Sbjct: 208 SQILLLDPLPSLNKIFSMVIQHERQGNFTPSEHSKAL---INAANFR--PPGSTSSSKNS 262

Query: 537 RN----GSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNS-SNAASADTNEAPSD----KEE 689
           R+    G +VC+FCGK  H +D CY+KH +PPH Q+ S ++ A+A+ N+  S+     E 
Sbjct: 263 RSNSSTGKRVCTFCGKDNHIIDNCYQKHGLPPHLQKKSQAHNAAAEGNDCDSNSIAASEP 322

Query: 690 EKTVQMQTGITQEKYDKLINML 755
           +        +TQ+++++LI ++
Sbjct: 323 QAASSSSAPMTQDQWERLIALI 344


>gb|KYP51705.1| hypothetical protein KK1_026473 [Cajanus cajan]
          Length = 278

 Score =  254 bits (648), Expect = 7e-81
 Identities = 126/232 (54%), Positives = 171/232 (73%), Gaps = 3/232 (1%)
 Frame = +3

Query: 3   HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182
           HSWI+NSV +SIGQSI+FLEN +DVWN+LKER SQGDL+RISEL+ EIY  KQG L VTE
Sbjct: 52  HSWIVNSVVKSIGQSIIFLENVVDVWNDLKERFSQGDLIRISELQQEIYGIKQGSLFVTE 111

Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362
           ++SELK+LWEELE Y P P C CP KC C   +  ++Q + L+  IRFLTGLN+NF+++K
Sbjct: 112 FYSELKILWEELETYMPIPCCACPVKCTC-VAMRNARQFHTLNHFIRFLTGLNENFSVVK 170

Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNS 536
           SQIL+M+ +P +N+IF +VIQHERQ  F  +DESK L   +D +  R++ RG    G+  
Sbjct: 171 SQILLMDLVPSMNQIFYMVIQHERQGNFIVNDESKALINAIDYK--RSQGRG---KGFAQ 225

Query: 537 RNG-SKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKEE 689
            +G  K+C++ GKTGHT++TCY+KH  PPH+Q+ +S+  +   +E    KE+
Sbjct: 226 NSGPKKICTYYGKTGHTIETCYRKHGFPPHFQKGNSSMVNNACSETTDLKED 277


>gb|PNX72611.1| peptide transporter PTR2 [Trifolium pratense]
          Length = 845

 Score =  269 bits (688), Expect = 7e-81
 Identities = 132/259 (50%), Positives = 186/259 (71%), Gaps = 8/259 (3%)
 Frame = +3

Query: 3   HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182
           HSWIMN V ESI QSI++LENAIDVWNELKER S GD +RISEL+ EI+A KQG  SV+E
Sbjct: 4   HSWIMNFVEESIAQSIIYLENAIDVWNELKERFSHGDFIRISELQIEIHALKQGNRSVSE 63

Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362
           +F+ LK+LWEELEAY P P C CPRKCVC TGIS  +  ++L + IRFLTGLNDNF M++
Sbjct: 64  FFTALKILWEELEAYLPTPVCNCPRKCVCATGISNVKTQHDLLRKIRFLTGLNDNFDMVR 123

Query: 363 SQILIMNPLPKLNKIFSLVIQHERQF-PQS-----DESKILAGNVDARNGRTKPRGGYTS 524
           SQIL+M+PLP +NK+FS ++QHERQF P +     ++SK+L    D R  + + +GG+ +
Sbjct: 124 SQILLMDPLPPINKVFSSILQHERQFVPHNAGLDVEDSKVLVNASDNRRSQGRGKGGF-N 182

Query: 525 GYNSRNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKEEEKTVQ 704
           G +     K C++CGK  H ++ C+KKH  PP++ RN+++A    T+++  D ++ K+++
Sbjct: 183 GQSGPFKKKYCTYCGKDNHVIENCFKKHGFPPNFGRNNASANHFGTDDS-MDNDDIKSLK 241

Query: 705 MQT--GITQEKYDKLINML 755
                  T+ +Y+ L+N+L
Sbjct: 242 ASEPFTFTKSQYEHLVNLL 260


>gb|KYP35344.1| hypothetical protein KK1_043625 [Cajanus cajan]
          Length = 287

 Score =  249 bits (636), Expect = 6e-79
 Identities = 125/201 (62%), Positives = 157/201 (78%), Gaps = 3/201 (1%)
 Frame = +3

Query: 3   HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182
           HSWI+NSV ESIGQSI+FLENA+DVWN+LKER SQGDL RISEL+ EIY  KQG LSVTE
Sbjct: 93  HSWIVNSVVESIGQSIIFLENAVDVWNDLKERFSQGDLTRISELQQEIYGLKQGSLSVTE 152

Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362
           ++SELK+LWEELE Y P P C CP KC C   +  ++Q + L+ VIRFLTGLN+NF+++K
Sbjct: 153 FYSELKILWEELETYMPIPSCACPVKCTC-AAMRNARQFHTLNHVIRFLTGLNENFSVVK 211

Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNS 536
           SQIL+M+PLP +N+IFS+VIQHERQ  F  +DESK L   VD +  R++ RG    G+  
Sbjct: 212 SQILLMDPLPSMNRIFSMVIQHERQGNFIFNDESKALINAVDYK--RSQGRG---KGFAQ 266

Query: 537 RNG-SKVCSFCGKTGHTVDTC 596
            +G  K+C++CGKTGHTV+TC
Sbjct: 267 NSGPKKICTYCGKTGHTVETC 287


>dbj|GAU41109.1| hypothetical protein TSUD_139780 [Trifolium subterraneum]
          Length = 356

 Score =  251 bits (640), Expect = 1e-78
 Identities = 131/256 (51%), Positives = 175/256 (68%), Gaps = 5/256 (1%)
 Frame = +3

Query: 3   HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182
           HSWI+NSV+ESI QSIVF+E+A+D WN+LK+R SQGDLVRISEL  EIYA KQ    VTE
Sbjct: 89  HSWILNSVSESIAQSIVFIEHAVDAWNDLKDRFSQGDLVRISELMQEIYAFKQDSKFVTE 148

Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362
           +FSE KVLWEELE Y P P C C  +C C + +     H  L   IRFLTGLN+NF M+K
Sbjct: 149 FFSEFKVLWEELEIYMPIPNCVCRSRCSCDSMLKARSNH-ALLHAIRFLTGLNENFGMVK 207

Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNS 536
           SQIL+++PLP ++KIFS+V+Q ERQ  F   DESK+L   VD++        G++    S
Sbjct: 208 SQILLLDPLPPMSKIFSMVLQFERQSGFGLHDESKVLVNVVDSKKPSYFASKGHSQPSTS 267

Query: 537 RNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQ---RNSSNAASADTNEAPSDKEEEKTVQM 707
           + G++ C++C KT HTV+ C+KKH  PPH Q   R +S+ A +D     S++ E  +   
Sbjct: 268 K-GNRFCTYCHKTNHTVNECFKKHGFPPHMQKSNRTNSSQAGSDNVHNASERGESSSANS 326

Query: 708 QTGITQEKYDKLINML 755
           Q+ ITQ++Y++L+ ML
Sbjct: 327 QS-ITQDQYEQLMTML 341


>gb|PNX79728.1| hypothetical protein L195_g035716 [Trifolium pratense]
          Length = 272

 Score =  246 bits (627), Expect = 9e-78
 Identities = 126/255 (49%), Positives = 173/255 (67%), Gaps = 8/255 (3%)
 Frame = +3

Query: 15  MNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTEYFSE 194
           MNSV ES+ QSIVFL+NA+DVW ELKER S  D +RISEL+ EIY+ KQG  SV E+F+ 
Sbjct: 1   MNSVEESVAQSIVFLDNALDVWTELKERFSYCDFIRISELQVEIYSLKQGNPSVYEFFTA 60

Query: 195 LKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIKSQIL 374
           LKVLW+ELEAY P P C CPRKC+C TG+  ++  ++L + IRFLTGLNDNF  ++SQ+L
Sbjct: 61  LKVLWKELEAYLPAPVCNCPRKCMCVTGVRKARIQHDLLETIRFLTGLNDNFDTVRSQVL 120

Query: 375 IMNPLPKLNKIFSLVIQHERQFPQS------DESKILAGNVDARNGRTKPRGGYTSGYNS 536
           +M PLP +NK+FS+VIQ+ERQF  +      ++SK+     D+R    +P G   S +N 
Sbjct: 121 LMGPLPPINKVFSMVIQYERQFVATHAGLDIEDSKVSINASDSR----RPLGCGRSSFNP 176

Query: 537 R-NGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRN-SSNAASADTNEAPSDKEEEKTVQMQ 710
           + N  K C++CGK  H V+ CYKKH  PP++ RN ++N  +A+ +    D    K     
Sbjct: 177 QFNKKKYCTYCGKDNHVVENCYKKHGFPPNFGRNINANNVNAEDSMDNDDARSTKGTDSF 236

Query: 711 TGITQEKYDKLINML 755
           T  T+ +Y+KL+N+L
Sbjct: 237 T-FTKSQYEKLVNLL 250


>dbj|GAU43894.1| hypothetical protein TSUD_399420 [Trifolium subterraneum]
          Length = 1098

 Score =  263 bits (672), Expect = 3e-77
 Identities = 128/268 (47%), Positives = 188/268 (70%), Gaps = 17/268 (6%)
 Frame = +3

Query: 3   HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182
           HS IMNSV ESI QSI FL+N +DVWNELKER SQGD +RISEL+CEI+  KQ   SV+E
Sbjct: 119 HSGIMNSVDESIAQSIAFLDNVVDVWNELKERFSQGDYIRISELQCEIFGMKQESRSVSE 178

Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362
           +F+ LK+LWEEL++Y P P C C  +C+C TG+S ++  +++ + IRFLTGLN+NF  ++
Sbjct: 179 FFTALKILWEELDSYLPAPVCSCLMRCICNTGVSNAKHQHKIMRSIRFLTGLNENFDPVR 238

Query: 363 SQILIMNPLPKLNKIFSLVIQHERQFPQS--DESKILAGNVDAR------NGRTKPRGGY 518
           +QIL+MNPLP +N+IFS+V+QHERQ+  +  D+SK+L  + DAR      +G +  +G  
Sbjct: 239 AQILLMNPLPTINRIFSMVLQHERQYNSTHFDDSKVLVNSHDARKPKGRCHGSSSSQGNR 298

Query: 519 TSGYNSRN---GSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNS-SNAASADTNEAPSDKE 686
           ++ Y + N    +K CS+CGKT H V+ CY+KH  PPH+ RNS +N AS +  +   + +
Sbjct: 299 SNSYGANNYGAKNKECSYCGKTNHIVENCYRKHGFPPHYGRNSHANNASLEHVDERENMD 358

Query: 687 EEKTVQ-----MQTGITQEKYDKLINML 755
           + K+V+        G T+E+Y++L+ ++
Sbjct: 359 DNKSVRGNNNNTDFGFTKEQYNQLMTLI 386


>dbj|GAU45259.1| hypothetical protein TSUD_291430 [Trifolium subterraneum]
          Length = 387

 Score =  243 bits (619), Expect = 5e-75
 Identities = 134/261 (51%), Positives = 173/261 (66%), Gaps = 10/261 (3%)
 Frame = +3

Query: 3   HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182
           HS I+NSV+ESI QSIVF+EN IDVWN+LKE+ SQGDLVRI+EL+ EIY+ +Q   SVTE
Sbjct: 89  HSLILNSVSESIAQSIVFMENVIDVWNDLKEQFSQGDLVRIAELQQEIYSLRQESRSVTE 148

Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362
           +FS LK+LWEELE Y P P C C  KC C    S    H  L  VIRFLTGLN++F ++K
Sbjct: 149 FFSALKILWEELELYLPIPMCTCRVKCNCEAMRSARNNH-NLMYVIRFLTGLNEHFDVVK 207

Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYT--SGY 530
           SQIL+M+PLP LNKIFS+VIQHERQ  F  S++S+ L   ++A N  +K  G     S Y
Sbjct: 208 SQILLMDPLPTLNKIFSMVIQHERQGNFTPSEDSQAL---INAANSNSKGYGSKNPKSSY 264

Query: 531 NSRNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQR----NSSNAA--SADTNEAPSDKEEE 692
            S +  +VC+FCGK  H VD CYKKH +PPH Q+     + NAA      N  P      
Sbjct: 265 ASSSVKRVCTFCGKDNHIVDNCYKKHGLPPHLQKRVQSQAHNAAIDGGKCNTDPIPASNS 324

Query: 693 KTVQMQTGITQEKYDKLINML 755
           ++    T +TQ ++++LI ++
Sbjct: 325 QSASGSTPMTQAQWERLIALV 345


>gb|PNX59756.1| retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Trifolium pratense]
          Length = 309

 Score =  239 bits (611), Expect = 7e-75
 Identities = 123/252 (48%), Positives = 167/252 (66%), Gaps = 2/252 (0%)
 Frame = +3

Query: 6   SWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTEY 185
           SWI+NS++ SI QS+VF+ENAID+WN+L+ER SQGDL+RISEL+ EIY+ KQ   SVT++
Sbjct: 41  SWILNSISPSIAQSVVFMENAIDIWNDLRERFSQGDLIRISELQQEIYSLKQDNRSVTDF 100

Query: 186 FSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIKS 365
           FSELK LWEELE Y P P C C ++C C    S  + H  L   +RFLTGLN+NF+ ++S
Sbjct: 101 FSELKTLWEELELYLPIPSCTCRQRCACEAMRSARKNHL-LLHTVRFLTGLNENFSTVRS 159

Query: 366 QILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNSR 539
           QILIM PLP +NK+FSLVIQHERQ  F + D+SKIL     +     KP        +S+
Sbjct: 160 QILIMEPLPPINKVFSLVIQHERQGNFAEVDDSKILVNAAKS----AKPSS------SSK 209

Query: 540 NGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKEEEKTVQMQTGI 719
           + ++ CS+CGK  H V+ C+KK+ VPPH ++ S    SA    A     +         +
Sbjct: 210 SSTRNCSYCGKDNHVVENCFKKNGVPPHMKKFS----SAHNVAAEGGSVDSNVASTPPSL 265

Query: 720 TQEKYDKLINML 755
           +Q++YDKL+ +L
Sbjct: 266 SQDQYDKLMTLL 277


>gb|PNY05212.1| flavonol sulfotransferase-like protein [Trifolium pratense]
          Length = 417

 Score =  241 bits (615), Expect = 4e-74
 Identities = 131/253 (51%), Positives = 167/253 (66%), Gaps = 2/253 (0%)
 Frame = +3

Query: 3   HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182
           HSWI+NSV+ESI QSI+F+ENAIDVWN+LK R SQGDLVRISEL+ EIY+ +Q   SVTE
Sbjct: 161 HSWILNSVSESIAQSIMFMENAIDVWNDLKGRFSQGDLVRISELQQEIYSLRQESRSVTE 220

Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362
           +FS LKVLWEE E Y P P C C  KC C    S    H  L  VIRFLTGLND+F ++K
Sbjct: 221 FFSALKVLWEEFEIYLPIPMCTCRVKCSCEAMRSAHNNH-NLMYVIRFLTGLNDHFDVVK 279

Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNS 536
           SQILIM+PLP L KIFS++IQHERQ  F  S++SK L   ++A N +T     + S Y S
Sbjct: 280 SQILIMDPLPPLYKIFSMLIQHERQGNFAPSEDSKAL---INAANSKTSGSKNFKSSYGS 336

Query: 537 RNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKEEEKTVQMQTG 716
            +  +VC+FCGK  H +D CYKKH    +W R+    + A          E KT      
Sbjct: 337 SSVKRVCTFCGKDNHIIDNCYKKHGYSCNWGRDCDGDSVA--------ASEPKTAG-SAP 387

Query: 717 ITQEKYDKLINML 755
           +TQ+++++LI ++
Sbjct: 388 MTQDQWERLIALI 400


>dbj|GAU49830.1| hypothetical protein TSUD_293850 [Trifolium subterraneum]
          Length = 410

 Score =  241 bits (614), Expect = 5e-74
 Identities = 125/253 (49%), Positives = 170/253 (67%), Gaps = 2/253 (0%)
 Frame = +3

Query: 3   HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182
           HSWIMNSV+ESI QSIVF+ENAIDVWN+LKER SQ DL+RI+EL+ E++A +Q   SVTE
Sbjct: 86  HSWIMNSVSESIAQSIVFMENAIDVWNDLKERFSQADLIRIAELQQELHALQQDSRSVTE 145

Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362
           ++S+LK++WEELE Y P P C C  +C C    S    H  L  +IRFLTGLN++FA++K
Sbjct: 146 FYSDLKLIWEELEIYLPMPNCSCRNRCTCEAMRSARANH-ALLYIIRFLTGLNEHFAVVK 204

Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNS 536
           SQIL+M+PLP +NK+FSLV+QH+RQ  F  S++SK L           K +G +     S
Sbjct: 205 SQILLMDPLPPMNKVFSLVLQHQRQSNFSPSEDSKALL-------NAAKSKGSFP----S 253

Query: 537 RNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSNAASADTNEAPSDKEEEKTVQMQTG 716
           +N  ++C+FCGK  H V  C+KK+ +PPH+++NS     A+  E     EE+        
Sbjct: 254 KNPVRICTFCGKDNHIVANCFKKYGLPPHFRKNS----QANNAEIEGGNEEQIAADNSNI 309

Query: 717 ITQEKYDKLINML 755
           ITQE+  +LI +L
Sbjct: 310 ITQEQALQLITLL 322


>gb|PNY13856.1| hypothetical protein L195_g010524 [Trifolium pratense]
          Length = 448

 Score =  241 bits (614), Expect = 1e-73
 Identities = 125/256 (48%), Positives = 174/256 (67%), Gaps = 5/256 (1%)
 Frame = +3

Query: 3   HSWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTE 182
           HSWIMNSV+ESI QSIVF+ENAIDVWN+LKER SQ DL+RI+EL+ E++A KQ   +V E
Sbjct: 86  HSWIMNSVSESIAQSIVFIENAIDVWNDLKERFSQADLIRIAELQQELHALKQDSHTVNE 145

Query: 183 YFSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIK 362
           ++S+LK++WEELE Y P P C C   C C    S    H  L  VI FLTGLN++F+++K
Sbjct: 146 FYSDLKLIWEELEIYLPMPNCSCRNCCTCEAMRSARANH-TLLYVICFLTGLNEHFSVVK 204

Query: 363 SQILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNS 536
           SQIL+M+PLP + K+ SLV+QHERQ  F  SD+S++L     +R              +S
Sbjct: 205 SQILLMDPLPPMTKVVSLVLQHERQSHFSTSDDSRVLLNAAKSRGSS-----------SS 253

Query: 537 RNGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSS-NAASADTNEAPSDKEEEKTVQMQT 713
           R+G++VC+FCGK  H VD C+KKH +PPH+++NS  N A+ +      +  E    ++Q+
Sbjct: 254 RSGNRVCTFCGKDNHIVDNCFKKHGLPPHFRKNSQVNNAAIEGGIEDHNASEVTNAELQS 313

Query: 714 G--ITQEKYDKLINML 755
           G  ITQ++  +LI++L
Sbjct: 314 GPPITQDQALQLISLL 329


>gb|PNX80244.1| hypothetical protein L195_g036241 [Trifolium pratense]
          Length = 362

 Score =  238 bits (606), Expect = 2e-73
 Identities = 126/260 (48%), Positives = 173/260 (66%), Gaps = 10/260 (3%)
 Frame = +3

Query: 6   SWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTEY 185
           +WI++SV+ SI QS+VF+ENAID+WN+L+ER SQGDL+RISEL+ E YA KQ   SVT++
Sbjct: 86  AWILSSVSPSIAQSVVFMENAIDIWNDLRERFSQGDLIRISELQQEAYALKQDSKSVTDF 145

Query: 186 FSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIKS 365
           +++LKV+WEELE Y P P C CPR+C C    S  + H  L   IRFLTGLN NF+ +KS
Sbjct: 146 YTDLKVIWEELELYLPIPSCTCPRRCTCEAMRSARRNH-SLLHTIRFLTGLNANFSTVKS 204

Query: 366 QILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNSR 539
           QILIM+PLP +NK+FSLV+QHERQ    +SD+S IL     +    T    GY     S 
Sbjct: 205 QILIMDPLPPINKVFSLVLQHERQGISHESDDSTILVNAARS----TPSSSGYKQSTQSS 260

Query: 540 NGSK---VCSFCGKTGHTVDTCYKKHEVPPHWQR--NSSNAASAD---TNEAPSDKEEEK 695
           +GSK    C++CG   H V+ C+KK+ VPPH ++  +++NAAS +    N A +      
Sbjct: 261 SGSKPPRKCTYCGMNNHFVENCFKKNGVPPHMKKFASANNAASEEGITNNNAATSSTNSP 320

Query: 696 TVQMQTGITQEKYDKLINML 755
                  I+Q++YDKL+++L
Sbjct: 321 AA--SPSISQDQYDKLMSLL 338


>gb|PNX93130.1| retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Trifolium pratense]
          Length = 369

 Score =  238 bits (606), Expect = 2e-73
 Identities = 126/260 (48%), Positives = 173/260 (66%), Gaps = 10/260 (3%)
 Frame = +3

Query: 6   SWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTEY 185
           +WI++SV+ SI QS+VF+ENAID+WN+L+ER SQGDL+RISEL+ E YA KQ   SVT++
Sbjct: 86  AWILSSVSPSIAQSVVFMENAIDIWNDLRERFSQGDLIRISELQQEAYALKQDSKSVTDF 145

Query: 186 FSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIKS 365
           +++LKV+WEELE Y P P C CPR+C C    S  + H  L   IRFLTGLN NF+ +KS
Sbjct: 146 YTDLKVIWEELELYLPIPSCTCPRRCTCEAMRSARRNH-SLLHTIRFLTGLNANFSTVKS 204

Query: 366 QILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNSR 539
           QILIM+PLP +NK+FSLV+QHERQ    +SD+S IL     +    T    GY     S 
Sbjct: 205 QILIMDPLPPINKVFSLVLQHERQGISHESDDSTILVNAARS----TPSSSGYKQSTQSS 260

Query: 540 NGSK---VCSFCGKTGHTVDTCYKKHEVPPHWQR--NSSNAASAD---TNEAPSDKEEEK 695
           +GSK    C++CG   H V+ C+KK+ VPPH ++  +++NAAS +    N A +      
Sbjct: 261 SGSKPPRKCTYCGMNNHFVENCFKKNGVPPHMKKFASANNAASEEGITNNNAATSSTNSP 320

Query: 696 TVQMQTGITQEKYDKLINML 755
                  I+Q++YDKL+++L
Sbjct: 321 AA--SPSISQDQYDKLMSLL 338


>gb|PNX62201.1| retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Trifolium pratense]
          Length = 268

 Score =  234 bits (597), Expect = 3e-73
 Identities = 120/236 (50%), Positives = 162/236 (68%), Gaps = 7/236 (2%)
 Frame = +3

Query: 63  NAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTEYFSELKVLWEELEAYWPEPK 242
           NAIDVWNELKER S GD +RISEL+ EI+  KQG  SV+E+F+ LK LWEELEAY P P 
Sbjct: 35  NAIDVWNELKERFSHGDFIRISELQIEIHRLKQGNRSVSEFFTVLKTLWEELEAYLPTPV 94

Query: 243 CGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIKSQILIMNPLPKLNKIFSLVI 422
           C CPRKCVC TGI  ++  ++L + IRFLTGLND+F M++SQIL+M+PLP +NK+FS+VI
Sbjct: 95  CNCPRKCVCATGIINARSQHDLLRKIRFLTGLNDSFDMVRSQILLMDPLPLMNKVFSMVI 154

Query: 423 QHERQFP------QSDESKILAGNVDARNGRTKPRGGYTSGYNSRNGSKVCSFCGKTGHT 584
           QHERQF        +++SKI     D+R  + + RGG+  G  S +  K C+FCGK  H 
Sbjct: 155 QHERQFVPHITGLDTEDSKISINASDSRRSQGRGRGGF-HGQFSSSKKKYCTFCGKDSHV 213

Query: 585 VDTCYKKHEVPPHWQRNSS-NAASADTNEAPSDKEEEKTVQMQTGITQEKYDKLIN 749
           V+  YKKH  PP++ RN+S N A+A+ +    D +  K  +  T  T+ +YD +++
Sbjct: 214 VENFYKKHGFPPNYGRNTSGNNANAEDSLDTDDSKSTKGNEAFT-FTKSRYDNILS 268


>dbj|GAU47169.1| hypothetical protein TSUD_28920 [Trifolium subterraneum]
          Length = 1086

 Score =  252 bits (643), Expect = 3e-73
 Identities = 134/258 (51%), Positives = 176/258 (68%), Gaps = 8/258 (3%)
 Frame = +3

Query: 6   SWIMNSVAESIGQSIVFLENAIDVWNELKERLSQGDLVRISELECEIYASKQGPLSVTEY 185
           SWIMN V+ESI QSIVF+ENA+D WN+LK+R SQGDLVRISEL  EIYA +Q   SVTE+
Sbjct: 98  SWIMNFVSESIAQSIVFMENAMDAWNDLKDRFSQGDLVRISELMQEIYALQQDSKSVTEF 157

Query: 186 FSELKVLWEELEAYWPEPKCGCPRKCVCTTGISLSQQHYELSKVIRFLTGLNDNFAMIKS 365
           +S+LK+LWEELE Y P P C C  +C C   IS    H  L   IRFLTGLNDNFAM+KS
Sbjct: 158 YSDLKILWEELEIYMPIPNCTCRSRCNCEAMISARSNH-TLLYAIRFLTGLNDNFAMVKS 216

Query: 366 QILIMNPLPKLNKIFSLVIQHERQ--FPQSDESKILAGNVDARNGRTKPRGGYTSGYNSR 539
           QIL+++PLP + K+FS+V+Q ERQ  F  S+ESK+L   VD++   + P    +S   + 
Sbjct: 217 QILLLDPLPSMTKMFSMVLQFERQRNFGTSEESKVLVNAVDSKK-PSYPNSRGSSQPATS 275

Query: 540 NGSKVCSFCGKTGHTVDTCYKKHEVPPHWQRNSSN----AASADTNEAPSDKEE--EKTV 701
            GSK C++C +T HTV+ C+KKH  PPH QRN SN     AS ++NEA S   +  + + 
Sbjct: 276 KGSKFCTYCHRTNHTVNDCFKKHGYPPHMQRNHSNRAAYMASGESNEANSAASDHGQSSQ 335

Query: 702 QMQTGITQEKYDKLINML 755
                IT ++Y +L+++L
Sbjct: 336 AATPSITPDQYQQLMSLL 353


Top