BLASTX nr result

ID: Astragalus23_contig00016220 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00016220
         (843 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX99332.1| retrotransposon-related protein [Trifolium pratense]   276   6e-82
dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subt...   267   8e-78
gb|PNY16671.1| retrotransposon-related protein, partial [Trifoli...   264   6e-77
ref|XP_014634047.1| PREDICTED: uncharacterized protein LOC106799...   254   7e-77
dbj|GAU11620.1| hypothetical protein TSUD_346120 [Trifolium subt...   263   2e-76
gb|KYP53387.1| Retrotransposon-derived protein PEG10 [Cajanus ca...   247   4e-76
gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifo...   259   3e-75
gb|KOM58233.1| hypothetical protein LR48_Vigan11g126700 [Vigna a...   245   9e-75
gb|PNX94483.1| retrotransposon-related protein, partial [Trifoli...   258   1e-74
gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   258   1e-74
dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subt...   257   2e-74
dbj|GAU25507.1| hypothetical protein TSUD_279910 [Trifolium subt...   256   3e-74
dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subte...   251   2e-72
gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]   251   2e-72
gb|PNX92889.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   249   1e-71
dbj|GAU37387.1| hypothetical protein TSUD_22610 [Trifolium subte...   249   1e-71
gb|PNX86812.1| hypothetical protein L195_g042894, partial [Trifo...   233   4e-70
dbj|BAT97165.1| hypothetical protein VIGAN_09053500 [Vigna angul...   236   9e-70
ref|XP_019465366.1| PREDICTED: uncharacterized protein LOC109363...   229   7e-67
gb|KYP61911.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan]   230   1e-65

>gb|PNX99332.1| retrotransposon-related protein [Trifolium pratense]
          Length = 1084

 Score =  276 bits (706), Expect = 6e-82
 Identities = 143/257 (55%), Positives = 184/257 (71%), Gaps = 4/257 (1%)
 Frame = +1

Query: 31  HTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNPTKSQFSEKQTPPI 210
           HTP +IVKAV+L KVYEEK++                 ++        K + S + + PI
Sbjct: 211 HTPTSIVKAVSLAKVYEEKYTTNTKLPQTYQNNQITNKTYA------AKPENSTRNSAPI 264

Query: 211 LNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLLQVDD 390
           L+TP TRPM+ NQ+NPNI+RISPAE Q+R++KGLCY+CD+KFSFTHKCPN+ LMLLQ DD
Sbjct: 265 LHTPPTRPMHPNQRNPNIKRISPAERQIRRDKGLCYWCDEKFSFTHKCPNRQLMLLQYDD 324

Query: 391 GDTVETIE-PNPPDIPQT*TNLDT---EYHLSLNAMKGAGGFGIIRFQGSIGSISASVLL 558
           GDT    E P+PPD+  T  +LDT   E HLS+NAMKG    G++RF GSIG I   +L+
Sbjct: 325 GDTQLFDESPDPPDL--TTNSLDTNLPELHLSMNAMKGTNNMGVMRFAGSIGHIDVQILI 382

Query: 559 DGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVSAY 738
           DGGS+DNF+QPRI   LK+P+E AP ++VLVGNG+ M AEG++K L ++I  H+++VSAY
Sbjct: 383 DGGSSDNFVQPRIAKFLKLPVEPAPIFKVLVGNGEIMTAEGVIKQLPINIQSHKLEVSAY 442

Query: 739 LLLVVGADVILGAPWLA 789
           LL V GADVILGA WLA
Sbjct: 443 LLPVAGADVILGASWLA 459


>dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subterraneum]
          Length = 1512

 Score =  267 bits (682), Expect = 8e-78
 Identities = 140/259 (54%), Positives = 181/259 (69%), Gaps = 6/259 (2%)
 Frame = +1

Query: 31  HTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNP--TKSQFSEKQTP 204
           HTP +IVKAV+L KVYEEK++                  +  ++  P  TK + S + + 
Sbjct: 211 HTPNSIVKAVSLAKVYEEKYTTT--------LKPQKTYQNNYSNIKPLTTKPENSTRNSA 262

Query: 205 PILNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLLQV 384
           PILNTP TRPM+  QKNPNI+RISPAEMQ+R++KGLCY+CD+KFSFTHKCPN+ LMLLQ 
Sbjct: 263 PILNTPPTRPMSQFQKNPNIKRISPAEMQIRRDKGLCYWCDEKFSFTHKCPNRQLMLLQY 322

Query: 385 DDGDT-VETIEPNPPDIPQT*TNLDT---EYHLSLNAMKGAGGFGIIRFQGSIGSISASV 552
           DD +T +    P PPD P    +LDT   ++HLS+NAMKG    G+IRF GSI  I   +
Sbjct: 323 DDNETQLFDGSPEPPDSPTN--SLDTNIPDHHLSMNAMKGTSNMGVIRFVGSIEHIEVQI 380

Query: 553 LLDGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVS 732
           L+DGGS+DNF+QPRI   LK+PIE AP ++VLVGNG+ M AEG++K L + I  H+++V 
Sbjct: 381 LIDGGSSDNFVQPRIAKFLKLPIEPAPVFKVLVGNGEIMNAEGVIKQLPIDIQGHKLEVP 440

Query: 733 AYLLLVVGADVILGAPWLA 789
           A+LL V G DV+LGA WLA
Sbjct: 441 AFLLPVAGVDVVLGASWLA 459


>gb|PNY16671.1| retrotransposon-related protein, partial [Trifolium pratense]
          Length = 1284

 Score =  264 bits (674), Expect = 6e-77
 Identities = 138/257 (53%), Positives = 173/257 (67%), Gaps = 4/257 (1%)
 Frame = +1

Query: 31  HTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNPTKSQFSEKQTPPI 210
           HTPP++VKA +L KVYEEK+++                  T   FN  K +   + + PI
Sbjct: 200 HTPPSLVKAFSLAKVYEEKYTSNTNQKKFNTTNYA-----TNKPFN--KPEILTRDSAPI 252

Query: 211 LNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLLQVDD 390
           LNTP TRPM+  QKNPNIRRISPAE Q+R EKGLCY+CD+KFSFTHKCPN+ LML+Q DD
Sbjct: 253 LNTPPTRPMSQFQKNPNIRRISPAERQMRSEKGLCYWCDEKFSFTHKCPNRQLMLIQCDD 312

Query: 391 GDTVETIEP----NPPDIPQT*TNLDTEYHLSLNAMKGAGGFGIIRFQGSIGSISASVLL 558
            D  +  EP        I  + TN  TE+HLSLNAMKG    G++RF GSI  I   VL+
Sbjct: 313 SDADQMFEPMTQPEESTINSSITN-QTEHHLSLNAMKGTSNMGVLRFTGSIEQIKVQVLI 371

Query: 559 DGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVSAY 738
           DGGS+DNF+QPRI   LK+PIE+ P++ VLVGNG+ M AEG+++ L L I  H++ V  +
Sbjct: 372 DGGSSDNFLQPRIAKFLKLPIESGPQFNVLVGNGETMTAEGIIQKLPLEIQGHKLDVPVF 431

Query: 739 LLLVVGADVILGAPWLA 789
           LL + GADVILGA WLA
Sbjct: 432 LLPIAGADVILGASWLA 448


>ref|XP_014634047.1| PREDICTED: uncharacterized protein LOC106799639 [Glycine max]
          Length = 600

 Score =  254 bits (648), Expect = 7e-77
 Identities = 132/257 (51%), Positives = 173/257 (67%), Gaps = 4/257 (1%)
 Frame = +1

Query: 31  HTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNPTKSQFSEKQT-PP 207
           HTP ++VK V+L KVYEEK+++                 + RA FN  K + ++K    P
Sbjct: 223 HTPISMVKVVSLAKVYEEKYTSTSK----PHKSTPSNSYNHRAPFNSNKPENTQKANHTP 278

Query: 208 ILNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLLQVD 387
           +L T  TRPMN NQ+NPNI+RISPAEMQLR+EKGLCY+CDD+FS THKCPN+ +M+LQ D
Sbjct: 279 LLQTLPTRPMNPNQRNPNIKRISPAEMQLRREKGLCYWCDDQFSLTHKCPNRQVMMLQFD 338

Query: 388 DGDTVETIEPNPPDIPQT*TNLD---TEYHLSLNAMKGAGGFGIIRFQGSIGSISASVLL 558
           D +     EP    +  T    D    ++HLSLNAMKG    GI+RF G IG IS  VL+
Sbjct: 339 DSEKHIEPEPEKAQLDMTCNEPDPTTNDHHLSLNAMKGTNSMGILRFTGQIGQISVQVLI 398

Query: 559 DGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVSAY 738
           DGGS+DNF+QPRI   LK+P+E  P ++VLVGN Q M AEG+V +L++++  HE+ V  +
Sbjct: 399 DGGSSDNFLQPRIAEFLKLPVEPGPCFKVLVGNVQTMTAEGVVPNLSITLQGHELIVPVF 458

Query: 739 LLLVVGADVILGAPWLA 789
           LL V GAD+ILG+ WLA
Sbjct: 459 LLPVAGADIILGSSWLA 475


>dbj|GAU11620.1| hypothetical protein TSUD_346120 [Trifolium subterraneum]
          Length = 1479

 Score =  263 bits (671), Expect = 2e-76
 Identities = 134/257 (52%), Positives = 179/257 (69%), Gaps = 4/257 (1%)
 Frame = +1

Query: 31  HTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNPTKSQFSEKQTPPI 210
           HTP ++VKAV+L KVYEEK++    +            S  +  +N  KS+ + + + PI
Sbjct: 210 HTPSSLVKAVSLAKVYEEKYAMNSKS----QTRNYSNYSTNKPLYN--KSEIATRNSAPI 263

Query: 211 LNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLLQVDD 390
           LNTP TRPM+  QKNPNI+RISPAEMQ+R++KGLCY+CD+KFSFTHKCPN+ LMLL  DD
Sbjct: 264 LNTPPTRPMSQYQKNPNIKRISPAEMQVRRDKGLCYWCDEKFSFTHKCPNRQLMLLHYDD 323

Query: 391 GDTVETIEPN----PPDIPQT*TNLDTEYHLSLNAMKGAGGFGIIRFQGSIGSISASVLL 558
            D  + +EP+    P  I  + TN   ++HLSLNAMKG    G++RF G+I      VL+
Sbjct: 324 SDEEQLVEPSITLEPKTIDSSITNTP-DHHLSLNAMKGNNTMGVLRFTGAIEQFKVQVLI 382

Query: 559 DGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVSAY 738
           DGGS+DNF+QPRI   LK+PIE  P ++VLVGNG+ M AEG++++L L I  H+I +  +
Sbjct: 383 DGGSSDNFLQPRIAKFLKLPIEPGPTFRVLVGNGEIMTAEGVIQELPLDIQGHKIHIPVF 442

Query: 739 LLLVVGADVILGAPWLA 789
           LL VVGAD++LGA WLA
Sbjct: 443 LLPVVGADIVLGASWLA 459


>gb|KYP53387.1| Retrotransposon-derived protein PEG10 [Cajanus cajan]
          Length = 431

 Score =  247 bits (631), Expect = 4e-76
 Identities = 125/258 (48%), Positives = 168/258 (65%), Gaps = 4/258 (1%)
 Frame = +1

Query: 28  AHTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNPTKS---QFSEKQ 198
           A +PP++VK VAL K++EEK+    A                R   NP         +  
Sbjct: 83  ALSPPSLVKVVALAKLFEEKYILSSAPKNPSYQPRATTFYPNRHSSNPKPDIPHSLPKSN 142

Query: 199 TPPILNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLL 378
             P+L  P T+P     KN  +++ISPAEMQ+R+EKGLCYFCD+KF FTHKCPN+ +M+L
Sbjct: 143 LSPLLPNPSTKPFPQTHKN-QVKKISPAEMQIRREKGLCYFCDEKFPFTHKCPNRQMMML 201

Query: 379 QVDDGDTVETIEPNPPDIPQT*TNLDT-EYHLSLNAMKGAGGFGIIRFQGSIGSISASVL 555
           Q+ D + +++ EP+PPD+PQ  T +   E+HLSLNAMKG GG G I F G I  IS  VL
Sbjct: 202 QLIDDELLDSREPDPPDLPQPDTEVSNPEHHLSLNAMKGVGGVGTIEFTGHIEPISIKVL 261

Query: 556 LDGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVSA 735
           +DGGS+D+F+QPRI H LK+PIE  P + V VGNGQ M  EG+++ LA++I  H++ V  
Sbjct: 262 VDGGSSDSFLQPRIAHFLKLPIELVPGFPVFVGNGQSMTTEGVIQQLAMTIQGHQLVVPV 321

Query: 736 YLLLVVGADVILGAPWLA 789
           YLL V GAD++LG+ WLA
Sbjct: 322 YLLSVFGADLVLGSSWLA 339


>gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 1502

 Score =  259 bits (663), Expect = 3e-75
 Identities = 137/262 (52%), Positives = 177/262 (67%), Gaps = 9/262 (3%)
 Frame = +1

Query: 31  HTPPNIVKAVALDKVYEEKHS----AQKANLXXXXXXXXXXXSHTRAHFNPTKSQFSEKQ 198
           HTPP++VKAV+L KVYEEK++     QKA +           +H+       K + + + 
Sbjct: 212 HTPPSLVKAVSLAKVYEEKYADAMNTQKATIN----------NHSTNKPFINKPEIATRN 261

Query: 199 TPPILNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLL 378
           T PILNTP TRPM+  QKNPNI+R+SPAE Q+R++KGLCY+CD+KFSFTHKCPN+ ++LL
Sbjct: 262 TAPILNTPPTRPMSQFQKNPNIKRMSPAERQVRRDKGLCYWCDEKFSFTHKCPNRQMLLL 321

Query: 379 QVDDGDT-VETIEPNPPDIPQT*TNLDT----EYHLSLNAMKGAGGFGIIRFQGSIGSIS 543
           Q DD D   + +        Q  TN  T    E+HLSLNA+KG    G+IRF GSI  I 
Sbjct: 322 QYDDDDNDADQVFDTLTQTEQVTTNGQTTNLPEHHLSLNALKGTSNMGVIRFAGSIEHIG 381

Query: 544 ASVLLDGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEI 723
             +L+DGGS+DNF+QPRI   LK+PIE  P++ VLVGNG+ M+AEGM++ L L I  H I
Sbjct: 382 VQILIDGGSSDNFMQPRIAKFLKLPIEPGPQFNVLVGNGEVMSAEGMIQKLPLHIQGHVI 441

Query: 724 KVSAYLLLVVGADVILGAPWLA 789
           +V  YLL + GADVILGA WLA
Sbjct: 442 EVPVYLLPIAGADVILGASWLA 463


>gb|KOM58233.1| hypothetical protein LR48_Vigan11g126700 [Vigna angularis]
          Length = 472

 Score =  245 bits (625), Expect = 9e-75
 Identities = 125/254 (49%), Positives = 170/254 (66%), Gaps = 4/254 (1%)
 Frame = +1

Query: 28  AHTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNP---TKSQFSEKQ 198
           A +PP++VKAVAL K++EEK++   A                R  +N    T S   +  
Sbjct: 218 ALSPPSLVKAVALAKLFEEKYNPPNAAKNPVYLPRSSTIVPNRTSYNTKTDTSSSLPKST 277

Query: 199 TPPILNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLL 378
            PP+L  P  +P++   K+  IR++SPAEMQLR+EK LCYFCD+KFSF+HKCPN+ +MLL
Sbjct: 278 LPPLLPNPNIKPLSQTNKHHQIRKLSPAEMQLRREKCLCYFCDEKFSFSHKCPNRQMMLL 337

Query: 379 QVDDGDTVETIEPNPPDIPQT*TNL-DTEYHLSLNAMKGAGGFGIIRFQGSIGSISASVL 555
           Q+ D D  +T EP+PPD+ QT + L + E+HLSLNAMKG GG G I F G IG ++  +L
Sbjct: 338 QLIDDDLGDTREPDPPDLIQTDSELCNPEHHLSLNAMKGVGGVGTIGFTGHIGPLAVKIL 397

Query: 556 LDGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVSA 735
           +DGGS+DNFIQPRI   LK+PIE    +QV VGNGQ M  EG+++ LA++I  H++ V  
Sbjct: 398 VDGGSSDNFIQPRIAQFLKLPIEYVTGFQVFVGNGQSMTTEGVIQQLAVTIQGHQLIVPV 457

Query: 736 YLLLVVGADVILGA 777
           YL  V GAD++LG+
Sbjct: 458 YLFPVSGADLVLGS 471


>gb|PNX94483.1| retrotransposon-related protein, partial [Trifolium pratense]
          Length = 1287

 Score =  258 bits (658), Expect = 1e-74
 Identities = 137/263 (52%), Positives = 177/263 (67%), Gaps = 10/263 (3%)
 Frame = +1

Query: 31  HTPPNIVKAVALDKVYEEKHSA----QKANLXXXXXXXXXXXSHTRAHFNP---TKSQFS 189
           HTPP++VKAV+L KVYEEK+++    QK+N             HT    N     K +  
Sbjct: 212 HTPPSLVKAVSLAKVYEEKYASNLKSQKSN-------------HTNYSTNQPFTNKPETI 258

Query: 190 EKQTPPILNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHL 369
            + + PILNTP TRPM+  QKNPNI+RISPAE Q+R++KGLCY+CDDKFS+THKCPN+ L
Sbjct: 259 TRNSAPILNTPPTRPMSQFQKNPNIKRISPAERQVRRDKGLCYWCDDKFSYTHKCPNRQL 318

Query: 370 MLLQVDDGDTVETIEPNPPDIPQT*TNLDT---EYHLSLNAMKGAGGFGIIRFQGSIGSI 540
           MLLQ DD +    +E            L+T   E+HLS NAMKG    GI+RF G+I  I
Sbjct: 319 MLLQYDDNEEENVVEIPSDSSELAINTLETTQPEHHLSFNAMKGNSSMGILRFSGTIEHI 378

Query: 541 SASVLLDGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHE 720
              +L+DGGS+DNF+QPRI   LK+PIE  P ++VLVGNG+ M AEG++++LAL+I   E
Sbjct: 379 QVQILIDGGSSDNFLQPRIARFLKLPIEPGPVFKVLVGNGEIMTAEGVIQNLALNIQGTE 438

Query: 721 IKVSAYLLLVVGADVILGAPWLA 789
           ++V  +LL V GADVILGA WLA
Sbjct: 439 LQVPVFLLPVAGADVILGASWLA 461


>gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1535

 Score =  258 bits (658), Expect = 1e-74
 Identities = 134/257 (52%), Positives = 175/257 (68%), Gaps = 4/257 (1%)
 Frame = +1

Query: 31  HTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNPTKSQFSEKQTPPI 210
           HTP ++VKA++L KVYEEK+S    N            S++    N  K  ++ + T PI
Sbjct: 210 HTPISLVKAMSLAKVYEEKYSYNNKN------QKNYSNSYSTNKPNTNKPDYTTRNTAPI 263

Query: 211 LNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLLQVDD 390
           LNTP TRPM+  Q NPNI+R+S AE QLR++KGLCY+CDDKFSFTHKCPN+ LML+Q DD
Sbjct: 264 LNTPPTRPMSQFQNNPNIKRMSQAERQLRRDKGLCYWCDDKFSFTHKCPNRQLMLIQNDD 323

Query: 391 G-DTVETIEPNPPDIPQT*TNLDT---EYHLSLNAMKGAGGFGIIRFQGSIGSISASVLL 558
             D  + ++        T  +LDT   E+HLSLNAMKG    G++RF GSI  I   +L+
Sbjct: 324 DLDADQVLDQLTQTTETTIKSLDTNQPEHHLSLNAMKGTSNMGVLRFAGSIEHIGVQILI 383

Query: 559 DGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVSAY 738
           DGGS+DNF+QPRI   LK+PIE  P++ VLVGNG+ M AEG++++L L I  H+++V  +
Sbjct: 384 DGGSSDNFLQPRIAKFLKLPIEPGPQFNVLVGNGEIMTAEGVIQNLPLEIQGHKLEVPVF 443

Query: 739 LLLVVGADVILGAPWLA 789
           LL V GADVILGA WLA
Sbjct: 444 LLPVAGADVILGASWLA 460


>dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subterraneum]
          Length = 1531

 Score =  257 bits (657), Expect = 2e-74
 Identities = 137/256 (53%), Positives = 174/256 (67%), Gaps = 3/256 (1%)
 Frame = +1

Query: 31  HTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNPTKSQFSEKQTPPI 210
           HTP ++VKAV+L KVYEEK++                   T   +N  K + S + T PI
Sbjct: 212 HTPSSLVKAVSLAKVYEEKYTTTMK-----PQKPYTQTYSTNKPYN-NKPENSTRNTAPI 265

Query: 211 LNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLLQVDD 390
           LNTP TRPM+  QKNPN++RISPAEMQLR++KGLCY+CDDKFSFTHKCPN+ LMLLQ +D
Sbjct: 266 LNTPPTRPMSQFQKNPNVKRISPAEMQLRRDKGLCYWCDDKFSFTHKCPNRQLMLLQYED 325

Query: 391 GDTVETIEPNPPDIPQT*---TNLDTEYHLSLNAMKGAGGFGIIRFQGSIGSISASVLLD 561
            +     E   P  P T    TNL  + HLS++AMKG+   G++RF G+I  I   +L+D
Sbjct: 326 SEDQVLDEITDPPDPTTNGLTTNLP-KLHLSMSAMKGSSHMGVLRFTGAIEHIQVQILID 384

Query: 562 GGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVSAYL 741
           GGS+DNF+QPRI   LK+PIE AP ++VLVGNG+ M AEG+VK L L +  H ++V  YL
Sbjct: 385 GGSSDNFVQPRIAKFLKLPIEPAPIFKVLVGNGEVMTAEGIVKQLPLDVQGHRLQVPVYL 444

Query: 742 LLVVGADVILGAPWLA 789
           L V GADVILGA WL+
Sbjct: 445 LPVAGADVILGASWLS 460


>dbj|GAU25507.1| hypothetical protein TSUD_279910 [Trifolium subterraneum]
          Length = 1389

 Score =  256 bits (655), Expect = 3e-74
 Identities = 137/258 (53%), Positives = 176/258 (68%), Gaps = 5/258 (1%)
 Frame = +1

Query: 31  HTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNPTKSQFSEKQTPPI 210
           HTP ++VKAV+L KVYEEK+++                S  +   N T+ Q   +   PI
Sbjct: 205 HTPNSLVKAVSLAKVYEEKYTSSNK----PQRINTNNYSTNKPFMNRTEIQ--TRNATPI 258

Query: 211 LNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLLQVD- 387
           LNTP TRPM+  QKNPNI+RISPAEMQ+R+ KGLCY+CDDKFSFTHKCPN+ LMLL  D 
Sbjct: 259 LNTPPTRPMSQFQKNPNIKRISPAEMQIRRNKGLCYWCDDKFSFTHKCPNRQLMLLHYDE 318

Query: 388 DGDTVETIEPNPPDIPQT*TN-LDT---EYHLSLNAMKGAGGFGIIRFQGSIGSISASVL 555
           D D  + +        +  TN LDT   E+HLSLNAMKG    G++RF GSI +I   +L
Sbjct: 319 DSDNEDKVLDTMTQSTEITTNSLDTNQPEHHLSLNAMKGTNNMGVLRFAGSINNIGVQIL 378

Query: 556 LDGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVSA 735
           +DGGS+DNF+QPRI   LK+PIE+ P+++VLVGNG+ M AEG+V ++ L I  H+++V  
Sbjct: 379 IDGGSSDNFLQPRIAKFLKLPIESGPQFKVLVGNGEIMTAEGVVHNVPLEIQGHKLEVPV 438

Query: 736 YLLLVVGADVILGAPWLA 789
           +LL V GADVILGA WLA
Sbjct: 439 FLLPVAGADVILGASWLA 456


>dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subterraneum]
          Length = 1500

 Score =  251 bits (642), Expect = 2e-72
 Identities = 127/256 (49%), Positives = 173/256 (67%), Gaps = 3/256 (1%)
 Frame = +1

Query: 31  HTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNPTKSQFSEKQTPPI 210
           HTP ++VKA++L KVYEEK+S+   +               + +FN  K+  + +   P+
Sbjct: 211 HTPISLVKAMSLAKVYEEKYSSCLKSQKNYSNSQLT----NKPNFN--KNDTTTRNAAPV 264

Query: 211 LNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLLQVDD 390
           LNTP TRPM+  QKNPNI+RISPAEMQLR++KGLCY+CD+KFSFTHKCPN+ LMLL  DD
Sbjct: 265 LNTPPTRPMSQYQKNPNIKRISPAEMQLRRDKGLCYWCDEKFSFTHKCPNRQLMLLHYDD 324

Query: 391 GD---TVETIEPNPPDIPQT*TNLDTEYHLSLNAMKGAGGFGIIRFQGSIGSISASVLLD 561
            D    ++T+         + T    E+HLS NA+KG    G+IRF GSIG +   +L+D
Sbjct: 325 NDEDQVLDTLTQQDEITTDSPTTNLPEHHLSFNALKGNSNMGVIRFAGSIGKLGVQILID 384

Query: 562 GGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVSAYL 741
           GGS+DNF+QPR+   LK+P+E  P++ VLVGNG+ M+AEG ++ L + I  H I++  +L
Sbjct: 385 GGSSDNFLQPRVAKFLKLPVEPGPQFNVLVGNGEIMSAEGTIQKLPVEIQGHMIEIPVFL 444

Query: 742 LLVVGADVILGAPWLA 789
           L + GADVILGA WLA
Sbjct: 445 LPIAGADVILGASWLA 460


>gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 1210

 Score =  251 bits (640), Expect = 2e-72
 Identities = 126/258 (48%), Positives = 173/258 (67%), Gaps = 4/258 (1%)
 Frame = +1

Query: 28  AHTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNPTKS---QFSEKQ 198
           A +PP++VKAVAL K++E K++   A                R   NP         +  
Sbjct: 227 ALSPPSLVKAVALAKLFEAKYTPSSAPRNPSYQPRAPTFYPNRHSSNPKPDIPHSLPKSN 286

Query: 199 TPPILNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLL 378
            PP+L  P T+P +   +N  +++ISPAEMQ+R+EKGLCYFCD+KFSF HKCPN+H+M+L
Sbjct: 287 LPPLLPNPSTKPFSQTYQN-QVKKISPAEMQIRREKGLCYFCDEKFSFNHKCPNRHMMML 345

Query: 379 QVDDGDTVETIEPNPPDIPQT*TNL-DTEYHLSLNAMKGAGGFGIIRFQGSIGSISASVL 555
           Q+ D + V++ EP+PPD+PQ    + + E+HLSLNAMKG GG G I F G IG I+  VL
Sbjct: 346 QLIDDELVDSREPDPPDLPQPDIEVGNPEHHLSLNAMKGVGGVGTIGFTGHIGPIAIKVL 405

Query: 556 LDGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVSA 735
           +DGGS+D+F+QPRI H LK+PIE    +QV VGNGQ M  EG+++ LA++I  H++ V  
Sbjct: 406 VDGGSSDSFLQPRIAHFLKLPIELVRGFQVFVGNGQSMTTEGVIQQLAVTIQGHQLVVPV 465

Query: 736 YLLLVVGADVILGAPWLA 789
           YLL V GAD++LG+ WLA
Sbjct: 466 YLLPVSGADLVLGSSWLA 483


>gb|PNX92889.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1302

 Score =  249 bits (636), Expect = 1e-71
 Identities = 134/257 (52%), Positives = 173/257 (67%), Gaps = 4/257 (1%)
 Frame = +1

Query: 31  HTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNPTKSQFSEKQTPPI 210
           HTP ++VKA +L KVYEEK+++                  T    N  K +   + + PI
Sbjct: 188 HTPSSLVKAFSLAKVYEEKYTSTTNQKRLNTTNYS-----TNKPLN--KPEILTRDSAPI 240

Query: 211 LNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLLQVDD 390
           LNTP TRPM+  QKNPNI+RISPAE Q+R++KGLCY+CD+KFSFTHKCPN+ LML+Q DD
Sbjct: 241 LNTPPTRPMSQFQKNPNIKRISPAERQVRRDKGLCYWCDEKFSFTHKCPNRQLMLVQYDD 300

Query: 391 G-DTVETIEPNPPDIPQT*TNLDT---EYHLSLNAMKGAGGFGIIRFQGSIGSISASVLL 558
             D +      PPDI  T  + DT   E+HLSLNAMKG    G++RF+GSI  I   +L+
Sbjct: 301 DEDKLFDEMTQPPDI--TTNSHDTNPPEHHLSLNAMKGTSNMGVLRFEGSIEHIRVQILI 358

Query: 559 DGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVSAY 738
           DGGS+DNF+QPRI   L++PIE  P++ VLVGNG+ M AEG+++ L L I  H ++V  +
Sbjct: 359 DGGSSDNFLQPRIAKFLRLPIEPGPQFNVLVGNGEVMTAEGVIQKLPLEIQGHMLEVPVF 418

Query: 739 LLLVVGADVILGAPWLA 789
           LL V GADVILGA WLA
Sbjct: 419 LLPVAGADVILGASWLA 435


>dbj|GAU37387.1| hypothetical protein TSUD_22610 [Trifolium subterraneum]
          Length = 1418

 Score =  249 bits (636), Expect = 1e-71
 Identities = 133/256 (51%), Positives = 172/256 (67%), Gaps = 3/256 (1%)
 Frame = +1

Query: 31  HTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNPTKSQFSEKQTPPI 210
           HTP +IVK V+L KVYEEK+++ +              S  +  +N  KS+ + +   PI
Sbjct: 186 HTPISIVKVVSLAKVYEEKYASNQK----LQKNNTTNYSTNKPLYN--KSENTTRNAAPI 239

Query: 211 LNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLLQVDD 390
           LNT  TRPM+  QKNPNI+RISPAE+Q+R++KGLCY+CD+KFSFTHKCPN+ LMLLQ DD
Sbjct: 240 LNTSPTRPMSQFQKNPNIKRISPAEIQIRRDKGLCYWCDEKFSFTHKCPNRQLMLLQYDD 299

Query: 391 GDTVETIEPNPPDIPQT*TNLDT---EYHLSLNAMKGAGGFGIIRFQGSIGSISASVLLD 561
            D    +E      P T  + DT   E+HLSLNAMKG    G++RF GSI  I   VL+D
Sbjct: 300 KDEDPVLETLTQTTPITTNSPDTNQPEHHLSLNAMKGTRNMGVLRFAGSIEHIEVQVLID 359

Query: 562 GGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVSAYL 741
           GGS++NF+QPRI   LK+PIE  P+++VLVGNG+ M AE ++  L L I  H++ V  +L
Sbjct: 360 GGSSNNFLQPRIAKFLKLPIEPRPQFKVLVGNGEIMTAERVINKLPLEIQGHKLDVPVFL 419

Query: 742 LLVVGADVILGAPWLA 789
           L V GADVILGA W A
Sbjct: 420 LPVAGADVILGASWFA 435


>gb|PNX86812.1| hypothetical protein L195_g042894, partial [Trifolium pratense]
          Length = 487

 Score =  233 bits (595), Expect = 4e-70
 Identities = 127/262 (48%), Positives = 171/262 (65%), Gaps = 8/262 (3%)
 Frame = +1

Query: 28  AHTPPNIVKAVALDKVYEEKHSAQ---KANLXXXXXXXXXXXSHTRAHFNP----TKSQF 186
           A TP N+ KA AL K++EEK++ Q   K N            +    + N     T+   
Sbjct: 40  ALTPANLPKAFALAKLFEEKYTTQTKPKTNPYKSSYTPNSYQNKISPNTNKPHPITQQNP 99

Query: 187 SEKQTPPILNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKH 366
              Q PP+L TP       NQK  +I+ +S AE+QLR++KGLCYFCDDKFS TH+CPN+ 
Sbjct: 100 QRAQLPPLLPTP-------NQKPMSIKNMSSAEIQLRRDKGLCYFCDDKFSHTHRCPNRR 152

Query: 367 LMLLQVDDGDTVETIEPNPPDIP-QT*TNLDTEYHLSLNAMKGAGGFGIIRFQGSIGSIS 543
           +M+LQ+ + D  E +EP+PP+    + T+ D ++HLSLNAMKG  G GIIRF G IG+I 
Sbjct: 153 VMMLQLREEDDKE-LEPDPPEESLNSHTSDDNQHHLSLNAMKGISGRGIIRFTGMIGNIE 211

Query: 544 ASVLLDGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEI 723
             VL+DGGS+D ++QPRI   LK+PIE +PK+QVLVGNGQ +  EGMV+ L + +  HE+
Sbjct: 212 VQVLVDGGSSDTYLQPRIAQFLKVPIETSPKFQVLVGNGQSLIVEGMVRQLHVQVQGHEL 271

Query: 724 KVSAYLLLVVGADVILGAPWLA 789
            + AYLL V GAD+ILG+ WLA
Sbjct: 272 TIPAYLLPVAGADLILGSSWLA 293


>dbj|BAT97165.1| hypothetical protein VIGAN_09053500 [Vigna angularis var.
           angularis]
          Length = 651

 Score =  236 bits (603), Expect = 9e-70
 Identities = 122/255 (47%), Positives = 166/255 (65%), Gaps = 4/255 (1%)
 Frame = +1

Query: 28  AHTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNP---TKSQFSEKQ 198
           A +PP++VKAVAL K++EEK++   A                R  +N    T S   +  
Sbjct: 218 ALSPPSLVKAVALAKLFEEKYNPPNAAKNPVYLPRSSTIVPNRTSYNTKTDTSSSLPKST 277

Query: 199 TPPILNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLL 378
            PP+L  P  +P++   K+  IR++SPAEMQLR+EK LCYFCD+KFSF+HKCPN+ +MLL
Sbjct: 278 LPPLLPNPNIKPLSQTNKHHQIRKLSPAEMQLRREKCLCYFCDEKFSFSHKCPNRQMMLL 337

Query: 379 QVDDGDTVETIEPNPPDIPQT*TNL-DTEYHLSLNAMKGAGGFGIIRFQGSIGSISASVL 555
           Q+ D D  +T EP+PPD+ QT + L + E+HLSLNAMKG GG G I F G IG ++  +L
Sbjct: 338 QLIDDDLGDTREPDPPDLIQTDSELCNPEHHLSLNAMKGVGGVGTIGFTGHIGPLAVKIL 397

Query: 556 LDGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVSA 735
           +DGGS+DNFIQPRI   LK+PIE    +QV VGNGQ M  EG+++ LA++I  H++ V  
Sbjct: 398 VDGGSSDNFIQPRIAQFLKLPIEYVTGFQVFVGNGQSMTTEGVIQQLAVTIQGHQLIVPV 457

Query: 736 YLLLVVGADVILGAP 780
           YL  V G  ++   P
Sbjct: 458 YLFPVSGEHMLQPQP 472


>ref|XP_019465366.1| PREDICTED: uncharacterized protein LOC109363560 [Lupinus
           angustifolius]
          Length = 661

 Score =  229 bits (584), Expect = 7e-67
 Identities = 122/258 (47%), Positives = 166/258 (64%), Gaps = 6/258 (2%)
 Frame = +1

Query: 34  TPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNPTKSQFSEKQTPPIL 213
           +P N++K VAL K++EEK+                  + +      T+    +   PP+L
Sbjct: 240 SPINLLKVVALAKLFEEKYQTTPTKYPYSTNNHKAIPNSSSYQ---TRFPGPKPSLPPLL 296

Query: 214 NTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLMLLQVDDG 393
            TP  RP +L QK  NI+RI+P EMQ+R++KGLCY+CD+KFSF+HKCPNKHL+LLQVDD 
Sbjct: 297 PTPNIRPHDLTQKPTNIKRITPVEMQVRRDKGLCYYCDEKFSFSHKCPNKHLLLLQVDD- 355

Query: 394 DTVETIEPN-----PPDIPQT*TNLD-TEYHLSLNAMKGAGGFGIIRFQGSIGSISASVL 555
                I PN     PPDIPQ+  +    E HLSLN M GA G G I+F G IG +   +L
Sbjct: 356 -----ISPNDPHTDPPDIPQSPDDPSRMELHLSLNTMTGANGVGTIKFTGLIGEL--QIL 408

Query: 556 LDGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKVSA 735
           LDGG +DNF+Q R+ H L +P+E AP +++LVGNG  ++AE M+ +LA+ +  HE+ +  
Sbjct: 409 LDGGISDNFLQIRLAHFLNLPVEPAPCFKLLVGNGNTLSAEAMINNLAVKVQGHELCLPV 468

Query: 736 YLLLVVGADVILGAPWLA 789
           Y+L VVGAD+ILGA WLA
Sbjct: 469 YMLPVVGADLILGAAWLA 486


>gb|KYP61911.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan]
          Length = 963

 Score =  230 bits (587), Expect = 1e-65
 Identities = 122/260 (46%), Positives = 170/260 (65%), Gaps = 6/260 (2%)
 Frame = +1

Query: 28  AHTPPNIVKAVALDKVYEEKHSAQKANLXXXXXXXXXXXSHTRAHFNPTKSQF--SEKQT 201
           A +P +++KAV+L K+YEEK+S                 +++R H N T S +  + +QT
Sbjct: 146 AQSPHSLLKAVSLAKLYEEKYSTSTK--------PAYTTTYSR-HLNTTPSPYLNTNQQT 196

Query: 202 PPI---LNTPLTRPMNLNQKNPNIRRISPAEMQLRKEKGLCYFCDDKFSFTHKCPNKHLM 372
           P I   L  P  +P     K+PNI++ISPAEMQ+R+EKGLCY CDDKFS TH+CPNK  +
Sbjct: 197 PSIPAILPNPSQKPFTHLPKSPNIKKISPAEMQIRREKGLCYTCDDKFSPTHRCPNKQYL 256

Query: 373 LLQVDDGDTVETIEPNPPD-IPQT*TNLDTEYHLSLNAMKGAGGFGIIRFQGSIGSISAS 549
           LL ++D D    I+  PPD I     +++ E+H+S NA+ G+ G G +RF GSI  ++  
Sbjct: 257 LLHIEDDDD-PPIDLAPPDPISSPCPDVNREHHVSFNALNGSSGLGTMRFHGSINGVNVK 315

Query: 550 VLLDGGSTDNFIQPRIVHCLKMPIEAAPKWQVLVGNGQKMAAEGMVKDLALSIDEHEIKV 729
           +LLD GS+DNF+QPR+ H LK+PIE    +QVLVGNG  +  EG+VKD+ ++I  H IK+
Sbjct: 316 ILLDSGSSDNFLQPRLAHYLKLPIEPISSFQVLVGNGNSLTVEGLVKDVTVTIQGHTIKL 375

Query: 730 SAYLLLVVGADVILGAPWLA 789
             YLL V GADV+LGA WL+
Sbjct: 376 PVYLLPVSGADVVLGASWLS 395


Top