BLASTX nr result

ID: Astragalus23_contig00017226 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00017226
         (797 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABN08405.1| Peptidase aspartic, active site [Medicago truncat...   367   e-123
gb|ABN08407.1| Peptidase aspartic, active site [Medicago truncat...   367   e-123
gb|PNY03339.1| retrotransposon-related protein, partial [Trifoli...   374   e-119
dbj|GAU18768.1| hypothetical protein TSUD_80570 [Trifolium subte...   293   1e-89
dbj|GAU28992.1| hypothetical protein TSUD_391930 [Trifolium subt...   248   1e-71
dbj|GAU46429.1| hypothetical protein TSUD_402070 [Trifolium subt...   215   4e-60
dbj|GAU28744.1| hypothetical protein TSUD_372530 [Trifolium subt...   214   9e-60
dbj|GAU12466.1| hypothetical protein TSUD_229990, partial [Trifo...   211   1e-58
dbj|GAU48361.1| hypothetical protein TSUD_282420 [Trifolium subt...   211   1e-58
dbj|GAU37691.1| hypothetical protein TSUD_164940 [Trifolium subt...   205   2e-56
gb|PNX57013.1| hypothetical protein L195_g050182, partial [Trifo...   192   3e-56
dbj|GAU17298.1| hypothetical protein TSUD_110150 [Trifolium subt...   196   2e-53
dbj|GAU31427.1| hypothetical protein TSUD_221980 [Trifolium subt...   196   3e-53
gb|PNX93254.1| Ty3/gypsy retrotransposon protein, partial [Trifo...   194   1e-52
gb|PNX97977.1| hypothetical protein L195_g021217, partial [Trifo...   194   2e-52
dbj|GAU42300.1| hypothetical protein TSUD_136860 [Trifolium subt...   194   2e-52
gb|ABN06064.1| RNA-directed DNA polymerase (Reverse transcriptas...   193   3e-52
ref|XP_014624207.1| PREDICTED: uncharacterized protein LOC106796...   192   4e-52
gb|PNX92072.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   191   1e-51
ref|XP_017431852.1| PREDICTED: uncharacterized protein LOC108339...   188   6e-51

>gb|ABN08405.1| Peptidase aspartic, active site [Medicago truncatula]
          Length = 435

 Score =  367 bits (942), Expect = e-123
 Identities = 175/260 (67%), Positives = 208/260 (80%), Gaps = 6/260 (2%)
 Frame = +2

Query: 35  TGDEELQLNVMSFNGLM------ENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRR 196
           TG EELQLNV++F   +      E   DR   I+ QG +  +PVLMLVDSGA +NFMSRR
Sbjct: 107 TGAEELQLNVLTFENALTFDRQTEYYQDRFQCIRFQGKVREIPVLMLVDSGANKNFMSRR 166

Query: 197 LALAMGLHITEAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLG 376
           LALA+GL ITE P R I+LGDG    T GECH VII VQG+ WEI+ MLF+L G DLVLG
Sbjct: 167 LALALGLRITETPVRRIRLGDGHVVPTLGECHGVIISVQGVEWEIDVMLFELRGYDLVLG 226

Query: 377 MAWLTQIGATWIDWIQKKMRFDYQSEWVEIQGVRSKECEPLQQYVDDNHFAQLKCVSQEA 556
           MAWLTQIG T IDW++KKMRFDYQ EW+EI+G+R++EC PLQ YVD+NHF QL C  Q  
Sbjct: 227 MAWLTQIGCTCIDWVEKKMRFDYQGEWIEIRGIRTRECTPLQNYVDENHFGQLHCDVQPG 286

Query: 557 VVTKSQQIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPHHHKSE 736
           +VT +QQ+EM+S+LD F+++FKEPQGLPP RQQEHAIHL+ GQGP+NV PYRYPHHHK+E
Sbjct: 287 MVTPNQQLEMKSLLDNFDNIFKEPQGLPPGRQQEHAIHLLHGQGPVNVRPYRYPHHHKTE 346

Query: 737 IEKQVQELLLTGVIRPSQSA 796
           IEKQV+ELLL+GVIRPSQSA
Sbjct: 347 IEKQVKELLLSGVIRPSQSA 366


>gb|ABN08407.1| Peptidase aspartic, active site [Medicago truncatula]
          Length = 435

 Score =  367 bits (942), Expect = e-123
 Identities = 175/260 (67%), Positives = 208/260 (80%), Gaps = 6/260 (2%)
 Frame = +2

Query: 35  TGDEELQLNVMSFNGLM------ENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRR 196
           TG EELQLNV++F   +      E   DR   I+ QG +  +PVLMLVDSGA +NFMSRR
Sbjct: 107 TGAEELQLNVLTFENALTFDRQTEYYQDRFQCIRFQGKVREIPVLMLVDSGANKNFMSRR 166

Query: 197 LALAMGLHITEAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLG 376
           LALA+GL ITE P R I+LGDG    T GECH VII VQG+ WEI+ MLF+L G DLVLG
Sbjct: 167 LALALGLRITETPVRRIRLGDGHVVPTLGECHGVIISVQGVEWEIDVMLFELRGYDLVLG 226

Query: 377 MAWLTQIGATWIDWIQKKMRFDYQSEWVEIQGVRSKECEPLQQYVDDNHFAQLKCVSQEA 556
           MAWLTQIG T IDW++KKMRFDYQ EW+EI+G+R++EC PLQ YVD+NHF QL C  Q  
Sbjct: 227 MAWLTQIGCTCIDWVEKKMRFDYQGEWIEIRGIRTRECTPLQNYVDENHFGQLHCDVQPG 286

Query: 557 VVTKSQQIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPHHHKSE 736
           +VT +QQ+EM+S+LD F+++FKEPQGLPP RQQEHAIHL+ GQGP+NV PYRYPHHHK+E
Sbjct: 287 MVTPNQQLEMKSLLDNFDNIFKEPQGLPPGRQQEHAIHLLHGQGPVNVRPYRYPHHHKTE 346

Query: 737 IEKQVQELLLTGVIRPSQSA 796
           IEKQV+ELLL+GVIRPSQSA
Sbjct: 347 IEKQVKELLLSGVIRPSQSA 366


>gb|PNY03339.1| retrotransposon-related protein, partial [Trifolium pratense]
          Length = 1048

 Score =  374 bits (959), Expect = e-119
 Identities = 178/260 (68%), Positives = 212/260 (81%), Gaps = 6/260 (2%)
 Frame = +2

Query: 35   TGDEELQLNVMSFNGLM------ENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRR 196
            TG EELQLNV++F  ++      E   D L  I+LQGT+G +PVLMLVDSGA +NFMSR 
Sbjct: 357  TGAEELQLNVLTFEHVLTFDKQTEYYQDMLQCIRLQGTVGTIPVLMLVDSGANKNFMSRH 416

Query: 197  LALAMGLHITEAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLG 376
            LALA+GL ITE P R I+LGDG  A T GECH VII VQG+ WEI+ +LFDL G DLVLG
Sbjct: 417  LALALGLRITETPARDIRLGDGHVAPTLGECHGVIIFVQGVKWEIDVVLFDLGGYDLVLG 476

Query: 377  MAWLTQIGATWIDWIQKKMRFDYQSEWVEIQGVRSKECEPLQQYVDDNHFAQLKCVSQEA 556
            MAWLTQIG T+IDW +KKMRFDYQ EW+EI+G+R++EC PLQ YVD+NHF QL C  Q+ 
Sbjct: 477  MAWLTQIGCTYIDWTEKKMRFDYQGEWIEIRGIRTRECTPLQNYVDENHFDQLHCDVQQG 536

Query: 557  VVTKSQQIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPHHHKSE 736
            +VT +QQ EM+SVLD F+++FKEPQGLPP RQQEHAIHL+ GQGP+NV PYRYPHHHK+E
Sbjct: 537  MVTPNQQSEMKSVLDNFDTIFKEPQGLPPGRQQEHAIHLLNGQGPVNVRPYRYPHHHKTE 596

Query: 737  IEKQVQELLLTGVIRPSQSA 796
            IEKQV+E+LL+GVIRPSQSA
Sbjct: 597  IEKQVKEMLLSGVIRPSQSA 616


>dbj|GAU18768.1| hypothetical protein TSUD_80570 [Trifolium subterraneum]
          Length = 895

 Score =  293 bits (751), Expect = 1e-89
 Identities = 151/260 (58%), Positives = 181/260 (69%), Gaps = 6/260 (2%)
 Frame = +2

Query: 35   TGDEELQLNVMSFNGLM------ENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRR 196
            TG EELQLNV++F  ++      E   D L  I+LQGT+G +PVLMLVDSGA +NFMSR 
Sbjct: 350  TGAEELQLNVLTFEHVLTFDKQIEYYQDMLQCIRLQGTVGTIPVLMLVDSGANKNFMSRH 409

Query: 197  LALAMGLHITEAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLG 376
            LALA+GL ITE P R I+LGDG  A T GECH VII VQG+ WEI+ +LF+L G DLVLG
Sbjct: 410  LALALGLRITETPARHIRLGDGHMAPTLGECHGVIISVQGVKWEIDVVLFELGGYDLVLG 469

Query: 377  MAWLTQIGATWIDWIQKKMRFDYQSEWVEIQGVRSKECEPLQQYVDDNHFAQLKCVSQEA 556
                                            +R++EC PLQ YVD+NHF QL C  Q  
Sbjct: 470  --------------------------------IRTRECTPLQNYVDENHFVQLHCEVQPG 497

Query: 557  VVTKSQQIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPHHHKSE 736
            +VT +QQ+EM+ VLD F++VFKEP GLPP RQQEHAIHL+ GQGP+NV PYRYPHHHK+E
Sbjct: 498  MVTPNQQLEMKLVLDNFDNVFKEPHGLPPGRQQEHAIHLLNGQGPVNVRPYRYPHHHKTE 557

Query: 737  IEKQVQELLLTGVIRPSQSA 796
            IEKQV+E+LL+GVIRPSQSA
Sbjct: 558  IEKQVKEMLLSGVIRPSQSA 577


>dbj|GAU28992.1| hypothetical protein TSUD_391930 [Trifolium subterraneum]
          Length = 1407

 Score =  248 bits (634), Expect = 1e-71
 Identities = 128/232 (55%), Positives = 160/232 (68%), Gaps = 15/232 (6%)
 Frame = +2

Query: 41   DEELQLNVMSFNGLMENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRRLALAMGLH 220
            ++E +LN MSFNGL E+    L+S+K++GTI GVP++MLVDSG T NF+SRRL  A+GL 
Sbjct: 411  EDEGELNTMSFNGLTESRRATLDSMKVRGTIRGVPLVMLVDSGTTHNFISRRLVNALGLT 470

Query: 221  ITEAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLGMAWLTQIG 400
            +T+ PP  IKLGDG      GEC DVII +QG+S+ IN MLFDL+G+DLVLGMAWLT+IG
Sbjct: 471  VTDTPPMKIKLGDGHATTIQGECRDVIIRIQGLSFVINAMLFDLNGVDLVLGMAWLTKIG 530

Query: 401  ATWIDWIQKKMRFDYQSEWVEIQGVRSKECEPLQQYVDDNH--FAQLKCV---------- 544
              W DW QK MRF++  EWVEI+G+R      LQ+YV DN   FA L             
Sbjct: 531  CIWFDWNQKLMRFEHNGEWVEIKGMRLVLFRSLQEYVSDNRYSFADLLLTQHMHEEKDHR 590

Query: 545  SQEAVVT---KSQQIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGP 691
            ++E   T    +Q  ++Q +L  F  VFKEPQGLPPERQQEHAIHL+EG GP
Sbjct: 591  NEEVAATHLEATQSTDIQRLLAAFAEVFKEPQGLPPERQQEHAIHLLEGAGP 642


>dbj|GAU46429.1| hypothetical protein TSUD_402070 [Trifolium subterraneum]
          Length = 1026

 Score =  215 bits (547), Expect = 4e-60
 Identities = 110/254 (43%), Positives = 163/254 (64%), Gaps = 2/254 (0%)
 Frame = +2

Query: 41  DEELQLNVMSFNGLMENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRRLALAMGLH 220
           + E +++V+SF  L +N   +  SIKL+GTI GVPVL+L+DSGAT NF+S  L   M   
Sbjct: 200 ETEGEISVLSFQQLAQNTL-KPQSIKLKGTIQGVPVLILIDSGATHNFISYPLVHKMNWE 258

Query: 221 ITEAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLGMAWLTQIG 400
           I E PP  IKLGDG  +KT G C ++ + ++ I + +N  LF+L  +D+VLGM WL  +G
Sbjct: 259 IEETPPMNIKLGDGSCSKTKGSCVNLGVSIEDIPFRLNAQLFELGVVDMVLGMEWLQTLG 318

Query: 401 ATWIDWIQKKMRFDYQSEWVEIQGVRSKE--CEPLQQYVDDNHFAQLKCVSQEAVVTKSQ 574
              ++W +  M F Y  +WV ++G+  +      LQ  V       +K       +  +Q
Sbjct: 319 DMIVNWNKHTMSFWYHKQWVTLKGMEDQHGLMHSLQSIVCSKGMNCMKGGGSTQTLGVNQ 378

Query: 575 QIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPHHHKSEIEKQVQ 754
             E++++L+R+  VF+EP+GLPP+R++EH I L EG+G +NV PYRYPHHHK+EIE+QV+
Sbjct: 379 SRELENLLNRYAEVFQEPKGLPPKREKEHVITLKEGEGAVNVRPYRYPHHHKNEIERQVK 438

Query: 755 ELLLTGVIRPSQSA 796
           E++  G+IR S SA
Sbjct: 439 EMVEAGIIRHSTSA 452


>dbj|GAU28744.1| hypothetical protein TSUD_372530 [Trifolium subterraneum]
          Length = 1462

 Score =  214 bits (546), Expect = 9e-60
 Identities = 113/265 (42%), Positives = 169/265 (63%), Gaps = 13/265 (4%)
 Frame = +2

Query: 41   DEEL--QLNVMSFNGLMENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRRLALAMG 214
            DEE   +++VMS + L     +++ ++KL+ TI GVPV++LVDSGAT NF+++ +   +G
Sbjct: 323  DEEQGGEMSVMSISELEGFQREKIQTLKLRATINGVPVVVLVDSGATHNFIAKSMVQKLG 382

Query: 215  LHITEAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLGMAWLTQ 394
              +   P   IKLGDGF   T G+C  ++     +SWEI   LFDLDG+D+V+GMAWL  
Sbjct: 383  WQVENTPDFRIKLGDGFQTITRGKCPQILFKTGEVSWEIEAYLFDLDGVDVVVGMAWLKS 442

Query: 395  IGATWIDWIQKKMRFDYQSEWVEIQGVR-SKECEPLQQYVDDNHFAQL--KCVSQEAVVT 565
            +G   ++W ++ M F ++  WV ++G+  + E  P  Q V          K  S EA + 
Sbjct: 443  LGDMIVNWKKQTMEFWHEGNWVMLKGIEGTAEAIPALQSVVGRASKGYGKKWWSLEADLN 502

Query: 566  KS--------QQIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPH 721
             +        +  E++S+L+ FE+VF+EP+GLPP R ++HAI+LV GQGP+NV PYRYPH
Sbjct: 503  NTEGKLEPIIEHPELKSILESFENVFQEPKGLPPCRSRDHAINLVSGQGPVNVRPYRYPH 562

Query: 722  HHKSEIEKQVQELLLTGVIRPSQSA 796
            H K+EIE+QV+E+L  G+I+ S SA
Sbjct: 563  HQKNEIERQVKEMLEGGIIQHSGSA 587


>dbj|GAU12466.1| hypothetical protein TSUD_229990, partial [Trifolium subterraneum]
          Length = 1303

 Score =  211 bits (537), Expect = 1e-58
 Identities = 115/265 (43%), Positives = 157/265 (59%), Gaps = 13/265 (4%)
 Frame = +2

Query: 41  DEELQLNVMSFNGLMENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRRLALAMGLH 220
           +EE +   M    L   A +   ++K QG I GVPVL++VDSGAT NF+S+RL   M   
Sbjct: 129 NEEEEGGEMCILNLNHIAFENHQTVKFQGQIQGVPVLVMVDSGATHNFISQRLVHKMEWP 188

Query: 221 ITEAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLGMAWLTQIG 400
           + E P   IKLGDG    T G C  + + ++  +      LF+L GID+VLGM WL  +G
Sbjct: 189 VEETPMMNIKLGDGCHKSTRGVCGGLELQIRNFTISPKLHLFELGGIDIVLGMEWLKTLG 248

Query: 401 ATWIDWIQKKMRFDYQSEWVEIQGVRSKECEPLQQYVDDNHFAQLKCVSQEAV------- 559
              ++W ++ M F  +  WV +QG+  +E   +      +  ++ K   QE +       
Sbjct: 249 DMIVNWRKQTMSFWSEKRWVTLQGISGQEKSSVAL---QSILSKPKLTDQEVLWGLDIQE 305

Query: 560 ------VTKSQQIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPH 721
                 +TK QQ+E+  VL +FE VFKEP GLPP R +EHAI+LVEG G +NV PYRYPH
Sbjct: 306 KKELHGLTKQQQLELNKVLVQFEGVFKEPTGLPPRRDKEHAINLVEGHGTVNVRPYRYPH 365

Query: 722 HHKSEIEKQVQELLLTGVIRPSQSA 796
           HHK+EIEKQVQE+L  G+IRPS S+
Sbjct: 366 HHKNEIEKQVQEMLSAGIIRPSTSS 390


>dbj|GAU48361.1| hypothetical protein TSUD_282420 [Trifolium subterraneum]
          Length = 1352

 Score =  211 bits (537), Expect = 1e-58
 Identities = 113/255 (44%), Positives = 162/255 (63%), Gaps = 7/255 (2%)
 Frame = +2

Query: 53   QLNVMSFNGLMENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRRLALAMGLHITEA 232
            +LN+MS   L + +  +  SI+L+G IGGVPV +LVDSGAT NF+ +RL   M   + ++
Sbjct: 379  ELNIMSLLQLGQLSASKPQSIQLKGAIGGVPVAILVDSGATHNFIDKRLVQKMNWAVDDS 438

Query: 233  PPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLGMAWLTQIGATWI 412
                IKLGDG  A + G C D+ I V G+   I   LF+L G+D+VLG+ WL  +G   +
Sbjct: 439  TSMCIKLGDGSRAHSIGVCPDLKIDVDGVQLAIQAHLFELGGVDIVLGVDWLRTLGDIIM 498

Query: 413  DWIQKKMRFDYQSEWVEIQGVRSKECEPLQQYVDDNH---FAQLKCVSQ----EAVVTKS 571
            +W +  M F Y+ +WV  QG+ +++ E L   V  ++      ++ V Q    E  +   
Sbjct: 499  NWTKHTMSFWYKQKWVTFQGL-NEDMEALNSIVSCSNRRGKGWMRSVEQKRGNENDLNIG 557

Query: 572  QQIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPHHHKSEIEKQV 751
            QQ+E++ +LD++E +FKEP+GLPP R++EH I+L EG   +NV PYRYPHHHK+EIE QV
Sbjct: 558  QQLELEGLLDKYEDIFKEPRGLPPRREKEHVINLKEGHDAVNVRPYRYPHHHKNEIETQV 617

Query: 752  QELLLTGVIRPSQSA 796
            QELL  GVIR S S+
Sbjct: 618  QELLTAGVIRHSTSS 632


>dbj|GAU37691.1| hypothetical protein TSUD_164940 [Trifolium subterraneum]
          Length = 1542

 Score =  205 bits (521), Expect = 2e-56
 Identities = 110/254 (43%), Positives = 156/254 (61%), Gaps = 19/254 (7%)
 Frame = +2

Query: 92   ANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRRLALAMGLHITEAPPRPIKLGDGFTA 271
            A+D  +++K QG IGGV VL+LVDSGAT NF+S++L   M   I + P   +KLGDGF  
Sbjct: 407  AHDTHHTVKFQGYIGGVEVLILVDSGATHNFISQKLVHQMEWPIEDTPEMKVKLGDGFQT 466

Query: 272  KTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLGMAWLTQIGATWIDWIQKKMRFDYQS 451
             T G C  + + +       N  LF+L GID+VLG+ WL  +G T ++W ++ M F ++ 
Sbjct: 467  ATKGVCKGLGMFIGDFQLSPNMHLFELGGIDVVLGIEWLKTLGDTIMNWKKQTMSFWWEG 526

Query: 452  EWVEIQGVRS--KECEPLQQYV-----------------DDNHFAQLKCVSQEAVVTKSQ 574
             WV ++G     K+   LQ  +                 + N   +   +SQ+  +T+ Q
Sbjct: 527  RWVTLRGKEGCQKQIVALQSILNRPKPNLQGVLWELEKGEPNTMKKQLIISQQ--LTRQQ 584

Query: 575  QIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPHHHKSEIEKQVQ 754
            Q E+++VL ++ESVF EP GLPP+R  EHAI LVEGQ  ++V PYRYPHHHK+EIEKQ++
Sbjct: 585  QTELEAVLKKYESVFNEPSGLPPKRAMEHAIRLVEGQDAVSVRPYRYPHHHKNEIEKQIK 644

Query: 755  ELLLTGVIRPSQSA 796
            ++L TGVIR S SA
Sbjct: 645  DMLATGVIRHSTSA 658


>gb|PNX57013.1| hypothetical protein L195_g050182, partial [Trifolium pratense]
          Length = 313

 Score =  192 bits (487), Expect = 3e-56
 Identities = 108/263 (41%), Positives = 157/263 (59%), Gaps = 12/263 (4%)
 Frame = +2

Query: 44  EELQLNVMSFNGLMENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRRLALAMGLHI 223
           E+ +   MS   L   A++  +++K QGTI GV VL+LVDSGAT NF+S++L   M   +
Sbjct: 35  EDEEKGEMSILNLHHIAHETHHTMKFQGTIHGVEVLILVDSGATHNFISQKLVHHMDWPV 94

Query: 224 TEAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLGMAWLTQIGA 403
                  +KLG+G    T G C  V + +     +    LF+L GID+VLG+ WL  +G 
Sbjct: 95  ETTTQMNVKLGNGLQVATQGVCRKVEMCIGDFKLKPTMHLFELGGIDVVLGIEWLKTLGD 154

Query: 404 TWIDWIQKKMRFDYQSEWVEIQGVRS--KECEPLQQYVD------DNHFAQLKCVSQ--- 550
           T I+W Q+ M F    +W+ +QG     +    LQ  +        N   +L  V     
Sbjct: 155 TIINWKQQTMSFWQDKKWMTLQGTGGCRQSTVSLQSILSKARPNTQNMMWELNEVKTKGG 214

Query: 551 -EAVVTKSQQIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPHHH 727
            E+ ++  QQ E+ ++L +++SVF+ P GLPP+R ++HAI+L+EGQG +NV PYRYPHHH
Sbjct: 215 GESELSVQQQKEIDALLLKYDSVFQTPSGLPPKRSKDHAINLIEGQGAVNVRPYRYPHHH 274

Query: 728 KSEIEKQVQELLLTGVIRPSQSA 796
           K+EIEKQ++E+L TGVIR S SA
Sbjct: 275 KNEIEKQIKEMLATGVIRHSTSA 297


>dbj|GAU17298.1| hypothetical protein TSUD_110150 [Trifolium subterraneum]
          Length = 1558

 Score =  196 bits (499), Expect = 2e-53
 Identities = 106/260 (40%), Positives = 162/260 (62%), Gaps = 6/260 (2%)
 Frame = +2

Query: 35   TGDE-ELQLNVMSFNGLMENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRRLALAM 211
            +GDE + +++ M+   L E   +RL ++KL   I GVPV++LVD GAT NF++R L   M
Sbjct: 387  SGDELDGEISAMNLYELGEVQRERLQTLKLAAMINGVPVVVLVDCGATHNFIARPLVEKM 446

Query: 212  GLHITEAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLGMAWLT 391
            G  +   P   IKLGDGF   T G+C+ +++    +S  I+  LF+L G+D+VLGMAWL 
Sbjct: 447  GWKVEATPAFNIKLGDGFQTVTRGKCNQILLTTGEVSCNIDAYLFELKGVDVVLGMAWLK 506

Query: 392  QIGATWIDWIQKKMRFDYQSEWVEIQGVRS--KECEPLQQYV---DDNHFAQLKCVSQEA 556
             +G   ++W ++ M F +  +WV ++G+    +    LQ  +    + + ++   + +E 
Sbjct: 507  TLGDMVVNWKKQTMEFWHDKKWVTLKGMEGTPEAISALQNVIGKASNGYESKGWSLDREG 566

Query: 557  VVTKSQQIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPHHHKSE 736
               KS    +  V+  FE VF EP+GLPP+R ++HAI L+ GQGP++V PYRYP+H K+E
Sbjct: 567  RGDKS----LDQVIQAFEDVFCEPKGLPPQRARDHAITLLPGQGPVSVRPYRYPYHQKNE 622

Query: 737  IEKQVQELLLTGVIRPSQSA 796
            IEKQV+EL+ T VI+ S SA
Sbjct: 623  IEKQVKELMSTRVIQQSNSA 642


>dbj|GAU31427.1| hypothetical protein TSUD_221980 [Trifolium subterraneum]
          Length = 1344

 Score =  196 bits (497), Expect = 3e-53
 Identities = 99/228 (43%), Positives = 139/228 (60%)
 Frame = +2

Query: 113  IKLQGTIGGVPVLMLVDSGATQNFMSRRLALAMGLHITEAPPRPIKLGDGFTAKTHGECH 292
            +  QG + GVPVL+L+DSGAT NF+S++L   MG  + E P   IKLGDGF + T G C 
Sbjct: 389  LAFQGEVCGVPVLILIDSGATHNFISQKLVKKMGWEVEETPLMNIKLGDGFQSNTKGVCR 448

Query: 293  DVIIMVQGISWEINTMLFDLDGIDLVLGMAWLTQIGATWIDWIQKKMRFDYQSEWVEIQG 472
             + + +          LF+L GID+VLG+ WL  +G   I+W Q+ M F     WV ++G
Sbjct: 449  SLEMKIGDFPLTPTMHLFELGGIDVVLGIEWLKTLGDMIINWRQQTMSFWSNKRWVTLKG 508

Query: 473  VRSKECEPLQQYVDDNHFAQLKCVSQEAVVTKSQQIEMQSVLDRFESVFKEPQGLPPERQ 652
            +            D+ H        +E  +T+ QQ E++ +L R+++VF+EP GLPP R 
Sbjct: 509  IDG----------DNEH-------EEEEELTEGQQKELEELLHRYQNVFREPTGLPPRRN 551

Query: 653  QEHAIHLVEGQGPINVSPYRYPHHHKSEIEKQVQELLLTGVIRPSQSA 796
            +EH I+LVE    +NV PYRYPHHHK+EIE+Q+QE+L  G+IR S SA
Sbjct: 552  KEHIINLVENHSAVNVRPYRYPHHHKNEIERQIQEMLTVGIIRHSTSA 599


>gb|PNX93254.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 1534

 Score =  194 bits (493), Expect = 1e-52
 Identities = 107/266 (40%), Positives = 159/266 (59%), Gaps = 14/266 (5%)
 Frame = +2

Query: 41   DEELQ--LNVMSFNGLMENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRRLALAMG 214
            +EE+Q  +++M+F  L          IKLQGTI  VPV++LVDSGA+ NF+S+ L   M 
Sbjct: 375  EEEVQGEMSLMNFCQLSNTGRSMPQVIKLQGTIQEVPVVILVDSGASHNFISQNLVHKMN 434

Query: 215  LHITEAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLGMAWLTQ 394
            L + +     IKLGDGF +KT G C ++ I ++G+   ++  LF+L  +D++LG+ WL  
Sbjct: 435  LTVNDDAALNIKLGDGFCSKTKGTCSNLEIDIKGLKVTVDVQLFELGCVDVILGIEWLRT 494

Query: 395  IGATWIDWIQKKMRFDYQSEWVEIQGVRSK----------ECEPLQQYVDDNHFAQLK-- 538
            +G   ++W +  M F    EWV ++G+ S            C+P  +       A++K  
Sbjct: 495  LGDMIVNWKKHTMSFWLNKEWVTLKGMESSLNMMDTLHSVLCKPKLKRSTGGEEAKVKVS 554

Query: 539  CVSQEAVVTKSQQIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYP 718
            C    ++  + Q  E++ +L  +  VF++P+GLPP+R +EH I L EG G INV PYRYP
Sbjct: 555  CGVLHSLEVE-QSRELEHLLSLYADVFQDPKGLPPKRNKEHVITLREGAGAINVRPYRYP 613

Query: 719  HHHKSEIEKQVQELLLTGVIRPSQSA 796
            HHHK EIEKQV E+L  G++RPS SA
Sbjct: 614  HHHKDEIEKQVGEMLQAGIVRPSTSA 639


>gb|PNX97977.1| hypothetical protein L195_g021217, partial [Trifolium pratense]
          Length = 1299

 Score =  194 bits (492), Expect = 2e-52
 Identities = 104/265 (39%), Positives = 162/265 (61%), Gaps = 13/265 (4%)
 Frame = +2

Query: 41  DEEL--QLNVMSFNGLMENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRRLALAMG 214
           DEE+  ++++MSF  L ++   +  SI+L+GTI  VPV +L+DSGAT NF+S  L   M 
Sbjct: 162 DEEVDGEMSMMSFQQLGQHDYIKPQSIRLKGTIHEVPVSILIDSGATHNFISHHLVHKMN 221

Query: 215 LHITEAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLGMAWLTQ 394
             +   P   IKLGDG  +KT G C ++ + V+G+   ++  LF+L  +D+VLG+ WL  
Sbjct: 222 WSVDNTPSMRIKLGDGSCSKTTGRCVNLEVDVEGVPIVVDVQLFELGDVDMVLGIEWLRT 281

Query: 395 IGATWIDWIQKKMRFDYQSEWVEIQGVRSK--ECEPLQQYVDDNHFA---------QLKC 541
           +G   ++W +  M F Y  +WV ++G+  +    + LQ  V  +  +         ++K 
Sbjct: 282 LGDMIVNWEKHTMSFWYHKKWVTLRGIEGRWDVRDTLQSIVCKSQRSCVGWWKDREKMKE 341

Query: 542 VSQEAVVTKSQQIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPH 721
                 +   Q  ++ ++L+ +  VF+EP GLPP+R++EH I L EG+G +NV PYRYPH
Sbjct: 342 EGSFLTLEVGQARDLDNLLNVYVGVFQEPTGLPPKRKKEHVITLKEGEGAVNVRPYRYPH 401

Query: 722 HHKSEIEKQVQELLLTGVIRPSQSA 796
           HHK+EIEKQVQE++ TG+IR S S+
Sbjct: 402 HHKNEIEKQVQEMMKTGIIRHSTSS 426


>dbj|GAU42300.1| hypothetical protein TSUD_136860 [Trifolium subterraneum]
          Length = 1523

 Score =  194 bits (492), Expect = 2e-52
 Identities = 107/264 (40%), Positives = 160/264 (60%), Gaps = 12/264 (4%)
 Frame = +2

Query: 41   DEELQLNVMSFNGLMENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRRLALAMGLH 220
            +EE     MS   L    ++  +++K QGTI GV VL+LVDSGAT NF+S++L   M   
Sbjct: 375  EEEEGKGEMSILNLHHIVHETHHTMKFQGTIHGVEVLILVDSGATHNFISQKLVHQMDWL 434

Query: 221  ITEAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLGMAWLTQIG 400
            +   P   +KLG+G    T G C D+ + ++    +    LF+L GID+VLG+ WL  +G
Sbjct: 435  VDATPHLNVKLGNGVQVATQGVCRDLEVCIEEFKLKPELHLFELGGIDVVLGIEWLKTLG 494

Query: 401  ATWIDWIQKKMRFDYQSEWVEIQG----------VRSKECEPLQQYVDDNHF--AQLKCV 544
             T  +W ++ M F +  +W+ +QG          ++S    P      D  +  ++ K  
Sbjct: 495  DTITNWKKQIMSFWWDKKWITLQGQGGCRRSAVALQSILSRPKPSTEQDFFWEASKAKKK 554

Query: 545  SQEAVVTKSQQIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPHH 724
            S EA +T  QQ E++++L + ESVF+ P+GLPP+R ++HAI+L+EGQ  +NV PYRYPHH
Sbjct: 555  SSEAHLTVHQQQELEALLGKHESVFQSPKGLPPKRIKDHAINLIEGQTAVNVRPYRYPHH 614

Query: 725  HKSEIEKQVQELLLTGVIRPSQSA 796
            HK+EIE+QV+E+L  G+IR S SA
Sbjct: 615  HKNEIERQVKEMLSAGIIRHSTSA 638


>gb|ABN06064.1| RNA-directed DNA polymerase (Reverse transcriptase); Chromo; Zinc
           finger, CCHC-type; Peptidase aspartic, active site;
           Polynucleotidyl transferase, Ribonuclease H fold
           [Medicago truncatula]
          Length = 1297

 Score =  193 bits (490), Expect = 3e-52
 Identities = 106/263 (40%), Positives = 154/263 (58%), Gaps = 11/263 (4%)
 Frame = +2

Query: 41  DEELQLNVMSFNGLMENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRRLALAMGLH 220
           DEE     M         + R  SIKL G I  VPV++LVDSGAT NF+S++L   M   
Sbjct: 155 DEEEGDGEMCMMEFFHLGHSRPQSIKLMGVIKEVPVVVLVDSGATHNFISQQLVHKMNWA 214

Query: 221 ITEAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLGMAWLTQIG 400
           + + P   IKLGDG  +KT G C  + + V  +  EI+  LFDL G+D+VLG+ WL  +G
Sbjct: 215 VVDTPCMSIKLGDGSYSKTKGTCEGLEVDVGDVHLEIDAQLFDLGGVDMVLGIEWLRTLG 274

Query: 401 ATWIDWIQKKMRFDYQSEWVEIQGVRSK--ECEPLQQYVDDNHFAQL-------KCVSQE 553
              ++W ++ M F +  +WV ++G+ ++      LQ  +  +            KC    
Sbjct: 275 DMIVNWNKQTMSFWHNKKWVTVKGMDTQGGAIATLQSIICKSRRRSTGWWTYEDKCKEDG 334

Query: 554 AVVT--KSQQIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPHHH 727
           ++ T    Q  E++ +L+ +  VF+EP GLPP+R++EH I L EG+G +NV PYRYPHHH
Sbjct: 335 SIHTLASEQSRELELLLENYGGVFQEPTGLPPKRKKEHVITLKEGEGAVNVRPYRYPHHH 394

Query: 728 KSEIEKQVQELLLTGVIRPSQSA 796
           K+EIEKQV+E+L  G+IR S S+
Sbjct: 395 KNEIEKQVREMLQAGIIRHSTSS 417


>ref|XP_014624207.1| PREDICTED: uncharacterized protein LOC106796443 [Glycine max]
          Length = 1152

 Score =  192 bits (489), Expect = 4e-52
 Identities = 104/246 (42%), Positives = 150/246 (60%), Gaps = 11/246 (4%)
 Frame = +2

Query: 92   ANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRRLALAMGLHITEAPPRPIKLGDGFTA 271
            A +  +++K QG + GVPVL+LVDSGAT NF+S++L   M   + + P   IKLGDG+  
Sbjct: 527  AQENHHTVKFQGIVRGVPVLILVDSGATHNFISQKLVYKMDWPVDDTPEMRIKLGDGYQT 586

Query: 272  KTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLGMAWLTQIGATWIDWIQKKMRFDYQS 451
             T G C  + + +   +   +  LF+L GID+VLGM WL  +G T I+W ++ M F    
Sbjct: 587  ITKGICKKLEMSIGDFTLSPDLHLFELGGIDVVLGMEWLKTLGDTIINWRKQTMSFWMDK 646

Query: 452  EWVEIQGVRS--KECEPLQ-------QYVDDNHFAQLKCV--SQEAVVTKSQQIEMQSVL 598
             WV +QG+ +  +    LQ       Q V    +   K     +  ++T  QQ E++ +L
Sbjct: 647  HWVTLQGLGNCRESMVALQSILRKSKQEVHGGFWGMEKHEQRKENQILTPGQQEELERLL 706

Query: 599  DRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPHHHKSEIEKQVQELLLTGVI 778
             +F  VF+EP GLPP R +EHAI+L+EGQ  +NV PYRYPHHHK+EIE+QV+E+L  G+I
Sbjct: 707  HKFSQVFQEPTGLPPIRGKEHAINLMEGQNAVNVRPYRYPHHHKNEIERQVKEMLAAGII 766

Query: 779  RPSQSA 796
            R S S+
Sbjct: 767  RHSTSS 772


>gb|PNX92072.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1498

 Score =  191 bits (485), Expect = 1e-51
 Identities = 110/268 (41%), Positives = 158/268 (58%), Gaps = 16/268 (5%)
 Frame = +2

Query: 41   DEELQLNVMSFNGLME---NANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRRLALAM 211
            +EE+   VMS   L +   +   +   IKL+GTI  VPV++L+DSGA+ NF+S+ L   M
Sbjct: 359  EEEVTQGVMSMMNLHQLDRHGQSKPQVIKLKGTIHEVPVVILIDSGASHNFISQGLVRKM 418

Query: 212  GLHITEAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLGMAWLT 391
            G  I ++ P  IKLGDG  + T G C  + I V+G+  E++  LF+L  +D+VLG+ WL 
Sbjct: 419  GWDIEDSCPMSIKLGDGSCSNTKGTCRGLEINVEGMKVEVDVQLFELGCVDVVLGIEWLR 478

Query: 392  QIGATWIDWIQKKMRFDYQSEWVEIQGVR----------SKECEPLQQYVDDNHFAQLKC 541
             +G   ++W +  M F Y  +WV ++G+           S  C+P +   +     + K 
Sbjct: 479  TLGDMIVNWQKHTMSFWYNKQWVTMRGIEGHLNLMDTLYSVICKPKRHRANRRKEEEEKT 538

Query: 542  ---VSQEAVVTKSQQIEMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYR 712
               V Q   V +S+ +E   +L     VF++P GLPP+R++EHAI L EG   +NV PYR
Sbjct: 539  SCGVFQTLKVDQSEALE--HLLSLCADVFQDPVGLPPKRKKEHAIVLKEGAEAVNVRPYR 596

Query: 713  YPHHHKSEIEKQVQELLLTGVIRPSQSA 796
            YPHHHK EIEKQV+E+L  GVIRPS SA
Sbjct: 597  YPHHHKDEIEKQVKEMLSAGVIRPSTSA 624


>ref|XP_017431852.1| PREDICTED: uncharacterized protein LOC108339224 [Vigna angularis]
          Length = 835

 Score =  188 bits (477), Expect = 6e-51
 Identities = 96/251 (38%), Positives = 153/251 (60%), Gaps = 2/251 (0%)
 Frame = +2

Query: 47   ELQLNVMSFNGLMENANDRLNSIKLQGTIGGVPVLMLVDSGATQNFMSRRLALAMGLHIT 226
            E++   M  +GL      + N++KLQG + G  VL+L+DSGAT +F+S RL   +GL  T
Sbjct: 349  EVEQKAMELSGLSAGGLSQSNTMKLQGWVQGKRVLVLIDSGATHSFISNRLVEELGLECT 408

Query: 227  EAPPRPIKLGDGFTAKTHGECHDVIIMVQGISWEINTMLFDLDGIDLVLGMAWLTQIGAT 406
            +  P  + LGDG   +T G C  V ++++ +       LF+L G+D++LGM WL+ +G  
Sbjct: 409  DTRPYKVCLGDGQRKETSGNCTGVSVLLENLEVRDKLYLFELGGVDIILGMTWLSSLGEI 468

Query: 407  WIDWIQKKMRFDYQSEWVEIQGVRSKECEPL--QQYVDDNHFAQLKCVSQEAVVTKSQQI 580
             +DW Q  M+  +    VE++G  S     +  +  + +     ++ +S E+ + + +++
Sbjct: 469  KVDWGQLIMKVAHGGREVEVKGDPSLTHRVVTPEALIKEK---GIEMLSLESGLMQEEEV 525

Query: 581  EMQSVLDRFESVFKEPQGLPPERQQEHAIHLVEGQGPINVSPYRYPHHHKSEIEKQVQEL 760
            E++ +L  FE VF++PQGLPPER+ +H I L EG   +NV PYRYPH  K+EIE+QV+E+
Sbjct: 526  ELEQILSAFEGVFRDPQGLPPERRVDHRIPLKEGSEAVNVRPYRYPHGMKAEIERQVEEM 585

Query: 761  LLTGVIRPSQS 793
            L  G+IRPS S
Sbjct: 586  LNLGIIRPSNS 596


Top