BLASTX nr result

ID: Astragalus23_contig00015240 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00015240
         (1160 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KYP74405.1| Transposon Ty3-I Gag-Pol polyprotein, partial [Ca...   644   0.0  
gb|KYP73703.1| Retrotransposable element Tf2, partial [Cajanus c...   620   0.0  
gb|KYP70257.1| Retrovirus-related Pol polyprotein from transposo...   612   0.0  
gb|PNY04892.1| hypothetical protein L195_g001324 [Trifolium prat...   627   0.0  
gb|KYP47823.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan]   594   0.0  
gb|KYP33590.1| Retrotransposable element Tf2 [Cajanus cajan]          595   0.0  
dbj|GAU25155.1| hypothetical protein TSUD_150540 [Trifolium subt...   631   0.0  
gb|PNY12616.1| hypothetical protein L195_g009250 [Trifolium prat...   630   0.0  
gb|KYP31777.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan]   591   0.0  
ref|XP_014629745.1| PREDICTED: uncharacterized protein LOC106798...   628   0.0  
ref|XP_014621692.1| PREDICTED: uncharacterized protein LOC106795...   628   0.0  
gb|PNY12480.1| hypothetical protein L195_g009111 [Trifolium prat...   627   0.0  
ref|XP_014622146.1| PREDICTED: uncharacterized protein LOC106795...   626   0.0  
gb|KYP51742.1| Retrovirus-related Pol polyprotein from transposo...   583   0.0  
ref|XP_014617324.1| PREDICTED: uncharacterized protein LOC106794...   625   0.0  
gb|PNY03357.1| hypothetical protein L195_g026684, partial [Trifo...   585   0.0  
dbj|GAU21695.1| hypothetical protein TSUD_242670 [Trifolium subt...   625   0.0  
ref|XP_014629757.1| PREDICTED: uncharacterized protein LOC106798...   628   0.0  
gb|PNX77934.1| hypothetical protein L195_g033907 [Trifolium prat...   585   0.0  
dbj|GAU45320.1| hypothetical protein TSUD_84370 [Trifolium subte...   616   0.0  

>gb|KYP74405.1| Transposon Ty3-I Gag-Pol polyprotein, partial [Cajanus cajan]
          Length = 743

 Score =  644 bits (1660), Expect = 0.0
 Identities = 290/386 (75%), Positives = 341/386 (88%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            EKL ++ ERPWFAD+AN+KAAG +PED  WQQ+KKF  D+ +Y+WDDPYL+K+GADGLLR
Sbjct: 307  EKLFSVQERPWFADMANFKAAGVIPEDLNWQQRKKFFKDSTYYVWDDPYLFKIGADGLLR 366

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCVSG E + ILWHCH+SP GGHYNG+RTAAK+LQSGF+WP+LFKD+H++ ++CDKCQRT
Sbjct: 367  RCVSGVEIRDILWHCHNSPYGGHYNGDRTAAKILQSGFYWPTLFKDSHEHCKNCDKCQRT 426

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            G IS++ E+PL  ILEVEVFDCWGIDF+GP P S++ EYILVAVDYVSKWVEA AT K D
Sbjct: 427  GGISKRHELPLQNILEVEVFDCWGIDFVGPLPSSYSNEYILVAVDYVSKWVEAVATQKVD 486

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
            ++TV+KF+KKNIF+RFG PRVLISDGGSHFCN QL  VL  YGV HKV TPYHPQ+NGQA
Sbjct: 487  ARTVIKFLKKNIFTRFGTPRVLISDGGSHFCNTQLKKVLEHYGVRHKVATPYHPQTNGQA 546

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            EVSNRE+KRILEKTV SSRKDW+LK+DD LWAYRTAYKTP+G+SP+QLVYGKACHLPVEL
Sbjct: 547  EVSNRELKRILEKTVASSRKDWALKLDDTLWAYRTAYKTPIGLSPFQLVYGKACHLPVEL 606

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            EHKAYWALK LNF   A GEKR LQLH L+E RLQAYE+++IYK K+K+YHD+ I +++F
Sbjct: 607  EHKAYWALKALNFDLKAAGEKRKLQLHELEETRLQAYESSRIYKSKVKSYHDRKIVQRDF 666

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
             PGQQVLLFNSRL+LFPGKLKSKWSG
Sbjct: 667  QPGQQVLLFNSRLRLFPGKLKSKWSG 692


>gb|KYP73703.1| Retrotransposable element Tf2, partial [Cajanus cajan]
          Length = 646

 Score =  620 bits (1600), Expect = 0.0
 Identities = 285/386 (73%), Positives = 329/386 (85%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            EKLL + ERPWFADIAN+KAAG +PE+  W QKKKF HDAK Y+WDDP+L+K+G D LLR
Sbjct: 210  EKLLLVQERPWFADIANFKAAGVIPENLNWHQKKKFFHDAKQYIWDDPHLFKIGVDSLLR 269

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCVS  E+KSILWHCH+SP GGH+NGERT AKVLQSGF+WPSLFKDA  +V+  + CQRT
Sbjct: 270  RCVSNEEAKSILWHCHNSPYGGHFNGERTVAKVLQSGFYWPSLFKDAQLHVQHYNNCQRT 329

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            G ISR+ EMPLN ILE+EVFDCWGIDF+GP P SF+ EYILVAVDYVSKWVEA AT+K+D
Sbjct: 330  GGISRRNEMPLNNILEIEVFDCWGIDFVGPLPSSFSNEYILVAVDYVSKWVEAIATSKAD 389

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
            ++TV+KF+K+NIF RFG PRVLISDGGSHFCN QL   L  YGV HKV +PYHPQ+NGQA
Sbjct: 390  AKTVIKFLKRNIFCRFGTPRVLISDGGSHFCNKQLQKALKHYGVRHKVASPYHPQTNGQA 449

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            EVSNREIKRILEKTV  SRKDWS+K+DDALWAYRTAYK P G+SP+QLVYGKACHLPVEL
Sbjct: 450  EVSNREIKRILEKTVSFSRKDWSMKLDDALWAYRTAYKAPTGLSPFQLVYGKACHLPVEL 509

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            E KA+WALKF N   +  G KR +QL+ L+EMR  AYE++KIYK+K KAYHDK I +K+F
Sbjct: 510  EQKAHWALKFFNLDPNVAGNKRKIQLNELEEMRRNAYESSKIYKEKAKAYHDKKILKKDF 569

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
             PGQ VLLFNS+L+LFPGKLKSKWSG
Sbjct: 570  QPGQSVLLFNSKLRLFPGKLKSKWSG 595


>gb|KYP70257.1| Retrovirus-related Pol polyprotein from transposon 412 family
            [Cajanus cajan]
          Length = 702

 Score =  612 bits (1578), Expect = 0.0
 Identities = 280/372 (75%), Positives = 322/372 (86%)
 Frame = +2

Query: 44   IANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLRRCVSGAESKSILWH 223
            +ANYKAAG +PE++ WQQKKKF  D+ +Y+WDDPYL+K+GADGLLRRCV G E + ILWH
Sbjct: 1    MANYKAAGVIPEEYNWQQKKKFFRDSHYYVWDDPYLFKIGADGLLRRCVFGVEIRDILWH 60

Query: 224  CHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRTGNISRKGEMPLNPI 403
            CH+SP GGHYNGERTAAKVLQSGF+WP+LF+DA+++ +SCDKCQRTG +S++ E+PL  I
Sbjct: 61   CHNSPYGGHYNGERTAAKVLQSGFYWPTLFRDAYEHCKSCDKCQRTGTVSKRHELPLQNI 120

Query: 404  LEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSDSQTVVKFIKKNIFS 583
            LEVEVFDCWGIDF+GP P SF  EYILVAVDYVSKWVEASA  K+D++TV+KF+KKNIF 
Sbjct: 121  LEVEVFDCWGIDFIGPLPSSFGNEYILVAVDYVSKWVEASAVQKADARTVIKFLKKNIFC 180

Query: 584  RFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQAEVSNREIKRILEKT 763
            RFG PRVLISDGGSHFCNAQL  VL  Y V HKV TPYHPQ+NGQAEVSNRE+KRILEKT
Sbjct: 181  RFGSPRVLISDGGSHFCNAQLQKVLEHYKVRHKVATPYHPQTNGQAEVSNRELKRILEKT 240

Query: 764  VGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVELEHKAYWALKFLNFT 943
            V SSRKDW+LK+DD LWAYRTA+K+P+G SP+QLVYGKACHLPVELEHKA+WALK LN+ 
Sbjct: 241  VASSRKDWALKLDDTLWAYRTAFKSPIGFSPFQLVYGKACHLPVELEHKAFWALKALNYD 300

Query: 944  TSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEFHPGQQVLLFNSRLK 1123
              A GEK  LQLH L+EMRLQAYE++K YK K K YHDK I  + F PGQQVLLFNSRLK
Sbjct: 301  LKAAGEKMKLQLHELEEMRLQAYESSKSYKHKAKIYHDKKILNRNFQPGQQVLLFNSRLK 360

Query: 1124 LFPGKLKSKWSG 1159
            LFPGKLKSKWSG
Sbjct: 361  LFPGKLKSKWSG 372


>gb|PNY04892.1| hypothetical protein L195_g001324 [Trifolium pratense]
          Length = 1381

 Score =  627 bits (1618), Expect = 0.0
 Identities = 280/386 (72%), Positives = 339/386 (87%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            EKLL + ERPWFAD+ANYKA+G +P+DF W QKK+FL  A  ++WDDPYL+K+GAD LLR
Sbjct: 945  EKLLMVQERPWFADMANYKASGLIPDDFNWHQKKRFLRIANQFVWDDPYLFKLGADNLLR 1004

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCV+  E+ SILWHCH+SP GGHYNGERTAAK+LQ+GF WP++FKD+++YV+SCD CQRT
Sbjct: 1005 RCVTKEEATSILWHCHNSPYGGHYNGERTAAKILQAGFFWPTVFKDSYEYVQSCDNCQRT 1064

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            G ISR+ EMPL  ILEVEVFDCWGIDF+GPFP S + EYILVAVDYVSKWVEA A+ K+D
Sbjct: 1065 GGISRRNEMPLQSILEVEVFDCWGIDFVGPFPSSLSNEYILVAVDYVSKWVEAIASPKAD 1124

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
             +TV+KF+K+NIF+RFG PRVLISDGGSHFCN+QL   L  YGV HK+ +PYHPQ+NGQA
Sbjct: 1125 GKTVIKFLKRNIFTRFGTPRVLISDGGSHFCNSQLAKALEHYGVKHKIASPYHPQTNGQA 1184

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            EVSNREIK+ILEKTV +SRKDWSLK+D+ALWAYRTA+K+P+G++P+Q++YGKACHLPVEL
Sbjct: 1185 EVSNREIKKILEKTVSTSRKDWSLKLDEALWAYRTAFKSPIGLTPFQMIYGKACHLPVEL 1244

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            EHKA+WALKFLNF  +  GEKR  QLH L+EMR  AYE++K+YKQK+K+YHDK I +++F
Sbjct: 1245 EHKAFWALKFLNFDENQAGEKRKFQLHELEEMRFHAYESSKLYKQKVKSYHDKQIVKRDF 1304

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
             PGQ+VLLFNSRLKLFPGKLKSKWSG
Sbjct: 1305 QPGQKVLLFNSRLKLFPGKLKSKWSG 1330


>gb|KYP47823.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan]
          Length = 548

 Score =  594 bits (1532), Expect = 0.0
 Identities = 271/365 (74%), Positives = 316/365 (86%)
 Frame = +2

Query: 65   GKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLRRCVSGAESKSILWHCHDSPCG 244
            G +PEDFTW QKKKFL DA +Y+WDDP+L+K G+DGLLRRCVS  E+KSILWHCH+SP G
Sbjct: 133  GVIPEDFTWHQKKKFLRDATYYVWDDPHLFKSGSDGLLRRCVSREEAKSILWHCHNSPYG 192

Query: 245  GHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRTGNISRKGEMPLNPILEVEVFD 424
            GH+NGERTAAKVLQ+GF+WP+LFKDAH + + CD CQR G I+RK EMPL  + EVEVFD
Sbjct: 193  GHFNGERTAAKVLQAGFYWPTLFKDAHSHAKQCDSCQRAGGITRKHEMPLQNMQEVEVFD 252

Query: 425  CWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSDSQTVVKFIKKNIFSRFGVPRV 604
            CWGIDF+GP P SF  EYILVAVDYVSKWVEA AT K+D +TVVKF+KK+IFSRFG PRV
Sbjct: 253  CWGIDFVGPLPSSFTNEYILVAVDYVSKWVEAIATPKADGKTVVKFLKKHIFSRFGTPRV 312

Query: 605  LISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQAEVSNREIKRILEKTVGSSRKD 784
            LISDGGSHFCNAQL   L  YGV HKV T YHPQ+NGQAEVSNREIKRILEKTV +SRKD
Sbjct: 313  LISDGGSHFCNAQLERALEHYGVHHKVATAYHPQTNGQAEVSNREIKRILEKTVSASRKD 372

Query: 785  WSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVELEHKAYWALKFLNFTTSACGEK 964
            WS K+D+ALWAYRT +K+P+G++P+QLVYGKACHLPVELEHKAYWA+K LNF   + GEK
Sbjct: 373  WSTKLDEALWAYRTTFKSPIGLTPFQLVYGKACHLPVELEHKAYWAVKLLNFDPLSTGEK 432

Query: 965  RILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEFHPGQQVLLFNSRLKLFPGKLK 1144
            R L+LH LDE+RLQAYE++K+YK+K+K+YHD+ I ++EF  GQ VLLFNSRLKLFPGKL+
Sbjct: 433  RKLELHALDELRLQAYESSKLYKEKVKSYHDRKILKREFRAGQSVLLFNSRLKLFPGKLR 492

Query: 1145 SKWSG 1159
            SKWSG
Sbjct: 493  SKWSG 497


>gb|KYP33590.1| Retrotransposable element Tf2 [Cajanus cajan]
          Length = 572

 Score =  595 bits (1533), Expect = 0.0
 Identities = 273/386 (70%), Positives = 326/386 (84%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            EKLLAI  RPWFAD+AN+KAAG +P+D  W Q+KKF  DAK+Y+WDDP+L+ +GAD LLR
Sbjct: 136  EKLLAIQARPWFADMANFKAAGVIPKDLNWHQRKKFFRDAKYYVWDDPHLFMIGADNLLR 195

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCV+  E+ +ILWHCH+SP GGH+NGERTA KVLQSGF WP+LF+DAH +V+ CD C+RT
Sbjct: 196  RCVTKEEAGNILWHCHNSPYGGHFNGERTAIKVLQSGFFWPTLFRDAHKHVQRCDNCERT 255

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            G+IS++ E PL  I EVEVFDCWG DF+GP P SF+ EYIL+AV+YVS+WVEA  T K+D
Sbjct: 256  GSISKRHERPLQNIQEVEVFDCWGNDFIGPLPLSFSNEYILLAVEYVSRWVEAIPTQKAD 315

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
            ++TVVKF+KKNIF+ FG PRVLISDGGSHFCN QL  VL  YGV HKV   YHPQ+NGQA
Sbjct: 316  AKTVVKFLKKNIFTWFGTPRVLISDGGSHFCNLQLQKVLEHYGVRHKVAIAYHPQTNGQA 375

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            EVSNREI +ILEKTV SSRKDWS+K++DALWAYRTA+KTP+G+SP+QL+YGKACHLPVEL
Sbjct: 376  EVSNREITKILEKTVTSSRKDWSIKLEDALWAYRTAFKTPIGLSPFQLIYGKACHLPVEL 435

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            EHKA+WALKFLN    A GE R LQL  L+EMRL AYE +K YK+++KAYHDKNI ++ F
Sbjct: 436  EHKAFWALKFLNLDAKAAGEHRKLQLLELEEMRLNAYEFSKQYKERIKAYHDKNILKRNF 495

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
             PGQ VLLFNSR++LFPGKLKSKWSG
Sbjct: 496  KPGQSVLLFNSRMRLFPGKLKSKWSG 521


>dbj|GAU25155.1| hypothetical protein TSUD_150540 [Trifolium subterraneum]
          Length = 1788

 Score =  631 bits (1627), Expect = 0.0
 Identities = 284/386 (73%), Positives = 335/386 (86%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            EKLL + ERPWFAD+ANYKA+G +P+DF W QKK+FL  A  Y+WDDPYL+K+GAD LLR
Sbjct: 961  EKLLMVQERPWFADMANYKASGLIPDDFNWHQKKRFLRIANQYVWDDPYLFKIGADNLLR 1020

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCV+  E+ SILWHCH+SP GGHYNGERTAAK+LQ+GF WP+LFKD+++YV+SCD CQ+T
Sbjct: 1021 RCVTKEEATSILWHCHNSPYGGHYNGERTAAKILQAGFFWPTLFKDSYEYVQSCDNCQKT 1080

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            G ISR+ EMPL  ILEVEVFDCWGIDF+GPFP S + EYILVAVDYVSKWVEA A+ K+D
Sbjct: 1081 GGISRRNEMPLQSILEVEVFDCWGIDFVGPFPSSLSNEYILVAVDYVSKWVEAIASPKAD 1140

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
             +TV+KF+KKNIF+RFG PRVLISDGGSHFCN+QL   L  YGV HKV +PYHPQ+NGQA
Sbjct: 1141 GKTVIKFLKKNIFTRFGTPRVLISDGGSHFCNSQLAKALEHYGVKHKVASPYHPQTNGQA 1200

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            EVSNRE+K+ILEKTV +SRKDWSLK+D+ALWAYRTA+K+P+G++P+Q+VYGKACH PVEL
Sbjct: 1201 EVSNRELKKILEKTVSTSRKDWSLKLDEALWAYRTAFKSPIGLTPFQMVYGKACHFPVEL 1260

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            EHKAYWALKFLNF  S  GEKR  QLH +DEMR  AYE++K+YK+K+K YHDK I  K F
Sbjct: 1261 EHKAYWALKFLNFDQSLAGEKRKTQLHEVDEMRFHAYESSKLYKEKVKEYHDKRIINKNF 1320

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
            HPGQ VLLFNSRLK+FPGKLKSKWSG
Sbjct: 1321 HPGQLVLLFNSRLKIFPGKLKSKWSG 1346


>gb|PNY12616.1| hypothetical protein L195_g009250 [Trifolium pratense]
          Length = 1814

 Score =  630 bits (1626), Expect = 0.0
 Identities = 289/386 (74%), Positives = 334/386 (86%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            EKLL + ERPWFAD+ANYKA+G +PED  W QKKKFL +A  Y+WDDPYL+K+GAD LLR
Sbjct: 1378 EKLLMMQERPWFADMANYKASGLIPEDLNWHQKKKFLRNANQYVWDDPYLFKIGADNLLR 1437

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCV+  E+ SILWHCH+SP GGHYNGERTAAKVLQSGF WP+LFKDA+ + + CDKCQ T
Sbjct: 1438 RCVTTEEATSILWHCHNSPYGGHYNGERTAAKVLQSGFFWPTLFKDAYQHAQKCDKCQMT 1497

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            G IS++ EMPL  IL VEVFDCWGIDF+GPFP SF+ EYILVAVDYVSKWVEA A+ K+D
Sbjct: 1498 GGISKRNEMPLQNILVVEVFDCWGIDFVGPFPSSFSNEYILVAVDYVSKWVEAIASPKAD 1557

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
             +TV+KF+KKNIF+RFG PRVLISDGGSHFCN+QL   L  YGV HKV +PYHPQ+NGQA
Sbjct: 1558 GKTVIKFLKKNIFTRFGTPRVLISDGGSHFCNSQLEKALEHYGVRHKVASPYHPQTNGQA 1617

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            EVSNREIKRILEKTV +SRKDWS K+DDALWAYRTA+K+P+G++P+Q+VYGKACHLPVEL
Sbjct: 1618 EVSNREIKRILEKTVSTSRKDWSSKLDDALWAYRTAFKSPIGLTPFQMVYGKACHLPVEL 1677

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            EHKAYWALKFLNF     G+KR LQLH L+EMR QAYE++K+YK+K+K+YHDK I  KEF
Sbjct: 1678 EHKAYWALKFLNFDPCFSGDKRKLQLHELEEMRAQAYESSKLYKEKVKSYHDKKILSKEF 1737

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
             PGQ VLLFNSRLKLFPGKLKSKWSG
Sbjct: 1738 KPGQMVLLFNSRLKLFPGKLKSKWSG 1763


>gb|KYP31777.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan]
          Length = 541

 Score =  591 bits (1523), Expect = 0.0
 Identities = 272/386 (70%), Positives = 327/386 (84%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            EKLLAI  RPWFAD+AN+KAAG +P+D  W Q+KKF  DAK+Y+WDDP+L+K+GAD LLR
Sbjct: 136  EKLLAIQARPWFADMANFKAAGVIPKDLNWHQRKKFFRDAKYYVWDDPHLFKIGADNLLR 195

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCV+  E+ +ILWHCH+SP GGH+NGERTA KVLQSGF WP+LF+DAH++V+ CD CQR 
Sbjct: 196  RCVTKEEAGNILWHCHNSPYGGHFNGERTAIKVLQSGFFWPTLFRDAHEHVQRCDNCQRI 255

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            G+IS+  EMPL  I EVEVF+CWGIDF+GP P SF+ EYIL+AV+YVS+WVEA  T K+D
Sbjct: 256  GSISKIHEMPLQNIQEVEVFECWGIDFIGPLPLSFSNEYILLAVEYVSRWVEAVPTQKAD 315

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
            ++ VVKF+K NIF+RFG PRVLISD GSHFCN QL  VL  YGV HKV T YHPQ+NGQA
Sbjct: 316  AKIVVKFLK-NIFTRFGTPRVLISDEGSHFCNVQLQKVLEHYGVRHKVATTYHPQTNGQA 374

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            EVSNREIK+ILEKTV SSRKDWS+K++DALWAYRT +KTP+G+SP+QL+YGKACHLPVEL
Sbjct: 375  EVSNREIKKILEKTVTSSRKDWSIKLEDALWAYRTTFKTPIGLSPFQLIYGKACHLPVEL 434

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            EHKA+WALKFLN    A GE R LQL  L+EMRL AYE++K YK+++KAYHDK I ++ F
Sbjct: 435  EHKAFWALKFLNLDAKAAGEHRKLQLLELEEMRLNAYESSKQYKERIKAYHDKKILKRNF 494

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
             PG+ VLLFNSR++LFPGKLKSKWSG
Sbjct: 495  KPGKSVLLFNSRMRLFPGKLKSKWSG 520


>ref|XP_014629745.1| PREDICTED: uncharacterized protein LOC106798267 [Glycine max]
          Length = 1815

 Score =  628 bits (1620), Expect = 0.0
 Identities = 283/386 (73%), Positives = 334/386 (86%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            E LL +  RPWFAD+ANYKA G +PE++TW Q+KKFLHDA+ Y+WDDP+L+K GAD +LR
Sbjct: 1376 EFLLQVTTRPWFADMANYKATGVIPEEYTWNQRKKFLHDARFYVWDDPHLFKAGADNVLR 1435

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCV+  E++SILWHCH S  GGH++G+RTAAKVLQSGF WPSLFKDA+++VR CD+CQRT
Sbjct: 1436 RCVTKEEARSILWHCHSSSYGGHHSGDRTAAKVLQSGFFWPSLFKDAYEFVRCCDRCQRT 1495

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            G ISR+ EMPL  I+EVE+FDCWGIDFMGP P S+   YILVAVDYVSKWVEA AT K D
Sbjct: 1496 GGISRRNEMPLQNIMEVEIFDCWGIDFMGPLPSSYGNIYILVAVDYVSKWVEAIATPKDD 1555

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
            ++ V+KF+KKNIFSRFGVPR LISDGG+HFCN QL  VL  Y V HKV TPYHPQ+NGQA
Sbjct: 1556 ARVVIKFLKKNIFSRFGVPRALISDGGTHFCNNQLKKVLEHYNVRHKVATPYHPQTNGQA 1615

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            E+SNRE+KRILEKTV SSRKDW+LK+DD LWAYRTA+KTP+G+SP+QLVYGK+CHLPVEL
Sbjct: 1616 EISNRELKRILEKTVASSRKDWALKLDDTLWAYRTAFKTPIGLSPFQLVYGKSCHLPVEL 1675

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            EHKAYWAL+ LNF  +ACGEKR LQL  L+EMRL AYE+++IYK++ KAYHDK + R+EF
Sbjct: 1676 EHKAYWALRLLNFDNNACGEKRKLQLQELEEMRLNAYESSRIYKERTKAYHDKKLQRREF 1735

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
             PGQQVLLFNSRL+LFPGKLKSKWSG
Sbjct: 1736 QPGQQVLLFNSRLRLFPGKLKSKWSG 1761


>ref|XP_014621692.1| PREDICTED: uncharacterized protein LOC106795614 [Glycine max]
          Length = 1816

 Score =  628 bits (1620), Expect = 0.0
 Identities = 283/386 (73%), Positives = 334/386 (86%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            E LL +  RPWFAD+ANYKA G +PE++TW Q+KKFLHDA+ Y+WDDP+L+K GAD +LR
Sbjct: 1377 EFLLQVTTRPWFADMANYKATGVIPEEYTWNQRKKFLHDARFYVWDDPHLFKAGADNVLR 1436

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCV+  E++SILWHCH S  GGH++G+RTAAKVLQSGF WPSLFKDA+++VR CD+CQRT
Sbjct: 1437 RCVTKEEARSILWHCHSSSYGGHHSGDRTAAKVLQSGFFWPSLFKDAYEFVRCCDRCQRT 1496

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            G ISR+ EMPL  I+EVE+FDCWGIDFMGP P S+   YILVAVDYVSKWVEA AT K D
Sbjct: 1497 GGISRRNEMPLQNIMEVEIFDCWGIDFMGPLPSSYGNIYILVAVDYVSKWVEAIATPKDD 1556

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
            ++ V+KF+KKNIFSRFGVPR LISDGG+HFCN QL  VL  Y V HKV TPYHPQ+NGQA
Sbjct: 1557 ARVVIKFLKKNIFSRFGVPRALISDGGTHFCNNQLKKVLEHYNVRHKVATPYHPQTNGQA 1616

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            E+SNRE+KRILEKTV SSRKDW+LK+DD LWAYRTA+KTP+G+SP+QLVYGK+CHLPVEL
Sbjct: 1617 EISNRELKRILEKTVASSRKDWALKLDDTLWAYRTAFKTPIGLSPFQLVYGKSCHLPVEL 1676

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            EHKAYWAL+ LNF  +ACGEKR LQL  L+EMRL AYE+++IYK++ KAYHDK + R+EF
Sbjct: 1677 EHKAYWALRLLNFDNNACGEKRKLQLQELEEMRLNAYESSRIYKERTKAYHDKKLQRREF 1736

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
             PGQQVLLFNSRL+LFPGKLKSKWSG
Sbjct: 1737 QPGQQVLLFNSRLRLFPGKLKSKWSG 1762


>gb|PNY12480.1| hypothetical protein L195_g009111 [Trifolium pratense]
          Length = 1859

 Score =  627 bits (1617), Expect = 0.0
 Identities = 282/386 (73%), Positives = 335/386 (86%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            EKLL + ERPWFAD+ANYK +G +PEDF W QKK+FL +A  ++WDDPYL+K+GAD LLR
Sbjct: 1423 EKLLMVQERPWFADMANYKVSGLIPEDFNWHQKKRFLREANQFVWDDPYLFKIGADNLLR 1482

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCV+  E+ SILWHCH SP GGHYNGERTAAKVLQSGF+WP+LFKDAH+Y + CD CQ+T
Sbjct: 1483 RCVTREEATSILWHCHSSPYGGHYNGERTAAKVLQSGFYWPTLFKDAHEYSQRCDNCQKT 1542

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            G ISR+ EMPL  ILEVEVFDCWGIDF+GPFP SF+ EYILVAVDYVSKWVEA A+ K+D
Sbjct: 1543 GGISRRNEMPLQNILEVEVFDCWGIDFVGPFPSSFSYEYILVAVDYVSKWVEAVASPKAD 1602

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
             +TV+KF+KKNIF+RFG PRVLISDGGSHFCN+QL   L  YGV HKV +PYHPQ+NGQA
Sbjct: 1603 GKTVIKFLKKNIFTRFGTPRVLISDGGSHFCNSQLAKALEHYGVKHKVASPYHPQTNGQA 1662

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            EVSNRE+K+ILEKTV +SRKDWSLK+D+ALWAYRTA+K+P+G++P+Q+VYGK+CHLPVEL
Sbjct: 1663 EVSNREVKKILEKTVSTSRKDWSLKLDEALWAYRTAFKSPIGLTPFQMVYGKSCHLPVEL 1722

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            EHKAYWALKFLNF  S  GEKR +Q+  L+EMR  AYE++K+YK+K+K+YHDK I  + F
Sbjct: 1723 EHKAYWALKFLNFDPSLAGEKRKMQMQELEEMRFHAYESSKLYKEKVKSYHDKRIIDRNF 1782

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
             PGQ+VLLFNSRLKLF GKLKSKWSG
Sbjct: 1783 QPGQKVLLFNSRLKLFSGKLKSKWSG 1808


>ref|XP_014622146.1| PREDICTED: uncharacterized protein LOC106795762 [Glycine max]
          Length = 1821

 Score =  626 bits (1614), Expect = 0.0
 Identities = 282/386 (73%), Positives = 333/386 (86%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            E LL +  RPWFAD+ANYKA G +PE++TW Q+KKFLHDA+ Y+WDDP+L+K GAD +LR
Sbjct: 1382 EFLLQVTTRPWFADMANYKATGVIPEEYTWNQRKKFLHDARFYVWDDPHLFKAGADNVLR 1441

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCV+  E++SILWHCH S  GGH++G+RTAAKVLQSGF WPSLFKDA+++VR CD+CQRT
Sbjct: 1442 RCVTKEEARSILWHCHSSSYGGHHSGDRTAAKVLQSGFFWPSLFKDAYEFVRCCDRCQRT 1501

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            G ISR+ EMPL  I+EVE+FDCWGIDFMGP P S+   YILVAVDYVSKWVEA AT K D
Sbjct: 1502 GGISRRNEMPLQNIMEVEIFDCWGIDFMGPLPSSYGNIYILVAVDYVSKWVEAIATPKDD 1561

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
            ++ V+KF+KKNIFSRFGVPR LISDGG+HFCN QL  VL  Y V HKV TPYHPQ+NGQA
Sbjct: 1562 ARVVIKFLKKNIFSRFGVPRALISDGGTHFCNNQLKKVLEHYNVRHKVATPYHPQTNGQA 1621

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            E+SNRE+KRILEKTV SS KDW+LK+DD LWAYRTA+KTP+G+SP+QLVYGK+CHLPVEL
Sbjct: 1622 EISNRELKRILEKTVASSMKDWALKLDDTLWAYRTAFKTPIGLSPFQLVYGKSCHLPVEL 1681

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            EHKAYWAL+ LNF  +ACGEKR LQL  L+EMRL AYE+++IYK++ KAYHDK + R+EF
Sbjct: 1682 EHKAYWALRLLNFDNNACGEKRKLQLQELEEMRLNAYESSRIYKERTKAYHDKKLQRREF 1741

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
             PGQQVLLFNSRL+LFPGKLKSKWSG
Sbjct: 1742 QPGQQVLLFNSRLRLFPGKLKSKWSG 1767


>gb|KYP51742.1| Retrovirus-related Pol polyprotein from transposon 412 family
            [Cajanus cajan]
          Length = 423

 Score =  583 bits (1503), Expect = 0.0
 Identities = 264/372 (70%), Positives = 316/372 (84%)
 Frame = +2

Query: 44   IANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLRRCVSGAESKSILWH 223
            +AN+KAAG +PED  W Q+KK ++DAK Y+WDDP+L+K+GA+ LLRRCV+  E+K ILWH
Sbjct: 1    MANFKAAGVIPEDLNWHQRKKMINDAKLYVWDDPHLFKIGAENLLRRCVTKEEAKDILWH 60

Query: 224  CHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRTGNISRKGEMPLNPI 403
            CH+ P GGH+NGERTAAKVLQSGF WP+LFKDAH YV+ CD CQR+G IS++ EMPL  I
Sbjct: 61   CHNLPYGGHFNGERTAAKVLQSGFFWPTLFKDAHGYVQRCDSCQRSGTISKRHEMPLQNI 120

Query: 404  LEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSDSQTVVKFIKKNIFS 583
             EVEVFDCWGIDF+GP P SF+ EYIL+ V+YVS+WVEA  T K+D++TV+KF+KKNIF 
Sbjct: 121  QEVEVFDCWGIDFIGPLPTSFSNEYILLIVEYVSRWVEAIPTQKADAKTVIKFLKKNIFY 180

Query: 584  RFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQAEVSNREIKRILEKT 763
            RFG PRVLISDGGSHFCN QL   L  YGV HKV T YHPQ+NGQAEVSNREIKRILEKT
Sbjct: 181  RFGTPRVLISDGGSHFCNVQLKKALEHYGVRHKVATAYHPQTNGQAEVSNREIKRILEKT 240

Query: 764  VGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVELEHKAYWALKFLNFT 943
            V +SRKDW+ K+D+ALWAYRTA+KTP  +SP+QLVYGKACHLPVELEHKA+WALK LNF 
Sbjct: 241  VATSRKDWASKLDEALWAYRTAFKTPTSLSPFQLVYGKACHLPVELEHKAFWALKLLNFD 300

Query: 944  TSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEFHPGQQVLLFNSRLK 1123
              +CGEKR  QLH L+EMRL AY++++ YK+++KAYHDK I +++F PGQ VLLFNSRLK
Sbjct: 301  PKSCGEKRKSQLHELEEMRLNAYQSSRQYKERVKAYHDKKILKRDFRPGQSVLLFNSRLK 360

Query: 1124 LFPGKLKSKWSG 1159
            LFPGKL+SKWSG
Sbjct: 361  LFPGKLRSKWSG 372


>ref|XP_014617324.1| PREDICTED: uncharacterized protein LOC106794530 [Glycine max]
          Length = 1819

 Score =  625 bits (1613), Expect = 0.0
 Identities = 282/386 (73%), Positives = 333/386 (86%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            E LL +  RPWFAD+ANYKA G +PE++TW Q+KKFLHDA+ Y+WDDP+L+K GAD +LR
Sbjct: 1380 EFLLQVTTRPWFADMANYKATGVIPEEYTWNQRKKFLHDARFYVWDDPHLFKAGADNVLR 1439

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCV+  E++SILWHCH S  GGH++G+RTAAKVLQSGF WPSLFKDA+++VR CD+CQRT
Sbjct: 1440 RCVTKEEARSILWHCHSSSYGGHHSGDRTAAKVLQSGFFWPSLFKDAYEFVRCCDRCQRT 1499

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            G ISR+ EMPL  I+EVE+FDCWGIDFMGP P S+   YILVAVDYVSKWVEA AT K D
Sbjct: 1500 GGISRRNEMPLQNIMEVEIFDCWGIDFMGPLPSSYGNIYILVAVDYVSKWVEAIATPKDD 1559

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
            ++ V+KF+KKNIFSRFGVPR LISDGG+HFCN QL  VL  Y V HKV TPYHPQ+NGQA
Sbjct: 1560 ARVVIKFLKKNIFSRFGVPRALISDGGTHFCNNQLKKVLEHYNVRHKVATPYHPQTNGQA 1619

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            E+SNRE+KRILEKTV SSRKDW+LK+DD LWAYRTA+KTP+G+SP+QLVYGK+CHLPVEL
Sbjct: 1620 EISNRELKRILEKTVASSRKDWALKLDDTLWAYRTAFKTPIGLSPFQLVYGKSCHLPVEL 1679

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
             HKAYWAL+ LNF  +ACGEKR LQL  L+EMRL AYE+++IYK++ KAYHDK + R+EF
Sbjct: 1680 VHKAYWALRLLNFDNNACGEKRKLQLQELEEMRLNAYESSRIYKERTKAYHDKKLQRREF 1739

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
             PGQQVLLFNSRL+LFPGKLKSKWSG
Sbjct: 1740 QPGQQVLLFNSRLRLFPGKLKSKWSG 1765


>gb|PNY03357.1| hypothetical protein L195_g026684, partial [Trifolium pratense]
          Length = 498

 Score =  585 bits (1508), Expect = 0.0
 Identities = 269/386 (69%), Positives = 315/386 (81%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            E +LA+   PWFAD ANY     +P DFT QQ+KKFLHD K Y+WD+P+LYK G DGLLR
Sbjct: 62   EHILAVTVAPWFADFANYMVGKTIPSDFTSQQRKKFLHDCKFYVWDEPFLYKRGVDGLLR 121

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCV   E + +LWHCHDS  GGH++G+RTAAKVLQSG  WP+LFKDA  YV+ CD+CQRT
Sbjct: 122  RCVPEDEQEKVLWHCHDSSYGGHFSGDRTAAKVLQSGLFWPTLFKDAFTYVKRCDRCQRT 181

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            GNIS++ EMP NP+LEVE+FD WGIDFMGPFP S++  YILVAVDYVSKWVEA AT  +D
Sbjct: 182  GNISKRNEMPQNPVLEVEIFDVWGIDFMGPFPSSYSKTYILVAVDYVSKWVEAIATQTND 241

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
            +Q VV F+KKNIFSRFGVPR LISD G+HF N ++  +L KY V H++ TPYHPQ++GQ 
Sbjct: 242  AQVVVSFLKKNIFSRFGVPRALISDEGTHFLNRKMEALLRKYNVHHRIATPYHPQTSGQV 301

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            EVSNR+IK+ILEKTV SSRKDWSLK+DDALWAYRTA+KTP+GMSP+Q+VYGK+CHLP+EL
Sbjct: 302  EVSNRQIKQILEKTVNSSRKDWSLKLDDALWAYRTAFKTPIGMSPFQIVYGKSCHLPLEL 361

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            EHKA WA KFLNF  S  GE RILQLH LDE R  AYEN KI+K+K K +HDK I  +EF
Sbjct: 362  EHKALWATKFLNFDLSKAGESRILQLHELDEFRNFAYENAKIFKEKTKKWHDKKIQNREF 421

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
              GQ VLLFNSRLKLFPGKLKS+WSG
Sbjct: 422  REGQLVLLFNSRLKLFPGKLKSRWSG 447


>dbj|GAU21695.1| hypothetical protein TSUD_242670 [Trifolium subterraneum]
          Length = 1897

 Score =  625 bits (1611), Expect = 0.0
 Identities = 286/386 (74%), Positives = 332/386 (86%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            EKLL + ERPWFAD+ANYKA+G +PEDF W +KKKFL +A  Y+WDDPYL+K+GAD LLR
Sbjct: 1134 EKLLMMQERPWFADMANYKASGLIPEDFNWHKKKKFLREANQYVWDDPYLFKIGADNLLR 1193

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCV+  E+ +ILWHCH+SP GGHYNGERTAAKVLQSGF WP+LF+DAH + + C+ CQ T
Sbjct: 1194 RCVTTEEATNILWHCHNSPYGGHYNGERTAAKVLQSGFFWPTLFRDAHQHAQRCNNCQMT 1253

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            G ISR+ EMPL  IL VEVFDCWGIDF+GPFP SF+ EYILVAVDYVSKWVEA A+ ++D
Sbjct: 1254 GGISRRNEMPLQNILVVEVFDCWGIDFVGPFPSSFSNEYILVAVDYVSKWVEAIASPRAD 1313

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
             +TV+KF+KKNIF+RFG PRVLISDGGSHFCN+QL   L  YGV HKV +PYHPQ+NGQA
Sbjct: 1314 GKTVIKFLKKNIFTRFGTPRVLISDGGSHFCNSQLEKALEHYGVRHKVASPYHPQTNGQA 1373

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            EVSNREIKRILEKTV +SRKDWS K+D+ALWAYRTA+K+P+G++P+QLVYGKACHL VEL
Sbjct: 1374 EVSNREIKRILEKTVSTSRKDWSAKLDEALWAYRTAFKSPIGLTPFQLVYGKACHLLVEL 1433

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            EHKAYWALKFLNF  S  GEKR LQLH L+EMR QAYE++K+YK+K+K YHDK I  K F
Sbjct: 1434 EHKAYWALKFLNFDPSVSGEKRKLQLHELEEMRAQAYESSKLYKEKVKGYHDKRIFNKAF 1493

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
             PGQ VLLFNSRLKLFPGKLKSKWSG
Sbjct: 1494 KPGQMVLLFNSRLKLFPGKLKSKWSG 1519


>ref|XP_014629757.1| PREDICTED: uncharacterized protein LOC106798275 [Glycine max]
          Length = 2256

 Score =  628 bits (1620), Expect = 0.0
 Identities = 283/386 (73%), Positives = 334/386 (86%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            E LL +  RPWFAD+ANYKA G +PE++TW Q+KKFLHDA+ Y+WDDP+L+K GAD +LR
Sbjct: 1382 EFLLQVTTRPWFADMANYKATGVIPEEYTWNQRKKFLHDARFYVWDDPHLFKAGADNVLR 1441

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCV+  E++SILWHCH S  GGH++G+RTAAKVLQSGF WPSLFKDA+++VR CD+CQRT
Sbjct: 1442 RCVTKEEARSILWHCHSSSYGGHHSGDRTAAKVLQSGFFWPSLFKDAYEFVRCCDRCQRT 1501

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            G ISR+ EMPL  I+EVE+FDCWGIDFMGP P S+   YILVAVDYVSKWVEA AT K D
Sbjct: 1502 GGISRRNEMPLQNIMEVEIFDCWGIDFMGPLPSSYGNIYILVAVDYVSKWVEAIATPKDD 1561

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
            ++ V+KF+KKNIFSRFGVPR LISDGG+HFCN QL  VL  Y V HKV TPYHPQ+NGQA
Sbjct: 1562 ARVVIKFLKKNIFSRFGVPRALISDGGTHFCNNQLKKVLEHYNVRHKVATPYHPQTNGQA 1621

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            E+SNRE+KRILEKTV SSRKDW+LK+DD LWAYRTA+KTP+G+SP+QLVYGK+CHLPVEL
Sbjct: 1622 EISNRELKRILEKTVASSRKDWALKLDDTLWAYRTAFKTPIGLSPFQLVYGKSCHLPVEL 1681

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            EHKAYWAL+ LNF  +ACGEKR LQL  L+EMRL AYE+++IYK++ KAYHDK + R+EF
Sbjct: 1682 EHKAYWALRLLNFDNNACGEKRKLQLQELEEMRLNAYESSRIYKERTKAYHDKKLQRREF 1741

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
             PGQQVLLFNSRL+LFPGKLKSKWSG
Sbjct: 1742 QPGQQVLLFNSRLRLFPGKLKSKWSG 1767


>gb|PNX77934.1| hypothetical protein L195_g033907 [Trifolium pratense]
          Length = 595

 Score =  585 bits (1507), Expect = 0.0
 Identities = 268/386 (69%), Positives = 316/386 (81%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            E +LA+   PWFAD ANY     +P DFT QQ+KKFLHD K Y+WD+P+LYK G DGLLR
Sbjct: 159  EHILAVTVAPWFADFANYMVGRTIPSDFTPQQRKKFLHDCKFYVWDEPFLYKRGVDGLLR 218

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCV   E + +LWHCHDS  GGH++G+RTAAKVLQSG  WP+LFKDA  YV+ CD+CQRT
Sbjct: 219  RCVPEGEQEKVLWHCHDSSYGGHFSGDRTAAKVLQSGLFWPTLFKDAFTYVKRCDRCQRT 278

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            GNIS++ EMP NPILEVE+FD WGIDFMGPFP S++  YILVAVDYVSKWVEA AT  +D
Sbjct: 279  GNISKRNEMPQNPILEVEIFDVWGIDFMGPFPSSYSKTYILVAVDYVSKWVEAIATHTND 338

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
            +Q VV F+K+NIFSRFGVPR LISD G+HF N ++  +L KY V H++ TPYHPQ++GQ 
Sbjct: 339  AQVVVAFLKRNIFSRFGVPRALISDEGTHFLNRKMEALLKKYNVHHRIATPYHPQTSGQV 398

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            EVSNR+IK+ILEKTV SSRKDWS+K+DDALWAYRTA+KTP+GMSP+Q+VYGKACHLP+EL
Sbjct: 399  EVSNRQIKQILEKTVNSSRKDWSVKLDDALWAYRTAFKTPIGMSPFQIVYGKACHLPLEL 458

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            EHKA WA KFLNF  S  GE RILQLH LDE R  AYEN KI+K+K K +HD+ I +KEF
Sbjct: 459  EHKALWATKFLNFDLSKAGESRILQLHELDEFRNYAYENAKIFKEKTKKWHDRKIQKKEF 518

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
              GQ VLLFNSRL+LFPGKLKS+WSG
Sbjct: 519  REGQLVLLFNSRLRLFPGKLKSRWSG 544


>dbj|GAU45320.1| hypothetical protein TSUD_84370 [Trifolium subterraneum]
          Length = 1748

 Score =  616 bits (1588), Expect = 0.0
 Identities = 277/386 (71%), Positives = 331/386 (85%)
 Frame = +2

Query: 2    EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181
            EKLL + ERPWFAD+AN+KA+G +P+DF W QKKKFL +A  Y+WDDPYL+ +G D LLR
Sbjct: 1312 EKLLMVQERPWFADMANFKASGLIPDDFNWHQKKKFLREANQYVWDDPYLFMIGEDNLLR 1371

Query: 182  RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361
            RCV+  E+ SI+WHCH+SP GGHYNGERTAAKVLQSGF WP+LFKD +++ R CD CQR 
Sbjct: 1372 RCVTKEEATSIMWHCHNSPYGGHYNGERTAAKVLQSGFFWPTLFKDTYEHARKCDSCQRI 1431

Query: 362  GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541
            G ISR+ EMPL  +  VEVFDCWGIDF+GPFP SF+ EYILVAVDYVSKWVEA A+ K+D
Sbjct: 1432 GGISRRNEMPLQNMHVVEVFDCWGIDFVGPFPSSFSNEYILVAVDYVSKWVEAIASPKAD 1491

Query: 542  SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721
             +TV+KF+K+NIF+RFG PRVLISDGGSHFCN+QL   L  YGV HKV +PYHPQ+NGQA
Sbjct: 1492 GKTVIKFLKRNIFTRFGTPRVLISDGGSHFCNSQLARALEHYGVKHKVASPYHPQTNGQA 1551

Query: 722  EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901
            EVSNREIK+ILEKTV +SRKDWSLK+D+ALWAYRTA+K P+G++P+Q+VYGK+CHLPVEL
Sbjct: 1552 EVSNREIKKILEKTVSASRKDWSLKLDEALWAYRTAFKAPIGLTPFQMVYGKSCHLPVEL 1611

Query: 902  EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081
            EHKAYWALKFLNF  +  GEKR +QL  LDEM+ QAYE++K+YK+K+K+YHDK +  K+F
Sbjct: 1612 EHKAYWALKFLNFDENQAGEKRKVQLQQLDEMQCQAYESSKLYKEKVKSYHDKRLVNKDF 1671

Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159
             PGQ VLLFNSRLKLFPGKLKSKWSG
Sbjct: 1672 RPGQMVLLFNSRLKLFPGKLKSKWSG 1697


Top