BLASTX nr result
ID: Astragalus23_contig00015240
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00015240 (1160 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KYP74405.1| Transposon Ty3-I Gag-Pol polyprotein, partial [Ca... 644 0.0 gb|KYP73703.1| Retrotransposable element Tf2, partial [Cajanus c... 620 0.0 gb|KYP70257.1| Retrovirus-related Pol polyprotein from transposo... 612 0.0 gb|PNY04892.1| hypothetical protein L195_g001324 [Trifolium prat... 627 0.0 gb|KYP47823.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan] 594 0.0 gb|KYP33590.1| Retrotransposable element Tf2 [Cajanus cajan] 595 0.0 dbj|GAU25155.1| hypothetical protein TSUD_150540 [Trifolium subt... 631 0.0 gb|PNY12616.1| hypothetical protein L195_g009250 [Trifolium prat... 630 0.0 gb|KYP31777.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan] 591 0.0 ref|XP_014629745.1| PREDICTED: uncharacterized protein LOC106798... 628 0.0 ref|XP_014621692.1| PREDICTED: uncharacterized protein LOC106795... 628 0.0 gb|PNY12480.1| hypothetical protein L195_g009111 [Trifolium prat... 627 0.0 ref|XP_014622146.1| PREDICTED: uncharacterized protein LOC106795... 626 0.0 gb|KYP51742.1| Retrovirus-related Pol polyprotein from transposo... 583 0.0 ref|XP_014617324.1| PREDICTED: uncharacterized protein LOC106794... 625 0.0 gb|PNY03357.1| hypothetical protein L195_g026684, partial [Trifo... 585 0.0 dbj|GAU21695.1| hypothetical protein TSUD_242670 [Trifolium subt... 625 0.0 ref|XP_014629757.1| PREDICTED: uncharacterized protein LOC106798... 628 0.0 gb|PNX77934.1| hypothetical protein L195_g033907 [Trifolium prat... 585 0.0 dbj|GAU45320.1| hypothetical protein TSUD_84370 [Trifolium subte... 616 0.0 >gb|KYP74405.1| Transposon Ty3-I Gag-Pol polyprotein, partial [Cajanus cajan] Length = 743 Score = 644 bits (1660), Expect = 0.0 Identities = 290/386 (75%), Positives = 341/386 (88%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 EKL ++ ERPWFAD+AN+KAAG +PED WQQ+KKF D+ +Y+WDDPYL+K+GADGLLR Sbjct: 307 EKLFSVQERPWFADMANFKAAGVIPEDLNWQQRKKFFKDSTYYVWDDPYLFKIGADGLLR 366 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCVSG E + ILWHCH+SP GGHYNG+RTAAK+LQSGF+WP+LFKD+H++ ++CDKCQRT Sbjct: 367 RCVSGVEIRDILWHCHNSPYGGHYNGDRTAAKILQSGFYWPTLFKDSHEHCKNCDKCQRT 426 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 G IS++ E+PL ILEVEVFDCWGIDF+GP P S++ EYILVAVDYVSKWVEA AT K D Sbjct: 427 GGISKRHELPLQNILEVEVFDCWGIDFVGPLPSSYSNEYILVAVDYVSKWVEAVATQKVD 486 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 ++TV+KF+KKNIF+RFG PRVLISDGGSHFCN QL VL YGV HKV TPYHPQ+NGQA Sbjct: 487 ARTVIKFLKKNIFTRFGTPRVLISDGGSHFCNTQLKKVLEHYGVRHKVATPYHPQTNGQA 546 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 EVSNRE+KRILEKTV SSRKDW+LK+DD LWAYRTAYKTP+G+SP+QLVYGKACHLPVEL Sbjct: 547 EVSNRELKRILEKTVASSRKDWALKLDDTLWAYRTAYKTPIGLSPFQLVYGKACHLPVEL 606 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 EHKAYWALK LNF A GEKR LQLH L+E RLQAYE+++IYK K+K+YHD+ I +++F Sbjct: 607 EHKAYWALKALNFDLKAAGEKRKLQLHELEETRLQAYESSRIYKSKVKSYHDRKIVQRDF 666 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 PGQQVLLFNSRL+LFPGKLKSKWSG Sbjct: 667 QPGQQVLLFNSRLRLFPGKLKSKWSG 692 >gb|KYP73703.1| Retrotransposable element Tf2, partial [Cajanus cajan] Length = 646 Score = 620 bits (1600), Expect = 0.0 Identities = 285/386 (73%), Positives = 329/386 (85%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 EKLL + ERPWFADIAN+KAAG +PE+ W QKKKF HDAK Y+WDDP+L+K+G D LLR Sbjct: 210 EKLLLVQERPWFADIANFKAAGVIPENLNWHQKKKFFHDAKQYIWDDPHLFKIGVDSLLR 269 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCVS E+KSILWHCH+SP GGH+NGERT AKVLQSGF+WPSLFKDA +V+ + CQRT Sbjct: 270 RCVSNEEAKSILWHCHNSPYGGHFNGERTVAKVLQSGFYWPSLFKDAQLHVQHYNNCQRT 329 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 G ISR+ EMPLN ILE+EVFDCWGIDF+GP P SF+ EYILVAVDYVSKWVEA AT+K+D Sbjct: 330 GGISRRNEMPLNNILEIEVFDCWGIDFVGPLPSSFSNEYILVAVDYVSKWVEAIATSKAD 389 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 ++TV+KF+K+NIF RFG PRVLISDGGSHFCN QL L YGV HKV +PYHPQ+NGQA Sbjct: 390 AKTVIKFLKRNIFCRFGTPRVLISDGGSHFCNKQLQKALKHYGVRHKVASPYHPQTNGQA 449 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 EVSNREIKRILEKTV SRKDWS+K+DDALWAYRTAYK P G+SP+QLVYGKACHLPVEL Sbjct: 450 EVSNREIKRILEKTVSFSRKDWSMKLDDALWAYRTAYKAPTGLSPFQLVYGKACHLPVEL 509 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 E KA+WALKF N + G KR +QL+ L+EMR AYE++KIYK+K KAYHDK I +K+F Sbjct: 510 EQKAHWALKFFNLDPNVAGNKRKIQLNELEEMRRNAYESSKIYKEKAKAYHDKKILKKDF 569 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 PGQ VLLFNS+L+LFPGKLKSKWSG Sbjct: 570 QPGQSVLLFNSKLRLFPGKLKSKWSG 595 >gb|KYP70257.1| Retrovirus-related Pol polyprotein from transposon 412 family [Cajanus cajan] Length = 702 Score = 612 bits (1578), Expect = 0.0 Identities = 280/372 (75%), Positives = 322/372 (86%) Frame = +2 Query: 44 IANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLRRCVSGAESKSILWH 223 +ANYKAAG +PE++ WQQKKKF D+ +Y+WDDPYL+K+GADGLLRRCV G E + ILWH Sbjct: 1 MANYKAAGVIPEEYNWQQKKKFFRDSHYYVWDDPYLFKIGADGLLRRCVFGVEIRDILWH 60 Query: 224 CHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRTGNISRKGEMPLNPI 403 CH+SP GGHYNGERTAAKVLQSGF+WP+LF+DA+++ +SCDKCQRTG +S++ E+PL I Sbjct: 61 CHNSPYGGHYNGERTAAKVLQSGFYWPTLFRDAYEHCKSCDKCQRTGTVSKRHELPLQNI 120 Query: 404 LEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSDSQTVVKFIKKNIFS 583 LEVEVFDCWGIDF+GP P SF EYILVAVDYVSKWVEASA K+D++TV+KF+KKNIF Sbjct: 121 LEVEVFDCWGIDFIGPLPSSFGNEYILVAVDYVSKWVEASAVQKADARTVIKFLKKNIFC 180 Query: 584 RFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQAEVSNREIKRILEKT 763 RFG PRVLISDGGSHFCNAQL VL Y V HKV TPYHPQ+NGQAEVSNRE+KRILEKT Sbjct: 181 RFGSPRVLISDGGSHFCNAQLQKVLEHYKVRHKVATPYHPQTNGQAEVSNRELKRILEKT 240 Query: 764 VGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVELEHKAYWALKFLNFT 943 V SSRKDW+LK+DD LWAYRTA+K+P+G SP+QLVYGKACHLPVELEHKA+WALK LN+ Sbjct: 241 VASSRKDWALKLDDTLWAYRTAFKSPIGFSPFQLVYGKACHLPVELEHKAFWALKALNYD 300 Query: 944 TSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEFHPGQQVLLFNSRLK 1123 A GEK LQLH L+EMRLQAYE++K YK K K YHDK I + F PGQQVLLFNSRLK Sbjct: 301 LKAAGEKMKLQLHELEEMRLQAYESSKSYKHKAKIYHDKKILNRNFQPGQQVLLFNSRLK 360 Query: 1124 LFPGKLKSKWSG 1159 LFPGKLKSKWSG Sbjct: 361 LFPGKLKSKWSG 372 >gb|PNY04892.1| hypothetical protein L195_g001324 [Trifolium pratense] Length = 1381 Score = 627 bits (1618), Expect = 0.0 Identities = 280/386 (72%), Positives = 339/386 (87%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 EKLL + ERPWFAD+ANYKA+G +P+DF W QKK+FL A ++WDDPYL+K+GAD LLR Sbjct: 945 EKLLMVQERPWFADMANYKASGLIPDDFNWHQKKRFLRIANQFVWDDPYLFKLGADNLLR 1004 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCV+ E+ SILWHCH+SP GGHYNGERTAAK+LQ+GF WP++FKD+++YV+SCD CQRT Sbjct: 1005 RCVTKEEATSILWHCHNSPYGGHYNGERTAAKILQAGFFWPTVFKDSYEYVQSCDNCQRT 1064 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 G ISR+ EMPL ILEVEVFDCWGIDF+GPFP S + EYILVAVDYVSKWVEA A+ K+D Sbjct: 1065 GGISRRNEMPLQSILEVEVFDCWGIDFVGPFPSSLSNEYILVAVDYVSKWVEAIASPKAD 1124 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 +TV+KF+K+NIF+RFG PRVLISDGGSHFCN+QL L YGV HK+ +PYHPQ+NGQA Sbjct: 1125 GKTVIKFLKRNIFTRFGTPRVLISDGGSHFCNSQLAKALEHYGVKHKIASPYHPQTNGQA 1184 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 EVSNREIK+ILEKTV +SRKDWSLK+D+ALWAYRTA+K+P+G++P+Q++YGKACHLPVEL Sbjct: 1185 EVSNREIKKILEKTVSTSRKDWSLKLDEALWAYRTAFKSPIGLTPFQMIYGKACHLPVEL 1244 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 EHKA+WALKFLNF + GEKR QLH L+EMR AYE++K+YKQK+K+YHDK I +++F Sbjct: 1245 EHKAFWALKFLNFDENQAGEKRKFQLHELEEMRFHAYESSKLYKQKVKSYHDKQIVKRDF 1304 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 PGQ+VLLFNSRLKLFPGKLKSKWSG Sbjct: 1305 QPGQKVLLFNSRLKLFPGKLKSKWSG 1330 >gb|KYP47823.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan] Length = 548 Score = 594 bits (1532), Expect = 0.0 Identities = 271/365 (74%), Positives = 316/365 (86%) Frame = +2 Query: 65 GKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLRRCVSGAESKSILWHCHDSPCG 244 G +PEDFTW QKKKFL DA +Y+WDDP+L+K G+DGLLRRCVS E+KSILWHCH+SP G Sbjct: 133 GVIPEDFTWHQKKKFLRDATYYVWDDPHLFKSGSDGLLRRCVSREEAKSILWHCHNSPYG 192 Query: 245 GHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRTGNISRKGEMPLNPILEVEVFD 424 GH+NGERTAAKVLQ+GF+WP+LFKDAH + + CD CQR G I+RK EMPL + EVEVFD Sbjct: 193 GHFNGERTAAKVLQAGFYWPTLFKDAHSHAKQCDSCQRAGGITRKHEMPLQNMQEVEVFD 252 Query: 425 CWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSDSQTVVKFIKKNIFSRFGVPRV 604 CWGIDF+GP P SF EYILVAVDYVSKWVEA AT K+D +TVVKF+KK+IFSRFG PRV Sbjct: 253 CWGIDFVGPLPSSFTNEYILVAVDYVSKWVEAIATPKADGKTVVKFLKKHIFSRFGTPRV 312 Query: 605 LISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQAEVSNREIKRILEKTVGSSRKD 784 LISDGGSHFCNAQL L YGV HKV T YHPQ+NGQAEVSNREIKRILEKTV +SRKD Sbjct: 313 LISDGGSHFCNAQLERALEHYGVHHKVATAYHPQTNGQAEVSNREIKRILEKTVSASRKD 372 Query: 785 WSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVELEHKAYWALKFLNFTTSACGEK 964 WS K+D+ALWAYRT +K+P+G++P+QLVYGKACHLPVELEHKAYWA+K LNF + GEK Sbjct: 373 WSTKLDEALWAYRTTFKSPIGLTPFQLVYGKACHLPVELEHKAYWAVKLLNFDPLSTGEK 432 Query: 965 RILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEFHPGQQVLLFNSRLKLFPGKLK 1144 R L+LH LDE+RLQAYE++K+YK+K+K+YHD+ I ++EF GQ VLLFNSRLKLFPGKL+ Sbjct: 433 RKLELHALDELRLQAYESSKLYKEKVKSYHDRKILKREFRAGQSVLLFNSRLKLFPGKLR 492 Query: 1145 SKWSG 1159 SKWSG Sbjct: 493 SKWSG 497 >gb|KYP33590.1| Retrotransposable element Tf2 [Cajanus cajan] Length = 572 Score = 595 bits (1533), Expect = 0.0 Identities = 273/386 (70%), Positives = 326/386 (84%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 EKLLAI RPWFAD+AN+KAAG +P+D W Q+KKF DAK+Y+WDDP+L+ +GAD LLR Sbjct: 136 EKLLAIQARPWFADMANFKAAGVIPKDLNWHQRKKFFRDAKYYVWDDPHLFMIGADNLLR 195 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCV+ E+ +ILWHCH+SP GGH+NGERTA KVLQSGF WP+LF+DAH +V+ CD C+RT Sbjct: 196 RCVTKEEAGNILWHCHNSPYGGHFNGERTAIKVLQSGFFWPTLFRDAHKHVQRCDNCERT 255 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 G+IS++ E PL I EVEVFDCWG DF+GP P SF+ EYIL+AV+YVS+WVEA T K+D Sbjct: 256 GSISKRHERPLQNIQEVEVFDCWGNDFIGPLPLSFSNEYILLAVEYVSRWVEAIPTQKAD 315 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 ++TVVKF+KKNIF+ FG PRVLISDGGSHFCN QL VL YGV HKV YHPQ+NGQA Sbjct: 316 AKTVVKFLKKNIFTWFGTPRVLISDGGSHFCNLQLQKVLEHYGVRHKVAIAYHPQTNGQA 375 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 EVSNREI +ILEKTV SSRKDWS+K++DALWAYRTA+KTP+G+SP+QL+YGKACHLPVEL Sbjct: 376 EVSNREITKILEKTVTSSRKDWSIKLEDALWAYRTAFKTPIGLSPFQLIYGKACHLPVEL 435 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 EHKA+WALKFLN A GE R LQL L+EMRL AYE +K YK+++KAYHDKNI ++ F Sbjct: 436 EHKAFWALKFLNLDAKAAGEHRKLQLLELEEMRLNAYEFSKQYKERIKAYHDKNILKRNF 495 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 PGQ VLLFNSR++LFPGKLKSKWSG Sbjct: 496 KPGQSVLLFNSRMRLFPGKLKSKWSG 521 >dbj|GAU25155.1| hypothetical protein TSUD_150540 [Trifolium subterraneum] Length = 1788 Score = 631 bits (1627), Expect = 0.0 Identities = 284/386 (73%), Positives = 335/386 (86%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 EKLL + ERPWFAD+ANYKA+G +P+DF W QKK+FL A Y+WDDPYL+K+GAD LLR Sbjct: 961 EKLLMVQERPWFADMANYKASGLIPDDFNWHQKKRFLRIANQYVWDDPYLFKIGADNLLR 1020 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCV+ E+ SILWHCH+SP GGHYNGERTAAK+LQ+GF WP+LFKD+++YV+SCD CQ+T Sbjct: 1021 RCVTKEEATSILWHCHNSPYGGHYNGERTAAKILQAGFFWPTLFKDSYEYVQSCDNCQKT 1080 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 G ISR+ EMPL ILEVEVFDCWGIDF+GPFP S + EYILVAVDYVSKWVEA A+ K+D Sbjct: 1081 GGISRRNEMPLQSILEVEVFDCWGIDFVGPFPSSLSNEYILVAVDYVSKWVEAIASPKAD 1140 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 +TV+KF+KKNIF+RFG PRVLISDGGSHFCN+QL L YGV HKV +PYHPQ+NGQA Sbjct: 1141 GKTVIKFLKKNIFTRFGTPRVLISDGGSHFCNSQLAKALEHYGVKHKVASPYHPQTNGQA 1200 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 EVSNRE+K+ILEKTV +SRKDWSLK+D+ALWAYRTA+K+P+G++P+Q+VYGKACH PVEL Sbjct: 1201 EVSNRELKKILEKTVSTSRKDWSLKLDEALWAYRTAFKSPIGLTPFQMVYGKACHFPVEL 1260 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 EHKAYWALKFLNF S GEKR QLH +DEMR AYE++K+YK+K+K YHDK I K F Sbjct: 1261 EHKAYWALKFLNFDQSLAGEKRKTQLHEVDEMRFHAYESSKLYKEKVKEYHDKRIINKNF 1320 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 HPGQ VLLFNSRLK+FPGKLKSKWSG Sbjct: 1321 HPGQLVLLFNSRLKIFPGKLKSKWSG 1346 >gb|PNY12616.1| hypothetical protein L195_g009250 [Trifolium pratense] Length = 1814 Score = 630 bits (1626), Expect = 0.0 Identities = 289/386 (74%), Positives = 334/386 (86%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 EKLL + ERPWFAD+ANYKA+G +PED W QKKKFL +A Y+WDDPYL+K+GAD LLR Sbjct: 1378 EKLLMMQERPWFADMANYKASGLIPEDLNWHQKKKFLRNANQYVWDDPYLFKIGADNLLR 1437 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCV+ E+ SILWHCH+SP GGHYNGERTAAKVLQSGF WP+LFKDA+ + + CDKCQ T Sbjct: 1438 RCVTTEEATSILWHCHNSPYGGHYNGERTAAKVLQSGFFWPTLFKDAYQHAQKCDKCQMT 1497 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 G IS++ EMPL IL VEVFDCWGIDF+GPFP SF+ EYILVAVDYVSKWVEA A+ K+D Sbjct: 1498 GGISKRNEMPLQNILVVEVFDCWGIDFVGPFPSSFSNEYILVAVDYVSKWVEAIASPKAD 1557 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 +TV+KF+KKNIF+RFG PRVLISDGGSHFCN+QL L YGV HKV +PYHPQ+NGQA Sbjct: 1558 GKTVIKFLKKNIFTRFGTPRVLISDGGSHFCNSQLEKALEHYGVRHKVASPYHPQTNGQA 1617 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 EVSNREIKRILEKTV +SRKDWS K+DDALWAYRTA+K+P+G++P+Q+VYGKACHLPVEL Sbjct: 1618 EVSNREIKRILEKTVSTSRKDWSSKLDDALWAYRTAFKSPIGLTPFQMVYGKACHLPVEL 1677 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 EHKAYWALKFLNF G+KR LQLH L+EMR QAYE++K+YK+K+K+YHDK I KEF Sbjct: 1678 EHKAYWALKFLNFDPCFSGDKRKLQLHELEEMRAQAYESSKLYKEKVKSYHDKKILSKEF 1737 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 PGQ VLLFNSRLKLFPGKLKSKWSG Sbjct: 1738 KPGQMVLLFNSRLKLFPGKLKSKWSG 1763 >gb|KYP31777.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan] Length = 541 Score = 591 bits (1523), Expect = 0.0 Identities = 272/386 (70%), Positives = 327/386 (84%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 EKLLAI RPWFAD+AN+KAAG +P+D W Q+KKF DAK+Y+WDDP+L+K+GAD LLR Sbjct: 136 EKLLAIQARPWFADMANFKAAGVIPKDLNWHQRKKFFRDAKYYVWDDPHLFKIGADNLLR 195 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCV+ E+ +ILWHCH+SP GGH+NGERTA KVLQSGF WP+LF+DAH++V+ CD CQR Sbjct: 196 RCVTKEEAGNILWHCHNSPYGGHFNGERTAIKVLQSGFFWPTLFRDAHEHVQRCDNCQRI 255 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 G+IS+ EMPL I EVEVF+CWGIDF+GP P SF+ EYIL+AV+YVS+WVEA T K+D Sbjct: 256 GSISKIHEMPLQNIQEVEVFECWGIDFIGPLPLSFSNEYILLAVEYVSRWVEAVPTQKAD 315 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 ++ VVKF+K NIF+RFG PRVLISD GSHFCN QL VL YGV HKV T YHPQ+NGQA Sbjct: 316 AKIVVKFLK-NIFTRFGTPRVLISDEGSHFCNVQLQKVLEHYGVRHKVATTYHPQTNGQA 374 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 EVSNREIK+ILEKTV SSRKDWS+K++DALWAYRT +KTP+G+SP+QL+YGKACHLPVEL Sbjct: 375 EVSNREIKKILEKTVTSSRKDWSIKLEDALWAYRTTFKTPIGLSPFQLIYGKACHLPVEL 434 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 EHKA+WALKFLN A GE R LQL L+EMRL AYE++K YK+++KAYHDK I ++ F Sbjct: 435 EHKAFWALKFLNLDAKAAGEHRKLQLLELEEMRLNAYESSKQYKERIKAYHDKKILKRNF 494 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 PG+ VLLFNSR++LFPGKLKSKWSG Sbjct: 495 KPGKSVLLFNSRMRLFPGKLKSKWSG 520 >ref|XP_014629745.1| PREDICTED: uncharacterized protein LOC106798267 [Glycine max] Length = 1815 Score = 628 bits (1620), Expect = 0.0 Identities = 283/386 (73%), Positives = 334/386 (86%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 E LL + RPWFAD+ANYKA G +PE++TW Q+KKFLHDA+ Y+WDDP+L+K GAD +LR Sbjct: 1376 EFLLQVTTRPWFADMANYKATGVIPEEYTWNQRKKFLHDARFYVWDDPHLFKAGADNVLR 1435 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCV+ E++SILWHCH S GGH++G+RTAAKVLQSGF WPSLFKDA+++VR CD+CQRT Sbjct: 1436 RCVTKEEARSILWHCHSSSYGGHHSGDRTAAKVLQSGFFWPSLFKDAYEFVRCCDRCQRT 1495 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 G ISR+ EMPL I+EVE+FDCWGIDFMGP P S+ YILVAVDYVSKWVEA AT K D Sbjct: 1496 GGISRRNEMPLQNIMEVEIFDCWGIDFMGPLPSSYGNIYILVAVDYVSKWVEAIATPKDD 1555 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 ++ V+KF+KKNIFSRFGVPR LISDGG+HFCN QL VL Y V HKV TPYHPQ+NGQA Sbjct: 1556 ARVVIKFLKKNIFSRFGVPRALISDGGTHFCNNQLKKVLEHYNVRHKVATPYHPQTNGQA 1615 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 E+SNRE+KRILEKTV SSRKDW+LK+DD LWAYRTA+KTP+G+SP+QLVYGK+CHLPVEL Sbjct: 1616 EISNRELKRILEKTVASSRKDWALKLDDTLWAYRTAFKTPIGLSPFQLVYGKSCHLPVEL 1675 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 EHKAYWAL+ LNF +ACGEKR LQL L+EMRL AYE+++IYK++ KAYHDK + R+EF Sbjct: 1676 EHKAYWALRLLNFDNNACGEKRKLQLQELEEMRLNAYESSRIYKERTKAYHDKKLQRREF 1735 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 PGQQVLLFNSRL+LFPGKLKSKWSG Sbjct: 1736 QPGQQVLLFNSRLRLFPGKLKSKWSG 1761 >ref|XP_014621692.1| PREDICTED: uncharacterized protein LOC106795614 [Glycine max] Length = 1816 Score = 628 bits (1620), Expect = 0.0 Identities = 283/386 (73%), Positives = 334/386 (86%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 E LL + RPWFAD+ANYKA G +PE++TW Q+KKFLHDA+ Y+WDDP+L+K GAD +LR Sbjct: 1377 EFLLQVTTRPWFADMANYKATGVIPEEYTWNQRKKFLHDARFYVWDDPHLFKAGADNVLR 1436 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCV+ E++SILWHCH S GGH++G+RTAAKVLQSGF WPSLFKDA+++VR CD+CQRT Sbjct: 1437 RCVTKEEARSILWHCHSSSYGGHHSGDRTAAKVLQSGFFWPSLFKDAYEFVRCCDRCQRT 1496 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 G ISR+ EMPL I+EVE+FDCWGIDFMGP P S+ YILVAVDYVSKWVEA AT K D Sbjct: 1497 GGISRRNEMPLQNIMEVEIFDCWGIDFMGPLPSSYGNIYILVAVDYVSKWVEAIATPKDD 1556 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 ++ V+KF+KKNIFSRFGVPR LISDGG+HFCN QL VL Y V HKV TPYHPQ+NGQA Sbjct: 1557 ARVVIKFLKKNIFSRFGVPRALISDGGTHFCNNQLKKVLEHYNVRHKVATPYHPQTNGQA 1616 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 E+SNRE+KRILEKTV SSRKDW+LK+DD LWAYRTA+KTP+G+SP+QLVYGK+CHLPVEL Sbjct: 1617 EISNRELKRILEKTVASSRKDWALKLDDTLWAYRTAFKTPIGLSPFQLVYGKSCHLPVEL 1676 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 EHKAYWAL+ LNF +ACGEKR LQL L+EMRL AYE+++IYK++ KAYHDK + R+EF Sbjct: 1677 EHKAYWALRLLNFDNNACGEKRKLQLQELEEMRLNAYESSRIYKERTKAYHDKKLQRREF 1736 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 PGQQVLLFNSRL+LFPGKLKSKWSG Sbjct: 1737 QPGQQVLLFNSRLRLFPGKLKSKWSG 1762 >gb|PNY12480.1| hypothetical protein L195_g009111 [Trifolium pratense] Length = 1859 Score = 627 bits (1617), Expect = 0.0 Identities = 282/386 (73%), Positives = 335/386 (86%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 EKLL + ERPWFAD+ANYK +G +PEDF W QKK+FL +A ++WDDPYL+K+GAD LLR Sbjct: 1423 EKLLMVQERPWFADMANYKVSGLIPEDFNWHQKKRFLREANQFVWDDPYLFKIGADNLLR 1482 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCV+ E+ SILWHCH SP GGHYNGERTAAKVLQSGF+WP+LFKDAH+Y + CD CQ+T Sbjct: 1483 RCVTREEATSILWHCHSSPYGGHYNGERTAAKVLQSGFYWPTLFKDAHEYSQRCDNCQKT 1542 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 G ISR+ EMPL ILEVEVFDCWGIDF+GPFP SF+ EYILVAVDYVSKWVEA A+ K+D Sbjct: 1543 GGISRRNEMPLQNILEVEVFDCWGIDFVGPFPSSFSYEYILVAVDYVSKWVEAVASPKAD 1602 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 +TV+KF+KKNIF+RFG PRVLISDGGSHFCN+QL L YGV HKV +PYHPQ+NGQA Sbjct: 1603 GKTVIKFLKKNIFTRFGTPRVLISDGGSHFCNSQLAKALEHYGVKHKVASPYHPQTNGQA 1662 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 EVSNRE+K+ILEKTV +SRKDWSLK+D+ALWAYRTA+K+P+G++P+Q+VYGK+CHLPVEL Sbjct: 1663 EVSNREVKKILEKTVSTSRKDWSLKLDEALWAYRTAFKSPIGLTPFQMVYGKSCHLPVEL 1722 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 EHKAYWALKFLNF S GEKR +Q+ L+EMR AYE++K+YK+K+K+YHDK I + F Sbjct: 1723 EHKAYWALKFLNFDPSLAGEKRKMQMQELEEMRFHAYESSKLYKEKVKSYHDKRIIDRNF 1782 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 PGQ+VLLFNSRLKLF GKLKSKWSG Sbjct: 1783 QPGQKVLLFNSRLKLFSGKLKSKWSG 1808 >ref|XP_014622146.1| PREDICTED: uncharacterized protein LOC106795762 [Glycine max] Length = 1821 Score = 626 bits (1614), Expect = 0.0 Identities = 282/386 (73%), Positives = 333/386 (86%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 E LL + RPWFAD+ANYKA G +PE++TW Q+KKFLHDA+ Y+WDDP+L+K GAD +LR Sbjct: 1382 EFLLQVTTRPWFADMANYKATGVIPEEYTWNQRKKFLHDARFYVWDDPHLFKAGADNVLR 1441 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCV+ E++SILWHCH S GGH++G+RTAAKVLQSGF WPSLFKDA+++VR CD+CQRT Sbjct: 1442 RCVTKEEARSILWHCHSSSYGGHHSGDRTAAKVLQSGFFWPSLFKDAYEFVRCCDRCQRT 1501 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 G ISR+ EMPL I+EVE+FDCWGIDFMGP P S+ YILVAVDYVSKWVEA AT K D Sbjct: 1502 GGISRRNEMPLQNIMEVEIFDCWGIDFMGPLPSSYGNIYILVAVDYVSKWVEAIATPKDD 1561 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 ++ V+KF+KKNIFSRFGVPR LISDGG+HFCN QL VL Y V HKV TPYHPQ+NGQA Sbjct: 1562 ARVVIKFLKKNIFSRFGVPRALISDGGTHFCNNQLKKVLEHYNVRHKVATPYHPQTNGQA 1621 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 E+SNRE+KRILEKTV SS KDW+LK+DD LWAYRTA+KTP+G+SP+QLVYGK+CHLPVEL Sbjct: 1622 EISNRELKRILEKTVASSMKDWALKLDDTLWAYRTAFKTPIGLSPFQLVYGKSCHLPVEL 1681 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 EHKAYWAL+ LNF +ACGEKR LQL L+EMRL AYE+++IYK++ KAYHDK + R+EF Sbjct: 1682 EHKAYWALRLLNFDNNACGEKRKLQLQELEEMRLNAYESSRIYKERTKAYHDKKLQRREF 1741 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 PGQQVLLFNSRL+LFPGKLKSKWSG Sbjct: 1742 QPGQQVLLFNSRLRLFPGKLKSKWSG 1767 >gb|KYP51742.1| Retrovirus-related Pol polyprotein from transposon 412 family [Cajanus cajan] Length = 423 Score = 583 bits (1503), Expect = 0.0 Identities = 264/372 (70%), Positives = 316/372 (84%) Frame = +2 Query: 44 IANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLRRCVSGAESKSILWH 223 +AN+KAAG +PED W Q+KK ++DAK Y+WDDP+L+K+GA+ LLRRCV+ E+K ILWH Sbjct: 1 MANFKAAGVIPEDLNWHQRKKMINDAKLYVWDDPHLFKIGAENLLRRCVTKEEAKDILWH 60 Query: 224 CHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRTGNISRKGEMPLNPI 403 CH+ P GGH+NGERTAAKVLQSGF WP+LFKDAH YV+ CD CQR+G IS++ EMPL I Sbjct: 61 CHNLPYGGHFNGERTAAKVLQSGFFWPTLFKDAHGYVQRCDSCQRSGTISKRHEMPLQNI 120 Query: 404 LEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSDSQTVVKFIKKNIFS 583 EVEVFDCWGIDF+GP P SF+ EYIL+ V+YVS+WVEA T K+D++TV+KF+KKNIF Sbjct: 121 QEVEVFDCWGIDFIGPLPTSFSNEYILLIVEYVSRWVEAIPTQKADAKTVIKFLKKNIFY 180 Query: 584 RFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQAEVSNREIKRILEKT 763 RFG PRVLISDGGSHFCN QL L YGV HKV T YHPQ+NGQAEVSNREIKRILEKT Sbjct: 181 RFGTPRVLISDGGSHFCNVQLKKALEHYGVRHKVATAYHPQTNGQAEVSNREIKRILEKT 240 Query: 764 VGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVELEHKAYWALKFLNFT 943 V +SRKDW+ K+D+ALWAYRTA+KTP +SP+QLVYGKACHLPVELEHKA+WALK LNF Sbjct: 241 VATSRKDWASKLDEALWAYRTAFKTPTSLSPFQLVYGKACHLPVELEHKAFWALKLLNFD 300 Query: 944 TSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEFHPGQQVLLFNSRLK 1123 +CGEKR QLH L+EMRL AY++++ YK+++KAYHDK I +++F PGQ VLLFNSRLK Sbjct: 301 PKSCGEKRKSQLHELEEMRLNAYQSSRQYKERVKAYHDKKILKRDFRPGQSVLLFNSRLK 360 Query: 1124 LFPGKLKSKWSG 1159 LFPGKL+SKWSG Sbjct: 361 LFPGKLRSKWSG 372 >ref|XP_014617324.1| PREDICTED: uncharacterized protein LOC106794530 [Glycine max] Length = 1819 Score = 625 bits (1613), Expect = 0.0 Identities = 282/386 (73%), Positives = 333/386 (86%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 E LL + RPWFAD+ANYKA G +PE++TW Q+KKFLHDA+ Y+WDDP+L+K GAD +LR Sbjct: 1380 EFLLQVTTRPWFADMANYKATGVIPEEYTWNQRKKFLHDARFYVWDDPHLFKAGADNVLR 1439 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCV+ E++SILWHCH S GGH++G+RTAAKVLQSGF WPSLFKDA+++VR CD+CQRT Sbjct: 1440 RCVTKEEARSILWHCHSSSYGGHHSGDRTAAKVLQSGFFWPSLFKDAYEFVRCCDRCQRT 1499 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 G ISR+ EMPL I+EVE+FDCWGIDFMGP P S+ YILVAVDYVSKWVEA AT K D Sbjct: 1500 GGISRRNEMPLQNIMEVEIFDCWGIDFMGPLPSSYGNIYILVAVDYVSKWVEAIATPKDD 1559 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 ++ V+KF+KKNIFSRFGVPR LISDGG+HFCN QL VL Y V HKV TPYHPQ+NGQA Sbjct: 1560 ARVVIKFLKKNIFSRFGVPRALISDGGTHFCNNQLKKVLEHYNVRHKVATPYHPQTNGQA 1619 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 E+SNRE+KRILEKTV SSRKDW+LK+DD LWAYRTA+KTP+G+SP+QLVYGK+CHLPVEL Sbjct: 1620 EISNRELKRILEKTVASSRKDWALKLDDTLWAYRTAFKTPIGLSPFQLVYGKSCHLPVEL 1679 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 HKAYWAL+ LNF +ACGEKR LQL L+EMRL AYE+++IYK++ KAYHDK + R+EF Sbjct: 1680 VHKAYWALRLLNFDNNACGEKRKLQLQELEEMRLNAYESSRIYKERTKAYHDKKLQRREF 1739 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 PGQQVLLFNSRL+LFPGKLKSKWSG Sbjct: 1740 QPGQQVLLFNSRLRLFPGKLKSKWSG 1765 >gb|PNY03357.1| hypothetical protein L195_g026684, partial [Trifolium pratense] Length = 498 Score = 585 bits (1508), Expect = 0.0 Identities = 269/386 (69%), Positives = 315/386 (81%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 E +LA+ PWFAD ANY +P DFT QQ+KKFLHD K Y+WD+P+LYK G DGLLR Sbjct: 62 EHILAVTVAPWFADFANYMVGKTIPSDFTSQQRKKFLHDCKFYVWDEPFLYKRGVDGLLR 121 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCV E + +LWHCHDS GGH++G+RTAAKVLQSG WP+LFKDA YV+ CD+CQRT Sbjct: 122 RCVPEDEQEKVLWHCHDSSYGGHFSGDRTAAKVLQSGLFWPTLFKDAFTYVKRCDRCQRT 181 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 GNIS++ EMP NP+LEVE+FD WGIDFMGPFP S++ YILVAVDYVSKWVEA AT +D Sbjct: 182 GNISKRNEMPQNPVLEVEIFDVWGIDFMGPFPSSYSKTYILVAVDYVSKWVEAIATQTND 241 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 +Q VV F+KKNIFSRFGVPR LISD G+HF N ++ +L KY V H++ TPYHPQ++GQ Sbjct: 242 AQVVVSFLKKNIFSRFGVPRALISDEGTHFLNRKMEALLRKYNVHHRIATPYHPQTSGQV 301 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 EVSNR+IK+ILEKTV SSRKDWSLK+DDALWAYRTA+KTP+GMSP+Q+VYGK+CHLP+EL Sbjct: 302 EVSNRQIKQILEKTVNSSRKDWSLKLDDALWAYRTAFKTPIGMSPFQIVYGKSCHLPLEL 361 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 EHKA WA KFLNF S GE RILQLH LDE R AYEN KI+K+K K +HDK I +EF Sbjct: 362 EHKALWATKFLNFDLSKAGESRILQLHELDEFRNFAYENAKIFKEKTKKWHDKKIQNREF 421 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 GQ VLLFNSRLKLFPGKLKS+WSG Sbjct: 422 REGQLVLLFNSRLKLFPGKLKSRWSG 447 >dbj|GAU21695.1| hypothetical protein TSUD_242670 [Trifolium subterraneum] Length = 1897 Score = 625 bits (1611), Expect = 0.0 Identities = 286/386 (74%), Positives = 332/386 (86%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 EKLL + ERPWFAD+ANYKA+G +PEDF W +KKKFL +A Y+WDDPYL+K+GAD LLR Sbjct: 1134 EKLLMMQERPWFADMANYKASGLIPEDFNWHKKKKFLREANQYVWDDPYLFKIGADNLLR 1193 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCV+ E+ +ILWHCH+SP GGHYNGERTAAKVLQSGF WP+LF+DAH + + C+ CQ T Sbjct: 1194 RCVTTEEATNILWHCHNSPYGGHYNGERTAAKVLQSGFFWPTLFRDAHQHAQRCNNCQMT 1253 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 G ISR+ EMPL IL VEVFDCWGIDF+GPFP SF+ EYILVAVDYVSKWVEA A+ ++D Sbjct: 1254 GGISRRNEMPLQNILVVEVFDCWGIDFVGPFPSSFSNEYILVAVDYVSKWVEAIASPRAD 1313 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 +TV+KF+KKNIF+RFG PRVLISDGGSHFCN+QL L YGV HKV +PYHPQ+NGQA Sbjct: 1314 GKTVIKFLKKNIFTRFGTPRVLISDGGSHFCNSQLEKALEHYGVRHKVASPYHPQTNGQA 1373 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 EVSNREIKRILEKTV +SRKDWS K+D+ALWAYRTA+K+P+G++P+QLVYGKACHL VEL Sbjct: 1374 EVSNREIKRILEKTVSTSRKDWSAKLDEALWAYRTAFKSPIGLTPFQLVYGKACHLLVEL 1433 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 EHKAYWALKFLNF S GEKR LQLH L+EMR QAYE++K+YK+K+K YHDK I K F Sbjct: 1434 EHKAYWALKFLNFDPSVSGEKRKLQLHELEEMRAQAYESSKLYKEKVKGYHDKRIFNKAF 1493 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 PGQ VLLFNSRLKLFPGKLKSKWSG Sbjct: 1494 KPGQMVLLFNSRLKLFPGKLKSKWSG 1519 >ref|XP_014629757.1| PREDICTED: uncharacterized protein LOC106798275 [Glycine max] Length = 2256 Score = 628 bits (1620), Expect = 0.0 Identities = 283/386 (73%), Positives = 334/386 (86%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 E LL + RPWFAD+ANYKA G +PE++TW Q+KKFLHDA+ Y+WDDP+L+K GAD +LR Sbjct: 1382 EFLLQVTTRPWFADMANYKATGVIPEEYTWNQRKKFLHDARFYVWDDPHLFKAGADNVLR 1441 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCV+ E++SILWHCH S GGH++G+RTAAKVLQSGF WPSLFKDA+++VR CD+CQRT Sbjct: 1442 RCVTKEEARSILWHCHSSSYGGHHSGDRTAAKVLQSGFFWPSLFKDAYEFVRCCDRCQRT 1501 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 G ISR+ EMPL I+EVE+FDCWGIDFMGP P S+ YILVAVDYVSKWVEA AT K D Sbjct: 1502 GGISRRNEMPLQNIMEVEIFDCWGIDFMGPLPSSYGNIYILVAVDYVSKWVEAIATPKDD 1561 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 ++ V+KF+KKNIFSRFGVPR LISDGG+HFCN QL VL Y V HKV TPYHPQ+NGQA Sbjct: 1562 ARVVIKFLKKNIFSRFGVPRALISDGGTHFCNNQLKKVLEHYNVRHKVATPYHPQTNGQA 1621 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 E+SNRE+KRILEKTV SSRKDW+LK+DD LWAYRTA+KTP+G+SP+QLVYGK+CHLPVEL Sbjct: 1622 EISNRELKRILEKTVASSRKDWALKLDDTLWAYRTAFKTPIGLSPFQLVYGKSCHLPVEL 1681 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 EHKAYWAL+ LNF +ACGEKR LQL L+EMRL AYE+++IYK++ KAYHDK + R+EF Sbjct: 1682 EHKAYWALRLLNFDNNACGEKRKLQLQELEEMRLNAYESSRIYKERTKAYHDKKLQRREF 1741 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 PGQQVLLFNSRL+LFPGKLKSKWSG Sbjct: 1742 QPGQQVLLFNSRLRLFPGKLKSKWSG 1767 >gb|PNX77934.1| hypothetical protein L195_g033907 [Trifolium pratense] Length = 595 Score = 585 bits (1507), Expect = 0.0 Identities = 268/386 (69%), Positives = 316/386 (81%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 E +LA+ PWFAD ANY +P DFT QQ+KKFLHD K Y+WD+P+LYK G DGLLR Sbjct: 159 EHILAVTVAPWFADFANYMVGRTIPSDFTPQQRKKFLHDCKFYVWDEPFLYKRGVDGLLR 218 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCV E + +LWHCHDS GGH++G+RTAAKVLQSG WP+LFKDA YV+ CD+CQRT Sbjct: 219 RCVPEGEQEKVLWHCHDSSYGGHFSGDRTAAKVLQSGLFWPTLFKDAFTYVKRCDRCQRT 278 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 GNIS++ EMP NPILEVE+FD WGIDFMGPFP S++ YILVAVDYVSKWVEA AT +D Sbjct: 279 GNISKRNEMPQNPILEVEIFDVWGIDFMGPFPSSYSKTYILVAVDYVSKWVEAIATHTND 338 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 +Q VV F+K+NIFSRFGVPR LISD G+HF N ++ +L KY V H++ TPYHPQ++GQ Sbjct: 339 AQVVVAFLKRNIFSRFGVPRALISDEGTHFLNRKMEALLKKYNVHHRIATPYHPQTSGQV 398 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 EVSNR+IK+ILEKTV SSRKDWS+K+DDALWAYRTA+KTP+GMSP+Q+VYGKACHLP+EL Sbjct: 399 EVSNRQIKQILEKTVNSSRKDWSVKLDDALWAYRTAFKTPIGMSPFQIVYGKACHLPLEL 458 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 EHKA WA KFLNF S GE RILQLH LDE R AYEN KI+K+K K +HD+ I +KEF Sbjct: 459 EHKALWATKFLNFDLSKAGESRILQLHELDEFRNYAYENAKIFKEKTKKWHDRKIQKKEF 518 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 GQ VLLFNSRL+LFPGKLKS+WSG Sbjct: 519 REGQLVLLFNSRLRLFPGKLKSRWSG 544 >dbj|GAU45320.1| hypothetical protein TSUD_84370 [Trifolium subterraneum] Length = 1748 Score = 616 bits (1588), Expect = 0.0 Identities = 277/386 (71%), Positives = 331/386 (85%) Frame = +2 Query: 2 EKLLAIVERPWFADIANYKAAGKVPEDFTWQQKKKFLHDAKHYLWDDPYLYKVGADGLLR 181 EKLL + ERPWFAD+AN+KA+G +P+DF W QKKKFL +A Y+WDDPYL+ +G D LLR Sbjct: 1312 EKLLMVQERPWFADMANFKASGLIPDDFNWHQKKKFLREANQYVWDDPYLFMIGEDNLLR 1371 Query: 182 RCVSGAESKSILWHCHDSPCGGHYNGERTAAKVLQSGFHWPSLFKDAHDYVRSCDKCQRT 361 RCV+ E+ SI+WHCH+SP GGHYNGERTAAKVLQSGF WP+LFKD +++ R CD CQR Sbjct: 1372 RCVTKEEATSIMWHCHNSPYGGHYNGERTAAKVLQSGFFWPTLFKDTYEHARKCDSCQRI 1431 Query: 362 GNISRKGEMPLNPILEVEVFDCWGIDFMGPFPPSFNCEYILVAVDYVSKWVEASATAKSD 541 G ISR+ EMPL + VEVFDCWGIDF+GPFP SF+ EYILVAVDYVSKWVEA A+ K+D Sbjct: 1432 GGISRRNEMPLQNMHVVEVFDCWGIDFVGPFPSSFSNEYILVAVDYVSKWVEAIASPKAD 1491 Query: 542 SQTVVKFIKKNIFSRFGVPRVLISDGGSHFCNAQLNNVLSKYGVTHKVTTPYHPQSNGQA 721 +TV+KF+K+NIF+RFG PRVLISDGGSHFCN+QL L YGV HKV +PYHPQ+NGQA Sbjct: 1492 GKTVIKFLKRNIFTRFGTPRVLISDGGSHFCNSQLARALEHYGVKHKVASPYHPQTNGQA 1551 Query: 722 EVSNREIKRILEKTVGSSRKDWSLKVDDALWAYRTAYKTPLGMSPYQLVYGKACHLPVEL 901 EVSNREIK+ILEKTV +SRKDWSLK+D+ALWAYRTA+K P+G++P+Q+VYGK+CHLPVEL Sbjct: 1552 EVSNREIKKILEKTVSASRKDWSLKLDEALWAYRTAFKAPIGLTPFQMVYGKSCHLPVEL 1611 Query: 902 EHKAYWALKFLNFTTSACGEKRILQLHGLDEMRLQAYENNKIYKQKMKAYHDKNISRKEF 1081 EHKAYWALKFLNF + GEKR +QL LDEM+ QAYE++K+YK+K+K+YHDK + K+F Sbjct: 1612 EHKAYWALKFLNFDENQAGEKRKVQLQQLDEMQCQAYESSKLYKEKVKSYHDKRLVNKDF 1671 Query: 1082 HPGQQVLLFNSRLKLFPGKLKSKWSG 1159 PGQ VLLFNSRLKLFPGKLKSKWSG Sbjct: 1672 RPGQMVLLFNSRLKLFPGKLKSKWSG 1697