BLASTX nr result
ID: Astragalus22_contig00029563
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00029563 (1589 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNX82129.1| hypothetical protein L195_g038157 [Trifolium prat... 458 e-155 gb|PNX71626.1| flavonol sulfotransferase-like protein, partial [... 461 e-154 dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subt... 448 e-142 gb|PNY08535.1| retrovirus-related Pol polyprotein from transposo... 443 e-140 gb|PNX55412.1| hypothetical protein L195_g049041, partial [Trifo... 384 e-127 ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798... 382 e-126 gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Gly... 364 e-117 gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Gly... 363 e-117 gb|PNX91084.1| hypothetical protein L195_g047213, partial [Trifo... 352 e-113 gb|KHN34741.1| Retrovirus-related Pol polyprotein from transposo... 340 e-107 gb|PNX71325.1| hypothetical protein L195_g027200, partial [Trifo... 323 2e-99 ref|XP_014621696.1| PREDICTED: uncharacterized protein LOC106795... 298 2e-94 ref|XP_014626210.1| PREDICTED: uncharacterized protein LOC106797... 296 7e-94 ref|XP_014627175.1| PREDICTED: uncharacterized protein LOC106797... 294 4e-93 ref|XP_014632403.1| PREDICTED: uncharacterized protein LOC106798... 276 5e-86 gb|KYP65733.1| hypothetical protein KK1_011995 [Cajanus cajan] >... 279 8e-85 dbj|GAU50616.1| hypothetical protein TSUD_290710 [Trifolium subt... 271 2e-82 ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662... 266 3e-80 ref|XP_019455138.1| PREDICTED: uncharacterized protein LOC109356... 273 9e-79 ref|XP_016673106.1| PREDICTED: uncharacterized protein LOC107892... 251 3e-75 >gb|PNX82129.1| hypothetical protein L195_g038157 [Trifolium pratense] Length = 392 Score = 458 bits (1178), Expect = e-155 Identities = 243/409 (59%), Positives = 283/409 (69%), Gaps = 14/409 (3%) Frame = +3 Query: 159 MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEE 314 MAT +YSDFSTNSANPYYLHPNENPALIL P L N H L + Sbjct: 1 MATINYSDFSTNSANPYYLHPNENPALILVS------PPLDHKNYHTWARSMNIALISKN 54 Query: 315 QRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRT 479 + K + PK TDP+Y PWIRCNTMVLAWIHRS+S+SIA+S+ AGVWKNLR Sbjct: 55 KDKFIDGSFPKPSITDPLYGPWIRCNTMVLAWIHRSISDSIARSVLWIDTAAGVWKNLRI 114 Query: 480 RFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGA 659 RFSQGDIFRISDIQ+ELYKFRQG L+ISDYFTQLKV WDELE+YRP+P CKC+IACTCGA Sbjct: 115 RFSQGDIFRISDIQEELYKFRQGTLDISDYFTQLKVLWDELENYRPIPHCKCSIACTCGA 174 Query: 660 VDSVKIYREQDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVLD 836 +DS+ IYR+QDYVIRFLKGLND+FS TKSQIM M PLP+IDTVFSMLIQQEREI +SV+D Sbjct: 175 IDSINIYRQQDYVIRFLKGLNDKFSHTKSQIMLMNPLPDIDTVFSMLIQQEREIGNSVID 234 Query: 837 PIVHDAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTI 1016 IV+DAP+ S LANS Y +KG+NR CTHC TNH + Sbjct: 235 SIVNDAPDKNSSNVFLANSSYGNFHGKYNSKGKGQHSG----SKGSNRFCTHCQGTNHIV 290 Query: 1017 DS*WVKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQYQG 1196 ++ W+KHGYP G+KGKGKN FQ +Q+N+ S Q DS ++S K PFG TQEQY G Sbjct: 291 ENCWIKHGYPIGYKGKGKNSFQSTQANSAAVPNSPMQLDS--TTSSTKPPFGFTQEQYHG 348 Query: 1197 ILSMVXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSSDHDWLQGSDWYS 1343 IL + N VST+PLA NSQSS+ ++ QGSDWYS Sbjct: 349 ILGL-----FQQLKHQPTPASNSVSTSPLAFNSQSSNGNELYQGSDWYS 392 >gb|PNX71626.1| flavonol sulfotransferase-like protein, partial [Trifolium pratense] Length = 591 Score = 461 bits (1186), Expect = e-154 Identities = 240/403 (59%), Positives = 284/403 (70%), Gaps = 14/403 (3%) Frame = +3 Query: 171 SYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*RNPPKL 350 +YSDFS+NSANPYYLHPNENPA+IL P L N H Q + +N K Sbjct: 3 TYSDFSSNSANPYYLHPNENPAVILVS------PPLDHKNYHTWSRSMQIALISKNKDKF 56 Query: 351 QN--------TDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQ 491 + DP+Y+PWIRCNTMVLAWIHRSLSESIA+S+ AG+WKNLRTRFSQ Sbjct: 57 IDGTLVKPSPLDPLYSPWIRCNTMVLAWIHRSLSESIARSVLWIDSAAGLWKNLRTRFSQ 116 Query: 492 GDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSV 671 GDIFRISD+Q+ELY+ RQGNL++SDYFT+LKV WDELE+YRP+P CKC+IACTCGA++S Sbjct: 117 GDIFRISDLQEELYRLRQGNLDVSDYFTKLKVLWDELENYRPIPFCKCSIACTCGAIESF 176 Query: 672 KIYREQDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVLDPIVH 848 K+YREQDYVIRFLKGLNDRFS TKSQIM M PLP++DTVFSMLIQQEREI +S+LDPI H Sbjct: 177 KVYREQDYVIRFLKGLNDRFSNTKSQIMLMNPLPDVDTVFSMLIQQEREIAYSILDPITH 236 Query: 849 DAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*W 1028 DAPE + ST LLANS Y KG NR+CT+C TNH + + W Sbjct: 237 DAPEVDSSTALLANSHYRNQNGKTNYYGKGKGQAPNSAPKGYNRLCTYCKGTNHIVQNCW 296 Query: 1029 VKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSM 1208 +K+GYPPG+K KGKN Q S+ V A +SSTQ DS Q+S + PFGLTQ+QY GILSM Sbjct: 297 IKYGYPPGYKNKGKNSSQ--PSHTVAAVDSSTQPDS-QSSTTATPPFGLTQDQYDGILSM 353 Query: 1209 VXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSSDHDWLQGSDW 1337 + N VSTTPLAL+SQSS+ +DW QGS W Sbjct: 354 I-----QQSKSQPTPTVNSVSTTPLALHSQSSTSNDWYQGSXW 391 >dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subterraneum] Length = 1178 Score = 448 bits (1152), Expect = e-142 Identities = 236/402 (58%), Positives = 279/402 (69%), Gaps = 14/402 (3%) Frame = +3 Query: 171 SYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*RNPPKL 350 +YSDFSTNSANPYYLHPNENPA+IL P L N H Q + +N K Sbjct: 3 TYSDFSTNSANPYYLHPNENPAVILVS------PPLDHKNYHTWSRSMQIALISKNKDKF 56 Query: 351 QN--------TDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQ 491 + DP+Y+PWIRCNTMVLAWIHRSLS+SIA+S+ A +WKNLRTRFSQ Sbjct: 57 IDGTLVKPSPLDPLYSPWIRCNTMVLAWIHRSLSDSIARSVLWIDSAASLWKNLRTRFSQ 116 Query: 492 GDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSV 671 GDIFRISD+Q+ELY+ RQGNL++SDYFT+L+V WDELE+YRP+PLCKC+IACTCGAV+S Sbjct: 117 GDIFRISDLQEELYRLRQGNLDVSDYFTKLQVLWDELENYRPIPLCKCSIACTCGAVESF 176 Query: 672 KIYREQDYVIRFLKGLNDRFSQTKSQIMM-KPLPEIDTVFSMLIQQEREITHSVLDPIVH 848 K+YREQDYVIRFLKGLNDRFS TKSQIM+ PLP++DTVFSMLIQQEREI +S+LDPI H Sbjct: 177 KLYREQDYVIRFLKGLNDRFSNTKSQIMLINPLPDVDTVFSMLIQQEREIAYSILDPITH 236 Query: 849 DAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*W 1028 DAPE + ST LLANS Y KG NR+CTHC TNH + W Sbjct: 237 DAPEVDFSTALLANSHYKNQNGKSNYYGKGRGQAPNSAPKGHNRLCTHCRGTNHIVQDCW 296 Query: 1029 VKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSM 1208 +K+GYPPG+K KN Q S+ V A +SSTQ+DS Q S + PFGLTQ QY GI+SM Sbjct: 297 IKYGYPPGYKNNRKNSSQ--PSHIVAAVDSSTQHDS-QFSNTATPPFGLTQVQYDGIISM 353 Query: 1209 VXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSSDHDWLQGSD 1334 + N VSTTPLA +SQSS+ +DW QGSD Sbjct: 354 I-----QQSKSQPTPTVNSVSTTPLAFHSQSSNSNDWYQGSD 390 >gb|PNY08535.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1205 Score = 443 bits (1140), Expect = e-140 Identities = 242/424 (57%), Positives = 287/424 (67%), Gaps = 22/424 (5%) Frame = +3 Query: 171 SYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*RNPPKL 350 +YSDFSTNSANPYYLHPNENPA+IL P L N H Q + +N K Sbjct: 3 TYSDFSTNSANPYYLHPNENPAMILVS------PPLDHKNYHTWSRSMQIALISKNKDKF 56 Query: 351 QN--------TDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQ 491 + DP+++PWIRCNTMVLAW+HRS+SESIA+SI AGVWKNLR RFSQ Sbjct: 57 IDGTLVKPSPLDPLFSPWIRCNTMVLAWLHRSVSESIARSILWIDSAAGVWKNLRIRFSQ 116 Query: 492 GDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSV 671 GDIFRISDIQ+ELY+FRQGNL+ISDYFT+LKV WDELE+YRP+PLCKC+I CTCGA+DS Sbjct: 117 GDIFRISDIQEELYRFRQGNLDISDYFTKLKVLWDELENYRPIPLCKCSIPCTCGAIDSF 176 Query: 672 KIYREQDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVLDPIVH 848 K+YREQDYVIRFLKGLNDRFS TKSQIM M PLP++DTVFSMLIQQEREI +S+LDPI H Sbjct: 177 KVYREQDYVIRFLKGLNDRFSNTKSQIMLMNPLPDVDTVFSMLIQQEREIAYSILDPITH 236 Query: 849 DAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*W 1028 DAPE + ST LLANS KG +R+CT+C TNH + + W Sbjct: 237 DAPEVDSSTALLANSHSRNQNGKSNYYGKGKGQAPNSAPKGHDRLCTYCKGTNHVVQNCW 296 Query: 1029 VKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSM 1208 +K+GYPPG+K KGKN Q S+ V A +SSTQ DS Q+S + PFGLTQ+QY GILSM Sbjct: 297 IKYGYPPGYKNKGKNSSQ--PSHTVAAVDSSTQLDS-QSSTTATPPFGLTQDQYDGILSM 353 Query: 1209 VXXXXXXXXXXXXXXXXNFVSTTPLALNSQSS-----SDHDWL---QGSDWYSWDAKGIV 1364 + N VSTTPLAL+SQSS S + W+ +D ++D K Sbjct: 354 I-----RQSKSQPTPTVNSVSTTPLALHSQSSTNNGKSSNFWILDTGATDHITYDIKTFN 408 Query: 1365 YLRH 1376 RH Sbjct: 409 SYRH 412 >gb|PNX55412.1| hypothetical protein L195_g049041, partial [Trifolium pratense] Length = 338 Score = 384 bits (986), Expect = e-127 Identities = 200/331 (60%), Positives = 234/331 (70%), Gaps = 9/331 (2%) Frame = +3 Query: 342 PKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFR 506 PK TDP+Y PWIRCNTMVLAWIHRS+S+SIA+S+ AGVWKNL+ RFSQGDIFR Sbjct: 19 PKPSITDPLYGPWIRCNTMVLAWIHRSISDSIARSVLWIDTAAGVWKNLKIRFSQGDIFR 78 Query: 507 ISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSVKIYRE 686 ISDIQ+ELYKFRQG L+ISDYFTQLKV WDELE+YRP+P CKC+IACTCGA+DS+ IYR+ Sbjct: 79 ISDIQEELYKFRQGTLDISDYFTQLKVLWDELENYRPIPHCKCSIACTCGAIDSINIYRQ 138 Query: 687 QDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPET 863 QDYVIRFLKGLNDRFS TKSQIM M PLP+IDTVFSMLIQQEREI +SV+D IV+DAP+ Sbjct: 139 QDYVIRFLKGLNDRFSHTKSQIMLMNPLPDIDTVFSMLIQQEREIGNSVIDSIVNDAPDR 198 Query: 864 EVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGY 1043 S LLANS Y +KG NR CT+C TNH +++ W+KHGY Sbjct: 199 NSSNVLLANSYYGKYNSKGKGQNSG--------SKGGNRFCTYCKGTNHIVENCWIKHGY 250 Query: 1044 PPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQ---ASASNKAPFGLTQEQYQGILSMVX 1214 P G+KGKGKN Q +Q N+V A + S Q ++S K FG TQEQY GIL + Sbjct: 251 PIGYKGKGKNLSQSTQVNSVAAPNAVVPKSSLQLDSTTSSTKPLFGFTQEQYHGILGL-- 308 Query: 1215 XXXXXXXXXXXXXXXNFVSTTPLALNSQSSS 1307 N VST+PL NSQSS+ Sbjct: 309 ---FQQLQSQPSPSSNSVSTSPLVFNSQSSN 336 >ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798459 [Glycine max] Length = 389 Score = 382 bits (981), Expect = e-126 Identities = 209/413 (50%), Positives = 267/413 (64%), Gaps = 18/413 (4%) Frame = +3 Query: 159 MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEE 314 MA ++ DFSTNSANPYYLHPNENPAL+L P L N H L + Sbjct: 1 MALQNFVDFSTNSANPYYLHPNENPALVLVS------PSLTAKNYHTWSHSMHIALISKN 54 Query: 315 QRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRT 479 + K + PK +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+ AGVWKNLR Sbjct: 55 KDKFIDGSLPKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRI 114 Query: 480 RFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGA 659 RFSQ DIFRISD+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P CKC+I C+CG Sbjct: 115 RFSQSDIFRISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGG 174 Query: 660 VDSVKIYREQDYVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQEREITHSVLD 836 +DSV++YREQDYV+RFLKGLNDRFS +KSQI MM PLP+ID VFS++IQQERE+ S D Sbjct: 175 IDSVRVYREQDYVVRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSD 234 Query: 837 PIVHDAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTI 1016 + ++ ++ + +N +KG NRVCTHC +TNH + Sbjct: 235 SVSEATSDSAMAMQVNSNQSNFNGKGGYYNKGKG-------SSKGGNRVCTHCGKTNHIV 287 Query: 1017 DS*WVKHGYPPGFK-GKGKNPFQQSQSN---NVNASESSTQNDSAQASASNKAPFGLTQE 1184 D+ + K GYPPG+K K KN SQ+N N +A ES+ Q SAQ+S F TQE Sbjct: 288 DNCFEKIGYPPGYKTNKSKNSSSSSQANNTSNASALESTQQGSSAQSS------FQFTQE 341 Query: 1185 QYQGILSMVXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSSDHDWLQGSDWYS 1343 YQGIL + N V+T+P AL+S SS+ ++ G+DWYS Sbjct: 342 MYQGILEAL-----QQSKVGSQPKANSVTTSPFALHSPSSNPNESFSGNDWYS 389 >gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Glycine soja] Length = 484 Score = 364 bits (935), Expect = e-117 Identities = 202/393 (51%), Positives = 254/393 (64%), Gaps = 18/393 (4%) Frame = +3 Query: 183 FSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEEQRKIC*RN 338 FSTNSANPYYLHPNENPAL+L P L N H L + + K + Sbjct: 1 FSTNSANPYYLHPNENPALVLVS------PSLTAKNYHTWSRSMHIALISKNKDKFIDGS 54 Query: 339 PPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIF 503 PK +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+ AGVWKNLR RFSQ DIF Sbjct: 55 LPKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIF 114 Query: 504 RISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSVKIYR 683 RISD+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P CKC+I C+CG +DSV++YR Sbjct: 115 RISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYR 174 Query: 684 EQDYVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPE 860 EQDYVIRFLKGLNDRFS +KSQI MM PLP+ID VFS++IQQERE+ S D + + Sbjct: 175 EQDYVIRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSD 234 Query: 861 TEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHG 1040 + ++ + +N +KG NRVCTHC +TNH +D+ + K G Sbjct: 235 SAMAMQVNSNQSNFNGKGGYYNKGKG-------SSKGGNRVCTHCGKTNHIVDNCFEKIG 287 Query: 1041 YPPGFK-GKGKNPFQQSQSN---NVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSM 1208 YPPG+K K KN SQ+N N +A ES+ Q SAQ+S F TQE YQGIL Sbjct: 288 YPPGYKTNKSKNSSSSSQANNTSNASALESTQQGSSAQSS------FQFTQEMYQGILEA 341 Query: 1209 VXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSS 1307 + N V+T+P AL+S SS+ Sbjct: 342 L-----QQSKVGSQPKANLVTTSPFALHSPSSN 369 Score = 87.0 bits (214), Expect = 2e-14 Identities = 42/89 (47%), Positives = 55/89 (61%) Frame = +1 Query: 1321 SKAVIGIAGMRRGLYILDIEDPXXXXXXXXXXXXXXXNVSHGDSQLWHLRLGHISDIGLK 1500 S IG A ++RGLY++D D ++S +LWH RLGH+S+ G++ Sbjct: 404 SLETIGTAKLQRGLYVIDTAD----------MIRSCNSISSHSFELWHSRLGHVSNSGMQ 453 Query: 1501 TISKQFPFISSSNNMLPCDSCHFAKQKKL 1587 ISKQFPFI NNM PCDSCHF+KQK+L Sbjct: 454 AISKQFPFIPCKNNMSPCDSCHFSKQKRL 482 >gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Glycine soja] Length = 484 Score = 363 bits (933), Expect = e-117 Identities = 202/393 (51%), Positives = 254/393 (64%), Gaps = 18/393 (4%) Frame = +3 Query: 183 FSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEEQRKIC*RN 338 FSTNSANPYYLHPNENPAL+L P L N H L + + K + Sbjct: 1 FSTNSANPYYLHPNENPALVLVS------PSLTAKNYHTWSRSMHIALISKNKDKFIDGS 54 Query: 339 PPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIF 503 PK +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+ AGVWKNLR RFSQ DIF Sbjct: 55 LPKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIF 114 Query: 504 RISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSVKIYR 683 RISD+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P CKC+I C+CG +DSV++YR Sbjct: 115 RISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYR 174 Query: 684 EQDYVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPE 860 EQDYVIRFLKGLNDRFS +KSQI MM PLP+ID VFS++IQQERE+ S D + + Sbjct: 175 EQDYVIRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSD 234 Query: 861 TEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHG 1040 + ++ + +N +KG NRVCTHC +TNH +D+ + K G Sbjct: 235 SAMAMQVNSNQSNFNGKGGYYNKGKG-------SSKGGNRVCTHCGKTNHIVDNCFEKIG 287 Query: 1041 YPPGFK-GKGKNPFQQSQSN---NVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSM 1208 YPPG+K K KN SQ+N N +A ES+ Q SAQ+S F TQE YQGIL Sbjct: 288 YPPGYKTNKSKNSSSSSQANNTSNASALESTQQGSSAQSS------FQFTQEMYQGILEA 341 Query: 1209 VXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSS 1307 + N V+T+P AL+S SS+ Sbjct: 342 L-----QQSKVGSQPKANSVTTSPFALHSPSSN 369 Score = 87.0 bits (214), Expect = 2e-14 Identities = 42/89 (47%), Positives = 55/89 (61%) Frame = +1 Query: 1321 SKAVIGIAGMRRGLYILDIEDPXXXXXXXXXXXXXXXNVSHGDSQLWHLRLGHISDIGLK 1500 S IG A ++RGLY++D D ++S +LWH RLGH+S+ G++ Sbjct: 404 SLETIGTAKLQRGLYVIDTAD----------MIRSCNSISSHSFELWHSRLGHVSNSGMQ 453 Query: 1501 TISKQFPFISSSNNMLPCDSCHFAKQKKL 1587 ISKQFPFI NNM PCDSCHF+KQK+L Sbjct: 454 AISKQFPFIPCKNNMSPCDSCHFSKQKRL 482 >gb|PNX91084.1| hypothetical protein L195_g047213, partial [Trifolium pratense] Length = 417 Score = 352 bits (903), Expect = e-113 Identities = 194/377 (51%), Positives = 252/377 (66%), Gaps = 19/377 (5%) Frame = +3 Query: 129 LRLKKPLLPIMATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--- 299 +++++ L+ MA +Y DF TNSANPYYLHPNENPAL+L P L N H Sbjct: 48 VKIRRLLVGTMALQNYIDFPTNSANPYYLHPNENPALVLVS------PSLTAKNYHTWSR 101 Query: 300 -----LDLEEQRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI----- 449 L + + K + PK +DP+YAPWIRCNTMVLAWIHRS+SESIA+S+ Sbjct: 102 SMHIALISKNKEKFIDGSLPKPPVSDPLYAPWIRCNTMVLAWIHRSISESIARSVLWIET 161 Query: 450 PAGVWKNLRTRFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLC 629 AGVWKNLR RFSQ DIFRISD+Q+++Y+FRQG L++SDYFTQLKV+WDELE+YRPLP C Sbjct: 162 AAGVWKNLRVRFSQSDIFRISDLQEDMYRFRQGTLDVSDYFTQLKVYWDELENYRPLPYC 221 Query: 630 KCAIACTCGAVDSVKIYREQDYVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQ 806 KC+I C+CG +DSV+ YREQD+VIRFLKGLN+RFS +KSQI MM PLP+ID FS++IQQ Sbjct: 222 KCSIPCSCGVIDSVRAYREQDFVIRFLKGLNERFSHSKSQIMMMNPLPDIDRAFSLVIQQ 281 Query: 807 EREI-----THSVLDPIVHDAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKG 971 ERE+ + SV + A +V++T +NS ++ Sbjct: 282 EREMLSFNNSDSVSEATSDSAMVMQVNST-KSNSHGKKSFXYKEKGQG--------SSQS 332 Query: 972 TNRVCTHCNRTNHTIDS*WVKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASA 1151 NRVCTHC +TNH +D+ + K GYPPG+K N F +S S+ VN + S++ +S Q + Sbjct: 333 GNRVCTHCGKTNHIVDNCFEKIGYPPGYK---TNKF-KSSSSQVNNTSSASALESVQQGS 388 Query: 1152 SNKAPFGLTQEQYQGIL 1202 S ++ F TQE YQGIL Sbjct: 389 SAQSNFQFTQEMYQGIL 405 >gb|KHN34741.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 495 Score = 340 bits (871), Expect = e-107 Identities = 182/346 (52%), Positives = 231/346 (66%), Gaps = 18/346 (5%) Frame = +3 Query: 192 NSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEEQRKIC*RNPPK 347 NSANPYYLHPNENPAL+L P L N H L + + K + PK Sbjct: 1 NSANPYYLHPNENPALVLVS------PSLTAKNYHTWSRSMHIALISKNKDKFIDGSLPK 54 Query: 348 LQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFRIS 512 +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+ AGVWKNLR RFS DIFRIS Sbjct: 55 PPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSHSDIFRIS 114 Query: 513 DIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSVKIYREQD 692 D+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P CKC+I C+CG +DSV++YREQD Sbjct: 115 DLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPYCKCSIPCSCGGIDSVRVYREQD 174 Query: 693 YVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPETEV 869 YVIRFLKGLNDRFS +KSQI MM PLP+ID VFS++IQQERE+ S D + ++ + Sbjct: 175 YVIRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAM 234 Query: 870 STTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGYPP 1049 + + +N +KG NRVCTHC +TNH +D+ + K GYPP Sbjct: 235 AMQVNSNQSNFNGKGGYYNKGKG-------SSKGGNRVCTHCGKTNHIVDNCFEKIGYPP 287 Query: 1050 GFK-GKGKNPFQQSQSN---NVNASESSTQNDSAQASASNKAPFGL 1175 G+K K KN SQ+N N +A ES+ Q SAQ+ + +PF L Sbjct: 288 GYKTNKSKNSSSSSQANNTSNASALESTQQGSSAQSITT--SPFAL 331 Score = 64.7 bits (156), Expect = 3e-07 Identities = 26/39 (66%), Positives = 33/39 (84%) Frame = +1 Query: 1471 LGHISDIGLKTISKQFPFISSSNNMLPCDSCHFAKQKKL 1587 +GH+S+ G++ ISKQFPFI NNM PCDSCHF+KQK+L Sbjct: 443 IGHVSNSGMQAISKQFPFIPCKNNMSPCDSCHFSKQKRL 481 >gb|PNX71325.1| hypothetical protein L195_g027200, partial [Trifolium pratense] Length = 655 Score = 323 bits (828), Expect = 2e-99 Identities = 189/450 (42%), Positives = 244/450 (54%), Gaps = 54/450 (12%) Frame = +3 Query: 156 IMATTSYSDFSTNSANPYYLHPNENPALILFL--------HH*IELPHLGKINAHRLDLE 311 IMA +Y+D+ TN +NP+YLHPNENP+++L H+ L H+ I+ ++ Sbjct: 224 IMAFPNYTDYLTNPSNPFYLHPNENPSVVLVTPLLDNKNYHNWARLMHIALISKNK---- 279 Query: 312 EQRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLR 476 K K DPM+A WIRCN MVLAW HRS+SESIA+SI AGVW +L+ Sbjct: 280 --EKFIDGTFSKPPTNDPMFAQWIRCNNMVLAWFHRSVSESIAKSILSISTAAGVWSDLK 337 Query: 477 TRFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCG 656 RFSQGDIFRISDIQ+ELY+FRQGNL++SDYFT L+V+WDELE YRP+P CKC+IACTCG Sbjct: 338 NRFSQGDIFRISDIQEELYRFRQGNLDVSDYFTGLRVYWDELEDYRPIPYCKCSIACTCG 397 Query: 657 AVDSVKIYREQDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVL 833 S+K +REQDYVIRFLKGLN+RF+ TKS IM M PLP + FS+++QQERE+ + + Sbjct: 398 GYTSMKQFREQDYVIRFLKGLNERFTHTKSHIMAMDPLPTVSKAFSLVLQQERELLGNGI 457 Query: 834 DPIVHDAPETEVSTT----------------------------LLANSQYXXXXXXXXXX 929 D ++ +LAN Sbjct: 458 TTSQTDENAIALAANASRNASNYGSKNASNYGSGTSRNRGNPPVLANPSNFSGNNAANGH 517 Query: 930 XXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGYPPGFKGKGKNPFQQSQSNNVNA 1109 G NR+CT+C RTNH ID + HG+PPG+K KGK SQ+N+ Sbjct: 518 GRGKNFYANKGPSGQNRMCTYCGRTNHIIDGCFELHGFPPGYKPKGK-----SQANSAQT 572 Query: 1110 SESSTQNDSAQASASNKAPFGLTQEQYQGILSMV-XXXXXXXXXXXXXXXXNFVSTTPLA 1286 S Q+ + Q S G TQEQ+QGIL+++ N V T P A Sbjct: 573 DASVAQHQAPQFS-------GFTQEQFQGILTLIQQSQQPHSGSTSAVHQSNSVMTHPFA 625 Query: 1287 LNSQSSS-----------DHDWLQGSDWYS 1343 N S+ D + Q DWYS Sbjct: 626 FNCDSNKTSGKSPFVWILDTEQFQEDDWYS 655 >ref|XP_014621696.1| PREDICTED: uncharacterized protein LOC106795617 [Glycine max] Length = 275 Score = 298 bits (762), Expect = 2e-94 Identities = 151/258 (58%), Positives = 190/258 (73%), Gaps = 14/258 (5%) Frame = +3 Query: 159 MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEE 314 MA ++ DFSTNSANPYYLHPNENPAL+L P L N H L + Sbjct: 1 MALQNFVDFSTNSANPYYLHPNENPALVLVS------PSLTAKNYHTWSRSMHIALISKN 54 Query: 315 QRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRT 479 + K + PK +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+ AGVWKNLR Sbjct: 55 KDKFIDGSLPKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRI 114 Query: 480 RFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGA 659 RFSQ DIFRISD+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P CKC+I C+CG Sbjct: 115 RFSQSDIFRISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGG 174 Query: 660 VDSVKIYREQDYVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQEREITHSVLD 836 +DSV++Y EQDYVIRFLKGLNDRFS +KSQI MM PLP+ID VFS++IQQERE+ S D Sbjct: 175 IDSVRVYCEQDYVIRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSD 234 Query: 837 PIVHDAPETEVSTTLLAN 890 + ++ ++ + +N Sbjct: 235 SVSEATSDSAMAMQVNSN 252 >ref|XP_014626210.1| PREDICTED: uncharacterized protein LOC106797041 [Glycine max] Length = 275 Score = 296 bits (758), Expect = 7e-94 Identities = 150/258 (58%), Positives = 190/258 (73%), Gaps = 14/258 (5%) Frame = +3 Query: 159 MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEE 314 MA +++DFSTNSANPYYLHPNENP L+L P L N H L + Sbjct: 1 MALQNFADFSTNSANPYYLHPNENPTLVLVS------PSLTAKNYHTWSRSMHIALISKN 54 Query: 315 QRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRT 479 + K + PK +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+ AGVWKNLR Sbjct: 55 KDKFIDGSLPKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRI 114 Query: 480 RFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGA 659 RFSQ DIFRISD+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P KC+I C+CG Sbjct: 115 RFSQSDIFRISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHYKCSIPCSCGG 174 Query: 660 VDSVKIYREQDYVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQEREITHSVLD 836 +DSV++YREQDYVIRFLKGLNDRFS +KSQI MM PLP+ID VFS++IQQERE+ S D Sbjct: 175 IDSVRVYREQDYVIRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSD 234 Query: 837 PIVHDAPETEVSTTLLAN 890 + ++ ++ + +N Sbjct: 235 SVSEATSDSAMAMQVNSN 252 >ref|XP_014627175.1| PREDICTED: uncharacterized protein LOC106797397 [Glycine max] Length = 275 Score = 294 bits (753), Expect = 4e-93 Identities = 150/258 (58%), Positives = 189/258 (73%), Gaps = 14/258 (5%) Frame = +3 Query: 159 MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEE 314 MA ++ DFSTNSANPYYLHPNENPAL+L P L N H L + Sbjct: 1 MALQNFVDFSTNSANPYYLHPNENPALVLVS------PSLTAKNYHTWSRSMHIALISKN 54 Query: 315 QRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRT 479 + K + PK +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+ AGVWKNLR Sbjct: 55 KDKFIDGSLPKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRI 114 Query: 480 RFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGA 659 RFSQ DIFRISD+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P CKC+I +CG Sbjct: 115 RFSQSDIFRISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPYSCGG 174 Query: 660 VDSVKIYREQDYVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQEREITHSVLD 836 +DSV++YREQDYVIR LKGLNDRFS +KSQI MM PLP+ID VFS++IQQERE+ S D Sbjct: 175 IDSVRVYREQDYVIRLLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSD 234 Query: 837 PIVHDAPETEVSTTLLAN 890 + ++ ++ + +N Sbjct: 235 SVSEATSDSAMAMQVNSN 252 >ref|XP_014632403.1| PREDICTED: uncharacterized protein LOC106798995 [Glycine max] Length = 277 Score = 276 bits (706), Expect = 5e-86 Identities = 138/219 (63%), Positives = 166/219 (75%), Gaps = 14/219 (6%) Frame = +3 Query: 159 MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEE 314 MA ++ DFSTNSANPYYLHPNENPAL+L P L N H L + Sbjct: 1 MALQNFVDFSTNSANPYYLHPNENPALVLVS------PSLTAKNYHTWSRSMHIALISKN 54 Query: 315 QRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRT 479 + K + PK DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+ AGVWKNLR Sbjct: 55 KDKFIDGSLPKPPVFDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRI 114 Query: 480 RFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGA 659 RFSQ DIFRISD+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P CKC+I C+CG Sbjct: 115 RFSQSDIFRISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGG 174 Query: 660 VDSVKIYREQDYVIRFLKGLNDRFSQTKSQI-MMKPLPE 773 +DSV++YREQDYVIRFLKGLNDRFS +KSQI MM PLP+ Sbjct: 175 IDSVRVYREQDYVIRFLKGLNDRFSHSKSQIMMMNPLPD 213 >gb|KYP65733.1| hypothetical protein KK1_011995 [Cajanus cajan] gb|KYP72745.1| hypothetical protein KK1_005345 [Cajanus cajan] Length = 445 Score = 279 bits (713), Expect = 8e-85 Identities = 158/404 (39%), Positives = 233/404 (57%), Gaps = 21/404 (5%) Frame = +3 Query: 159 MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*RN 338 M SY+DF+TN NPYYLHPNE P+L+L + + A R+ L + K+ + Sbjct: 1 MEDQSYADFTTNPYNPYYLHPNETPSLVLVTPLLDGKNYHTRARAMRMALMSKHKVKFID 60 Query: 339 ----PPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQ 491 PP ++ ++ PW RCNTMV++W+ S+SE I +SI + +W++L+ RFSQ Sbjct: 61 GTLTPP--HSSSILFEPWGRCNTMVISWLQHSISEKIVKSILWFDTASDIWQDLKARFSQ 118 Query: 492 GDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSV 671 GD+FR++ +Q++LYKF QG+L++++YFTQLK WDE+++ RPL CKC+IAC+CGAVDS Sbjct: 119 GDVFRVAQLQEDLYKFHQGSLDVTEYFTQLKEMWDEIDNLRPLSRCKCSIACSCGAVDSS 178 Query: 672 KIYREQDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVLDPIVH 848 YREQD VIRFL+GLND+++ +SQIM M PLP + FS++ QQER + S +H Sbjct: 179 YKYREQDAVIRFLRGLNDQYTHVRSQIMLMDPLPSLSKTFSLVGQQERHLNQSA----IH 234 Query: 849 DAPETEVSTTLLA-----------NSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHC 995 D + +T+ + + Q G+ ++CTHC Sbjct: 235 DDTKVLAATSFGSLPQTPTTQQHQSPQQQQFGFRRGGYSHGRGRGRGGRTHGSIKICTHC 294 Query: 996 NRTNHTIDS*WVKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGL 1175 R NHT+D+ + KHG+PPG++ KG S + VNA E T + S+ SN FG Sbjct: 295 GRNNHTVDTCYFKHGFPPGYQSKGGT----SANFTVNAVE--TTSPSSMVPESNNPNFGF 348 Query: 1176 TQEQYQGILSMVXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSS 1307 TQEQ Q +LS++ N V ++PLA+N S++ Sbjct: 349 TQEQCQELLSLL--QQSKTIPTPSSHSANSVVSSPLAMNFNSNA 390 >dbj|GAU50616.1| hypothetical protein TSUD_290710 [Trifolium subterraneum] Length = 404 Score = 271 bits (693), Expect = 2e-82 Identities = 161/417 (38%), Positives = 227/417 (54%), Gaps = 22/417 (5%) Frame = +3 Query: 159 MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*RN 338 MA Y+DF+TN NPYY+HPNENP++IL P L N + + +N Sbjct: 1 MANQPYADFATNPTNPYYIHPNENPSIILVT------PLLDHKNYQTWSRSMKVALISKN 54 Query: 339 PPKLQN--------TDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRT 479 K + +DP++ PWIRCN MVL+WI RS+SE+I +SI A VWK L Sbjct: 55 KLKFVDGTLPLPHVSDPLHEPWIRCNNMVLSWIQRSISETIVKSIMWCDCAAVVWKCLER 114 Query: 480 RFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGA 659 RF+ GDIFRI+DI +E+ +++QG L+IS YFT L W+ELE++RPL C CAI CTCGA Sbjct: 115 RFAHGDIFRIADILEEIARYQQGTLDISSYFTHLTTLWEELENFRPLKDCSCAIPCTCGA 174 Query: 660 VDSVKIYREQDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVL- 833 +K Y+EQD VI+FLKGLN++++ +SQIM + PLP+ID FS+++QQER++ ++ Sbjct: 175 ASDLKKYKEQDKVIKFLKGLNEQYASVRSQIMLLDPLPDIDRCFSLVLQQERQMLIPIIT 234 Query: 834 -DPIVHDAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKG-TNRVCTHCNRTN 1007 + + A +V T + ++ +G NR CTHC R N Sbjct: 235 DNSVDQQASIMQVRQTSYNHGKHYTSFSSTHHGGRGRGRGNHHGGRGPNNRTCTHCGRHN 294 Query: 1008 HTIDS*WVKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQ 1187 H +D+ + HGYPPG++ K S+S NV A+ S+ + ++ A QEQ Sbjct: 295 HIVDTCFELHGYPPGYQHK------NSKSVNVAATASNATLKEGHINLTS-ATINTIQEQ 347 Query: 1188 YQGIL-----SMVXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSSDHDWLQGSDWYS 1343 Y IL S + N + + P ALNS SS D+ SDW S Sbjct: 348 YNQILQLLQHSALQASSTPSNPSPTQASANSIISLPTALNSSSSPTFDFNPNSDWCS 404 >ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662412 [Glycine max] Length = 424 Score = 266 bits (680), Expect = 3e-80 Identities = 151/389 (38%), Positives = 224/389 (57%), Gaps = 14/389 (3%) Frame = +3 Query: 171 SYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*----RN 338 SYSDF+TN +NPYY+HPNENP+LIL + + ++ L + K+ + Sbjct: 4 SYSDFATNPSNPYYMHPNENPSLILVQPVLDNKNYQIWCRSMKVALISKNKVKFVDGTLS 63 Query: 339 PPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIF 503 PP + +DP+Y PW+RCN +VL+W+ RS SE IA+S+ + VWK+L RFSQGDIF Sbjct: 64 PPPI--SDPLYEPWLRCNNLVLSWLQRSTSEEIAKSLLWCDRASFVWKSLENRFSQGDIF 121 Query: 504 RISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSVKIYR 683 R++DIQ+E+ +QG L+IS YFT+L W+E+E++RP+ C CAI C+CGA ++ ++ Sbjct: 122 RVADIQEEVACLQQGTLDISSYFTKLMTLWEEIENFRPIRDCTCAIPCSCGAATDLRKFK 181 Query: 684 EQDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPE 860 EQD VI+FLKGL D++S +SQIM M PLP +D F++++QQER+ + + Sbjct: 182 EQDKVIKFLKGLGDQYSHVRSQIMLMSPLPTLDNAFNLILQQERQFN-------LPSTTD 234 Query: 861 TEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHG 1040 + + N +G NR+CTHCNRTNHT+++ ++KHG Sbjct: 235 SSIENQSSVNHFSQTPSRPSNNSGCGRGRGYSSGGRG-NRLCTHCNRTNHTVETCFIKHG 293 Query: 1041 YPPGFKGKGKNPF-QQSQSNNVNASESSTQNDSAQASAS---NKAPFGLTQEQYQGILSM 1208 YPPGF+ + N S N+V + S+ + S+ AS S + A QEQY IL + Sbjct: 294 YPPGFQHRKSNSSGNASVVNSVQDAGSAHISSSSSASTSTNGSSASLSTIQEQYTQILQL 353 Query: 1209 VXXXXXXXXXXXXXXXXNFVSTTPLALNS 1295 + N ST+P ++NS Sbjct: 354 L-------------QQSNLQSTSPSSVNS 369 >ref|XP_019455138.1| PREDICTED: uncharacterized protein LOC109356267 [Lupinus angustifolius] Length = 834 Score = 273 bits (698), Expect = 9e-79 Identities = 156/367 (42%), Positives = 211/367 (57%), Gaps = 20/367 (5%) Frame = +3 Query: 171 SYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*RNP--P 344 ++S S NP++LH NENPAL+L + A RL LE + K+ N P Sbjct: 3 NHSILSDQLNNPFFLHSNENPALVLVTPPMNTKDYHSWARAMRLTLESKNKLNFINGSLP 62 Query: 345 KLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFRI 509 + DP+Y PW+RCNTMVL+WI + ESI +SI A WK+L RFS GDIFRI Sbjct: 63 RPSPKDPLYGPWVRCNTMVLSWIQHCVDESIVKSILWIDTTAEAWKDLHDRFSHGDIFRI 122 Query: 510 SDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSVKIYREQ 689 + +Q E Y QGNL+ISDYFT+LK WDE+E +RP P CKC C CGA+DS+K Y+EQ Sbjct: 123 AALQKEFYHLDQGNLDISDYFTKLKTLWDEIEDFRPFPSCKCNTPCICGAMDSLKTYKEQ 182 Query: 690 DYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPETE 866 DYVIRFL+GLN++F+ KSQIM M PLP I F++LIQQER+ V + D Sbjct: 183 DYVIRFLEGLNEQFAHVKSQIMLMDPLPNITKAFALLIQQERQTQLPVPPSLEPDNRVMN 242 Query: 867 VSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKG------------TNRVCTHCNRTNH 1010 VS+ +SQY +P +G NR CT+C RTNH Sbjct: 243 VSSR--QDSQY-----RNNSTNNSFRGRGIIPFRGRGNRAAGFGRGQNNRFCTYCERTNH 295 Query: 1011 TIDS*WVKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQY 1190 TI++ ++KHGYPPG++ + + + ++ST N++A + +N F T+EQ Sbjct: 296 TIETCYLKHGYPPGYQSTRSSKMVNHTTG--YSFDTSTNNEAAHQTQNNSTSF--TKEQV 351 Query: 1191 QGILSMV 1211 QGIL ++ Sbjct: 352 QGILDLL 358 >ref|XP_016673106.1| PREDICTED: uncharacterized protein LOC107892548 [Gossypium hirsutum] Length = 366 Score = 251 bits (642), Expect = 3e-75 Identities = 147/401 (36%), Positives = 219/401 (54%), Gaps = 16/401 (3%) Frame = +3 Query: 189 TNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*RNPPKLQN---- 356 T ++PYYLHPNENPAL+L P L N H + +N + N Sbjct: 7 TLPSSPYYLHPNENPALVLVS------PVLSSSNYHSWSRAMTMALLSKNKLQFVNGTIT 60 Query: 357 ----TDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFRI 509 TDP+Y+ W RCNTMVL+W+H S+S SI S+ + VW++LR RFSQGD+FRI Sbjct: 61 VPLRTDPLYSAWERCNTMVLSWLHHSISPSIMNSVLWLDFASDVWRDLRERFSQGDVFRI 120 Query: 510 SDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSVKIYREQ 689 SD+Q+E+ F+Q + ++DYFT+LK+ WDEL ++RP+P+C C +C+CG +++ Y + Sbjct: 121 SDLQEEINSFKQEDRSVTDYFTELKILWDELMNFRPIPVCSCPTSCSCGVFATLQKYHDN 180 Query: 690 DYVIRFLKGLNDRFSQTKSQIMM-KPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPETE 866 DYVIRFLKGL+DRF+ +SQIM+ PLP I+ FS++IQQER + + + + Sbjct: 181 DYVIRFLKGLHDRFAAVRSQIMLIDPLPTINKAFSLVIQQERHL-------LAASSSQLF 233 Query: 867 VSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGYP 1046 VS TL + + +R CT C ++ HT+D+ + KHGYP Sbjct: 234 VSNTLRQHPSSRKSQP---------------KSASDSRQCTFCGKSRHTVDTCYEKHGYP 278 Query: 1047 PGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASA--SNKAPFGLTQEQYQGILSMVXXX 1220 PG+K +G+ S+++ V + D++Q+ A +P LTQ+Q Q +L+++ Sbjct: 279 PGYKSRGRT----SRAHAVLTDGEAPSLDASQSVAILPPDSPVTLTQDQLQQLLTLL--- 331 Query: 1221 XXXXXXXXXXXXXNFVSTTPLALNSQSSSDHDWLQGSDWYS 1343 N S+ PL S S + DWYS Sbjct: 332 ---PSSTSPTHVTNTASSLPL---QPSVSSGPSIFDDDWYS 366