BLASTX nr result
ID: Glycyrrhiza35_contig00003970
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza35_contig00003970 (1354 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_004504099.1 PREDICTED: uncharacterized protein LOC101515258 [... 271 9e-85 XP_013446772.1 transmembrane protein, putative [Medicago truncat... 235 4e-71 GAU19129.1 hypothetical protein TSUD_79520 [Trifolium subterraneum] 235 5e-71 XP_003525047.1 PREDICTED: uncharacterized protein LOC100790782 i... 233 4e-70 XP_003531342.1 PREDICTED: uncharacterized protein LOC100809936 i... 227 5e-68 KYP49274.1 hypothetical protein KK1_029015 [Cajanus cajan] 209 4e-61 XP_014505798.1 PREDICTED: uncharacterized protein LOC106765624 [... 208 9e-61 XP_017430673.1 PREDICTED: uncharacterized protein LOC108338336 [... 207 4e-60 XP_015957621.1 PREDICTED: uncharacterized protein LOC107481815 [... 199 6e-57 XP_013446773.1 transmembrane protein, putative [Medicago truncat... 197 6e-57 XP_016190686.1 PREDICTED: uncharacterized protein LOC107631679 [... 199 8e-57 BAT73405.1 hypothetical protein VIGAN_01088400 [Vigna angularis ... 191 1e-53 XP_006580260.1 PREDICTED: uncharacterized protein LOC100790782 i... 185 4e-52 XP_006585264.1 PREDICTED: uncharacterized protein LOC100809936 i... 183 1e-51 XP_019464590.1 PREDICTED: uncharacterized protein LOC109362943 [... 180 6e-50 XP_007159675.1 hypothetical protein PHAVU_002G257700g [Phaseolus... 169 9e-46 EOX92049.1 Uncharacterized protein TCM_001072 isoform 1 [Theobro... 162 8e-43 KHN06564.1 hypothetical protein glysoja_010908 [Glycine soja] 155 2e-42 XP_017969923.1 PREDICTED: uncharacterized protein LOC18611534 [T... 159 2e-41 EOX92050.1 Uncharacterized protein TCM_001072 isoform 2 [Theobro... 158 3e-41 >XP_004504099.1 PREDICTED: uncharacterized protein LOC101515258 [Cicer arietinum] Length = 296 Score = 271 bits (692), Expect = 9e-85 Identities = 160/271 (59%), Positives = 178/271 (65%), Gaps = 1/271 (0%) Frame = +3 Query: 306 LIPLRPLHLSTPILNLR-PSTPHRLTSFTAHADDPFRRRSQSSAPKHVAXXXXXXXXXXX 482 +IPLRPLHL+T ILNL+ P+T HR TS T HAD FRRRSQ S PKHV Sbjct: 33 IIPLRPLHLNTQILNLKLPTTTHRFTSITVHADS-FRRRSQISVPKHVTGDSNFDSFLSF 91 Query: 483 XXXXXERSCLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXXXXXXXXXRRRQ 662 E SCL SS IVSA AVI WKKEL IG+R RRR+ Sbjct: 92 L----ELSCLLSSVIVSASVAVIAVWKKELFVAIGNRVSPWSVLLLVVGVLTGALIRRRK 147 Query: 663 WRETVLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIRFRLTRKALKE 842 WR+TV+ GGF S+ V ++R+EKLEED+RSSA VVR SRQLEKLGIRFR+TRK+LKE Sbjct: 148 WRQTVVDGGFPVSE--VNFLQRMEKLEEDLRSSAMVVRVLSRQLEKLGIRFRVTRKSLKE 205 Query: 843 PITETAILAQKNSEAARALAVQSDILEKELGEIXXXXXXXXXXXXXXXXXXXAIAKAGKL 1022 PITETA LAQKNSEAARALA+QSDILEKELGEI AI KAGKL Sbjct: 206 PITETAALAQKNSEAARALAMQSDILEKELGEIQKVLLAMQEQQRKQLDLILAIGKAGKL 265 Query: 1023 WESKLATIEEHDTFEMSNSAADEVIQEVHQI 1115 WESK T EEH T EMSNSAA+EV Q+VHQI Sbjct: 266 WESKRETSEEHGTIEMSNSAANEVKQQVHQI 296 >XP_013446772.1 transmembrane protein, putative [Medicago truncatula] KEH20799.1 transmembrane protein, putative [Medicago truncatula] Length = 290 Score = 235 bits (600), Expect = 4e-71 Identities = 148/273 (54%), Positives = 169/273 (61%), Gaps = 3/273 (1%) Frame = +3 Query: 306 LIPLRPLHLSTPILNLRP-STPHRLTSFTAHADDPFRRRSQSSAPKHVAXXXXXXXXXXX 482 +IP RPLHL+TPILNL+P STPHR TS+T H + F RRSQ S Sbjct: 30 IIPSRPLHLNTPILNLKPLSTPHRFTSYTVHTNS-FPRRSQLSV---------VDSNFDS 79 Query: 483 XXXXXERSCLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXXXXXXXXXRRRQ 662 E S L SS +VSA AV WKK L IG+R RRR+ Sbjct: 80 FLSFLELSALLSSLVVSAAVAVTAIWKKGLYLAIGNRVAPWSLLLLVVGVLTGALIRRRK 139 Query: 663 WRETVLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIRFRLTRKALKE 842 WRETVL+G + S+ V ++RIEKLEED++S+ATVVR SRQLEKLGIRFR+TRK LKE Sbjct: 140 WRETVLNGVVSVSE--VDFLQRIEKLEEDLKSNATVVRVLSRQLEKLGIRFRVTRKGLKE 197 Query: 843 PITETAILAQKNSEAARALAVQSDILEKELGEIXXXXXXXXXXXXXXXXXXXAIAKAGKL 1022 PITETA LAQKNSEAARALA+QSDILEKELGE+ AIAKAG L Sbjct: 198 PITETASLAQKNSEAARALALQSDILEKELGEVQKVLLAMQEQQQKQLDLILAIAKAGNL 257 Query: 1023 WESKLATIEEHDTFEMSNSAA--DEVIQEVHQI 1115 WESK T EEH T EMSN+AA + V QEV QI Sbjct: 258 WESKRETSEEHGTIEMSNTAANLNVVNQEVRQI 290 >GAU19129.1 hypothetical protein TSUD_79520 [Trifolium subterraneum] Length = 297 Score = 235 bits (600), Expect = 5e-71 Identities = 144/259 (55%), Positives = 163/259 (62%), Gaps = 1/259 (0%) Frame = +3 Query: 315 LRPLHLSTPILNLR-PSTPHRLTSFTAHADDPFRRRSQSSAPKHVAXXXXXXXXXXXXXX 491 LRPLHL+TPILN + ST HRLTSFT HAD FR RSQ S HV Sbjct: 21 LRPLHLNTPILNFKLQSTSHRLTSFTVHADS-FRHRSQVSVSNHVGDSNFDSFLSFL--- 76 Query: 492 XXERSCLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXXXXXXXXXRRRQWRE 671 E SCL SS IVSA AV + WKKEL I +R RRR+WRE Sbjct: 77 --ELSCLLSSVIVSATAAVTSVWKKELFVSISNRIAPWSLLMFVVGVFTGALIRRRKWRE 134 Query: 672 TVLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIRFRLTRKALKEPIT 851 TV +G F GS+ V L++RIEKLEED+R+S TVVR SRQLEKLGIRFR+TRK++KEPI+ Sbjct: 135 TV-NGRFTGSE--VNLLQRIEKLEEDLRNSTTVVRVLSRQLEKLGIRFRVTRKSMKEPIS 191 Query: 852 ETAILAQKNSEAARALAVQSDILEKELGEIXXXXXXXXXXXXXXXXXXXAIAKAGKLWES 1031 ETA LAQKNSEAAR LA+QS+ILEKELGE AIAKAGKLW+S Sbjct: 192 ETAALAQKNSEAARTLAMQSEILEKELGETQKVLLAMQVQQQKQLDLILAIAKAGKLWDS 251 Query: 1032 KLATIEEHDTFEMSNSAAD 1088 K T EEH T MSNSAA+ Sbjct: 252 KRETSEEHGTTGMSNSAAN 270 >XP_003525047.1 PREDICTED: uncharacterized protein LOC100790782 isoform X1 [Glycine max] KRH59305.1 hypothetical protein GLYMA_05G176600 [Glycine max] Length = 293 Score = 233 bits (594), Expect = 4e-70 Identities = 149/281 (53%), Positives = 165/281 (58%), Gaps = 5/281 (1%) Frame = +3 Query: 288 NRNCN-----NLIPLRPLHLSTPILNLRPSTPHRLTSFTAHADDPFRRRSQSSAPKHVAX 452 NRNCN +++ RPLHL+T LNL T R S T AD F RS+ A V+ Sbjct: 26 NRNCNLSLSFSIVTSRPLHLTTQNLNL---TAQRFNSLTVRADS-FCLRSEHVAVAGVSN 81 Query: 453 XXXXXXXXXXXXXXXERSCLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXXX 632 E SCL SSA+ SA AV+ G K ELL IG+R Sbjct: 82 FDSLLSLL-------EFSCLLSSAVASAAAAVVAGSKNELLVGIGTRAAPFGGALLVVGV 134 Query: 633 XXXXXXRRRQWRETVLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIR 812 RRRQWR + G G + V L+ERIEKLEED+RSSATVVR SRQLEKLG+R Sbjct: 135 LVGAWIRRRQWRRACVETGKGGLE--VNLLERIEKLEEDMRSSATVVRVLSRQLEKLGVR 192 Query: 813 FRLTRKALKEPITETAILAQKNSEAARALAVQSDILEKELGEIXXXXXXXXXXXXXXXXX 992 FR+TRKALK+PI ETA LAQKNSEAARALAVQSDILEKELGEI Sbjct: 193 FRVTRKALKDPIAETAALAQKNSEAARALAVQSDILEKELGEIQQVLLAMQEQQRKQLDL 252 Query: 993 XXAIAKAGKLWESKLATIEEHDTFEMSNSAADEVIQEVHQI 1115 AI KA KLWESK T E HDT EMSNSA DEV QEVHQI Sbjct: 253 ILAIGKASKLWESKHETSERHDTLEMSNSAEDEVKQEVHQI 293 >XP_003531342.1 PREDICTED: uncharacterized protein LOC100809936 isoform X1 [Glycine max] KRH43154.1 hypothetical protein GLYMA_08G134000 [Glycine max] Length = 287 Score = 227 bits (579), Expect = 5e-68 Identities = 144/281 (51%), Positives = 162/281 (57%), Gaps = 5/281 (1%) Frame = +3 Query: 288 NRNCN-----NLIPLRPLHLSTPILNLRPSTPHRLTSFTAHADDPFRRRSQSSAPKHVAX 452 NRNCN +++ RPLHL+T T HR S T AD FR RS+ +A Sbjct: 26 NRNCNLSLSFSIVTSRPLHLTT-------HTAHRFNSLTVRADS-FRLRSEHAAADS--- 74 Query: 453 XXXXXXXXXXXXXXXERSCLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXXX 632 E SCL SSAI SA AV+ G K EL+A IG+R Sbjct: 75 ------NFDSLLSLLEFSCLLSSAISSAAAAVLAGSKNELIAGIGARAAPFGGALLVVGV 128 Query: 633 XXXXXXRRRQWRETVLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIR 812 RRRQWR + G G + V L+ERIEKLEED+RSSATVVR SRQLEKLG+R Sbjct: 129 LVGAWIRRRQWRRVSVEAGKGGLE--VNLLERIEKLEEDLRSSATVVRVLSRQLEKLGVR 186 Query: 813 FRLTRKALKEPITETAILAQKNSEAARALAVQSDILEKELGEIXXXXXXXXXXXXXXXXX 992 FR+TRK LK+PI ETA LAQKNSEAARALAVQSDILEKELGEI Sbjct: 187 FRVTRKGLKDPIAETAALAQKNSEAARALAVQSDILEKELGEIQQVLLAMQEQQRKQLDL 246 Query: 993 XXAIAKAGKLWESKLATIEEHDTFEMSNSAADEVIQEVHQI 1115 A+ KA KLWESK T E HDT E+SNSA D V QEVHQI Sbjct: 247 ILAVGKASKLWESKQETNERHDTLELSNSAEDGVKQEVHQI 287 >KYP49274.1 hypothetical protein KK1_029015 [Cajanus cajan] Length = 287 Score = 209 bits (533), Expect = 4e-61 Identities = 140/283 (49%), Positives = 156/283 (55%), Gaps = 7/283 (2%) Frame = +3 Query: 288 NRNCNNLIPL-------RPLHLSTPILNLRPSTPHRLTSFTAHADDPFRRRSQSSAPKHV 446 NR CN + RPLHL+ HR S T AD F SQ +HV Sbjct: 24 NRICNLSLSFSLATSTSRPLHLTL--------AAHRFNSLTVRADS-FHLPSQ----QHV 70 Query: 447 AXXXXXXXXXXXXXXXXERSCLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXX 626 A E SCL SSA+VSA KK+LLA I +R Sbjct: 71 AADSNFDSLLSLL----EISCLLSSALVSAAALAFAASKKDLLAGIAARGAPLGVAMLVF 126 Query: 627 XXXXXXXXRRRQWRETVLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLG 806 RRRQWR + G G++ V L+ERIEKLEED+RSSATVVR SRQLEKLG Sbjct: 127 GVSVGAWIRRRQWRRVCVETGRGGTE--VNLLERIEKLEEDLRSSATVVRVLSRQLEKLG 184 Query: 807 IRFRLTRKALKEPITETAILAQKNSEAARALAVQSDILEKELGEIXXXXXXXXXXXXXXX 986 +RFR+TRK LK+PI ETA LAQKNSEAARALAVQSDILEKELGEI Sbjct: 185 VRFRVTRKGLKDPIAETAALAQKNSEAARALAVQSDILEKELGEIQQVLLAMQEQQRKQL 244 Query: 987 XXXXAIAKAGKLWESKLATIEEHDTFEMSNSAADEVIQEVHQI 1115 AI KAGKLWESK T E+ DT EMSNSA DEV +VHQI Sbjct: 245 DLILAIGKAGKLWESKHETSEQQDTLEMSNSAEDEVKSKVHQI 287 >XP_014505798.1 PREDICTED: uncharacterized protein LOC106765624 [Vigna radiata var. radiata] Length = 283 Score = 208 bits (530), Expect = 9e-61 Identities = 138/281 (49%), Positives = 158/281 (56%), Gaps = 5/281 (1%) Frame = +3 Query: 288 NRNCN-----NLIPLRPLHLSTPILNLRPSTPHRLTSFTAHADDPFRRRSQSSAPKHVAX 452 NRN N ++I RPL+L+TP L + T HR S T AD F RSQ HV Sbjct: 18 NRNYNLSLSLSIITSRPLYLTTPNLKV---TAHRFNSLTVSADS-FPLRSQ-----HVTA 68 Query: 453 XXXXXXXXXXXXXXXERSCLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXXX 632 E SCL SSA+ S+ V+ K +LLA IG+R Sbjct: 69 DSNFDSLLSFL----EFSCLLSSAVASSAATVVAASKNDLLAGIGTRAAPFGVTMLVIGV 124 Query: 633 XXXXXXRRRQWRETVLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIR 812 RRRQWR + G G + V ++RIEKLEED++SS TVVR SRQLEKLGIR Sbjct: 125 LIGVWIRRRQWRRVCVENGKGGLE--VNFLQRIEKLEEDLKSSLTVVRVLSRQLEKLGIR 182 Query: 813 FRLTRKALKEPITETAILAQKNSEAARALAVQSDILEKELGEIXXXXXXXXXXXXXXXXX 992 FR+TRKALK+PI ETA LAQKNSEAARALAVQSDILE+ELGEI Sbjct: 183 FRVTRKALKDPIAETAALAQKNSEAARALAVQSDILEQELGEIQQVLLAMQEQQRKQLDL 242 Query: 993 XXAIAKAGKLWESKLATIEEHDTFEMSNSAADEVIQEVHQI 1115 AI KAGKLWESK + DT EMSNSA V QEVHQI Sbjct: 243 ILAIGKAGKLWESKPEISDRQDTLEMSNSAEGVVKQEVHQI 283 >XP_017430673.1 PREDICTED: uncharacterized protein LOC108338336 [Vigna angularis] XP_017430758.1 PREDICTED: uncharacterized protein LOC108338336 [Vigna angularis] KOM30729.1 hypothetical protein LR48_Vigan01g028400 [Vigna angularis] Length = 283 Score = 207 bits (526), Expect = 4e-60 Identities = 138/281 (49%), Positives = 157/281 (55%), Gaps = 5/281 (1%) Frame = +3 Query: 288 NRNCN-----NLIPLRPLHLSTPILNLRPSTPHRLTSFTAHADDPFRRRSQSSAPKHVAX 452 NRN N ++I RPL+L+TP L L T HR S T AD F R Q HV Sbjct: 18 NRNHNLSLALSIITSRPLYLTTPNLKL---TAHRFNSLTVSADS-FPLRFQ-----HVTA 68 Query: 453 XXXXXXXXXXXXXXXERSCLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXXX 632 E SCL SSA+ S+ V+ K +LLA IG+R Sbjct: 69 DSNFDSLLSFL----EFSCLLSSAVASSAATVVAASKNDLLAGIGTRAAPFGVTMLVIGV 124 Query: 633 XXXXXXRRRQWRETVLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIR 812 RRRQWR + G G + V ++RIEKLEED++SS TVVR SRQLEKLGIR Sbjct: 125 LIGVWIRRRQWRRVCVENGKGGLE--VNFLQRIEKLEEDLKSSLTVVRVLSRQLEKLGIR 182 Query: 813 FRLTRKALKEPITETAILAQKNSEAARALAVQSDILEKELGEIXXXXXXXXXXXXXXXXX 992 FR+TRKALK+PI ETA LAQKNSEAARALAVQSDILE+ELGEI Sbjct: 183 FRVTRKALKDPIAETAALAQKNSEAARALAVQSDILEQELGEIQHVLLAMQEQQRKQLDL 242 Query: 993 XXAIAKAGKLWESKLATIEEHDTFEMSNSAADEVIQEVHQI 1115 AI KAGKLWESK + DT EMSNSA V QEVHQI Sbjct: 243 ILAIGKAGKLWESKPEISDRQDTLEMSNSAEGVVKQEVHQI 283 >XP_015957621.1 PREDICTED: uncharacterized protein LOC107481815 [Arachis duranensis] Length = 302 Score = 199 bits (506), Expect = 6e-57 Identities = 138/286 (48%), Positives = 159/286 (55%), Gaps = 12/286 (4%) Frame = +3 Query: 294 NCN---NLIPLRPLHLSTPILNLRPSTPHRLTSFT--AHADDPFRRR---SQSSAPKHVA 449 +CN ++I LR H PIL+L PS P + + FT A D F S + PKH A Sbjct: 26 SCNLSRSIISLRSRH-HFPILHLNPSLPFKPSHFTFLATRADAFNLAAYDSDGALPKHAA 84 Query: 450 XXXXXXXXXXXXXXXXERSCLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXX 629 E SCL SSA S AV+ G KKELLA IGS+ Sbjct: 85 GAGGFDFDYFLSLI--EFSCLLSSAFASVCVAVVAGLKKELLAAIGSKAAVWGTLALVFG 142 Query: 630 XXXXXXXRRRQWR----ETVLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLE 797 RRRQWR ETV G V L++RIEKLEED+RS++ + R SR+LE Sbjct: 143 VLSGAWIRRRQWRRVCRETVKDG------LEVNLLQRIEKLEEDLRSTSAISRVLSRELE 196 Query: 798 KLGIRFRLTRKALKEPITETAILAQKNSEAARALAVQSDILEKELGEIXXXXXXXXXXXX 977 KL IRFR+TRK LKEP+TETA+LAQKNSEAARALAVQSDILEKE+ EI Sbjct: 197 KLAIRFRVTRKTLKEPVTETAVLAQKNSEAARALAVQSDILEKEMVEIQQVLLAMQEQQR 256 Query: 978 XXXXXXXAIAKAGKLWESKLATIEEHDTFEMSNSAADEVIQEVHQI 1115 AI K GKL ESKL T EE+DT E SNS DEV QEVH I Sbjct: 257 RQLDLILAIGKTGKLQESKLETREENDTLETSNSVDDEVKQEVHPI 302 >XP_013446773.1 transmembrane protein, putative [Medicago truncatula] KEH20800.1 transmembrane protein, putative [Medicago truncatula] Length = 257 Score = 197 bits (502), Expect = 6e-57 Identities = 120/213 (56%), Positives = 139/213 (65%), Gaps = 1/213 (0%) Frame = +3 Query: 306 LIPLRPLHLSTPILNLRP-STPHRLTSFTAHADDPFRRRSQSSAPKHVAXXXXXXXXXXX 482 +IP RPLHL+TPILNL+P STPHR TS+T H + F RRSQ S Sbjct: 30 IIPSRPLHLNTPILNLKPLSTPHRFTSYTVHTNS-FPRRSQLSV---------VDSNFDS 79 Query: 483 XXXXXERSCLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXXXXXXXXXRRRQ 662 E S L SS +VSA AV WKK L IG+R RRR+ Sbjct: 80 FLSFLELSALLSSLVVSAAVAVTAIWKKGLYLAIGNRVAPWSLLLLVVGVLTGALIRRRK 139 Query: 663 WRETVLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIRFRLTRKALKE 842 WRETVL+G + S+ V ++RIEKLEED++S+ATVVR SRQLEKLGIRFR+TRK LKE Sbjct: 140 WRETVLNGVVSVSE--VDFLQRIEKLEEDLKSNATVVRVLSRQLEKLGIRFRVTRKGLKE 197 Query: 843 PITETAILAQKNSEAARALAVQSDILEKELGEI 941 PITETA LAQKNSEAARALA+QSDILEKELGE+ Sbjct: 198 PITETASLAQKNSEAARALALQSDILEKELGEV 230 >XP_016190686.1 PREDICTED: uncharacterized protein LOC107631679 [Arachis ipaensis] Length = 302 Score = 199 bits (505), Expect = 8e-57 Identities = 139/286 (48%), Positives = 158/286 (55%), Gaps = 12/286 (4%) Frame = +3 Query: 294 NCN---NLIPLRPLHLSTPILNLRPSTPHRLTSFT--AHADDPFRRR---SQSSAPKHVA 449 +CN ++I LR H PIL+L PS P + + FT A D F S + PKH A Sbjct: 26 SCNLSRSIISLRSRH-HFPILHLNPSLPFKPSHFTFLATRADAFNLAAYDSDGALPKHAA 84 Query: 450 XXXXXXXXXXXXXXXXERSCLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXX 629 E SCL SSA S AV+ G KKELLA IGS+ Sbjct: 85 GAGGFDFDDFLSLI--EFSCLLSSAFASVCVAVVAGLKKELLAAIGSKAAVWGTLALVFG 142 Query: 630 XXXXXXXRRRQWR----ETVLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLE 797 RRRQWR ETV G V L++RIEKLEED RS++ + R SR+LE Sbjct: 143 VLSGAWIRRRQWRRVCRETVKDG------LEVNLLQRIEKLEEDFRSTSAISRVLSRELE 196 Query: 798 KLGIRFRLTRKALKEPITETAILAQKNSEAARALAVQSDILEKELGEIXXXXXXXXXXXX 977 KL IRFR+TRK LKEPITETA+LAQKNSEAARALAVQSDILEKE+ EI Sbjct: 197 KLAIRFRVTRKTLKEPITETAVLAQKNSEAARALAVQSDILEKEMVEIQQVLLAMQEQQR 256 Query: 978 XXXXXXXAIAKAGKLWESKLATIEEHDTFEMSNSAADEVIQEVHQI 1115 AI K GKL ESKL T EE+DT E SNS DEV QEVH I Sbjct: 257 RQLGLILAIGKTGKLQESKLETREENDTLETSNSVDDEVKQEVHPI 302 >BAT73405.1 hypothetical protein VIGAN_01088400 [Vigna angularis var. angularis] Length = 314 Score = 191 bits (484), Expect = 1e-53 Identities = 138/312 (44%), Positives = 157/312 (50%), Gaps = 36/312 (11%) Frame = +3 Query: 288 NRNCN-----NLIPLRPLHLSTPILNLRPSTPHRLTSFTAHADDPFRRRSQSSAPKHVAX 452 NRN N ++I RPL+L+TP L L T HR S T AD F R Q HV Sbjct: 18 NRNHNLSLALSIITSRPLYLTTPNLKL---TAHRFNSLTVSADS-FPLRFQ-----HVTA 68 Query: 453 XXXXXXXXXXXXXXXERSCLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXXX 632 E SCL SSA+ S+ V+ K +LLA IG+R Sbjct: 69 DSNFDSLLSFL----EFSCLLSSAVASSAATVVAASKNDLLAGIGTRAAPFGVTMLVIGV 124 Query: 633 XXXXXXRRRQWRETVLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIR 812 RRRQWR + G G + V ++RIEKLEED++SS TVVR SRQLEKLGIR Sbjct: 125 LIGVWIRRRQWRRVCVENGKGGLE--VNFLQRIEKLEEDLKSSLTVVRVLSRQLEKLGIR 182 Query: 813 FRLTRKALKEPITE-------------------------------TAILAQKNSEAARAL 899 FR+TRKALK+PI E TA LAQKNSEAARAL Sbjct: 183 FRVTRKALKDPIAELPLQLLSTVSVLHCHPPRRSHLQHIHQYPLMTAALAQKNSEAARAL 242 Query: 900 AVQSDILEKELGEIXXXXXXXXXXXXXXXXXXXAIAKAGKLWESKLATIEEHDTFEMSNS 1079 AVQSDILE+ELGEI AI KAGKLWESK + DT EMSNS Sbjct: 243 AVQSDILEQELGEIQHVLLAMQEQQRKQLDLILAIGKAGKLWESKPEISDRQDTLEMSNS 302 Query: 1080 AADEVIQEVHQI 1115 A V QEVHQI Sbjct: 303 AEGVVKQEVHQI 314 >XP_006580260.1 PREDICTED: uncharacterized protein LOC100790782 isoform X2 [Glycine max] Length = 254 Score = 185 bits (469), Expect = 4e-52 Identities = 119/223 (53%), Positives = 135/223 (60%), Gaps = 5/223 (2%) Frame = +3 Query: 288 NRNCN-----NLIPLRPLHLSTPILNLRPSTPHRLTSFTAHADDPFRRRSQSSAPKHVAX 452 NRNCN +++ RPLHL+T LNL T R S T AD F RS+ A V+ Sbjct: 26 NRNCNLSLSFSIVTSRPLHLTTQNLNL---TAQRFNSLTVRADS-FCLRSEHVAVAGVSN 81 Query: 453 XXXXXXXXXXXXXXXERSCLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXXX 632 E SCL SSA+ SA AV+ G K ELL IG+R Sbjct: 82 FDSLLSLL-------EFSCLLSSAVASAAAAVVAGSKNELLVGIGTRAAPFGGALLVVGV 134 Query: 633 XXXXXXRRRQWRETVLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIR 812 RRRQWR + G G + V L+ERIEKLEED+RSSATVVR SRQLEKLG+R Sbjct: 135 LVGAWIRRRQWRRACVETGKGGLE--VNLLERIEKLEEDMRSSATVVRVLSRQLEKLGVR 192 Query: 813 FRLTRKALKEPITETAILAQKNSEAARALAVQSDILEKELGEI 941 FR+TRKALK+PI ETA LAQKNSEAARALAVQSDILEKELGEI Sbjct: 193 FRVTRKALKDPIAETAALAQKNSEAARALAVQSDILEKELGEI 235 >XP_006585264.1 PREDICTED: uncharacterized protein LOC100809936 isoform X2 [Glycine max] KRH43153.1 hypothetical protein GLYMA_08G134000 [Glycine max] Length = 248 Score = 183 bits (465), Expect = 1e-51 Identities = 117/223 (52%), Positives = 133/223 (59%), Gaps = 5/223 (2%) Frame = +3 Query: 288 NRNCN-----NLIPLRPLHLSTPILNLRPSTPHRLTSFTAHADDPFRRRSQSSAPKHVAX 452 NRNCN +++ RPLHL+T T HR S T AD FR RS+ +A Sbjct: 26 NRNCNLSLSFSIVTSRPLHLTT-------HTAHRFNSLTVRADS-FRLRSEHAAADS--- 74 Query: 453 XXXXXXXXXXXXXXXERSCLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXXX 632 E SCL SSAI SA AV+ G K EL+A IG+R Sbjct: 75 ------NFDSLLSLLEFSCLLSSAISSAAAAVLAGSKNELIAGIGARAAPFGGALLVVGV 128 Query: 633 XXXXXXRRRQWRETVLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIR 812 RRRQWR + G G + V L+ERIEKLEED+RSSATVVR SRQLEKLG+R Sbjct: 129 LVGAWIRRRQWRRVSVEAGKGGLE--VNLLERIEKLEEDLRSSATVVRVLSRQLEKLGVR 186 Query: 813 FRLTRKALKEPITETAILAQKNSEAARALAVQSDILEKELGEI 941 FR+TRK LK+PI ETA LAQKNSEAARALAVQSDILEKELGEI Sbjct: 187 FRVTRKGLKDPIAETAALAQKNSEAARALAVQSDILEKELGEI 229 >XP_019464590.1 PREDICTED: uncharacterized protein LOC109362943 [Lupinus angustifolius] OIW00401.1 hypothetical protein TanjilG_05751 [Lupinus angustifolius] Length = 283 Score = 180 bits (457), Expect = 6e-50 Identities = 131/268 (48%), Positives = 149/268 (55%), Gaps = 6/268 (2%) Frame = +3 Query: 339 PILNLRPSTPHRLTSFTAHADDPFRRRSQSSAPKHVAXXXXXXXXXXXXXXXXERSCLFS 518 PILNL+P H+ SF AHA + P H E + L S Sbjct: 31 PILNLKPFRSHKFASFRAHAHS-----IEPIVPNH--DVSAGDFNFDSLLSLLEVTSLLS 83 Query: 519 SAIVSAGGAVITGWKKE-LLAEIGSRXXXXXXXXXXXXXXXXXXX-RRRQW----RETVL 680 S I++ AV T K+E LLA IG++ RRRQW RETV Sbjct: 84 STILTVAFAVNTVIKREILLAAIGNKSLLPLGVLLMVFGVLIGVWIRRRQWKRVCRETVK 143 Query: 681 SGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIRFRLTRKALKEPITETA 860 G V L+ERIEKLEED+RS+ T+VR SRQLEKLGIRFR+TRK+LKEPITETA Sbjct: 144 DG------LEVNLLERIEKLEEDLRSAVTIVRVLSRQLEKLGIRFRVTRKSLKEPITETA 197 Query: 861 ILAQKNSEAARALAVQSDILEKELGEIXXXXXXXXXXXXXXXXXXXAIAKAGKLWESKLA 1040 LAQKNSEAARALAVQS+ILEKELGEI AI K GKL E+K Sbjct: 198 ALAQKNSEAARALAVQSEILEKELGEIQQVLLAMQEQQQKQLDLILAIVKNGKLGENKRK 257 Query: 1041 TIEEHDTFEMSNSAADEVIQEVHQI*SL 1124 T E+ E SNSAADEV QEVHQI SL Sbjct: 258 TSEK---LETSNSAADEVNQEVHQIRSL 282 >XP_007159675.1 hypothetical protein PHAVU_002G257700g [Phaseolus vulgaris] ESW31669.1 hypothetical protein PHAVU_002G257700g [Phaseolus vulgaris] Length = 267 Score = 169 bits (427), Expect = 9e-46 Identities = 118/259 (45%), Positives = 135/259 (52%), Gaps = 5/259 (1%) Frame = +3 Query: 288 NRNCN-----NLIPLRPLHLSTPILNLRPSTPHRLTSFTAHADDPFRRRSQSSAPKHVAX 452 NRN N +I LHL+TP LN T HR S T A+ F SQ HV Sbjct: 23 NRNNNLSLSLPIITSLSLHLATPNLN---HTAHRFNSLTVRAES-FPLLSQ-----HVTA 73 Query: 453 XXXXXXXXXXXXXXXERSCLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXXX 632 E SCL SSA+ S+ V+ K ELLA IG+ Sbjct: 74 DSKFDSLLSFV----EFSCLLSSAVASSAATVVAASKNELLARIGTIAAPFGLAMLVIGV 129 Query: 633 XXXXXXRRRQWRETVLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIR 812 RRR+WR + G G + V ++RIEKLEED+RSS TVVR SRQLEKLGIR Sbjct: 130 SVGVWIRRRRWRRVCVENGKGGLE--VNFLQRIEKLEEDLRSSLTVVRVLSRQLEKLGIR 187 Query: 813 FRLTRKALKEPITETAILAQKNSEAARALAVQSDILEKELGEIXXXXXXXXXXXXXXXXX 992 FR+TRK LK+PI ETA LA+KNSEA RALAVQSDILEKE+GEI Sbjct: 188 FRVTRKTLKDPIAETATLAKKNSEATRALAVQSDILEKEVGEIQKVLLAMQEQQQKQLDL 247 Query: 993 XXAIAKAGKLWESKLATIE 1049 I KA KLWESK T E Sbjct: 248 ILTIGKASKLWESKHETRE 266 >EOX92049.1 Uncharacterized protein TCM_001072 isoform 1 [Theobroma cacao] Length = 316 Score = 162 bits (411), Expect = 8e-43 Identities = 105/211 (49%), Positives = 122/211 (57%), Gaps = 4/211 (1%) Frame = +3 Query: 507 CLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXXXXXXXXXRRRQWR----ET 674 C+ SSA+VS GAV +GWK +L I R RRRQWR ET Sbjct: 100 CILSSAVVSVVGAV-SGWKGVILGGIWRRVMVWGIVGLVSGVAIGAWIRRRQWRRICAET 158 Query: 675 VLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIRFRLTRKALKEPITE 854 V GG + + L+ RIEKLEED+RS AT+ RA SRQLEKLGIRFR+TRKALKEPI E Sbjct: 159 VKGGGGGKN---LNLIGRIEKLEEDLRSYATITRALSRQLEKLGIRFRVTRKALKEPIAE 215 Query: 855 TAILAQKNSEAARALAVQSDILEKELGEIXXXXXXXXXXXXXXXXXXXAIAKAGKLWESK 1034 TA LAQKNSEA RALAVQ DILEKELGEI AI K+GKL+E K Sbjct: 216 TAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQGKQLELILAIGKSGKLFEDK 275 Query: 1035 LATIEEHDTFEMSNSAADEVIQEVHQI*SLG 1127 +E +T E N + E++Q LG Sbjct: 276 REPSQEKNTVEACNLTEEVNQMEINQTQPLG 306 >KHN06564.1 hypothetical protein glysoja_010908 [Glycine soja] Length = 122 Score = 155 bits (392), Expect = 2e-42 Identities = 86/122 (70%), Positives = 90/122 (73%) Frame = +3 Query: 750 VRSSATVVRAFSRQLEKLGIRFRLTRKALKEPITETAILAQKNSEAARALAVQSDILEKE 929 +RSSATVVR SRQLEKLG+RFR+TRKALK+PI ETA LAQKNSEAARALAVQSDILEKE Sbjct: 1 MRSSATVVRVLSRQLEKLGVRFRVTRKALKDPIAETAALAQKNSEAARALAVQSDILEKE 60 Query: 930 LGEIXXXXXXXXXXXXXXXXXXXAIAKAGKLWESKLATIEEHDTFEMSNSAADEVIQEVH 1109 LGEI AI KA KLWESK T E HDT EMSNSA DEV QEVH Sbjct: 61 LGEIQQVLLAMQEQQRKQLDLILAIGKASKLWESKHETSERHDTLEMSNSAEDEVKQEVH 120 Query: 1110 QI 1115 QI Sbjct: 121 QI 122 >XP_017969923.1 PREDICTED: uncharacterized protein LOC18611534 [Theobroma cacao] Length = 316 Score = 159 bits (402), Expect = 2e-41 Identities = 104/211 (49%), Positives = 121/211 (57%), Gaps = 4/211 (1%) Frame = +3 Query: 507 CLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXXXXXXXXXRRRQWR----ET 674 C+ SSA+VS AV +GWK +L I R RRRQWR ET Sbjct: 100 CILSSAVVSVVCAV-SGWKGVILGGIWRRVMVWGIVGLVSAVAIGAWIRRRQWRRICAET 158 Query: 675 VLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIRFRLTRKALKEPITE 854 V GG + + L+ RIEKLEED+RS AT+ RA SRQLEKLGIRFR+TRKALKEPI E Sbjct: 159 VKGGGGGKN---LNLIGRIEKLEEDLRSYATITRALSRQLEKLGIRFRVTRKALKEPIAE 215 Query: 855 TAILAQKNSEAARALAVQSDILEKELGEIXXXXXXXXXXXXXXXXXXXAIAKAGKLWESK 1034 TA LAQKNSEA RALAVQ DILEKELGEI AI K+GKL+E K Sbjct: 216 TAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQGKQLELILAIGKSGKLFEDK 275 Query: 1035 LATIEEHDTFEMSNSAADEVIQEVHQI*SLG 1127 +E +T E N + E++Q LG Sbjct: 276 REPSQEKNTVEACNLTEEVNQMEINQTQPLG 306 >EOX92050.1 Uncharacterized protein TCM_001072 isoform 2 [Theobroma cacao] Length = 313 Score = 158 bits (400), Expect = 3e-41 Identities = 105/211 (49%), Positives = 122/211 (57%), Gaps = 4/211 (1%) Frame = +3 Query: 507 CLFSSAIVSAGGAVITGWKKELLAEIGSRXXXXXXXXXXXXXXXXXXXRRRQWR----ET 674 C+ SSA+VS GAV +GWK +L I R RRRQWR ET Sbjct: 100 CILSSAVVSVVGAV-SGWKGVILGGIWRRVMVWGIVGLVSGVAIGAWIRRRQWRRICAET 158 Query: 675 VLSGGFAGSDSVVYLMERIEKLEEDVRSSATVVRAFSRQLEKLGIRFRLTRKALKEPITE 854 V GG + + L+ RIEKLEED+RS AT+ RA SRQLEKLGIRFR+TRKALKEPI E Sbjct: 159 VKGGGGGKN---LNLIGRIEKLEEDLRSYATITRALSRQLEKLGIRFRVTRKALKEPIAE 215 Query: 855 TAILAQKNSEAARALAVQSDILEKELGEIXXXXXXXXXXXXXXXXXXXAIAKAGKLWESK 1034 TA LAQKNSEA RALAVQ DILEKELGEI AI K+GKL+E K Sbjct: 216 TAALAQKNSEATRALAVQEDILEKELGEI---QKVLLAMQGKQLELILAIGKSGKLFEDK 272 Query: 1035 LATIEEHDTFEMSNSAADEVIQEVHQI*SLG 1127 +E +T E N + E++Q LG Sbjct: 273 REPSQEKNTVEACNLTEEVNQMEINQTQPLG 303