BLASTX nr result

ID: Glycyrrhiza32_contig00031227 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza32_contig00031227
         (1467 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_004510669.1 PREDICTED: uncharacterized protein LOC101494537 [...   479   e-154
XP_007135264.1 hypothetical protein PHAVU_010G114600g [Phaseolus...   448   e-142
XP_014492515.1 PREDICTED: uncharacterized protein LOC106754957 [...   449   e-141
XP_017405818.1 PREDICTED: uncharacterized protein LOC108319257 [...   431   e-134
XP_013444623.1 embryo defective 1703 protein, putative [Medicago...   425   e-134
OIW04587.1 hypothetical protein TanjilG_18064 [Lupinus angustifo...   424   e-133
XP_019456207.1 PREDICTED: uncharacterized protein LOC109356989 [...   424   e-133
XP_003548415.1 PREDICTED: uncharacterized protein LOC100796285 [...   417   e-130
XP_016181516.1 PREDICTED: uncharacterized protein LOC107623680 [...   385   e-118
XP_015937679.1 PREDICTED: uncharacterized protein LOC107463403 [...   381   e-117
KRH47901.1 hypothetical protein GLYMA_07G055400 [Glycine max]         319   1e-95
GAU43060.1 hypothetical protein TSUD_350060 [Trifolium subterran...   313   6e-93
KHN06315.1 hypothetical protein glysoja_021153 [Glycine soja]         269   9e-84
OAY62160.1 hypothetical protein MANES_01G246200 [Manihot esculenta]   254   6e-71
XP_017975378.1 PREDICTED: uncharacterized protein LOC18613973 is...   254   7e-71
XP_017975372.1 PREDICTED: uncharacterized protein LOC18613973 is...   254   8e-71
XP_007051543.2 PREDICTED: uncharacterized protein LOC18613973 is...   254   8e-71
XP_004306670.1 PREDICTED: uncharacterized protein LOC101313638 [...   250   2e-69
XP_008233144.2 PREDICTED: uncharacterized protein LOC103332203 [...   248   1e-68
EOX95699.1 Embryo defective 1703, putative isoform 2 [Theobroma ...   248   1e-68

>XP_004510669.1 PREDICTED: uncharacterized protein LOC101494537 [Cicer arietinum]
          Length = 1203

 Score =  479 bits (1232), Expect = e-154
 Identities = 268/444 (60%), Positives = 316/444 (71%), Gaps = 13/444 (2%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLNPKTPS---NSPFHRNLFPLYLTTSTARKFQTWAHFG 341
            MD LN S+ K I FP FC PK+LN K  S   N+PFH N FP YLT+ST+RKFQT+AHF 
Sbjct: 1    MDTLNVSSFKTIAFPFFCKPKTLNSKNISSNHNTPFHINPFPFYLTSSTSRKFQTFAHFR 60

Query: 342  RPTTXXXXXXXXXXXXHDHQVRPKHTSTDPSPVSGNGVEV-SDAGFQRVSVDD------V 500
            RP              +DHQV   H   DPS VS N VE  SD  FQRVS DD      V
Sbjct: 61   RPINRRNSLRNKLL--NDHQVTLIHIPNDPSSVSSNFVEKNSDVNFQRVSFDDDDDDNIV 118

Query: 501  EVEKPKPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGGVKRVSVDEEE 680
            E+E+ K KLLG+SVLLNKLE+WVD+Y KDIEYWGIG+ P+FTVY+DSFGGVKRV VDE+E
Sbjct: 119  ELEEEKSKLLGDSVLLNKLENWVDEYRKDIEYWGIGSNPIFTVYEDSFGGVKRVFVDEQE 178

Query: 681  ILRRSRVQRD--EIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVVQGEEEGGF 854
            ILRR RVQR+  EIE   EVK KI DA  LARE+E+GNNVI+RNSSVAKFVVQGEEEGGF
Sbjct: 179  ILRRDRVQREGNEIEGLSEVKYKILDAKKLAREVESGNNVIARNSSVAKFVVQGEEEGGF 238

Query: 855  VKAASGFVAQPGLLPKLSGVGSRELCVLVVLWAVKELFSFGVKEAQYXXXXXXXXXXXXX 1034
            ++A  GFV QP L+PKL GVGS  LCVLV+L+AVK+LF FG K+ QY             
Sbjct: 239  IQAVRGFVVQPWLVPKLFGVGSTVLCVLVLLFAVKKLFRFGDKDVQYTEMEKKMMMRKVK 298

Query: 1035 XXXXXXXLVKGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILKVKASADK-PVVVENSPA 1211
                   L+KGAVEVI E +ET VI +KKPKLD EQLKNNILK KAS+D   +VV+NS  
Sbjct: 299  ARKEKEVLMKGAVEVIHERVETSVIGVKKPKLDKEQLKNNILKAKASSDSDKLVVQNSFD 358

Query: 1212 EVRTGYMDVDYKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPVIEKSSNETKVIKKDSE 1391
            EVR G MD+DYKV+             GRD S+VS+DMEMDEPVIEKSSNE++VIKK+S+
Sbjct: 359  EVRNGSMDMDYKVREIREMARRAREIEGRDGSVVSKDMEMDEPVIEKSSNESEVIKKNSK 418

Query: 1392 QDNSLSNHQSKFSRKTTGSNAILQ 1463
            QDN+L NHQ++ +R+TT ++ I Q
Sbjct: 419  QDNNLCNHQNEVARETTDTSGIWQ 442


>XP_007135264.1 hypothetical protein PHAVU_010G114600g [Phaseolus vulgaris]
            ESW07258.1 hypothetical protein PHAVU_010G114600g
            [Phaseolus vulgaris]
          Length = 1287

 Score =  448 bits (1152), Expect = e-142
 Identities = 246/432 (56%), Positives = 299/432 (69%), Gaps = 9/432 (2%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLNPKTPSN-----SPFHRNLFPLYLTTSTARKFQTWAH 335
            MDIL  S P     P+FCHPK+LN K   N     SPF R  FPLYL+ STA KFQTWAH
Sbjct: 1    MDILRISNPTNFSVPSFCHPKTLNRKFSPNYDKPTSPFRRTPFPLYLSRSTAVKFQTWAH 60

Query: 336  FGRPTTXXXXXXXXXXXXHDHQVRPKHTSTDPSPVSGNGVEVSDAGFQRVSVDD--VEVE 509
             GRPT              DH+V P     DP  VSGNGVE S  G Q VSV D  VE E
Sbjct: 61   SGRPTKRRNSLRKKIL--RDHKVIPNQIPNDPLSVSGNGVEESGVGVQGVSVVDSVVEAE 118

Query: 510  KPKPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGGVKRVSVDEEEILR 689
            K K KLLGESVL NK E WVDQY +DIEYWG+G+GPVFT+Y+DS GGVKRV VDEEEIL+
Sbjct: 119  KTKSKLLGESVLWNKFESWVDQYKRDIEYWGVGSGPVFTIYEDSLGGVKRVFVDEEEILK 178

Query: 690  RSRVQRDEIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVVQGEEEGGFVKAAS 869
            RS+V+RD I +FPEV+ KI +A N+AREME+GNNVI+RNSSVAKFVVQG+EEGGFVKA  
Sbjct: 179  RSKVRRDVIGDFPEVRSKILNAKNMAREMESGNNVIARNSSVAKFVVQGKEEGGFVKAVQ 238

Query: 870  GFVAQPGLLPKLSGVGSRELCVLVVLWAVKELFSFGV--KEAQYXXXXXXXXXXXXXXXX 1043
            GFVA+P LLP+LS VG   L  LVV+W VK+LF+FG   KE +Y                
Sbjct: 239  GFVAKPQLLPRLSRVGRYVLYGLVVMWGVKKLFAFGEGDKEVEYTAREKEMMRRKMKARK 298

Query: 1044 XXXXLVKGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILKVKASADKPVVVENSPAEVRT 1223
                LVKGAVEVI EP ET ++DIK+PKLD EQL++NILK K S+DK +VV +S  +++T
Sbjct: 299  EKEKLVKGAVEVIVEPSETLMVDIKRPKLDKEQLRSNILKAKGSSDK-LVVRDSSDKIKT 357

Query: 1224 GYMDVDYKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPVIEKSSNETKVIKKDSEQDNS 1403
              M+VDYKVQ             GRD  +V++D+EMD+ VI+KSS++ + IKK SEQD+S
Sbjct: 358  ISMEVDYKVQEIKEMARQAREIEGRDSVVVNKDLEMDDSVIKKSSDDNEFIKKKSEQDDS 417

Query: 1404 LSNHQSKFSRKT 1439
            LS++Q++ +R+T
Sbjct: 418  LSDNQNEIARET 429


>XP_014492515.1 PREDICTED: uncharacterized protein LOC106754957 [Vigna radiata var.
            radiata]
          Length = 1383

 Score =  449 bits (1155), Expect = e-141
 Identities = 245/441 (55%), Positives = 302/441 (68%), Gaps = 9/441 (2%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLNPKTPSN-----SPFHRNLFPLYLTTSTARKFQTWAH 335
            MDILN S P     P+FC PK+L PK P N     SPF R  FPLYL+ STA KFQTWAH
Sbjct: 1    MDILNISNPSNFSIPSFCQPKALKPKFPPNYNKPTSPFRRTPFPLYLSRSTAVKFQTWAH 60

Query: 336  FGRPTTXXXXXXXXXXXXHDHQVRPKHTSTDPSPVSGNGVEVSDAGFQRVSVDD--VEVE 509
             GRPT              DH+V P     DP   SGNGVE S  G Q  SV D  VE E
Sbjct: 61   SGRPTKRRNSLRKKLL--RDHKVIPNQIPNDPLSFSGNGVEESGVGIQGDSVADSVVEAE 118

Query: 510  KPKPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGGVKRVSVDEEEILR 689
            K K KLLGESVL NKLE WVDQY +DIEYWG+G+GPVFTVY+DS GGVKRV VDEEEIL+
Sbjct: 119  KSKSKLLGESVLWNKLESWVDQYKRDIEYWGVGSGPVFTVYEDSLGGVKRVFVDEEEILK 178

Query: 690  RSRVQRDEIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVVQGEEEGGFVKAAS 869
            RS+V+RD I +FPEV+ KI +A N+A EME+GNNVI+RNSSV KFVV G+EEGGFVKA  
Sbjct: 179  RSKVRRDVIGDFPEVRSKILNAKNMASEMESGNNVIARNSSVTKFVVHGKEEGGFVKAVR 238

Query: 870  GFVAQPGLLPKLSGVGSRELCVLVVLWAVKELFSFGV--KEAQYXXXXXXXXXXXXXXXX 1043
            GFVA+P LLP+LS VG   L VLVV+W VK+LF+FG   KE ++                
Sbjct: 239  GFVAKPQLLPRLSRVGRYVLYVLVVMWVVKKLFAFGEGDKEVEFTPLEKEMMRRKMKARK 298

Query: 1044 XXXXLVKGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILKVKASADKPVVVENSPAEVRT 1223
                LVKG+VEVI EP ETPV+DIK+PKLD EQL+NNILK K S+DK +V+ +S  +++ 
Sbjct: 299  EKEKLVKGSVEVIVEPSETPVVDIKRPKLDKEQLRNNILKAKGSSDK-LVLGDSSDKIKA 357

Query: 1224 GYMDVDYKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPVIEKSSNETKVIKKDSEQDNS 1403
              M++DYKVQ             GRD+ +V++D+E D+ VI KSS++ ++IK+ SE+D+S
Sbjct: 358  ISMEMDYKVQEIKEMARQARKIEGRDNVVVNKDLETDDSVIRKSSDDNELIKRKSERDDS 417

Query: 1404 LSNHQSKFSRKTTGSNAILQT 1466
            L+++Q +  R+TT SN ILQ+
Sbjct: 418  LTDNQIEVVRETTDSNVILQS 438


>XP_017405818.1 PREDICTED: uncharacterized protein LOC108319257 [Vigna angularis]
            KOM25752.1 hypothetical protein LR48_Vigan181s003000
            [Vigna angularis] BAT98106.1 hypothetical protein
            VIGAN_09172500 [Vigna angularis var. angularis]
          Length = 1413

 Score =  431 bits (1109), Expect = e-134
 Identities = 242/441 (54%), Positives = 295/441 (66%), Gaps = 9/441 (2%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLNPKTPSN-----SPFHRNLFPLYLTTSTARKFQTWAH 335
            MDIL  S       P+FC PK+L  K P N     SPF R  F +YL+ STA KFQTWAH
Sbjct: 1    MDILKISNLSNFSIPSFCQPKALKLKFPPNYNKPTSPFRRTPFSVYLSRSTAVKFQTWAH 60

Query: 336  FGRPTTXXXXXXXXXXXXHDHQVRPKHTSTDPSPVSGNGVEVSDAGFQRVSVDD--VEVE 509
             GRPT              DH+V P     DP  VSGNG + S  G Q  SV D  VE E
Sbjct: 61   SGRPTKRRNSLRKKLL--RDHKVIPNQIPNDPLSVSGNGFKESGVGVQGDSVVDSVVEAE 118

Query: 510  KPKPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGGVKRVSVDEEEILR 689
            K K KLLGESVL NKLE WVDQY +DIEYWG+G+GPVFTVY+DS GGVKRV VDEEEIL+
Sbjct: 119  KSKSKLLGESVLWNKLESWVDQYKRDIEYWGVGSGPVFTVYEDSLGGVKRVFVDEEEILK 178

Query: 690  RSRVQRDEIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVVQGEEEGGFVKAAS 869
            RS+V+RD I +FPEV+ KI +A N+AREME+GNNVI+RNSSV KFVV G+EEGGFVKA  
Sbjct: 179  RSKVRRDVIGDFPEVRSKILNAKNMAREMESGNNVIARNSSVTKFVVHGKEEGGFVKAVR 238

Query: 870  GFVAQPGLLPKLSGVGSRELCVLVVLWAVKELFSFGV--KEAQYXXXXXXXXXXXXXXXX 1043
             FVA+P LLP+LS VG   L VLVV+W VK+LF+FG   KE +                 
Sbjct: 239  VFVAKPQLLPRLSRVGRYVLYVLVVMWVVKKLFAFGEGDKEVECTALEKEMMRRKMKARK 298

Query: 1044 XXXXLVKGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILKVKASADKPVVVENSPAEVRT 1223
                LVKGAVEVI EP ETPV+DIK PKLD EQL+NNILK K S+DK +VV +S  +++ 
Sbjct: 299  EKEKLVKGAVEVIVEPSETPVVDIKMPKLDKEQLRNNILKAKGSSDK-LVVGDSSDKIKA 357

Query: 1224 GYMDVDYKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPVIEKSSNETKVIKKDSEQDNS 1403
              M++DYKVQ             GRD+ +V++D+EMD+ VI KSS++ + IK+  E+D+S
Sbjct: 358  ISMEMDYKVQEIKEMARQARKIEGRDNVVVNKDLEMDDSVIRKSSDDNEFIKRKRERDDS 417

Query: 1404 LSNHQSKFSRKTTGSNAILQT 1466
            LS++Q +  R+TT SN ILQ+
Sbjct: 418  LSDNQIEVVRETTDSNVILQS 438


>XP_013444623.1 embryo defective 1703 protein, putative [Medicago truncatula]
            KEH18648.1 embryo defective 1703 protein, putative
            [Medicago truncatula]
          Length = 1172

 Score =  425 bits (1092), Expect = e-134
 Identities = 246/435 (56%), Positives = 295/435 (67%), Gaps = 3/435 (0%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLNPKTPSNSPFHRNLFPLYLTTSTARKFQTWAHFGRPT 350
            MDILN S PK I +P FC+P++L      N+PFH+N F  YLTTST+RKFQT AHF RPT
Sbjct: 1    MDILNFSPPKTISYPFFCNPRTLYTSN-RNTPFHKNTFSFYLTTSTSRKFQTLAHFRRPT 59

Query: 351  TXXXXXXXXXXXXHDHQVRPKHTSTDPSPVSGNGVE-VSDAGFQRVSVDDVEVEK-PKPK 524
                         HDHQV   H   DPS VS N VE + DA F       VE+EK  K +
Sbjct: 60   NRRNSLRNKLL--HDHQVSRNHIPNDPSSVSSNHVEEIDDASF-------VELEKLHKSE 110

Query: 525  LLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGGVKRVSVDEEEILRRSRVQ 704
            LLGE+VLLNKL++WVDQY KDI++WGIG+ P+FTVYQD FGGVKRV VDE+EIL+  RV 
Sbjct: 111  LLGENVLLNKLDNWVDQYRKDIDFWGIGSAPIFTVYQDLFGGVKRVLVDEDEILK--RVG 168

Query: 705  RDEIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVVQGEEE-GGFVKAASGFVA 881
             ++IE      DKI +A  LAREME+G NVI++NSSVAKF+VQGEEE G FVKA  GF+ 
Sbjct: 169  GNDIE------DKILEAKKLAREMESGENVIAKNSSVAKFIVQGEEEKGDFVKAVRGFIV 222

Query: 882  QPGLLPKLSGVGSRELCVLVVLWAVKELFSFGVKEAQYXXXXXXXXXXXXXXXXXXXXLV 1061
            QPGL+PKLSGVG   LCV  V++ VK+LF FG KE +Y                    L+
Sbjct: 223  QPGLVPKLSGVGGIVLCVF-VMFGVKKLFRFGDKEVRYTEMEKKMMMRKAKARKEKEMLM 281

Query: 1062 KGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILKVKASADKPVVVENSPAEVRTGYMDVD 1241
            KGAVEVI E  ETPVI +KKP+LD EQLK NILK KAS+DK +VV+NS  EV TG MD+D
Sbjct: 282  KGAVEVIHESTETPVIGVKKPELDKEQLKYNILKAKASSDK-LVVQNSSGEVITGSMDMD 340

Query: 1242 YKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPVIEKSSNETKVIKKDSEQDNSLSNHQS 1421
            YKV+             G D SLVS+DMEMD+ VI KSS E +VIK++S+QDNSLSN Q+
Sbjct: 341  YKVREIREMARRAREIEGGDRSLVSKDMEMDDSVIGKSSKEIEVIKENSKQDNSLSNRQN 400

Query: 1422 KFSRKTTGSNAILQT 1466
            + + KTT SN IL T
Sbjct: 401  EGASKTTDSNGILHT 415


>OIW04587.1 hypothetical protein TanjilG_18064 [Lupinus angustifolius]
          Length = 1199

 Score =  424 bits (1091), Expect = e-133
 Identities = 250/440 (56%), Positives = 292/440 (66%), Gaps = 8/440 (1%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLNPKTPS--NSP---FHRNLFPLYLTTSTARKFQTWAH 335
            M ILN ST    PF  FCHPK+L+PK PS  + P   F RN FP YL+TST  KFQT AH
Sbjct: 2    MHILNVSTT---PFNFFCHPKTLHPKFPSYPHKPTFRFQRNTFPRYLSTSTTVKFQTLAH 58

Query: 336  FGRPTTXXXXXXXXXXXXHDH-QVRPKHTST-DPSPVSGNGVEVSDAGFQRVSVDDVEVE 509
            FGRPT             HDH QVRP      +PS +  N VE       +V ++    E
Sbjct: 59   FGRPTNRRNSLRKKLL--HDHNQVRPNQVEIQNPSSIVDNVVE-------KVEIE----E 105

Query: 510  KPKPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGGVKRVSVDEEEILR 689
            K +PKLLGESVLLNKLE+W++QY KDIEYWGIG+GP+FTVYQDSFG V+RV VDEEEILR
Sbjct: 106  KTEPKLLGESVLLNKLENWLEQYKKDIEYWGIGSGPIFTVYQDSFGNVQRVLVDEEEILR 165

Query: 690  RSRVQRDEIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVVQGEE-EGGFVKAA 866
            RSRV R+ I++FPEV +KI  A N+AREMENGNNVI+RNSSVA FVVQGEE +G FVK  
Sbjct: 166  RSRVLREVIDDFPEVSNKILYAKNMAREMENGNNVIARNSSVANFVVQGEEGKGDFVKGI 225

Query: 867  SGFVAQPGLLPKLSGVGSRELCVLVVLWAVKELFSFGVKEAQYXXXXXXXXXXXXXXXXX 1046
             GFV QPG LPK+ GVGSR L VLVVLWA K LFSFG KE ++                 
Sbjct: 226  RGFVVQPGFLPKVKGVGSRVLFVLVVLWAAKNLFSFGDKEVEHTEKEKEMMRRKIKARKE 285

Query: 1047 XXXLVKGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILKVKASADKPVVVENSPAEVRTG 1226
               LVKGAVEVIPE  E+ VID+KKP LD EQL N+I+K KASADK +VV+ S A+    
Sbjct: 286  KEMLVKGAVEVIPEVSESLVIDMKKPNLDKEQLMNSIIKAKASADK-LVVQGSSAKGGNR 344

Query: 1227 YMDVDYKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPVIEKSSNETKVIKKDSEQDNSL 1406
             MD+D+KVQ             G D S VS D EMD+P IE+ SNE +VIK + EQ  SL
Sbjct: 345  PMDMDFKVQEIREMAREARKIEGIDCSHVSSDTEMDDPGIEELSNEMEVIKMNGEQHKSL 404

Query: 1407 SNHQSKFSRKTTGSNAILQT 1466
            SNHQ++  RKT   N+ LQT
Sbjct: 405  SNHQNEVERKTKDCNSTLQT 424


>XP_019456207.1 PREDICTED: uncharacterized protein LOC109356989 [Lupinus
            angustifolius]
          Length = 1214

 Score =  424 bits (1091), Expect = e-133
 Identities = 250/440 (56%), Positives = 292/440 (66%), Gaps = 8/440 (1%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLNPKTPS--NSP---FHRNLFPLYLTTSTARKFQTWAH 335
            M ILN ST    PF  FCHPK+L+PK PS  + P   F RN FP YL+TST  KFQT AH
Sbjct: 2    MHILNVSTT---PFNFFCHPKTLHPKFPSYPHKPTFRFQRNTFPRYLSTSTTVKFQTLAH 58

Query: 336  FGRPTTXXXXXXXXXXXXHDH-QVRPKHTST-DPSPVSGNGVEVSDAGFQRVSVDDVEVE 509
            FGRPT             HDH QVRP      +PS +  N VE       +V ++    E
Sbjct: 59   FGRPTNRRNSLRKKLL--HDHNQVRPNQVEIQNPSSIVDNVVE-------KVEIE----E 105

Query: 510  KPKPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGGVKRVSVDEEEILR 689
            K +PKLLGESVLLNKLE+W++QY KDIEYWGIG+GP+FTVYQDSFG V+RV VDEEEILR
Sbjct: 106  KTEPKLLGESVLLNKLENWLEQYKKDIEYWGIGSGPIFTVYQDSFGNVQRVLVDEEEILR 165

Query: 690  RSRVQRDEIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVVQGEE-EGGFVKAA 866
            RSRV R+ I++FPEV +KI  A N+AREMENGNNVI+RNSSVA FVVQGEE +G FVK  
Sbjct: 166  RSRVLREVIDDFPEVSNKILYAKNMAREMENGNNVIARNSSVANFVVQGEEGKGDFVKGI 225

Query: 867  SGFVAQPGLLPKLSGVGSRELCVLVVLWAVKELFSFGVKEAQYXXXXXXXXXXXXXXXXX 1046
             GFV QPG LPK+ GVGSR L VLVVLWA K LFSFG KE ++                 
Sbjct: 226  RGFVVQPGFLPKVKGVGSRVLFVLVVLWAAKNLFSFGDKEVEHTEKEKEMMRRKIKARKE 285

Query: 1047 XXXLVKGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILKVKASADKPVVVENSPAEVRTG 1226
               LVKGAVEVIPE  E+ VID+KKP LD EQL N+I+K KASADK +VV+ S A+    
Sbjct: 286  KEMLVKGAVEVIPEVSESLVIDMKKPNLDKEQLMNSIIKAKASADK-LVVQGSSAKGGNR 344

Query: 1227 YMDVDYKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPVIEKSSNETKVIKKDSEQDNSL 1406
             MD+D+KVQ             G D S VS D EMD+P IE+ SNE +VIK + EQ  SL
Sbjct: 345  PMDMDFKVQEIREMAREARKIEGIDCSHVSSDTEMDDPGIEELSNEMEVIKMNGEQHKSL 404

Query: 1407 SNHQSKFSRKTTGSNAILQT 1466
            SNHQ++  RKT   N+ LQT
Sbjct: 405  SNHQNEVERKTKDCNSTLQT 424


>XP_003548415.1 PREDICTED: uncharacterized protein LOC100796285 [Glycine max]
            KHN15928.1 hypothetical protein glysoja_013144 [Glycine
            soja] KRH06458.1 hypothetical protein GLYMA_16G024100
            [Glycine max]
          Length = 1308

 Score =  417 bits (1072), Expect = e-130
 Identities = 237/443 (53%), Positives = 293/443 (66%), Gaps = 11/443 (2%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLNPKTPSN-----SPFHRNLFPLYLTTSTARKFQTWAH 335
            M+ILN S P     PTFCHPK+L  K  SN     SPF R  F LYL+ S A KFQTWAH
Sbjct: 1    MEILNISNPTNFSIPTFCHPKTLTSKFTSNNIKPTSPFRRTSFSLYLSRSAAIKFQTWAH 60

Query: 336  FGRPTTXXXXXXXXXXXXHDHQVRPKHTSTDPSPVSGNGVEVSDAGFQRVSVDD--VEVE 509
             GRP+              DH+V P     DP  VSGNGVE S  G Q VSV +  VE E
Sbjct: 61   SGRPSNRRNSLRKKLL--RDHKVNPNQIPNDPFSVSGNGVEESGVGVQGVSVVNNVVEAE 118

Query: 510  KPKPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGGVKRVSVDEEEILR 689
            KPK K+L ESVL NKLE+WVDQY KD+EYWG+G+GP+FTVY+DS G V+RV VDE++IL+
Sbjct: 119  KPKSKILRESVLWNKLENWVDQYKKDVEYWGVGSGPIFTVYEDSLGAVERVVVDEDQILK 178

Query: 690  RSRVQRDEIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVVQGEEE-GGFVKAA 866
            RS+V+RD +E   EV+ KI +A N+AREME+GNNVI+RNSSVAKFVV+G+EE GGFVKA 
Sbjct: 179  RSKVRRDAVENLAEVRSKILNAKNIAREMESGNNVIARNSSVAKFVVEGKEEGGGFVKAV 238

Query: 867  SGFVAQPGLLPKLSGVGSRELCVLVVLWAVKELF-SFGV--KEAQYXXXXXXXXXXXXXX 1037
             GFVA+P LLP+LS VG + L VLVV+W VK+LF +FG   KE +Y              
Sbjct: 239  QGFVAKPRLLPRLSWVGRKVLYVLVVVWVVKKLFVAFGERDKEVEYTATEKEMMRRKIKA 298

Query: 1038 XXXXXXLVKGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILKVKASADKPVVVENSPAEV 1217
                  L K AVEV+ E  E PV+DIKKPKLD EQL+N+ILKV  SADK +VV +S  +V
Sbjct: 299  REEKEKLTKRAVEVVVESSEAPVVDIKKPKLDKEQLRNSILKVTGSADK-LVVHDSSDKV 357

Query: 1218 RTGYMDVDYKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPVIEKSSNETKVIKKDSEQD 1397
            +T   ++DYKVQ             G +  + +RDME D+PVIE SS+       DSEQ 
Sbjct: 358  KTRSTEMDYKVQEIREMARQARKIEGSNGVVGNRDMETDDPVIEISSD-------DSEQY 410

Query: 1398 NSLSNHQSKFSRKTTGSNAILQT 1466
            + LSNHQ++ S++TT SN I+Q+
Sbjct: 411  DGLSNHQNEVSKETTDSNTIMQS 433


>XP_016181516.1 PREDICTED: uncharacterized protein LOC107623680 [Arachis ipaensis]
          Length = 1216

 Score =  385 bits (989), Expect = e-118
 Identities = 225/448 (50%), Positives = 282/448 (62%), Gaps = 16/448 (3%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLN----PKTPSNSP-FHRN-LFPLYLTTSTARKFQTWA 332
            M +LN ST     F  FC+PK+L+    P   SN P FHR  LF  +  +S   KFQT+A
Sbjct: 1    MAVLNVST-----FSIFCNPKTLSDTKFPSKYSNKPRFHRTPLFSPHFFSSKTTKFQTFA 55

Query: 333  HFGRPTTXXXXXXXXXXXXHDHQVRPKHTSTDPSPVS--------GNGVEVSDAGFQRVS 488
             FGRPT             + H+V P    TDP P           NGVE S +   +V 
Sbjct: 56   QFGRPTNRRNYLRKKLLHDNHHRVSPNKPITDPPPSEFHEKSSSFNNGVEDSVSEGTKVE 115

Query: 489  VDDVEVEKPKPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGGVKRVSV 668
              +VE ++     LGESV+LNKLE+WV+QY KD EYWGIG+G +FTVY+DS GGVKRV V
Sbjct: 116  NFEVEKQQKSKFSLGESVMLNKLENWVEQYKKDFEYWGIGSGSIFTVYEDSNGGVKRVIV 175

Query: 669  DEEEILRRSRVQRDEIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVVQGEE-- 842
            DE+EILRR++V R+ IEEFPEV  KI +A N+AREME GNNVISRNSSVAKFVVQGEE  
Sbjct: 176  DEDEILRRNKVDREVIEEFPEVIYKISNAKNMAREMEKGNNVISRNSSVAKFVVQGEEVA 235

Query: 843  EGGFVKAASGFVAQPGLLPKLSGVGSRELCVLVVLWAVKELFSFGVKEAQYXXXXXXXXX 1022
            E GF     GF+AQPGLLPK+S VG R LCV++V+WAVK+LF+ G +E +Y         
Sbjct: 236  ESGFFSGVGGFIAQPGLLPKISRVGGRVLCVMLVMWAVKKLFTIGGEEVEYTGMEKEMMR 295

Query: 1023 XXXXXXXXXXXLVKGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILKVKASADKPVVVEN 1202
                       LVKGA+EVIPE  E+  +DIKKPKLD +QLKNNILK KA+ADK + V+ 
Sbjct: 296  RKIKARKEKEVLVKGAIEVIPEQSESLTMDIKKPKLDKDQLKNNILKAKATADK-LAVQG 354

Query: 1203 SPAEVRTGYMDVDYKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPVIEKSSNETKVIKK 1382
            S A+V +     DYKVQ              R+ S VS+D + + PVIE+SSNE +V++K
Sbjct: 355  SSAKVTSKTTHFDYKVQEIQEMARQARRIEAREKSQVSKDTDRNGPVIEESSNEMEVVQK 414

Query: 1383 DSEQDNSLSNHQSKFSRKTTGSNAILQT 1466
            + E+D      Q +  RKTT SNAIL++
Sbjct: 415  NDEKD------QDEVERKTTDSNAILES 436


>XP_015937679.1 PREDICTED: uncharacterized protein LOC107463403 [Arachis duranensis]
          Length = 1221

 Score =  381 bits (978), Expect = e-117
 Identities = 225/449 (50%), Positives = 281/449 (62%), Gaps = 17/449 (3%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLN----PKTPSNSP-FHR-NLFPLYLTTSTARKFQTWA 332
            M +LN ST     F TFC+P++L+    P   SN P FHR  LF  +  +S   KFQT+A
Sbjct: 1    MAVLNVST-----FSTFCNPQTLSNTKFPSKYSNKPRFHRIPLFSPHFFSSKTTKFQTFA 55

Query: 333  HFGRPTTXXXXXXXXXXXXHDHQVRPKHTSTDPSPVS--------GNGVEVSDAGFQRVS 488
             FGRPT             + H+V P    TDP P           NGVE S +   +V 
Sbjct: 56   QFGRPTNRRNYLRKKLLHDNHHRVSPNKLITDPRPSEFHEKSTSFNNGVEDSVSEGTKVE 115

Query: 489  VDDVEVEKPKPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGGVKRVSV 668
              +VE ++     LGESV+LNKLE+WV+QY KD EYWGIG+G +FTVY+DS GGVKRV V
Sbjct: 116  NFEVEKQQKSKFSLGESVMLNKLENWVEQYKKDFEYWGIGSGSIFTVYEDSNGGVKRVIV 175

Query: 669  DEEEILRRSRVQRDEIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVVQGEE-- 842
            DE+EILRR++V R+ I+EFPEV  KI +A N+AREME GNNVISRNSSVAKFVVQGEE  
Sbjct: 176  DEDEILRRNKVDREVIDEFPEVIYKISNAKNMAREMEKGNNVISRNSSVAKFVVQGEEVV 235

Query: 843  -EGGFVKAASGFVAQPGLLPKLSGVGSRELCVLVVLWAVKELFSFGVKEAQYXXXXXXXX 1019
             E GFV    GF+AQPGLL K+S VG R LCVL+V+WAVK+LF+ G +E +Y        
Sbjct: 236  AESGFVSGVRGFIAQPGLLLKISRVGGRVLCVLLVMWAVKKLFTVGGEEVEYTGMEKEMM 295

Query: 1020 XXXXXXXXXXXXLVKGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILKVKASADKPVVVE 1199
                        LVKGA+EVIPE  E+   DIKKPKLD +QLKNNILK KA+ADKP  V+
Sbjct: 296  RRKIKARKEKEVLVKGAIEVIPEQSESLTTDIKKPKLDKDQLKNNILKAKATADKP-AVQ 354

Query: 1200 NSPAEVRTGYMDVDYKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPVIEKSSNETKVIK 1379
               A+V +     DYKVQ              R+ S VS+D + + PVIE+SSNE +V++
Sbjct: 355  GLSAKVTSKTTHFDYKVQEIQEMARRARRIEAREKSQVSKDTDRNGPVIEESSNEMEVVQ 414

Query: 1380 KDSEQDNSLSNHQSKFSRKTTGSNAILQT 1466
            K+ E+D      Q +  RKTT SNAIL++
Sbjct: 415  KNDEKD------QDEVERKTTDSNAILES 437


>KRH47901.1 hypothetical protein GLYMA_07G055400 [Glycine max]
          Length = 955

 Score =  319 bits (817), Expect = 1e-95
 Identities = 203/438 (46%), Positives = 259/438 (59%), Gaps = 6/438 (1%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLNPKTPSN-----SPFHRNLFPLYLTTSTARKFQTWAH 335
            M+IL+ S P      T C P++L  K P N     SPF R  F LYL+     KFQTWAH
Sbjct: 1    MEILSISNP------TLCLPQTLTLKFPPNHSKPTSPFLRTPFSLYLSRFAVIKFQTWAH 54

Query: 336  FGRPTTXXXXXXXXXXXXHDHQVRPKHTSTDPSPVSGNGVEVSDAGFQRVSVDDVEVEKP 515
             GRP+              D +V P     DP  VSGNGVE S  G Q V  + VEVEKP
Sbjct: 55   SGRPSNRRNSLRKKLLL--DLKVNPNQIPNDPFSVSGNGVEESGVGVQGVD-NVVEVEKP 111

Query: 516  KPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGGVKRVSVDEEEILRRS 695
            K KLL ESVL NKL +W DQY +D+EYWG+G+G +FTVY+DS GG+KRV VDE+ IL+RS
Sbjct: 112  KSKLLRESVLWNKLGNWADQYKRDVEYWGVGSGRIFTVYEDSIGGIKRVVVDEDPILKRS 171

Query: 696  RVQRDEIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVVQGEEEGGFVKAASGF 875
            +V                   N+AREME+GNNVI+RNSSVAKF+           A  GF
Sbjct: 172  KV-------------------NMAREMESGNNVIARNSSVAKFM-----------AVQGF 201

Query: 876  VAQPGLLPKLSGVGSRELCVLVVLWAVKELF-SFGVKEAQYXXXXXXXXXXXXXXXXXXX 1052
            VA+P LLP+LS +G + L VLVV+W VK+LF +FG  + +                    
Sbjct: 202  VAKPRLLPRLSELGRKVLYVLVVVWMVKKLFVAFGEGDKE--------------VEEEKE 247

Query: 1053 XLVKGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILKVKASADKPVVVENSPAEVRTGYM 1232
             L KG VEV+ EP ETP +DIKK +LD EQL+N+ILKVK S  K VV ++S  +V+T + 
Sbjct: 248  KLAKGTVEVVVEPWETPAVDIKK-QLDKEQLRNSILKVKDSVYKSVVHDSSD-KVKTRFT 305

Query: 1233 DVDYKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPVIEKSSNETKVIKKDSEQDNSLSN 1412
            ++DYK               G D  +V++D+EMD+PVIE SSN       DSEQD+ LSN
Sbjct: 306  EMDYK---------------GSDSVVVNKDIEMDDPVIEISSN-------DSEQDDGLSN 343

Query: 1413 HQSKFSRKTTGSNAILQT 1466
            HQ++ S++TT SN I+Q+
Sbjct: 344  HQNEVSKETTDSNTIMQS 361


>GAU43060.1 hypothetical protein TSUD_350060 [Trifolium subterraneum]
          Length = 1056

 Score =  313 bits (803), Expect = 6e-93
 Identities = 179/300 (59%), Positives = 214/300 (71%), Gaps = 5/300 (1%)
 Frame = +3

Query: 582  KDIEYWGIGTGPVFTVYQDSFGGVKRVSVDEEEILRRSRVQR--DEIEEFPEVKDKIQDA 755
            KD ++WGIG+ P+FTVY+DSFGGVKRV VDE+EIL+R RVQR   EIE   EVK KI DA
Sbjct: 8    KDSDFWGIGSSPIFTVYEDSFGGVKRVLVDEDEILKRIRVQRGGSEIENLSEVKCKILDA 67

Query: 756  NNLAREMENGNNVISRNSSVAKFVVQGEEE-GGFVKAASGFVAQPGLLPKLSGVGSRELC 932
              LAREMENG+NVI+R+SSVAKFVVQGEEE GGFV A  GFV QP L+PKL+GVG   LC
Sbjct: 68   KKLAREMENGDNVIARDSSVAKFVVQGEEEKGGFVTAVRGFVVQPRLVPKLTGVGGIVLC 127

Query: 933  VLVVLWAVKELFSFGVKEAQYXXXXXXXXXXXXXXXXXXXXLVKGAVEVIPEPLETP-VI 1109
            VLVV++A K+LFSFG KE +Y                     +KGAVEVI E  E P VI
Sbjct: 128  VLVVMFAAKKLFSFGSKEVEYTETEKKMMMRKVKARKEKERSMKGAVEVIHETTEIPAVI 187

Query: 1110 DIKKPKLDMEQLKNNILKVKASADKPVVVENSPAEVRTGYMDVDYKVQXXXXXXXXXXXX 1289
            D+KKPKLD EQLKNNI+  KAS+DK +VV+NS  EVRTG +D+DYK++            
Sbjct: 188  DVKKPKLDKEQLKNNIVNAKASSDK-LVVQNSSGEVRTGSVDMDYKIREIREMARRAREI 246

Query: 1290 XGRDHSLVSRDMEMDEPVIEKSSNETKVIKKDSEQDNSLSNHQSKFSRKTTGS-NAILQT 1466
             GRDHSL S+DME+++P+I KSS+E       SE DNSLSNHQ++ +RKTT S N ILQT
Sbjct: 247  EGRDHSLGSKDMEVEDPLIGKSSDE-------SEVDNSLSNHQNEVARKTTDSNNEILQT 299


>KHN06315.1 hypothetical protein glysoja_021153 [Glycine soja]
          Length = 291

 Score =  269 bits (688), Expect = 9e-84
 Identities = 158/315 (50%), Positives = 190/315 (60%), Gaps = 8/315 (2%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLNPKTPSN-----SPFHRNLFPLYLTTSTARKFQTWAH 335
            M+IL+ S P      T C P++L  K P N     SPF R  F LYL+     KFQTWAH
Sbjct: 1    MEILSISNP------TLCLPQTLTLKFPPNHSKPTSPFLRTPFSLYLSRFAVIKFQTWAH 54

Query: 336  FGRPTTXXXXXXXXXXXXHDHQVRPKHTSTDPSPVSGNGVEVSDAGFQRVSVDDVEVEKP 515
             GRP+              D +V P     DP  VSGNGVE S  G Q V  + VEVEKP
Sbjct: 55   SGRPSNRRNSLSKKLL--RDRKVNPNQIPNDPFSVSGNGVEESGVGDQGVD-NVVEVEKP 111

Query: 516  KPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGGVKRVSVDEEEILRRS 695
            K KLL +SVL NKLE+W DQY +D+EYWG+G+GP+FTVY+DS GGVKRV VDE++IL+RS
Sbjct: 112  KSKLLRDSVLWNKLENWADQYKRDVEYWGVGSGPIFTVYEDSIGGVKRVVVDEDQILKRS 171

Query: 696  RVQRDEIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVVQGEEEGGFVKAASGF 875
            +V                   N+AREME+GNNVI RNSSVAKF+V+G+EEGGFVKA  GF
Sbjct: 172  KV-------------------NMAREMESGNNVIVRNSSVAKFMVEGKEEGGFVKAVQGF 212

Query: 876  VAQPGLLPKLSGVGSRELCVLVVLWAVKEL---FSFGVKEAQYXXXXXXXXXXXXXXXXX 1046
            VA+P LLP LSG+G + L VLVV+W VK+L   F  G KE +Y                 
Sbjct: 213  VAKPRLLPWLSGLGRKVLYVLVVVWMVKKLFVAFGEGDKEVEYTAMEKEMMRRKMKAREE 272

Query: 1047 XXXLVKGAVEVIPEP 1091
               L KG VEV+ EP
Sbjct: 273  KEKLAKGTVEVVVEP 287


>OAY62160.1 hypothetical protein MANES_01G246200 [Manihot esculenta]
          Length = 1154

 Score =  254 bits (649), Expect = 6e-71
 Identities = 170/446 (38%), Positives = 235/446 (52%), Gaps = 29/446 (6%)
 Frame = +3

Query: 171  MDILNAS-------TPKAIPFPTFCHPKSLNPKTPSNSPFHRNL-FPLYLTTSTARKFQT 326
            M++LN S       TP++  F      K+   K PS S  H+NL  P +L+  T R    
Sbjct: 1    MELLNPSVSNRHLFTPRSSFFTRKFSFKTCKTKIPSKS--HKNLSVPFHLSFFTTRIVLV 58

Query: 327  WAHFGRPTTXXXXXXXXXXXXHDHQVRPKHT-STDPSPVSGN-GVEVSDAGFQRVSVDDV 500
             AHFGRPT              D QVR K+  S +PS    N  +   + G  + ++D  
Sbjct: 59   SAHFGRPTNRRNSLRKKHVD--DQQVRQKNPISLNPSSDFQNPNIHFDNIGNSQETLDYD 116

Query: 501  EVE-------------------KPKPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVF 623
             +E                   K KPK LGESVL  KLEDWV QY+KD  YWG+G+ P+F
Sbjct: 117  SLEGIDSSYGVGLVEPGWEKTWKTKPKELGESVLSTKLEDWVHQYNKDTAYWGLGSSPIF 176

Query: 624  TVYQDSFGGVKRVSVDEEEILRRSRVQRDEIEEFPEVKDKIQDANNLAREMENGNNVISR 803
            T++ D  G VKRV VDE+EIL+RS+V++ E+ +  ++  KI  A +LAR ME G NVI R
Sbjct: 177  TLFHDLKGNVKRVIVDEDEILKRSQVKKRELGDITKLNSKISYAKDLARRMEEGGNVIPR 236

Query: 804  NSSVAKFVVQGEEEGGFVKAASGFVAQPGLLPKLSGVGSRELCVLVVLWAVKELFSFGVK 983
            NSSVAKFVV   EE GFV +    V QP  +P LSG+G    C  V +WA+K+LF+ G K
Sbjct: 237  NSSVAKFVV-SREESGFVNSIRDAVFQPQFVPVLSGLGKLTFCGFVAIWALKKLFTSGNK 295

Query: 984  EAQYXXXXXXXXXXXXXXXXXXXXLVKGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILK 1163
            + Q                     L KG VEV+ EP E P++  +KPKLD ++L  NIL 
Sbjct: 296  KEQLTEVEKEMMRRKIKSRQEKEMLEKGRVEVVQEPSELPMLSTEKPKLDKQELMRNILD 355

Query: 1164 VKASADKPVVVENSPAEVRTGYMDVDYKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPV 1343
             KAS D  V+V++S     T  MD D K+Q                  +V+++ E  + V
Sbjct: 356  AKASKDNLVLVDSSGCHT-TSSMDFDKKIQEIGAMAREAREIQSGGQPMVNKNREEKQSV 414

Query: 1344 IEKSSNETKVIKKDSEQDNSLSNHQS 1421
             ++SS  T++ +K +E+ +S+SN Q+
Sbjct: 415  KDESSGGTELFEKHTEEVSSISNTQN 440


>XP_017975378.1 PREDICTED: uncharacterized protein LOC18613973 isoform X3 [Theobroma
            cacao]
          Length = 1124

 Score =  254 bits (648), Expect = 7e-71
 Identities = 171/459 (37%), Positives = 242/459 (52%), Gaps = 27/459 (5%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLNP----KTPSNSPFHR---------NLFPLYLTTSTA 311
            M++LN    K      FC   S  P    KT +  P HR           F   L  S  
Sbjct: 1    MELLNPPISKTPQL--FCSFSSFTPRLSTKTSNKKPLHRFHISKFREIPSFSRCLPLSGT 58

Query: 312  RKFQTWAHFGRPTTXXXXXXXXXXXXHDHQVRPKHTSTDPSP--VSGNGV-----EVSDA 470
            + F   AHFGRPT+            H  QVR     ++P+P   + NG       ++  
Sbjct: 59   KFFHVSAHFGRPTSRRNSLREKLLLDHQ-QVRQNPIPSNPTPDFQNPNGSFENFENLNSG 117

Query: 471  GFQRVSVDDVEVEKPKPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGG 650
            G +++ VD+ +V + K K LGESV+L+KLE+W+DQY KD ++WGIG+GP+FTV  D  G 
Sbjct: 118  GSKQIDVDN-DVGELKSKRLGESVMLSKLENWIDQYKKDADFWGIGSGPIFTVLHDLEGN 176

Query: 651  VKRVSVDEEEILRRSRVQRDEIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVV 830
            VKRV+V+E+EIL+R      E E+  +V  K+  A NLAREME G NVI RNS VAKFVV
Sbjct: 177  VKRVTVNEDEILKRL-----EFEDLEKVNSKLSYAKNLAREMERGENVIPRNSLVAKFVV 231

Query: 831  QGEEEGGFVKAASGFVAQPGLLPKLSGVGSRELCVLVVLWAVKELFSFGVKEAQYXXXXX 1010
             G +E G V    G + +PG +PKLS  GS  LC  +VLWAVK+LF  G KE  Y     
Sbjct: 232  SG-QESGLVSGVHGVILRPGFMPKLSRGGSLLLCGFLVLWAVKKLFVLGNKEVAYTELEK 290

Query: 1011 XXXXXXXXXXXXXXXLVKGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILKVKASADKPV 1190
                           L KG+VEV+    E P +  ++PKLD +QL NNILK KA+ DK  
Sbjct: 291  EMMRRKIKSRKEREMLEKGSVEVVQASEEPPNMSFQRPKLDRQQLLNNILKAKAAKDKLA 350

Query: 1191 VVENSPAEVRTGYMDVDYKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPVIEKSSNETK 1370
            ++++S ++  +  +D +++VQ             GR+ S++ +D +  +   ++  NE +
Sbjct: 351  LLDSSGSQ-SSKSVDFEHEVQEIKVTAKEALETEGREQSVIGKDEKQVQAANKEFCNEMQ 409

Query: 1371 VIKKDSEQDNS-LSN------HQSKFSRKTTGSNAILQT 1466
             IK+D +   S LSN       Q K S +T  + +  +T
Sbjct: 410  AIKEDGQDGVSFLSNLSTEDSEQGKVSYRTVEATSPCET 448


>XP_017975372.1 PREDICTED: uncharacterized protein LOC18613973 isoform X2 [Theobroma
            cacao]
          Length = 1143

 Score =  254 bits (648), Expect = 8e-71
 Identities = 171/459 (37%), Positives = 242/459 (52%), Gaps = 27/459 (5%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLNP----KTPSNSPFHR---------NLFPLYLTTSTA 311
            M++LN    K      FC   S  P    KT +  P HR           F   L  S  
Sbjct: 1    MELLNPPISKTPQL--FCSFSSFTPRLSTKTSNKKPLHRFHISKFREIPSFSRCLPLSGT 58

Query: 312  RKFQTWAHFGRPTTXXXXXXXXXXXXHDHQVRPKHTSTDPSP--VSGNGV-----EVSDA 470
            + F   AHFGRPT+            H  QVR     ++P+P   + NG       ++  
Sbjct: 59   KFFHVSAHFGRPTSRRNSLREKLLLDHQ-QVRQNPIPSNPTPDFQNPNGSFENFENLNSG 117

Query: 471  GFQRVSVDDVEVEKPKPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGG 650
            G +++ VD+ +V + K K LGESV+L+KLE+W+DQY KD ++WGIG+GP+FTV  D  G 
Sbjct: 118  GSKQIDVDN-DVGELKSKRLGESVMLSKLENWIDQYKKDADFWGIGSGPIFTVLHDLEGN 176

Query: 651  VKRVSVDEEEILRRSRVQRDEIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVV 830
            VKRV+V+E+EIL+R      E E+  +V  K+  A NLAREME G NVI RNS VAKFVV
Sbjct: 177  VKRVTVNEDEILKRL-----EFEDLEKVNSKLSYAKNLAREMERGENVIPRNSLVAKFVV 231

Query: 831  QGEEEGGFVKAASGFVAQPGLLPKLSGVGSRELCVLVVLWAVKELFSFGVKEAQYXXXXX 1010
             G +E G V    G + +PG +PKLS  GS  LC  +VLWAVK+LF  G KE  Y     
Sbjct: 232  SG-QESGLVSGVHGVILRPGFMPKLSRGGSLLLCGFLVLWAVKKLFVLGNKEVAYTELEK 290

Query: 1011 XXXXXXXXXXXXXXXLVKGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILKVKASADKPV 1190
                           L KG+VEV+    E P +  ++PKLD +QL NNILK KA+ DK  
Sbjct: 291  EMMRRKIKSRKEREMLEKGSVEVVQASEEPPNMSFQRPKLDRQQLLNNILKAKAAKDKLA 350

Query: 1191 VVENSPAEVRTGYMDVDYKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPVIEKSSNETK 1370
            ++++S ++  +  +D +++VQ             GR+ S++ +D +  +   ++  NE +
Sbjct: 351  LLDSSGSQ-SSKSVDFEHEVQEIKVTAKEALETEGREQSVIGKDEKQVQAANKEFCNEMQ 409

Query: 1371 VIKKDSEQDNS-LSN------HQSKFSRKTTGSNAILQT 1466
             IK+D +   S LSN       Q K S +T  + +  +T
Sbjct: 410  AIKEDGQDGVSFLSNLSTEDSEQGKVSYRTVEATSPCET 448


>XP_007051543.2 PREDICTED: uncharacterized protein LOC18613973 isoform X1 [Theobroma
            cacao]
          Length = 1155

 Score =  254 bits (648), Expect = 8e-71
 Identities = 171/459 (37%), Positives = 242/459 (52%), Gaps = 27/459 (5%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLNP----KTPSNSPFHR---------NLFPLYLTTSTA 311
            M++LN    K      FC   S  P    KT +  P HR           F   L  S  
Sbjct: 1    MELLNPPISKTPQL--FCSFSSFTPRLSTKTSNKKPLHRFHISKFREIPSFSRCLPLSGT 58

Query: 312  RKFQTWAHFGRPTTXXXXXXXXXXXXHDHQVRPKHTSTDPSP--VSGNGV-----EVSDA 470
            + F   AHFGRPT+            H  QVR     ++P+P   + NG       ++  
Sbjct: 59   KFFHVSAHFGRPTSRRNSLREKLLLDHQ-QVRQNPIPSNPTPDFQNPNGSFENFENLNSG 117

Query: 471  GFQRVSVDDVEVEKPKPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGG 650
            G +++ VD+ +V + K K LGESV+L+KLE+W+DQY KD ++WGIG+GP+FTV  D  G 
Sbjct: 118  GSKQIDVDN-DVGELKSKRLGESVMLSKLENWIDQYKKDADFWGIGSGPIFTVLHDLEGN 176

Query: 651  VKRVSVDEEEILRRSRVQRDEIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVV 830
            VKRV+V+E+EIL+R      E E+  +V  K+  A NLAREME G NVI RNS VAKFVV
Sbjct: 177  VKRVTVNEDEILKRL-----EFEDLEKVNSKLSYAKNLAREMERGENVIPRNSLVAKFVV 231

Query: 831  QGEEEGGFVKAASGFVAQPGLLPKLSGVGSRELCVLVVLWAVKELFSFGVKEAQYXXXXX 1010
             G +E G V    G + +PG +PKLS  GS  LC  +VLWAVK+LF  G KE  Y     
Sbjct: 232  SG-QESGLVSGVHGVILRPGFMPKLSRGGSLLLCGFLVLWAVKKLFVLGNKEVAYTELEK 290

Query: 1011 XXXXXXXXXXXXXXXLVKGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILKVKASADKPV 1190
                           L KG+VEV+    E P +  ++PKLD +QL NNILK KA+ DK  
Sbjct: 291  EMMRRKIKSRKEREMLEKGSVEVVQASEEPPNMSFQRPKLDRQQLLNNILKAKAAKDKLA 350

Query: 1191 VVENSPAEVRTGYMDVDYKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPVIEKSSNETK 1370
            ++++S ++  +  +D +++VQ             GR+ S++ +D +  +   ++  NE +
Sbjct: 351  LLDSSGSQ-SSKSVDFEHEVQEIKVTAKEALETEGREQSVIGKDEKQVQAANKEFCNEMQ 409

Query: 1371 VIKKDSEQDNS-LSN------HQSKFSRKTTGSNAILQT 1466
             IK+D +   S LSN       Q K S +T  + +  +T
Sbjct: 410  AIKEDGQDGVSFLSNLSTEDSEQGKVSYRTVEATSPCET 448


>XP_004306670.1 PREDICTED: uncharacterized protein LOC101313638 [Fragaria vesca
            subsp. vesca]
          Length = 1166

 Score =  250 bits (638), Expect = 2e-69
 Identities = 174/469 (37%), Positives = 240/469 (51%), Gaps = 42/469 (8%)
 Frame = +3

Query: 171  MDILNASTPK-------AIPFPTFCHPKSLNPKT------PSNSPFHRNLFPLYLTTSTA 311
            M++L +S P          PFPT    KS NPKT      PS +P     F +Y  +   
Sbjct: 1    MELLCSSIPTNPNSLSFTTPFPTRFPNKSWNPKTTFRYRKPSKNPS----FSIYFLSRNT 56

Query: 312  RKFQTWAHFGRPTTXXXXXXXXXXXXHDHQVRPK---------HTSTDPSPVSGNGVEVS 464
             KFQ +A FGRPT+             D +V P          +T+ D S    N   V 
Sbjct: 57   TKFQAFAQFGRPTSRRNSLRKKLI--EDQKVNPLIPSFDFQLLNTNIDDSESKLNSDNVK 114

Query: 465  DAGFQRVSVDDV-----------------EVEKPKPKLLGESVLLNKLEDWVDQYSKDIE 593
            +  F+    DD                    E  + K  GESVLL KLE W++QY +D E
Sbjct: 115  EKNFRNWVADDKVKDGEFSNEGGGDSVAGASELKESKGFGESVLLRKLESWIEQYKRDTE 174

Query: 594  YWGIGTGPVFTVYQDSFGGVKRVSVDEEEILRRSRVQRDEIEEFPEVKDKIQDANNLARE 773
            YWGIG+G +FTVYQ S G V+RV V+E+EILRRSR++R  +E  PEV  KI  A +LA+E
Sbjct: 175  YWGIGSGQIFTVYQGSDGNVERVLVNEDEILRRSRIERWGLEGSPEVNLKILQAESLAKE 234

Query: 774  MENGNNVISRNSSVAKFVVQGEEEGGFVKAASGFVAQPGLLPKLSGVGSRELCVLVVLWA 953
            ME+G +VI  NSSVAKFVVQG EE GF+K   GF  QP  LPKLS VG   + VL+ LWA
Sbjct: 235  MESGLDVIPWNSSVAKFVVQG-EESGFLKTIRGFTLQPDFLPKLSRVGRLMVYVLIALWA 293

Query: 954  VKELFSFGVKEAQYXXXXXXXXXXXXXXXXXXXXLVKG--AVEVIPEPLETPVIDIKKPK 1127
            +K+L   G KE +Y                    L KG   VEV+ E  E P++  +KP 
Sbjct: 294  LKKLVGSGNKEEKYTELEKEMMRRKMKARQEKEVLEKGNLEVEVVQESSELPLVSFEKPY 353

Query: 1128 LDMEQLKNNILKVKASADKPVVVENSPAEVRTGYMDVDYKVQXXXXXXXXXXXXXGRDHS 1307
            LD ++L N+I+  K+   KP  +++S   + +   + D+KVQ               + S
Sbjct: 354  LDRKELMNSIVSAKSVNGKP-ALQDSSNSMTSKSSEFDFKVQEIKNMARKAREIEQMEQS 412

Query: 1308 LVSRDMEMDEPVIEKSSNETKVIKKDSEQD-NSLSNHQSKFSRKTTGSN 1451
            LV  D +  +PV +K  +E KV+++ +E+  N+L++      R+  GS+
Sbjct: 413  LVGNDEKETQPVNDKLLDEMKVVEQHTEEGANTLTHPLEGDCRQAMGSD 461


>XP_008233144.2 PREDICTED: uncharacterized protein LOC103332203 [Prunus mume]
          Length = 1193

 Score =  248 bits (633), Expect = 1e-68
 Identities = 174/479 (36%), Positives = 245/479 (51%), Gaps = 53/479 (11%)
 Frame = +3

Query: 171  MDILNASTPKA-------IPFPTFCHPKSLNPKTPS--NSP---FHRN-LFPLYLTTSTA 311
            M++  +STP          PF T    KS N K P   N P   FH+N  F +YL +  +
Sbjct: 1    MEVFYSSTPTNRKILSLNSPFLTNFPAKSWNKKNPCRYNIPSFGFHKNPSFSIYLLSCHS 60

Query: 312  RKFQTWAHFGRPTTXXXXXXXXXXXXHD-HQVRPKHTSTDPSPVSGNGVEVSDAGFQRVS 488
             KF+  AHFGRP +               +Q+      +       N  + + +  ++V+
Sbjct: 61   TKFRALAHFGRPMSRRNSLRKKLIDEQKVNQISVPLNPSSDFQFLNNNFDDTVSPLEKVN 120

Query: 489  VDDVE--------------------VEKPKPKLLGESVLLNKLEDWVDQYSKDIEYWGIG 608
             D V+                    V++P  K LG+SVLL+KL+ W++QY +D EYWGIG
Sbjct: 121  YDSVKESEFSNEVVADDSSVAETSSVKEPNAKSLGDSVLLSKLDSWMEQYKRDTEYWGIG 180

Query: 609  TGPVFTVYQDSFGGVKRVSVDEEEILRRSRVQRDEIEEFPEVKDKIQDANNLAREMENGN 788
            +G +FTV QDS G VK VSV+E+EILRRSRV+R E+E+  EV  KI  A +LAREME+G 
Sbjct: 181  SGHIFTVNQDSDGNVKVVSVNEDEILRRSRVERLELEDSAEVNLKILQAESLAREMESGK 240

Query: 789  NVISRNSSVAKFVVQGEEEGGFVKAASGFVAQPGLLPKLSGVGSRELCVLVVLWAVKELF 968
            NVI+RNSSVAKFVV+G E+ GF+K   GF  +P  LPK+S  G   L   + LWA+K+LF
Sbjct: 241  NVIARNSSVAKFVVEG-EDSGFMKGIRGFSFRPEFLPKISRFGRLVLYGFIALWALKKLF 299

Query: 969  SFGVKEAQYXXXXXXXXXXXXXXXXXXXXLVKGAVEVIPEPLETPVIDIKKPKLDMEQLK 1148
            +FG KE +Y                    L KG+VEV+    E P+   KKP +D ++L 
Sbjct: 300  TFGNKEERYSELDKEMMRRKIKSRKEKEMLEKGSVEVVQASSELPLGPFKKPSIDKQELM 359

Query: 1149 NNILKVKASADKPVVVENSPAEVRTGYMDVDYKVQXXXXXXXXXXXXXGRDHSLVSRDME 1328
              I++   S     + ++S + +     D D KVQ             GR+HSLV  D +
Sbjct: 360  KAIMRENLSNGNLALQDSSTSMIVAENTDFDDKVQEIRNMARQAREIEGREHSLVGTDRK 419

Query: 1329 ---------MDEPVIEKSS---------NETKVIKKDSEQD-NSLSNHQSKFSRKTTGS 1448
                      DE V +K S         +E KV+K+  E+  N+L+N  +   R+T GS
Sbjct: 420  EIQTVNDEISDETVNDKLSDEIVHDEILDEIKVVKQHEEEGANTLTNRLNGDCRQTKGS 478


>EOX95699.1 Embryo defective 1703, putative isoform 2 [Theobroma cacao]
          Length = 1154

 Score =  248 bits (632), Expect = 1e-68
 Identities = 168/459 (36%), Positives = 239/459 (52%), Gaps = 27/459 (5%)
 Frame = +3

Query: 171  MDILNASTPKAIPFPTFCHPKSLNP----KTPSNSPFHR---------NLFPLYLTTSTA 311
            M++LN    K      FC   S  P    KT +  P HR           F   L  S  
Sbjct: 1    MELLNPPISKTPQL--FCSFSSFTPRLSTKTSNKKPLHRFHISKFREIPSFSRCLPLSGT 58

Query: 312  RKFQTWAHFGRPTTXXXXXXXXXXXXHDHQVRPKHTSTDPSP--VSGNGV-----EVSDA 470
            + F   A FGRPT+            H  QVR     ++P+P   + NG       ++  
Sbjct: 59   KFFHVSAQFGRPTSRRNSLREKLLLDHQ-QVRQNPIPSNPTPDFQNPNGSFENFENLNSG 117

Query: 471  GFQRVSVDDVEVEKPKPKLLGESVLLNKLEDWVDQYSKDIEYWGIGTGPVFTVYQDSFGG 650
            G +++ VD+ +V + K K LGESV+L+KLE+W+DQY KD ++WGIG+GP+FTV  D  G 
Sbjct: 118  GSKQIDVDN-DVGELKSKRLGESVMLSKLENWIDQYKKDADFWGIGSGPIFTVLHDLEGN 176

Query: 651  VKRVSVDEEEILRRSRVQRDEIEEFPEVKDKIQDANNLAREMENGNNVISRNSSVAKFVV 830
            VKR +V+E+EIL+R      E E+  +V  K+  A NLAREME G NVI RNS VAKFVV
Sbjct: 177  VKRATVNEDEILKRL-----EFEDLEKVNSKLSYAKNLAREMERGENVIPRNSLVAKFVV 231

Query: 831  QGEEEGGFVKAASGFVAQPGLLPKLSGVGSRELCVLVVLWAVKELFSFGVKEAQYXXXXX 1010
             G +E G V    G + +PG +PKLS  GS  LC  +VLW VK+LF  G KE  Y     
Sbjct: 232  SG-QESGLVSGVHGVILRPGFMPKLSRGGSLLLCGFLVLWVVKKLFVLGNKEVAYTELEK 290

Query: 1011 XXXXXXXXXXXXXXXLVKGAVEVIPEPLETPVIDIKKPKLDMEQLKNNILKVKASADKPV 1190
                           L KG+VEV+    E P +  ++PKLD +QL NNILK KA+ DK  
Sbjct: 291  EMMRRKIKSRKEREMLEKGSVEVVQASEEPPNMSFQRPKLDRQQLLNNILKAKAAKDKLA 350

Query: 1191 VVENSPAEVRTGYMDVDYKVQXXXXXXXXXXXXXGRDHSLVSRDMEMDEPVIEKSSNETK 1370
            ++++S ++  +  +D +++VQ             GR+ S++ +D +  +   ++  NE +
Sbjct: 351  LLDSSGSQ-SSKSVDFEHEVQEIKVMAKEALETEGREQSVIGKDEKQVQAANKEFCNEMQ 409

Query: 1371 VIKKDSEQDNS-LSN------HQSKFSRKTTGSNAILQT 1466
             IK+D +   S LSN       Q K S +T  + +  +T
Sbjct: 410  AIKEDGQDGVSFLSNLSTEDSEQGKVSYRTVEATSPCET 448


Top