BLASTX nr result

ID: Wisteria21_contig00010896 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Wisteria21_contig00010896
         (1611 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KRH63274.1| hypothetical protein GLYMA_04G165100 [Glycine max...   326   3e-86
gb|KHN06666.1| hypothetical protein glysoja_047924 [Glycine soja]     326   3e-86
ref|XP_006578558.1| PREDICTED: uncharacterized protein LOC102662...   326   3e-86
ref|XP_007138150.1| hypothetical protein PHAVU_009G184400g [Phas...   300   3e-78
ref|XP_014522229.1| PREDICTED: uncharacterized protein LOC106778...   293   3e-76
ref|XP_013457948.1| DUF863 family protein [Medicago truncatula] ...   293   3e-76
gb|KOM40273.1| hypothetical protein LR48_Vigan04g047100 [Vigna a...   288   1e-74
ref|XP_006581984.1| PREDICTED: uncharacterized protein LOC102666...   216   4e-53
gb|KRH54614.1| hypothetical protein GLYMA_06G198000 [Glycine max]     213   4e-52
ref|XP_006581983.1| PREDICTED: uncharacterized protein LOC102666...   213   4e-52
gb|KHN06896.1| hypothetical protein glysoja_041276 [Glycine soja]     211   1e-51
ref|XP_004508706.1| PREDICTED: uncharacterized protein LOC101495...   148   1e-32
ref|XP_010258910.1| PREDICTED: uncharacterized protein LOC104598...   121   2e-24
ref|XP_010258199.1| PREDICTED: uncharacterized protein LOC104598...   120   5e-24
ref|XP_008374587.1| PREDICTED: uncharacterized protein LOC103437...   110   3e-21
ref|XP_007037462.1| Uncharacterized protein isoform 6 [Theobroma...   110   3e-21
ref|XP_007037460.1| Uncharacterized protein isoform 4 [Theobroma...   110   3e-21
ref|XP_007037457.1| Uncharacterized protein isoform 1 [Theobroma...   110   3e-21
ref|XP_007037461.1| Uncharacterized protein isoform 5 [Theobroma...   108   1e-20
ref|XP_007037459.1| Uncharacterized protein isoform 3 [Theobroma...   108   1e-20

>gb|KRH63274.1| hypothetical protein GLYMA_04G165100 [Glycine max]
            gi|947114973|gb|KRH63275.1| hypothetical protein
            GLYMA_04G165100 [Glycine max]
          Length = 916

 Score =  326 bits (836), Expect = 3e-86
 Identities = 235/569 (41%), Positives = 301/569 (52%), Gaps = 39/569 (6%)
 Frame = -3

Query: 1591 SKNAQKIYCSPNLPWSTAQSSVLIADSIQLPMAFAQEEGRHICPAH--ASTVTEESLKDY 1418
            S ++  +Y S N+PW T+QSSVL  + IQLP+A  QE+ R + P    A T  +ESL D 
Sbjct: 92   SSSSSSLYYSQNMPWLTSQSSVLNPERIQLPLASMQEKSRELSPTPLPAPTAIKESL-DP 150

Query: 1417 KLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXEV-PQVSAYSLNEISQVVHNSHD 1241
            KL     RK+GKKILDLQLPADEYIDS             P +S Y++           +
Sbjct: 151  KLSGLTYRKVGKKILDLQLPADEYIDSEGDSCENERVIKQPPLSTYTV-----------E 199

Query: 1240 KPYGNNSRD-FTDLNVPFKLDEEAAAKSYDSEAPTHHMNNSFYDLSTRTKFGSQNLPNDA 1064
            KPY  NS + F DLN+P KL +E   KS D  A   H N++F+D+  R   GS N PND 
Sbjct: 200  KPYRINSNNGFADLNLPCKLQKETGVKSDDFGASIPHRNHTFHDMPGRMTPGSHNFPNDV 259

Query: 1063 I---NKRQDLEGCSDNPLPDHGKKHEWXXXXXXXXXLDS----FAK-----------DQI 938
            I    ++QD E   D PL +HG+KH W           S     AK           D +
Sbjct: 260  ILNLKRKQDHEAYPDLPLSNHGQKHGWLPSGTCAGHNGSDLGFLAKFNDMESQSVSIDSV 319

Query: 937  NRGPWRTEK-----------KFSSSESSAWTQGPTNNGLLGPNSASRTCALQQPVSGAYM 791
            ++   + +              S +ES A T  PT+ G L P+ AS TCA  + VS + M
Sbjct: 320  SKKLKQVKYCPCFHSIHQIVPRSRTESPAGTHDPTSGGWLRPSFASCTCAPHKLVSDSDM 379

Query: 790  ISYGISPAGLWKTPVSDFGQSQPEVQASSASLGQNSKSMMGISGFTRDELYQGTGVKSGP 611
             + GISP+ LWK+                                            S P
Sbjct: 380  KNSGISPSVLWKSTT------------------------------------------SDP 397

Query: 610  NLDGQNFLLS-SFCSRSKLLDLPSC-TNDPNNSDNHGSL-VSHEFRKYVEGSKDVGTPKN 440
            NLD Q++LL+ SFCSRS +LDLPS  T D N+ DN GS    HE RKYV+    VGT K+
Sbjct: 398  NLDLQHYLLNQSFCSRSNILDLPSISTGDLNSIDNFGSSSADHELRKYVKDLVYVGTHKS 457

Query: 439  LNLNTMPGGYSDPTE--FQSIQITGEENKFEDSTMGLPWLKEKPVGKGKPNEESKISTQI 266
            +NLN MP G SD T   FQS QITGEE+K +DS   L WLK KPV KGKPNEES++STQ+
Sbjct: 458  INLNIMPAGCSDKTAAAFQSDQITGEEDKCQDSR--LSWLKAKPVAKGKPNEESQLSTQV 515

Query: 265  EPGLLNPYNTGFIH-DLKLRKIEESNMGTEKRLAFHSNGKPHMSSDLHSFHDFPSELFQN 89
            +  LLNPY +G IH DL   K+E+S+  TEK LAF  NGKP             S++FQ+
Sbjct: 516  DSFLLNPYKSGCIHSDLMFNKVEKSDSCTEKTLAFDLNGKPQ-----------TSKVFQS 564

Query: 88   QSKNQRVEEIEKGCISDVKSSCIHVPDLG 2
             SKN  +EEI+K  IS++ S+C   PD+G
Sbjct: 565  LSKNHWIEEIKK--ISNINSACDSDPDMG 591


>gb|KHN06666.1| hypothetical protein glysoja_047924 [Glycine soja]
          Length = 787

 Score =  326 bits (836), Expect = 3e-86
 Identities = 235/569 (41%), Positives = 301/569 (52%), Gaps = 39/569 (6%)
 Frame = -3

Query: 1591 SKNAQKIYCSPNLPWSTAQSSVLIADSIQLPMAFAQEEGRHICPAH--ASTVTEESLKDY 1418
            S ++  +Y S N+PW T+QSSVL  + IQLP+A  QE+ R + P    A T  +ESL D 
Sbjct: 116  SSSSSSLYYSQNMPWLTSQSSVLNPERIQLPLASMQEKSRELSPTPLPAPTAIKESL-DP 174

Query: 1417 KLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXEV-PQVSAYSLNEISQVVHNSHD 1241
            KL     RK+GKKILDLQLPADEYIDS             P +S Y++           +
Sbjct: 175  KLSGLTYRKVGKKILDLQLPADEYIDSEGESCENERVIKQPPLSTYTV-----------E 223

Query: 1240 KPYGNNSRD-FTDLNVPFKLDEEAAAKSYDSEAPTHHMNNSFYDLSTRTKFGSQNLPNDA 1064
            KPY  NS + F DLN+P KL +E   KS D  A   H N++F+D+  R   GS N PND 
Sbjct: 224  KPYRINSNNGFADLNLPCKLQKETGVKSDDFGASIPHRNHTFHDMPGRMTPGSHNFPNDV 283

Query: 1063 I---NKRQDLEGCSDNPLPDHGKKHEWXXXXXXXXXLDS----FAK-----------DQI 938
            I    ++QD E   D PL +HG+KH W           S     AK           D +
Sbjct: 284  ILNLKRKQDHEAYPDLPLSNHGQKHGWLPSGTCAGHNGSDLGFLAKFNDMESQSVSIDSV 343

Query: 937  NRGPWRTEK-----------KFSSSESSAWTQGPTNNGLLGPNSASRTCALQQPVSGAYM 791
            ++   + +              S +ES A T  PT+ G L P+ AS TCA  + VS + M
Sbjct: 344  SKKLKQVKYCPCFHSIHQIVPRSRTESPAGTHDPTSGGWLRPSFASCTCAPHKLVSDSDM 403

Query: 790  ISYGISPAGLWKTPVSDFGQSQPEVQASSASLGQNSKSMMGISGFTRDELYQGTGVKSGP 611
             + GISP+ LWK+                                            S P
Sbjct: 404  KNSGISPSVLWKSTT------------------------------------------SDP 421

Query: 610  NLDGQNFLLS-SFCSRSKLLDLPSC-TNDPNNSDNHGSL-VSHEFRKYVEGSKDVGTPKN 440
            NLD Q++LL+ SFCSRS +LDLPS  T D N+ DN GS    HE RKYV+    VGT K+
Sbjct: 422  NLDLQHYLLNQSFCSRSNILDLPSISTGDLNSIDNFGSSSADHELRKYVKDLVYVGTHKS 481

Query: 439  LNLNTMPGGYSDPTE--FQSIQITGEENKFEDSTMGLPWLKEKPVGKGKPNEESKISTQI 266
            +NLN MP G SD T   FQS QITGEE+K +DS   L WLK KPV KGKPNEES++STQ+
Sbjct: 482  INLNIMPAGCSDKTAAAFQSDQITGEEDKCQDSR--LSWLKAKPVAKGKPNEESQLSTQV 539

Query: 265  EPGLLNPYNTGFIH-DLKLRKIEESNMGTEKRLAFHSNGKPHMSSDLHSFHDFPSELFQN 89
            +  LLNPY +G IH DL   K+E+S+  TEK LAF  NGKP             S++FQ+
Sbjct: 540  DSFLLNPYKSGCIHSDLMFNKVEKSDSCTEKTLAFDLNGKPQ-----------TSKVFQS 588

Query: 88   QSKNQRVEEIEKGCISDVKSSCIHVPDLG 2
             SKN  +EEI+K  IS++ S+C   PD+G
Sbjct: 589  LSKNHWIEEIKK--ISNINSACDSDPDMG 615


>ref|XP_006578558.1| PREDICTED: uncharacterized protein LOC102662706 isoform X1 [Glycine
            max] gi|571450844|ref|XP_006578559.1| PREDICTED:
            uncharacterized protein LOC102662706 isoform X2 [Glycine
            max] gi|571450846|ref|XP_006578560.1| PREDICTED:
            uncharacterized protein LOC102662706 isoform X3 [Glycine
            max] gi|947114974|gb|KRH63276.1| hypothetical protein
            GLYMA_04G165100 [Glycine max] gi|947114975|gb|KRH63277.1|
            hypothetical protein GLYMA_04G165100 [Glycine max]
            gi|947114976|gb|KRH63278.1| hypothetical protein
            GLYMA_04G165100 [Glycine max]
          Length = 940

 Score =  326 bits (836), Expect = 3e-86
 Identities = 235/569 (41%), Positives = 301/569 (52%), Gaps = 39/569 (6%)
 Frame = -3

Query: 1591 SKNAQKIYCSPNLPWSTAQSSVLIADSIQLPMAFAQEEGRHICPAH--ASTVTEESLKDY 1418
            S ++  +Y S N+PW T+QSSVL  + IQLP+A  QE+ R + P    A T  +ESL D 
Sbjct: 116  SSSSSSLYYSQNMPWLTSQSSVLNPERIQLPLASMQEKSRELSPTPLPAPTAIKESL-DP 174

Query: 1417 KLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXEV-PQVSAYSLNEISQVVHNSHD 1241
            KL     RK+GKKILDLQLPADEYIDS             P +S Y++           +
Sbjct: 175  KLSGLTYRKVGKKILDLQLPADEYIDSEGDSCENERVIKQPPLSTYTV-----------E 223

Query: 1240 KPYGNNSRD-FTDLNVPFKLDEEAAAKSYDSEAPTHHMNNSFYDLSTRTKFGSQNLPNDA 1064
            KPY  NS + F DLN+P KL +E   KS D  A   H N++F+D+  R   GS N PND 
Sbjct: 224  KPYRINSNNGFADLNLPCKLQKETGVKSDDFGASIPHRNHTFHDMPGRMTPGSHNFPNDV 283

Query: 1063 I---NKRQDLEGCSDNPLPDHGKKHEWXXXXXXXXXLDS----FAK-----------DQI 938
            I    ++QD E   D PL +HG+KH W           S     AK           D +
Sbjct: 284  ILNLKRKQDHEAYPDLPLSNHGQKHGWLPSGTCAGHNGSDLGFLAKFNDMESQSVSIDSV 343

Query: 937  NRGPWRTEK-----------KFSSSESSAWTQGPTNNGLLGPNSASRTCALQQPVSGAYM 791
            ++   + +              S +ES A T  PT+ G L P+ AS TCA  + VS + M
Sbjct: 344  SKKLKQVKYCPCFHSIHQIVPRSRTESPAGTHDPTSGGWLRPSFASCTCAPHKLVSDSDM 403

Query: 790  ISYGISPAGLWKTPVSDFGQSQPEVQASSASLGQNSKSMMGISGFTRDELYQGTGVKSGP 611
             + GISP+ LWK+                                            S P
Sbjct: 404  KNSGISPSVLWKSTT------------------------------------------SDP 421

Query: 610  NLDGQNFLLS-SFCSRSKLLDLPSC-TNDPNNSDNHGSL-VSHEFRKYVEGSKDVGTPKN 440
            NLD Q++LL+ SFCSRS +LDLPS  T D N+ DN GS    HE RKYV+    VGT K+
Sbjct: 422  NLDLQHYLLNQSFCSRSNILDLPSISTGDLNSIDNFGSSSADHELRKYVKDLVYVGTHKS 481

Query: 439  LNLNTMPGGYSDPTE--FQSIQITGEENKFEDSTMGLPWLKEKPVGKGKPNEESKISTQI 266
            +NLN MP G SD T   FQS QITGEE+K +DS   L WLK KPV KGKPNEES++STQ+
Sbjct: 482  INLNIMPAGCSDKTAAAFQSDQITGEEDKCQDSR--LSWLKAKPVAKGKPNEESQLSTQV 539

Query: 265  EPGLLNPYNTGFIH-DLKLRKIEESNMGTEKRLAFHSNGKPHMSSDLHSFHDFPSELFQN 89
            +  LLNPY +G IH DL   K+E+S+  TEK LAF  NGKP             S++FQ+
Sbjct: 540  DSFLLNPYKSGCIHSDLMFNKVEKSDSCTEKTLAFDLNGKPQ-----------TSKVFQS 588

Query: 88   QSKNQRVEEIEKGCISDVKSSCIHVPDLG 2
             SKN  +EEI+K  IS++ S+C   PD+G
Sbjct: 589  LSKNHWIEEIKK--ISNINSACDSDPDMG 615


>ref|XP_007138150.1| hypothetical protein PHAVU_009G184400g [Phaseolus vulgaris]
            gi|593329449|ref|XP_007138151.1| hypothetical protein
            PHAVU_009G184400g [Phaseolus vulgaris]
            gi|561011237|gb|ESW10144.1| hypothetical protein
            PHAVU_009G184400g [Phaseolus vulgaris]
            gi|561011238|gb|ESW10145.1| hypothetical protein
            PHAVU_009G184400g [Phaseolus vulgaris]
          Length = 930

 Score =  300 bits (767), Expect = 3e-78
 Identities = 218/571 (38%), Positives = 283/571 (49%), Gaps = 40/571 (7%)
 Frame = -3

Query: 1594 SSKNAQKIYCSPNLPWSTAQSSVLIADSIQLPMAFAQEEGRHICP----AHASTVTEESL 1427
            +S ++  +Y S NL   T +SS++ A+ IQLP+A  QE  R +CP    A +     +S 
Sbjct: 115  TSSSSSSLYYSQNLHRFTCKSSIIHAEGIQLPLASMQEMNRQLCPTPLPAPSPAAIVQSP 174

Query: 1426 KDYKLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXEV-PQVSAYSLNEISQVVHN 1250
            KD KL     RK+GKKILDLQLPADEYIDS             P +S Y+LN IS+ V+N
Sbjct: 175  KDPKLSVSAYRKVGKKILDLQLPADEYIDSEGESCENERVIKEPALSTYALNGISKAVYN 234

Query: 1249 SHDKPYGNNSRDFTDLNVPFKLDEEAAAKSYDSEAPTHHMNNSFYDLSTRTKFGSQNLPN 1070
            + +KP+  N   F DLN+PFK +EE   KS D  A  HH N +F+D+  R + GS + PN
Sbjct: 235  TVEKPFRTNFNGFADLNLPFKFEEETCVKSDDFGASIHHRNYTFHDMPRRMRPGSHSFPN 294

Query: 1069 DAI---NKRQDLEGCSDNPLPDHGKKHEWXXXXXXXXXLDS------FAKDQINRGP--W 923
            D I    ++QDL+ CSD PL + G+KH W           S         D  N+     
Sbjct: 295  DVIQNLKRKQDLQACSDPPLQNQGEKHGWLPLGISAGKNGSDLGSVAIFNDTENQSVSIE 354

Query: 922  RTEKKFS------------------SSESSAWTQGPTNNGLLGPNSASRTCALQQPVSGA 797
               KK                     ++S A T  PT+   LGP+ AS  CA  Q VS  
Sbjct: 355  SLSKKLKQVNNCSCFHSTHQIVPGLETDSLAGTHNPTSGVWLGPSYASCPCASHQLVSET 414

Query: 796  YMISYGISPAGLWKTPVSDFGQSQPEVQASSASLGQNSKSMMGISGFTRDELYQGTGVKS 617
             M S  ISP+ LWK+                                          + S
Sbjct: 415  DMKSSRISPSVLWKS------------------------------------------IAS 432

Query: 616  GPNLDGQNFLL-SSFCSRSKLLDLPSCT-NDPNNSDNHG-SLVSHEFRKYVEGSKDVGTP 446
            G NLD QN+LL S FC RS LLDLPS + +DPN  DN G S   HE R YVE      T 
Sbjct: 433  GSNLDCQNYLLHSKFCRRSNLLDLPSVSADDPNCCDNCGPSSAGHELRNYVE------TR 486

Query: 445  KNLNLNTMPGGYSD--PTEFQSIQITGEENKFEDSTMGLPWLKEKPVGKGKPNEESKIST 272
            KN+NLNTMP G+S+    EFQSI                 WLKEKPV KGKP++E + ST
Sbjct: 487  KNINLNTMPVGFSETKAVEFQSI-----------------WLKEKPVPKGKPSDECEAST 529

Query: 271  QIEPGLLNPYNTGFIH-DLKLRKIEESNMGTEKRLAFHSNGKPHMSSDLHSFHDFPSELF 95
             I+  +LNP  +G IH DL+L K+++S++  ++ LAF  NGKP             S++ 
Sbjct: 530  PIDSSILNPLKSGCIHSDLELNKVQKSDLCRDQTLAFDLNGKPR-----------TSKVV 578

Query: 94   QNQSKNQRVEEIEKGCISDVKSSCIHVPDLG 2
            Q+ S N   EEIEK  +S V S     PD+G
Sbjct: 579  QSLSANHWFEEIEK--MSIVNSPSDDYPDMG 607


>ref|XP_014522229.1| PREDICTED: uncharacterized protein LOC106778758 [Vigna radiata var.
            radiata] gi|951058622|ref|XP_014522230.1| PREDICTED:
            uncharacterized protein LOC106778758 [Vigna radiata var.
            radiata] gi|951058627|ref|XP_014522231.1| PREDICTED:
            uncharacterized protein LOC106778758 [Vigna radiata var.
            radiata] gi|951058634|ref|XP_014522232.1| PREDICTED:
            uncharacterized protein LOC106778758 [Vigna radiata var.
            radiata]
          Length = 941

 Score =  293 bits (750), Expect = 3e-76
 Identities = 215/571 (37%), Positives = 284/571 (49%), Gaps = 40/571 (7%)
 Frame = -3

Query: 1594 SSKNAQKIYCSPNLPWSTAQSSVLIADSIQLPMAFAQEEGRHI----CPAHASTVTEESL 1427
            +S ++  +Y S NL W T+QSS+L A+ IQLP+A  QE  R +     PA A     ES 
Sbjct: 115  TSSSSSSLYYSQNLHWFTSQSSILKAEGIQLPLASMQEMSRQLHPTPVPAPAPAAVIESS 174

Query: 1426 KDYKLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXEVP-QVSAYSLNEISQVVHN 1250
            KD  L     RK+GKKILDLQLPADEYIDS               +S Y+LN IS+ V+N
Sbjct: 175  KDTTLSVSAYRKVGKKILDLQLPADEYIDSEGESCENERAIKELALSTYTLNGISKAVYN 234

Query: 1249 SHDKPYGNNSRDFTDLNVPFKLDEEAAAKSYDSEAPTHHMNNSFYDLSTRTKFGSQNLPN 1070
            + +KP+  N   F+DLN+PFKL+EE   KS D  A  HH N +F+D+  R + GS +  N
Sbjct: 235  TVEKPFRTNFNGFSDLNLPFKLEEETGVKSDDFGASIHHKNYTFHDMPRRIRPGSHSFTN 294

Query: 1069 DAI---NKRQDLEGCSDNPLPDHGKKHE--------------------WXXXXXXXXXLD 959
            D I    ++QDL+ C D P+ + G KH                     +         ++
Sbjct: 295  DVIQNLERKQDLQACLDPPIQNKGTKHGRLPLGTGSGQNGSDLGSLAIFNDTESQSVSIE 354

Query: 958  SFAK--DQINRGPWRTEKKF-----SSSESSAWTQGPTNNGLLGPNSASRTCALQQPVSG 800
            S +K   Q+N   + +  +      +  +S A    PTN   LGP+ AS  C+  Q VS 
Sbjct: 355  SISKKLKQVNSSRFHSTNQIVPGLRTDMDSFAGRHNPTNGVWLGPSYASCPCSSHQLVSE 414

Query: 799  AYMISYGISPAGLWKTPVSDFGQSQPEVQASSASLGQNSKSMMGISGFTRDELYQGTGVK 620
            + + S  ISP  LWK+                                          + 
Sbjct: 415  SDLKSSRISPPVLWKS------------------------------------------IA 432

Query: 619  SGPNLDGQNFLLSSFCSRSKLLDLPS-CTNDPNNSDNHGSLVSHEFRKYVEGSKDVGTPK 443
            SG NLD QN L S FC+RS LL LPS  T+DPN  D   S   HE  KYV+ S+ V T  
Sbjct: 433  SGCNLDCQNCLHSKFCNRSNLLGLPSISTDDPNCCDRGPSSAGHELWKYVKDSEYVETNN 492

Query: 442  NLNLNTMPGGYSD--PTEFQSIQITGEENKFEDSTMGLPWLKEKP-VGKGKPNEESKIST 272
            N+NLN MP   S+    EFQSI+IT E +KF+DS   LPWLKEKP V KG P  E + ST
Sbjct: 493  NINLNVMPVSSSETKAAEFQSIRITVEYDKFQDSR--LPWLKEKPAVPKGNPTGEREAST 550

Query: 271  QIEPGLLNPYNTGFIH-DLKLRKIEESNMGTEKRLAFHSNGKPHMSSDLHSFHDFPSELF 95
             I+   LNP   G +H DL+L K+++SN+      AF  NGKP              ++ 
Sbjct: 551  PIDCSFLNPSKFGCVHSDLELNKVQKSNL-----CAFDLNGKPQ-----------TPKVV 594

Query: 94   QNQSKNQRVEEIEKGCISDVKSSCIHVPDLG 2
            Q+ S + R EEI K  IS+VK      PD+G
Sbjct: 595  QSLSTDHRTEEINK--ISNVKLPSDGYPDMG 623


>ref|XP_013457948.1| DUF863 family protein [Medicago truncatula]
            gi|657390458|gb|KEH31979.1| DUF863 family protein
            [Medicago truncatula]
          Length = 832

 Score =  293 bits (750), Expect = 3e-76
 Identities = 169/292 (57%), Positives = 191/292 (65%), Gaps = 38/292 (13%)
 Frame = -3

Query: 1609 SSSALSSKNAQKIYCSPNLPWSTAQSSVLIADSIQLPMAFAQEEGRHICPAHASTVTEES 1430
            SSSAL SKNA+K + SPN PWST+QSSVL A+SIQLP+AFAQE+ + I PAHASTVTEE 
Sbjct: 117  SSSALLSKNAEKTFYSPNRPWSTSQSSVLFAESIQLPLAFAQEKSKQIFPAHASTVTEEP 176

Query: 1429 LKDYKLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXEVPQVSAYSLNEISQVVHN 1250
            LKDYKL E  CRK+GKK+LDL+LPADEYIDS          EV Q SAYSLN +SQV+ +
Sbjct: 177  LKDYKLLESMCRKVGKKVLDLELPADEYIDSDEGEENVRVTEVLQDSAYSLNGVSQVLCD 236

Query: 1249 SHDKPYGNNSRDF--------TDLNVPFKLDEEAAAKSYDSEAPTHHMNNSFYDLSTRTK 1094
            +HDKP GN+SR           DLNVPF+L+ EAA KS D E P+ HMNN  YDLS +T 
Sbjct: 237  NHDKPRGNSSRGSDNLNVSFKLDLNVPFRLEVEAATKSSDKEVPSLHMNNCLYDLSMKTI 296

Query: 1093 FGSQNLPNDAINKRQDLEGCSDNPLPDHGKKHEWXXXXXXXXXLDSFAKD---------- 944
            FGSQNL NDAINKRQDLEG S N  PD+ KK EW         LDSFAK           
Sbjct: 297  FGSQNLHNDAINKRQDLEGGSHNQRPDNEKKCEWKFSGHNGGLLDSFAKSIHTEKQYFSV 356

Query: 943  --------------------QINRGPWRTEKKFSSSESSAWTQGPTNNGLLG 848
                                QINRGPW TE+KFSSS SS  TQ PT+ GLLG
Sbjct: 357  DSLSKNMEQFVDLSCFHSSHQINRGPW-TERKFSSSASSTQTQCPTSKGLLG 407



 Score =  146 bits (369), Expect = 5e-32
 Identities = 73/114 (64%), Positives = 83/114 (72%)
 Frame = -3

Query: 343 MGLPWLKEKPVGKGKPNEESKISTQIEPGLLNPYNTGFIHDLKLRKIEESNMGTEKRLAF 164
           MGLP LKE           SK STQIE  ++NPY TG  H L+L+KIEESN+G EK LAF
Sbjct: 409 MGLPCLKE-----------SKFSTQIESAVVNPYETGVTHGLELKKIEESNLGAEKTLAF 457

Query: 163 HSNGKPHMSSDLHSFHDFPSELFQNQSKNQRVEEIEKGCISDVKSSCIHVPDLG 2
           HSNG P MSSDLH FHDF ++LFQN  KNQR+E+IEK CI+DVKS C  VPDLG
Sbjct: 458 HSNGNPRMSSDLHYFHDFATKLFQNHPKNQRIEDIEKDCIADVKSPCADVPDLG 511


>gb|KOM40273.1| hypothetical protein LR48_Vigan04g047100 [Vigna angularis]
          Length = 943

 Score =  288 bits (737), Expect = 1e-74
 Identities = 215/578 (37%), Positives = 289/578 (50%), Gaps = 47/578 (8%)
 Frame = -3

Query: 1594 SSKNAQKIYCSPNLPWSTAQSSVLIADSIQLPMAFAQEEGRHICP----AHASTVTEESL 1427
            +S ++  +Y S NL W T+QSS+L A+ IQLP+A  QE  + + P    A A     ES 
Sbjct: 115  TSSSSSSLYYSQNLHWFTSQSSILNAEGIQLPLASMQEMSKQLHPTLVAAPAPAAIIESS 174

Query: 1426 KDYKLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXEVP-QVSAYSLNEISQVVHN 1250
            KD  L     RK+GKKILDLQLPADEYIDS               +S Y+LN IS+ V+N
Sbjct: 175  KDTTLSVSAYRKVGKKILDLQLPADEYIDSEGESCENERAIKELALSTYTLNGISKAVYN 234

Query: 1249 SH-DKPYGNNSRDFTDLNVPFKLDEEAAAKSYDSEAPTHHMNNSFYDLSTRTKFGSQNLP 1073
            +  +KP+  N   F+DLN+PFKL+EE   K  D  A  HH N +F+D+  R K GS +  
Sbjct: 235  NTVEKPFRTNFNGFSDLNLPFKLEEETGVKYADFGASIHHKNYTFHDMPRRIKPGSHSFT 294

Query: 1072 NDAI---NKRQDLEGCSDNPLPDHGKKHE--------------------WXXXXXXXXXL 962
            ND I    ++QDL+ C D PLP+ G KH                     +         +
Sbjct: 295  NDVIQNLERKQDLQSCLDPPLPNKGTKHGRLPLGTGSGQNGSDLGSLAIFNDTENQSVSI 354

Query: 961  DSFAK--DQINRGPW-----------RTEKKFSSSESSAWTQGPTNNGLLGPNSASRTCA 821
            +S +K   Q+N   +           RT++     +S A    PT+   +GP+ AS  C+
Sbjct: 355  ESISKKLKQVNNSSYFHSTNQTVPGLRTDR-----DSFAGRHNPTSGVWIGPSYASCPCS 409

Query: 820  LQQPVSGAYMISYGISPAGLWKTPVSDFGQSQPEVQASSASLGQNSKSMMGISGFTRDEL 641
              Q +S + + S  ISP+ LWK+                                     
Sbjct: 410  SHQLLSESDLKSSRISPSVLWKS------------------------------------- 432

Query: 640  YQGTGVKSGPNLDGQNFLLSSFCSRSKLLDLPSCT-NDPNNSDNHGSLVSHEFRKYVEGS 464
                 + SG NLD QN L S FC+RS LL LPS + +DPN  D   S   HE  KYV+ S
Sbjct: 433  -----IGSGSNLDCQNCLHSKFCNRSNLLGLPSISADDPNCCDRGPSSAGHELWKYVKDS 487

Query: 463  KDVGTPKNLNLNTMPGGYSD--PTEFQSIQITGEENKFEDSTMGLPWLKEKP-VGKGKPN 293
            + V T KN+NLN MP G S+    EFQSI+IT E +KF+DS   LPWLKEKP V KG P 
Sbjct: 488  EYVETNKNINLNVMPVGSSETKAAEFQSIRITFEYDKFQDSR--LPWLKEKPAVPKGNPT 545

Query: 292  EESKISTQIEPGLLNPYNTGFIH-DLKLRKIEESNMGTEKRLAFHSNGKPHMSSDLHSFH 116
            +E + ST I+   LNP  +G +H DL+L K+++S+M      AF  NGKP          
Sbjct: 546  DECEASTPIDSSFLNPSKSGCVHSDLELNKVQKSHM-----CAFDLNGKPQ--------- 591

Query: 115  DFPSELFQNQSKNQRVEEIEKGCISDVKSSCIHVPDLG 2
                ++ Q+ S + R EEI K  IS+V       PD+G
Sbjct: 592  --TPKVVQSLSTDHRTEEINK--ISNVNLHSDGYPDMG 625


>ref|XP_006581984.1| PREDICTED: uncharacterized protein LOC102666418 isoform X2 [Glycine
            max]
          Length = 941

 Score =  216 bits (551), Expect = 4e-53
 Identities = 144/359 (40%), Positives = 191/359 (53%), Gaps = 37/359 (10%)
 Frame = -3

Query: 1594 SSKNAQKIYCSPNLPWSTAQSSVLIADSIQLPMAFAQEEGRHICPAH----ASTVTEESL 1427
            +S ++  +Y S N+PW T+QSSVL A+ IQLP+A  QE+ R +CP      A T  +ESL
Sbjct: 115  TSLSSSSLYYSQNMPWLTSQSSVLNAELIQLPLASMQEKSRELCPTPLAVPAPTAIKESL 174

Query: 1426 KDYKLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXEV-PQVSAYSLNEISQVVHN 1250
            +D KL    CRK+GKKILDLQLPADEYIDS             P +S Y+ N IS+VV+N
Sbjct: 175  EDTKLSGLTCRKVGKKILDLQLPADEYIDSEGESCENERVIKQPPLSTYTSNGISKVVYN 234

Query: 1249 SHDKPYGNNSRDFTDLNVPFKLDEEAAAKSYDSEAPTHHMNNSFYDLSTRTKFGSQNLPN 1070
            + +KPY  NS  F DLN+PFKL +E   +S D  A   H N++F+ +  R   GS N PN
Sbjct: 235  TVEKPYRINSNGFADLNLPFKLQKETGVESDDFGASIPHRNHTFHGMLGRMTSGSHNFPN 294

Query: 1069 DAI---NKRQDLEGCSDNPLPDHGKKHEWXXXXXXXXXLDSFAK---------------- 947
            D I    +RQD E   D PLP+ G+KH W         L   AK                
Sbjct: 295  DVIPNLKRRQDHEAYPDLPLPNQGQKHGWLPSGQNGSNLGFLAKFNDMESQSVSIDFISK 354

Query: 946  --DQINRGP-WRTEKKF---SSSESSAWTQGPTNNGLLGPNSASRTCALQQPVSGAYMIS 785
               Q+N  P + +  +    S ++S A T  PT+ GLLGP+ AS TCA  + VS + M S
Sbjct: 355  KLKQVNNCPCFHSTSQIVPGSRTKSPAGTHDPTSGGLLGPSYASCTCAPHKLVSDSDMKS 414

Query: 784  YGISPAGLWKTPVS----DFGQSQPEVQASSASLGQN---SKSMMGISGFTRDELYQGT 629
             GISP+ LWK+  S    D     P + A   + G N   S +   +  + +D  Y GT
Sbjct: 415  SGISPSVLWKSTTSGPNLDRRNYLPPISAGDLNSGDNFGSSSAGHELRKYVKDSEYVGT 473


>gb|KRH54614.1| hypothetical protein GLYMA_06G198000 [Glycine max]
          Length = 918

 Score =  213 bits (542), Expect = 4e-52
 Identities = 144/363 (39%), Positives = 191/363 (52%), Gaps = 41/363 (11%)
 Frame = -3

Query: 1594 SSKNAQKIYCSPNLPWSTAQSSVLIADSIQLPMAFAQEEGRHICPAH----ASTVTEESL 1427
            +S ++  +Y S N+PW T+QSSVL A+ IQLP+A  QE+ R +CP      A T  +ESL
Sbjct: 115  TSLSSSSLYYSQNMPWLTSQSSVLNAELIQLPLASMQEKSRELCPTPLAVPAPTAIKESL 174

Query: 1426 KDYKLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXEV-PQVSAYSLNEISQVVHN 1250
            +D KL    CRK+GKKILDLQLPADEYIDS             P +S Y+ N IS+VV+N
Sbjct: 175  EDTKLSGLTCRKVGKKILDLQLPADEYIDSEGESCENERVIKQPPLSTYTSNGISKVVYN 234

Query: 1249 SHDKPYGNNSRDFTDLNVPFKLDEEAAAKSYDSEAPTHHMNNSFYDLSTRTKFGSQNLPN 1070
            + +KPY  NS  F DLN+PFKL +E   +S D  A   H N++F+ +  R   GS N PN
Sbjct: 235  TVEKPYRINSNGFADLNLPFKLQKETGVESDDFGASIPHRNHTFHGMLGRMTSGSHNFPN 294

Query: 1069 DAI---NKRQDLEGCSDNPLPDHGKKHEWXXXXXXXXXLDS----FAK------------ 947
            D I    +RQD E   D PLP+ G+KH W           S     AK            
Sbjct: 295  DVIPNLKRRQDHEAYPDLPLPNQGQKHGWLPSGTHAGQNGSNLGFLAKFNDMESQSVSID 354

Query: 946  ------DQINRGP-WRTEKKF---SSSESSAWTQGPTNNGLLGPNSASRTCALQQPVSGA 797
                   Q+N  P + +  +    S ++S A T  PT+ GLLGP+ AS TCA  + VS +
Sbjct: 355  FISKKLKQVNNCPCFHSTSQIVPGSRTKSPAGTHDPTSGGLLGPSYASCTCAPHKLVSDS 414

Query: 796  YMISYGISPAGLWKTPVS----DFGQSQPEVQASSASLGQN---SKSMMGISGFTRDELY 638
             M S GISP+ LWK+  S    D     P + A   + G N   S +   +  + +D  Y
Sbjct: 415  DMKSSGISPSVLWKSTTSGPNLDRRNYLPPISAGDLNSGDNFGSSSAGHELRKYVKDSEY 474

Query: 637  QGT 629
             GT
Sbjct: 475  VGT 477


>ref|XP_006581983.1| PREDICTED: uncharacterized protein LOC102666418 isoform X1 [Glycine
            max] gi|947106230|gb|KRH54613.1| hypothetical protein
            GLYMA_06G198000 [Glycine max]
          Length = 945

 Score =  213 bits (542), Expect = 4e-52
 Identities = 144/363 (39%), Positives = 191/363 (52%), Gaps = 41/363 (11%)
 Frame = -3

Query: 1594 SSKNAQKIYCSPNLPWSTAQSSVLIADSIQLPMAFAQEEGRHICPAH----ASTVTEESL 1427
            +S ++  +Y S N+PW T+QSSVL A+ IQLP+A  QE+ R +CP      A T  +ESL
Sbjct: 115  TSLSSSSLYYSQNMPWLTSQSSVLNAELIQLPLASMQEKSRELCPTPLAVPAPTAIKESL 174

Query: 1426 KDYKLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXEV-PQVSAYSLNEISQVVHN 1250
            +D KL    CRK+GKKILDLQLPADEYIDS             P +S Y+ N IS+VV+N
Sbjct: 175  EDTKLSGLTCRKVGKKILDLQLPADEYIDSEGESCENERVIKQPPLSTYTSNGISKVVYN 234

Query: 1249 SHDKPYGNNSRDFTDLNVPFKLDEEAAAKSYDSEAPTHHMNNSFYDLSTRTKFGSQNLPN 1070
            + +KPY  NS  F DLN+PFKL +E   +S D  A   H N++F+ +  R   GS N PN
Sbjct: 235  TVEKPYRINSNGFADLNLPFKLQKETGVESDDFGASIPHRNHTFHGMLGRMTSGSHNFPN 294

Query: 1069 DAI---NKRQDLEGCSDNPLPDHGKKHEWXXXXXXXXXLDS----FAK------------ 947
            D I    +RQD E   D PLP+ G+KH W           S     AK            
Sbjct: 295  DVIPNLKRRQDHEAYPDLPLPNQGQKHGWLPSGTHAGQNGSNLGFLAKFNDMESQSVSID 354

Query: 946  ------DQINRGP-WRTEKKF---SSSESSAWTQGPTNNGLLGPNSASRTCALQQPVSGA 797
                   Q+N  P + +  +    S ++S A T  PT+ GLLGP+ AS TCA  + VS +
Sbjct: 355  FISKKLKQVNNCPCFHSTSQIVPGSRTKSPAGTHDPTSGGLLGPSYASCTCAPHKLVSDS 414

Query: 796  YMISYGISPAGLWKTPVS----DFGQSQPEVQASSASLGQN---SKSMMGISGFTRDELY 638
             M S GISP+ LWK+  S    D     P + A   + G N   S +   +  + +D  Y
Sbjct: 415  DMKSSGISPSVLWKSTTSGPNLDRRNYLPPISAGDLNSGDNFGSSSAGHELRKYVKDSEY 474

Query: 637  QGT 629
             GT
Sbjct: 475  VGT 477


>gb|KHN06896.1| hypothetical protein glysoja_041276 [Glycine soja]
          Length = 945

 Score =  211 bits (538), Expect = 1e-51
 Identities = 144/363 (39%), Positives = 191/363 (52%), Gaps = 41/363 (11%)
 Frame = -3

Query: 1594 SSKNAQKIYCSPNLPWSTAQSSVLIADSIQLPMAFAQEEGRHICPAH----ASTVTEESL 1427
            +S ++  +Y S N+PW T+QSSVL A+ IQLP+A  QE+ R +CP      A T  +ESL
Sbjct: 115  TSLSSSFLYYSQNMPWLTSQSSVLNAELIQLPLASMQEKSRELCPTPLAVPAPTAIKESL 174

Query: 1426 KDYKLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXEV-PQVSAYSLNEISQVVHN 1250
            +D KL    CRK+GKKILDLQLPADEYIDS             P +S Y+ N IS+VV+N
Sbjct: 175  EDTKLSGLTCRKVGKKILDLQLPADEYIDSEGESCENERVIKQPPLSTYTSNGISKVVYN 234

Query: 1249 SHDKPYGNNSRDFTDLNVPFKLDEEAAAKSYDSEAPTHHMNNSFYDLSTRTKFGSQNLPN 1070
            + +KPY  NS  F DLN+PFKL +E   +S D  A   H N++F+ +  R   GS N PN
Sbjct: 235  TVEKPYRINSNGFADLNLPFKLQKETGVESDDFGASIPHRNHTFHGMLGRMTSGSHNFPN 294

Query: 1069 DAI---NKRQDLEGCSDNPLPDHGKKHEWXXXXXXXXXLDS----FAK------------ 947
            D I    +RQD E   D PLP+ G+KH W           S     AK            
Sbjct: 295  DVIPNLKRRQDHEAYPDLPLPNQGQKHGWLPSGTHAGQNGSNLGFLAKFNDMESQSVSID 354

Query: 946  ------DQINRGP-WRTEKKF---SSSESSAWTQGPTNNGLLGPNSASRTCALQQPVSGA 797
                   Q+N  P + +  +    S ++S A T  PT+ GLLGP+ AS TCA  + VS +
Sbjct: 355  FISKKLKQVNNCPCFHSTSQIVPGSRTKSPAGTHDPTSGGLLGPSYASCTCAPHKLVSDS 414

Query: 796  YMISYGISPAGLWKTPVS----DFGQSQPEVQASSASLGQN---SKSMMGISGFTRDELY 638
             M S GISP+ LWK+  S    D     P + A   + G N   S +   +  + +D  Y
Sbjct: 415  DMKSSGISPSLLWKSTTSGPNLDRRNYLPPISAGDLNSGDNFGSSSAGHELRKYVKDSEY 474

Query: 637  QGT 629
             GT
Sbjct: 475  VGT 477


>ref|XP_004508706.1| PREDICTED: uncharacterized protein LOC101495925 [Cicer arietinum]
          Length = 536

 Score =  148 bits (374), Expect = 1e-32
 Identities = 104/215 (48%), Positives = 120/215 (55%), Gaps = 28/215 (13%)
 Frame = -3

Query: 565 SKLLDLPSCTND--PNN--------SDNHGSLVSHEFRKYVEGSKDVGTPKNL--NLNTM 422
           +K LDL  C+++  PNN        S+  G L+   F K +   K   T  +L  N+N  
Sbjct: 25  NKRLDLEGCSHNKLPNNGKKCEWKPSELCGGLLD-SFAKSIHTEKQYVTVDSLRKNMNQF 83

Query: 421 PGGYSDPTEFQSIQITGEENKFEDS----------------TMGLPWLKEKPVGKGKPNE 290
                  +  Q  Q    E KF  S                 M LP LKE P  KGK +E
Sbjct: 84  DDLPFFHSSHQIDQRLCTERKFFGSESFARTQSLTSNGLVGAMELPCLKELPAVKGKLSE 143

Query: 289 ESKISTQIEPGLLNPYNTGFIHDLKLRKIEESNMGTEKRLAFHSNGKPHMSSDLHSFHDF 110
           ESKISTQIE  +LNP N G IH LKLRKIEESN+GTEK LA  SNGKPHMSS+LHSF   
Sbjct: 144 ESKISTQIESVVLNPNNKGVIHGLKLRKIEESNLGTEKTLALQSNGKPHMSSNLHSF--- 200

Query: 109 PSELFQNQSKNQRVEEIEKGCISDVKSSCIHVPDL 5
               FQNQ +NQR+EEIEKG ISDVKS CI V DL
Sbjct: 201 ----FQNQPENQRIEEIEKGFISDVKSPCIDVSDL 231



 Score = 90.1 bits (222), Expect = 5e-15
 Identities = 59/124 (47%), Positives = 67/124 (54%), Gaps = 29/124 (23%)
 Frame = -3

Query: 1132 MNNSFYDLSTRTKFGSQNLPNDAINKRQDLEGCSDNPLPDHGKKHEWXXXXXXXXXLDSF 953
            MNN   DLS RTKFGSQNL NDAINKR DLEGCS N LP++GKK EW         LDSF
Sbjct: 1    MNNCCNDLSMRTKFGSQNLHNDAINKRLDLEGCSHNKLPNNGKKCEWKPSELCGGLLDSF 60

Query: 952  AK------------------DQINRGPW-----------RTEKKFSSSESSAWTQGPTNN 860
            AK                  +Q +  P+            TE+KF  SES A TQ  T+N
Sbjct: 61   AKSIHTEKQYVTVDSLRKNMNQFDDLPFFHSSHQIDQRLCTERKFFGSESFARTQSLTSN 120

Query: 859  GLLG 848
            GL+G
Sbjct: 121  GLVG 124


>ref|XP_010258910.1| PREDICTED: uncharacterized protein LOC104598504 [Nelumbo nucifera]
          Length = 1093

 Score =  121 bits (304), Expect = 2e-24
 Identities = 172/644 (26%), Positives = 245/644 (38%), Gaps = 110/644 (17%)
 Frame = -3

Query: 1606 SSALSSKNAQKIYCSPNLPW---STAQSSVLIADSIQLPMAFAQEEGRHICPAHASTVTE 1436
            SS LSS++AQK    P+LP    + ++ SV  ++  Q P +F +E G H CPA   T   
Sbjct: 117  SSQLSSEDAQKTRHIPSLPLVNSACSKPSVCDSEKTQPPFSFRKENGIHTCPA--PTQKG 174

Query: 1435 ESLKDYKLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXE--VPQVSAYSL----- 1277
             S KD KL E   +K  +K+ DLQLPADEYIDS             +  V+ Y L     
Sbjct: 175  SSSKD-KLSESSSKKFPRKMFDLQLPADEYIDSEEGEPLEEEKASEISVVTNYPLRNSGA 233

Query: 1276 --------------NEISQVVHNSHDKPYGNNSRDFTDLNVPFKLDEEAAAKSYDSEAPT 1139
                          N  SQ      D    +      DLN P  ++EE  +   D   P 
Sbjct: 234  AHDRDVKLSLGSGGNPGSQADSLRSDSCLQSTHHGLADLNEPIPVEEEIVSAPVDFLHPV 293

Query: 1138 -------------------HHMNNSFYD-----LSTRTKFGSQNLPNDAI------NKRQ 1049
                               H  +  F+          T   SQ   N+        N   
Sbjct: 294  TCHGEIKGQSLPITPNSGFHGSSRDFFQDKQKGRDNETSSNSQRSENERSRQEWPPNLEA 353

Query: 1048 DLEGCSDNPLPDH-------GKKHEWXXXXXXXXXLDSFA-KDQINRGPWRTEKKFSSSE 893
                CS   LP              +         L SF   DQ  R PWR EK   S E
Sbjct: 354  GQSNCSLKSLPQGFYPEKLPAPSAPFQFEHKKALELPSFVLSDQSKREPWR-EKTSYSLE 412

Query: 892  SSAWTQGPTNNGLLGPNSASRTCALQQPVSGAYMISYGISPAGLWKTPVSDFGQSQP-EV 716
            SS   Q   +   LG  + +    L   +  + + + G S A  W+ P S   Q +P  V
Sbjct: 413  SSQREQNLQSFNFLGSVADAHVPGLHPSIPQSDVANSGSSLASSWRKPTSSLIQKKPIAV 472

Query: 715  QASSA-----SLGQNSKSMMGISGFTRDELYQGTGVKSGPNL-----DGQNFLLSSFCSR 566
            Q  S+      + +NSK+    SG   D+ +  +   S P+      +G+N       S 
Sbjct: 473  QELSSVNPFSPMSKNSKTSYQGSGVIEDKWHLNSNFGSNPSFGSEISNGKNGFCHGSQSE 532

Query: 565  SKLLDLPS---------CTNDPNNSDNHGSLVSHEFRKYVEGSK--DVGTPKNLNLN-TM 422
            SKLL + S         C+ D  ++  H    +H   K+ +GS   DV   K++NLN  +
Sbjct: 533  SKLLQVCSPSVGFGYLNCSIDNTSAYEHFG--NHGLAKHYKGSDSVDVKIVKDINLNMVL 590

Query: 421  PGGYSDPTEFQSIQITGEENKFEDSTMGLPWLKEKP-----VGKGKPNEESKISTQI--- 266
            P G+ D    + + I   E K ED   GLPWL  KP       KG  N +   S  +   
Sbjct: 591  PNGFQDMVLQRDLVIIDGEGKHEDPPGGLPWLGAKPACNDTTTKGSRNLDKTGSDSLQVC 650

Query: 265  ------EPGLLNPYNTGFIHDL-----------KLRKIEESNMGTEKRLAFHSNGKPHMS 137
                  E    N  N  FI D            K+ K+ +S    +K L F    KPH+S
Sbjct: 651  PQHFADEVEARNGRNPSFIQDFTLASCTRYTEAKIVKMADS-PSDKKILGFPIFDKPHVS 709

Query: 136  SDLHSFHDFPSELFQNQSKNQRVEEIEKGCISDVKSSCIHVPDL 5
             +  S     ++L  ++S+ + +E   K  + ++  S  H P L
Sbjct: 710  HNHSSSQCSSAKLCHHRSEIEDIENNVKVKVLNIDLS--HNPSL 751


>ref|XP_010258199.1| PREDICTED: uncharacterized protein LOC104598027 [Nelumbo nucifera]
          Length = 1077

 Score =  120 bits (300), Expect = 5e-24
 Identities = 159/643 (24%), Positives = 251/643 (39%), Gaps = 108/643 (16%)
 Frame = -3

Query: 1606 SSALSSKNAQKIYCSPNLPW---STAQSSVLIADSIQLPMAFAQEEGRHICPAHASTVTE 1436
            SS +SS+++QK++   +LP    + +++SV   D +Q P +F ++    +    A T   
Sbjct: 117  SSQMSSEDSQKMWHISSLPLVNSACSRASVSGTDKMQPPFSFCKDNNMQV--DLAPTQNG 174

Query: 1435 ESLKDYKLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXEVPQVSAY--------- 1283
            +S KD KL E K +K  +K+ DLQLPADEYIDS              V A          
Sbjct: 175  DSSKDCKLLESKSKKFPRKVFDLQLPADEYIDSEGETLEEEKVSEISVVANYTQRNSGIA 234

Query: 1282 -----------SLNEISQVVHNSHDKPYGNNSRDFTDLNVPFKLDEEAAAKSYDSEAPTH 1136
                       S N  SQ   +  +    +      DLN P +++E   +   D   P +
Sbjct: 235  PDRDVNLSLGSSRNHSSQGESSRSESSLRSKHHGLADLNEPIQVEEVTDSAPVDFLHPVN 294

Query: 1135 -HMNNSFYDLSTRTKFGSQNLPNDAINKRQDLEGCSDNPLPD------HGKKHEWXXXXX 977
             H      +L T    G +  P D     Q  +G S+    +       G+K EW     
Sbjct: 295  CHKEIKGQNLPTVPNSGLKGSPRDFFKDTQ--KGRSNETSSNIQNQENEGRKREWLSYNL 352

Query: 976  XXXXLDSFAK----------------------------------DQINRGPWRTEKKFSS 899
                 +S  K                                  D   + PWR EK   S
Sbjct: 353  EAGQSNSNLKPLPQGVHSENLLASSAPIQVELKKAHEFPRFLISDHDKKEPWR-EKAICS 411

Query: 898  SESSAWTQGPTNNGLLGPNSASRTCALQQPVSGAYMISYGISPAGLWKTPVSDFGQSQP- 722
               S   Q   N     P            VS + M ++G+  A  W+ P+    Q  P 
Sbjct: 412  LGISDKDQNLPNFNNPYP-----------VVSQSDMANFGVPSASSWRRPMCSLSQKNPI 460

Query: 721  EVQASS-----ASLGQNSKSMMGISGFTRDELYQGTGVKSGPNLDGQ-NFLLSSFCSRSK 560
             V+A       A L  NSKS +  SG   D+ +    ++  PN   + +   + FC  S+
Sbjct: 461  AVEALPCVNPFAPLSNNSKSSLEGSGVIEDKWHLNGNLRLNPNFGSEISHKRNGFCHGSQ 520

Query: 559  LLDLP-------------SCTNDP----NNSDNHGSLVSHEFRKYVEGSKDVGTPKNLNL 431
            L   P             +C+N+      N ++HGS+  ++   +V    DV T K+ NL
Sbjct: 521  LESKPLQVCSPSVGFDYLNCSNEKALTSENFEDHGSVKRYKGSDFV----DVKTAKDRNL 576

Query: 430  N-TMPGGYSDPTEFQ-SIQITGEENKFEDSTMGLPWLKEKP-----VGKGKPNEE----- 287
            N  +P G++D    Q  + I   E K ED +  LPWL+ KP       K + N E     
Sbjct: 577  NMVLPSGFNDTVVPQRDLVIIDGERKHEDPSAVLPWLRGKPACNDMTPKARGNSERMGLD 636

Query: 286  ------SKISTQIEPGLLNPYNTGFIHDLKLRKIEESN-MGTEKRLAFHSNGKPHMSSDL 128
                     S ++E G  N  +  ++HD + + ++ ++ +G +K L      KP  S++ 
Sbjct: 637  FLQVNHQHFSDKVEAG--NGPSLHYVHDTEPKSVKVADRLGDKKILGVPIFEKPCASNNH 694

Query: 127  HSFHDFPSELFQNQSKNQRVEEIEKGCISDVKSSCIHV-PDLG 2
             SF   P+ +    S+ + VE   K  +  +  SC  + P+LG
Sbjct: 695  SSFQLSPARINHYPSRVEDVENNGKATVLHIDLSCDPILPNLG 737


>ref|XP_008374587.1| PREDICTED: uncharacterized protein LOC103437859 [Malus domestica]
            gi|657965836|ref|XP_008374588.1| PREDICTED:
            uncharacterized protein LOC103437859 [Malus domestica]
            gi|657965838|ref|XP_008374589.1| PREDICTED:
            uncharacterized protein LOC103437859 [Malus domestica]
          Length = 985

 Score =  110 bits (276), Expect = 3e-21
 Identities = 145/502 (28%), Positives = 208/502 (41%), Gaps = 54/502 (10%)
 Frame = -3

Query: 1606 SSALSSKNA---QKIYCSPNLPW---STAQSSVLIADSIQLPMAFAQEEGRHICPAHAST 1445
            ++ALS K++   QK   +P+LP    + +Q SV  A+SI+ P  F +  GR+I      T
Sbjct: 105  TTALSQKSSVYVQKTLHAPSLPLVNPACSQISVSAAESIESPSCFVR--GRNIQTCSYPT 162

Query: 1444 VTEESLKDYKLPEPKCRKIGKKILDLQLPADEYID--SXXXXXXXXXXEVPQVSAYSLNE 1271
             TE    D +L E KC+K  K   DL+LPAD YID             E P+VS+  L  
Sbjct: 163  QTEGRSGDCELLESKCKKFQKNF-DLELPADAYIDDEGEGFLVDGKVSEAPEVSSSRLKR 221

Query: 1270 ISQVVHNSHDKPY----------GNNSRDFTDLNVPFKLDEEAAAKSYDSEAPTHHMNNS 1121
              +V+ N   K +           +  +D  DLN  +KL++          +PT      
Sbjct: 222  FPEVLCNGDAKQFLGSEDDTSTSASLEKDSFDLNYVYKLEK--------GTSPT------ 267

Query: 1120 FYDLSTRTKFGSQNLPNDAINKRQDLEGCSDNPLPDHGKKHEW---XXXXXXXXXLDSFA 950
                    +F S+ L  D   KRQD E  S+  L +  +K EW            LDSF 
Sbjct: 268  -----PGRQFLSKELIQDT-RKRQDFEVFSNVLLQERKRKQEWSSIYEAGKRKRTLDSFP 321

Query: 949  K----DQIN--RGPWRTEKK------FSSSESSAWT-------QGPTNNGLLGPNSASRT 827
            +    DQ          E K      F  S  ++W+       Q P +    GP+ AS  
Sbjct: 322  QGSHADQFYPLSSLLHEELKAAEPPPFHQSNQNSWSGRTLFGLQKPGSYSQSGPSRASSL 381

Query: 826  CALQQPVSGAYMISYGISPAGLWKTPVSDFGQSQPEVQA-----SSASLGQNSKSMMGIS 662
            C   Q +    M + G S     + P+ DF +    VQA     +   LG +SKS     
Sbjct: 382  CTPYQHIPQGEMENSGASCIAALRKPIHDFARFPIAVQALPCFNTPIQLGNSSKSSTIRP 441

Query: 661  GFTRDELYQGTGVKSGPNLDGQNFLLSSFCSRSKLLDLPSCTNDPN--------NSDNHG 506
            G   D L     ++S         L  +F + S+L    S  + P+         +DN G
Sbjct: 442  GINGDRLQLKHDLRSSTKHGSAFSLDDNFSNGSQLESKNSEVHRPHISLDNLIRTNDNIG 501

Query: 505  SLVSHEFRKYVEGSKDVGTPKNLNLNTMPGGYS-DPTEFQSIQITGEENKFEDSTMGLPW 329
             +  H   K V  S  V + K++NLN +P G S D    QS Q T E  K E S+ GLPW
Sbjct: 502  -IEHHGVTKDVLDSASVKSCKDINLNCVPTGCSLDAAVSQSFQATTESEKLEASSEGLPW 560

Query: 328  LKEKPVGKGKPNEESKISTQIE 263
             +      GK ++    STQ++
Sbjct: 561  HRWN--HNGKTDKGCDNSTQVD 580


>ref|XP_007037462.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508774707|gb|EOY21963.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 954

 Score =  110 bits (276), Expect = 3e-21
 Identities = 158/594 (26%), Positives = 236/594 (39%), Gaps = 59/594 (9%)
 Frame = -3

Query: 1606 SSALSSKNAQKIYCSPNLPWSTAQSSVLIADSIQLPMAFAQEEGRHICPAHASTVTEESL 1427
            +  LS K++  +     LP  T+ S  L+    Q P++ A++  R     H     E S 
Sbjct: 47   NQVLSPKSSNHV----QLPQHTSTSINLVHS--QSPVSEAKDGNRGQA-GHDPIHIECSS 99

Query: 1426 KDYKLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXEVPQVSAYSLNEISQV--VH 1253
            K  +  E  C+  GKKILDL+LPADEYIDS            P+V+    N + ++  V 
Sbjct: 100  KAPEFMESNCKMFGKKILDLELPADEYIDSEEEGFSEVKM-APEVTDIPTNALKKIPEVK 158

Query: 1252 NSHDK-------------PYGN--------NSRDFTDLNVPFKLDEEAAAKSYDSEAPT- 1139
            +  DK             P GN         S+   DLN+P KL+E+   +  D + P  
Sbjct: 159  DRGDKELPISASGCNSVFPEGNFIPSSISLKSKVLADLNIPVKLEEDKIPELSDFQDPII 218

Query: 1138 HHMNNSFYDLSTRTKFGSQNLPNDAINKRQ---DLEGCSDNPLPDHGKKHEWXXXXXXXX 968
             H   S  DLS ++    + L  + I   Q   D E   D+   D   KH          
Sbjct: 219  GHRETSLQDLSGKSNSSFEVLSKEVIPNSQIMRDPEADLDSLFLD---KHNMQRERITCN 275

Query: 967  XLDSFAKDQINRG--PWRTEKKF------SSSESSAW------TQGPTNNGLLGPNSASR 830
                 +++ +N       TEK          +E S+        +G   N +L       
Sbjct: 276  DKAGQSRNDLNSSCQDLYTEKLSIEHIDDEQAEDSSTPHGLDEAKGKLCNEILQCVGGDI 335

Query: 829  TCALQQPVSGAYM-ISYGISP-----------AGLWKTPVSDFGQSQPEVQASSASLGQN 686
            +    +PV+   M  SY I P              W+    D  +S   VQA     G++
Sbjct: 336  SSHSYKPVATVDMRSSYQIVPLADKMNSESSSVSSWRR---DLKRSPIAVQALPCFKGKS 392

Query: 685  SKSMMGISGFTRDELYQGTGVKSGPNL-DGQNFLLSSFCSRSKLLDLPSCTNDPN---NS 518
            SKS     G   +EL   T + S P L     F   S+ +  +L   P  T+  +   N+
Sbjct: 393  SKSFTRSLGLAGNELCLSTKLLSRPKLCSAATFPQESWQNDFQLEGQPPSTSSVSLNCNN 452

Query: 517  DNHGSLVSHEFRKYVEGSKDVGTPKNLNLN-TMPGGYSDPTEFQSIQITGEENKFEDSTM 341
            DN  +   H   KY +  K V + K+L+LN  +P   +D    Q       E   E+ST 
Sbjct: 453  DNGSAFERHSPAKYTKDFKYVMSVKSLDLNFVLPSFSTDVACSQGASSILGEKTLENST- 511

Query: 340  GLPWLKEKPVGKGKPNEESKISTQIEPGLLNPYNTGFIHDLKLRKIEESNMGTEKR-LAF 164
            G   + E P+   K  E    S  +E  +L   N+  +HD +L K+E SN    KR L F
Sbjct: 512  GCSQIAETPIHDSKSGERKDQSVPLEC-VLKQANSVCVHDAELDKVEASNSLDFKRILGF 570

Query: 163  HSNGKPHMSSDLHSFHDFPSELFQNQSKNQRVEEIEKGCISDVKSSCIHVPDLG 2
            H   KP + +   S H  P+    N    + +++ EK  + D+     HVP  G
Sbjct: 571  HRYNKPPIPNGQCSSHASPAGNHSNSCAKEDIKDKEKDRLPDMNLEVDHVPFRG 624


>ref|XP_007037460.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508774705|gb|EOY21961.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 1016

 Score =  110 bits (276), Expect = 3e-21
 Identities = 158/594 (26%), Positives = 236/594 (39%), Gaps = 59/594 (9%)
 Frame = -3

Query: 1606 SSALSSKNAQKIYCSPNLPWSTAQSSVLIADSIQLPMAFAQEEGRHICPAHASTVTEESL 1427
            +  LS K++  +     LP  T+ S  L+    Q P++ A++  R     H     E S 
Sbjct: 109  NQVLSPKSSNHV----QLPQHTSTSINLVHS--QSPVSEAKDGNRGQA-GHDPIHIECSS 161

Query: 1426 KDYKLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXEVPQVSAYSLNEISQV--VH 1253
            K  +  E  C+  GKKILDL+LPADEYIDS            P+V+    N + ++  V 
Sbjct: 162  KAPEFMESNCKMFGKKILDLELPADEYIDSEEEGFSEVKM-APEVTDIPTNALKKIPEVK 220

Query: 1252 NSHDK-------------PYGN--------NSRDFTDLNVPFKLDEEAAAKSYDSEAPT- 1139
            +  DK             P GN         S+   DLN+P KL+E+   +  D + P  
Sbjct: 221  DRGDKELPISASGCNSVFPEGNFIPSSISLKSKVLADLNIPVKLEEDKIPELSDFQDPII 280

Query: 1138 HHMNNSFYDLSTRTKFGSQNLPNDAINKRQ---DLEGCSDNPLPDHGKKHEWXXXXXXXX 968
             H   S  DLS ++    + L  + I   Q   D E   D+   D   KH          
Sbjct: 281  GHRETSLQDLSGKSNSSFEVLSKEVIPNSQIMRDPEADLDSLFLD---KHNMQRERITCN 337

Query: 967  XLDSFAKDQINRG--PWRTEKKF------SSSESSAW------TQGPTNNGLLGPNSASR 830
                 +++ +N       TEK          +E S+        +G   N +L       
Sbjct: 338  DKAGQSRNDLNSSCQDLYTEKLSIEHIDDEQAEDSSTPHGLDEAKGKLCNEILQCVGGDI 397

Query: 829  TCALQQPVSGAYM-ISYGISP-----------AGLWKTPVSDFGQSQPEVQASSASLGQN 686
            +    +PV+   M  SY I P              W+    D  +S   VQA     G++
Sbjct: 398  SSHSYKPVATVDMRSSYQIVPLADKMNSESSSVSSWRR---DLKRSPIAVQALPCFKGKS 454

Query: 685  SKSMMGISGFTRDELYQGTGVKSGPNL-DGQNFLLSSFCSRSKLLDLPSCTNDPN---NS 518
            SKS     G   +EL   T + S P L     F   S+ +  +L   P  T+  +   N+
Sbjct: 455  SKSFTRSLGLAGNELCLSTKLLSRPKLCSAATFPQESWQNDFQLEGQPPSTSSVSLNCNN 514

Query: 517  DNHGSLVSHEFRKYVEGSKDVGTPKNLNLN-TMPGGYSDPTEFQSIQITGEENKFEDSTM 341
            DN  +   H   KY +  K V + K+L+LN  +P   +D    Q       E   E+ST 
Sbjct: 515  DNGSAFERHSPAKYTKDFKYVMSVKSLDLNFVLPSFSTDVACSQGASSILGEKTLENST- 573

Query: 340  GLPWLKEKPVGKGKPNEESKISTQIEPGLLNPYNTGFIHDLKLRKIEESNMGTEKR-LAF 164
            G   + E P+   K  E    S  +E  +L   N+  +HD +L K+E SN    KR L F
Sbjct: 574  GCSQIAETPIHDSKSGERKDQSVPLEC-VLKQANSVCVHDAELDKVEASNSLDFKRILGF 632

Query: 163  HSNGKPHMSSDLHSFHDFPSELFQNQSKNQRVEEIEKGCISDVKSSCIHVPDLG 2
            H   KP + +   S H  P+    N    + +++ EK  + D+     HVP  G
Sbjct: 633  HRYNKPPIPNGQCSSHASPAGNHSNSCAKEDIKDKEKDRLPDMNLEVDHVPFRG 686


>ref|XP_007037457.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508774702|gb|EOY21958.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1025

 Score =  110 bits (276), Expect = 3e-21
 Identities = 158/594 (26%), Positives = 236/594 (39%), Gaps = 59/594 (9%)
 Frame = -3

Query: 1606 SSALSSKNAQKIYCSPNLPWSTAQSSVLIADSIQLPMAFAQEEGRHICPAHASTVTEESL 1427
            +  LS K++  +     LP  T+ S  L+    Q P++ A++  R     H     E S 
Sbjct: 118  NQVLSPKSSNHV----QLPQHTSTSINLVHS--QSPVSEAKDGNRGQA-GHDPIHIECSS 170

Query: 1426 KDYKLPEPKCRKIGKKILDLQLPADEYIDSXXXXXXXXXXEVPQVSAYSLNEISQV--VH 1253
            K  +  E  C+  GKKILDL+LPADEYIDS            P+V+    N + ++  V 
Sbjct: 171  KAPEFMESNCKMFGKKILDLELPADEYIDSEEEGFSEVKM-APEVTDIPTNALKKIPEVK 229

Query: 1252 NSHDK-------------PYGN--------NSRDFTDLNVPFKLDEEAAAKSYDSEAPT- 1139
            +  DK             P GN         S+   DLN+P KL+E+   +  D + P  
Sbjct: 230  DRGDKELPISASGCNSVFPEGNFIPSSISLKSKVLADLNIPVKLEEDKIPELSDFQDPII 289

Query: 1138 HHMNNSFYDLSTRTKFGSQNLPNDAINKRQ---DLEGCSDNPLPDHGKKHEWXXXXXXXX 968
             H   S  DLS ++    + L  + I   Q   D E   D+   D   KH          
Sbjct: 290  GHRETSLQDLSGKSNSSFEVLSKEVIPNSQIMRDPEADLDSLFLD---KHNMQRERITCN 346

Query: 967  XLDSFAKDQINRG--PWRTEKKF------SSSESSAW------TQGPTNNGLLGPNSASR 830
                 +++ +N       TEK          +E S+        +G   N +L       
Sbjct: 347  DKAGQSRNDLNSSCQDLYTEKLSIEHIDDEQAEDSSTPHGLDEAKGKLCNEILQCVGGDI 406

Query: 829  TCALQQPVSGAYM-ISYGISP-----------AGLWKTPVSDFGQSQPEVQASSASLGQN 686
            +    +PV+   M  SY I P              W+    D  +S   VQA     G++
Sbjct: 407  SSHSYKPVATVDMRSSYQIVPLADKMNSESSSVSSWRR---DLKRSPIAVQALPCFKGKS 463

Query: 685  SKSMMGISGFTRDELYQGTGVKSGPNL-DGQNFLLSSFCSRSKLLDLPSCTNDPN---NS 518
            SKS     G   +EL   T + S P L     F   S+ +  +L   P  T+  +   N+
Sbjct: 464  SKSFTRSLGLAGNELCLSTKLLSRPKLCSAATFPQESWQNDFQLEGQPPSTSSVSLNCNN 523

Query: 517  DNHGSLVSHEFRKYVEGSKDVGTPKNLNLN-TMPGGYSDPTEFQSIQITGEENKFEDSTM 341
            DN  +   H   KY +  K V + K+L+LN  +P   +D    Q       E   E+ST 
Sbjct: 524  DNGSAFERHSPAKYTKDFKYVMSVKSLDLNFVLPSFSTDVACSQGASSILGEKTLENST- 582

Query: 340  GLPWLKEKPVGKGKPNEESKISTQIEPGLLNPYNTGFIHDLKLRKIEESNMGTEKR-LAF 164
            G   + E P+   K  E    S  +E  +L   N+  +HD +L K+E SN    KR L F
Sbjct: 583  GCSQIAETPIHDSKSGERKDQSVPLEC-VLKQANSVCVHDAELDKVEASNSLDFKRILGF 641

Query: 163  HSNGKPHMSSDLHSFHDFPSELFQNQSKNQRVEEIEKGCISDVKSSCIHVPDLG 2
            H   KP + +   S H  P+    N    + +++ EK  + D+     HVP  G
Sbjct: 642  HRYNKPPIPNGQCSSHASPAGNHSNSCAKEDIKDKEKDRLPDMNLEVDHVPFRG 695


>ref|XP_007037461.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508774706|gb|EOY21962.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 999

 Score =  108 bits (270), Expect = 1e-20
 Identities = 150/561 (26%), Positives = 222/561 (39%), Gaps = 59/561 (10%)
 Frame = -3

Query: 1507 QLPMAFAQEEGRHICPAHASTVTEESLKDYKLPEPKCRKIGKKILDLQLPADEYIDSXXX 1328
            Q P++ A++  R     H     E S K  +  E  C+  GKKILDL+LPADEYIDS   
Sbjct: 119  QSPVSEAKDGNRGQA-GHDPIHIECSSKAPEFMESNCKMFGKKILDLELPADEYIDSEEE 177

Query: 1327 XXXXXXXEVPQVSAYSLNEISQV--VHNSHDK-------------PYGN--------NSR 1217
                     P+V+    N + ++  V +  DK             P GN         S+
Sbjct: 178  GFSEVKM-APEVTDIPTNALKKIPEVKDRGDKELPISASGCNSVFPEGNFIPSSISLKSK 236

Query: 1216 DFTDLNVPFKLDEEAAAKSYDSEAPT-HHMNNSFYDLSTRTKFGSQNLPNDAINKRQ--- 1049
               DLN+P KL+E+   +  D + P   H   S  DLS ++    + L  + I   Q   
Sbjct: 237  VLADLNIPVKLEEDKIPELSDFQDPIIGHRETSLQDLSGKSNSSFEVLSKEVIPNSQIMR 296

Query: 1048 DLEGCSDNPLPDHGKKHEWXXXXXXXXXLDSFAKDQINRG--PWRTEKKF------SSSE 893
            D E   D+   D   KH               +++ +N       TEK          +E
Sbjct: 297  DPEADLDSLFLD---KHNMQRERITCNDKAGQSRNDLNSSCQDLYTEKLSIEHIDDEQAE 353

Query: 892  SSAW------TQGPTNNGLLGPNSASRTCALQQPVSGAYM-ISYGISP-----------A 767
             S+        +G   N +L       +    +PV+   M  SY I P            
Sbjct: 354  DSSTPHGLDEAKGKLCNEILQCVGGDISSHSYKPVATVDMRSSYQIVPLADKMNSESSSV 413

Query: 766  GLWKTPVSDFGQSQPEVQASSASLGQNSKSMMGISGFTRDELYQGTGVKSGPNL-DGQNF 590
              W+    D  +S   VQA     G++SKS     G   +EL   T + S P L     F
Sbjct: 414  SSWRR---DLKRSPIAVQALPCFKGKSSKSFTRSLGLAGNELCLSTKLLSRPKLCSAATF 470

Query: 589  LLSSFCSRSKLLDLPSCTNDPN---NSDNHGSLVSHEFRKYVEGSKDVGTPKNLNLN-TM 422
               S+ +  +L   P  T+  +   N+DN  +   H   KY +  K V + K+L+LN  +
Sbjct: 471  PQESWQNDFQLEGQPPSTSSVSLNCNNDNGSAFERHSPAKYTKDFKYVMSVKSLDLNFVL 530

Query: 421  PGGYSDPTEFQSIQITGEENKFEDSTMGLPWLKEKPVGKGKPNEESKISTQIEPGLLNPY 242
            P   +D    Q       E   E+ST G   + E P+   K  E    S  +E  +L   
Sbjct: 531  PSFSTDVACSQGASSILGEKTLENST-GCSQIAETPIHDSKSGERKDQSVPLEC-VLKQA 588

Query: 241  NTGFIHDLKLRKIEESNMGTEKR-LAFHSNGKPHMSSDLHSFHDFPSELFQNQSKNQRVE 65
            N+  +HD +L K+E SN    KR L FH   KP + +   S H  P+    N    + ++
Sbjct: 589  NSVCVHDAELDKVEASNSLDFKRILGFHRYNKPPIPNGQCSSHASPAGNHSNSCAKEDIK 648

Query: 64   EIEKGCISDVKSSCIHVPDLG 2
            + EK  + D+     HVP  G
Sbjct: 649  DKEKDRLPDMNLEVDHVPFRG 669


>ref|XP_007037459.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508774704|gb|EOY21960.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 928

 Score =  108 bits (270), Expect = 1e-20
 Identities = 150/561 (26%), Positives = 222/561 (39%), Gaps = 59/561 (10%)
 Frame = -3

Query: 1507 QLPMAFAQEEGRHICPAHASTVTEESLKDYKLPEPKCRKIGKKILDLQLPADEYIDSXXX 1328
            Q P++ A++  R     H     E S K  +  E  C+  GKKILDL+LPADEYIDS   
Sbjct: 48   QSPVSEAKDGNRGQA-GHDPIHIECSSKAPEFMESNCKMFGKKILDLELPADEYIDSEEE 106

Query: 1327 XXXXXXXEVPQVSAYSLNEISQV--VHNSHDK-------------PYGN--------NSR 1217
                     P+V+    N + ++  V +  DK             P GN         S+
Sbjct: 107  GFSEVKM-APEVTDIPTNALKKIPEVKDRGDKELPISASGCNSVFPEGNFIPSSISLKSK 165

Query: 1216 DFTDLNVPFKLDEEAAAKSYDSEAPT-HHMNNSFYDLSTRTKFGSQNLPNDAINKRQ--- 1049
               DLN+P KL+E+   +  D + P   H   S  DLS ++    + L  + I   Q   
Sbjct: 166  VLADLNIPVKLEEDKIPELSDFQDPIIGHRETSLQDLSGKSNSSFEVLSKEVIPNSQIMR 225

Query: 1048 DLEGCSDNPLPDHGKKHEWXXXXXXXXXLDSFAKDQINRG--PWRTEKKF------SSSE 893
            D E   D+   D   KH               +++ +N       TEK          +E
Sbjct: 226  DPEADLDSLFLD---KHNMQRERITCNDKAGQSRNDLNSSCQDLYTEKLSIEHIDDEQAE 282

Query: 892  SSAW------TQGPTNNGLLGPNSASRTCALQQPVSGAYM-ISYGISP-----------A 767
             S+        +G   N +L       +    +PV+   M  SY I P            
Sbjct: 283  DSSTPHGLDEAKGKLCNEILQCVGGDISSHSYKPVATVDMRSSYQIVPLADKMNSESSSV 342

Query: 766  GLWKTPVSDFGQSQPEVQASSASLGQNSKSMMGISGFTRDELYQGTGVKSGPNL-DGQNF 590
              W+    D  +S   VQA     G++SKS     G   +EL   T + S P L     F
Sbjct: 343  SSWRR---DLKRSPIAVQALPCFKGKSSKSFTRSLGLAGNELCLSTKLLSRPKLCSAATF 399

Query: 589  LLSSFCSRSKLLDLPSCTNDPN---NSDNHGSLVSHEFRKYVEGSKDVGTPKNLNLN-TM 422
               S+ +  +L   P  T+  +   N+DN  +   H   KY +  K V + K+L+LN  +
Sbjct: 400  PQESWQNDFQLEGQPPSTSSVSLNCNNDNGSAFERHSPAKYTKDFKYVMSVKSLDLNFVL 459

Query: 421  PGGYSDPTEFQSIQITGEENKFEDSTMGLPWLKEKPVGKGKPNEESKISTQIEPGLLNPY 242
            P   +D    Q       E   E+ST G   + E P+   K  E    S  +E  +L   
Sbjct: 460  PSFSTDVACSQGASSILGEKTLENST-GCSQIAETPIHDSKSGERKDQSVPLEC-VLKQA 517

Query: 241  NTGFIHDLKLRKIEESNMGTEKR-LAFHSNGKPHMSSDLHSFHDFPSELFQNQSKNQRVE 65
            N+  +HD +L K+E SN    KR L FH   KP + +   S H  P+    N    + ++
Sbjct: 518  NSVCVHDAELDKVEASNSLDFKRILGFHRYNKPPIPNGQCSSHASPAGNHSNSCAKEDIK 577

Query: 64   EIEKGCISDVKSSCIHVPDLG 2
            + EK  + D+     HVP  G
Sbjct: 578  DKEKDRLPDMNLEVDHVPFRG 598


Top