BLASTX nr result

ID: Sinomenium21_contig00027556 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00027556
         (945 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270804.2| PREDICTED: uncharacterized protein LOC100258...   243   8e-62
emb|CBI25523.3| unnamed protein product [Vitis vinifera]              243   8e-62
ref|XP_002520903.1| hypothetical protein RCOM_0690420 [Ricinus c...   228   3e-57
ref|XP_006371340.1| hypothetical protein POPTR_0019s09240g [Popu...   216   1e-53
ref|XP_007011434.1| Uncharacterized protein isoform 4 [Theobroma...   213   7e-53
ref|XP_007011433.1| Uncharacterized protein isoform 3 [Theobroma...   213   7e-53
ref|XP_007011431.1| Uncharacterized protein isoform 1 [Theobroma...   213   7e-53
ref|XP_006578854.1| PREDICTED: uncharacterized protein LOC100793...   212   2e-52
ref|XP_006486110.1| PREDICTED: uncharacterized protein LOC102622...   211   3e-52
ref|XP_006435980.1| hypothetical protein CICLE_v10030611mg [Citr...   211   3e-52
ref|XP_002319873.1| hypothetical protein POPTR_0013s09310g [Popu...   211   4e-52
ref|XP_006578855.1| PREDICTED: uncharacterized protein LOC100793...   210   8e-52
ref|XP_006581697.1| PREDICTED: uncharacterized protein LOC100784...   207   6e-51
ref|XP_006581699.1| PREDICTED: uncharacterized protein LOC100784...   207   6e-51
ref|XP_007220911.1| hypothetical protein PRUPE_ppa000661mg [Prun...   199   2e-48
emb|CAN75588.1| hypothetical protein VITISV_042879 [Vitis vinifera]   195   3e-47
ref|XP_007136387.1| hypothetical protein PHAVU_009G041000g [Phas...   192   1e-46
ref|XP_004309001.1| PREDICTED: uncharacterized protein LOC101294...   192   1e-46
ref|XP_007136388.1| hypothetical protein PHAVU_009G041000g [Phas...   191   5e-46
ref|XP_007011432.1| Uncharacterized protein isoform 2 [Theobroma...   189   1e-45

>ref|XP_002270804.2| PREDICTED: uncharacterized protein LOC100258677 [Vitis vinifera]
          Length = 958

 Score =  243 bits (620), Expect = 8e-62
 Identities = 137/265 (51%), Positives = 165/265 (62%)
 Frame = -3

Query: 934  EPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXXX 755
            E E  VNS + C   EN DQ  + GD       ES ++++ +E   ISP+AW        
Sbjct: 703  ESETQVNSPQKCGNIENLDQVTADGDDK-KKMVESSLKMEGEEESAISPIAWVEIEEHQD 761

Query: 754  XXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVYQ 575
               PCD    Q+  P +IAP+A          SQMLQE++ E +  EWG AENPP VVY 
Sbjct: 762  SHIPCDDITSQLISPASIAPVALSSPRVRHSLSQMLQEESSEPDSIEWGNAENPPAVVYH 821

Query: 574  KDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASLQ 395
            KD PKGFKRLLKFARKS+GD N TGWSSPS FSEGEDDAEE KA +KR+AD LLKKA+L 
Sbjct: 822  KDAPKGFKRLLKFARKSRGDGNTTGWSSPSAFSEGEDDAEEAKAINKRNADTLLKKATLH 881

Query: 394  ANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISAGA 215
            A  +G QK    SS  GG     A+  +LS    + S   K++ QSSHKLQEG+ +SA A
Sbjct: 882  AKNYGQQK----SSLSGGYERNVAARELLS----AQSNISKFNTQSSHKLQEGQ-VSATA 932

Query: 214  TSTKASRSFFSLPTFRSSKSNEGKL 140
             +TKA+RSFFSL  FR SK NE KL
Sbjct: 933  PTTKATRSFFSLSAFRGSKPNETKL 957


>emb|CBI25523.3| unnamed protein product [Vitis vinifera]
          Length = 1121

 Score =  243 bits (620), Expect = 8e-62
 Identities = 137/265 (51%), Positives = 165/265 (62%)
 Frame = -3

Query: 934  EPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXXX 755
            E E  VNS + C   EN DQ  + GD       ES ++++ +E   ISP+AW        
Sbjct: 866  ESETQVNSPQKCGNIENLDQVTADGDDK-KKMVESSLKMEGEEESAISPIAWVEIEEHQD 924

Query: 754  XXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVYQ 575
               PCD    Q+  P +IAP+A          SQMLQE++ E +  EWG AENPP VVY 
Sbjct: 925  SHIPCDDITSQLISPASIAPVALSSPRVRHSLSQMLQEESSEPDSIEWGNAENPPAVVYH 984

Query: 574  KDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASLQ 395
            KD PKGFKRLLKFARKS+GD N TGWSSPS FSEGEDDAEE KA +KR+AD LLKKA+L 
Sbjct: 985  KDAPKGFKRLLKFARKSRGDGNTTGWSSPSAFSEGEDDAEEAKAINKRNADTLLKKATLH 1044

Query: 394  ANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISAGA 215
            A  +G QK    SS  GG     A+  +LS    + S   K++ QSSHKLQEG+ +SA A
Sbjct: 1045 AKNYGQQK----SSLSGGYERNVAARELLS----AQSNISKFNTQSSHKLQEGQ-VSATA 1095

Query: 214  TSTKASRSFFSLPTFRSSKSNEGKL 140
             +TKA+RSFFSL  FR SK NE KL
Sbjct: 1096 PTTKATRSFFSLSAFRGSKPNETKL 1120


>ref|XP_002520903.1| hypothetical protein RCOM_0690420 [Ricinus communis]
            gi|223540034|gb|EEF41612.1| hypothetical protein
            RCOM_0690420 [Ricinus communis]
          Length = 1051

 Score =  228 bits (581), Expect = 3e-57
 Identities = 127/265 (47%), Positives = 163/265 (61%)
 Frame = -3

Query: 937  LEPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXX 758
            +EPEA V S ENC+++   ++    GD SF +TAES  +I+SQ+   ISP+AW       
Sbjct: 795  MEPEALVKSHENCDESVKINELAIDGDDSFKDTAESSTKIESQKESVISPIAWEEIDECQ 854

Query: 757  XXXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVY 578
                   +   Q+  P ++ P+           SQMLQE++ E +  EWG AENPP + Y
Sbjct: 855  HVHSSYGNGASQLASPVHVEPVGLSSPRVRHSLSQMLQEESSEPDTFEWGNAENPPAMAY 914

Query: 577  QKDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASL 398
            QKD PKG KRLLKFARKSKGDANV GWSSPSVFSEGEDDAEE KA SKR+ D LL+KA+L
Sbjct: 915  QKDAPKGLKRLLKFARKSKGDANVAGWSSPSVFSEGEDDAEESKATSKRNTDNLLRKAAL 974

Query: 397  QANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISAG 218
             +  +G Q+ T V +       TR         L + S   K+  Q+S KLQ+G ++S  
Sbjct: 975  HSKNYG-QQTTSVCAGPEKKIDTRL--------LSAESNLSKFGVQNSEKLQKG-NVSTA 1024

Query: 217  ATSTKASRSFFSLPTFRSSKSNEGK 143
            A++TKA+RSFFSL  FR SK NE K
Sbjct: 1025 ASTTKATRSFFSLSAFRGSKPNETK 1049


>ref|XP_006371340.1| hypothetical protein POPTR_0019s09240g [Populus trichocarpa]
            gi|550317093|gb|ERP49137.1| hypothetical protein
            POPTR_0019s09240g [Populus trichocarpa]
          Length = 1099

 Score =  216 bits (550), Expect = 1e-53
 Identities = 122/266 (45%), Positives = 159/266 (59%), Gaps = 1/266 (0%)
 Frame = -3

Query: 937  LEPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXX 758
            ++ E   NS +N  + EN  +  +  D  F +T +S    QS+E   ISP AW       
Sbjct: 841  MDSETVGNSHQNSGEVENFKELATDVDDGFKDTVQSSANFQSEEDSVISPSAWVEIEEQK 900

Query: 757  XXXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNG-ESEIAEWGKAENPPTVV 581
                       Q++ P   AP+           SQMLQEDN  E +I EWG AENPP+VV
Sbjct: 901  DLPSIHGDATIQLSPPVRAAPVGFPSQGVRHSLSQMLQEDNNSEPDIVEWGNAENPPSVV 960

Query: 580  YQKDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKAS 401
            YQKD PKG KRLLKFARKSKGDAN+TGWSSPSV+SEGEDD EE KA +KR+ D LL+KA+
Sbjct: 961  YQKDAPKGLKRLLKFARKSKGDANMTGWSSPSVYSEGEDDGEESKAINKRNTDNLLRKAA 1020

Query: 400  LQANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISA 221
              +   G Q+ +F   Y   + +  A + +L+      S   K++AQSSH+LQ+G ++S 
Sbjct: 1021 HHSKDSGQQQTSFFEGY---DRNVNAHELLLAQ-----SNISKFNAQSSHQLQKG-NVST 1071

Query: 220  GATSTKASRSFFSLPTFRSSKSNEGK 143
              ++TKA+RSFFSL  FR SK NE K
Sbjct: 1072 ATSTTKATRSFFSLSAFRGSKPNETK 1097


>ref|XP_007011434.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508728347|gb|EOY20244.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 934

 Score =  213 bits (543), Expect = 7e-53
 Identities = 124/267 (46%), Positives = 160/267 (59%)
 Frame = -3

Query: 937  LEPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXX 758
            ++ E  VN  +  +  E+ D+     D    N AES    + +E +TISP AW       
Sbjct: 684  IQLETQVNGHQKSDVIESIDELAPDVDDGLKNIAESS---KCEEELTISPAAWVEIEEHQ 740

Query: 757  XXXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVY 578
                 CD    + T   +IAP+           SQMLQE++ E++  EWG AENPP +VY
Sbjct: 741  DLPNQCDDNTGENTSSASIAPVGSASPRVRHSLSQMLQEESSEADTTEWGNAENPPAMVY 800

Query: 577  QKDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASL 398
            QKD PKG KRLLKFARKSKGDAN+TGWSSPSVFSEGEDDAEE KA +KR+AD LL+KA+L
Sbjct: 801  QKDAPKGLKRLLKFARKSKGDANITGWSSPSVFSEGEDDAEESKAINKRNADNLLRKAAL 860

Query: 397  QANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISAG 218
            QA  +G QK   +S  G  N        + +H+LPS  +    S   +HK+ +G  +S  
Sbjct: 861  QAKNYGQQK---MSCEGYEN-------HLGAHELPSAQS--GISTFDAHKMHKG-SVSTA 907

Query: 217  ATSTKASRSFFSLPTFRSSKSNEGKLR 137
            A++TK +RSFFSL  FR SK +E KLR
Sbjct: 908  ASTTKGTRSFFSLSAFRGSKPSEMKLR 934


>ref|XP_007011433.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508728346|gb|EOY20243.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 1100

 Score =  213 bits (543), Expect = 7e-53
 Identities = 124/267 (46%), Positives = 160/267 (59%)
 Frame = -3

Query: 937  LEPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXX 758
            ++ E  VN  +  +  E+ D+     D    N AES    + +E +TISP AW       
Sbjct: 850  IQLETQVNGHQKSDVIESIDELAPDVDDGLKNIAESS---KCEEELTISPAAWVEIEEHQ 906

Query: 757  XXXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVY 578
                 CD    + T   +IAP+           SQMLQE++ E++  EWG AENPP +VY
Sbjct: 907  DLPNQCDDNTGENTSSASIAPVGSASPRVRHSLSQMLQEESSEADTTEWGNAENPPAMVY 966

Query: 577  QKDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASL 398
            QKD PKG KRLLKFARKSKGDAN+TGWSSPSVFSEGEDDAEE KA +KR+AD LL+KA+L
Sbjct: 967  QKDAPKGLKRLLKFARKSKGDANITGWSSPSVFSEGEDDAEESKAINKRNADNLLRKAAL 1026

Query: 397  QANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISAG 218
            QA  +G QK   +S  G  N        + +H+LPS  +    S   +HK+ +G  +S  
Sbjct: 1027 QAKNYGQQK---MSCEGYEN-------HLGAHELPSAQS--GISTFDAHKMHKG-SVSTA 1073

Query: 217  ATSTKASRSFFSLPTFRSSKSNEGKLR 137
            A++TK +RSFFSL  FR SK +E KLR
Sbjct: 1074 ASTTKGTRSFFSLSAFRGSKPSEMKLR 1100


>ref|XP_007011431.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508728344|gb|EOY20241.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1099

 Score =  213 bits (543), Expect = 7e-53
 Identities = 124/267 (46%), Positives = 160/267 (59%)
 Frame = -3

Query: 937  LEPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXX 758
            ++ E  VN  +  +  E+ D+     D    N AES    + +E +TISP AW       
Sbjct: 849  IQLETQVNGHQKSDVIESIDELAPDVDDGLKNIAESS---KCEEELTISPAAWVEIEEHQ 905

Query: 757  XXXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVY 578
                 CD    + T   +IAP+           SQMLQE++ E++  EWG AENPP +VY
Sbjct: 906  DLPNQCDDNTGENTSSASIAPVGSASPRVRHSLSQMLQEESSEADTTEWGNAENPPAMVY 965

Query: 577  QKDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASL 398
            QKD PKG KRLLKFARKSKGDAN+TGWSSPSVFSEGEDDAEE KA +KR+AD LL+KA+L
Sbjct: 966  QKDAPKGLKRLLKFARKSKGDANITGWSSPSVFSEGEDDAEESKAINKRNADNLLRKAAL 1025

Query: 397  QANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISAG 218
            QA  +G QK   +S  G  N        + +H+LPS  +    S   +HK+ +G  +S  
Sbjct: 1026 QAKNYGQQK---MSCEGYEN-------HLGAHELPSAQS--GISTFDAHKMHKG-SVSTA 1072

Query: 217  ATSTKASRSFFSLPTFRSSKSNEGKLR 137
            A++TK +RSFFSL  FR SK +E KLR
Sbjct: 1073 ASTTKGTRSFFSLSAFRGSKPSEMKLR 1099


>ref|XP_006578854.1| PREDICTED: uncharacterized protein LOC100793207 isoform X1 [Glycine
            max]
          Length = 1091

 Score =  212 bits (540), Expect = 2e-52
 Identities = 122/264 (46%), Positives = 157/264 (59%)
 Frame = -3

Query: 934  EPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXXX 755
            EP+  +++Q  C +TEN DQ  + G+   T T ES + I+++E  TISP AW        
Sbjct: 843  EPDPQIHNQLQCSETENLDQNPTDGE-VLTYTEESSLNIRNEE-STISPSAWVETEEDLE 900

Query: 754  XXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVYQ 575
               PC+    Q     N AP+           SQMLQE++ E +  EWG AENPP ++YQ
Sbjct: 901  MPKPCEDDTFQSVSLANAAPVGSASPRVRHSLSQMLQEESSEPDTCEWGNAENPPAMIYQ 960

Query: 574  KDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASLQ 395
            KD PKGFKRLLKFARKSKGDA  TGWSSPSVFSEGEDDAEE K ++KR+AD LL+KA+L 
Sbjct: 961  KDAPKGFKRLLKFARKSKGDAGSTGWSSPSVFSEGEDDAEEFKNSNKRNADNLLRKAALN 1020

Query: 394  ANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISAGA 215
               +G  KN+    Y                +L       +   + S+K+Q+GRD+ AG+
Sbjct: 1021 VKSYGQPKNSVHEGY--------------ERNLDFCHAAGRDDGKGSYKMQDGRDLGAGS 1066

Query: 214  TSTKASRSFFSLPTFRSSKSNEGK 143
            T T+ASRSFFSL  FR SK +E K
Sbjct: 1067 T-TRASRSFFSLSAFRGSKPSESK 1089


>ref|XP_006486110.1| PREDICTED: uncharacterized protein LOC102622185 isoform X1 [Citrus
            sinensis] gi|568865498|ref|XP_006486111.1| PREDICTED:
            uncharacterized protein LOC102622185 isoform X2 [Citrus
            sinensis] gi|568865500|ref|XP_006486112.1| PREDICTED:
            uncharacterized protein LOC102622185 isoform X3 [Citrus
            sinensis]
          Length = 1122

 Score =  211 bits (537), Expect = 3e-52
 Identities = 122/267 (45%), Positives = 157/267 (58%), Gaps = 1/267 (0%)
 Frame = -3

Query: 937  LEPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXX 758
            +E E +++SQ+ C + EN ++  +  D +  N  E P++IQ +E   ISP AW       
Sbjct: 866  MESETTISSQQICNEVENFNEPAADNDDALKNMTEMPLQIQVEEESIISPSAWVEIEEDN 925

Query: 757  XXXXPCD-SCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVV 581
                        Q+  P NI P+           SQMLQED+ E E  EWG AENP  +V
Sbjct: 926  HDLPNPHHDSTSQLANPANIVPIGLSSPRVRHSLSQMLQEDSSEPETTEWGIAENPRALV 985

Query: 580  YQKDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKAS 401
            YQKD PKG KRLLKFARKSK DAN +GWSSPSVFSEGE D EE KA+SKR+AD LL+KA+
Sbjct: 986  YQKDAPKGLKRLLKFARKSKTDANSSGWSSPSVFSEGESDVEESKASSKRNADNLLRKAA 1045

Query: 400  LQANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISA 221
            L A  +G QK + +  Y            + +H L + S   ++ A +S KLQ+   ++A
Sbjct: 1046 LNAKIYGMQKTSVLEDY---------EKHMDAHLLSAQSDISRFDANNSEKLQKNH-VAA 1095

Query: 220  GATSTKASRSFFSLPTFRSSKSNEGKL 140
             A +TKASRSFFSL  FR SK NE KL
Sbjct: 1096 VAPTTKASRSFFSLSAFRGSKPNETKL 1122


>ref|XP_006435980.1| hypothetical protein CICLE_v10030611mg [Citrus clementina]
            gi|557538176|gb|ESR49220.1| hypothetical protein
            CICLE_v10030611mg [Citrus clementina]
          Length = 1016

 Score =  211 bits (537), Expect = 3e-52
 Identities = 122/267 (45%), Positives = 157/267 (58%), Gaps = 1/267 (0%)
 Frame = -3

Query: 937  LEPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXX 758
            +E E +++SQ+ C + EN ++  +  D +  N  E P++IQ +E   ISP AW       
Sbjct: 760  MESETTISSQQICNEVENFNEPAADNDDALKNMTEMPLQIQVEEESIISPSAWVEIEEDN 819

Query: 757  XXXXPCD-SCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVV 581
                        Q+  P NI P+           SQMLQED+ E E  EWG AENP  +V
Sbjct: 820  HDLPNPHHDSTSQLANPANIVPIGLSSPRVRHSLSQMLQEDSSEPETTEWGIAENPRALV 879

Query: 580  YQKDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKAS 401
            YQKD PKG KRLLKFARKSK DAN +GWSSPSVFSEGE D EE KA+SKR+AD LL+KA+
Sbjct: 880  YQKDAPKGLKRLLKFARKSKTDANSSGWSSPSVFSEGESDVEESKASSKRNADNLLRKAA 939

Query: 400  LQANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISA 221
            L A  +G QK + +  Y            + +H L + S   ++ A +S KLQ+   ++A
Sbjct: 940  LNAKIYGMQKTSVLEDY---------EKHMDAHLLSAQSDISRFDANNSEKLQKNH-VAA 989

Query: 220  GATSTKASRSFFSLPTFRSSKSNEGKL 140
             A +TKASRSFFSL  FR SK NE KL
Sbjct: 990  VAPTTKASRSFFSLSAFRGSKPNETKL 1016


>ref|XP_002319873.1| hypothetical protein POPTR_0013s09310g [Populus trichocarpa]
            gi|222858249|gb|EEE95796.1| hypothetical protein
            POPTR_0013s09310g [Populus trichocarpa]
          Length = 858

 Score =  211 bits (536), Expect = 4e-52
 Identities = 124/266 (46%), Positives = 155/266 (58%), Gaps = 2/266 (0%)
 Frame = -3

Query: 934  EPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXXX 755
            E E   N  +N  + EN ++ V+  D SF    +S    Q  E   ISP AW        
Sbjct: 602  ELETVENGHQNSGEMENFNELVTDADDSFKYMVQSSASFQFHEDSVISPSAWVEIEEQQN 661

Query: 754  XXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNG-ESEIAEWGKAENPPTVVY 578
                 D    Q + P  +AP+           SQMLQEDN  E +  EWG AENPP VVY
Sbjct: 662  LPSTNDDTT-QHSSPVLVAPVGLPSQGVRHSLSQMLQEDNNSEPDTVEWGNAENPPAVVY 720

Query: 577  QKDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASL 398
            QKD PKG KRLLKFARKSKGDAN+TGWSSP VFSEGEDD EE KA +KR+ D L +KA+L
Sbjct: 721  QKDAPKGLKRLLKFARKSKGDANMTGWSSPYVFSEGEDDGEESKAINKRNTDNLQRKAAL 780

Query: 397  QANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLP-SVSTTRKYSAQSSHKLQEGRDISA 221
             +N  G Q+++F   Y           ++ +H+LP + S   K++AQSSH+L +G   S 
Sbjct: 781  HSNDHGKQQSSFFEGY---------DRNLKAHELPLAQSNISKFNAQSSHQLHKGH-FST 830

Query: 220  GATSTKASRSFFSLPTFRSSKSNEGK 143
             A++TKA+RSFFSL  FR SK NE K
Sbjct: 831  AASTTKATRSFFSLSAFRGSKPNETK 856


>ref|XP_006578855.1| PREDICTED: uncharacterized protein LOC100793207 isoform X2 [Glycine
            max]
          Length = 1085

 Score =  210 bits (534), Expect = 8e-52
 Identities = 123/264 (46%), Positives = 157/264 (59%)
 Frame = -3

Query: 934  EPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXXX 755
            EP+  +++Q  C +TEN DQ  + G+   T T ES + I+++E  TISP AW        
Sbjct: 843  EPDPQIHNQLQCSETENLDQNPTDGE-VLTYTEESSLNIRNEE-STISPSAWVETEEDLE 900

Query: 754  XXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVYQ 575
               PC+    Q     N AP+           SQMLQE++ E +  EWG AENPP ++YQ
Sbjct: 901  MPKPCEDDTFQSVSLANAAPVGSASPRVRHSLSQMLQEESSEPDTCEWGNAENPPAMIYQ 960

Query: 574  KDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASLQ 395
            KD PKGFKRLLKFARKSKGDA  TGWSSPSVFSEGEDDAEE K ++KR+AD LL+KA+L 
Sbjct: 961  KDAPKGFKRLLKFARKSKGDAGSTGWSSPSVFSEGEDDAEEFKNSNKRNADNLLRKAALN 1020

Query: 394  ANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISAGA 215
               +G  KN+    Y          +  L  D            + S+K+Q+GRD+ AG+
Sbjct: 1021 VKSYGQPKNSVHEGY----------ERNLGRD----------DGKGSYKMQDGRDLGAGS 1060

Query: 214  TSTKASRSFFSLPTFRSSKSNEGK 143
            T T+ASRSFFSL  FR SK +E K
Sbjct: 1061 T-TRASRSFFSLSAFRGSKPSESK 1083


>ref|XP_006581697.1| PREDICTED: uncharacterized protein LOC100784082 isoform X1 [Glycine
            max] gi|571460435|ref|XP_006581698.1| PREDICTED:
            uncharacterized protein LOC100784082 isoform X2 [Glycine
            max]
          Length = 1093

 Score =  207 bits (526), Expect = 6e-51
 Identities = 119/264 (45%), Positives = 154/264 (58%)
 Frame = -3

Query: 934  EPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXXX 755
            EP+  +++Q  C +TEN DQ  + G+   T T ES + I+++E  TISP AW        
Sbjct: 850  EPDPQIHNQLQCGETENLDQNPTDGE-VLTYTGESSINIRNEEESTISPSAWLETEEDLE 908

Query: 754  XXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVYQ 575
               PC+    Q     N AP+           SQMLQE++ E +  EWG AENPP ++YQ
Sbjct: 909  MPKPCEDDTFQSASLANAAPVGSASPRVRHSLSQMLQEESSEPDTCEWGNAENPPAMIYQ 968

Query: 574  KDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASLQ 395
            K+ PKG KRLLKFARKSKGD   TGWSSPSVFSEGEDDAEE K ++KR+AD LL+KA+  
Sbjct: 969  KNAPKGLKRLLKFARKSKGDTGSTGWSSPSVFSEGEDDAEEFKNSNKRNADNLLRKAAQN 1028

Query: 394  ANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISAGA 215
               +G  KN+    Y          +  L  D            + SHK+++GRD+ AG+
Sbjct: 1029 VKSYGQPKNSVHEGY----------ERNLGRD----------DGKGSHKMRDGRDLGAGS 1068

Query: 214  TSTKASRSFFSLPTFRSSKSNEGK 143
            T T+ASRSFFSL  FR SK +E K
Sbjct: 1069 T-TRASRSFFSLSAFRGSKPSESK 1091


>ref|XP_006581699.1| PREDICTED: uncharacterized protein LOC100784082 isoform X3 [Glycine
            max]
          Length = 1084

 Score =  207 bits (526), Expect = 6e-51
 Identities = 119/264 (45%), Positives = 154/264 (58%)
 Frame = -3

Query: 934  EPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXXX 755
            EP+  +++Q  C +TEN DQ  + G+   T T ES + I+++E  TISP AW        
Sbjct: 841  EPDPQIHNQLQCGETENLDQNPTDGE-VLTYTGESSINIRNEEESTISPSAWLETEEDLE 899

Query: 754  XXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVYQ 575
               PC+    Q     N AP+           SQMLQE++ E +  EWG AENPP ++YQ
Sbjct: 900  MPKPCEDDTFQSASLANAAPVGSASPRVRHSLSQMLQEESSEPDTCEWGNAENPPAMIYQ 959

Query: 574  KDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASLQ 395
            K+ PKG KRLLKFARKSKGD   TGWSSPSVFSEGEDDAEE K ++KR+AD LL+KA+  
Sbjct: 960  KNAPKGLKRLLKFARKSKGDTGSTGWSSPSVFSEGEDDAEEFKNSNKRNADNLLRKAAQN 1019

Query: 394  ANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISAGA 215
               +G  KN+    Y          +  L  D            + SHK+++GRD+ AG+
Sbjct: 1020 VKSYGQPKNSVHEGY----------ERNLGRD----------DGKGSHKMRDGRDLGAGS 1059

Query: 214  TSTKASRSFFSLPTFRSSKSNEGK 143
            T T+ASRSFFSL  FR SK +E K
Sbjct: 1060 T-TRASRSFFSLSAFRGSKPSESK 1082


>ref|XP_007220911.1| hypothetical protein PRUPE_ppa000661mg [Prunus persica]
            gi|462417373|gb|EMJ22110.1| hypothetical protein
            PRUPE_ppa000661mg [Prunus persica]
          Length = 1048

 Score =  199 bits (505), Expect = 2e-48
 Identities = 120/269 (44%), Positives = 149/269 (55%), Gaps = 2/269 (0%)
 Frame = -3

Query: 937  LEPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXX 758
            +E EA +N    C +T++ D   +  +      AES ++IQ++E  TISP AW       
Sbjct: 809  VESEALINDNLTCSETQHIDPVSADSNDDLKYVAESSLQIQAEEESTISPSAWVEIEEHQ 868

Query: 757  XXXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVY 578
                 C+    Q+T  TN+AP            SQMLQE++ E +  EWG AENPP++V+
Sbjct: 869  PISP-CNDSSSQLTTSTNVAPAGLSSPRVRHSLSQMLQEESNEPDTIEWGNAENPPSIVF 927

Query: 577  QKDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASL 398
            QKD PKG KRLLKFARKSKGD N  GWSSPSVFSEGEDD           AD +L+KASL
Sbjct: 928  QKDAPKGLKRLLKFARKSKGDGNTAGWSSPSVFSEGEDD-----------ADSVLRKASL 976

Query: 397  QANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSV-STTRKYSAQS-SHKLQEGRDIS 224
             A  +G QK +    Y              + +L S  S   K+  QS SHKLQE RD  
Sbjct: 977  NARNYGQQKTSLGEGYD-------------ARELYSAQSNISKFDGQSCSHKLQESRD-- 1021

Query: 223  AGATSTKASRSFFSLPTFRSSKSNEGKLR 137
              A +TKA+RSFFSL  FR SK NE K R
Sbjct: 1022 --APATKATRSFFSLSAFRGSKPNEMKFR 1048


>emb|CAN75588.1| hypothetical protein VITISV_042879 [Vitis vinifera]
          Length = 927

 Score =  195 bits (495), Expect = 3e-47
 Identities = 106/210 (50%), Positives = 126/210 (60%)
 Frame = -3

Query: 934  EPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXXX 755
            E E  VNS + C   EN DQ  + GD       ES ++ + +E   ISP+AW        
Sbjct: 703  ESETQVNSPQKCGNIENLDQVTADGDDK-KKMVESSLKXEGEEESAISPIAWVEIEEHQD 761

Query: 754  XXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVYQ 575
               PCD    Q+  P +IAP+A          SQMLQE++ E +  EWG AENPP VVY 
Sbjct: 762  SHIPCDDITSQLISPASIAPVALSSPRVRHSLSQMLQEESSEPDSIEWGNAENPPAVVYH 821

Query: 574  KDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASLQ 395
            KD PKGFKRLLKFARKS+GD N TGWSSPS FSEGEDDAEE KA +KR+AD LLKKA+L 
Sbjct: 822  KDAPKGFKRLLKFARKSRGDGNTTGWSSPSAFSEGEDDAEEAKAINKRNADTLLKKATLH 881

Query: 394  ANGFGHQKNTFVSSYGGGNSSTRASDSVLS 305
            A  +G QK    SS  GG     A+  +LS
Sbjct: 882  AKNYGQQK----SSLSGGYERNVAARELLS 907


>ref|XP_007136387.1| hypothetical protein PHAVU_009G041000g [Phaseolus vulgaris]
            gi|561009474|gb|ESW08381.1| hypothetical protein
            PHAVU_009G041000g [Phaseolus vulgaris]
          Length = 1082

 Score =  192 bits (489), Expect = 1e-46
 Identities = 116/264 (43%), Positives = 145/264 (54%)
 Frame = -3

Query: 934  EPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXXX 755
            EP+  +N+Q  C + E  DQ    GD   T   ES + I+++E  TISP AW        
Sbjct: 842  EPDPQINNQSQCSEPEKLDQNPIDGD-VVTYFEESSLSIRNEEESTISPSAWVDAEEDLL 900

Query: 754  XXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVYQ 575
               PC+    Q     N  P+           SQML E++ E +  EWG AENPP ++YQ
Sbjct: 901  MPKPCEDDTFQSESLANAVPVGSSSPRVRHSLSQMLLEESSEPDTCEWGNAENPPAMIYQ 960

Query: 574  KDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASLQ 395
            KD PKG KRLLKFARKSKGD   TGWSSPSVFSEGEDDAEE K ++KR+AD LL+KA+L 
Sbjct: 961  KDAPKGLKRLLKFARKSKGDTGSTGWSSPSVFSEGEDDAEELKNSNKRNADNLLRKAALN 1020

Query: 394  ANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISAGA 215
               +G  KN+                    HD    +   +   + SHK+Q+G    AG 
Sbjct: 1021 VKSYGQPKNSV-------------------HDGYERNLAGRGDGKGSHKMQDG----AGP 1057

Query: 214  TSTKASRSFFSLPTFRSSKSNEGK 143
            T T+ASRSFFSL  FR SK +E K
Sbjct: 1058 T-TRASRSFFSLSAFRGSKPSESK 1080


>ref|XP_004309001.1| PREDICTED: uncharacterized protein LOC101294123 [Fragaria vesca
            subsp. vesca]
          Length = 1034

 Score =  192 bits (489), Expect = 1e-46
 Identities = 119/269 (44%), Positives = 148/269 (55%), Gaps = 2/269 (0%)
 Frame = -3

Query: 937  LEPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXX 758
            +E +A +N    C++T+  D   +  +    + AES  +IQ +E + ISP AW       
Sbjct: 793  VESKAPINDNLTCDETQEIDPVSADSNDDVKDVAESTTKIQVEEELLISPRAWVEIEEHQ 852

Query: 757  XXXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVY 578
                   S + Q+    N+AP            SQMLQE++ E +  EWG AENPP +++
Sbjct: 853  AMSPYNHS-KSQLITSANVAPTGLSSPRVRHSLSQMLQEESNEPDNIEWGNAENPPAIIF 911

Query: 577  QKDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASL 398
            QKD PKG KRLLKFARKSKGDAN TGWSSPSVFSEGEDD            D +L+KASL
Sbjct: 912  QKDAPKGLKRLLKFARKSKGDANSTGWSSPSVFSEGEDD------------DTVLRKASL 959

Query: 397  QANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSV-STTRKYSAQ-SSHKLQEGRDIS 224
             A  +G QK +    Y              + DL S  S   K+ AQ SSHK QE RDI+
Sbjct: 960  HAKNYGQQKTSLGEGYD-------------ARDLYSAQSNISKFDAQSSSHKYQESRDIA 1006

Query: 223  AGATSTKASRSFFSLPTFRSSKSNEGKLR 137
            A A +TKA RSFFSL  FR SK NE K R
Sbjct: 1007 A-APTTKAPRSFFSLSAFRGSKPNEMKFR 1034


>ref|XP_007136388.1| hypothetical protein PHAVU_009G041000g [Phaseolus vulgaris]
            gi|561009475|gb|ESW08382.1| hypothetical protein
            PHAVU_009G041000g [Phaseolus vulgaris]
          Length = 1081

 Score =  191 bits (484), Expect = 5e-46
 Identities = 115/264 (43%), Positives = 143/264 (54%)
 Frame = -3

Query: 934  EPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXXX 755
            EP+  +N+Q  C + E  DQ    GD   T   ES + I+++E  TISP AW        
Sbjct: 842  EPDPQINNQSQCSEPEKLDQNPIDGD-VVTYFEESSLSIRNEEESTISPSAWVDAEEDLL 900

Query: 754  XXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVYQ 575
               PC+    Q     N  P+           SQML E++ E +  EWG AENPP ++YQ
Sbjct: 901  MPKPCEDDTFQSESLANAVPVGSSSPRVRHSLSQMLLEESSEPDTCEWGNAENPPAMIYQ 960

Query: 574  KDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASLQ 395
            KD PKG KRLLKFARKSKGD   TGWSSPSVFSEGEDDAEE K ++KR+AD LL+KA+L 
Sbjct: 961  KDAPKGLKRLLKFARKSKGDTGSTGWSSPSVFSEGEDDAEELKNSNKRNADNLLRKAALN 1020

Query: 394  ANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISAGA 215
               +G  KN+    Y                         +   + SHK+Q+G    AG 
Sbjct: 1021 VKSYGQPKNSVHDGY--------------------ERNLGRGDGKGSHKMQDG----AGP 1056

Query: 214  TSTKASRSFFSLPTFRSSKSNEGK 143
            T T+ASRSFFSL  FR SK +E K
Sbjct: 1057 T-TRASRSFFSLSAFRGSKPSESK 1079


>ref|XP_007011432.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508728345|gb|EOY20242.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 1088

 Score =  189 bits (480), Expect = 1e-45
 Identities = 110/248 (44%), Positives = 145/248 (58%)
 Frame = -3

Query: 937  LEPEASVNSQENCEKTENHDQFVSQGDGSFTNTAESPVEIQSQEVMTISPVAWXXXXXXX 758
            ++ E  VN  +  +  E+ D+     D    N AES    + +E +TISP AW       
Sbjct: 849  IQLETQVNGHQKSDVIESIDELAPDVDDGLKNIAESS---KCEEELTISPAAWVEIEEHQ 905

Query: 757  XXXXPCDSCEPQITLPTNIAPLAXXXXXXXXXXSQMLQEDNGESEIAEWGKAENPPTVVY 578
                 CD    + T   +IAP+           SQMLQE++ E++  EWG AENPP +VY
Sbjct: 906  DLPNQCDDNTGENTSSASIAPVGSASPRVRHSLSQMLQEESSEADTTEWGNAENPPAMVY 965

Query: 577  QKDVPKGFKRLLKFARKSKGDANVTGWSSPSVFSEGEDDAEEPKAASKRHADVLLKKASL 398
            QKD PKG KRLLKFARKSKGDAN+TGWSSPSVFSEGEDDAEE KA +KR+AD LL+KA+L
Sbjct: 966  QKDAPKGLKRLLKFARKSKGDANITGWSSPSVFSEGEDDAEESKAINKRNADNLLRKAAL 1025

Query: 397  QANGFGHQKNTFVSSYGGGNSSTRASDSVLSHDLPSVSTTRKYSAQSSHKLQEGRDISAG 218
            QA  +G QK   +S  G  N        + +H+LPS  +    S   +HK+ +G  +S  
Sbjct: 1026 QAKNYGQQK---MSCEGYEN-------HLGAHELPSAQS--GISTFDAHKMHKG-SVSTA 1072

Query: 217  ATSTKASR 194
            A++TK  +
Sbjct: 1073 ASTTKGDK 1080


Top