BLASTX nr result

ID: Forsythia23_contig00022617 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia23_contig00022617
         (1322 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166...   559   e-156
ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583...   501   e-139
ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   499   e-138
ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236...   496   e-137
ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116...   493   e-136
ref|XP_010317637.1| PREDICTED: uncharacterized protein LOC101260...   493   e-136
ref|XP_009788654.1| PREDICTED: uncharacterized protein LOC104236...   478   e-132
ref|XP_010317636.1| PREDICTED: uncharacterized protein LOC101260...   475   e-131
emb|CDP05166.1| unnamed protein product [Coffea canephora]            470   e-130
ref|XP_011076135.1| PREDICTED: uncharacterized protein LOC105160...   459   e-126
ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648...   425   e-116
ref|XP_008234199.1| PREDICTED: uncharacterized protein LOC103333...   424   e-116
ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun...   424   e-116
ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citr...   423   e-115
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   421   e-115
gb|KDO51973.1| hypothetical protein CISIN_1g010808mg [Citrus sin...   419   e-114
ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-...   419   e-114
gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum]   418   e-114
ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765...   417   e-113
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   415   e-113

>ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166690 [Sesamum indicum]
          Length = 479

 Score =  559 bits (1441), Expect = e-156
 Identities = 284/405 (70%), Positives = 309/405 (76%), Gaps = 30/405 (7%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            MSSVHNS             AESRVQPSTVQKRRWGSCWS+YWCFGSHK +KRIGHAVLV
Sbjct: 1    MSSVHNSVETVNAAATAIVTAESRVQPSTVQKRRWGSCWSIYWCFGSHKQSKRIGHAVLV 60

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795
             EP   GV AP++ENRN S+TIVLPFI       SFLQSDPPSATQSPAGL         
Sbjct: 61   SEPAAAGVAAPISENRNQSSTIVLPFIAPPSSPASFLQSDPPSATQSPAGLISLASLSVH 120

Query: 794  ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                 L SPPVFS FTTEPSTA FTPPPE VQ+TTPSSPEVPFA
Sbjct: 121  ANSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTPSSPEVPFA 180

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498
            QLL+SSLARNRRN GTNLK+SLSQY+FQPYQYPGSPGG +KSPGSA+STSGTSSPFPDK 
Sbjct: 181  QLLSSSLARNRRNCGTNLKYSLSQYEFQPYQYPGSPGGHIKSPGSALSTSGTSSPFPDKH 240

Query: 497  PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318
            PI+EFR GEAPKFLGYEHF N KWGSRVGSGSLTP GWGSRLGSG LTPNGGLSRLGSGT
Sbjct: 241  PIMEFRMGEAPKFLGYEHFPNYKWGSRVGSGSLTPNGWGSRLGSGALTPNGGLSRLGSGT 300

Query: 317  VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 138
            +TPNGGEPPS+D  LLE+QI EVASLANSD +S++ + VVD+RVSFEL GEDIPTC + E
Sbjct: 301  LTPNGGEPPSRDGNLLENQIYEVASLANSDRKSQNDDAVVDHRVSFELFGEDIPTCVVTE 360

Query: 137  SVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSEV 3
            S PS+K A      A   G N KDL  KN D CRE++DGET +EV
Sbjct: 361  SAPSHKNASGYPGVATAEGTNNKDLTTKNADSCREHNDGETTNEV 405


>ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum]
          Length = 470

 Score =  501 bits (1291), Expect = e-139
 Identities = 263/390 (67%), Positives = 284/390 (72%), Gaps = 32/390 (8%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            MSSV N+             AESRVQPSTVQKRRWGSCWSLYWCFGSHK +KRIGHAVLV
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789
            PEP  PG   PV EN NHSATIV+PFI       SFL SDPPSATQSPAGLL        
Sbjct: 61   PEPAAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSLSIN 120

Query: 788  -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                   SPPVFS FTTEPSTA FTPPPE V +TTP SPEVPFA
Sbjct: 121  AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPFA 180

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498
            QLLTSSLARNRR +G+N KF LSQY+F PYQ PGSPG  L SPGS +S SGTSSPFP K 
Sbjct: 181  QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKC 240

Query: 497  PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318
            PI+EFRKGE PKFLGYEHF+ RKWGSRVGSGSLTP+GWGSRLGSGTLTPNGG+SRLGSGT
Sbjct: 241  PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGT 300

Query: 317  VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 138
            VTPNGGEPPS+DSYLLE QISEVASLANSDN S+ GE V+D+RVSFEL GED+P+C  KE
Sbjct: 301  VTPNGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREKE 360

Query: 137  SVPSY--KTALKTVPEAAPNGMNQKDLMAK 54
             V S+  +T    V     N M     MA+
Sbjct: 361  PVMSHSQQTLPMDVSNLLANEMKSGSSMAE 390


>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 isoform X2 [Solanum
            lycopersicum]
          Length = 470

 Score =  499 bits (1285), Expect = e-138
 Identities = 255/365 (69%), Positives = 276/365 (75%), Gaps = 30/365 (8%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            MSSV N+             AESRVQPSTVQKRRWGSCWSLYWCFGSHK +KRIGHAVLV
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789
            PEP  PG   PV EN NHSATIV+PFI       SFL SDPPSATQSPAGLL        
Sbjct: 61   PEPVAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSIN 120

Query: 788  -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                   SPPVFS FTTEPSTA FTPPPE V +TTP SPEVPFA
Sbjct: 121  AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFA 180

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498
            QLLTSSLARNRR +G+N KF LSQY+F PYQ PGSPG  L SPGS +S SGTSSPFP K 
Sbjct: 181  QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKC 240

Query: 497  PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318
            PI+EFRKGE PKFLGYEHF+ RKWGSRVGSGS+TP+GWGSRLGSGTLTPNGG+SRLGSGT
Sbjct: 241  PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGT 300

Query: 317  VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 138
            VTPNGGEPPS+DSYLLE+QISEVASLANSDN S+ GE V+D+RVSFEL  ED+P+C  KE
Sbjct: 301  VTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKE 360

Query: 137  SVPSY 123
             V S+
Sbjct: 361  PVMSH 365


>ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236433 isoform X1 [Nicotiana
            sylvestris]
          Length = 470

 Score =  496 bits (1277), Expect = e-137
 Identities = 260/399 (65%), Positives = 290/399 (72%), Gaps = 30/399 (7%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            MSSV N+             AESRVQPS+VQKRRWGSCWSLYWCFGS+K +KRIGHAVLV
Sbjct: 1    MSSVQNTVDTVNAAATAIVTAESRVQPSSVQKRRWGSCWSLYWCFGSYKHSKRIGHAVLV 60

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789
            PEP  PG   PV EN N SATIV+PFI       SFL SDPPSATQSPAGLL        
Sbjct: 61   PEPAAPGPAVPVTENPNRSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSFSIN 120

Query: 788  -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                   SPPVFS FTTEPSTA FTPPPE V +TTP SPEVPFA
Sbjct: 121  AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFA 180

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498
            QLLTSSLARNRR +G+N KF LSQY+F PYQ PGSPG  L SPGS +S SGTSSPFP K 
Sbjct: 181  QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKC 240

Query: 497  PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318
            PI+EFRKGE PKFLGYEHF+ RKWGSRVGSGSLTP+GWGSRLGSGTLTPNGG+SRLGSGT
Sbjct: 241  PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGT 300

Query: 317  VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 138
            VTPNGGEPPS+D YLLE+QISEVASLANSDN S+  E V+D+RVSFEL GED+P+C  KE
Sbjct: 301  VTPNGGEPPSRDCYLLENQISEVASLANSDNGSEIAEGVIDHRVSFELTGEDVPSCREKE 360

Query: 137  SVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDG 21
             V S+  + +T+P   P   N++  M  ++    E +DG
Sbjct: 361  PVMSH--SQQTLPMDVPAPSNKE--MRSSSSIVEEKTDG 395


>ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116142 [Nicotiana
            tomentosiformis]
          Length = 470

 Score =  493 bits (1269), Expect = e-136
 Identities = 257/399 (64%), Positives = 290/399 (72%), Gaps = 30/399 (7%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            MSSV N+             AESRVQPS++QK+RWGSCWSLYWCFGS+K +KRIGHA+LV
Sbjct: 1    MSSVQNTVDTVNAAATAIITAESRVQPSSIQKKRWGSCWSLYWCFGSYKHSKRIGHAILV 60

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789
            PEP  PG   PV EN N SATIV+PFI       SFL SDPPSATQSPAGLL        
Sbjct: 61   PEPAAPGPAVPVTENPNRSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSFSIN 120

Query: 788  -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                   SPPVFS FTTEPSTA FTPPPE V +TTP SPEVPFA
Sbjct: 121  AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFA 180

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498
            QLLTSSLARNRR +G+N KF LSQY+F PYQ PGSPG  L SPGS +S SGTSSPFP K 
Sbjct: 181  QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSSLISPGSVVSNSGTSSPFPGKC 240

Query: 497  PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318
            PI+EFRKGE PKFLGYEHF+ RKWGSRVGSGSLTP+GWGSRLGSGTLTPNGG+SRLGSGT
Sbjct: 241  PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGT 300

Query: 317  VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 138
            VTPNGGEPPS+D YLLE+QISEVASLANSDN S+  E V+D+RVSFEL GED+P+C  KE
Sbjct: 301  VTPNGGEPPSRDCYLLENQISEVASLANSDNGSEIAEGVIDHRVSFELTGEDVPSCREKE 360

Query: 137  SVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDG 21
             V S+  + +T+P   P   N++  M  ++    E +DG
Sbjct: 361  PVMSH--SQQTLPMDVPAPSNKE--MRSSSSNVEEKTDG 395


>ref|XP_010317637.1| PREDICTED: uncharacterized protein LOC101260903 isoform X3 [Solanum
            lycopersicum]
          Length = 469

 Score =  493 bits (1268), Expect = e-136
 Identities = 254/365 (69%), Positives = 275/365 (75%), Gaps = 30/365 (8%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            MSSV N+             AESRVQPSTVQ RRWGSCWSLYWCFGSHK +KRIGHAVLV
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQ-RRWGSCWSLYWCFGSHKHSKRIGHAVLV 59

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789
            PEP  PG   PV EN NHSATIV+PFI       SFL SDPPSATQSPAGLL        
Sbjct: 60   PEPVAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSIN 119

Query: 788  -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                   SPPVFS FTTEPSTA FTPPPE V +TTP SPEVPFA
Sbjct: 120  AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFA 179

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498
            QLLTSSLARNRR +G+N KF LSQY+F PYQ PGSPG  L SPGS +S SGTSSPFP K 
Sbjct: 180  QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKC 239

Query: 497  PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318
            PI+EFRKGE PKFLGYEHF+ RKWGSRVGSGS+TP+GWGSRLGSGTLTPNGG+SRLGSGT
Sbjct: 240  PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGT 299

Query: 317  VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 138
            VTPNGGEPPS+DSYLLE+QISEVASLANSDN S+ GE V+D+RVSFEL  ED+P+C  KE
Sbjct: 300  VTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKE 359

Query: 137  SVPSY 123
             V S+
Sbjct: 360  PVMSH 364


>ref|XP_009788654.1| PREDICTED: uncharacterized protein LOC104236433 isoform X2 [Nicotiana
            sylvestris]
          Length = 442

 Score =  478 bits (1229), Expect = e-132
 Identities = 246/370 (66%), Positives = 275/370 (74%), Gaps = 30/370 (8%)
 Frame = -1

Query: 1040 VQKRRWGSCWSLYWCFGSHKPNKRIGHAVLVPEPTPPGVEAPVAENRNHSATIVLPFIXX 861
            +QKRRWGSCWSLYWCFGS+K +KRIGHAVLVPEP  PG   PV EN N SATIV+PFI  
Sbjct: 2    MQKRRWGSCWSLYWCFGSYKHSKRIGHAVLVPEPAAPGPAVPVTENPNRSATIVIPFIAP 61

Query: 860  XXXXXSFLQSDPPSATQSPAGLLX------------------------------SPPVFS 771
                 SFL SDPPSATQSPAGLL                               SPPVFS
Sbjct: 62   PSSPASFLPSDPPSATQSPAGLLSLKSFSINAYSPGGTASIFAIGPYAHETQLVSPPVFS 121

Query: 770  AFTTEPSTACFTPPPESVQITTPSSPEVPFAQLLTSSLARNRRNNGTNLKFSLSQYDFQP 591
             FTTEPSTA FTPPPE V +TTP SPEVPFAQLLTSSLARNRR +G+N KF LSQY+F P
Sbjct: 122  TFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEFVP 181

Query: 590  YQYPGSPGGRLKSPGSAISTSGTSSPFPDKLPIVEFRKGEAPKFLGYEHFANRKWGSRVG 411
            YQ PGSPG  L SPGS +S SGTSSPFP K PI+EFRKGE PKFLGYEHF+ RKWGSRVG
Sbjct: 182  YQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVG 241

Query: 410  SGSLTPTGWGSRLGSGTLTPNGGLSRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANS 231
            SGSLTP+GWGSRLGSGTLTPNGG+SRLGSGTVTPNGGEPPS+D YLLE+QISEVASLANS
Sbjct: 242  SGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDCYLLENQISEVASLANS 301

Query: 230  DNESKDGEVVVDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKN 51
            DN S+  E V+D+RVSFEL GED+P+C  KE V S+  + +T+P   P   N++  M  +
Sbjct: 302  DNGSEIAEGVIDHRVSFELTGEDVPSCREKEPVMSH--SQQTLPMDVPAPSNKE--MRSS 357

Query: 50   TDCCREYSDG 21
            +    E +DG
Sbjct: 358  SSIVEEKTDG 367


>ref|XP_010317636.1| PREDICTED: uncharacterized protein LOC101260903 isoform X1 [Solanum
            lycopersicum]
          Length = 476

 Score =  475 bits (1223), Expect = e-131
 Identities = 238/334 (71%), Positives = 259/334 (77%), Gaps = 30/334 (8%)
 Frame = -1

Query: 1034 KRRWGSCWSLYWCFGSHKPNKRIGHAVLVPEPTPPGVEAPVAENRNHSATIVLPFIXXXX 855
            +RRWGSCWSLYWCFGSHK +KRIGHAVLVPEP  PG   PV EN NHSATIV+PFI    
Sbjct: 38   ERRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPVAPGPAVPVTENPNHSATIVIPFIAPPS 97

Query: 854  XXXSFLQSDPPSATQSPAGLLX------------------------------SPPVFSAF 765
               SFL SDPPSATQSPAGLL                               SPPVFS F
Sbjct: 98   SPASFLPSDPPSATQSPAGLLSLKALSINAYSPGGTASIFAIGPYAHETQLVSPPVFSTF 157

Query: 764  TTEPSTACFTPPPESVQITTPSSPEVPFAQLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ 585
            TTEPSTA FTPPPE V +TTP SPEVPFAQLLTSSLARNRR +G+N KF LSQY+F PYQ
Sbjct: 158  TTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQ 217

Query: 584  YPGSPGGRLKSPGSAISTSGTSSPFPDKLPIVEFRKGEAPKFLGYEHFANRKWGSRVGSG 405
             PGSPG  L SPGS +S SGTSSPFP K PI+EFRKGE PKFLGYEHF+ RKWGSRVGSG
Sbjct: 218  DPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSG 277

Query: 404  SLTPTGWGSRLGSGTLTPNGGLSRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDN 225
            S+TP+GWGSRLGSGTLTPNGG+SRLGSGTVTPNGGEPPS+DSYLLE+QISEVASLANSDN
Sbjct: 278  SVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLENQISEVASLANSDN 337

Query: 224  ESKDGEVVVDYRVSFELIGEDIPTCTIKESVPSY 123
             S+ GE V+D+RVSFEL  ED+P+C  KE V S+
Sbjct: 338  GSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSH 371


>emb|CDP05166.1| unnamed protein product [Coffea canephora]
          Length = 452

 Score =  470 bits (1210), Expect = e-130
 Identities = 244/360 (67%), Positives = 269/360 (74%), Gaps = 30/360 (8%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            MSSVHNS             AESRVQP TVQKRRWGSCWS YWCFGS K +KRIG+AVLV
Sbjct: 1    MSSVHNSVETVNAAATAIVTAESRVQPPTVQKRRWGSCWSFYWCFGSVKNSKRIGNAVLV 60

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789
            PEPT PG   PV +N NHSATIV+PFI       SFLQSDPPSATQSPA  L        
Sbjct: 61   PEPTVPGSAVPVPDNLNHSATIVIPFIAPPSSPASFLQSDPPSATQSPAKFLPLASFSVN 120

Query: 788  -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                   SPPVFSAFTTEPSTA FTPPPE VQ+TTPSSPEVPFA
Sbjct: 121  TYSPSGAASIFAIGPYAHETQLVSPPVFSAFTTEPSTASFTPPPEPVQLTTPSSPEVPFA 180

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498
            QLL SSL  NRR++GT++KF LSQY+FQPYQ PGSPG  L SPGSAIS SGTSSPFP+K 
Sbjct: 181  QLLVSSLTHNRRHSGTSIKFPLSQYEFQPYQCPGSPGSHLISPGSAISNSGTSSPFPEKR 240

Query: 497  PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318
            PI+EFR GEAPKFLGYE F  RKWGSRVGSGSLTP GWGSRLGSG+LTPNGG+SRLGSGT
Sbjct: 241  PIIEFRIGEAPKFLGYELF-TRKWGSRVGSGSLTPNGWGSRLGSGSLTPNGGISRLGSGT 299

Query: 317  VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 138
            +TPNGGEP ++DSYLLE+QISEVASLANSDN + + E ++D+RVSFEL  E +P C  +E
Sbjct: 300  LTPNGGEPAARDSYLLENQISEVASLANSDNGTHNEEGLMDHRVSFELTAEHVPNCVEEE 359


>ref|XP_011076135.1| PREDICTED: uncharacterized protein LOC105160458 [Sesamum indicum]
          Length = 466

 Score =  459 bits (1180), Expect = e-126
 Identities = 251/418 (60%), Positives = 285/418 (68%), Gaps = 44/418 (10%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            M+SVHNS             AE+R QPSTVQKRRWGSCWSLYWCFGS+K +KRIGHAVL+
Sbjct: 1    MTSVHNSAETLNAAATAIVTAENRAQPSTVQKRRWGSCWSLYWCFGSYKHSKRIGHAVLI 60

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795
             EPT     APV EN N SAT++LPFI       SFLQSDPPSATQS AGL         
Sbjct: 61   SEPTAQVAVAPVVENLNRSATLMLPFIAPPSSPASFLQSDPPSATQSAAGLVSLAALSVH 120

Query: 794  ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                 L SPPVFSAFTTEPSTA FTPPPE VQ+TTPSSPEVPFA
Sbjct: 121  TYSPGGTAPIFTIGPYAYETQLVSPPVFSAFTTEPSTASFTPPPEPVQMTTPSSPEVPFA 180

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498
            QLL+SSLARNRRN+G N+K SLSQY+F  Y+          SPGSA+S+SGTSSPFPDK 
Sbjct: 181  QLLSSSLARNRRNSG-NMKSSLSQYEFLAYE----------SPGSALSSSGTSSPFPDKW 229

Query: 497  PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318
            P+VE R+GEAP F+GYEHF N KWGSRVGSGSLTP G GSRLGSG LTPNGGLSRLGSG 
Sbjct: 230  PVVEIRRGEAPIFIGYEHFFNHKWGSRVGSGSLTPNGRGSRLGSGALTPNGGLSRLGSGA 289

Query: 317  VTPNGG--------------EPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSF 180
            +TPNGG              EPPS+D  LL + ISEV SLANS NE ++ + VVD+RVSF
Sbjct: 290  LTPNGGLSRLGSGALTPNGGEPPSRDCNLLGNPISEVVSLANSGNELQNCDAVVDHRVSF 349

Query: 179  ELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSE 6
            EL GEDIPTC + E+VPS K   + + EA     N  D MAK ++  R+ S+GET+ E
Sbjct: 350  ELSGEDIPTCVVSETVPSPKMESRDLQEATAEVTNHSDFMAKVSETYRKLSNGETMHE 407


>ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648441 [Jatropha curcas]
            gi|643706116|gb|KDP22248.1| hypothetical protein
            JCGZ_26079 [Jatropha curcas]
          Length = 498

 Score =  425 bits (1092), Expect = e-116
 Identities = 238/426 (55%), Positives = 281/426 (65%), Gaps = 52/426 (12%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            M SV+NS             AESRVQP+ VQKRRWG CWSLYWCFGSHK +KRIGHAVLV
Sbjct: 1    MRSVNNSVETINAAATAIISAESRVQPTVVQKRRWGGCWSLYWCFGSHKNSKRIGHAVLV 60

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795
            PEP  P      AEN+ HS    +PFI       SFLQSDPPS TQSPAGL         
Sbjct: 61   PEPEVPQAVVTSAENQTHSTAAAVPFIAPPSSPASFLQSDPPSVTQSPAGLLSLTALSVS 120

Query: 794  ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                 L +PPVFSAFTTEPSTA FTPPPESVQ+TTPSSPEVPFA
Sbjct: 121  AYSPGGPASIFAIGPYAHETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFA 180

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 501
            QLLTSSL R RRN+G N KF+LS Y+FQ Y  YPGSPGG+L SPGS IS SGTSSPFPD+
Sbjct: 181  QLLTSSLERARRNSGANQKFALSHYEFQSYPLYPGSPGGQLISPGSIISNSGTSSPFPDR 240

Query: 500  LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP------------------TGWGSR 375
             P++EFR GEAPK LG+EHF  RKWGSR+GSG+LTP                   G GSR
Sbjct: 241  HPLLEFRMGEAPKLLGFEHFTTRKWGSRLGSGTLTPDGVGLGSRLCSGTATPDGVGLGSR 300

Query: 374  LGSGTLTPNG-GL-SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 201
            LGSG++TP+G GL SRLGSG++TP+   P SQD  LLE+QISEVASLANS+N SK+ E +
Sbjct: 301  LGSGSVTPDGVGLRSRLGSGSLTPDCVVPASQDGLLLENQISEVASLANSENASKNDENI 360

Query: 200  VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEA-APNGMNQKDLMAKNTDCCREYSD 24
            VD+RVSFEL GE++  C   +S+ S +T  +   ++ A   +N ++++  + DC      
Sbjct: 361  VDHRVSFELSGEEVARCLESKSMTSSRTFSECPQDSMAEEQINSEEILINSNDCLH---I 417

Query: 23   GETVSE 6
            GET +E
Sbjct: 418  GETSNE 423


>ref|XP_008234199.1| PREDICTED: uncharacterized protein LOC103333182 [Prunus mume]
          Length = 499

 Score =  424 bits (1091), Expect = e-116
 Identities = 233/426 (54%), Positives = 279/426 (65%), Gaps = 51/426 (11%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            M SV++S             AE+R QP+TV KRRWGSCWSLYWCFGSHK NKRIGHAVLV
Sbjct: 1    MRSVNSSVDTINAAATAIVSAEARAQPTTVPKRRWGSCWSLYWCFGSHK-NKRIGHAVLV 59

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789
            PEP  PG      +N+  S  IV+PFI       SFL SDPPSATQSPAG L        
Sbjct: 60   PEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLSAN 119

Query: 788  -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                   SPPVFS F TEPSTA FTPPPESVQ+TTPSSPEVPFA
Sbjct: 120  AYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVPFA 179

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPY-QYPGSPGGRLKSPGSAISTSGTSSPFPDK 501
            QLLTSSL RNRRN+GTN KF+LS Y+FQPY QYPGSPGG L SPGSA+S SGTSSPFPD+
Sbjct: 180  QLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPDR 239

Query: 500  LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNGGL---- 339
             P++EF  GEAPK  G++HF  RKWGSR+GSGSLTP   G GSRLGSG+LTP+G      
Sbjct: 240  HPVLEFHMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELGSR 299

Query: 338  --------------SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 201
                          SRLGSG +TP+G  P S+DS+LLE+QISEVASLANS++  +  E V
Sbjct: 300  LGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVETV 359

Query: 200  VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDG 21
             D+RVSFEL GED+  C   +++ S +TA  +    A +  +++D ++ ++    E+S  
Sbjct: 360  FDHRVSFELTGEDVACCLANKAMASNRTASGSSKVIASDYPSERDALSSDSSNHCEFSVE 419

Query: 20   ETVSEV 3
            E+ S +
Sbjct: 420  ESSSRI 425


>ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica]
            gi|462415503|gb|EMJ20240.1| hypothetical protein
            PRUPE_ppa004616mg [Prunus persica]
          Length = 499

 Score =  424 bits (1091), Expect = e-116
 Identities = 234/426 (54%), Positives = 278/426 (65%), Gaps = 51/426 (11%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            M SV++S             AE+R QP+TV KRRWGSCWSLYWCFG HK NKRIGHAVLV
Sbjct: 1    MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFGPHK-NKRIGHAVLV 59

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789
            PEP  PG      +N+  S  IV+PFI       SFL SDPPSATQSPAG L        
Sbjct: 60   PEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLSAN 119

Query: 788  -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                   SPPVFS F TEPSTA FTPPPESVQ+TTPSSPEVPFA
Sbjct: 120  AYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVPFA 179

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPY-QYPGSPGGRLKSPGSAISTSGTSSPFPDK 501
            QLLTSSL RNRRN+GTN KF+LS Y+FQPY QYPGSPGG L SPGSA+S SGTSSPFPD+
Sbjct: 180  QLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPDR 239

Query: 500  LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNGGL---- 339
             P++EFR GEAPK  G++HF  RKWGSR+GSGSLTP   G GSRLGSG+LTP+G      
Sbjct: 240  HPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELGSR 299

Query: 338  --------------SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 201
                          SRLGSG +TP+G  P S+DS+LLE+QISEVASLANS++  +  E V
Sbjct: 300  LGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVETV 359

Query: 200  VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDG 21
             D+RVSFEL GED+  C   ++V S +TA  +    A    +++D ++ ++    E+S  
Sbjct: 360  FDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCEFSVE 419

Query: 20   ETVSEV 3
            E+ S +
Sbjct: 420  ESSSRI 425


>ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citrus clementina]
            gi|557523850|gb|ESR35217.1| hypothetical protein
            CICLE_v10004813mg [Citrus clementina]
          Length = 500

 Score =  423 bits (1087), Expect = e-115
 Identities = 230/405 (56%), Positives = 270/405 (66%), Gaps = 51/405 (12%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            MSSVH+S             AESR++P+ +QKRRWGSCWSLYWCFGSHK +KRI HAVLV
Sbjct: 1    MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLV 60

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795
            PEP   G  AP AE + HS  IVLPFI       SFLQSDPPSATQSPAGL         
Sbjct: 61   PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLNSLSVN 120

Query: 794  ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                 L +PPVFSAFTTEPSTA  TPPPESVQ+TTPSSPEVPFA
Sbjct: 121  AYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFA 180

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 501
            QLLTSSL R RRN+GTN K SLS Y +QPYQ YPGSPGG+L SPGS +S SGTSSPFPD+
Sbjct: 181  QLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDR 240

Query: 500  LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP------------------TGWGSR 375
             PI++F    APK LG+EHF  RKWGSR+GSGS+TP                   G GSR
Sbjct: 241  HPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSR 300

Query: 374  LGSGTLTPNG-GL-SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 201
            LGSGT+TP+G GL SRLGSG++TP+G  P S+D ++ E+QISEVASLANSDN +K  E +
Sbjct: 301  LGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHI 360

Query: 200  VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKD 66
            +D+RVSFEL GE++  C   +S  S +   +   +  P G  ++D
Sbjct: 361  IDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRD 405


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  421 bits (1082), Expect = e-115
 Identities = 233/409 (56%), Positives = 275/409 (67%), Gaps = 35/409 (8%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            M SV++S             A+SRVQP+TVQK+RWGSCW LYWCFGS K +KRIGHAVLV
Sbjct: 1    MRSVNDSVETVNAAATAIVSADSRVQPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLV 60

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795
            PEP  PG     AEN ++   I+LPFI       SFLQSDPPSATQSPAGL         
Sbjct: 61   PEPVVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVN 120

Query: 794  ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                 L +PPVFSA TTEPSTA FTPPPESVQ+TTPSSPEVPFA
Sbjct: 121  AYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFA 180

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 501
            QLLTSSL R RRN+G N KF LS Y+FQ YQ YPGSPGG L SPGSAIS SGTSSPFPD+
Sbjct: 181  QLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDR 240

Query: 500  LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNG-GL-SR 333
             PI+EFR GEAPK LG+E+F  RKWGSR+GSGSLTP   G GSRLGSG++TP+G GL SR
Sbjct: 241  RPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSR 300

Query: 332  LGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPT 153
            LGSG++TP+G  P S+D +L+ SQISEVA LAN  N  K+ E +VD+RVSFEL GED+  
Sbjct: 301  LGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAP 360

Query: 152  CTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSE 6
            C   +S+   +   +   +    G  ++D + K+ +   E    ET +E
Sbjct: 361  CLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNE 409


>gb|KDO51973.1| hypothetical protein CISIN_1g010808mg [Citrus sinensis]
          Length = 500

 Score =  419 bits (1077), Expect = e-114
 Identities = 228/405 (56%), Positives = 269/405 (66%), Gaps = 51/405 (12%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            MSSVH+S             AESR++P+ +QKRRWGSCWSLYWCFGSHK +KRI HAVL+
Sbjct: 1    MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLL 60

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795
            PEP   G  AP AE + HS  IVLPFI       SFLQSDP SATQSPAGL         
Sbjct: 61   PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPSSATQSPAGLLCLNSLSVN 120

Query: 794  ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                 L +PPVFSAFTTEPSTA  TPPPESVQ+TTPSSPEVPFA
Sbjct: 121  AYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFA 180

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 501
            QLLTSSL R RRN+GTN K SLS Y +QPYQ YPGSPGG+L SPGS +S SGTSSPFPD+
Sbjct: 181  QLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDR 240

Query: 500  LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP------------------TGWGSR 375
             PI++F    APK LG+EHF  RKWGSR+GSGS+TP                   G GSR
Sbjct: 241  RPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSR 300

Query: 374  LGSGTLTPNG-GL-SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 201
            LGSGT+TP+G GL SRLGSG++TP+G  P S+D ++ E+QISEVASLANSDN +K  E +
Sbjct: 301  LGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHI 360

Query: 200  VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKD 66
            +D+RVSFEL GE++  C   +S  S +   +   +  P G  ++D
Sbjct: 361  IDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRD 405


>ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-like [Citrus sinensis]
          Length = 500

 Score =  419 bits (1076), Expect = e-114
 Identities = 228/405 (56%), Positives = 269/405 (66%), Gaps = 51/405 (12%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            MSSVH+S             AESR++P+ +QKRRWGSCWSLYWCFGSHK +KRI HAVL+
Sbjct: 1    MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLL 60

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795
            PEP   G  AP AE + HS  IVLPFI       SFLQSDP SATQSPAGL         
Sbjct: 61   PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPSSATQSPAGLLSLNSLSVN 120

Query: 794  ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                 L +PPVFSAFTTEPSTA  TPPPESVQ+TTPSSPEVPFA
Sbjct: 121  AYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFA 180

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 501
            QLLTSSL R RRN+GTN K SLS Y +QPYQ YPGSPGG+L SPGS +S SGTSSPFPD+
Sbjct: 181  QLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDR 240

Query: 500  LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP------------------TGWGSR 375
             PI++F    APK LG+EHF  RKWGSR+GSGS+TP                   G GSR
Sbjct: 241  HPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSR 300

Query: 374  LGSGTLTPNG-GL-SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 201
            LGSGT+TP+G GL SRLGSG++TP+G  P S+D ++ E+QISEVASLANSDN +K  E +
Sbjct: 301  LGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHI 360

Query: 200  VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKD 66
            +D+RVSFEL GE++  C   +S  S +   +   +  P G  ++D
Sbjct: 361  IDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRD 405


>gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum]
          Length = 465

 Score =  418 bits (1075), Expect = e-114
 Identities = 232/393 (59%), Positives = 268/393 (68%), Gaps = 35/393 (8%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            M SV++S             AESRVQP+TVQK+RWGSCWS YWCFGSHK +KRIGHAVLV
Sbjct: 1    MRSVNDSVETVNAAASAIVSAESRVQPTTVQKKRWGSCWSFYWCFGSHKSSKRIGHAVLV 60

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795
            PEP  PG     AEN ++   IV+PFI       SFLQSDPPSATQSPAGL         
Sbjct: 61   PEPVVPGASVSTAENASNPTGIVMPFIAPPSSPASFLQSDPPSATQSPAGLLSLTALSVN 120

Query: 794  ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                 L +PPVFSA TTEPSTA FTPPPESVQ+TTPSSPEVPFA
Sbjct: 121  AYSPRGPASIFSIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFA 180

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 501
            QLLTSSL R RRN+G N KF LS Y+FQ YQ YPGSPGG L SPGS IS SGTSSPFPD+
Sbjct: 181  QLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDR 240

Query: 500  LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNG-GL-SR 333
             PI+EFR GEAPK LG+EHF  RKWGSR+GSGSLTP   G GSRLGS  +TP+G GL SR
Sbjct: 241  RPILEFRMGEAPKTLGFEHFTTRKWGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSR 300

Query: 332  LGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPT 153
            LGSG++TP+G  P S+D + +ESQ SEVA L+N  N  K+ E++VD+RVSFEL GED+  
Sbjct: 301  LGSGSLTPDGLGPASRDGFPIESQNSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVAR 360

Query: 152  CTIKESVPSYKTALKTVPEAAPNGMNQKDLMAK 54
            C   +S+ S +T         P+    KDL+A+
Sbjct: 361  CLKNKSLVSSRT--------MPDYEYPKDLVAQ 385


>ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765522 [Gossypium raimondii]
            gi|763785675|gb|KJB52746.1| hypothetical protein
            B456_008G275500 [Gossypium raimondii]
          Length = 465

 Score =  417 bits (1071), Expect = e-113
 Identities = 227/372 (61%), Positives = 260/372 (69%), Gaps = 35/372 (9%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948
            M SV++S             AESRVQP+TVQK+RWGSCWS YWCFGSHK +KRIGHAVLV
Sbjct: 1    MRSVNDSVETVNAAASAIVSAESRVQPTTVQKKRWGSCWSFYWCFGSHKSSKRIGHAVLV 60

Query: 947  PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795
            PEP  PG     AEN ++   IV+PFI       SFLQSDPPSATQSPAGL         
Sbjct: 61   PEPVVPGALVSTAENASNPTGIVMPFIAPPSSPASFLQSDPPSATQSPAGLLSLTALSVN 120

Query: 794  ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678
                                 L +PPVFSA TTEPSTA FTPPPESVQ+TTPSSPEVPFA
Sbjct: 121  AYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFA 180

Query: 677  QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 501
            QLLTSSL R RRN+G N KF LS Y+FQ YQ YPGSPGG L SPGS IS SGTSSPFPD+
Sbjct: 181  QLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDR 240

Query: 500  LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNG-GL-SR 333
             PI+EFR GEAPK LG+EHF  RKWGSR+GSGSLTP   G GSRLGS  +TP+G GL SR
Sbjct: 241  RPILEFRMGEAPKTLGFEHFTTRKWGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSR 300

Query: 332  LGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPT 153
            LGSG++TP+G  P S+D + +ESQ SEVA L+N  N  K+ E++VD+RVSFEL GED+  
Sbjct: 301  LGSGSLTPDGLGPASRDGFPIESQNSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVAR 360

Query: 152  CTIKESVPSYKT 117
            C   +S+ S +T
Sbjct: 361  CLKNKSLVSSRT 372


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  415 bits (1067), Expect = e-113
 Identities = 233/413 (56%), Positives = 275/413 (66%), Gaps = 39/413 (9%)
 Frame = -1

Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQ----KRRWGSCWSLYWCFGSHKPNKRIGH 960
            M SV++S             A+SRVQP+TVQ    K+RWGSCW LYWCFGS K +KRIGH
Sbjct: 1    MRSVNDSVETVNAAATAIVSADSRVQPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGH 60

Query: 959  AVLVPEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL----- 795
            AVLVPEP  PG     AEN ++   I+LPFI       SFLQSDPPSATQSPAGL     
Sbjct: 61   AVLVPEPVVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTS 120

Query: 794  -------------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPE 690
                                     L +PPVFSA TTEPSTA FTPPPESVQ+TTPSSPE
Sbjct: 121  LSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPE 180

Query: 689  VPFAQLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSP 513
            VPFAQLLTSSL R RRN+G N KF LS Y+FQ YQ YPGSPGG L SPGSAIS SGTSSP
Sbjct: 181  VPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSP 240

Query: 512  FPDKLPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNG-G 342
            FPD+ PI+EFR GEAPK LG+E+F  RKWGSR+GSGSLTP   G GSRLGSG++TP+G G
Sbjct: 241  FPDRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMG 300

Query: 341  L-SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGE 165
            L SRLGSG++TP+G  P S+D +L+ SQISEVA LAN  N  K+ E +VD+RVSFEL GE
Sbjct: 301  LGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGE 360

Query: 164  DIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSE 6
            D+  C   +S+   +   +   +    G  ++D + K+ +   E    ET +E
Sbjct: 361  DVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNE 413


Top