BLASTX nr result

ID: Rehmannia27_contig00003712 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia27_contig00003712
         (2320 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166...   466   e-154
ref|XP_011076135.1| PREDICTED: uncharacterized protein LOC105160...   434   e-142
ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236...   386   e-123
ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116...   384   e-122
ref|XP_015070691.1| PREDICTED: uncharacterized protein LOC107015...   382   e-121
ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583...   379   e-120
ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   379   e-120
emb|CDP05166.1| unnamed protein product [Coffea canephora]            376   e-119
ref|XP_009788654.1| PREDICTED: uncharacterized protein LOC104236...   375   e-119
ref|XP_015070692.1| PREDICTED: uncharacterized protein LOC107015...   375   e-119
ref|XP_010317637.1| PREDICTED: uncharacterized protein LOC101260...   373   e-118
ref|XP_010317636.1| PREDICTED: uncharacterized protein LOC101260...   363   e-114
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   326   e-100
gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum]   320   5e-98
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   320   9e-98
ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648...   320   2e-97
ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765...   318   2e-97
ref|XP_015888763.1| PREDICTED: uncharacterized protein LOC107423...   317   4e-96
gb|KJB52747.1| hypothetical protein B456_008G275500 [Gossypium r...   313   3e-95
ref|XP_009368760.1| PREDICTED: uncharacterized protein LOC103958...   313   3e-95

>ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166690 [Sesamum indicum]
          Length = 479

 Score =  466 bits (1198), Expect = e-154
 Identities = 261/480 (54%), Positives = 292/480 (60%), Gaps = 54/480 (11%)
 Frame = +3

Query: 387  MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566
            MSSV NS                  QPSTVQKRRW SCWS+Y C GS+K SKRIGHAVLV
Sbjct: 1    MSSVHNSVETVNAAATAIVTAESRVQPSTVQKRRWGSCWSIYWCFGSHKQSKRIGHAVLV 60

Query: 567  SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746
            S+P      AP +  N N+S+T                QSDPPSAT              
Sbjct: 61   SEPAAAGVAAP-ISENRNQSSTIVLPFIAPPSSPASFLQSDPPSATQSPAGLISLASLSV 119

Query: 747  HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926
            H  SPGGTAPIFTIGPYA+ETQLVSPPVFSTFTTEPSTASF               EV F
Sbjct: 120  HANSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTPSSPEVPF 179

Query: 927  XXXXXXXXXXXXXXXWTN----------------------IKSPGSAISTSGTSSPFPDK 1040
                            TN                      IKSPGSA+STSGTSSPFPDK
Sbjct: 180  AQLLSSSLARNRRNCGTNLKYSLSQYEFQPYQYPGSPGGHIKSPGSALSTSGTSSPFPDK 239

Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178
              I+EF +GE  KF+GYEHF NYKWGSRVGSGS              LTPNGG+SRLGSG
Sbjct: 240  HPIMEFRMGEAPKFLGYEHFPNYKWGSRVGSGSLTPNGWGSRLGSGALTPNGGLSRLGSG 299

Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358
            +LTPNG EP SRD NLLE+QI EVASLANSD +S+NDD VV+HRVSFEL GEDIPTCVV 
Sbjct: 300  TLTPNGGEPPSRDGNLLENQIYEVASLANSDRKSQNDDAVVDHRVSFELFGEDIPTCVVT 359

Query: 1359 ETVRSPK---------IALDTNQKDLMTKNEDSCRENNNAKNVNAIP---------GFDQ 1484
            E+  S K          A  TN KDL TKN DSCRE+N+ +  N +P            Q
Sbjct: 360  ESAPSHKNASGYPGVATAEGTNNKDLTTKNADSCREHNDGETTNEVPEIPLDGEGGELHQ 419

Query: 1485 KHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664
            K RTVS+GSSKDFNFNN+K E+ +KS+++CEWW N+KVV KELGP NSW+FFPMLQSG S
Sbjct: 420  KQRTVSLGSSKDFNFNNAKGEIPEKSSINCEWWTNEKVVRKELGPRNSWSFFPMLQSGAS 479


>ref|XP_011076135.1| PREDICTED: uncharacterized protein LOC105160458 [Sesamum indicum]
          Length = 466

 Score =  434 bits (1116), Expect = e-142
 Identities = 246/474 (51%), Positives = 282/474 (59%), Gaps = 48/474 (10%)
 Frame = +3

Query: 387  MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566
            M+SV NS                  QPSTVQKRRW SCWSLY C GSYKHSKRIGHAVL+
Sbjct: 1    MTSVHNSAETLNAAATAIVTAENRAQPSTVQKRRWGSCWSLYWCFGSYKHSKRIGHAVLI 60

Query: 567  SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746
            S+PT QV+VAP V+N  NRSAT                QSDPPSAT              
Sbjct: 61   SEPTAQVAVAPVVENL-NRSATLMLPFIAPPSSPASFLQSDPPSATQSAAGLVSLAALSV 119

Query: 747  HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926
            HT+SPGGTAPIFTIGPYAYETQLVSPPVFS FTTEPSTASF               EV F
Sbjct: 120  HTYSPGGTAPIFTIGPYAYETQLVSPPVFSAFTTEPSTASFTPPPEPVQMTTPSSPEVPF 179

Query: 927  -----------XXXXXXXXXXXXXXXWTNIKSPGSAISTSGTSSPFPDKRAIIEFCVGEG 1073
                                      +   +SPGSA+S+SGTSSPFPDK  ++E   GE 
Sbjct: 180  AQLLSSSLARNRRNSGNMKSSLSQYEFLAYESPGSALSSSGTSSPFPDKWPVVEIRRGEA 239

Query: 1074 SKFIGYEHFLNYKWGSRVGSGS----------------------------LTPNGGISRL 1169
              FIGYEHF N+KWGSRVGSGS                            LTPNGG+SRL
Sbjct: 240  PIFIGYEHFFNHKWGSRVGSGSLTPNGRGSRLGSGALTPNGGLSRLGSGALTPNGGLSRL 299

Query: 1170 GSGSLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTC 1349
            GSG+LTPNG EP SRDCNLL + ISEV SLANS  E +N D VV+HRVSFEL GEDIPTC
Sbjct: 300  GSGALTPNGGEPPSRDCNLLGNPISEVVSLANSGNELQNCDAVVDHRVSFELSGEDIPTC 359

Query: 1350 VVKETVRSPKI---------ALDTNQKDLMTKNEDSCRENNNAKNVNAIPGFDQKHRTVS 1502
            VV ETV SPK+         A  TN  D M K  ++ R+ +N + ++       ++ T+S
Sbjct: 360  VVSETVPSPKMESRDLQEATAEVTNHSDFMAKVSETYRKLSNGETMH-------ENHTIS 412

Query: 1503 MGSSKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664
            +GSS+DFNFNN+  E++ +  VDCEWW ND VV KEL P N+W FFPMLQSGVS
Sbjct: 413  LGSSRDFNFNNADGELSARIAVDCEWWTNDDVVGKELAPRNNWTFFPMLQSGVS 466


>ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236433 isoform X1 [Nicotiana
            sylvestris]
          Length = 470

 Score =  386 bits (991), Expect = e-123
 Identities = 228/473 (48%), Positives = 265/473 (56%), Gaps = 47/473 (9%)
 Frame = +3

Query: 387  MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566
            MSSV+N+                  QPS+VQKRRW SCWSLY C GSYKHSKRIGHAVLV
Sbjct: 1    MSSVQNTVDTVNAAATAIVTAESRVQPSSVQKRRWGSCWSLYWCFGSYKHSKRIGHAVLV 60

Query: 567  SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746
             +P       P V  NPNRSAT                 SDPPSAT              
Sbjct: 61   PEPAAPGPAVP-VTENPNRSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSFSI 119

Query: 747  HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926
            + +SPGGTA IF IGPYA+ETQLVSPPVFSTFTTEPSTA+F               EV F
Sbjct: 120  NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPF 179

Query: 927  XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040
                            +N K                      SPGS +S SGTSSPFP K
Sbjct: 180  AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 239

Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178
              IIEF  GE  KF+GYEHF   KWGSRVGSGS              LTPNGGISRLGSG
Sbjct: 240  CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 299

Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358
            ++TPNG EP SRDC LLE+QISEVASLANSD  S   +GV++HRVSFEL GED+P+C  K
Sbjct: 300  TVTPNGGEPPSRDCYLLENQISEVASLANSDNGSEIAEGVIDHRVSFELTGEDVPSCREK 359

Query: 1359 ETVRSPKIALDTNQKDLMTKNEDSCRENNN--AKNVNAIP------GFDQ---KHRTVSM 1505
            E V S   +  T   D+   +    R +++   +  + +P      G DQ   KHR ++ 
Sbjct: 360  EPVMSH--SQQTLPMDVPAPSNKEMRSSSSIVEEKTDGLPEKASERGDDQCHRKHRNITF 417

Query: 1506 GSSKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664
            GSSKDF+F+N K EV +K +VDCEWW +DK   KE    N+W FFP+LQ GVS
Sbjct: 418  GSSKDFDFDNVKIEVLEKHSVDCEWWTSDKATGKESSIQNNWTFFPVLQPGVS 470


>ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116142 [Nicotiana
            tomentosiformis]
          Length = 470

 Score =  384 bits (986), Expect = e-122
 Identities = 222/471 (47%), Positives = 263/471 (55%), Gaps = 45/471 (9%)
 Frame = +3

Query: 387  MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566
            MSSV+N+                  QPS++QK+RW SCWSLY C GSYKHSKRIGHA+LV
Sbjct: 1    MSSVQNTVDTVNAAATAIITAESRVQPSSIQKKRWGSCWSLYWCFGSYKHSKRIGHAILV 60

Query: 567  SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746
             +P       P V  NPNRSAT                 SDPPSAT              
Sbjct: 61   PEPAAPGPAVP-VTENPNRSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSFSI 119

Query: 747  HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926
            + +SPGGTA IF IGPYA+ETQLVSPPVFSTFTTEPSTA+F               EV F
Sbjct: 120  NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPF 179

Query: 927  XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040
                            +N K                      SPGS +S SGTSSPFP K
Sbjct: 180  AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSSLISPGSVVSNSGTSSPFPGK 239

Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178
              IIEF  GE  KF+GYEHF   KWGSRVGSGS              LTPNGGISRLGSG
Sbjct: 240  CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 299

Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358
            ++TPNG EP SRDC LLE+QISEVASLANSD  S   +GV++HRVSFEL GED+P+C  K
Sbjct: 300  TVTPNGGEPPSRDCYLLENQISEVASLANSDNGSEIAEGVIDHRVSFELTGEDVPSCREK 359

Query: 1359 ETVRS------PKIALDTNQKDLMTKNEDSCRENNNAKNVNAIPGFDQ---KHRTVSMGS 1511
            E V S      P      + K++ + + +   + +      +  G DQ   KHR ++ GS
Sbjct: 360  EPVMSHSQQTLPMDVPAPSNKEMRSSSSNVEEKTDGLPEKASERGDDQCHRKHRNITFGS 419

Query: 1512 SKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664
            SKDF+F+N K EV ++ +VDCEWW +DK   KE    N+W FFP+LQ GVS
Sbjct: 420  SKDFDFDNVKIEVLEEDSVDCEWWTSDKATGKESSIQNNWTFFPVLQPGVS 470


>ref|XP_015070691.1| PREDICTED: uncharacterized protein LOC107015045 isoform X1 [Solanum
            pennellii]
          Length = 470

 Score =  382 bits (981), Expect = e-121
 Identities = 223/471 (47%), Positives = 261/471 (55%), Gaps = 45/471 (9%)
 Frame = +3

Query: 387  MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566
            MSSV+N+                  QPSTVQKRRW SCWSLY C GS+KHSKRIGHAVLV
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60

Query: 567  SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746
             +P       P V  NPN SAT                 SDPPSAT              
Sbjct: 61   PEPVAPGPAVP-VTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSI 119

Query: 747  HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926
            + +SPGGTA IF IGPYA+ETQLVSPPVFSTFTTEPSTA+F               EV F
Sbjct: 120  NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPF 179

Query: 927  XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040
                            +N K                      SPGS +S SGTSSPFP K
Sbjct: 180  AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 239

Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178
              IIEF  GE  KF+GYEHF   KWGSRVGSGS              LTPNGGISRLGSG
Sbjct: 240  CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 299

Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358
            ++TPNG EP SRD  LLE+QISEVASLANSD  S   +GV++HRVSFEL GED+P+C  K
Sbjct: 300  TVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREK 359

Query: 1359 ETVRS---PKIALD------TNQKDLMTKNEDSCRENNNAKNVNAIPGFDQKHRTVSMGS 1511
            E V S   P + +D      +  K   +  E+    +    + +      +KHR ++ GS
Sbjct: 360  EPVMSHSQPTLPMDVSNLLASEMKSGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGS 419

Query: 1512 SKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664
            SKDF+F+N K EV +K ++DCEWW +DK   KE G  N+W FFP+LQ GVS
Sbjct: 420  SKDFDFDNVKIEVLEKDSIDCEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 470


>ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum]
          Length = 470

 Score =  379 bits (974), Expect = e-120
 Identities = 227/471 (48%), Positives = 262/471 (55%), Gaps = 45/471 (9%)
 Frame = +3

Query: 387  MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566
            MSSV+N+                  QPSTVQKRRW SCWSLY C GS+KHSKRIGHAVLV
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60

Query: 567  SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746
             +P       P V  NPN SAT                 SDPPSAT              
Sbjct: 61   PEPAAPGPAVP-VTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSLSI 119

Query: 747  HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926
            + +SPGGTA IF IGPYA+ETQLVSPPVFSTFTTEPSTA+F               EV F
Sbjct: 120  NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPF 179

Query: 927  XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040
                            +N K                      SPGS +S SGTSSPFP K
Sbjct: 180  AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 239

Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178
              IIEF  GE  KF+GYEHF   KWGSRVGSGS              LTPNGGISRLGSG
Sbjct: 240  CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 299

Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358
            ++TPNG EP SRD  LLE QISEVASLANSD  S   +GV++HRVSFEL GED+P+C  K
Sbjct: 300  TVTPNGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREK 359

Query: 1359 ETVRS-PKIALDTNQKDLMT---KNEDSCRENN--NAKNVNAIPGFDQ---KHRTVSMGS 1511
            E V S  +  L  +  +L+    K+  S  E     +    +  G DQ   KHR ++ GS
Sbjct: 360  EPVMSHSQQTLPMDVSNLLANEMKSGSSMAEEKTYGSPRKASESGEDQCHRKHRNITFGS 419

Query: 1512 SKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664
            SKDF+F+N K EV +K ++DCEWW +DK   KE G  N+W FFP+LQ GVS
Sbjct: 420  SKDFDFDNVKIEVLEKDSIDCEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 470


>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 isoform X2 [Solanum
            lycopersicum]
          Length = 470

 Score =  379 bits (974), Expect = e-120
 Identities = 222/471 (47%), Positives = 263/471 (55%), Gaps = 45/471 (9%)
 Frame = +3

Query: 387  MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566
            MSSV+N+                  QPSTVQKRRW SCWSLY C GS+KHSKRIGHAVLV
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60

Query: 567  SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746
             +P       P V  NPN SAT                 SDPPSAT              
Sbjct: 61   PEPVAPGPAVP-VTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSI 119

Query: 747  HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926
            + +SPGGTA IF IGPYA+ETQLVSPPVFSTFTTEPSTA+F               EV F
Sbjct: 120  NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPF 179

Query: 927  XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040
                            +N K                      SPGS +S SGTSSPFP K
Sbjct: 180  AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 239

Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178
              IIEF  GE  KF+GYEHF   KWGSRVGSGS              LTPNGGISRLGSG
Sbjct: 240  CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSG 299

Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358
            ++TPNG EP SRD  LLE+QISEVASLANSD  S   + V++HRVSFEL  ED+P+C  K
Sbjct: 300  TVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREK 359

Query: 1359 ETVRS---PKIALDTNQ---KDLMTKNEDSCRENNNAKNVNAIPGFDQ---KHRTVSMGS 1511
            E V S   P + +D +     ++ + +  +  +   +    +  G D+   KHR ++ GS
Sbjct: 360  EPVMSHSQPTLPMDVSNLLASEMRSGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGS 419

Query: 1512 SKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664
            SKDF+F+N K EV +K ++DCEWW +DK  VKE G  N+W FFP+LQ GVS
Sbjct: 420  SKDFDFDNVKIEVLEKDSIDCEWWTSDKAAVKESGIQNNWTFFPVLQPGVS 470


>emb|CDP05166.1| unnamed protein product [Coffea canephora]
          Length = 452

 Score =  376 bits (965), Expect = e-119
 Identities = 223/470 (47%), Positives = 260/470 (55%), Gaps = 44/470 (9%)
 Frame = +3

Query: 387  MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566
            MSSV NS                  QP TVQKRRW SCWS Y C GS K+SKRIG+AVLV
Sbjct: 1    MSSVHNSVETVNAAATAIVTAESRVQPPTVQKRRWGSCWSFYWCFGSVKNSKRIGNAVLV 60

Query: 567  SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746
             +PT   S  P V +N N SAT                QSDPPSAT              
Sbjct: 61   PEPTVPGSAVP-VPDNLNHSATIVIPFIAPPSSPASFLQSDPPSATQSPAKFLPLASFSV 119

Query: 747  HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926
            +T+SP G A IF IGPYA+ETQLVSPPVFS FTTEPSTASF               EV F
Sbjct: 120  NTYSPSGAASIFAIGPYAHETQLVSPPVFSAFTTEPSTASFTPPPEPVQLTTPSSPEVPF 179

Query: 927  XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040
                            T+IK                      SPGSAIS SGTSSPFP+K
Sbjct: 180  AQLLVSSLTHNRRHSGTSIKFPLSQYEFQPYQCPGSPGSHLISPGSAISNSGTSSPFPEK 239

Query: 1041 RAIIEFCVGEGSKFIGYEHFLNY-------------KWGSRVGSGSLTPNGGISRLGSGS 1181
            R IIEF +GE  KF+GYE F                 WGSR+GSGSLTPNGGISRLGSG+
Sbjct: 240  RPIIEFRIGEAPKFLGYELFTRKWGSRVGSGSLTPNGWGSRLGSGSLTPNGGISRLGSGT 299

Query: 1182 LTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVKE 1361
            LTPNG EP +RD  LLE+QISEVASLANSD  + N++G+++HRVSFEL  E +P CV +E
Sbjct: 300  LTPNGGEPAARDSYLLENQISEVASLANSDNGTHNEEGLMDHRVSFELTAEHVPNCVEEE 359

Query: 1362 TVRSPKIALDTNQKDLMTKNEDSCRE--NNNAKNV--NAIPGFDQK-----HRTVSMGSS 1514
                              K ++ C +   ++  N+   A+ G + K     +RT S+GSS
Sbjct: 360  -----------------MKGQNFCEDCTGDSIHNITRKALDGQEGKQCLKNNRTFSLGSS 402

Query: 1515 KDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664
            KDFNF+N K+E  DKST+DCEWW N+    KELG  N W FFPMLQ GVS
Sbjct: 403  KDFNFDNMKQESPDKSTIDCEWWTNETAAAKELGSKNKWTFFPMLQPGVS 452


>ref|XP_009788654.1| PREDICTED: uncharacterized protein LOC104236433 isoform X2 [Nicotiana
            sylvestris]
          Length = 442

 Score =  375 bits (964), Expect = e-119
 Identities = 219/444 (49%), Positives = 254/444 (57%), Gaps = 47/444 (10%)
 Frame = +3

Query: 474  VQKRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPTPQVSVAPFVDNNPNRSATXXXXXXX 653
            +QKRRW SCWSLY C GSYKHSKRIGHAVLV +P       P V  NPNRSAT       
Sbjct: 2    MQKRRWGSCWSLYWCFGSYKHSKRIGHAVLVPEPAAPGPAVP-VTENPNRSATIVIPFIA 60

Query: 654  XXXXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYETQLVSPPVF 833
                      SDPPSAT              + +SPGGTA IF IGPYA+ETQLVSPPVF
Sbjct: 61   PPSSPASFLPSDPPSATQSPAGLLSLKSFSINAYSPGGTASIFAIGPYAHETQLVSPPVF 120

Query: 834  STFTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK--------- 986
            STFTTEPSTA+F               EV F                +N K         
Sbjct: 121  STFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEFV 180

Query: 987  -------------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEHFLNYKWGSRV 1127
                         SPGS +S SGTSSPFP K  IIEF  GE  KF+GYEHF   KWGSRV
Sbjct: 181  PYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRKWGSRV 240

Query: 1128 GSGS--------------LTPNGGISRLGSGSLTPNGLEPLSRDCNLLESQISEVASLAN 1265
            GSGS              LTPNGGISRLGSG++TPNG EP SRDC LLE+QISEVASLAN
Sbjct: 241  GSGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDCYLLENQISEVASLAN 300

Query: 1266 SDEESRNDDGVVEHRVSFELIGEDIPTCVVKETVRSPKIALDTNQKDLMTKNEDSCRENN 1445
            SD  S   +GV++HRVSFEL GED+P+C  KE V S   +  T   D+   +    R ++
Sbjct: 301  SDNGSEIAEGVIDHRVSFELTGEDVPSCREKEPVMSH--SQQTLPMDVPAPSNKEMRSSS 358

Query: 1446 N--AKNVNAIP------GFDQ---KHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWIND 1592
            +   +  + +P      G DQ   KHR ++ GSSKDF+F+N K EV +K +VDCEWW +D
Sbjct: 359  SIVEEKTDGLPEKASERGDDQCHRKHRNITFGSSKDFDFDNVKIEVLEKHSVDCEWWTSD 418

Query: 1593 KVVVKELGPHNSWNFFPMLQSGVS 1664
            K   KE    N+W FFP+LQ GVS
Sbjct: 419  KATGKESSIQNNWTFFPVLQPGVS 442


>ref|XP_015070692.1| PREDICTED: uncharacterized protein LOC107015045 isoform X2 [Solanum
            pennellii]
          Length = 469

 Score =  375 bits (964), Expect = e-119
 Identities = 222/471 (47%), Positives = 260/471 (55%), Gaps = 45/471 (9%)
 Frame = +3

Query: 387  MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566
            MSSV+N+                  QPSTVQ RRW SCWSLY C GS+KHSKRIGHAVLV
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQ-RRWGSCWSLYWCFGSHKHSKRIGHAVLV 59

Query: 567  SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746
             +P       P V  NPN SAT                 SDPPSAT              
Sbjct: 60   PEPVAPGPAVP-VTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSI 118

Query: 747  HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926
            + +SPGGTA IF IGPYA+ETQLVSPPVFSTFTTEPSTA+F               EV F
Sbjct: 119  NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPF 178

Query: 927  XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040
                            +N K                      SPGS +S SGTSSPFP K
Sbjct: 179  AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 238

Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178
              IIEF  GE  KF+GYEHF   KWGSRVGSGS              LTPNGGISRLGSG
Sbjct: 239  CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 298

Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358
            ++TPNG EP SRD  LLE+QISEVASLANSD  S   +GV++HRVSFEL GED+P+C  K
Sbjct: 299  TVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREK 358

Query: 1359 ETVRS---PKIALD------TNQKDLMTKNEDSCRENNNAKNVNAIPGFDQKHRTVSMGS 1511
            E V S   P + +D      +  K   +  E+    +    + +      +KHR ++ GS
Sbjct: 359  EPVMSHSQPTLPMDVSNLLASEMKSGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGS 418

Query: 1512 SKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664
            SKDF+F+N K EV +K ++DCEWW +DK   KE G  N+W FFP+LQ GVS
Sbjct: 419  SKDFDFDNVKIEVLEKDSIDCEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 469


>ref|XP_010317637.1| PREDICTED: uncharacterized protein LOC101260903 isoform X3 [Solanum
            lycopersicum]
          Length = 469

 Score =  373 bits (957), Expect = e-118
 Identities = 221/471 (46%), Positives = 262/471 (55%), Gaps = 45/471 (9%)
 Frame = +3

Query: 387  MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566
            MSSV+N+                  QPSTVQ RRW SCWSLY C GS+KHSKRIGHAVLV
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQ-RRWGSCWSLYWCFGSHKHSKRIGHAVLV 59

Query: 567  SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746
             +P       P V  NPN SAT                 SDPPSAT              
Sbjct: 60   PEPVAPGPAVP-VTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSI 118

Query: 747  HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926
            + +SPGGTA IF IGPYA+ETQLVSPPVFSTFTTEPSTA+F               EV F
Sbjct: 119  NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPF 178

Query: 927  XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040
                            +N K                      SPGS +S SGTSSPFP K
Sbjct: 179  AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 238

Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178
              IIEF  GE  KF+GYEHF   KWGSRVGSGS              LTPNGGISRLGSG
Sbjct: 239  CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSG 298

Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358
            ++TPNG EP SRD  LLE+QISEVASLANSD  S   + V++HRVSFEL  ED+P+C  K
Sbjct: 299  TVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREK 358

Query: 1359 ETVRS---PKIALDTNQ---KDLMTKNEDSCRENNNAKNVNAIPGFDQ---KHRTVSMGS 1511
            E V S   P + +D +     ++ + +  +  +   +    +  G D+   KHR ++ GS
Sbjct: 359  EPVMSHSQPTLPMDVSNLLASEMRSGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGS 418

Query: 1512 SKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664
            SKDF+F+N K EV +K ++DCEWW +DK  VKE G  N+W FFP+LQ GVS
Sbjct: 419  SKDFDFDNVKIEVLEKDSIDCEWWTSDKAAVKESGIQNNWTFFPVLQPGVS 469


>ref|XP_010317636.1| PREDICTED: uncharacterized protein LOC101260903 isoform X1 [Solanum
            lycopersicum]
          Length = 476

 Score =  363 bits (933), Expect = e-114
 Identities = 210/440 (47%), Positives = 250/440 (56%), Gaps = 45/440 (10%)
 Frame = +3

Query: 480  KRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPTPQVSVAPFVDNNPNRSATXXXXXXXXX 659
            +RRW SCWSLY C GS+KHSKRIGHAVLV +P       P V  NPN SAT         
Sbjct: 38   ERRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPVAPGPAVP-VTENPNHSATIVIPFIAPP 96

Query: 660  XXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYETQLVSPPVFST 839
                    SDPPSAT              + +SPGGTA IF IGPYA+ETQLVSPPVFST
Sbjct: 97   SSPASFLPSDPPSATQSPAGLLSLKALSINAYSPGGTASIFAIGPYAHETQLVSPPVFST 156

Query: 840  FTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK----------- 986
            FTTEPSTA+F               EV F                +N K           
Sbjct: 157  FTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEFVPY 216

Query: 987  -----------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEHFLNYKWGSRVGS 1133
                       SPGS +S SGTSSPFP K  IIEF  GE  KF+GYEHF   KWGSRVGS
Sbjct: 217  QDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGS 276

Query: 1134 GS--------------LTPNGGISRLGSGSLTPNGLEPLSRDCNLLESQISEVASLANSD 1271
            GS              LTPNGGISRLGSG++TPNG EP SRD  LLE+QISEVASLANSD
Sbjct: 277  GSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLENQISEVASLANSD 336

Query: 1272 EESRNDDGVVEHRVSFELIGEDIPTCVVKETVRS---PKIALDTNQ---KDLMTKNEDSC 1433
              S   + V++HRVSFEL  ED+P+C  KE V S   P + +D +     ++ + +  + 
Sbjct: 337  NGSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSHSQPTLPMDVSNLLASEMRSGSSMAE 396

Query: 1434 RENNNAKNVNAIPGFDQ---KHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWINDKVVV 1604
             +   +    +  G D+   KHR ++ GSSKDF+F+N K EV +K ++DCEWW +DK  V
Sbjct: 397  EKTYGSPRKASESGEDECHRKHRNITFGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAV 456

Query: 1605 KELGPHNSWNFFPMLQSGVS 1664
            KE G  N+W FFP+LQ GVS
Sbjct: 457  KESGIQNNWTFFPVLQPGVS 476


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  326 bits (836), Expect = e-100
 Identities = 204/463 (44%), Positives = 248/463 (53%), Gaps = 62/463 (13%)
 Frame = +3

Query: 462  QPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPT-PQVSVAPFVDNNPNRSATXX 638
            QP+TVQK+RW SCW LY C GS K+SKRIGHAVLV +P  P  SV+     N +      
Sbjct: 26   QPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVS--TAENVSNPTGII 83

Query: 639  XXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYETQLV 818
                          QSDPPSAT              + +SP G A IF IGPYA+ETQLV
Sbjct: 84   LPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLV 143

Query: 819  SPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK---- 986
            +PPVFS  TTEPSTA F               EV F                 N K    
Sbjct: 144  TPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLS 203

Query: 987  -------------------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEHFLNY 1109
                               SPGSAIS SGTSSPFPD+R I+EF +GE  K +G+E+F   
Sbjct: 204  HYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGEAPKLLGFENFTTR 263

Query: 1110 KWGSRVGSGSLTPNG-----------------GI-SRLGSGSLTPNGLEPLSRDCNLLES 1235
            KWGSR+GSGSLTP+G                 G+ SRLGSGSLTP+GL P SRD  L+ S
Sbjct: 264  KWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGS 323

Query: 1236 QISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVKETVRSPKIALDTNQKDLMT 1415
            QISEVA LAN     +ND+ +V+HRVSFEL GED+  C+  +++  P  A+    KDL+ 
Sbjct: 324  QISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSL-LPSRAVSEYPKDLVA 382

Query: 1416 KN-----------EDSC----RENNNAKNVNAIPGFD-----QKHRTVSMGSSKDFNFNN 1535
            +            E SC    RE +N     A    +     QKHR+V++GS K+FNF+N
Sbjct: 383  EGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDN 442

Query: 1536 SKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664
            +K E +DK T+  EWW N+KV  KE  P NSW FFPMLQ  VS
Sbjct: 443  TKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485


>gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum]
          Length = 465

 Score =  320 bits (821), Expect = 5e-98
 Identities = 196/446 (43%), Positives = 243/446 (54%), Gaps = 45/446 (10%)
 Frame = +3

Query: 462  QPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPT-PQVSVAPFVDNNPNRSATXX 638
            QP+TVQK+RW SCWS Y C GS+K SKRIGHAVLV +P  P  SV+     N +      
Sbjct: 26   QPTTVQKKRWGSCWSFYWCFGSHKSSKRIGHAVLVPEPVVPGASVS--TAENASNPTGIV 83

Query: 639  XXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYETQLV 818
                          QSDPPSAT              + +SP G A IF+IGPYA+ETQLV
Sbjct: 84   MPFIAPPSSPASFLQSDPPSATQSPAGLLSLTALSVNAYSPRGPASIFSIGPYAHETQLV 143

Query: 819  SPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK---- 986
            +PPVFS  TTEPSTA F               EV F                 N K    
Sbjct: 144  TPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLS 203

Query: 987  -------------------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEHFLNY 1109
                               SPGS IS SGTSSPFPD+R I+EF +GE  K +G+EHF   
Sbjct: 204  HYEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDRRPILEFRMGEAPKTLGFEHFTTR 263

Query: 1110 KWGSRVGSGSLTPNG-----------------GI-SRLGSGSLTPNGLEPLSRDCNLLES 1235
            KWGSR+GSGSLTP+G                 G+ SRLGSGSLTP+GL P SRD   +ES
Sbjct: 264  KWGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSRLGSGSLTPDGLGPASRDGFPIES 323

Query: 1236 QISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVKETVRSPKIALDTN-QKDLM 1412
            Q SEVA L+N     +ND+ +V+HRVSFEL GED+  C+  +++ S +   D    KDL+
Sbjct: 324  QNSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVARCLKNKSLVSSRTMPDYEYPKDLV 383

Query: 1413 TKN--EDSCRENNNAKNVNAIPGFDQKHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWI 1586
             +   E   + +  A+  +      QKHR+V++GS K+FNF+N K E ++K TV  EWW 
Sbjct: 384  AQGRIEKDEKVSGEAEEDHCY----QKHRSVTLGSIKEFNFDNRKGEASEKPTVRSEWWA 439

Query: 1587 NDKVVVKELGPHNSWNFFPMLQSGVS 1664
            N+KV  KE  P N+W FFPMLQ  VS
Sbjct: 440  NEKVAGKEARPGNNWTFFPMLQPEVS 465


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  320 bits (821), Expect = 9e-98
 Identities = 204/467 (43%), Positives = 248/467 (53%), Gaps = 66/467 (14%)
 Frame = +3

Query: 462  QPSTVQ----KRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPT-PQVSVAPFVDNNPNRS 626
            QP+TVQ    K+RW SCW LY C GS K+SKRIGHAVLV +P  P  SV+     N +  
Sbjct: 26   QPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVS--TAENVSNP 83

Query: 627  ATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYE 806
                              QSDPPSAT              + +SP G A IF IGPYA+E
Sbjct: 84   TGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHE 143

Query: 807  TQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK 986
            TQLV+PPVFS  TTEPSTA F               EV F                 N K
Sbjct: 144  TQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQK 203

Query: 987  -----------------------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEH 1097
                                   SPGSAIS SGTSSPFPD+R I+EF +GE  K +G+E+
Sbjct: 204  FGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGEAPKLLGFEN 263

Query: 1098 FLNYKWGSRVGSGSLTPNG-----------------GI-SRLGSGSLTPNGLEPLSRDCN 1223
            F   KWGSR+GSGSLTP+G                 G+ SRLGSGSLTP+GL P SRD  
Sbjct: 264  FTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGF 323

Query: 1224 LLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVKETVRSPKIALDTNQK 1403
            L+ SQISEVA LAN     +ND+ +V+HRVSFEL GED+  C+  +++  P  A+    K
Sbjct: 324  LVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSL-LPSRAVSEYPK 382

Query: 1404 DLMTKN-----------EDSC----RENNNAKNVNAIPGFD-----QKHRTVSMGSSKDF 1523
            DL+ +            E SC    RE +N     A    +     QKHR+V++GS K+F
Sbjct: 383  DLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEF 442

Query: 1524 NFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664
            NF+N+K E +DK T+  EWW N+KV  KE  P NSW FFPMLQ  VS
Sbjct: 443  NFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489


>ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648441 [Jatropha curcas]
            gi|643706116|gb|KDP22248.1| hypothetical protein
            JCGZ_26079 [Jatropha curcas]
          Length = 498

 Score =  320 bits (819), Expect = 2e-97
 Identities = 194/500 (38%), Positives = 251/500 (50%), Gaps = 74/500 (14%)
 Frame = +3

Query: 387  MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566
            M SV NS                  QP+ VQKRRW  CWSLY C GS+K+SKRIGHAVLV
Sbjct: 1    MRSVNNSVETINAAATAIISAESRVQPTVVQKRRWGGCWSLYWCFGSHKNSKRIGHAVLV 60

Query: 567  SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746
             +P    +V    +N  + +A                 QSDPPS T              
Sbjct: 61   PEPEVPQAVVTSAENQTHSTAAAVPFIAPPSSPASFL-QSDPPSVTQSPAGLLSLTALSV 119

Query: 747  HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926
              +SPGG A IF IGPYA+ETQLV+PPVFS FTTEPSTA F               EV F
Sbjct: 120  SAYSPGGPASIFAIGPYAHETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPF 179

Query: 927  XXXXXXXXXXXXXXXWTNIK-----------------------SPGSAISTSGTSSPFPD 1037
                             N K                       SPGS IS SGTSSPFPD
Sbjct: 180  AQLLTSSLERARRNSGANQKFALSHYEFQSYPLYPGSPGGQLISPGSIISNSGTSSPFPD 239

Query: 1038 KRAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGSLTPNG---------------GI---- 1160
            +  ++EF +GE  K +G+EHF   KWGSR+GSG+LTP+G               G+    
Sbjct: 240  RHPLLEFRMGEAPKLLGFEHFTTRKWGSRLGSGTLTPDGVGLGSRLCSGTATPDGVGLGS 299

Query: 1161 ---------------SRLGSGSLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDG 1295
                           SRLGSGSLTP+ + P S+D  LLE+QISEVASLANS+  S+ND+ 
Sbjct: 300  RLGSGSVTPDGVGLRSRLGSGSLTPDCVVPASQDGLLLENQISEVASLANSENASKNDEN 359

Query: 1296 VVEHRVSFELIGEDIPTCVVKETVRSPKIALD----------TNQKDLMTKNEDSCRENN 1445
            +V+HRVSFEL GE++  C+  +++ S +   +           N ++++  + D      
Sbjct: 360  IVDHRVSFELSGEEVARCLESKSMTSSRTFSECPQDSMAEEQINSEEILINSNDCLHIGE 419

Query: 1446 NAKNVNAIPGFD-------QKHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWINDKVVV 1604
             +      P  +       +KHR++++GS K+FNF+NSK EV DK T+  EWW N+ +  
Sbjct: 420  TSNETPEKPSGETEEEPCYRKHRSITLGSIKEFNFDNSK-EVPDKPTISSEWWANETIAG 478

Query: 1605 KELGPHNSWNFFPMLQSGVS 1664
            KE  P N+W FFP+LQ  VS
Sbjct: 479  KEARPANNWTFFPLLQPEVS 498


>ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765522 [Gossypium raimondii]
            gi|763785675|gb|KJB52746.1| hypothetical protein
            B456_008G275500 [Gossypium raimondii]
          Length = 465

 Score =  318 bits (816), Expect = 2e-97
 Identities = 193/445 (43%), Positives = 241/445 (54%), Gaps = 44/445 (9%)
 Frame = +3

Query: 462  QPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPTPQVSVAPFVDNNPNRSATXXX 641
            QP+TVQK+RW SCWS Y C GS+K SKRIGHAVLV +P    ++    +N  N +     
Sbjct: 26   QPTTVQKKRWGSCWSFYWCFGSHKSSKRIGHAVLVPEPVVPGALVSTAENASNPTGIVMP 85

Query: 642  XXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYETQLVS 821
                         QSDPPSAT              + +SP G A IF IGPYA+ETQLV+
Sbjct: 86   FIAPPSSPASFL-QSDPPSATQSPAGLLSLTALSVNAYSPRGPASIFAIGPYAHETQLVT 144

Query: 822  PPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK----- 986
            PPVFS  TTEPSTA F               EV F                 N K     
Sbjct: 145  PPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSH 204

Query: 987  ------------------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEHFLNYK 1112
                              SPGS IS SGTSSPFPD+R I+EF +GE  K +G+EHF   K
Sbjct: 205  YEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDRRPILEFRMGEAPKTLGFEHFTTRK 264

Query: 1113 WGSRVGSGSLTPNG-----------------GI-SRLGSGSLTPNGLEPLSRDCNLLESQ 1238
            WGSR+GSGSLTP+G                 G+ SRLGSGSLTP+GL P SRD   +ESQ
Sbjct: 265  WGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSRLGSGSLTPDGLGPASRDGFPIESQ 324

Query: 1239 ISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVKETVRSPKIALDTN-QKDLMT 1415
             SEVA L+N     +ND+ +V+HRVSFEL GED+  C+  +++ S +   D     DL+ 
Sbjct: 325  NSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVARCLKNKSLVSSRTMPDYEYPNDLVA 384

Query: 1416 KN--EDSCRENNNAKNVNAIPGFDQKHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWIN 1589
            +   E   + +  A+  +      QKHR+V++GS K+FNF+N K E ++K TV  EWW N
Sbjct: 385  QGRIEKDEKVSGEAEEDHCY----QKHRSVTLGSIKEFNFDNRKGEASEKPTVRSEWWAN 440

Query: 1590 DKVVVKELGPHNSWNFFPMLQSGVS 1664
            +KV  KE  P N+W FFPMLQ  VS
Sbjct: 441  EKVAGKEARPGNNWTFFPMLQPEVS 465


>ref|XP_015888763.1| PREDICTED: uncharacterized protein LOC107423668 [Ziziphus jujuba]
          Length = 501

 Score =  317 bits (811), Expect = 4e-96
 Identities = 201/506 (39%), Positives = 248/506 (49%), Gaps = 80/506 (15%)
 Frame = +3

Query: 387  MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566
            M SV NS                  QP+ V KRRW SCWSLY C GS+K++KRI HAVLV
Sbjct: 1    MRSVNNSVETINAAASAIVSAETRAQPTAVPKRRWGSCWSLYWCFGSHKNTKRISHAVLV 60

Query: 567  SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746
             +     +  P  +N    +A                 QSDPPSAT              
Sbjct: 61   PEQVVPGAAVPAAENQIPSTAVVLPFIAPPSSPASFL-QSDPPSATQSPAGLLSLTSLSV 119

Query: 747  HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926
            + +SPGG A IF IGPYAYETQLVSPPVFSTFTTEPSTA F               EV F
Sbjct: 120  NAYSPGGPASIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPF 179

Query: 927  XXXXXXXXXXXXXXXWTNIK-----------------------SPGSAISTSGTSSPFPD 1037
                            TN K                       SPGS IS SGTSSPFPD
Sbjct: 180  AQLLTSSLDRTRRNNGTNQKFALSHCEFQPYQPYPGSPGGQLISPGSVISNSGTSSPFPD 239

Query: 1038 KRAIIEFCVGEGSKFIGYEHFLNYKW--------------------------------GS 1121
            +  I+EF +GE  + +G+EHF   KW                                GS
Sbjct: 240  RHPILEFRMGEAPRLLGFEHFTTRKWGSRLGSGSITPDGLGLGSRLGSGCLTPDGNGLGS 299

Query: 1122 RVGSGSLTPNGG--ISRLGSGSLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDG 1295
            R+GSGSLTPNG    SRLGSG LTP+G+ P S D   +E+QISEVASLANS+   + D  
Sbjct: 300  RIGSGSLTPNGAGLASRLGSGCLTPDGVGPASGDSFPMENQISEVASLANSESGCQLDGN 359

Query: 1296 VVEHRVSFELIGEDIPTCVVKETVRSPKIALDT---------NQKDLMTKN-----EDSC 1433
            V+ HRVSFEL GED+  C+  +++ S + A D           +KD M         +SC
Sbjct: 360  VINHRVSFELTGEDVARCLANKSMASVRTASDPLKDTPSECGVKKDRMISTGTDHFSESC 419

Query: 1434 RENNNAKNVNAIPGFD---------QKHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWI 1586
             E  + +    +P  D         +KHR++++GS K+FNF+++K E +DK T   EWW 
Sbjct: 420  VEETSVE----LPENDHGEWEDQCYRKHRSITLGSIKEFNFDSTKSEFSDKPTNGSEWWA 475

Query: 1587 NDKVVVKELGPHNSWNFFPMLQSGVS 1664
            N+KV  KE  P N W FFP+LQ GVS
Sbjct: 476  NEKVAGKESKPGNGWTFFPILQPGVS 501


>gb|KJB52747.1| hypothetical protein B456_008G275500 [Gossypium raimondii]
          Length = 464

 Score =  313 bits (802), Expect = 3e-95
 Identities = 193/445 (43%), Positives = 240/445 (53%), Gaps = 44/445 (9%)
 Frame = +3

Query: 462  QPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPTPQVSVAPFVDNNPNRSATXXX 641
            QP+TVQKR W SCWS Y C GS+K SKRIGHAVLV +P    ++    +N  N +     
Sbjct: 26   QPTTVQKR-WGSCWSFYWCFGSHKSSKRIGHAVLVPEPVVPGALVSTAENASNPTGIVMP 84

Query: 642  XXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYETQLVS 821
                         QSDPPSAT              + +SP G A IF IGPYA+ETQLV+
Sbjct: 85   FIAPPSSPASFL-QSDPPSATQSPAGLLSLTALSVNAYSPRGPASIFAIGPYAHETQLVT 143

Query: 822  PPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK----- 986
            PPVFS  TTEPSTA F               EV F                 N K     
Sbjct: 144  PPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSH 203

Query: 987  ------------------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEHFLNYK 1112
                              SPGS IS SGTSSPFPD+R I+EF +GE  K +G+EHF   K
Sbjct: 204  YEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDRRPILEFRMGEAPKTLGFEHFTTRK 263

Query: 1113 WGSRVGSGSLTPNG-----------------GI-SRLGSGSLTPNGLEPLSRDCNLLESQ 1238
            WGSR+GSGSLTP+G                 G+ SRLGSGSLTP+GL P SRD   +ESQ
Sbjct: 264  WGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSRLGSGSLTPDGLGPASRDGFPIESQ 323

Query: 1239 ISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVKETVRSPKIALDTN-QKDLMT 1415
             SEVA L+N     +ND+ +V+HRVSFEL GED+  C+  +++ S +   D     DL+ 
Sbjct: 324  NSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVARCLKNKSLVSSRTMPDYEYPNDLVA 383

Query: 1416 KN--EDSCRENNNAKNVNAIPGFDQKHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWIN 1589
            +   E   + +  A+  +      QKHR+V++GS K+FNF+N K E ++K TV  EWW N
Sbjct: 384  QGRIEKDEKVSGEAEEDHCY----QKHRSVTLGSIKEFNFDNRKGEASEKPTVRSEWWAN 439

Query: 1590 DKVVVKELGPHNSWNFFPMLQSGVS 1664
            +KV  KE  P N+W FFPMLQ  VS
Sbjct: 440  EKVAGKEARPGNNWTFFPMLQPEVS 464


>ref|XP_009368760.1| PREDICTED: uncharacterized protein LOC103958237 isoform X2 [Pyrus x
            bretschneideri]
          Length = 478

 Score =  313 bits (803), Expect = 3e-95
 Identities = 189/457 (41%), Positives = 246/457 (53%), Gaps = 57/457 (12%)
 Frame = +3

Query: 462  QPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPT-PQVSVAPFVDNNPNRSATXX 638
            QP+ + KRRW SCWSLY C GS+K SKRIGHAVLV +P  P  +V+    +N   S    
Sbjct: 23   QPTNISKRRWGSCWSLYWCFGSHKTSKRIGHAVLVPEPVVPGAAVS--TSDNQTTSTAIV 80

Query: 639  XXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYETQLV 818
                           SDPPSA+              + +S G  A +F+IGPYAYETQLV
Sbjct: 81   LPFIAPPSSPASFLPSDPPSASQSPAGFLSLTSLSVNAYSSGEPASMFSIGPYAYETQLV 140

Query: 819  SPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK---- 986
            SPPVFSTF TEPSTA F               EV F                 N K    
Sbjct: 141  SPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLSSSLDRQRRNSSNNQKFPLS 200

Query: 987  -------------------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEHFLNY 1109
                               SPGSAIS SGTSSPFPD+  ++EF +GEG K  G++HF N+
Sbjct: 201  QYEYQPYQQYPGSPGGDLISPGSAISNSGTSSPFPDRHPMLEFRMGEGPKLYGFDHFTNH 260

Query: 1110 KWGSRVGSGSLTPNG---------------GI---SRLGSGSLTPNGLEPLSRDCNLLES 1235
            KWGSR+GSG+LTP+G               G+   SR+ SG LTP+G  P SRD   +E+
Sbjct: 261  KWGSRLGSGTLTPDGYELGSRLGSGCLTPNGVGVGSRMSSGCLTPDGTGPASRDGFHMEN 320

Query: 1236 QISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVKETVR-----SPKIALDTN- 1397
            QISEVASLAN++    N   + +HRVSFEL GED+  C+  + +R     S  IA + + 
Sbjct: 321  QISEVASLANTESGCHNGGTIFDHRVSFELTGEDVACCLANKALRTATESSNDIAAENSI 380

Query: 1398 QKDLMTKNEDSCRENNNAKNVNAIP------GFDQ---KHRTVSMGSSKDFNFNNSKEEV 1550
            + D +  + ++ RE N  ++++ IP      G DQ   K R++++GS+K+FNF+++K EV
Sbjct: 381  ETDGLLTDSNNHREFNVEESLSRIPENASGEGEDQGYRKQRSITLGSTKEFNFDHTKAEV 440

Query: 1551 TDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGV 1661
              KS +  EWW N  V  KE  P N W FFP+LQ GV
Sbjct: 441  PSKSNIGSEWWANKNVAAKESKPCNDWTFFPILQPGV 477


Top