BLASTX nr result

ID: Rehmannia27_contig00013953 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia27_contig00013953
         (1469 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166...   582   0.0  
ref|XP_015070691.1| PREDICTED: uncharacterized protein LOC107015...   470   e-160
ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   465   e-158
ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236...   465   e-158
ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116...   465   e-158
ref|XP_015070692.1| PREDICTED: uncharacterized protein LOC107015...   464   e-157
ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583...   463   e-157
ref|XP_010317637.1| PREDICTED: uncharacterized protein LOC101260...   459   e-155
emb|CDP05166.1| unnamed protein product [Coffea canephora]            452   e-153
ref|XP_011076135.1| PREDICTED: uncharacterized protein LOC105160...   451   e-152
ref|XP_009788654.1| PREDICTED: uncharacterized protein LOC104236...   446   e-151
ref|XP_010317636.1| PREDICTED: uncharacterized protein LOC101260...   441   e-148
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   404   e-134
ref|XP_011470333.1| PREDICTED: uncharacterized protein LOC101312...   403   e-133
gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum]   400   e-132
ref|XP_008234199.1| PREDICTED: uncharacterized protein LOC103333...   401   e-132
ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun...   401   e-132
ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765...   399   e-132
ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648...   399   e-132
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   399   e-131

>ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166690 [Sesamum indicum]
          Length = 479

 Score =  582 bits (1501), Expect = 0.0
 Identities = 311/438 (71%), Positives = 332/438 (75%), Gaps = 30/438 (6%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            MSSVHNS              ESRVQPSTVQKRRWGSCWSIYWCFGS+KQSKRIGHAVL+
Sbjct: 1    MSSVHNSVETVNAAATAIVTAESRVQPSTVQKRRWGSCWSIYWCFGSHKQSKRIGHAVLV 60

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLSL--- 594
            SEP+A  +  PISENRN SS   TIVLPFI        FLQSDPPSAT SP GL+SL   
Sbjct: 61   SEPAAAGVAAPISENRNQSS---TIVLPFIAPPSSPASFLQSDPPSATQSPAGLISLASL 117

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            SVHA+SPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTT SSPEV
Sbjct: 118  SVHANSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTPSSPEV 177

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQYXXXXXXXXXXXXXAMSTSGTSSPFP 954
            PFAQLLSSSLARNRR  G +LK  LSQYEFQPYQY             A+STSGTSSPFP
Sbjct: 178  PFAQLLSSSLARNRRNCGTNLKYSLSQYEFQPYQYPGSPGGHIKSPGSALSTSGTSSPFP 237

Query: 955  DKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTPNDWG--------------SRLD 1092
            DK PIMEFR+GEAPKFLGYE+FPNYKW SRVGSGSLTPN WG              SRL 
Sbjct: 238  DKHPIMEFRMGEAPKFLGYEHFPNYKWGSRVGSGSLTPNGWGSRLGSGALTPNGGLSRLG 297

Query: 1093 SGTLTPNGGEPPSRDNKVLENQIYEVASLANSDRKSQNED-LVDHRVSFELFGEDIPTCI 1269
            SGTLTPNGGEPPSRD  +LENQIYEVASLANSDRKSQN+D +VDHRVSFELFGEDIPTC+
Sbjct: 298  SGTLTPNGGEPPSRDGNLLENQIYEVASLANSDRKSQNDDAVVDHRVSFELFGEDIPTCV 357

Query: 1270 V--RAPSLKNAPGYLQEEIVEGTSKRDLLASKNANSFREHFNGEIVNE----------EE 1413
            V   APS KNA GY      EGT+ +D L +KNA+S REH +GE  NE           E
Sbjct: 358  VTESAPSHKNASGYPGVATAEGTNNKD-LTTKNADSCREHNDGETTNEVPEIPLDGEGGE 416

Query: 1414 FYQKHRTISLGSSKDFNF 1467
             +QK RT+SLGSSKDFNF
Sbjct: 417  LHQKQRTVSLGSSKDFNF 434


>ref|XP_015070691.1| PREDICTED: uncharacterized protein LOC107015045 isoform X1 [Solanum
            pennellii]
          Length = 470

 Score =  470 bits (1210), Expect = e-160
 Identities = 258/434 (59%), Positives = 297/434 (68%), Gaps = 26/434 (5%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            MSSV N+              ESRVQPSTVQKRRWGSCWS+YWCFGS+K SKRIGHAVL+
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLSL--- 594
             EP AP    P++EN NHS+   TIV+PFI        FL SDPPSAT SP GLLSL   
Sbjct: 61   PEPVAPGPAVPVTENPNHSA---TIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKAL 117

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            S++A+SPGGTA IF IGPYAHETQLVSPPVFSTFTTEPSTA+FTPPPEPV MTT  SPEV
Sbjct: 118  SINAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEV 177

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQYXXXXXXXXXXXXXAMSTSGTSSPFP 954
            PFAQLL+SSLARNRR SG + K PLSQYEF PYQ               +S SGTSSPFP
Sbjct: 178  PFAQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFP 237

Query: 955  DKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTPNDWGSRLDSGTL---------- 1104
             K PI+EFR GE PKFLGYE+F   KW SRVGSGSLTP+ WGSRL SGTL          
Sbjct: 238  GKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLG 297

Query: 1105 ----TPNGGEPPSRDNKVLENQIYEVASLANSDRKSQ-NEDLVDHRVSFELFGEDIPTCI 1269
                TPNGGEPPSRD+ +LENQI EVASLANSD  S+  E ++DHRVSFEL GED+P+C 
Sbjct: 298  SGTVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCR 357

Query: 1270 VRAPSLKNAPGYLQEEIVEGTSKRDLLAS--KNANSFREH------FNGEIVNEEEFYQK 1425
             + P + ++   L  ++       +LLAS  K+ +S  E              E+E ++K
Sbjct: 358  EKEPVMSHSQPTLPMDV------SNLLASEMKSGSSMAEEKTYGSPRKASESGEDECHRK 411

Query: 1426 HRTISLGSSKDFNF 1467
            HR I+ GSSKDF+F
Sbjct: 412  HRNITFGSSKDFDF 425


>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 isoform X2 [Solanum
            lycopersicum]
          Length = 470

 Score =  465 bits (1197), Expect = e-158
 Identities = 254/431 (58%), Positives = 293/431 (67%), Gaps = 23/431 (5%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            MSSV N+              ESRVQPSTVQKRRWGSCWS+YWCFGS+K SKRIGHAVL+
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLSL--- 594
             EP AP    P++EN NHS+   TIV+PFI        FL SDPPSAT SP GLLSL   
Sbjct: 61   PEPVAPGPAVPVTENPNHSA---TIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKAL 117

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            S++A+SPGGTA IF IGPYAHETQLVSPPVFSTFTTEPSTA+FTPPPEPV MTT  SPEV
Sbjct: 118  SINAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEV 177

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQYXXXXXXXXXXXXXAMSTSGTSSPFP 954
            PFAQLL+SSLARNRR SG + K PLSQYEF PYQ               +S SGTSSPFP
Sbjct: 178  PFAQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFP 237

Query: 955  DKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTPNDWGSRLDSGTL---------- 1104
             K PI+EFR GE PKFLGYE+F   KW SRVGSGS+TP+ WGSRL SGTL          
Sbjct: 238  GKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLG 297

Query: 1105 ----TPNGGEPPSRDNKVLENQIYEVASLANSDRKSQ-NEDLVDHRVSFELFGEDIPTCI 1269
                TPNGGEPPSRD+ +LENQI EVASLANSD  S+  E ++DHRVSFEL  ED+P+C 
Sbjct: 298  SGTVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCR 357

Query: 1270 VRAPSLKNAPGYLQEEI-----VEGTSKRDLLASKNANSFREHFNGEIVNEEEFYQKHRT 1434
             + P + ++   L  ++      E  S   +   K   S R+        E+E ++KHR 
Sbjct: 358  EKEPVMSHSQPTLPMDVSNLLASEMRSGSSMAEEKTYGSPRKASES---GEDECHRKHRN 414

Query: 1435 ISLGSSKDFNF 1467
            I+ GSSKDF+F
Sbjct: 415  ITFGSSKDFDF 425


>ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236433 isoform X1 [Nicotiana
            sylvestris]
          Length = 470

 Score =  465 bits (1196), Expect = e-158
 Identities = 253/429 (58%), Positives = 296/429 (68%), Gaps = 21/429 (4%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            MSSV N+              ESRVQPS+VQKRRWGSCWS+YWCFGSYK SKRIGHAVL+
Sbjct: 1    MSSVQNTVDTVNAAATAIVTAESRVQPSSVQKRRWGSCWSLYWCFGSYKHSKRIGHAVLV 60

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLSL--- 594
             EP+AP    P++EN N S+   TIV+PFI        FL SDPPSAT SP GLLSL   
Sbjct: 61   PEPAAPGPAVPVTENPNRSA---TIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSF 117

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            S++A+SPGGTA IF IGPYAHETQLVSPPVFSTFTTEPSTA+FTPPPEPV MTT  SPEV
Sbjct: 118  SINAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEV 177

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQYXXXXXXXXXXXXXAMSTSGTSSPFP 954
            PFAQLL+SSLARNRR SG + K PLSQYEF PYQ               +S SGTSSPFP
Sbjct: 178  PFAQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFP 237

Query: 955  DKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTPNDWGSRLDSGTL---------- 1104
             K PI+EFR GE PKFLGYE+F   KW SRVGSGSLTP+ WGSRL SGTL          
Sbjct: 238  GKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLG 297

Query: 1105 ----TPNGGEPPSRDNKVLENQIYEVASLANSDRKSQ-NEDLVDHRVSFELFGEDIPTCI 1269
                TPNGGEPPSRD  +LENQI EVASLANSD  S+  E ++DHRVSFEL GED+P+C 
Sbjct: 298  SGTVTPNGGEPPSRDCYLLENQISEVASLANSDNGSEIAEGVIDHRVSFELTGEDVPSCR 357

Query: 1270 VRAPSLKNAPGYLQEEIVEGTSKRDLLASKNANSFREHFNGEIVNE---EEFYQKHRTIS 1440
             + P + ++   L  + V   S +++ +S +    +     E  +E   ++ ++KHR I+
Sbjct: 358  EKEPVMSHSQQTLPMD-VPAPSNKEMRSSSSIVEEKTDGLPEKASERGDDQCHRKHRNIT 416

Query: 1441 LGSSKDFNF 1467
             GSSKDF+F
Sbjct: 417  FGSSKDFDF 425


>ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116142 [Nicotiana
            tomentosiformis]
          Length = 470

 Score =  465 bits (1196), Expect = e-158
 Identities = 248/428 (57%), Positives = 293/428 (68%), Gaps = 20/428 (4%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            MSSV N+              ESRVQPS++QK+RWGSCWS+YWCFGSYK SKRIGHA+L+
Sbjct: 1    MSSVQNTVDTVNAAATAIITAESRVQPSSIQKKRWGSCWSLYWCFGSYKHSKRIGHAILV 60

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLSL--- 594
             EP+AP    P++EN N S+   TIV+PFI        FL SDPPSAT SP GLLSL   
Sbjct: 61   PEPAAPGPAVPVTENPNRSA---TIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSF 117

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            S++A+SPGGTA IF IGPYAHETQLVSPPVFSTFTTEPSTA+FTPPPEPV MTT  SPEV
Sbjct: 118  SINAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEV 177

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQYXXXXXXXXXXXXXAMSTSGTSSPFP 954
            PFAQLL+SSLARNRR SG + K PLSQYEF PYQ               +S SGTSSPFP
Sbjct: 178  PFAQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSSLISPGSVVSNSGTSSPFP 237

Query: 955  DKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTPNDWGSRLDSGTL---------- 1104
             K PI+EFR GE PKFLGYE+F   KW SRVGSGSLTP+ WGSRL SGTL          
Sbjct: 238  GKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLG 297

Query: 1105 ----TPNGGEPPSRDNKVLENQIYEVASLANSDRKSQ-NEDLVDHRVSFELFGEDIPTCI 1269
                TPNGGEPPSRD  +LENQI EVASLANSD  S+  E ++DHRVSFEL GED+P+C 
Sbjct: 298  SGTVTPNGGEPPSRDCYLLENQISEVASLANSDNGSEIAEGVIDHRVSFELTGEDVPSCR 357

Query: 1270 VRAPSLKNAPGYLQEEIVEGTSKRDLLASKNANSFREHF--NGEIVNEEEFYQKHRTISL 1443
             + P + ++   L  ++   ++K    +S N     +          +++ ++KHR I+ 
Sbjct: 358  EKEPVMSHSQQTLPMDVPAPSNKEMRSSSSNVEEKTDGLPEKASERGDDQCHRKHRNITF 417

Query: 1444 GSSKDFNF 1467
            GSSKDF+F
Sbjct: 418  GSSKDFDF 425


>ref|XP_015070692.1| PREDICTED: uncharacterized protein LOC107015045 isoform X2 [Solanum
            pennellii]
          Length = 469

 Score =  464 bits (1193), Expect = e-157
 Identities = 257/434 (59%), Positives = 296/434 (68%), Gaps = 26/434 (5%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            MSSV N+              ESRVQPSTVQ RRWGSCWS+YWCFGS+K SKRIGHAVL+
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQ-RRWGSCWSLYWCFGSHKHSKRIGHAVLV 59

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLSL--- 594
             EP AP    P++EN NHS+   TIV+PFI        FL SDPPSAT SP GLLSL   
Sbjct: 60   PEPVAPGPAVPVTENPNHSA---TIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKAL 116

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            S++A+SPGGTA IF IGPYAHETQLVSPPVFSTFTTEPSTA+FTPPPEPV MTT  SPEV
Sbjct: 117  SINAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEV 176

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQYXXXXXXXXXXXXXAMSTSGTSSPFP 954
            PFAQLL+SSLARNRR SG + K PLSQYEF PYQ               +S SGTSSPFP
Sbjct: 177  PFAQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFP 236

Query: 955  DKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTPNDWGSRLDSGTL---------- 1104
             K PI+EFR GE PKFLGYE+F   KW SRVGSGSLTP+ WGSRL SGTL          
Sbjct: 237  GKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLG 296

Query: 1105 ----TPNGGEPPSRDNKVLENQIYEVASLANSDRKSQ-NEDLVDHRVSFELFGEDIPTCI 1269
                TPNGGEPPSRD+ +LENQI EVASLANSD  S+  E ++DHRVSFEL GED+P+C 
Sbjct: 297  SGTVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCR 356

Query: 1270 VRAPSLKNAPGYLQEEIVEGTSKRDLLAS--KNANSFREH------FNGEIVNEEEFYQK 1425
             + P + ++   L  ++       +LLAS  K+ +S  E              E+E ++K
Sbjct: 357  EKEPVMSHSQPTLPMDV------SNLLASEMKSGSSMAEEKTYGSPRKASESGEDECHRK 410

Query: 1426 HRTISLGSSKDFNF 1467
            HR I+ GSSKDF+F
Sbjct: 411  HRNITFGSSKDFDF 424


>ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum]
          Length = 470

 Score =  463 bits (1191), Expect = e-157
 Identities = 253/431 (58%), Positives = 293/431 (67%), Gaps = 23/431 (5%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            MSSV N+              ESRVQPSTVQKRRWGSCWS+YWCFGS+K SKRIGHAVL+
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLSL--- 594
             EP+AP    P++EN NHS+   TIV+PFI        FL SDPPSAT SP GLLSL   
Sbjct: 61   PEPAAPGPAVPVTENPNHSA---TIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSL 117

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            S++A+SPGGTA IF IGPYAHETQLVSPPVFSTFTTEPSTA+FTPPPE V MTT  SPEV
Sbjct: 118  SINAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEV 177

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQYXXXXXXXXXXXXXAMSTSGTSSPFP 954
            PFAQLL+SSLARNRR SG + K PLSQYEF PYQ               +S SGTSSPFP
Sbjct: 178  PFAQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFP 237

Query: 955  DKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTPNDWGSRLDSGTL---------- 1104
             K PI+EFR GE PKFLGYE+F   KW SRVGSGSLTP+ WGSRL SGTL          
Sbjct: 238  GKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLG 297

Query: 1105 ----TPNGGEPPSRDNKVLENQIYEVASLANSDRKSQ-NEDLVDHRVSFELFGEDIPTCI 1269
                TPNGGEPPSRD+ +LE QI EVASLANSD  S+  E ++DHRVSFEL GED+P+C 
Sbjct: 298  SGTVTPNGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCR 357

Query: 1270 VRAPSLKNAPGYLQEEIV-----EGTSKRDLLASKNANSFREHFNGEIVNEEEFYQKHRT 1434
             + P + ++   L  ++      E  S   +   K   S R+        E++ ++KHR 
Sbjct: 358  EKEPVMSHSQQTLPMDVSNLLANEMKSGSSMAEEKTYGSPRKASES---GEDQCHRKHRN 414

Query: 1435 ISLGSSKDFNF 1467
            I+ GSSKDF+F
Sbjct: 415  ITFGSSKDFDF 425


>ref|XP_010317637.1| PREDICTED: uncharacterized protein LOC101260903 isoform X3 [Solanum
            lycopersicum]
          Length = 469

 Score =  459 bits (1180), Expect = e-155
 Identities = 253/431 (58%), Positives = 292/431 (67%), Gaps = 23/431 (5%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            MSSV N+              ESRVQPSTVQ RRWGSCWS+YWCFGS+K SKRIGHAVL+
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQ-RRWGSCWSLYWCFGSHKHSKRIGHAVLV 59

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLSL--- 594
             EP AP    P++EN NHS+   TIV+PFI        FL SDPPSAT SP GLLSL   
Sbjct: 60   PEPVAPGPAVPVTENPNHSA---TIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKAL 116

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            S++A+SPGGTA IF IGPYAHETQLVSPPVFSTFTTEPSTA+FTPPPEPV MTT  SPEV
Sbjct: 117  SINAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEV 176

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQYXXXXXXXXXXXXXAMSTSGTSSPFP 954
            PFAQLL+SSLARNRR SG + K PLSQYEF PYQ               +S SGTSSPFP
Sbjct: 177  PFAQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFP 236

Query: 955  DKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTPNDWGSRLDSGTL---------- 1104
             K PI+EFR GE PKFLGYE+F   KW SRVGSGS+TP+ WGSRL SGTL          
Sbjct: 237  GKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLG 296

Query: 1105 ----TPNGGEPPSRDNKVLENQIYEVASLANSDRKSQ-NEDLVDHRVSFELFGEDIPTCI 1269
                TPNGGEPPSRD+ +LENQI EVASLANSD  S+  E ++DHRVSFEL  ED+P+C 
Sbjct: 297  SGTVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCR 356

Query: 1270 VRAPSLKNAPGYLQEEI-----VEGTSKRDLLASKNANSFREHFNGEIVNEEEFYQKHRT 1434
             + P + ++   L  ++      E  S   +   K   S R+        E+E ++KHR 
Sbjct: 357  EKEPVMSHSQPTLPMDVSNLLASEMRSGSSMAEEKTYGSPRKASES---GEDECHRKHRN 413

Query: 1435 ISLGSSKDFNF 1467
            I+ GSSKDF+F
Sbjct: 414  ITFGSSKDFDF 424


>emb|CDP05166.1| unnamed protein product [Coffea canephora]
          Length = 452

 Score =  452 bits (1162), Expect = e-153
 Identities = 249/426 (58%), Positives = 281/426 (65%), Gaps = 18/426 (4%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            MSSVHNS              ESRVQP TVQKRRWGSCWS YWCFGS K SKRIG+AVL+
Sbjct: 1    MSSVHNSVETVNAAATAIVTAESRVQPPTVQKRRWGSCWSFYWCFGSVKNSKRIGNAVLV 60

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLL---SL 594
             EP+ P    P+ +N NHS+   TIV+PFI        FLQSDPPSAT SP   L   S 
Sbjct: 61   PEPTVPGSAVPVPDNLNHSA---TIVIPFIAPPSSPASFLQSDPPSATQSPAKFLPLASF 117

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            SV+ +SP G A IF IGPYAHETQLVSPPVFS FTTEPSTASFTPPPEPVQ+TT SSPEV
Sbjct: 118  SVNTYSPSGAASIFAIGPYAHETQLVSPPVFSAFTTEPSTASFTPPPEPVQLTTPSSPEV 177

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQYXXXXXXXXXXXXXAMSTSGTSSPFP 954
            PFAQLL SSL  NRR SG  +K PLSQYEFQPYQ              A+S SGTSSPFP
Sbjct: 178  PFAQLLVSSLTHNRRHSGTSIKFPLSQYEFQPYQCPGSPGSHLISPGSAISNSGTSSPFP 237

Query: 955  DKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTPNDWG--------------SRLD 1092
            +KRPI+EFR+GEAPKFLGYE F   KW SRVGSGSLTPN WG              SRL 
Sbjct: 238  EKRPIIEFRIGEAPKFLGYELFTR-KWGSRVGSGSLTPNGWGSRLGSGSLTPNGGISRLG 296

Query: 1093 SGTLTPNGGEPPSRDNKVLENQIYEVASLANSDRKSQNED-LVDHRVSFELFGEDIPTCI 1269
            SGTLTPNGGEP +RD+ +LENQI EVASLANSD  + NE+ L+DHRVSFEL  E +P C+
Sbjct: 297  SGTLTPNGGEPAARDSYLLENQISEVASLANSDNGTHNEEGLMDHRVSFELTAEHVPNCV 356

Query: 1270 VRAPSLKNAPGYLQEEIVEGTSKRDLLASKNANSFREHFNGEIVNEEEFYQKHRTISLGS 1449
                         +EE+       D       N  R+  +G+    ++  + +RT SLGS
Sbjct: 357  -------------EEEMKGQNFCEDCTGDSIHNITRKALDGQ--EGKQCLKNNRTFSLGS 401

Query: 1450 SKDFNF 1467
            SKDFNF
Sbjct: 402  SKDFNF 407


>ref|XP_011076135.1| PREDICTED: uncharacterized protein LOC105160458 [Sesamum indicum]
          Length = 466

 Score =  451 bits (1161), Expect = e-152
 Identities = 262/442 (59%), Positives = 299/442 (67%), Gaps = 34/442 (7%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            M+SVHNS              E+R QPSTVQKRRWGSCWS+YWCFGSYK SKRIGHAVLI
Sbjct: 1    MTSVHNSAETLNAAATAIVTAENRAQPSTVQKRRWGSCWSLYWCFGSYKHSKRIGHAVLI 60

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLSL--- 594
            SEP+A     P+ EN N S+   T++LPFI        FLQSDPPSAT S  GL+SL   
Sbjct: 61   SEPTAQVAVAPVVENLNRSA---TLMLPFIAPPSSPASFLQSDPPSATQSAAGLVSLAAL 117

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            SVH +SPGGTAPIFTIGPYA+ETQLVSPPVFS FTTEPSTASFTPPPEPVQMTT SSPEV
Sbjct: 118  SVHTYSPGGTAPIFTIGPYAYETQLVSPPVFSAFTTEPSTASFTPPPEPVQMTTPSSPEV 177

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQYXXXXXXXXXXXXXAMSTSGTSSPFP 954
            PFAQLLSSSLARNRR SG ++K  LSQYEF  Y+              A+S+SGTSSPFP
Sbjct: 178  PFAQLLSSSLARNRRNSG-NMKSSLSQYEFLAYE----------SPGSALSSSGTSSPFP 226

Query: 955  DKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSG--------------SLTPN------- 1071
            DK P++E R GEAP F+GYE+F N+KW SRVGSG              +LTPN       
Sbjct: 227  DKWPVVEIRRGEAPIFIGYEHFFNHKWGSRVGSGSLTPNGRGSRLGSGALTPNGGLSRLG 286

Query: 1072 ------DWG-SRLDSGTLTPNGGEPPSRDNKVLENQIYEVASLANSDRKSQNED-LVDHR 1227
                  + G SRL SG LTPNGGEPPSRD  +L N I EV SLANS  + QN D +VDHR
Sbjct: 287  SGALTPNGGLSRLGSGALTPNGGEPPSRDCNLLGNPISEVVSLANSGNELQNCDAVVDHR 346

Query: 1228 VSFELFGEDIPTCIV--RAPSLKNAPGYLQEEIVEGTSKRDLLASKNANSFREHFNGEIV 1401
            VSFEL GEDIPTC+V    PS K     LQE   E T+  D +A K + ++R+  NGE +
Sbjct: 347  VSFELSGEDIPTCVVSETVPSPKMESRDLQEATAEVTNHSDFMA-KVSETYRKLSNGETM 405

Query: 1402 NEEEFYQKHRTISLGSSKDFNF 1467
            +E      + TISLGSS+DFNF
Sbjct: 406  HE------NHTISLGSSRDFNF 421


>ref|XP_009788654.1| PREDICTED: uncharacterized protein LOC104236433 isoform X2 [Nicotiana
            sylvestris]
          Length = 442

 Score =  446 bits (1148), Expect = e-151
 Identities = 240/400 (60%), Positives = 282/400 (70%), Gaps = 21/400 (5%)
 Frame = +1

Query: 331  VQKRRWGSCWSIYWCFGSYKQSKRIGHAVLISEPSAPRITPPISENRNHSSNSSTIVLPF 510
            +QKRRWGSCWS+YWCFGSYK SKRIGHAVL+ EP+AP    P++EN N S+   TIV+PF
Sbjct: 2    MQKRRWGSCWSLYWCFGSYKHSKRIGHAVLVPEPAAPGPAVPVTENPNRSA---TIVIPF 58

Query: 511  IXXXXXXXXFLQSDPPSATHSPGGLLSL---SVHAHSPGGTAPIFTIGPYAHETQLVSPP 681
            I        FL SDPPSAT SP GLLSL   S++A+SPGGTA IF IGPYAHETQLVSPP
Sbjct: 59   IAPPSSPASFLPSDPPSATQSPAGLLSLKSFSINAYSPGGTASIFAIGPYAHETQLVSPP 118

Query: 682  VFSTFTTEPSTASFTPPPEPVQMTTLSSPEVPFAQLLSSSLARNRRKSGPHLKCPLSQYE 861
            VFSTFTTEPSTA+FTPPPEPV MTT  SPEVPFAQLL+SSLARNRR SG + K PLSQYE
Sbjct: 119  VFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYE 178

Query: 862  FQPYQYXXXXXXXXXXXXXAMSTSGTSSPFPDKRPIMEFRVGEAPKFLGYEYFPNYKWDS 1041
            F PYQ               +S SGTSSPFP K PI+EFR GE PKFLGYE+F   KW S
Sbjct: 179  FVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRKWGS 238

Query: 1042 RVGSGSLTPNDWGSRLDSGTL--------------TPNGGEPPSRDNKVLENQIYEVASL 1179
            RVGSGSLTP+ WGSRL SGTL              TPNGGEPPSRD  +LENQI EVASL
Sbjct: 239  RVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDCYLLENQISEVASL 298

Query: 1180 ANSDRKSQ-NEDLVDHRVSFELFGEDIPTCIVRAPSLKNAPGYLQEEIVEGTSKRDLLAS 1356
            ANSD  S+  E ++DHRVSFEL GED+P+C  + P + ++   L  + V   S +++ +S
Sbjct: 299  ANSDNGSEIAEGVIDHRVSFELTGEDVPSCREKEPVMSHSQQTLPMD-VPAPSNKEMRSS 357

Query: 1357 KNANSFREHFNGEIVNE---EEFYQKHRTISLGSSKDFNF 1467
             +    +     E  +E   ++ ++KHR I+ GSSKDF+F
Sbjct: 358  SSIVEEKTDGLPEKASERGDDQCHRKHRNITFGSSKDFDF 397


>ref|XP_010317636.1| PREDICTED: uncharacterized protein LOC101260903 isoform X1 [Solanum
            lycopersicum]
          Length = 476

 Score =  441 bits (1135), Expect = e-148
 Identities = 238/400 (59%), Positives = 277/400 (69%), Gaps = 23/400 (5%)
 Frame = +1

Query: 337  KRRWGSCWSIYWCFGSYKQSKRIGHAVLISEPSAPRITPPISENRNHSSNSSTIVLPFIX 516
            +RRWGSCWS+YWCFGS+K SKRIGHAVL+ EP AP    P++EN NHS+   TIV+PFI 
Sbjct: 38   ERRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPVAPGPAVPVTENPNHSA---TIVIPFIA 94

Query: 517  XXXXXXXFLQSDPPSATHSPGGLLSL---SVHAHSPGGTAPIFTIGPYAHETQLVSPPVF 687
                   FL SDPPSAT SP GLLSL   S++A+SPGGTA IF IGPYAHETQLVSPPVF
Sbjct: 95   PPSSPASFLPSDPPSATQSPAGLLSLKALSINAYSPGGTASIFAIGPYAHETQLVSPPVF 154

Query: 688  STFTTEPSTASFTPPPEPVQMTTLSSPEVPFAQLLSSSLARNRRKSGPHLKCPLSQYEFQ 867
            STFTTEPSTA+FTPPPEPV MTT  SPEVPFAQLL+SSLARNRR SG + K PLSQYEF 
Sbjct: 155  STFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEFV 214

Query: 868  PYQYXXXXXXXXXXXXXAMSTSGTSSPFPDKRPIMEFRVGEAPKFLGYEYFPNYKWDSRV 1047
            PYQ               +S SGTSSPFP K PI+EFR GE PKFLGYE+F   KW SRV
Sbjct: 215  PYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRKWGSRV 274

Query: 1048 GSGSLTPNDWGSRLDSGTL--------------TPNGGEPPSRDNKVLENQIYEVASLAN 1185
            GSGS+TP+ WGSRL SGTL              TPNGGEPPSRD+ +LENQI EVASLAN
Sbjct: 275  GSGSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLENQISEVASLAN 334

Query: 1186 SDRKSQ-NEDLVDHRVSFELFGEDIPTCIVRAPSLKNAPGYLQEEI-----VEGTSKRDL 1347
            SD  S+  E ++DHRVSFEL  ED+P+C  + P + ++   L  ++      E  S   +
Sbjct: 335  SDNGSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSHSQPTLPMDVSNLLASEMRSGSSM 394

Query: 1348 LASKNANSFREHFNGEIVNEEEFYQKHRTISLGSSKDFNF 1467
               K   S R+        E+E ++KHR I+ GSSKDF+F
Sbjct: 395  AEEKTYGSPRKASES---GEDECHRKHRNITFGSSKDFDF 431


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  404 bits (1039), Expect = e-134
 Identities = 234/444 (52%), Positives = 279/444 (62%), Gaps = 36/444 (8%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            M SV++S              +SRVQP+TVQK+RWGSCW +YWCFGS K SKRIGHAVL+
Sbjct: 1    MRSVNDSVETVNAAATAIVSADSRVQPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLV 60

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLSL--- 594
             EP  P  +   +EN    SN + I+LPFI        FLQSDPPSAT SP GLLSL   
Sbjct: 61   PEPVVPGASVSTAEN---VSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSL 117

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            SV+A+SP G A IF IGPYAHETQLV+PPVFS  TTEPSTA FTPPPE VQ+TT SSPEV
Sbjct: 118  SVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEV 177

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQ-YXXXXXXXXXXXXXAMSTSGTSSPF 951
            PFAQLL+SSL R RR SG + K  LS YEFQ YQ Y             A+S SGTSSPF
Sbjct: 178  PFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPF 237

Query: 952  PDKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTPN------------------DW 1077
            PD+RPI+EFR+GEAPK LG+E F   KW SR+GSGSLTP+                    
Sbjct: 238  PDRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGL 297

Query: 1078 GSRLDSGTLTPNGGEPPSRDNKVLENQIYEVASLANSDRKSQN-EDLVDHRVSFELFGED 1254
            GSRL SG+LTP+G  P SRD  ++ +QI EVA LAN     +N E +VDHRVSFEL GED
Sbjct: 298  GSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGED 357

Query: 1255 IPTCIVRAPSL--KNAPGYLQEEIVEGTSKRD-----------LLASKNANSFREHFNGE 1395
            +  C+     L  +    Y ++ + EG  +RD           L   + +N   E  +GE
Sbjct: 358  VAPCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGE 417

Query: 1396 IVNEEEFYQKHRTISLGSSKDFNF 1467
               EE  YQKHR+++LGS K+FNF
Sbjct: 418  -AEEEHSYQKHRSVTLGSIKEFNF 440


>ref|XP_011470333.1| PREDICTED: uncharacterized protein LOC101312100 isoform X1 [Fragaria
            vesca subsp. vesca]
          Length = 499

 Score =  403 bits (1035), Expect = e-133
 Identities = 227/433 (52%), Positives = 283/433 (65%), Gaps = 46/433 (10%)
 Frame = +1

Query: 307  ESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLISEPSAPRITPPISENRNHSSN 486
            E+RVQPS+V KRRWGSCWS+YWCFG  K SKRIGHAVL+ EP+ P +  P +EN+   ++
Sbjct: 25   EARVQPSSVPKRRWGSCWSLYWCFGYQKNSKRIGHAVLVPEPTVPGVAVPAAENQ---TS 81

Query: 487  SSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLS---LSVHAHSPGGTAPIFTIGPYAH 657
            S++IVLPFI        FL S+PPS+T SPGG +S   LS +A+SPGG   +FTIGPYA+
Sbjct: 82   STSIVLPFIAPPSSPASFLPSEPPSSTQSPGGFMSFAALSANAYSPGGALSMFTIGPYAY 141

Query: 658  ETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEVPFAQLLSSSLARNRRKSGPHL 837
            ETQLVSPPVFSTF TEPSTA +TPPPE VQ+TT SSPEVPFAQLL+SSL R+RR SG   
Sbjct: 142  ETQLVSPPVFSTFNTEPSTAPYTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRHSGGSQ 201

Query: 838  KCPLSQYEFQPY-QYXXXXXXXXXXXXXAMSTSGTSSPFPDKRPIMEFRVGEAPKFLGYE 1014
            K  LS  EFQPY QY             A+S SGTSSPFPD+ P++EFR+GEAPK LG+E
Sbjct: 202  KFALSYGEFQPYQQYPGSPGGQLRSPGSAISNSGTSSPFPDRYPVLEFRMGEAPKLLGFE 261

Query: 1015 YFPNYKWDSRVGSGSLTPN--DWGSRLDSGTLTPNGGE---------------------- 1122
            +F  YKW SR+GSGSLTP+    GSRL SGTLTP+G E                      
Sbjct: 262  HFAAYKWGSRLGSGSLTPDGAGLGSRLGSGTLTPDGYELGSRLASGSMTPNGVGVGSRLG 321

Query: 1123 ----------PPSRDNKVLENQIYEVASLANSDRKSQNE-DLVDHRVSFELFGEDIPTCI 1269
                      P SR+  +LEN+I EVASLANS+ + QN+ ++ DHRVSFEL  ED+  C+
Sbjct: 322  SGCLTPDGTGPASREAGLLENKISEVASLANSESECQNDGNVFDHRVSFELTCEDVVCCL 381

Query: 1270 VRAP--SLKNAPGYLQEEIVEGTSKRDLLASKNANSFREHF-----NGEIVNEEEFYQKH 1428
               P  S K A    +++  E  ++R   +  N +S  + F     N     E+  Y+KH
Sbjct: 382  ANKPGASFKTASESSKDKSAEFPNERHGSSITNKSSAGDSFSRIPENALAEGEDHCYRKH 441

Query: 1429 RTISLGSSKDFNF 1467
            R+I+LGS+KDFNF
Sbjct: 442  RSITLGSTKDFNF 454


>gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum]
          Length = 465

 Score =  400 bits (1029), Expect = e-132
 Identities = 230/431 (53%), Positives = 276/431 (64%), Gaps = 23/431 (5%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            M SV++S              ESRVQP+TVQK+RWGSCWS YWCFGS+K SKRIGHAVL+
Sbjct: 1    MRSVNDSVETVNAAASAIVSAESRVQPTTVQKKRWGSCWSFYWCFGSHKSSKRIGHAVLV 60

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLSL--- 594
             EP  P  +   +EN   +SN + IV+PFI        FLQSDPPSAT SP GLLSL   
Sbjct: 61   PEPVVPGASVSTAEN---ASNPTGIVMPFIAPPSSPASFLQSDPPSATQSPAGLLSLTAL 117

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            SV+A+SP G A IF+IGPYAHETQLV+PPVFS  TTEPSTA FTPPPE VQ+TT SSPEV
Sbjct: 118  SVNAYSPRGPASIFSIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEV 177

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQ-YXXXXXXXXXXXXXAMSTSGTSSPF 951
            PFAQLL+SSL R RR SG + K  LS YEFQ YQ Y              +S SGTSSPF
Sbjct: 178  PFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSVISNSGTSSPF 237

Query: 952  PDKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTPN------------------DW 1077
            PD+RPI+EFR+GEAPK LG+E+F   KW SR+GSGSLTP+                    
Sbjct: 238  PDRRPILEFRMGEAPKTLGFEHFTTRKWGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGL 297

Query: 1078 GSRLDSGTLTPNGGEPPSRDNKVLENQIYEVASLANSDRKSQNED-LVDHRVSFELFGED 1254
            GSRL SG+LTP+G  P SRD   +E+Q  EVA L+N     +N++ +VDHRVSFEL GED
Sbjct: 298  GSRLGSGSLTPDGLGPASRDGFPIESQNSEVALLSNPPNGPKNDEIIVDHRVSFELSGED 357

Query: 1255 IPTCIVRAPSLKNAPGYLQEEIVEGTSKRDLLASKNANSFREHFNGEIVNEEEFYQKHRT 1434
            +  C      LKN        + +    +DL+A        E  +GE   E+  YQKHR+
Sbjct: 358  VARC------LKNKSLVSSRTMPDYEYPKDLVAQGRIEK-DEKVSGE-AEEDHCYQKHRS 409

Query: 1435 ISLGSSKDFNF 1467
            ++LGS K+FNF
Sbjct: 410  VTLGSIKEFNF 420


>ref|XP_008234199.1| PREDICTED: uncharacterized protein LOC103333182 [Prunus mume]
          Length = 499

 Score =  401 bits (1030), Expect = e-132
 Identities = 231/460 (50%), Positives = 284/460 (61%), Gaps = 52/460 (11%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            M SV++S              E+R QP+TV KRRWGSCWS+YWCFGS+K +KRIGHAVL+
Sbjct: 1    MRSVNSSVDTINAAATAIVSAEARAQPTTVPKRRWGSCWSLYWCFGSHK-NKRIGHAVLV 59

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLSL--- 594
             EP  P       +N+   + S+ IV+PFI        FL SDPPSAT SP G LSL   
Sbjct: 60   PEPVVPGAAVSAIDNQ---TTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSL 116

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            S +A+SPGG A IF+IGPYA+ETQLVSPPVFSTF TEPSTA FTPPPE VQ+TT SSPEV
Sbjct: 117  SANAYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEV 176

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQ-YXXXXXXXXXXXXXAMSTSGTSSPF 951
            PFAQLL+SSL RNRR SG + K  LS YEFQPYQ Y             A+S SGTSSPF
Sbjct: 177  PFAQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPF 236

Query: 952  PDKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTP------------------NDW 1077
            PD+ P++EF +GEAPK  G+++F   KW SR+GSGSLTP                  N+ 
Sbjct: 237  PDRHPVLEFHMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNEL 296

Query: 1078 GSRLDSGTLTPNGGE----------------PPSRDNKVLENQIYEVASLANSDRKSQN- 1206
            GSRL SG +TPNG                  P SRD+ +LENQI EVASLANS+   Q  
Sbjct: 297  GSRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTV 356

Query: 1207 EDLVDHRVSFELFGEDIPTCIVRAPSLKNAPGYLQEEIV--EGTSKRDLLASKNAN---- 1368
            E + DHRVSFEL GED+  C+       N       +++  +  S+RD L+S ++N    
Sbjct: 357  ETVFDHRVSFELTGEDVACCLANKAMASNRTASGSSKVIASDYPSERDALSSDSSNHCEF 416

Query: 1369 -------SFREHFNGEIVNEEEFYQKHRTISLGSSKDFNF 1467
                      E+ +GE   E++ Y+KHR+I+LGS+KDFNF
Sbjct: 417  SVEESSSRIPENVSGE--GEDQGYRKHRSITLGSTKDFNF 454


>ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica]
            gi|462415503|gb|EMJ20240.1| hypothetical protein
            PRUPE_ppa004616mg [Prunus persica]
          Length = 499

 Score =  401 bits (1030), Expect = e-132
 Identities = 234/460 (50%), Positives = 285/460 (61%), Gaps = 52/460 (11%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            M SV++S              E+R QP+TV KRRWGSCWS+YWCFG +K +KRIGHAVL+
Sbjct: 1    MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFGPHK-NKRIGHAVLV 59

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLSL--- 594
             EP  P       +N+   + S+ IV+PFI        FL SDPPSAT SP G LSL   
Sbjct: 60   PEPVVPGAAVSAIDNQ---TTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSL 116

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            S +A+SPGG A IF+IGPYA+ETQLVSPPVFSTF TEPSTA FTPPPE VQ+TT SSPEV
Sbjct: 117  SANAYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEV 176

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQ-YXXXXXXXXXXXXXAMSTSGTSSPF 951
            PFAQLL+SSL RNRR SG + K  LS YEFQPYQ Y             A+S SGTSSPF
Sbjct: 177  PFAQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPF 236

Query: 952  PDKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTP------------------NDW 1077
            PD+ P++EFR+GEAPK  G+++F   KW SR+GSGSLTP                  N+ 
Sbjct: 237  PDRHPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNEL 296

Query: 1078 GSRLDSGTLTPNGGE----------------PPSRDNKVLENQIYEVASLANSDRKSQN- 1206
            GSRL SG +TPNG                  P SRD+ +LENQI EVASLANS+   Q  
Sbjct: 297  GSRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTV 356

Query: 1207 EDLVDHRVSFELFGEDIPTCIVR--APSLKNAPGYLQEEIVEGTSKRDLLASKNAN---- 1368
            E + DHRVSFEL GED+  C+      S + A G  +    E  S+RD L+S ++N    
Sbjct: 357  ETVFDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCEF 416

Query: 1369 -------SFREHFNGEIVNEEEFYQKHRTISLGSSKDFNF 1467
                      E+ +GE   E++ Y+KHR+I+LGS+KDFNF
Sbjct: 417  SVEESSSRIPENVSGE--GEDQGYRKHRSITLGSTKDFNF 454


>ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765522 [Gossypium raimondii]
            gi|763785675|gb|KJB52746.1| hypothetical protein
            B456_008G275500 [Gossypium raimondii]
          Length = 465

 Score =  399 bits (1024), Expect = e-132
 Identities = 230/431 (53%), Positives = 273/431 (63%), Gaps = 23/431 (5%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            M SV++S              ESRVQP+TVQK+RWGSCWS YWCFGS+K SKRIGHAVL+
Sbjct: 1    MRSVNDSVETVNAAASAIVSAESRVQPTTVQKKRWGSCWSFYWCFGSHKSSKRIGHAVLV 60

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLSL--- 594
             EP  P      +EN   +SN + IV+PFI        FLQSDPPSAT SP GLLSL   
Sbjct: 61   PEPVVPGALVSTAEN---ASNPTGIVMPFIAPPSSPASFLQSDPPSATQSPAGLLSLTAL 117

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            SV+A+SP G A IF IGPYAHETQLV+PPVFS  TTEPSTA FTPPPE VQ+TT SSPEV
Sbjct: 118  SVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEV 177

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQ-YXXXXXXXXXXXXXAMSTSGTSSPF 951
            PFAQLL+SSL R RR SG + K  LS YEFQ YQ Y              +S SGTSSPF
Sbjct: 178  PFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSVISNSGTSSPF 237

Query: 952  PDKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTPN------------------DW 1077
            PD+RPI+EFR+GEAPK LG+E+F   KW SR+GSGSLTP+                    
Sbjct: 238  PDRRPILEFRMGEAPKTLGFEHFTTRKWGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGL 297

Query: 1078 GSRLDSGTLTPNGGEPPSRDNKVLENQIYEVASLANSDRKSQNED-LVDHRVSFELFGED 1254
            GSRL SG+LTP+G  P SRD   +E+Q  EVA L+N     +N++ +VDHRVSFEL GED
Sbjct: 298  GSRLGSGSLTPDGLGPASRDGFPIESQNSEVALLSNPPNGPKNDEIIVDHRVSFELSGED 357

Query: 1255 IPTCIVRAPSLKNAPGYLQEEIVEGTSKRDLLASKNANSFREHFNGEIVNEEEFYQKHRT 1434
            +  C      LKN        + +     DL+A        E  +GE   E+  YQKHR+
Sbjct: 358  VARC------LKNKSLVSSRTMPDYEYPNDLVAQGRIEK-DEKVSGE-AEEDHCYQKHRS 409

Query: 1435 ISLGSSKDFNF 1467
            ++LGS K+FNF
Sbjct: 410  VTLGSIKEFNF 420


>ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648441 [Jatropha curcas]
            gi|643706116|gb|KDP22248.1| hypothetical protein
            JCGZ_26079 [Jatropha curcas]
          Length = 498

 Score =  399 bits (1026), Expect = e-132
 Identities = 231/458 (50%), Positives = 279/458 (60%), Gaps = 50/458 (10%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQKRRWGSCWSIYWCFGSYKQSKRIGHAVLI 423
            M SV+NS              ESRVQP+ VQKRRWG CWS+YWCFGS+K SKRIGHAVL+
Sbjct: 1    MRSVNNSVETINAAATAIISAESRVQPTVVQKRRWGGCWSLYWCFGSHKNSKRIGHAVLV 60

Query: 424  SEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLSL--- 594
             EP  P+     +EN+ HS+ ++   +PFI        FLQSDPPS T SP GLLSL   
Sbjct: 61   PEPEVPQAVVTSAENQTHSTAAA---VPFIAPPSSPASFLQSDPPSVTQSPAGLLSLTAL 117

Query: 595  SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLSSPEV 774
            SV A+SPGG A IF IGPYAHETQLV+PPVFS FTTEPSTA FTPPPE VQ+TT SSPEV
Sbjct: 118  SVSAYSPGGPASIFAIGPYAHETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEV 177

Query: 775  PFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQ-YXXXXXXXXXXXXXAMSTSGTSSPF 951
            PFAQLL+SSL R RR SG + K  LS YEFQ Y  Y              +S SGTSSPF
Sbjct: 178  PFAQLLTSSLERARRNSGANQKFALSHYEFQSYPLYPGSPGGQLISPGSIISNSGTSSPF 237

Query: 952  PDKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTPND------------------W 1077
            PD+ P++EFR+GEAPK LG+E+F   KW SR+GSG+LTP+                    
Sbjct: 238  PDRHPLLEFRMGEAPKLLGFEHFTTRKWGSRLGSGTLTPDGVGLGSRLCSGTATPDGVGL 297

Query: 1078 GSRLDSGTLTPNG----------------GEPPSRDNKVLENQIYEVASLANSDRKSQN- 1206
            GSRL SG++TP+G                  P S+D  +LENQI EVASLANS+  S+N 
Sbjct: 298  GSRLGSGSVTPDGVGLRSRLGSGSLTPDCVVPASQDGLLLENQISEVASLANSENASKND 357

Query: 1207 EDLVDHRVSFELFGEDIPTCI-----------VRAPSLKNAPGYLQEEIVEGTSKRDLLA 1353
            E++VDHRVSFEL GE++  C+              P    A   +  E +   S   L  
Sbjct: 358  ENIVDHRVSFELSGEEVARCLESKSMTSSRTFSECPQDSMAEEQINSEEILINSNDCLHI 417

Query: 1354 SKNANSFREHFNGEIVNEEEFYQKHRTISLGSSKDFNF 1467
             + +N   E  +GE   EE  Y+KHR+I+LGS K+FNF
Sbjct: 418  GETSNETPEKPSGE-TEEEPCYRKHRSITLGSIKEFNF 454


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  399 bits (1024), Expect = e-131
 Identities = 234/448 (52%), Positives = 279/448 (62%), Gaps = 40/448 (8%)
 Frame = +1

Query: 244  MSSVHNSXXXXXXXXXXXXXXESRVQPSTVQ----KRRWGSCWSIYWCFGSYKQSKRIGH 411
            M SV++S              +SRVQP+TVQ    K+RWGSCW +YWCFGS K SKRIGH
Sbjct: 1    MRSVNDSVETVNAAATAIVSADSRVQPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGH 60

Query: 412  AVLISEPSAPRITPPISENRNHSSNSSTIVLPFIXXXXXXXXFLQSDPPSATHSPGGLLS 591
            AVL+ EP  P  +   +EN    SN + I+LPFI        FLQSDPPSAT SP GLLS
Sbjct: 61   AVLVPEPVVPGASVSTAEN---VSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLS 117

Query: 592  L---SVHAHSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTLS 762
            L   SV+A+SP G A IF IGPYAHETQLV+PPVFS  TTEPSTA FTPPPE VQ+TT S
Sbjct: 118  LTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPS 177

Query: 763  SPEVPFAQLLSSSLARNRRKSGPHLKCPLSQYEFQPYQ-YXXXXXXXXXXXXXAMSTSGT 939
            SPEVPFAQLL+SSL R RR SG + K  LS YEFQ YQ Y             A+S SGT
Sbjct: 178  SPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGT 237

Query: 940  SSPFPDKRPIMEFRVGEAPKFLGYEYFPNYKWDSRVGSGSLTPN---------------- 1071
            SSPFPD+RPI+EFR+GEAPK LG+E F   KW SR+GSGSLTP+                
Sbjct: 238  SSPFPDRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPD 297

Query: 1072 --DWGSRLDSGTLTPNGGEPPSRDNKVLENQIYEVASLANSDRKSQN-EDLVDHRVSFEL 1242
                GSRL SG+LTP+G  P SRD  ++ +QI EVA LAN     +N E +VDHRVSFEL
Sbjct: 298  GMGLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFEL 357

Query: 1243 FGEDIPTCIVRAPSL--KNAPGYLQEEIVEGTSKRD-----------LLASKNANSFREH 1383
             GED+  C+     L  +    Y ++ + EG  +RD           L   + +N   E 
Sbjct: 358  SGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEK 417

Query: 1384 FNGEIVNEEEFYQKHRTISLGSSKDFNF 1467
             +GE   EE  YQKHR+++LGS K+FNF
Sbjct: 418  ASGE-AEEEHSYQKHRSVTLGSIKEFNF 444


Top