BLASTX nr result

ID: Achyranthes23_contig00003437 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes23_contig00003437
         (1589 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   381   e-103
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   380   e-102
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   369   2e-99
gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus pe...   364   5e-98
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   348   4e-93
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   348   4e-93
gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [...   337   9e-90
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     336   2e-89
emb|CBI34651.3| unnamed protein product [Vitis vinifera]              336   2e-89
ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   332   3e-88
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   328   4e-87
gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i...   316   2e-83
gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i...   316   2e-83
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   316   2e-83
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   315   3e-83
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   315   3e-83
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   311   7e-82
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   308   6e-81
ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Popu...   294   8e-77
ref|XP_006580574.1| PREDICTED: uncharacterized protein LOC100791...   292   2e-76

>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
            gi|557541785|gb|ESR52763.1| hypothetical protein
            CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  381 bits (978), Expect = e-103
 Identities = 218/415 (52%), Positives = 259/415 (62%), Gaps = 21/415 (5%)
 Frame = -1

Query: 1583 WCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXXXX 1404
            WCFG QK  KR GHA LVPEP  T + +  +   NSTQ  A+ +                
Sbjct: 51   WCFGFQKHRKRIGHAVLVPEP--TASRSNASEAVNSTQATAISLPFVAPPSSPASFLQSE 108

Query: 1403 XXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTAPF 1224
                 QSPA  V LNS+S NMYSPGGP SIFAIGPYAHE QLV+PPVFST+TTEPSTAPF
Sbjct: 109  PPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPF 168

Query: 1223 TPPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPLIXX 1044
            TPPPESVHLTTPSSPEVPFA+LLDP    G+ G +FP S+++FQSY LHPGSP+G LI  
Sbjct: 169  TPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISP 228

Query: 1043 XXXXXXXXXXXPFPD-------PYFPDIRHGVPQKLLTLEVGPMREWDSRQASGSGTPDA 885
                       PFPD       P FPD   G P KLL L+   +REW SRQ SG+ TPDA
Sbjct: 229  SSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLTPDA 288

Query: 884  IQPKSRGHQVLLDRQDSVVSALPHLPNGHRNNDLLIDHRVSFELTSEDVVRCVEKKPMAL 705
            ++   R +    +RQ S V+  PH  NG R  D ++DHRVSFELT+EDVVRCVEKKP  L
Sbjct: 289  VRSTPR-NGFFQNRQISEVALRPHSENGLR-KDQIVDHRVSFELTTEDVVRCVEKKPTTL 346

Query: 704  PKA----LQN-----PDERDYNANELVDDSASSTGTSEKGQTNI---EGQRHQKQRAASL 561
             +A    LQN      +E    A  +    A      E  +T +   E  RHQKQ++ +L
Sbjct: 347  AEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAPRHQKQQSITL 406

Query: 560  GSIKEFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPCKNWSFFPAMQ--PGVS 402
            GS KEFNFD  +G DS +P I S+WWANEKV+G + G  KNW+FFP +Q  PGVS
Sbjct: 407  GSTKEFNFDSADG-DSHEPTIASDWWANEKVVGKDSGAIKNWAFFPVIQPAPGVS 460


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  380 bits (975), Expect = e-102
 Identities = 218/415 (52%), Positives = 258/415 (62%), Gaps = 21/415 (5%)
 Frame = -1

Query: 1583 WCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXXXX 1404
            WCFG QK  KR GHA LVPEP  T + +  +   NSTQ  A+ +                
Sbjct: 51   WCFGFQKHRKRIGHAVLVPEP--TASRSNASEAVNSTQAAAISLPFVAPPSSPASFLQSE 108

Query: 1403 XXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTAPF 1224
                 QSPA  V LNS+S NMYSPGGP SIFAIGPYAHE QLV+PPVFST+TTEPSTAPF
Sbjct: 109  PPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPF 168

Query: 1223 TPPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPLIXX 1044
            TPPPESVHLTTPSSPEVPFA+LLDP    G+ G +FP S+++FQSY LHPGSP+G LI  
Sbjct: 169  TPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISP 228

Query: 1043 XXXXXXXXXXXPFPD-------PYFPDIRHGVPQKLLTLEVGPMREWDSRQASGSGTPDA 885
                       PFPD       P FPD   G P KLL L+   +REW SRQ SG+ TPDA
Sbjct: 229  SSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLTPDA 288

Query: 884  IQPKSRGHQVLLDRQDSVVSALPHLPNGHRNNDLLIDHRVSFELTSEDVVRCVEKKPMAL 705
            +    R +    +RQ S V+  PH  NG R  D ++DHRVSFELT+EDVVRCVEKKP  L
Sbjct: 289  VGSTPR-NGFFQNRQISEVALRPHSENGLR-KDQIVDHRVSFELTTEDVVRCVEKKPTTL 346

Query: 704  PKA----LQN-----PDERDYNANELVDDSASSTGTSEKGQTNI---EGQRHQKQRAASL 561
             +A    LQN      +E    A  +    A      E  +T +   E  RHQKQ++ +L
Sbjct: 347  AEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAPRHQKQQSITL 406

Query: 560  GSIKEFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPCKNWSFFPAMQ--PGVS 402
            GS KEFNFD  +G DS +P I S+WWANEKV+G + G  KNW+FFP +Q  PGVS
Sbjct: 407  GSTKEFNFDSADG-DSHEPTIASDWWANEKVVGKDSGAIKNWAFFPVIQPAPGVS 460


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  369 bits (948), Expect = 2e-99
 Identities = 220/434 (50%), Positives = 259/434 (59%), Gaps = 39/434 (8%)
 Frame = -1

Query: 1586 YWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXXX 1407
            YWCF S K  KR GHA L PE     +G   A  EN TQ P + +               
Sbjct: 49   YWCFRSPK-DKRIGHAVLAPESRAPGSGVPAA--ENLTQAPTIVLPFVAPPSSPASFLQS 105

Query: 1406 XXXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTAP 1227
                  QSP+  + L S++ N+YSPGGP SIFAIGPYAHE QLV+PPVFST+TTEPSTAP
Sbjct: 106  EPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAP 165

Query: 1226 FTPPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPLIX 1047
            FTPPPESVHLTTPSSPEVPFA+L DP + NG+ G RF  S ++FQSY L+PGSP+G LI 
Sbjct: 166  FTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLIS 225

Query: 1046 XXXXXXXXXXXXPFPD--------PYFPDIRHGVPQKLLTLEVGPMREWDSRQASGSGTP 891
                        PFPD          F + R G P KLLTL+     EW SR  SGS TP
Sbjct: 226  PSSGISGSGTSSPFPDRDFVCSGSSQFLEFRAGGPPKLLTLDKLSNHEWGSRIGSGSITP 285

Query: 890  DAIQPKSRGHQVL-----------------LDRQDSVVSALPHLPNGHRNNDLLIDHRVS 762
            DA+ P SR   VL                 LDRQ S V++     +G  NN++++DHRVS
Sbjct: 286  DALGPPSRDGSVLDRQVSDVIHPPSGDDSVLDRQISDVASHSLSDSGCPNNEIMVDHRVS 345

Query: 761  FELTSEDVVRCVEKKPMALPKA----LQNPD--ERDYNANELVDDSASSTGTS-----EK 615
            FELT+EDVVRCVEK   AL KA    LQNP   E D N+ E+V DS    G +     EK
Sbjct: 346  FELTAEDVVRCVEKDSAALVKAVSASLQNPATVEIDENSREVVVDSEGRVGETANNPPEK 405

Query: 614  GQTNI---EGQRHQKQRAASLGSIKEFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPC 444
               +    EGQ H KQR+ +LGS KEFNFD+ +G  SDKP I+S+WWANEKV+G E G  
Sbjct: 406  APEDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGAS 465

Query: 443  KNWSFFPAMQPGVS 402
            KNWS F  MQP VS
Sbjct: 466  KNWSIFHMMQPSVS 479


>gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  364 bits (935), Expect = 5e-98
 Identities = 212/415 (51%), Positives = 250/415 (60%), Gaps = 19/415 (4%)
 Frame = -1

Query: 1589 MYWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXX 1410
            MYWCFG Q+  KR GHA LVPE   T  G      EN  Q P++ +              
Sbjct: 48   MYWCFGFQRHKKRIGHAVLVPET--TDRGGDAPRAENPIQTPSIVLPFVAPPSSPASFLQ 105

Query: 1409 XXXXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTA 1230
                   QSPA   G  SL+ +MYSP GP SIFAIGPYAHE QLV+PPVFST+TTEPSTA
Sbjct: 106  SEPPSATQSPA---GFFSLTASMYSPSGPTSIFAIGPYAHETQLVSPPVFSTFTTEPSTA 162

Query: 1229 PFTPPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPLI 1050
            PFTPPPESVHLTTPSSPEVPFA+LLDP   NG+ G RFP SH++FQSY L+PGSP+G LI
Sbjct: 163  PFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLI 222

Query: 1049 XXXXXXXXXXXXXPFPD-------PYFPDIRHGVPQKLLTLEVGPMREWDSRQASGSGTP 891
                         PFPD        +F + R G P KLL L++   R+W SR  SGS TP
Sbjct: 223  SPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGDPPKLLNLDILSTRDWGSRLGSGSVTP 282

Query: 890  DAIQPKSRGHQVLLDRQDSVVSALPHLPNGHRNNDLLIDHRVSFELTSEDVVRCVEKKPM 711
            D  +  S     LL  Q   V   P   N  RNND+ I+HRVSFEL+SE+V+RCVEKKP+
Sbjct: 283  DGAKSTS-SDGFLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEVIRCVEKKPV 341

Query: 710  ALPKALQNPDERDYNANELVDDS-----------ASSTGTSEKGQTN-IEGQRHQKQRAA 567
            AL +A+    E    A    D S            +S   +EK   +  E Q H KQR+ 
Sbjct: 342  ALAEAVSTSLEDTEKAQSKEDPSKVVSSSICPVGETSNDAAEKAVADGEEAQLHPKQRSI 401

Query: 566  SLGSIKEFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPCKNWSFFPAMQPGVS 402
            +LGS+KEFNFD+ +G DS    I S+WWANEKV   E+GP KNWSFFP MQPGVS
Sbjct: 402  TLGSVKEFNFDNPDGGDSGN-SIGSDWWANEKVDAKENGPTKNWSFFPMMQPGVS 455


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346902|gb|ERP65330.1| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  348 bits (893), Expect = 4e-93
 Identities = 203/415 (48%), Positives = 251/415 (60%), Gaps = 19/415 (4%)
 Frame = -1

Query: 1589 MYWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXX 1410
            +Y CFG QK  K+ GHA L PEP+   NG   +  EN TQ PA+ +              
Sbjct: 47   IYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPAS--ENPTQAPAVTLPFAAPPSSPASFFQ 104

Query: 1409 XXXXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTA 1230
                   QSPA  V L S+S +MYSP GP SIFAIGPYAHE QLV+PPVFST+TTEPSTA
Sbjct: 105  SEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTA 164

Query: 1229 PFTPPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPLI 1050
            PFTPPPESVHLTTPSSPEVPFA+ LDP   NGD G RFP   FDFQSY  HPGSP+G LI
Sbjct: 165  PFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFP---FDFQSYQFHPGSPVGQLI 221

Query: 1049 XXXXXXXXXXXXXPFPD-------PYFPDIRHGVPQKLLTLEVGPMREWDSRQASGSGTP 891
                         PFPD        +FP+ R G P KLL L+     EW S Q SG+ TP
Sbjct: 222  SPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTP 281

Query: 890  DAIQPKSRGHQVLLDRQDSVVSALPHLPNGHRNNDLLIDHRVSFELTSEDVVRCVEKKP- 714
            ++++  S     LL RQ S V + P   NGH+N   +++HRVSFELT+ED  RCVE+KP 
Sbjct: 282  ESVRRGS--PNFLLHRQFSDVPSRPRSGNGHKNGQ-VVNHRVSFELTAEDASRCVEEKPA 338

Query: 713  ---MALPKALQNPDE--RDYNANELVDD-----SASSTGTSEKGQTNIE-GQRHQKQRAA 567
                 +P+ ++N  +   + N+ E +         +S  + E   T+ E   +H+KQ++ 
Sbjct: 339  FSIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHRKQQSI 398

Query: 566  SLGSIKEFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPCKNWSFFPAMQPGVS 402
            +LGS+KEFNFD+ +  DS KP  +SNWWAN  VIG E    KNWSFFP +Q GVS
Sbjct: 399  TLGSVKEFNFDNADEGDSRKPS-SSNWWANGSVIGKEGETTKNWSFFPMVQSGVS 452


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346901|gb|EEE82832.2| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  348 bits (893), Expect = 4e-93
 Identities = 203/415 (48%), Positives = 251/415 (60%), Gaps = 19/415 (4%)
 Frame = -1

Query: 1589 MYWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXX 1410
            +Y CFG QK  K+ GHA L PEP+   NG   +  EN TQ PA+ +              
Sbjct: 48   IYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPAS--ENPTQAPAVTLPFAAPPSSPASFFQ 105

Query: 1409 XXXXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTA 1230
                   QSPA  V L S+S +MYSP GP SIFAIGPYAHE QLV+PPVFST+TTEPSTA
Sbjct: 106  SEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTA 165

Query: 1229 PFTPPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPLI 1050
            PFTPPPESVHLTTPSSPEVPFA+ LDP   NGD G RFP   FDFQSY  HPGSP+G LI
Sbjct: 166  PFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFP---FDFQSYQFHPGSPVGQLI 222

Query: 1049 XXXXXXXXXXXXXPFPD-------PYFPDIRHGVPQKLLTLEVGPMREWDSRQASGSGTP 891
                         PFPD        +FP+ R G P KLL L+     EW S Q SG+ TP
Sbjct: 223  SPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTP 282

Query: 890  DAIQPKSRGHQVLLDRQDSVVSALPHLPNGHRNNDLLIDHRVSFELTSEDVVRCVEKKP- 714
            ++++  S     LL RQ S V + P   NGH+N   +++HRVSFELT+ED  RCVE+KP 
Sbjct: 283  ESVRRGS--PNFLLHRQFSDVPSRPRSGNGHKNGQ-VVNHRVSFELTAEDASRCVEEKPA 339

Query: 713  ---MALPKALQNPDE--RDYNANELVDD-----SASSTGTSEKGQTNIE-GQRHQKQRAA 567
                 +P+ ++N  +   + N+ E +         +S  + E   T+ E   +H+KQ++ 
Sbjct: 340  FSIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHRKQQSI 399

Query: 566  SLGSIKEFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPCKNWSFFPAMQPGVS 402
            +LGS+KEFNFD+ +  DS KP  +SNWWAN  VIG E    KNWSFFP +Q GVS
Sbjct: 400  TLGSVKEFNFDNADEGDSRKPS-SSNWWANGSVIGKEGETTKNWSFFPMVQSGVS 453


>gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  337 bits (864), Expect = 9e-90
 Identities = 197/414 (47%), Positives = 248/414 (59%), Gaps = 19/414 (4%)
 Frame = -1

Query: 1589 MYWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXX 1410
            +YWCFGS K  KR G A L  E +   +G  +   EN TQ PA+ +              
Sbjct: 48   IYWCFGSYKQKKRIGPAVLTSETS--FSGANVPAAENPTQAPAIALPFVAPPSSPASFLP 105

Query: 1409 XXXXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTA 1230
                   QSPA  V L S+S +MYSPG P SIFAIGPYAHE QLV+PPVFST+TTEPSTA
Sbjct: 106  SEPPSATQSPAGLVSLTSISASMYSPG-PASIFAIGPYAHETQLVSPPVFSTFTTEPSTA 164

Query: 1229 PFTPPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPLI 1050
            PFTPPPESVHLTTPSSPEVPFA+LL P    G+   RFP SH++FQSY LHPGSP+G LI
Sbjct: 165  PFTPPPESVHLTTPSSPEVPFAQLLGPNLQYGEGVQRFPISHYEFQSYQLHPGSPVGQLI 224

Query: 1049 XXXXXXXXXXXXXPFPDP------YFPDIRHGVPQKLLTLEVGPMREWDSRQASGSGTPD 888
                         PF D       +FP+ R G P KLL L+     EW S   SG+ TPD
Sbjct: 225  SPSSGISGSGTSSPFRDGEFAASLHFPEFRMGDPPKLLNLDKHSSCEWGSHHGSGTLTPD 284

Query: 887  AIQPKSRGHQVLLDRQDSVVSALPHLPNGHRNNDLLI-DHRVSFELTSEDVVRCVEKKPM 711
            A +   R +  LLD Q S +++ PHL N    ND +  +HRVSFELT+E+VVR +E +  
Sbjct: 285  ATRSTPR-NGFLLDHQISEITSHPHLKNKEVQNDQVAHNHRVSFELTTEEVVRSLEMETA 343

Query: 710  ALPKA------LQNPDERDYNANELVDDSASSTGTS-----EKGQTNIEGQ-RHQKQRAA 567
               +A      ++   E + +  ++VDD     G +     EK   + EG+ +H K ++ 
Sbjct: 344  TPSEAVSGSLQIEATRESEEHDTKVVDDYECRVGETSNERPEKALADREGKPQHHKHQSI 403

Query: 566  SLGSIKEFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPCKNWSFFPAMQPGV 405
            +LGS KEFNFD+V+G D+ KP + S+WWAN+KV G   G  +NWSFFP MQPGV
Sbjct: 404  TLGSAKEFNFDNVDGGDAHKPILTSDWWANDKVAGKGGGVPRNWSFFPMMQPGV 457


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  336 bits (861), Expect = 2e-89
 Identities = 197/420 (46%), Positives = 242/420 (57%), Gaps = 24/420 (5%)
 Frame = -1

Query: 1589 MYWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXX 1410
            +YWCFG+ K   R GH  LVPE     N    A  ENSTQ  A+ +              
Sbjct: 50   IYWCFGTPKNRTRIGHGVLVPETAQPGNSAPRA--ENSTQTHAVILPFIAPPSSPASFLQ 107

Query: 1409 XXXXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTA 1230
                   QSPA  + L S+S +MYSPGGP SIFAIGPYAHE QLV+PPVFST+TTEPSTA
Sbjct: 108  SEPPSATQSPAGLLSLTSVSASMYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTA 167

Query: 1229 PFTPPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPLI 1050
            PFTPPPESVHLTTPSSPEVPFA+LLDP   NG+ G RFP  H +FQSY   PGSP+G LI
Sbjct: 168  PFTPPPESVHLTTPSSPEVPFAQLLDPNIHNGEPGQRFPIFHNEFQSYYFQPGSPIGQLI 227

Query: 1049 XXXXXXXXXXXXXPFPD-------PYFPDIRHGVPQKLLTLEVGPMREWDSRQASGSGTP 891
                         PFPD       P+F + R G P KLL L+     +W SRQ SGS TP
Sbjct: 228  SPSSGISGSGTSSPFPDPEFAARGPHFLEFRTGDPPKLLNLDKLSKFDWGSRQGSGSLTP 287

Query: 890  DAIQPKSRGHQVLLDRQDSVVSALPHL-PNGH-RNNDLLIDHRVSFELTSEDVVRCVEKK 717
            D+++P             S     PHL PNG  RN + + D RVSF++++EDV+R VEKK
Sbjct: 288  DSVKP------------ISTFEVAPHLKPNGRCRNAENVADRRVSFDVSTEDVIRYVEKK 335

Query: 716  PMALPKAL----------QNPDERDYNANELV--DDSASSTGTSEKGQTNIEGQ---RHQ 582
             + L +A+          Q  +  D N  E +  ++    T   E  +    G+   +HQ
Sbjct: 336  TVPLAEAMLTSLKDTTMGQREENSDSNKVEEIGCENRVGETSNEEPDKAPTSGEEVLQHQ 395

Query: 581  KQRAASLGSIKEFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPCKNWSFFPAMQPGVS 402
            K R+ +LGS KEFNFD+ +  D  K    S+WWAN+KV G E  P +NWSFFP +QPGVS
Sbjct: 396  KHRSITLGSSKEFNFDNADAGDLHKSDSVSDWWANQKVAGKEGAPSQNWSFFPMIQPGVS 455


>emb|CBI34651.3| unnamed protein product [Vitis vinifera]
          Length = 412

 Score =  336 bits (861), Expect = 2e-89
 Identities = 202/409 (49%), Positives = 237/409 (57%), Gaps = 14/409 (3%)
 Frame = -1

Query: 1586 YWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXXX 1407
            YWCF S K  KR GHA L PE     +G   A  EN TQ P + +               
Sbjct: 49   YWCFRSPK-DKRIGHAVLAPESRAPGSGVPAA--ENLTQAPTIVLPFVAPPSSPASFLQS 105

Query: 1406 XXXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTAP 1227
                  QSP+  + L S++ N+YSPGGP SIFAIGPYAHE QLV+PPVFST+TTEPSTAP
Sbjct: 106  EPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAP 165

Query: 1226 FTPPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPLIX 1047
            FTPPPESVHLTTPSSPEVPFA+L DP + NG+ G RF  S ++FQSY L+PGSP+G LI 
Sbjct: 166  FTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLIS 225

Query: 1046 XXXXXXXXXXXXPFPDPYFPDIRHGVPQKLLTLEVGPMREWDSRQASGSGTPDAIQPKSR 867
                        PFPD                              SGS TPDA+ P SR
Sbjct: 226  PSSGISGSGTSSPFPD-----------------------------RSGSITPDALGPPSR 256

Query: 866  GHQVLLDRQDSVVSALPHLPNGHRNNDLLIDHRVSFELTSEDVVRCVEKKPMALPKA--- 696
                         S L H  +G  NN++++DHRVSFELT+EDVVRCVEK   AL KA   
Sbjct: 257  DG-----------SVLDH--SGCPNNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAVSA 303

Query: 695  -LQNPD--ERDYNANELVDDSASSTGTS-----EKGQTNI---EGQRHQKQRAASLGSIK 549
             LQNP   E D N+ E+V DS    G +     EK   +    EGQ H KQR+ +LGS K
Sbjct: 304  SLQNPATVEIDENSREVVVDSEGRVGETANNPPEKAPEDANGEEGQPHHKQRSITLGSAK 363

Query: 548  EFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPCKNWSFFPAMQPGVS 402
            EFNFD+ +G  SDKP I+S+WWANEKV+G E G  KNWS F  MQP VS
Sbjct: 364  EFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKNWSIFHMMQPSVS 412


>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum]
          Length = 443

 Score =  332 bits (851), Expect = 3e-88
 Identities = 199/413 (48%), Positives = 245/413 (59%), Gaps = 17/413 (4%)
 Frame = -1

Query: 1589 MYWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXX 1410
            MYWCFGSQK +KR GHA  +PE   T +G        S+Q P++ +              
Sbjct: 48   MYWCFGSQKQTKRIGHAVFIPET--TASGADRPSSNTSSQAPSIVLPFIAPPSSPASFLP 105

Query: 1409 XXXXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTA 1230
                    SP   VG   LS + YSP GP SIFAIGPYAHE QLV+PPVFS +TTEPSTA
Sbjct: 106  SEPPSATHSP---VGSKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVFSAFTTEPSTA 162

Query: 1229 PFTPPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPLI 1050
            PFTPPPESVHLTTPSSPEVPFA+LLDP + N   G R+P + ++FQSY L PGSP+  LI
Sbjct: 163  PFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQPGSPVSNLI 222

Query: 1049 XXXXXXXXXXXXXPFPD-PYFPDIRHGVPQKLLTLEVGPMREWDSRQASGSGTPDAIQPK 873
                         PF D  Y P    G PQ L   ++ P  EW SRQ SG+ TP+A+ PK
Sbjct: 223  SPGSAISVSGTSSPFLDREYTP----GRPQFLNLEKIAP-HEWGSRQGSGTLTPEAVNPK 277

Query: 872  SRGHQVLLDRQDSVVSALPHLPNGHRNNDLLIDHRVSFELTSEDVVRCVEKKPMALPK-- 699
               +  LL+ Q+S V  LP   NG +N+  ++DHRVSFE+T+EDVVRCVEKKP  + +  
Sbjct: 278  YHDN-FLLNYQNSGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTMMMRTG 336

Query: 698  --ALQNPDERDYNANELVDDSAS------------STGTSEKGQTNIEGQRHQKQRAASL 561
              +LQ+ +        L + S                G+S  G+   +GQR QK R+ +L
Sbjct: 337  SVSLQDTERSTKRQENLAEMSNGHDHGGHEPSREIHEGSSTDGE---DGQRQQKHRSITL 393

Query: 560  GSIKEFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPCKNWSFFPAMQPGVS 402
            GS KEFNFD+V+G   DK  I S+WWANEKV+G E  PC NW  FP MQPGVS
Sbjct: 394  GSSKEFNFDNVDGGYPDKATIGSDWWANEKVLGKE--PCNNW-IFPMMQPGVS 443


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
            lycopersicum]
          Length = 443

 Score =  328 bits (841), Expect = 4e-87
 Identities = 196/411 (47%), Positives = 243/411 (59%), Gaps = 15/411 (3%)
 Frame = -1

Query: 1589 MYWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPEN-STQPPAMRIXXXXXXXXXXXXX 1413
            MYWCFGSQK +KR GHA  +PE   T       P  N S+Q P++ +             
Sbjct: 48   MYWCFGSQKQTKRIGHAVFIPE---TTASAADRPSSNTSSQAPSIVLPFIAPPSSPASFL 104

Query: 1412 XXXXXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPST 1233
                     SP   VG   LS + YSP GP SIFAIGPYAHE QLV+PPVFS +TTEPST
Sbjct: 105  PSEPPSATHSP---VGSKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVFSAFTTEPST 161

Query: 1232 APFTPPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPL 1053
            APFTPPPESVHLTTPSSPEVPFA+LLDP + N   G R+P + ++FQSY L PGSP+  L
Sbjct: 162  APFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQPGSPVSNL 221

Query: 1052 IXXXXXXXXXXXXXPFPD-PYFPDIRHGVPQKLLTLEVGPMREWDSRQASGSGTPDAIQP 876
            I             PF +  Y P    G PQ L   ++ P  EW SRQ SG+ TP+A+ P
Sbjct: 222  ISPGSAISVSGTSSPFLEREYTP----GRPQFLNLEKIAP-HEWGSRQGSGTLTPEAVNP 276

Query: 875  KSRGHQVLLDRQDSVVSALPHLPNGHRNNDLLIDHRVSFELTSEDVVRCVEKKPMALPK- 699
            K      LL+ Q++ V  LP   NG +N+  ++DHRVSFE+T+EDVVRCVEKKP  + + 
Sbjct: 277  KYH-DSFLLNYQNTGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTMMMRT 335

Query: 698  ---ALQNPDERDYNANELVDDSASSTGTSEKGQTNI---------EGQRHQKQRAASLGS 555
               +LQ+ +        L + S +   +  +    I         +GQR QK R+ +LGS
Sbjct: 336  GSVSLQDTERSTKRQENLAEMSNAHDHSGHEPSREIHEGSSTDGEDGQRQQKHRSITLGS 395

Query: 554  IKEFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPCKNWSFFPAMQPGVS 402
             KEFNFD+V+G   DK  I S+WWANEKV+G E  PC NW  FP MQPGVS
Sbjct: 396  SKEFNFDNVDGGYPDKATIGSDWWANEKVLGKE--PCNNW-IFPMMQPGVS 443


>gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao]
          Length = 489

 Score =  316 bits (810), Expect = 2e-83
 Identities = 193/448 (43%), Positives = 242/448 (54%), Gaps = 52/448 (11%)
 Frame = -1

Query: 1589 MYWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXX 1410
            +YWCFGSQK SKR GHA LVPEP   + G  ++  EN + P  + +              
Sbjct: 45   LYWCFGSQKNSKRIGHAVLVPEP--VVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQ 102

Query: 1409 XXXXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTA 1230
                   QSPA  + L SLS N YSP GP SIFAIGPYAHE QLVTPPVFS  TTEPSTA
Sbjct: 103  SDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTA 162

Query: 1229 PFTPPPESVHLTTPSSPEVPFARL----LDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPM 1062
            PFTPPPESV LTTPSSPEVPFA+L    L+    N  +  +F  SH++FQSY ++PGSP 
Sbjct: 163  PFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPG 222

Query: 1061 GPLIXXXXXXXXXXXXXPFPDPY-FPDIRHGVPQKLLTLEVGPMREWDSRQASGSGTPDA 885
            G LI             PFPD     + R G   KLL  E    R+W SR  SGS TPD 
Sbjct: 223  GNLISPGSAISNSGTSSPFPDRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDG 282

Query: 884  IQPKSR-------------GHQV------------------LLDRQDSVVSALPHLPNGH 798
            +   SR             G ++                  L+  Q S V+ L +  NG 
Sbjct: 283  LGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGP 342

Query: 797  RNNDLLIDHRVSFELTSEDVVRCVEKKPMALPKAL---------QNPDERDYNANEL--- 654
            +N++ ++DHRVSFEL+ EDV  C+E K +   +A+         +   ERD    +L   
Sbjct: 343  KNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESS 402

Query: 653  ----VDDSASSTGTSEKGQTNIEGQRHQKQRAASLGSIKEFNFDHVNGLDSDKPCINSNW 486
                + ++++ T     G+   E   +QK R+ +LGSIKEFNFD+  G  SDKP I S W
Sbjct: 403  CELFIRETSNETVEKASGEAE-EEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEW 461

Query: 485  WANEKVIGNEDGPCKNWSFFPAMQPGVS 402
            WANEKV G E  P  +W+FFP +QP VS
Sbjct: 462  WANEKVAGKEARPGNSWTFFPMLQPEVS 489


>gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
          Length = 485

 Score =  316 bits (810), Expect = 2e-83
 Identities = 193/448 (43%), Positives = 242/448 (54%), Gaps = 52/448 (11%)
 Frame = -1

Query: 1589 MYWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXX 1410
            +YWCFGSQK SKR GHA LVPEP   + G  ++  EN + P  + +              
Sbjct: 41   LYWCFGSQKNSKRIGHAVLVPEP--VVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQ 98

Query: 1409 XXXXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTA 1230
                   QSPA  + L SLS N YSP GP SIFAIGPYAHE QLVTPPVFS  TTEPSTA
Sbjct: 99   SDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTA 158

Query: 1229 PFTPPPESVHLTTPSSPEVPFARL----LDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPM 1062
            PFTPPPESV LTTPSSPEVPFA+L    L+    N  +  +F  SH++FQSY ++PGSP 
Sbjct: 159  PFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPG 218

Query: 1061 GPLIXXXXXXXXXXXXXPFPDPY-FPDIRHGVPQKLLTLEVGPMREWDSRQASGSGTPDA 885
            G LI             PFPD     + R G   KLL  E    R+W SR  SGS TPD 
Sbjct: 219  GNLISPGSAISNSGTSSPFPDRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDG 278

Query: 884  IQPKSR-------------GHQV------------------LLDRQDSVVSALPHLPNGH 798
            +   SR             G ++                  L+  Q S V+ L +  NG 
Sbjct: 279  LGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGP 338

Query: 797  RNNDLLIDHRVSFELTSEDVVRCVEKKPMALPKAL---------QNPDERDYNANEL--- 654
            +N++ ++DHRVSFEL+ EDV  C+E K +   +A+         +   ERD    +L   
Sbjct: 339  KNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESS 398

Query: 653  ----VDDSASSTGTSEKGQTNIEGQRHQKQRAASLGSIKEFNFDHVNGLDSDKPCINSNW 486
                + ++++ T     G+   E   +QK R+ +LGSIKEFNFD+  G  SDKP I S W
Sbjct: 399  CELFIRETSNETVEKASGEAE-EEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEW 457

Query: 485  WANEKVIGNEDGPCKNWSFFPAMQPGVS 402
            WANEKV G E  P  +W+FFP +QP VS
Sbjct: 458  WANEKVAGKEARPGNSWTFFPMLQPEVS 485


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
            gi|223549721|gb|EEF51209.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 459

 Score =  316 bits (809), Expect = 2e-83
 Identities = 185/415 (44%), Positives = 246/415 (59%), Gaps = 20/415 (4%)
 Frame = -1

Query: 1589 MYWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPEN-STQPPAMRIXXXXXXXXXXXXX 1413
            +YWCFG  +  KR GHA LVPE +   N +  A  EN +TQ P + +             
Sbjct: 51   VYWCFGYHRHRKRIGHAVLVPENSAPGNDSSAA--ENPTTQAPTITLPFVAPPSSPASFL 108

Query: 1412 XXXXXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPST 1233
                    QSPA  + L S+S +MYSP GP SIFAIGPYAHE QLV+PP FST+TTEPST
Sbjct: 109  QSEPPSASQSPAGILSLTSVSASMYSPSGPASIFAIGPYAHETQLVSPPAFSTFTTEPST 168

Query: 1232 APFTPPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPL 1053
            APFTPPPESV LTTPSSPEVPFA+LL+P + NG+ G RFP S+++FQSY  +PGSP+G L
Sbjct: 169  APFTPPPESVQLTTPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYEFQSYQFYPGSPVGQL 228

Query: 1052 IXXXXXXXXXXXXXPFPD-------PYFPDIRHGVPQKLLTLEVGPMREWDSRQASGSGT 894
            I             PFPD       P F + +  VP KLL L+   + E  SRQ SG+ T
Sbjct: 229  ISPSSGISGSGTSSPFPDGEFAAAGPRFLEFQMAVPPKLLNLDKLSVHECGSRQGSGTLT 288

Query: 893  PDAIQPKSRGHQVLLDRQDSVVSALPHLPNGHRNNDLLIDHRVSFELTSEDVVRCVEKKP 714
            PDA++  S      LDRQ S +++  H  N ++ +D + D RVSF+L++ED +R  E KP
Sbjct: 289  PDAVRATSCSFP--LDRQCSDIASNRHSDNENK-DDQVADLRVSFDLSAEDALRYAEPKP 345

Query: 713  MA----LPKALQN--PDERDYNANELVDDSASSTGTSEKG---QTNIEGQ---RHQKQRA 570
             +    +P++++N    E+   ++E+  +     G +  G   Q +  G+   RHQK R 
Sbjct: 346  ASPVKIMPESMKNEIAAEKVQKSSEIRHNFECRVGETSNGILEQASTGGEKTPRHQKHRT 405

Query: 569  ASLGSIKEFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPCKNWSFFPAMQPGV 405
             +LG+ KEFNFD+ +G+   KP    +WW N   +G ED   KNWSFFP MQP +
Sbjct: 406  LTLGTFKEFNFDNADGV--PKPSAGPDWWDNGSDVGKEDFTAKNWSFFPVMQPSI 458


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 422

 Score =  315 bits (808), Expect = 3e-83
 Identities = 194/420 (46%), Positives = 251/420 (59%), Gaps = 24/420 (5%)
 Frame = -1

Query: 1589 MYWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXX 1410
            +YWCFG Q+  KR GHA ++PE   T  G      EN TQ  ++ +              
Sbjct: 12   VYWCFGFQRHRKRIGHAVILPET--TSPGHNDPRAENLTQASSIVLPFAAPPSSPASFLQ 69

Query: 1409 XXXXXXLQSPAASVGLN-SLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPST 1233
                  +QSP    G N SLS +MYSPG P SIFAIGPYAHE QLV+PPVFST+TTEPST
Sbjct: 70   SEPPSAMQSP----GFNFSLSASMYSPG-PSSIFAIGPYAHETQLVSPPVFSTFTTEPST 124

Query: 1232 APFTPPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPL 1053
            APFTPP ESVHLT PSSPEVPFA+LLD     G+ G R+P SH++FQSY  +PGSP+G L
Sbjct: 125  APFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQL 184

Query: 1052 IXXXXXXXXXXXXXPFPDP-------YFPDIRHGVPQKLLTLEVGPMREWDSRQASGSGT 894
            I             PF D        +F + R G   K+L L++   R+W SR  SGS T
Sbjct: 185  ISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGSRLCSGSVT 244

Query: 893  PDAIQ-PKSRGHQVLLDRQDSVVSALPHLPNGHRNNDLLIDHRVSFELTSEDVVRCVEKK 717
            PDA +   S G  +     + V++A  +  +  RN+   I HRVSFEL++E+VVRCVEKK
Sbjct: 245  PDAAKSTSSEGFTLKPYTPEGVLNARSN--SRRRNDGASIGHRVSFELSAEEVVRCVEKK 302

Query: 716  PMALPKA----LQNPD--ERDYNANE---------LVDDSASSTGTSEKGQTNIEGQRHQ 582
            P+AL +A    LQ+ +  ER+   N+         +VD S  S+  +  G       R+Q
Sbjct: 303  PVALAEAVSTSLQSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDAEELSYRYQ 362

Query: 581  KQRAASLGSIKEFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPCKNWSFFPAMQPGVS 402
            K+R+ +LGS KEFNFD+ +G DS    I+++WWANEKV+  E+G  KNWSFFP +QPG+S
Sbjct: 363  KERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFFPMIQPGMS 422


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 459

 Score =  315 bits (808), Expect = 3e-83
 Identities = 194/420 (46%), Positives = 251/420 (59%), Gaps = 24/420 (5%)
 Frame = -1

Query: 1589 MYWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXX 1410
            +YWCFG Q+  KR GHA ++PE   T  G      EN TQ  ++ +              
Sbjct: 49   VYWCFGFQRHRKRIGHAVILPET--TSPGHNDPRAENLTQASSIVLPFAAPPSSPASFLQ 106

Query: 1409 XXXXXXLQSPAASVGLN-SLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPST 1233
                  +QSP    G N SLS +MYSPG P SIFAIGPYAHE QLV+PPVFST+TTEPST
Sbjct: 107  SEPPSAMQSP----GFNFSLSASMYSPG-PSSIFAIGPYAHETQLVSPPVFSTFTTEPST 161

Query: 1232 APFTPPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPL 1053
            APFTPP ESVHLT PSSPEVPFA+LLD     G+ G R+P SH++FQSY  +PGSP+G L
Sbjct: 162  APFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQL 221

Query: 1052 IXXXXXXXXXXXXXPFPDP-------YFPDIRHGVPQKLLTLEVGPMREWDSRQASGSGT 894
            I             PF D        +F + R G   K+L L++   R+W SR  SGS T
Sbjct: 222  ISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGSRLCSGSVT 281

Query: 893  PDAIQ-PKSRGHQVLLDRQDSVVSALPHLPNGHRNNDLLIDHRVSFELTSEDVVRCVEKK 717
            PDA +   S G  +     + V++A  +  +  RN+   I HRVSFEL++E+VVRCVEKK
Sbjct: 282  PDAAKSTSSEGFTLKPYTPEGVLNARSN--SRRRNDGASIGHRVSFELSAEEVVRCVEKK 339

Query: 716  PMALPKA----LQNPD--ERDYNANE---------LVDDSASSTGTSEKGQTNIEGQRHQ 582
            P+AL +A    LQ+ +  ER+   N+         +VD S  S+  +  G       R+Q
Sbjct: 340  PVALAEAVSTSLQSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDAEELSYRYQ 399

Query: 581  KQRAASLGSIKEFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPCKNWSFFPAMQPGVS 402
            K+R+ +LGS KEFNFD+ +G DS    I+++WWANEKV+  E+G  KNWSFFP +QPG+S
Sbjct: 400  KERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFFPMIQPGMS 459


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  311 bits (796), Expect = 7e-82
 Identities = 195/461 (42%), Positives = 242/461 (52%), Gaps = 65/461 (14%)
 Frame = -1

Query: 1589 MYWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXX 1410
            +YWCFGS K +KR GHA L PEP   + G V+   EN +Q  A+ +              
Sbjct: 55   LYWCFGSHK-TKRIGHAVLAPEPE--VQGAVVTSAENQSQSTAITVPFIAPPSSPASFLQ 111

Query: 1409 XXXXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTA 1230
                   QSPA  + L SLS N YSPGGP SIFAIGPYAHE QLVTPP FS +TTEPSTA
Sbjct: 112  SDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLVTPPAFSAFTTEPSTA 171

Query: 1229 PFTPPPESVHLTTPSSPEVPFARL----LDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPM 1062
            PFTPPPESV LTTPSSPEVPFA+L    L+    N     +F  SH++FQSY L+PGSP 
Sbjct: 172  PFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALSHYEFQSYPLYPGSPG 231

Query: 1061 GPLIXXXXXXXXXXXXXPFPDPY-FPDIRHGVPQKLLTLEVGPMREWDSR---------- 915
            G LI             PFPD Y   + R G   KLL  E    R+W SR          
Sbjct: 232  GQLISPGSVISNSGTSSPFPDRYPILEFRMGEAPKLLGFEHFTTRKWGSRLGSGTVTPDG 291

Query: 914  --------------------------------------QASGSGTPDAIQPKSRGHQVLL 849
                                                    SGS TPDA+ P SR     L
Sbjct: 292  VGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSGSLTPDAVGPASR-DGFFL 350

Query: 848  DRQDSVVSALPHLPNGHRNNDLLIDHRVSFELTSEDVVRCVEKKPMALPKALQN--PD-- 681
            + Q S V++L +  NG + ++ ++DHRVSFEL+ E+V RC+E K +A  +A     PD  
Sbjct: 351  ENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLESKSLASCRAFSECPPDSM 410

Query: 680  --ERDYNANELVDDSASSTG-----TSEKGQTNIEGQR-HQKQRAASLGSIKEFNFDHVN 525
              ++  +   L+ D    TG     T EK    +E +  ++K R+ +LGSIKEFNFD+  
Sbjct: 411  AEDQIKSGKMLMTDENLPTGETSGETPEKPSGEMEEEHCYRKHRSITLGSIKEFNFDNSK 470

Query: 524  GLDSDKPCINSNWWANEKVIGNEDGPCKNWSFFPAMQPGVS 402
             +  DKP INS WWANE + G E  P  NW+FFP +QP VS
Sbjct: 471  EV-PDKPSINSEWWANETIAGKEARPANNWTFFPLLQPEVS 510


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  308 bits (788), Expect = 6e-81
 Identities = 187/419 (44%), Positives = 224/419 (53%), Gaps = 23/419 (5%)
 Frame = -1

Query: 1589 MYWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXX 1410
            +YWCFGS + SKR GHA LVPEP   + G V    EN     ++ +              
Sbjct: 41   LYWCFGSHRHSKRIGHAVLVPEP--MVPGAVAPASENLNLSTSIVLPFIAPPSSPASFLQ 98

Query: 1409 XXXXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTA 1230
                   QSPA  + L +LS N YSP GP S+FAIGPYAHE QLV+PPVFST+ TEPSTA
Sbjct: 99   SDPPSSTQSPAGFLSLTALSVNAYSPSGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTA 158

Query: 1229 PFTPPPESVHLTTPSSPEVPFARL----LDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPM 1062
            PFTPPPESV LTTPSSPEVPFA+L    LD    N     +   S+++FQ Y L+P SP+
Sbjct: 159  PFTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPV 218

Query: 1061 GPLIXXXXXXXXXXXXXPFPDPYFPDIRHGV-PQKLLTLEVGPMREWDSRQASGSGTPDA 885
            G LI                   FPD R  V   KLL  E    R W SR  SGS TPD 
Sbjct: 219  GHLISPISNSGTSSP--------FPDRRPIVEAPKLLGFEHFSTRRWGSRLGSGSLTPDG 270

Query: 884  IQPKSRGHQVLLDRQDSVVSALPHLPNGHRNNDLLIDHRVSFELTSEDVVRCVEKKPMAL 705
              P SR    LL+ Q S V++L +  +G +N + +IDHRVSFEL  EDV  CVEKKP+A 
Sbjct: 271  AGPASR-DSFLLENQISEVASLANSESGSQNGETVIDHRVSFELAGEDVAVCVEKKPVAS 329

Query: 704  PKALQNP-----------------DERDYNANELVDDSASSTGTSEKGQTNIEGQRHQKQ 576
             + +QN                   E   N  E     A    + +      E Q H+K 
Sbjct: 330  AETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAASEKASAEGEEEQCHKKH 389

Query: 575  RAASLGSIKEFNFDHVNGLDSDKP-CINSNWWANEKVIGNEDGPCKNWSFFPAMQPGVS 402
                 GSIKEFNFD+  G  S KP  I S WW NEKV+G   GP  NW+FFP +QPG+S
Sbjct: 390  PPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGKGTGPQTNWTFFPLLQPGIS 448


>ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa]
            gi|222841936|gb|EEE79483.1| hypothetical protein
            POPTR_0003s12950g [Populus trichocarpa]
          Length = 441

 Score =  294 bits (752), Expect = 8e-77
 Identities = 181/403 (44%), Positives = 229/403 (56%), Gaps = 18/403 (4%)
 Frame = -1

Query: 1589 MYWCFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXX 1410
            +YWCFG QK  ++ GHA L PE +   +G   A  ENS Q P +                
Sbjct: 48   IYWCFGYQKSKRQIGHAVLFPESSAPGSGAPAA--ENSAQAPEVTFPFVAPPSSPASFFQ 105

Query: 1409 XXXXXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTA 1230
                   QSPA  V   S+S +MYSP GP SIFAIGPYAHE QLV+PPVFST+TTEPSTA
Sbjct: 106  SEPPSVTQSPAGLVSRTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTA 165

Query: 1229 PFTPPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPLI 1050
            PFTPPPESVHLTTPSSPEVPFA+L+DP   NG  G RFP   FDFQSY  HPGS +G LI
Sbjct: 166  PFTPPPESVHLTTPSSPEVPFAQLIDPTLRNGVTGLRFP---FDFQSYQFHPGSSVGQLI 222

Query: 1049 XXXXXXXXXXXXXPFPD-------PYFPDIRHGVPQKLLTLEVGPMREWDSRQASGSGTP 891
                         PFPD       P+ P+ R G   KLL L+    REW S Q SG+ TP
Sbjct: 223  SPSSGISGSGTSSPFPDGEFAVGGPHSPEFRMG--PKLLNLDKLSTREWGSYQDSGALTP 280

Query: 890  DAIQPKSRGHQVLLDRQDSVVSALPHLPNGHRNNDLLIDHRVSFELTSEDVVRCVEKKP- 714
            D+++  S     LL RQ S V++ P   NGH ++D +++HR SFEL+ +D  RCVE+KP 
Sbjct: 281  DSVRHGS--PNFLLHRQFSDVASHPRSENGH-DDDQVVNHRFSFELSVKDASRCVEEKPA 337

Query: 713  ---MALPKALQN--PDERDYNANELVD-----DSASSTGTSEKGQTNIEGQRHQKQRAAS 564
                 +P+ ++N    + + N  EL+         +S  T E   T+ E  +H+KQ+  +
Sbjct: 338  CSIKTVPEYVENGTKAKEEENYGELIQSFERRSGDTSNDTPETPSTDGEAPQHRKQQPIT 397

Query: 563  LGSIKEFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPCKNW 435
            LGS+ EFNFD+ +  DS  P  +SNW    +      GP   W
Sbjct: 398  LGSVNEFNFDNADEGDSHNPS-SSNWVKQPRT-----GPSSLW 434


>ref|XP_006580574.1| PREDICTED: uncharacterized protein LOC100791666 isoform X2 [Glycine
            max]
          Length = 441

 Score =  292 bits (748), Expect = 2e-76
 Identities = 181/418 (43%), Positives = 234/418 (55%), Gaps = 25/418 (5%)
 Frame = -1

Query: 1580 CFGSQKISKRFGHAALVPEPNPTINGTVIAPPENSTQPPAMRIXXXXXXXXXXXXXXXXX 1401
            CFG +K  KR GHA LVPEP  T NG   A   +S Q P++ +                 
Sbjct: 33   CFGYKKTRKRIGHAVLVPEP--TTNGADPAAAASSIQAPSITLPFVAPPSSPASFFQSEP 90

Query: 1400 XXXLQSPAASVGLNSLSPNMYSPGGPRSIFAIGPYAHEPQLVTPPVFSTYTTEPSTAPFT 1221
                QSP   V    +S ++YSPGGP SIFAIGPYAHE QLV+PPVFS      STAPFT
Sbjct: 91   PSTAQSPIGKVSHTCVSASIYSPGGPASIFAIGPYAHETQLVSPPVFSA----SSTAPFT 146

Query: 1220 PPPESVHLTTPSSPEVPFARLLDPIHGNGDVGPRFPRSHFDFQSYLLHPGSPMGPLIXXX 1041
            PPPESVH+TTPSSPEVPFA+LLDP + N +   RF  SH+DFQSY  HPGSP+G LI   
Sbjct: 147  PPPESVHMTTPSSPEVPFAQLLDPNNKNSETFQRFQISHYDFQSYQFHPGSPVGQLISPR 206

Query: 1040 XXXXXXXXXXPFPDPYFP-------DIRHGVPQKLLTLE--VGPMREWDSRQASGSGTPD 888
                      P PD  F        D +   P KLL L+  +       S   SGS TPD
Sbjct: 207  SAISVSGTSSPLPDSEFNATFAHILDFQRADPPKLLNLDNKLSSCENQKSNHGSGSLTPD 266

Query: 887  AIQPKSRGHQVLLDRQDSVVSALPHLPNGHRNNDLLIDHRVSFELTSEDVVRCVEKKPMA 708
            A +  ++    L +   S +   PH P+ +R N++ I+HRVSFEL+++ V++ +E KP A
Sbjct: 267  AARSTTQS-GFLSNHWVSEIKMSPH-PSNNRLNEISINHRVSFELSAQKVLKSLENKPAA 324

Query: 707  ------LPKALQN----PDERDYNANELVDDS--ASSTGTSEKGQTNIEGQR----HQKQ 576
                  LPK L+N     D+ + +    +DD    S     +  +T + G +    H+K 
Sbjct: 325  SAWTNVLPK-LKNDAPTTDKEEKSEESALDDKQVVSEAHNDQPLETTLGGDKATTVHEKD 383

Query: 575  RAASLGSIKEFNFDHVNGLDSDKPCINSNWWANEKVIGNEDGPCKNWSFFPAMQPGVS 402
            ++ +L S KEFNFD+ +G DS  P I ++WWANEKV G E    K+WSFFP +QPGVS
Sbjct: 384  QSLTLSSAKEFNFDNADGGDSLAPNIVADWWANEKVAGKEREASKDWSFFPMIQPGVS 441


Top