BLASTX nr result

ID: Rauwolfia21_contig00010303 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00010303
         (2144 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus pe...   511   e-142
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   502   e-139
ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   501   e-139
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   500   e-138
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   498   e-138
gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [...   493   e-136
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   493   e-136
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   491   e-136
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   486   e-134
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     486   e-134
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   485   e-134
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   462   e-127
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   452   e-124
emb|CBI34651.3| unnamed protein product [Vitis vinifera]              440   e-120
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   423   e-115
gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i...   417   e-114
gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i...   415   e-113
ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Popu...   400   e-108
ref|XP_003525388.1| PREDICTED: uncharacterized protein LOC100791...   391   e-106
ref|XP_006580574.1| PREDICTED: uncharacterized protein LOC100791...   388   e-105

>gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  511 bits (1317), Expect = e-142
 Identities = 268/457 (58%), Positives = 314/457 (68%)
 Frame = -2

Query: 1906 GDMRGGGVNNPLXXXXXXXXXXXXXXNRPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIGH 1727
            G+ R G  NN L              NR P A VQKRRWG  WS+YWCFG  +H+KRIGH
Sbjct: 6    GESRTG--NNALETINAAASAIAAAENRVPQATVQKRRWGSWWSMYWCFGFQRHKKRIGH 63

Query: 1726 AVLVPEPIVSSVDAPAAENPPQAGSVVLPFIXXXXXXXSFLQSEPPSATQSPAGGLSLTS 1547
            AVLVPE      DAP AENP Q  S+VLPF+       SFLQSEPPSATQSPAG  SLT 
Sbjct: 64   AVLVPETTDRGGDAPRAENPIQTPSIVLPFVAPPSSPASFLQSEPPSATQSPAGFFSLT- 122

Query: 1546 ISASMYSPGGPASMFAIGPYAHETQLVTPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPE 1367
              ASMYSP GP S+FAIGPYAHETQLV+PPVFSTFTTEPSTAPFTPPPESVH+TTPSSPE
Sbjct: 123  --ASMYSPSGPTSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPE 180

Query: 1366 VPFAKLLDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDG 1187
            VPFA+LLDP  +NGE G RFP S YEFQSYQL PGSPV  L              PFPD 
Sbjct: 181  VPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDL 240

Query: 1186 EFFSGRPHFLQFRSGDPPNLLNLEKMASHEWGSQQGSGTMTPDAGVHRSQNNFLLDHHHS 1007
            EF +   HFL+FR+GDPP LLNL+ +++ +WGS+ GSG++TPD     S + FLL     
Sbjct: 241  EFAARGHHFLEFRTGDPPKLLNLDILSTRDWGSRLGSGSVTPDGAKSTSSDGFLLKPQTP 300

Query: 1006 DSTPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPLALKKTVLSSSENAESVVKK 827
            +      S N  RN++  ++HRVSFE++ EEV+RCVEKKP+AL + V +S E+ E    K
Sbjct: 301  EVVLNPRSNNRGRNNDISINHRVSFELSSEEVIRCVEKKPVALAEAVSTSLEDTEKAQSK 360

Query: 826  EGSPKEMANGHACRVGETSNSASERASTDGEGGERHQKHRSITLGSAKEFNFDSVDGVNS 647
            E  P ++ +   C VGETSN A+E+A  DGE  + H K RSITLGS KEFNFD+ DG +S
Sbjct: 361  E-DPSKVVSSSICPVGETSNDAAEKAVADGEEAQLHPKQRSITLGSVKEFNFDNPDGGDS 419

Query: 646  DKPNIGSDWWANEKVLGKEIETGKNWTFFPVMQPGVS 536
               +IGSDWWANEKV  KE    KNW+FFP+MQPGVS
Sbjct: 420  GN-SIGSDWWANEKVDAKENGPTKNWSFFPMMQPGVS 455


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  502 bits (1292), Expect = e-139
 Identities = 266/459 (57%), Positives = 310/459 (67%), Gaps = 2/459 (0%)
 Frame = -2

Query: 1906 GDMRGGGVNNPLXXXXXXXXXXXXXXNRPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIGH 1727
            GD R   +NN L              NR   A  QKRRWGGCWS+ WCFG  KHRKRIGH
Sbjct: 7    GDSRA--LNNSLETINAAATAIASAENRVHQATSQKRRWGGCWSISWCFGFQKHRKRIGH 64

Query: 1726 AVLVPEPIVSSVDAPAAENPPQAGSVVLPFIXXXXXXXSFLQSEPPSATQSPAGGLSLTS 1547
            AVLVPEP  S  +A  A N  QA ++ LPF+       SFLQSEPPSATQSPAG +SL S
Sbjct: 65   AVLVPEPTASRSNASEAVNSTQAAAISLPFVAPPSSPASFLQSEPPSATQSPAGLVSLNS 124

Query: 1546 ISASMYSPGGPASMFAIGPYAHETQLVTPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPE 1367
            IS +MYSPGGP+S+FAIGPYAHETQLV+PPVFSTFTTEPSTAPFTPPPESVH+TTPSSPE
Sbjct: 125  ISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPE 184

Query: 1366 VPFAKLLDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDG 1187
            VPFA+LLDP  + GE G +FPFS YEFQSY L PGSPV +L              PFPDG
Sbjct: 185  VPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPFPDG 244

Query: 1186 EFFSGRPHFLQFRSGDPPNLLNLEKMASHEWGSQQGSGTMTPDAGVHRSQNNFLLDHHHS 1007
            EF +  P F  F  GDPP LLNL+K++  EWGS+QGSGT+TPDA     +N F  +   S
Sbjct: 245  EFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLTPDAVGSTPRNGFFQNRQIS 304

Query: 1006 DSTPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPLALKKTVLSSSENAESVVKK 827
            +     +S N  R D+ +VDHRVSFE+T E+VVRCVEKKP  L + V  S +N  +V K+
Sbjct: 305  EVALRPHSENGLRKDQ-IVDHRVSFELTTEDVVRCVEKKPTTLAEAVSESLQNGTTVEKE 363

Query: 826  EGSPKEMANGHACRVGETSNSASERASTDGEGGERHQKHRSITLGSAKEFNFDSVDGVNS 647
            E S +     H+C  GE +N    +   D E   RHQK +SITLGS KEFNFDS DG +S
Sbjct: 364  ESSGEAENVHHSC-AGEAANDEPLKTPVDVEEAPRHQKQQSITLGSTKEFNFDSADG-DS 421

Query: 646  DKPNIGSDWWANEKVLGKEIETGKNWTFFPVMQ--PGVS 536
             +P I SDWWANEKV+GK+    KNW FFPV+Q  PGVS
Sbjct: 422  HEPTIASDWWANEKVVGKDSGAIKNWAFFPVIQPAPGVS 460


>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum]
          Length = 443

 Score =  501 bits (1290), Expect = e-139
 Identities = 261/430 (60%), Positives = 301/430 (70%)
 Frame = -2

Query: 1825 RPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIGHAVLVPEPIVSSVDAPAAENPPQAGSVV 1646
            R P A++QKRRWGGCWS+YWCFGS K  KRIGHAV +PE   S  D P++    QA S+V
Sbjct: 31   RVPQASIQKRRWGGCWSMYWCFGSQKQTKRIGHAVFIPETTASGADRPSSNTSSQAPSIV 90

Query: 1645 LPFIXXXXXXXSFLQSEPPSATQSPAGGLSLTSISASMYSPGGPASMFAIGPYAHETQLV 1466
            LPFI       SFL SEPPSAT SP G   L   S S YSP GPAS+FAIGPYAHETQLV
Sbjct: 91   LPFIAPPSSPASFLPSEPPSATHSPVGSKCL---SMSTYSPSGPASIFAIGPYAHETQLV 147

Query: 1465 TPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPEVPFAKLLDPIHQNGEDGPRFPFSQYEF 1286
            +PPVFS FTTEPSTAPFTPPPESVH+TTPSSPEVPFAKLLDP +QN   G R+PF+QYEF
Sbjct: 148  SPPVFSAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEF 207

Query: 1285 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFFSGRPHFLQFRSGDPPNLLNLEKMA 1106
            QSYQLQPGSPVS+L              PF D E+  GRP FL           NLEK+A
Sbjct: 208  QSYQLQPGSPVSNLISPGSAISVSGTSSPFLDREYTPGRPQFL-----------NLEKIA 256

Query: 1105 SHEWGSQQGSGTMTPDAGVHRSQNNFLLDHHHSDSTPPSNSYNFWRNDETVVDHRVSFEI 926
             HEWGS+QGSGT+TP+A   +  +NFLL++ +S        +N W+ND TVVDHRVSFEI
Sbjct: 257  PHEWGSRQGSGTLTPEAVNPKYHDNFLLNYQNSGVHRLPKPFNGWKNDLTVVDHRVSFEI 316

Query: 925  TEEEVVRCVEKKPLALKKTVLSSSENAESVVKKEGSPKEMANGHACRVGETSNSASERAS 746
            T E+VVRCVEKKP  + +T   S ++ E   K++ +  EM+NGH     E S    E +S
Sbjct: 317  TAEDVVRCVEKKPTMMMRTGSVSLQDTERSTKRQENLAEMSNGHDHGGHEPSREIHEGSS 376

Query: 745  TDGEGGERHQKHRSITLGSAKEFNFDSVDGVNSDKPNIGSDWWANEKVLGKEIETGKNWT 566
            TDGE G+R QKHRSITLGS+KEFNFD+VDG   DK  IGSDWWANEKVLGK  E   NW 
Sbjct: 377  TDGEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLGK--EPCNNW- 433

Query: 565  FFPVMQPGVS 536
             FP+MQPGVS
Sbjct: 434  IFPMMQPGVS 443


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
            gi|557541785|gb|ESR52763.1| hypothetical protein
            CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  500 bits (1287), Expect = e-138
 Identities = 265/459 (57%), Positives = 310/459 (67%), Gaps = 2/459 (0%)
 Frame = -2

Query: 1906 GDMRGGGVNNPLXXXXXXXXXXXXXXNRPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIGH 1727
            GD R   +NN L              NR   A  QKRRWGGCW++ WCFG  KHRKRIGH
Sbjct: 7    GDSRA--LNNSLETISAAATAIASAENRVHQATSQKRRWGGCWNISWCFGFQKHRKRIGH 64

Query: 1726 AVLVPEPIVSSVDAPAAENPPQAGSVVLPFIXXXXXXXSFLQSEPPSATQSPAGGLSLTS 1547
            AVLVPEP  S  +A  A N  QA ++ LPF+       SFLQSEPPSATQSPAG +SL S
Sbjct: 65   AVLVPEPTASRSNASEAVNSTQATAISLPFVAPPSSPASFLQSEPPSATQSPAGLVSLNS 124

Query: 1546 ISASMYSPGGPASMFAIGPYAHETQLVTPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPE 1367
            IS +MYSPGGP+S+FAIGPYAHETQLV+PPVFSTFTTEPSTAPFTPPPESVH+TTPSSPE
Sbjct: 125  ISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPE 184

Query: 1366 VPFAKLLDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDG 1187
            VPFA+LLDP  + GE G +FPFS YEFQSY L PGSPV +L              PFPDG
Sbjct: 185  VPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPFPDG 244

Query: 1186 EFFSGRPHFLQFRSGDPPNLLNLEKMASHEWGSQQGSGTMTPDAGVHRSQNNFLLDHHHS 1007
            EF +  P F  F  GDPP LLNL+K++  EWGS+QGSGT+TPDA     +N F  +   S
Sbjct: 245  EFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLTPDAVRSTPRNGFFQNRQIS 304

Query: 1006 DSTPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPLALKKTVLSSSENAESVVKK 827
            +     +S N  R D+ +VDHRVSFE+T E+VVRCVEKKP  L + V  S +N  +V K+
Sbjct: 305  EVALRPHSENGLRKDQ-IVDHRVSFELTTEDVVRCVEKKPTTLAEAVSESLQNGTTVEKE 363

Query: 826  EGSPKEMANGHACRVGETSNSASERASTDGEGGERHQKHRSITLGSAKEFNFDSVDGVNS 647
            E S +     H+C  GE +N    +   D E   RHQK +SITLGS KEFNFDS DG +S
Sbjct: 364  ESSGEAENVHHSC-AGEAANDEPLKTPVDVEEAPRHQKQQSITLGSTKEFNFDSADG-DS 421

Query: 646  DKPNIGSDWWANEKVLGKEIETGKNWTFFPVMQ--PGVS 536
             +P I SDWWANEKV+GK+    KNW FFPV+Q  PGVS
Sbjct: 422  HEPTIASDWWANEKVVGKDSGAIKNWAFFPVIQPAPGVS 460


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  498 bits (1283), Expect = e-138
 Identities = 268/451 (59%), Positives = 314/451 (69%), Gaps = 21/451 (4%)
 Frame = -2

Query: 1825 RPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIGHAVLVPEPIVSSVDAPAAENPPQAGSVV 1646
            R P   VQKRRWG CW  YWCF SPK  KRIGHAVL PE        PAAEN  QA ++V
Sbjct: 31   RVPQPTVQKRRWGSCWGEYWCFRSPKD-KRIGHAVLAPESRAPGSGVPAAENLTQAPTIV 89

Query: 1645 LPFIXXXXXXXSFLQSEPPSATQSPAGGLSLTSISASMYSPGGPASMFAIGPYAHETQLV 1466
            LPF+       SFLQSEPPSATQSP+G LSLTSI+A++YSPGGPAS+FAIGPYAHETQLV
Sbjct: 90   LPFVAPPSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLV 149

Query: 1465 TPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPEVPFAKLLDPIHQNGEDGPRFPFSQYEF 1286
            +PPVFSTFTTEPSTAPFTPPPESVH+TTPSSPEVPFA+L DP ++NGE G RF  SQYEF
Sbjct: 150  SPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEF 209

Query: 1285 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFF-SGRPHFLQFRSGDPPNLLNLEKM 1109
            QSYQL PGSPV HL              PFPD +F  SG   FL+FR+G PP LL L+K+
Sbjct: 210  QSYQLYPGSPVGHLISPSSGISGSGTSSPFPDRDFVCSGSSQFLEFRAGGPPKLLTLDKL 269

Query: 1108 ASHEWGSQQGSGTMTPDAGVHRSQNNFLLDHHHSDST-PPSN-----------------S 983
            ++HEWGS+ GSG++TPDA    S++  +LD   SD   PPS                  S
Sbjct: 270  SNHEWGSRIGSGSITPDALGPPSRDGSVLDRQVSDVIHPPSGDDSVLDRQISDVASHSLS 329

Query: 982  YNFWRNDETVVDHRVSFEITEEEVVRCVEKKPLALKKTVLSSSENAESVVKKEGSPKEMA 803
             +   N+E +VDHRVSFE+T E+VVRCVEK   AL K V +S +N  +V   E S +E+ 
Sbjct: 330  DSGCPNNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAVSASLQNPATVEIDENS-REVV 388

Query: 802  NGHACRVGETSNSASERASTD--GEGGERHQKHRSITLGSAKEFNFDSVDGVNSDKPNIG 629
                 RVGET+N+  E+A  D  GE G+ H K RSITLGSAKEFNFD+ DG +SDKPNI 
Sbjct: 389  VDSEGRVGETANNPPEKAPEDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNIS 448

Query: 628  SDWWANEKVLGKEIETGKNWTFFPVMQPGVS 536
            SDWWANEKV+GKE+   KNW+ F +MQP VS
Sbjct: 449  SDWWANEKVVGKEVGASKNWSIFHMMQPSVS 479


>gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  493 bits (1269), Expect = e-136
 Identities = 263/450 (58%), Positives = 306/450 (68%), Gaps = 1/450 (0%)
 Frame = -2

Query: 1885 VNNPLXXXXXXXXXXXXXXNRPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIGHAVLVPEP 1706
            +NN L              NR P A VQKRRWGGCWS+YWCFGS K +KRIG AVL  E 
Sbjct: 11   MNNTLETIHAAANAIASAENRVPQATVQKRRWGGCWSIYWCFGSYKQKKRIGPAVLTSET 70

Query: 1705 IVSSVDAPAAENPPQAGSVVLPFIXXXXXXXSFLQSEPPSATQSPAGGLSLTSISASMYS 1526
              S  + PAAENP QA ++ LPF+       SFL SEPPSATQSPAG +SLTSISASMYS
Sbjct: 71   SFSGANVPAAENPTQAPAIALPFVAPPSSPASFLPSEPPSATQSPAGLVSLTSISASMYS 130

Query: 1525 PGGPASMFAIGPYAHETQLVTPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPEVPFAKLL 1346
            PG PAS+FAIGPYAHETQLV+PPVFSTFTTEPSTAPFTPPPESVH+TTPSSPEVPFA+LL
Sbjct: 131  PG-PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLL 189

Query: 1345 DPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFFSGRP 1166
             P  Q GE   RFP S YEFQSYQL PGSPV  L              PF DGE F+   
Sbjct: 190  GPNLQYGEGVQRFPISHYEFQSYQLHPGSPVGQLISPSSGISGSGTSSPFRDGE-FAASL 248

Query: 1165 HFLQFRSGDPPNLLNLEKMASHEWGSQQGSGTMTPDAGVHRSQNNFLLDHHHSDSTP-PS 989
            HF +FR GDPP LLNL+K +S EWGS  GSGT+TPDA     +N FLLDH  S+ T  P 
Sbjct: 249  HFPEFRMGDPPKLLNLDKHSSCEWGSHHGSGTLTPDATRSTPRNGFLLDHQISEITSHPH 308

Query: 988  NSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPLALKKTVLSSSENAESVVKKEGSPKE 809
                  +ND+   +HRVSFE+T EEVVR +E +  A     +S S   E+  + E    +
Sbjct: 309  LKNKEVQNDQVAHNHRVSFELTTEEVVRSLEME-TATPSEAVSGSLQIEATRESEEHDTK 367

Query: 808  MANGHACRVGETSNSASERASTDGEGGERHQKHRSITLGSAKEFNFDSVDGVNSDKPNIG 629
            + + + CRVGETSN   E+A  D EG  +H KH+SITLGSAKEFNFD+VDG ++ KP + 
Sbjct: 368  VVDDYECRVGETSNERPEKALADREGKPQHHKHQSITLGSAKEFNFDNVDGGDAHKPILT 427

Query: 628  SDWWANEKVLGKEIETGKNWTFFPVMQPGV 539
            SDWWAN+KV GK     +NW+FFP+MQPGV
Sbjct: 428  SDWWANDKVAGKGGGVPRNWSFFPMMQPGV 457


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
            lycopersicum]
          Length = 443

 Score =  493 bits (1268), Expect = e-136
 Identities = 256/430 (59%), Positives = 300/430 (69%)
 Frame = -2

Query: 1825 RPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIGHAVLVPEPIVSSVDAPAAENPPQAGSVV 1646
            R P A++QKRRWG CWS+YWCFGS K  KRIGHAV +PE   S+ D P++    QA S+V
Sbjct: 31   RVPQASIQKRRWGSCWSMYWCFGSQKQTKRIGHAVFIPETTASAADRPSSNTSSQAPSIV 90

Query: 1645 LPFIXXXXXXXSFLQSEPPSATQSPAGGLSLTSISASMYSPGGPASMFAIGPYAHETQLV 1466
            LPFI       SFL SEPPSAT SP G   L   S S YSP GPAS+FAIGPYAHETQLV
Sbjct: 91   LPFIAPPSSPASFLPSEPPSATHSPVGSKCL---SMSTYSPSGPASIFAIGPYAHETQLV 147

Query: 1465 TPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPEVPFAKLLDPIHQNGEDGPRFPFSQYEF 1286
            +PPVFS FTTEPSTAPFTPPPESVH+TTPSSPEVPFAKLLDP +QN   G R+PF+QYEF
Sbjct: 148  SPPVFSAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEF 207

Query: 1285 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFFSGRPHFLQFRSGDPPNLLNLEKMA 1106
            QSYQLQPGSPVS+L              PF + E+  GRP FL           NLEK+A
Sbjct: 208  QSYQLQPGSPVSNLISPGSAISVSGTSSPFLEREYTPGRPQFL-----------NLEKIA 256

Query: 1105 SHEWGSQQGSGTMTPDAGVHRSQNNFLLDHHHSDSTPPSNSYNFWRNDETVVDHRVSFEI 926
             HEWGS+QGSGT+TP+A   +  ++FLL++ ++        +N W+ND TVVDHRVSFEI
Sbjct: 257  PHEWGSRQGSGTLTPEAVNPKYHDSFLLNYQNTGVHRLPKPFNGWKNDLTVVDHRVSFEI 316

Query: 925  TEEEVVRCVEKKPLALKKTVLSSSENAESVVKKEGSPKEMANGHACRVGETSNSASERAS 746
            T E+VVRCVEKKP  + +T   S ++ E   K++ +  EM+N H     E S    E +S
Sbjct: 317  TAEDVVRCVEKKPTMMMRTGSVSLQDTERSTKRQENLAEMSNAHDHSGHEPSREIHEGSS 376

Query: 745  TDGEGGERHQKHRSITLGSAKEFNFDSVDGVNSDKPNIGSDWWANEKVLGKEIETGKNWT 566
            TDGE G+R QKHRSITLGS+KEFNFD+VDG   DK  IGSDWWANEKVLGK  E   NW 
Sbjct: 377  TDGEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLGK--EPCNNW- 433

Query: 565  FFPVMQPGVS 536
             FP+MQPGVS
Sbjct: 434  IFPMMQPGVS 443


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346901|gb|EEE82832.2| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  491 bits (1265), Expect = e-136
 Identities = 258/430 (60%), Positives = 301/430 (70%)
 Frame = -2

Query: 1825 RPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIGHAVLVPEPIVSSVDAPAAENPPQAGSVV 1646
            R P A VQKRRWG CWS+Y CFG  KH+K+IGHAVL PEP      APA+ENP QA +V 
Sbjct: 31   RVPQATVQKRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVT 90

Query: 1645 LPFIXXXXXXXSFLQSEPPSATQSPAGGLSLTSISASMYSPGGPASMFAIGPYAHETQLV 1466
            LPF        SF QSEPPS TQSPAG +SLTSISASMYSP GPAS+FAIGPYAHETQLV
Sbjct: 91   LPFAAPPSSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLV 150

Query: 1465 TPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPEVPFAKLLDPIHQNGEDGPRFPFSQYEF 1286
            +PPVFSTFTTEPSTAPFTPPPESVH+TTPSSPEVPFA+ LDP  +NG+ G RFPF   +F
Sbjct: 151  SPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFPF---DF 207

Query: 1285 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFFSGRPHFLQFRSGDPPNLLNLEKMA 1106
            QSYQ  PGSPV  L              PFPDGEF  G  HF +FR G+PP LLNL+K++
Sbjct: 208  QSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKLS 267

Query: 1105 SHEWGSQQGSGTMTPDAGVHRSQNNFLLDHHHSDSTPPSNSYNFWRNDETVVDHRVSFEI 926
            + EWGS QGSG +TP++ V R   NFLL    SD      S N  +N + VV+HRVSFE+
Sbjct: 268  TCEWGSYQGSGALTPES-VRRGSPNFLLHRQFSDVPSRPRSGNGHKNGQ-VVNHRVSFEL 325

Query: 925  TEEEVVRCVEKKPLALKKTVLSSSENAESVVKKEGSPKEMANGHACRVGETSNSASERAS 746
            T E+  RCVE+KP    KTV    EN  +  K+E +  E      CRVG TSN + E AS
Sbjct: 326  TAEDASRCVEEKPAFSIKTVPEYVENG-TQAKEEKNSGESIQSFECRVGVTSNDSPEMAS 384

Query: 745  TDGEGGERHQKHRSITLGSAKEFNFDSVDGVNSDKPNIGSDWWANEKVLGKEIETGKNWT 566
            TDGE   +H+K +SITLGS KEFNFD+ D  +S KP+  S+WWAN  V+GKE ET KNW+
Sbjct: 385  TDGEAAPQHRKQQSITLGSVKEFNFDNADEGDSRKPS-SSNWWANGSVIGKEGETTKNWS 443

Query: 565  FFPVMQPGVS 536
            FFP++Q GVS
Sbjct: 444  FFPMVQSGVS 453


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
            gi|223549721|gb|EEF51209.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 459

 Score =  486 bits (1252), Expect = e-134
 Identities = 253/430 (58%), Positives = 301/430 (70%), Gaps = 1/430 (0%)
 Frame = -2

Query: 1825 RPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIGHAVLVPEPIVSSVDAPAAENPP-QAGSV 1649
            R P A +QKRRWG CWS+YWCFG  +HRKRIGHAVLVPE      D+ AAENP  QA ++
Sbjct: 34   RVPQATIQKRRWGSCWSVYWCFGYHRHRKRIGHAVLVPENSAPGNDSSAAENPTTQAPTI 93

Query: 1648 VLPFIXXXXXXXSFLQSEPPSATQSPAGGLSLTSISASMYSPGGPASMFAIGPYAHETQL 1469
             LPF+       SFLQSEPPSA+QSPAG LSLTS+SASMYSP GPAS+FAIGPYAHETQL
Sbjct: 94   TLPFVAPPSSPASFLQSEPPSASQSPAGILSLTSVSASMYSPSGPASIFAIGPYAHETQL 153

Query: 1468 VTPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPEVPFAKLLDPIHQNGEDGPRFPFSQYE 1289
            V+PP FSTFTTEPSTAPFTPPPESV +TTPSSPEVPFA+LL+P ++NGE G RFPFS YE
Sbjct: 154  VSPPAFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYE 213

Query: 1288 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFFSGRPHFLQFRSGDPPNLLNLEKM 1109
            FQSYQ  PGSPV  L              PFPDGEF +  P FL+F+   PP LLNL+K+
Sbjct: 214  FQSYQFYPGSPVGQLISPSSGISGSGTSSPFPDGEFAAAGPRFLEFQMAVPPKLLNLDKL 273

Query: 1108 ASHEWGSQQGSGTMTPDAGVHRSQNNFLLDHHHSDSTPPSNSYNFWRNDETVVDHRVSFE 929
            + HE GS+QGSGT+TPDA V  +  +F LD   SD     +S N    D+ V D RVSF+
Sbjct: 274  SVHECGSRQGSGTLTPDA-VRATSCSFPLDRQCSDIASNRHSDN-ENKDDQVADLRVSFD 331

Query: 928  ITEEEVVRCVEKKPLALKKTVLSSSENAESVVKKEGSPKEMANGHACRVGETSNSASERA 749
            ++ E+ +R  E KP +  K +  S +N E   +K     E+ +   CRVGETSN   E+A
Sbjct: 332  LSAEDALRYAEPKPASPVKIMPESMKN-EIAAEKVQKSSEIRHNFECRVGETSNGILEQA 390

Query: 748  STDGEGGERHQKHRSITLGSAKEFNFDSVDGVNSDKPNIGSDWWANEKVLGKEIETGKNW 569
            ST GE   RHQKHR++TLG+ KEFNFD+ DGV   KP+ G DWW N   +GKE  T KNW
Sbjct: 391  STGGEKTPRHQKHRTLTLGTFKEFNFDNADGV--PKPSAGPDWWDNGSDVGKEDFTAKNW 448

Query: 568  TFFPVMQPGV 539
            +FFPVMQP +
Sbjct: 449  SFFPVMQPSI 458


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  486 bits (1250), Expect = e-134
 Identities = 256/459 (55%), Positives = 306/459 (66%), Gaps = 6/459 (1%)
 Frame = -2

Query: 1894 GGG----VNNPLXXXXXXXXXXXXXXNRPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIGH 1727
            GGG    +NN L              NR P A V+KRRWGGC S+YWCFG+PK+R RIGH
Sbjct: 6    GGGDSRTMNNALETINAAATAIAMAENRVPQATVRKRRWGGCLSIYWCFGTPKNRTRIGH 65

Query: 1726 AVLVPEPIVSSVDAPAAENPPQAGSVVLPFIXXXXXXXSFLQSEPPSATQSPAGGLSLTS 1547
             VLVPE       AP AEN  Q  +V+LPFI       SFLQSEPPSATQSPAG LSLTS
Sbjct: 66   GVLVPETAQPGNSAPRAENSTQTHAVILPFIAPPSSPASFLQSEPPSATQSPAGLLSLTS 125

Query: 1546 ISASMYSPGGPASMFAIGPYAHETQLVTPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPE 1367
            +SASMYSPGGPAS+FAIGPYAHETQLV+PPVFSTFTTEPSTAPFTPPPESVH+TTPSSPE
Sbjct: 126  VSASMYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPE 185

Query: 1366 VPFAKLLDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDG 1187
            VPFA+LLDP   NGE G RFP    EFQSY  QPGSP+  L              PFPD 
Sbjct: 186  VPFAQLLDPNIHNGEPGQRFPIFHNEFQSYYFQPGSPIGQLISPSSGISGSGTSSPFPDP 245

Query: 1186 EFFSGRPHFLQFRSGDPPNLLNLEKMASHEWGSQQGSGTMTPDAGVHRSQNNFLLDHHHS 1007
            EF +  PHFL+FR+GDPP LLNL+K++  +WGS+QGSG++TPD+    S           
Sbjct: 246  EFAARGPHFLEFRTGDPPKLLNLDKLSKFDWGSRQGSGSLTPDSVKPIST---------F 296

Query: 1006 DSTPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPLALKKTVLSSSENAESVVKK 827
            +  P        RN E V D RVSF+++ E+V+R VEKK + L + +L+S ++     ++
Sbjct: 297  EVAPHLKPNGRCRNAENVADRRVSFDVSTEDVIRYVEKKTVPLAEAMLTSLKDTTMGQRE 356

Query: 826  EGSPKEMANGHAC--RVGETSNSASERASTDGEGGERHQKHRSITLGSAKEFNFDSVDGV 653
            E S         C  RVGETSN   ++A T GE   +HQKHRSITLGS+KEFNFD+ D  
Sbjct: 357  ENSDSNKVEEIGCENRVGETSNEEPDKAPTSGEEVLQHQKHRSITLGSSKEFNFDNADAG 416

Query: 652  NSDKPNIGSDWWANEKVLGKEIETGKNWTFFPVMQPGVS 536
            +  K +  SDWWAN+KV GKE    +NW+FFP++QPGVS
Sbjct: 417  DLHKSDSVSDWWANQKVAGKEGAPSQNWSFFPMIQPGVS 455


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346902|gb|ERP65330.1| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  485 bits (1248), Expect = e-134
 Identities = 257/430 (59%), Positives = 300/430 (69%)
 Frame = -2

Query: 1825 RPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIGHAVLVPEPIVSSVDAPAAENPPQAGSVV 1646
            R P A VQ RRWG CWS+Y CFG  KH+K+IGHAVL PEP      APA+ENP QA +V 
Sbjct: 31   RVPQATVQ-RRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVT 89

Query: 1645 LPFIXXXXXXXSFLQSEPPSATQSPAGGLSLTSISASMYSPGGPASMFAIGPYAHETQLV 1466
            LPF        SF QSEPPS TQSPAG +SLTSISASMYSP GPAS+FAIGPYAHETQLV
Sbjct: 90   LPFAAPPSSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLV 149

Query: 1465 TPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPEVPFAKLLDPIHQNGEDGPRFPFSQYEF 1286
            +PPVFSTFTTEPSTAPFTPPPESVH+TTPSSPEVPFA+ LDP  +NG+ G RFPF   +F
Sbjct: 150  SPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFPF---DF 206

Query: 1285 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFFSGRPHFLQFRSGDPPNLLNLEKMA 1106
            QSYQ  PGSPV  L              PFPDGEF  G  HF +FR G+PP LLNL+K++
Sbjct: 207  QSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKLS 266

Query: 1105 SHEWGSQQGSGTMTPDAGVHRSQNNFLLDHHHSDSTPPSNSYNFWRNDETVVDHRVSFEI 926
            + EWGS QGSG +TP++ V R   NFLL    SD      S N  +N + VV+HRVSFE+
Sbjct: 267  TCEWGSYQGSGALTPES-VRRGSPNFLLHRQFSDVPSRPRSGNGHKNGQ-VVNHRVSFEL 324

Query: 925  TEEEVVRCVEKKPLALKKTVLSSSENAESVVKKEGSPKEMANGHACRVGETSNSASERAS 746
            T E+  RCVE+KP    KTV    EN  +  K+E +  E      CRVG TSN + E AS
Sbjct: 325  TAEDASRCVEEKPAFSIKTVPEYVENG-TQAKEEKNSGESIQSFECRVGVTSNDSPEMAS 383

Query: 745  TDGEGGERHQKHRSITLGSAKEFNFDSVDGVNSDKPNIGSDWWANEKVLGKEIETGKNWT 566
            TDGE   +H+K +SITLGS KEFNFD+ D  +S KP+  S+WWAN  V+GKE ET KNW+
Sbjct: 384  TDGEAAPQHRKQQSITLGSVKEFNFDNADEGDSRKPS-SSNWWANGSVIGKEGETTKNWS 442

Query: 565  FFPVMQPGVS 536
            FFP++Q GVS
Sbjct: 443  FFPMVQSGVS 452


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 459

 Score =  462 bits (1190), Expect = e-127
 Identities = 247/461 (53%), Positives = 305/461 (66%), Gaps = 3/461 (0%)
 Frame = -2

Query: 1909 RGDMRGGGVNNPLXXXXXXXXXXXXXXNRPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIG 1730
            R  + GG  NN L              +R P A VQKRRW   W +YWCFG  +HRKRIG
Sbjct: 4    RRGVNGGDGNNALDTINAAASAIAAAESRVPQATVQKRRWAKGWGVYWCFGFQRHRKRIG 63

Query: 1729 HAVLVPEPIVSSVDAPAAENPPQAGSVVLPFIXXXXXXXSFLQSEPPSATQSPAGGLSLT 1550
            HAV++PE      + P AEN  QA S+VLPF        SFLQSEPPSA QSP    SL 
Sbjct: 64   HAVILPETTSPGHNDPRAENLTQASSIVLPFAAPPSSPASFLQSEPPSAMQSPGFNFSL- 122

Query: 1549 SISASMYSPGGPASMFAIGPYAHETQLVTPPVFSTFTTEPSTAPFTPPPESVHMTTPSSP 1370
              SASMYSPG P+S+FAIGPYAHETQLV+PPVFSTFTTEPSTAPFTPP ESVH+T PSSP
Sbjct: 123  --SASMYSPG-PSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPAESVHLTRPSSP 179

Query: 1369 EVPFAKLLDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPD 1190
            EVPFA+LLD   + GE G R+P S YEFQSYQ  PGSPV  L              PF D
Sbjct: 180  EVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLD 239

Query: 1189 GEFFSGRPHFLQFRSGDPPNLLNLEKMASHEWGSQQGSGTMTPDAGVHRSQNNFLLDHHH 1010
             EF SG  HFL+FR+G+ P +LNL+ + + +WGS+  SG++TPDA    S   F L  + 
Sbjct: 240  SEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGSRLCSGSVTPDAAKSTSSEGFTLKPYT 299

Query: 1009 SDSTPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPLALKKTVLSSSENAESVVK 830
             +    + S +  RND   + HRVSFE++ EEVVRCVEKKP+AL + V +S ++AE   +
Sbjct: 300  PEGVLNARSNSRRRNDGASIGHRVSFELSAEEVVRCVEKKPVALAEAVSTSLQSAEKAER 359

Query: 829  KEGSPKEMANGHACRVGETSNSASERASTDGEGGE---RHQKHRSITLGSAKEFNFDSVD 659
            +EG  +E+++ H C V +TSN +SE+A   G+  E   R+QK RSITLGSAKEFNFD+ D
Sbjct: 360  EEGPNQEVSSSHECPVVDTSNDSSEKA-VGGDAEELSYRYQKERSITLGSAKEFNFDNAD 418

Query: 658  GVNSDKPNIGSDWWANEKVLGKEIETGKNWTFFPVMQPGVS 536
            G +S   +I +DWWANEKV+ KE    KNW+FFP++QPG+S
Sbjct: 419  GGDSGTSSISTDWWANEKVVLKENGESKNWSFFPMIQPGMS 459


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 422

 Score =  452 bits (1162), Expect = e-124
 Identities = 237/427 (55%), Positives = 294/427 (68%), Gaps = 3/427 (0%)
 Frame = -2

Query: 1807 VQKRRWGGCWSLYWCFGSPKHRKRIGHAVLVPEPIVSSVDAPAAENPPQAGSVVLPFIXX 1628
            +QKRRW   W +YWCFG  +HRKRIGHAV++PE      + P AEN  QA S+VLPF   
Sbjct: 1    MQKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAENLTQASSIVLPFAAP 60

Query: 1627 XXXXXSFLQSEPPSATQSPAGGLSLTSISASMYSPGGPASMFAIGPYAHETQLVTPPVFS 1448
                 SFLQSEPPSA QSP    SL   SASMYSPG P+S+FAIGPYAHETQLV+PPVFS
Sbjct: 61   PSSPASFLQSEPPSAMQSPGFNFSL---SASMYSPG-PSSIFAIGPYAHETQLVSPPVFS 116

Query: 1447 TFTTEPSTAPFTPPPESVHMTTPSSPEVPFAKLLDPIHQNGEDGPRFPFSQYEFQSYQLQ 1268
            TFTTEPSTAPFTPP ESVH+T PSSPEVPFA+LLD   + GE G R+P S YEFQSYQ  
Sbjct: 117  TFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWY 176

Query: 1267 PGSPVSHLXXXXXXXXXXXXXXPFPDGEFFSGRPHFLQFRSGDPPNLLNLEKMASHEWGS 1088
            PGSPV  L              PF D EF SG  HFL+FR+G+ P +LNL+ + + +WGS
Sbjct: 177  PGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGS 236

Query: 1087 QQGSGTMTPDAGVHRSQNNFLLDHHHSDSTPPSNSYNFWRNDETVVDHRVSFEITEEEVV 908
            +  SG++TPDA    S   F L  +  +    + S +  RND   + HRVSFE++ EEVV
Sbjct: 237  RLCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVV 296

Query: 907  RCVEKKPLALKKTVLSSSENAESVVKKEGSPKEMANGHACRVGETSNSASERASTDGEGG 728
            RCVEKKP+AL + V +S ++AE   ++EG  +E+++ H C V +TSN +SE+A   G+  
Sbjct: 297  RCVEKKPVALAEAVSTSLQSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEKA-VGGDAE 355

Query: 727  E---RHQKHRSITLGSAKEFNFDSVDGVNSDKPNIGSDWWANEKVLGKEIETGKNWTFFP 557
            E   R+QK RSITLGSAKEFNFD+ DG +S   +I +DWWANEKV+ KE    KNW+FFP
Sbjct: 356  ELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFFP 415

Query: 556  VMQPGVS 536
            ++QPG+S
Sbjct: 416  MIQPGMS 422


>emb|CBI34651.3| unnamed protein product [Vitis vinifera]
          Length = 412

 Score =  440 bits (1132), Expect = e-120
 Identities = 243/432 (56%), Positives = 280/432 (64%), Gaps = 2/432 (0%)
 Frame = -2

Query: 1825 RPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIGHAVLVPEPIVSSVDAPAAENPPQAGSVV 1646
            R P   VQKRRWG CW  YWCF SPK  KRIGHAVL PE        PAAEN  QA ++V
Sbjct: 31   RVPQPTVQKRRWGSCWGEYWCFRSPKD-KRIGHAVLAPESRAPGSGVPAAENLTQAPTIV 89

Query: 1645 LPFIXXXXXXXSFLQSEPPSATQSPAGGLSLTSISASMYSPGGPASMFAIGPYAHETQLV 1466
            LPF+       SFLQSEPPSATQSP+G LSLTSI+A++YSPGGPAS+FAIGPYAHETQLV
Sbjct: 90   LPFVAPPSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLV 149

Query: 1465 TPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPEVPFAKLLDPIHQNGEDGPRFPFSQYEF 1286
            +PPVFSTFTTEPSTAPFTPPPESVH+TTPSSPEVPFA+L DP ++NGE G RF  SQYEF
Sbjct: 150  SPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEF 209

Query: 1285 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFFSGRPHFLQFRSGDPPNLLNLEKMA 1106
            QSYQL PGSPV HL              PFPD                            
Sbjct: 210  QSYQLYPGSPVGHLISPSSGISGSGTSSPFPD---------------------------- 241

Query: 1105 SHEWGSQQGSGTMTPDAGVHRSQNNFLLDHHHSDSTPPSNSYNFWRNDETVVDHRVSFEI 926
                     SG++TPDA    S++  +LDH                N+E +VDHRVSFE+
Sbjct: 242  --------RSGSITPDALGPPSRDGSVLDHSGCP------------NNEIMVDHRVSFEL 281

Query: 925  TEEEVVRCVEKKPLALKKTVLSSSENAESVVKKEGSPKEMANGHACRVGETSNSASERAS 746
            T E+VVRCVEK   AL K V +S +N  +V   E S +E+      RVGET+N+  E+A 
Sbjct: 282  TAEDVVRCVEKDSAALVKAVSASLQNPATVEIDENS-REVVVDSEGRVGETANNPPEKAP 340

Query: 745  TD--GEGGERHQKHRSITLGSAKEFNFDSVDGVNSDKPNIGSDWWANEKVLGKEIETGKN 572
             D  GE G+ H K RSITLGSAKEFNFD+ DG +SDKPNI SDWWANEKV+GKE+   KN
Sbjct: 341  EDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKN 400

Query: 571  WTFFPVMQPGVS 536
            W+ F +MQP VS
Sbjct: 401  WSIFHMMQPSVS 412


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  423 bits (1087), Expect = e-115
 Identities = 235/459 (51%), Positives = 292/459 (63%), Gaps = 9/459 (1%)
 Frame = -2

Query: 1885 VNNPLXXXXXXXXXXXXXXNRPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIGHAVLVPEP 1706
            VNN +              +R     VQKRRWG C SLYWCFGS +H KRIGHAVLVPEP
Sbjct: 4    VNNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEP 63

Query: 1705 IVSSVDAPAAENPPQAGSVVLPFIXXXXXXXSFLQSEPPSATQSPAGGLSLTSISASMYS 1526
            +V    APA+EN   + S+VLPFI       SFLQS+PPS+TQSPAG LSLT++S + YS
Sbjct: 64   MVPGAVAPASENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYS 123

Query: 1525 PGGPASMFAIGPYAHETQLVTPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPEVPFAKL- 1349
            P GPASMFAIGPYAHETQLV+PPVFSTF TEPSTAPFTPPPESV +TTPSSPEVPFA+L 
Sbjct: 124  PSGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 183

Query: 1348 ---LDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFF 1178
               LD   +N     +   S YEFQ YQL P SPV HL              PFPD    
Sbjct: 184  TSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHL---ISPISNSGTSSPFPD---- 236

Query: 1177 SGRPHFLQFRSGDPPNLLNLEKMASHEWGSQQGSGTMTPDAGVHRSQNNFLLDHHHSDST 998
              RP        + P LL  E  ++  WGS+ GSG++TPD     S+++FLL++  S+  
Sbjct: 237  -RRPIV------EAPKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVA 289

Query: 997  PPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPLALKKTVLSSSEN--AESVVKKE 824
              +NS +  +N ETV+DHRVSFE+  E+V  CVEKKP+A  +TV ++ ++   E  +++E
Sbjct: 290  SLANSESGSQNGETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERE 349

Query: 823  GSPKEMANGHACR--VGETSNSASERASTDGEGGERHQKHRSITLGSAKEFNFDSVDGVN 650
                  +  + C   VGE   +ASE+AS +GE  + H+KH  I  GS KEFNFD+  G  
Sbjct: 350  RDGISESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEV 409

Query: 649  SDKPN-IGSDWWANEKVLGKEIETGKNWTFFPVMQPGVS 536
            S KPN IGS+WW NEKV+GK      NWTFFP++QPG+S
Sbjct: 410  SAKPNIIGSEWWVNEKVVGKGTGPQTNWTFFPLLQPGIS 448


>gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
          Length = 485

 Score =  417 bits (1073), Expect = e-114
 Identities = 229/462 (49%), Positives = 283/462 (61%), Gaps = 38/462 (8%)
 Frame = -2

Query: 1807 VQKRRWGGCWSLYWCFGSPKHRKRIGHAVLVPEPIVSSVDAPAAENPPQAGSVVLPFIXX 1628
            VQK+RWG CW LYWCFGS K+ KRIGHAVLVPEP+V       AEN      ++LPFI  
Sbjct: 30   VQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFIAP 89

Query: 1627 XXXXXSFLQSEPPSATQSPAGGLSLTSISASMYSPGGPASMFAIGPYAHETQLVTPPVFS 1448
                 SFLQS+PPSATQSPAG LSLTS+S + YSP GPAS+FAIGPYAHETQLVTPPVFS
Sbjct: 90   PSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFS 149

Query: 1447 TFTTEPSTAPFTPPPESVHMTTPSSPEVPFAKLL----DPIHQNGEDGPRFPFSQYEFQS 1280
              TTEPSTAPFTPPPESV +TTPSSPEVPFA+LL    +   +N     +F  S YEFQS
Sbjct: 150  ALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQS 209

Query: 1279 YQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFFSGRPHFLQFRSGDPPNLLNLEKMASH 1100
            YQ+ PGSP  +L              PFPD      R   L+FR G+ P LL  E   + 
Sbjct: 210  YQIYPGSPGGNLISPGSAISNSGTSSPFPD------RRPILEFRMGEAPKLLGFENFTTR 263

Query: 1099 EWGSQQGS----------------GTMTPDA-GV---------------HRSQNNFLLDH 1016
            +WGS+ GS                G++TPD  G+                 S++ FL+  
Sbjct: 264  KWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGS 323

Query: 1015 HHSDSTPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPLALKKTVLSSSEN--AE 842
              S+    +N  N  +NDET+VDHRVSFE++ E+V  C+E K L   + V    ++  AE
Sbjct: 324  QISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAE 383

Query: 841  SVVKKEGSPKEMANGHACRVGETSNSASERASTDGEGGERHQKHRSITLGSAKEFNFDSV 662
               +++G  K++ +     + ETSN   E+AS + E    +QKHRS+TLGS KEFNFD+ 
Sbjct: 384  GRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNT 443

Query: 661  DGVNSDKPNIGSDWWANEKVLGKEIETGKNWTFFPVMQPGVS 536
             G  SDKP I S+WWANEKV GKE   G +WTFFP++QP VS
Sbjct: 444  KGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485


>gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao]
          Length = 489

 Score =  415 bits (1067), Expect = e-113
 Identities = 228/462 (49%), Positives = 282/462 (61%), Gaps = 38/462 (8%)
 Frame = -2

Query: 1807 VQKRRWGGCWSLYWCFGSPKHRKRIGHAVLVPEPIVSSVDAPAAENPPQAGSVVLPFIXX 1628
            V K+RWG CW LYWCFGS K+ KRIGHAVLVPEP+V       AEN      ++LPFI  
Sbjct: 34   VYKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFIAP 93

Query: 1627 XXXXXSFLQSEPPSATQSPAGGLSLTSISASMYSPGGPASMFAIGPYAHETQLVTPPVFS 1448
                 SFLQS+PPSATQSPAG LSLTS+S + YSP GPAS+FAIGPYAHETQLVTPPVFS
Sbjct: 94   PSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFS 153

Query: 1447 TFTTEPSTAPFTPPPESVHMTTPSSPEVPFAKLL----DPIHQNGEDGPRFPFSQYEFQS 1280
              TTEPSTAPFTPPPESV +TTPSSPEVPFA+LL    +   +N     +F  S YEFQS
Sbjct: 154  ALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQS 213

Query: 1279 YQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFFSGRPHFLQFRSGDPPNLLNLEKMASH 1100
            YQ+ PGSP  +L              PFPD      R   L+FR G+ P LL  E   + 
Sbjct: 214  YQIYPGSPGGNLISPGSAISNSGTSSPFPD------RRPILEFRMGEAPKLLGFENFTTR 267

Query: 1099 EWGSQQGS----------------GTMTPDA-GV---------------HRSQNNFLLDH 1016
            +WGS+ GS                G++TPD  G+                 S++ FL+  
Sbjct: 268  KWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGS 327

Query: 1015 HHSDSTPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPLALKKTVLSSSEN--AE 842
              S+    +N  N  +NDET+VDHRVSFE++ E+V  C+E K L   + V    ++  AE
Sbjct: 328  QISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAE 387

Query: 841  SVVKKEGSPKEMANGHACRVGETSNSASERASTDGEGGERHQKHRSITLGSAKEFNFDSV 662
               +++G  K++ +     + ETSN   E+AS + E    +QKHRS+TLGS KEFNFD+ 
Sbjct: 388  GRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNT 447

Query: 661  DGVNSDKPNIGSDWWANEKVLGKEIETGKNWTFFPVMQPGVS 536
             G  SDKP I S+WWANEKV GKE   G +WTFFP++QP VS
Sbjct: 448  KGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489


>ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa]
            gi|222841936|gb|EEE79483.1| hypothetical protein
            POPTR_0003s12950g [Populus trichocarpa]
          Length = 441

 Score =  400 bits (1028), Expect = e-108
 Identities = 221/402 (54%), Positives = 262/402 (65%)
 Frame = -2

Query: 1825 RPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIGHAVLVPEPIVSSVDAPAAENPPQAGSVV 1646
            R P A VQK+RW   WS+YWCFG  K +++IGHAVL PE       APAAEN  QA  V 
Sbjct: 31   RVPQAMVQKQRWRSHWSIYWCFGYQKSKRQIGHAVLFPESSAPGSGAPAAENSAQAPEVT 90

Query: 1645 LPFIXXXXXXXSFLQSEPPSATQSPAGGLSLTSISASMYSPGGPASMFAIGPYAHETQLV 1466
             PF+       SF QSEPPS TQSPAG +S TSISASMYSP GPAS+FAIGPYAHETQLV
Sbjct: 91   FPFVAPPSSPASFFQSEPPSVTQSPAGLVSRTSISASMYSPSGPASIFAIGPYAHETQLV 150

Query: 1465 TPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPEVPFAKLLDPIHQNGEDGPRFPFSQYEF 1286
            +PPVFSTFTTEPSTAPFTPPPESVH+TTPSSPEVPFA+L+DP  +NG  G RFPF   +F
Sbjct: 151  SPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLIDPTLRNGVTGLRFPF---DF 207

Query: 1285 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFFSGRPHFLQFRSGDPPNLLNLEKMA 1106
            QSYQ  PGS V  L              PFPDGEF  G PH  +FR G  P LLNL+K++
Sbjct: 208  QSYQFHPGSSVGQLISPSSGISGSGTSSPFPDGEFAVGGPHSPEFRMG--PKLLNLDKLS 265

Query: 1105 SHEWGSQQGSGTMTPDAGVHRSQNNFLLDHHHSDSTPPSNSYNFWRNDETVVDHRVSFEI 926
            + EWGS Q SG +TPD+  H S  NFLL    SD      S N   +D+ VV+HR SFE+
Sbjct: 266  TREWGSYQDSGALTPDSVRHGSP-NFLLHRQFSDVASHPRSEN-GHDDDQVVNHRFSFEL 323

Query: 925  TEEEVVRCVEKKPLALKKTVLSSSENAESVVKKEGSPKEMANGHACRVGETSNSASERAS 746
            + ++  RCVE+KP    KTV    EN  +  K+E +  E+      R G+TSN   E  S
Sbjct: 324  SVKDASRCVEEKPACSIKTVPEYVENG-TKAKEEENYGELIQSFERRSGDTSNDTPETPS 382

Query: 745  TDGEGGERHQKHRSITLGSAKEFNFDSVDGVNSDKPNIGSDW 620
            TDGE   +H+K + ITLGS  EFNFD+ D  +S  P+  S+W
Sbjct: 383  TDGE-APQHRKQQPITLGSVNEFNFDNADEGDSHNPS-SSNW 422


>ref|XP_003525388.1| PREDICTED: uncharacterized protein LOC100791666 isoform X1 [Glycine
            max]
          Length = 461

 Score =  391 bits (1004), Expect = e-106
 Identities = 222/437 (50%), Positives = 273/437 (62%), Gaps = 7/437 (1%)
 Frame = -2

Query: 1825 RPPHAAVQKRRWGGCWSLYWCFGSPKHRKRIGHAVLVPEPIVSSVDAPAAENPPQAGSVV 1646
            R    + QK+RWG       CFG  K RKRIGHAVLVPEP  +  D  AA +  QA S+ 
Sbjct: 33   RVSQPSTQKKRWGSWLGKIGCFGYKKTRKRIGHAVLVPEPTTNGADPAAAASSIQAPSIT 92

Query: 1645 LPFIXXXXXXXSFLQSEPPSATQSPAGGLSLTSISASMYSPGGPASMFAIGPYAHETQLV 1466
            LPF+       SF QSEPPS  QSP G +S T +SAS+YSPGGPAS+FAIGPYAHETQLV
Sbjct: 93   LPFVAPPSSPASFFQSEPPSTAQSPIGKVSHTCVSASIYSPGGPASIFAIGPYAHETQLV 152

Query: 1465 TPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPEVPFAKLLDPIHQNGEDGPRFPFSQYEF 1286
            +PPVFS      STAPFTPPPESVHMTTPSSPEVPFA+LLDP ++N E   RF  S Y+F
Sbjct: 153  SPPVFSA----SSTAPFTPPPESVHMTTPSSPEVPFAQLLDPNNKNSETFQRFQISHYDF 208

Query: 1285 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFFSGRPHFLQFRSGDPPNLLNLE-KM 1109
            QSYQ  PGSPV  L              P PD EF +   H L F+  DPP LLNL+ K+
Sbjct: 209  QSYQFHPGSPVGQLISPRSAISVSGTSSPLPDSEFNATFAHILDFQRADPPKLLNLDNKL 268

Query: 1108 ASHE-WGSQQGSGTMTPDAGVHRSQNNFLLDHHHSD---STPPSNSYNFWRNDETVVDHR 941
            +S E   S  GSG++TPDA    +Q+ FL +H  S+   S  PSN+    R +E  ++HR
Sbjct: 269  SSCENQKSNHGSGSLTPDAARSTTQSGFLSNHWVSEIKMSPHPSNN----RLNEISINHR 324

Query: 940  VSFEITEEEVVRCVEKKPLALKKT-VLSSSENAESVVKKEGSPKEMANGHACRVGETSNS 764
            VSFE++ ++V++ +E KP A   T VL   +N      KE   +E A      V E  N 
Sbjct: 325  VSFELSAQKVLKSLENKPAASAWTNVLPKLKNDAPTTDKEEKSEESALDDKQVVSEAHND 384

Query: 763  ASERASTDGEGGER-HQKHRSITLGSAKEFNFDSVDGVNSDKPNIGSDWWANEKVLGKEI 587
                 +  G+     H+K +S+TL SAKEFNFD+ DG +S  PNI +DWWANEKV GKE 
Sbjct: 385  QPLETTLGGDKATTVHEKDQSLTLSSAKEFNFDNADGGDSLAPNIVADWWANEKVAGKER 444

Query: 586  ETGKNWTFFPVMQPGVS 536
            E  K+W+FFP++QPGVS
Sbjct: 445  EASKDWSFFPMIQPGVS 461


>ref|XP_006580574.1| PREDICTED: uncharacterized protein LOC100791666 isoform X2 [Glycine
            max]
          Length = 441

 Score =  388 bits (997), Expect = e-105
 Identities = 220/429 (51%), Positives = 270/429 (62%), Gaps = 7/429 (1%)
 Frame = -2

Query: 1801 KRRWGGCWSLYWCFGSPKHRKRIGHAVLVPEPIVSSVDAPAAENPPQAGSVVLPFIXXXX 1622
            K+RWG       CFG  K RKRIGHAVLVPEP  +  D  AA +  QA S+ LPF+    
Sbjct: 21   KKRWGSWLGKIGCFGYKKTRKRIGHAVLVPEPTTNGADPAAAASSIQAPSITLPFVAPPS 80

Query: 1621 XXXSFLQSEPPSATQSPAGGLSLTSISASMYSPGGPASMFAIGPYAHETQLVTPPVFSTF 1442
               SF QSEPPS  QSP G +S T +SAS+YSPGGPAS+FAIGPYAHETQLV+PPVFS  
Sbjct: 81   SPASFFQSEPPSTAQSPIGKVSHTCVSASIYSPGGPASIFAIGPYAHETQLVSPPVFSA- 139

Query: 1441 TTEPSTAPFTPPPESVHMTTPSSPEVPFAKLLDPIHQNGEDGPRFPFSQYEFQSYQLQPG 1262
                STAPFTPPPESVHMTTPSSPEVPFA+LLDP ++N E   RF  S Y+FQSYQ  PG
Sbjct: 140  ---SSTAPFTPPPESVHMTTPSSPEVPFAQLLDPNNKNSETFQRFQISHYDFQSYQFHPG 196

Query: 1261 SPVSHLXXXXXXXXXXXXXXPFPDGEFFSGRPHFLQFRSGDPPNLLNLE-KMASHE-WGS 1088
            SPV  L              P PD EF +   H L F+  DPP LLNL+ K++S E   S
Sbjct: 197  SPVGQLISPRSAISVSGTSSPLPDSEFNATFAHILDFQRADPPKLLNLDNKLSSCENQKS 256

Query: 1087 QQGSGTMTPDAGVHRSQNNFLLDHHHSD---STPPSNSYNFWRNDETVVDHRVSFEITEE 917
              GSG++TPDA    +Q+ FL +H  S+   S  PSN+    R +E  ++HRVSFE++ +
Sbjct: 257  NHGSGSLTPDAARSTTQSGFLSNHWVSEIKMSPHPSNN----RLNEISINHRVSFELSAQ 312

Query: 916  EVVRCVEKKPLALKKT-VLSSSENAESVVKKEGSPKEMANGHACRVGETSNSASERASTD 740
            +V++ +E KP A   T VL   +N      KE   +E A      V E  N      +  
Sbjct: 313  KVLKSLENKPAASAWTNVLPKLKNDAPTTDKEEKSEESALDDKQVVSEAHNDQPLETTLG 372

Query: 739  GEGGER-HQKHRSITLGSAKEFNFDSVDGVNSDKPNIGSDWWANEKVLGKEIETGKNWTF 563
            G+     H+K +S+TL SAKEFNFD+ DG +S  PNI +DWWANEKV GKE E  K+W+F
Sbjct: 373  GDKATTVHEKDQSLTLSSAKEFNFDNADGGDSLAPNIVADWWANEKVAGKEREASKDWSF 432

Query: 562  FPVMQPGVS 536
            FP++QPGVS
Sbjct: 433  FPMIQPGVS 441


Top