BLASTX nr result

ID: Atropa21_contig00011147 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00011147
         (1265 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   580   e-163
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   577   e-162
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   367   7e-99
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   360   5e-97
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   360   7e-97
gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus pe...   353   6e-95
emb|CBI34651.3| unnamed protein product [Vitis vinifera]              337   6e-90
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     328   3e-87
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   328   3e-87
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   328   3e-87
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   320   8e-85
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   320   8e-85
gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [...   319   1e-84
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   316   1e-83
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   311   3e-82
emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]   311   3e-82
gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i...   306   9e-81
gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i...   306   9e-81
ref|XP_003516706.1| PREDICTED: uncharacterized protein LOC100777...   285   4e-74
ref|XP_006580574.1| PREDICTED: uncharacterized protein LOC100791...   281   4e-73

>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum]
          Length = 443

 Score =  580 bits (1496), Expect = e-163
 Identities = 283/319 (88%), Positives = 296/319 (92%), Gaps = 4/319 (1%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SPSGPASIFAIGPYAHE QLVSPPVFS FTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL
Sbjct: 127  SPSGPASIFAIGPYAHETQLVSPPVFSAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 186

Query: 181  LDPNNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHESIAGR 360
            LDPN QNV AGHR+PF+QYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLD E   GR
Sbjct: 187  LDPNYQNVAAGHRYPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDREYTPGR 246

Query: 361  PQFLNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRLPKPYNGWKNDLT 540
            PQFLNLEKIAPHEWGSRQGSGTLTP+AV PKYHD+FLLN+QNSGV RLPKP+NGWKNDLT
Sbjct: 247  PQFLNLEKIAPHEWGSRQGSGTLTPEAVNPKYHDNFLLNYQNSGVHRLPKPFNGWKNDLT 306

Query: 541  VVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQDAERNTKREENLAEMSNGQEGDGH- 717
            VVDHRVSFEITAEDVVRCVEKKPT+MMRTGS+SLQD ER+TKR+ENLAEMSNG +  GH 
Sbjct: 307  VVDHRVSFEITAEDVVRCVEKKPTMMMRTGSVSLQDTERSTKRQENLAEMSNGHDHGGHE 366

Query: 718  ---ELRDGSSTDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYPDKATIGSDWWANEKVLG 888
               E+ +GSSTDGEDGQRQQK RSITLGSSKEFNFDNVDG YPDKATIGSDWWANEKVLG
Sbjct: 367  PSREIHEGSSTDGEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLG 426

Query: 889  KEAGPCNNWIFPMMQPGVS 945
            KE  PCNNWIFPMMQPGVS
Sbjct: 427  KE--PCNNWIFPMMQPGVS 443


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
            lycopersicum]
          Length = 443

 Score =  577 bits (1487), Expect = e-162
 Identities = 281/319 (88%), Positives = 295/319 (92%), Gaps = 4/319 (1%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SPSGPASIFAIGPYAHE QLVSPPVFS FTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL
Sbjct: 127  SPSGPASIFAIGPYAHETQLVSPPVFSAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 186

Query: 181  LDPNNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHESIAGR 360
            LDPN QNV AGHR+PF+QYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFL+ E   GR
Sbjct: 187  LDPNYQNVAAGHRYPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLEREYTPGR 246

Query: 361  PQFLNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRLPKPYNGWKNDLT 540
            PQFLNLEKIAPHEWGSRQGSGTLTP+AV PKYHDSFLLN+QN+GV RLPKP+NGWKNDLT
Sbjct: 247  PQFLNLEKIAPHEWGSRQGSGTLTPEAVNPKYHDSFLLNYQNTGVHRLPKPFNGWKNDLT 306

Query: 541  VVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQDAERNTKREENLAEMSNGQEGDGH- 717
            VVDHRVSFEITAEDVVRCVEKKPT+MMRTGS+SLQD ER+TKR+ENLAEMSN  +  GH 
Sbjct: 307  VVDHRVSFEITAEDVVRCVEKKPTMMMRTGSVSLQDTERSTKRQENLAEMSNAHDHSGHE 366

Query: 718  ---ELRDGSSTDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYPDKATIGSDWWANEKVLG 888
               E+ +GSSTDGEDGQRQQK RSITLGSSKEFNFDNVDG YPDKATIGSDWWANEKVLG
Sbjct: 367  PSREIHEGSSTDGEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLG 426

Query: 889  KEAGPCNNWIFPMMQPGVS 945
            KE  PCNNWIFPMMQPGVS
Sbjct: 427  KE--PCNNWIFPMMQPGVS 443


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  367 bits (941), Expect = 7e-99
 Identities = 201/352 (57%), Positives = 233/352 (66%), Gaps = 37/352 (10%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SP GPASIFAIGPYAHE QLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFA+L
Sbjct: 129  SPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQL 188

Query: 181  LDPNNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHESI--- 351
             DPNN+N +AGHRF  SQYEFQSYQL PGSPV +LISP S IS SGTSSPF D + +   
Sbjct: 189  FDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSPFPDRDFVCSG 248

Query: 352  ---------AGRPQFLNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRL 504
                      G P+ L L+K++ HEWGSR GSG++TPDA+ P   D  +L+ Q S V   
Sbjct: 249  SSQFLEFRAGGPPKLLTLDKLSNHEWGSRIGSGSITPDALGPPSRDGSVLDRQVSDVIHP 308

Query: 505  PK------------------PYNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTLMMRTG 630
            P                     +G  N+  +VDHRVSFE+TAEDVVRCVEK    +++  
Sbjct: 309  PSGDDSVLDRQISDVASHSLSDSGCPNNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAV 368

Query: 631  SMSLQDAERNTKREENLAEMSNGQEGDGHELRDG------SSTDGEDGQRQQKQRSITLG 792
            S SLQ+     + +EN  E+    EG   E  +          +GE+GQ   KQRSITLG
Sbjct: 369  SASLQN-PATVEIDENSREVVVDSEGRVGETANNPPEKAPEDANGEEGQPHHKQRSITLG 427

Query: 793  SSKEFNFDNVDGVYPDKATIGSDWWANEKVLGKEAGPCNNW-IFPMMQPGVS 945
            S+KEFNFDN DG + DK  I SDWWANEKV+GKE G   NW IF MMQP VS
Sbjct: 428  SAKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKNWSIFHMMQPSVS 479


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
            gi|557541785|gb|ESR52763.1| hypothetical protein
            CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  360 bits (925), Expect = 5e-97
 Identities = 198/334 (59%), Positives = 230/334 (68%), Gaps = 19/334 (5%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SP GP+SIFAIGPYAHE QLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFA+L
Sbjct: 131  SPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQL 190

Query: 181  LDPNNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHESIAGR 360
            LDP+ +  + G +FPFS YEFQSY L PGSPV NLISP S IS SGTSSPF D E     
Sbjct: 191  LDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAG 250

Query: 361  PQF-----------LNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRLP 507
            PQF           LNL+K++  EWGSRQGSGTLTPDAV     + F  N Q S V   P
Sbjct: 251  PQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLTPDAVRSTPRNGFFQNRQISEVALRP 310

Query: 508  KPYNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQ-----DAERNTKRE 672
               NG + D  +VDHRVSFE+T EDVVRCVEKKPT +    S SLQ     + E ++   
Sbjct: 311  HSENGLRKD-QIVDHRVSFELTTEDVVRCVEKKPTTLAEAVSESLQNGTTVEKEESSGEA 369

Query: 673  ENLAEMSNGQEGDGHELRDGSSTDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYPDKATI 852
            EN+     G+  +   L+  +  D E+  R QKQ+SITLGS+KEFNFD+ DG    + TI
Sbjct: 370  ENVHHSCAGEAANDEPLK--TPVDVEEAPRHQKQQSITLGSTKEFNFDSADG-DSHEPTI 426

Query: 853  GSDWWANEKVLGKEAGPCNNW-IFPMMQ--PGVS 945
             SDWWANEKV+GK++G   NW  FP++Q  PGVS
Sbjct: 427  ASDWWANEKVVGKDSGAIKNWAFFPVIQPAPGVS 460


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  360 bits (924), Expect = 7e-97
 Identities = 198/334 (59%), Positives = 230/334 (68%), Gaps = 19/334 (5%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SP GP+SIFAIGPYAHE QLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFA+L
Sbjct: 131  SPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQL 190

Query: 181  LDPNNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHESIAGR 360
            LDP+ +  + G +FPFS YEFQSY L PGSPV NLISP S IS SGTSSPF D E     
Sbjct: 191  LDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAG 250

Query: 361  PQF-----------LNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRLP 507
            PQF           LNL+K++  EWGSRQGSGTLTPDAV     + F  N Q S V   P
Sbjct: 251  PQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLTPDAVGSTPRNGFFQNRQISEVALRP 310

Query: 508  KPYNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQ-----DAERNTKRE 672
               NG + D  +VDHRVSFE+T EDVVRCVEKKPT +    S SLQ     + E ++   
Sbjct: 311  HSENGLRKD-QIVDHRVSFELTTEDVVRCVEKKPTTLAEAVSESLQNGTTVEKEESSGEA 369

Query: 673  ENLAEMSNGQEGDGHELRDGSSTDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYPDKATI 852
            EN+     G+  +   L+  +  D E+  R QKQ+SITLGS+KEFNFD+ DG    + TI
Sbjct: 370  ENVHHSCAGEAANDEPLK--TPVDVEEAPRHQKQQSITLGSTKEFNFDSADG-DSHEPTI 426

Query: 853  GSDWWANEKVLGKEAGPCNNW-IFPMMQ--PGVS 945
             SDWWANEKV+GK++G   NW  FP++Q  PGVS
Sbjct: 427  ASDWWANEKVVGKDSGAIKNWAFFPVIQPAPGVS 460


>gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  353 bits (907), Expect = 6e-95
 Identities = 191/330 (57%), Positives = 222/330 (67%), Gaps = 15/330 (4%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SPSGP SIFAIGPYAHE QLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFA+L
Sbjct: 127  SPSGPTSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQL 186

Query: 181  LDPNNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHESIA-- 354
            LDP+ +N + G RFP S YEFQSYQL PGSPV  LISP S IS SGTSSPF D E  A  
Sbjct: 187  LDPHFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARG 246

Query: 355  ---------GRPQFLNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRLP 507
                       P+ LNL+ ++  +WGSR GSG++TPD       D FLL  Q   V   P
Sbjct: 247  HHFLEFRTGDPPKLLNLDILSTRDWGSRLGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNP 306

Query: 508  KPYNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQDAERNTKREENLAE 687
            +  N  +N+   ++HRVSFE+++E+V+RCVEKKP  +    S SL+D E+   +E+    
Sbjct: 307  RSNNRGRNNDISINHRVSFELSSEEVIRCVEKKPVALAEAVSTSLEDTEKAQSKEDPSKV 366

Query: 688  MSNGQEGDGHELRDGSS---TDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYPDKATIGS 858
            +S+     G    D +     DGE+ Q   KQRSITLGS KEFNFDN DG      +IGS
Sbjct: 367  VSSSICPVGETSNDAAEKAVADGEEAQLHPKQRSITLGSVKEFNFDNPDG-GDSGNSIGS 425

Query: 859  DWWANEKVLGKEAGPCNNW-IFPMMQPGVS 945
            DWWANEKV  KE GP  NW  FPMMQPGVS
Sbjct: 426  DWWANEKVDAKENGPTKNWSFFPMMQPGVS 455


>emb|CBI34651.3| unnamed protein product [Vitis vinifera]
          Length = 412

 Score =  337 bits (864), Expect = 6e-90
 Identities = 188/322 (58%), Positives = 213/322 (66%), Gaps = 7/322 (2%)
 Frame = +1

Query: 1   SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
           SP GPASIFAIGPYAHE QLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFA+L
Sbjct: 129 SPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQL 188

Query: 181 LDPNNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHESIAGR 360
            DPNN+N +AGHRF  SQYEFQSYQL PGSPV +LISP S IS SGTSSPF D       
Sbjct: 189 FDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSPFPDR------ 242

Query: 361 PQFLNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRLPKPYNGWKNDLT 540
                              SG++TPDA+ P   D  +L+H  SG P          N+  
Sbjct: 243 -------------------SGSITPDALGPPSRDGSVLDH--SGCP----------NNEI 271

Query: 541 VVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQDAERNTKREENLAEMSNGQEGDGHE 720
           +VDHRVSFE+TAEDVVRCVEK    +++  S SLQ+     + +EN  E+    EG   E
Sbjct: 272 MVDHRVSFELTAEDVVRCVEKDSAALVKAVSASLQN-PATVEIDENSREVVVDSEGRVGE 330

Query: 721 LRDG------SSTDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYPDKATIGSDWWANEKV 882
             +          +GE+GQ   KQRSITLGS+KEFNFDN DG + DK  I SDWWANEKV
Sbjct: 331 TANNPPEKAPEDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWANEKV 390

Query: 883 LGKEAGPCNNW-IFPMMQPGVS 945
           +GKE G   NW IF MMQP VS
Sbjct: 391 VGKEVGASKNWSIFHMMQPSVS 412


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  328 bits (841), Expect = 3e-87
 Identities = 191/336 (56%), Positives = 216/336 (64%), Gaps = 21/336 (6%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SP GPASIFAIGPYAHE QLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFA+L
Sbjct: 132  SPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQL 191

Query: 181  LDPNNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHESIAGR 360
            LDPN  N + G RFP    EFQSY  QPGSP+  LISP S IS SGTSSPF D E  A  
Sbjct: 192  LDPNIHNGEPGQRFPIFHNEFQSYYFQPGSPIGQLISPSSGISGSGTSSPFPDPEFAARG 251

Query: 361  PQF-----------LNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRLP 507
            P F           LNL+K++  +WGSRQGSG+LTPD+V P    +F +       P L 
Sbjct: 252  PHFLEFRTGDPPKLLNLDKLSKFDWGSRQGSGSLTPDSVKP--ISTFEV------APHL- 302

Query: 508  KPYNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQDAERNTKREENLAE 687
            KP    +N   V D RVSF+++ EDV+R VEKK   +      SL+D     +REEN   
Sbjct: 303  KPNGRCRNAENVADRRVSFDVSTEDVIRYVEKKTVPLAEAMLTSLKDTTMG-QREEN--S 359

Query: 688  MSNGQEGDGHELR---------DGSSTDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYPD 840
             SN  E  G E R         D + T GE+  + QK RSITLGSSKEFNFDN D     
Sbjct: 360  DSNKVEEIGCENRVGETSNEEPDKAPTSGEEVLQHQKHRSITLGSSKEFNFDNADAGDLH 419

Query: 841  KATIGSDWWANEKVLGKEAGPCNNW-IFPMMQPGVS 945
            K+   SDWWAN+KV GKE  P  NW  FPM+QPGVS
Sbjct: 420  KSDSVSDWWANQKVAGKEGAPSQNWSFFPMIQPGVS 455


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346902|gb|ERP65330.1| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  328 bits (841), Expect = 3e-87
 Identities = 190/336 (56%), Positives = 223/336 (66%), Gaps = 21/336 (6%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SPSGPASIFAIGPYAHE QLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFA+ 
Sbjct: 129  SPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQF 188

Query: 181  LDPNNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHESIAG- 357
            LDP+ +N D G RFPF   +FQSYQ  PGSPV  LISP S IS SGTSSPF D E   G 
Sbjct: 189  LDPSLRNGDTGLRFPF---DFQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGG 245

Query: 358  ----------RPQFLNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRLP 507
                       P+ LNL+K++  EWGS QGSG LTP++V  +   +FLL+ Q S VP  P
Sbjct: 246  AHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPESV-RRGSPNFLLHRQFSDVPSRP 304

Query: 508  KPYNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQDAERNTKREENLAE 687
            +  NG KN   VV+HRVSFE+TAED  RCVE+KP   ++T     +  E  T+ +E   E
Sbjct: 305  RSGNGHKNG-QVVNHRVSFELTAEDASRCVEEKPAFSIKTVP---EYVENGTQAKE---E 357

Query: 688  MSNGQEGDGHELRDG---------SSTDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYPD 840
             ++G+     E R G         +STDGE   + +KQ+SITLGS KEFNFDN D     
Sbjct: 358  KNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHRKQQSITLGSVKEFNFDNADEGDSR 417

Query: 841  KATIGSDWWANEKVLGKEAGPCNNW-IFPMMQPGVS 945
            K +  S+WWAN  V+GKE     NW  FPM+Q GVS
Sbjct: 418  KPS-SSNWWANGSVIGKEGETTKNWSFFPMVQSGVS 452


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346901|gb|EEE82832.2| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  328 bits (841), Expect = 3e-87
 Identities = 190/336 (56%), Positives = 223/336 (66%), Gaps = 21/336 (6%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SPSGPASIFAIGPYAHE QLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFA+ 
Sbjct: 130  SPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQF 189

Query: 181  LDPNNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHESIAG- 357
            LDP+ +N D G RFPF   +FQSYQ  PGSPV  LISP S IS SGTSSPF D E   G 
Sbjct: 190  LDPSLRNGDTGLRFPF---DFQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGG 246

Query: 358  ----------RPQFLNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRLP 507
                       P+ LNL+K++  EWGS QGSG LTP++V  +   +FLL+ Q S VP  P
Sbjct: 247  AHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPESV-RRGSPNFLLHRQFSDVPSRP 305

Query: 508  KPYNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQDAERNTKREENLAE 687
            +  NG KN   VV+HRVSFE+TAED  RCVE+KP   ++T     +  E  T+ +E   E
Sbjct: 306  RSGNGHKNG-QVVNHRVSFELTAEDASRCVEEKPAFSIKTVP---EYVENGTQAKE---E 358

Query: 688  MSNGQEGDGHELRDG---------SSTDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYPD 840
             ++G+     E R G         +STDGE   + +KQ+SITLGS KEFNFDN D     
Sbjct: 359  KNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHRKQQSITLGSVKEFNFDNADEGDSR 418

Query: 841  KATIGSDWWANEKVLGKEAGPCNNW-IFPMMQPGVS 945
            K +  S+WWAN  V+GKE     NW  FPM+Q GVS
Sbjct: 419  KPS-SSNWWANGSVIGKEGETTKNWSFFPMVQSGVS 453


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 422

 Score =  320 bits (820), Expect = 8e-85
 Identities = 178/331 (53%), Positives = 212/331 (64%), Gaps = 19/331 (5%)
 Frame = +1

Query: 10   GPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDP 189
            GP+SIFAIGPYAHE QLVSPPVFSTFTTEPSTAPFTPP ESVHLT PSSPEVPFA+LLD 
Sbjct: 93   GPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDS 152

Query: 190  NNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHESIAG---- 357
            N +  + G R+P S YEFQSYQ  PGSPV  LISP S IS SGTSSPFLD E  +G    
Sbjct: 153  NFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHF 212

Query: 358  -------RPQFLNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRLPKPY 516
                    P+ LNL+ +   +WGSR  SG++TPDA      + F L           +  
Sbjct: 213  LEFRTGEAPKVLNLDILFTRDWGSRLCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSN 272

Query: 517  NGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQDAERNTKREENLAEMSN 696
            +  +ND   + HRVSFE++AE+VVRCVEKKP  +    S SLQ AE+  + E    E+S+
Sbjct: 273  SRRRNDGASIGHRVSFELSAEEVVRCVEKKPVALAEAVSTSLQSAEKAEREEGPNQEVSS 332

Query: 697  GQE-------GDGHELRDGSSTDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYPDKATIG 855
              E        D  E   G   + E   R QK+RSITLGS+KEFNFDN DG     ++I 
Sbjct: 333  SHECPVVDTSNDSSEKAVGGDAE-ELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSIS 391

Query: 856  SDWWANEKVLGKEAGPCNNW-IFPMMQPGVS 945
            +DWWANEKV+ KE G   NW  FPM+QPG+S
Sbjct: 392  TDWWANEKVVLKENGESKNWSFFPMIQPGMS 422


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 459

 Score =  320 bits (820), Expect = 8e-85
 Identities = 178/331 (53%), Positives = 212/331 (64%), Gaps = 19/331 (5%)
 Frame = +1

Query: 10   GPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDP 189
            GP+SIFAIGPYAHE QLVSPPVFSTFTTEPSTAPFTPP ESVHLT PSSPEVPFA+LLD 
Sbjct: 130  GPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDS 189

Query: 190  NNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHESIAG---- 357
            N +  + G R+P S YEFQSYQ  PGSPV  LISP S IS SGTSSPFLD E  +G    
Sbjct: 190  NFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHF 249

Query: 358  -------RPQFLNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRLPKPY 516
                    P+ LNL+ +   +WGSR  SG++TPDA      + F L           +  
Sbjct: 250  LEFRTGEAPKVLNLDILFTRDWGSRLCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSN 309

Query: 517  NGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQDAERNTKREENLAEMSN 696
            +  +ND   + HRVSFE++AE+VVRCVEKKP  +    S SLQ AE+  + E    E+S+
Sbjct: 310  SRRRNDGASIGHRVSFELSAEEVVRCVEKKPVALAEAVSTSLQSAEKAEREEGPNQEVSS 369

Query: 697  GQE-------GDGHELRDGSSTDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYPDKATIG 855
              E        D  E   G   + E   R QK+RSITLGS+KEFNFDN DG     ++I 
Sbjct: 370  SHECPVVDTSNDSSEKAVGGDAE-ELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSIS 428

Query: 856  SDWWANEKVLGKEAGPCNNW-IFPMMQPGVS 945
            +DWWANEKV+ KE G   NW  FPM+QPG+S
Sbjct: 429  TDWWANEKVVLKENGESKNWSFFPMIQPGMS 459


>gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  319 bits (818), Expect = 1e-84
 Identities = 183/327 (55%), Positives = 210/327 (64%), Gaps = 16/327 (4%)
 Frame = +1

Query: 10   GPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDP 189
            GPASIFAIGPYAHE QLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFA+LL P
Sbjct: 132  GPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLGP 191

Query: 190  NNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHESIAG---- 357
            N Q  +   RFP S YEFQSYQL PGSPV  LISP S IS SGTSSPF D E  A     
Sbjct: 192  NLQYGEGVQRFPISHYEFQSYQLHPGSPVGQLISPSSGISGSGTSSPFRDGEFAASLHFP 251

Query: 358  ------RPQFLNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRLPKPYN 519
                   P+ LNL+K +  EWGS  GSGTLTPDA      + FLL+HQ S +   P   N
Sbjct: 252  EFRMGDPPKLLNLDKHSSCEWGSHHGSGTLTPDATRSTPRNGFLLDHQISEITSHPHLKN 311

Query: 520  -GWKNDLTVVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQDAERNTKREENLAEMSN 696
               +ND    +HRVSFE+T E+VVR +E +        S SLQ  E   + EE+  ++ +
Sbjct: 312  KEVQNDQVAHNHRVSFELTTEEVVRSLEMETATPSEAVSGSLQ-IEATRESEEHDTKVVD 370

Query: 697  GQE----GDGHELRDGSSTDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYPDKATIGSDW 864
              E       +E  + +  D E   +  K +SITLGS+KEFNFDNVDG    K  + SDW
Sbjct: 371  DYECRVGETSNERPEKALADREGKPQHHKHQSITLGSAKEFNFDNVDGGDAHKPILTSDW 430

Query: 865  WANEKVLGKEAGPCNNW-IFPMMQPGV 942
            WAN+KV GK  G   NW  FPMMQPGV
Sbjct: 431  WANDKVAGKGGGVPRNWSFFPMMQPGV 457


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
            gi|223549721|gb|EEF51209.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 459

 Score =  316 bits (809), Expect = 1e-83
 Identities = 185/336 (55%), Positives = 218/336 (64%), Gaps = 22/336 (6%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SPSGPASIFAIGPYAHE QLVSPP FSTFTTEPSTAPFTPPPESV LTTPSSPEVPFA+L
Sbjct: 134  SPSGPASIFAIGPYAHETQLVSPPAFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQL 193

Query: 181  LDPNNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHESIAGR 360
            L+P+N+N +AG RFPFS YEFQSYQ  PGSPV  LISP S IS SGTSSPF D E  A  
Sbjct: 194  LEPSNRNGEAGLRFPFSNYEFQSYQFYPGSPVGQLISPSSGISGSGTSSPFPDGEFAAAG 253

Query: 361  PQF-----------LNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRLP 507
            P+F           LNL+K++ HE GSRQGSGTLTPDAV      SF L+ Q S +    
Sbjct: 254  PRFLEFQMAVPPKLLNLDKLSVHECGSRQGSGTLTPDAVRAT-SCSFPLDRQCSDIASNR 312

Query: 508  KPYNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQD---AERNTKREE- 675
               N  K+D  V D RVSF+++AED +R  E KP   ++    S+++   AE+  K  E 
Sbjct: 313  HSDNENKDD-QVADLRVSFDLSAEDALRYAEPKPASPVKIMPESMKNEIAAEKVQKSSEI 371

Query: 676  ------NLAEMSNGQEGDGHELRDGSSTDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYP 837
                   + E SNG       + + +ST GE   R QK R++TLG+ KEFNFDN DGV  
Sbjct: 372  RHNFECRVGETSNG-------ILEQASTGGEKTPRHQKHRTLTLGTFKEFNFDNADGV-- 422

Query: 838  DKATIGSDWWANEKVLGKEAGPCNNW-IFPMMQPGV 942
             K + G DWW N   +GKE     NW  FP+MQP +
Sbjct: 423  PKPSAGPDWWDNGSDVGKEDFTAKNWSFFPVMQPSI 458


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  311 bits (798), Expect = 3e-82
 Identities = 177/329 (53%), Positives = 203/329 (61%), Gaps = 14/329 (4%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SPSGPAS+FAIGPYAHE QLVSPPVFSTF TEPSTAPFTPPPESV LTTPSSPEVPFA+L
Sbjct: 123  SPSGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQL 182

Query: 181  ----LDPNNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHES 348
                LD + +N     +   S YEFQ YQL P SPV +LISP   IS SGTSSPF D   
Sbjct: 183  LTSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRRP 239

Query: 349  IAGRPQFLNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRLPKPYNGWK 528
            I   P+ L  E  +   WGSR GSG+LTPD   P   DSFLL +Q S V  L    +G +
Sbjct: 240  IVEAPKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQ 299

Query: 529  NDLTVVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQDAERNTKREENLAEMSNGQEG 708
            N  TV+DHRVSFE+  EDV  CVEKKP     T   +LQD     + E     +S   E 
Sbjct: 300  NGETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTEN 359

Query: 709  -----DGHELRDGS---STDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYPDKAT-IGSD 861
                  G  L+  S   S +GE+ Q  +K   I  GS KEFNFDN  G    K   IGS+
Sbjct: 360  CCEFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSE 419

Query: 862  WWANEKVLGKEAGPCNNW-IFPMMQPGVS 945
            WW NEKV+GK  GP  NW  FP++QPG+S
Sbjct: 420  WWVNEKVVGKGTGPQTNWTFFPLLQPGIS 448


>emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]
          Length = 385

 Score =  311 bits (798), Expect = 3e-82
 Identities = 177/329 (53%), Positives = 203/329 (61%), Gaps = 14/329 (4%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SPSGPAS+FAIGPYAHE QLVSPPVFSTF TEPSTAPFTPPPESV LTTPSSPEVPFA+L
Sbjct: 60   SPSGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQL 119

Query: 181  ----LDPNNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHES 348
                LD + +N     +   S YEFQ YQL P SPV +LISP   IS SGTSSPF D   
Sbjct: 120  LTSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRRP 176

Query: 349  IAGRPQFLNLEKIAPHEWGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRLPKPYNGWK 528
            I   P+ L  E  +   WGSR GSG+LTPD   P   DSFLL +Q S V  L    +G +
Sbjct: 177  IVEAPKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQ 236

Query: 529  NDLTVVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQDAERNTKREENLAEMSNGQEG 708
            N  TV+DHRVSFE+  EDV  CVEKKP     T   +LQD     + E     +S   E 
Sbjct: 237  NGETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTEN 296

Query: 709  -----DGHELRDGS---STDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYPDKAT-IGSD 861
                  G  L+  S   S +GE+ Q  +K   I  GS KEFNFDN  G    K   IGS+
Sbjct: 297  CCEFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSE 356

Query: 862  WWANEKVLGKEAGPCNNW-IFPMMQPGVS 945
            WW NEKV+GK  GP  NW  FP++QPG+S
Sbjct: 357  WWVNEKVVGKGTGPQTNWTFFPLLQPGIS 385


>gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao]
          Length = 489

 Score =  306 bits (785), Expect = 9e-81
 Identities = 183/363 (50%), Positives = 213/363 (58%), Gaps = 48/363 (13%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SP GPASIFAIGPYAHE QLV+PPVFS  TTEPSTAPFTPPPESV LTTPSSPEVPFA+L
Sbjct: 127  SPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQL 186

Query: 181  LDPN----NQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHES 348
            L  +     +N     +F  S YEFQSYQ+ PGSP  NLISPGSAIS SGTSSPF D   
Sbjct: 187  LTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRP 246

Query: 349  I-----AGRPQFLNLEKIAPHEWGSRQGS------------------------------- 420
            I        P+ L  E     +WGSR GS                               
Sbjct: 247  ILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLG 306

Query: 421  -GTLTPDAVYPKYHDSFLLNHQNSGVPRLPKPYNGWKNDLTVVDHRVSFEITAEDVVRCV 597
             G+LTPD + P   D FL+  Q S V  L  P NG KND T+VDHRVSFE++ EDV  C+
Sbjct: 307  SGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCL 366

Query: 598  EKKPTLMMRTGSMSLQD--AERNTKREENLAEMSNGQE----GDGHELRDGSSTDGEDGQ 759
            E K  L  R  S   +D  AE   +R+    ++ +  E       +E  + +S + E+  
Sbjct: 367  ESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEH 426

Query: 760  RQQKQRSITLGSSKEFNFDNVDGVYPDKATIGSDWWANEKVLGKEAGPCNNW-IFPMMQP 936
              QK RS+TLGS KEFNFDN  G   DK TI S+WWANEKV GKEA P N+W  FPM+QP
Sbjct: 427  SYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQP 486

Query: 937  GVS 945
             VS
Sbjct: 487  EVS 489


>gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
          Length = 485

 Score =  306 bits (785), Expect = 9e-81
 Identities = 183/363 (50%), Positives = 213/363 (58%), Gaps = 48/363 (13%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SP GPASIFAIGPYAHE QLV+PPVFS  TTEPSTAPFTPPPESV LTTPSSPEVPFA+L
Sbjct: 123  SPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQL 182

Query: 181  LDPN----NQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHES 348
            L  +     +N     +F  S YEFQSYQ+ PGSP  NLISPGSAIS SGTSSPF D   
Sbjct: 183  LTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRP 242

Query: 349  I-----AGRPQFLNLEKIAPHEWGSRQGS------------------------------- 420
            I        P+ L  E     +WGSR GS                               
Sbjct: 243  ILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLG 302

Query: 421  -GTLTPDAVYPKYHDSFLLNHQNSGVPRLPKPYNGWKNDLTVVDHRVSFEITAEDVVRCV 597
             G+LTPD + P   D FL+  Q S V  L  P NG KND T+VDHRVSFE++ EDV  C+
Sbjct: 303  SGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCL 362

Query: 598  EKKPTLMMRTGSMSLQD--AERNTKREENLAEMSNGQE----GDGHELRDGSSTDGEDGQ 759
            E K  L  R  S   +D  AE   +R+    ++ +  E       +E  + +S + E+  
Sbjct: 363  ESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEH 422

Query: 760  RQQKQRSITLGSSKEFNFDNVDGVYPDKATIGSDWWANEKVLGKEAGPCNNW-IFPMMQP 936
              QK RS+TLGS KEFNFDN  G   DK TI S+WWANEKV GKEA P N+W  FPM+QP
Sbjct: 423  SYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQP 482

Query: 937  GVS 945
             VS
Sbjct: 483  EVS 485


>ref|XP_003516706.1| PREDICTED: uncharacterized protein LOC100777876 [Glycine max]
          Length = 431

 Score =  285 bits (728), Expect = 4e-74
 Identities = 157/323 (48%), Positives = 202/323 (62%), Gaps = 13/323 (4%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SP GP SIFAIGPYAHE QLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFA+L
Sbjct: 117  SPCGPFSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQL 176

Query: 181  LDPNNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHE----- 345
            LDPN +N +   RF  SQY+F SYQL PGSPV  LISP SA S SGTSSPF D +     
Sbjct: 177  LDPNTKNSETYQRFQISQYDFHSYQLHPGSPVGQLISPRSAFSPSGTSSPFPDTDFNSRG 236

Query: 346  ------SIAGRPQFLNLEKIAPHE-WGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPRL 504
                   I    + LN +K + +E   S QGSG+LTPD++       FL +H  S +   
Sbjct: 237  SLLLDFQIGDPTKLLNFDKPSTNENHKSHQGSGSLTPDSIRSTTQAGFLPSHWVSDIIMS 296

Query: 505  PKPYNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTLMMRTGSMSLQDAERNTKREENLA 684
            P+P     N+++ V+HRVS E++A++V++CVE K   + +  +    DA    K++ ++ 
Sbjct: 297  PRPRKNHPNEIS-VNHRVSIEVSAQEVLKCVENKAVALSKLKT----DAPGEDKKDNSIE 351

Query: 685  EMSNGQEGDGHELRDGSSTDGEDGQRQQKQRSITLGSSKEFNFDNVDGVYPDKATIGSDW 864
             + +    D  +    ++ +G+  +   K   I   ++KEFNFDN +G       I +DW
Sbjct: 352  VLVSETPNDAPQ---QTADNGDVERAHHKDECIIFSAAKEFNFDNAEGGDSPAPNIVADW 408

Query: 865  WANEKVLGKEAGPCNNW-IFPMM 930
            WANEKV  KE G  NNW  FPM+
Sbjct: 409  WANEKVASKEGGSSNNWSFFPMI 431


>ref|XP_006580574.1| PREDICTED: uncharacterized protein LOC100791666 isoform X2 [Glycine
            max]
          Length = 441

 Score =  281 bits (719), Expect = 4e-73
 Identities = 168/335 (50%), Positives = 206/335 (61%), Gaps = 20/335 (5%)
 Frame = +1

Query: 1    SPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 180
            SP GPASIFAIGPYAHE QLVSPPVFS      STAPFTPPPESVH+TTPSSPEVPFA+L
Sbjct: 112  SPGGPASIFAIGPYAHETQLVSPPVFSA----SSTAPFTPPPESVHMTTPSSPEVPFAQL 167

Query: 181  LDPNNQNVDAGHRFPFSQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDHE----- 345
            LDPNN+N +   RF  S Y+FQSYQ  PGSPV  LISP SAISVSGTSSP  D E     
Sbjct: 168  LDPNNKNSETFQRFQISHYDFQSYQFHPGSPVGQLISPRSAISVSGTSSPLPDSEFNATF 227

Query: 346  ------SIAGRPQFLNLE-KIAPHE-WGSRQGSGTLTPDAVYPKYHDSFLLNHQNSGVPR 501
                    A  P+ LNL+ K++  E   S  GSG+LTPDA        FL NH  S +  
Sbjct: 228  AHILDFQRADPPKLLNLDNKLSSCENQKSNHGSGSLTPDAARSTTQSGFLSNHWVSEIKM 287

Query: 502  LPKPYNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTLMMRTGSM-SLQDAERNTKREEN 678
             P P N   N+++ ++HRVSFE++A+ V++ +E KP     T  +  L++    T +EE 
Sbjct: 288  SPHPSNNRLNEIS-INHRVSFELSAQKVLKSLENKPAASAWTNVLPKLKNDAPTTDKEEK 346

Query: 679  LAEMSNGQE---GDGHELRDGSSTDGEDGQR--QQKQRSITLGSSKEFNFDNVDGVYPDK 843
              E +   +    + H  +   +T G D      +K +S+TL S+KEFNFDN DG     
Sbjct: 347  SEESALDDKQVVSEAHNDQPLETTLGGDKATTVHEKDQSLTLSSAKEFNFDNADGGDSLA 406

Query: 844  ATIGSDWWANEKVLGKEAGPCNNW-IFPMMQPGVS 945
              I +DWWANEKV GKE     +W  FPM+QPGVS
Sbjct: 407  PNIVADWWANEKVAGKEREASKDWSFFPMIQPGVS 441


Top