BLASTX nr result

ID: Atropa21_contig00011148 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00011148
         (1097 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   424   e-116
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   421   e-115
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   221   3e-55
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   221   4e-55
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   219   1e-54
gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus pe...   218   4e-54
gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [...   204   4e-50
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   204   5e-50
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   204   5e-50
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     203   9e-50
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   196   1e-47
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   196   1e-47
gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i...   196   2e-47
gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i...   196   2e-47
emb|CBI34651.3| unnamed protein product [Vitis vinifera]              195   2e-47
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   189   1e-45
emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]   189   1e-45
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   185   2e-44
gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus pe...   184   5e-44
ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr...   176   1e-41

>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum
           tuberosum]
          Length = 443

 Score =  424 bits (1090), Expect = e-116
 Identities = 208/250 (83%), Positives = 222/250 (88%), Gaps = 8/250 (3%)
 Frame = +2

Query: 5   AGQRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDREST--RPQFLNLEKI 178
           AG R+PFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSP LDRE T  RPQFLNLEKI
Sbjct: 196 AGHRYPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDREYTPGRPQFLNLEKI 255

Query: 179 APHEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGWKNDLTVVEHRVSFE 358
           APHEWGSRQGS TLTP+ V PKYHD+ LLN+QNSG+ RLPKPFNGWKNDLTVV+HRVSFE
Sbjct: 256 APHEWGSRQGSGTLTPEAVNPKYHDNFLLNYQNSGVHRLPKPFNGWKNDLTVVDHRVSFE 315

Query: 359 ITAEDVVRCVEKKPSLLMKTGSVS------HNKREEKLAEMSNGQELDGHEPSSEIREGS 520
           ITAEDVVRCVEKKP+++M+TGSVS        KR+E LAEMSNG +  GHEPS EI EGS
Sbjct: 316 ITAEDVVRCVEKKPTMMMRTGSVSLQDTERSTKRQENLAEMSNGHDHGGHEPSREIHEGS 375

Query: 521 STDGEDGQRHQKHRSITLGSSKEFNFDNVDGGYPDKAIVGSDWWANEKVLGKEGGPCNNW 700
           STDGEDGQR QKHRSITLGSSKEFNFDNVDGGYPDKA +GSDWWANEKVLGKE  PCNNW
Sbjct: 376 STDGEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLGKE--PCNNW 433

Query: 701 IFPMIQPGVS 730
           IFPM+QPGVS
Sbjct: 434 IFPMMQPGVS 443


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
           lycopersicum]
          Length = 443

 Score =  421 bits (1081), Expect = e-115
 Identities = 206/250 (82%), Positives = 221/250 (88%), Gaps = 8/250 (3%)
 Frame = +2

Query: 5   AGQRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDREST--RPQFLNLEKI 178
           AG R+PFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSP L+RE T  RPQFLNLEKI
Sbjct: 196 AGHRYPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLEREYTPGRPQFLNLEKI 255

Query: 179 APHEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGWKNDLTVVEHRVSFE 358
           APHEWGSRQGS TLTP+ V PKYHDS LLN+QN+G+ RLPKPFNGWKNDLTVV+HRVSFE
Sbjct: 256 APHEWGSRQGSGTLTPEAVNPKYHDSFLLNYQNTGVHRLPKPFNGWKNDLTVVDHRVSFE 315

Query: 359 ITAEDVVRCVEKKPSLLMKTGSVS------HNKREEKLAEMSNGQELDGHEPSSEIREGS 520
           ITAEDVVRCVEKKP+++M+TGSVS        KR+E LAEMSN  +  GHEPS EI EGS
Sbjct: 316 ITAEDVVRCVEKKPTMMMRTGSVSLQDTERSTKRQENLAEMSNAHDHSGHEPSREIHEGS 375

Query: 521 STDGEDGQRHQKHRSITLGSSKEFNFDNVDGGYPDKAIVGSDWWANEKVLGKEGGPCNNW 700
           STDGEDGQR QKHRSITLGSSKEFNFDNVDGGYPDKA +GSDWWANEKVLGKE  PCNNW
Sbjct: 376 STDGEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLGKE--PCNNW 433

Query: 701 IFPMIQPGVS 730
           IFPM+QPGVS
Sbjct: 434 IFPMMQPGVS 443


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
           gi|557541785|gb|ESR52763.1| hypothetical protein
           CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  221 bits (564), Expect = 3e-55
 Identities = 131/264 (49%), Positives = 160/264 (60%), Gaps = 21/264 (7%)
 Frame = +2

Query: 2   DAGQRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDRE--STRPQF----- 160
           + GQ+FPF+ YEFQSY L PGSPV NLISP S IS SGTSSP  D E  +  PQF     
Sbjct: 199 EQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHR 258

Query: 161 ------LNLEKIAPHEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGWKN 322
                 LNL+K++  EWGSRQGS TLTPD V     +    N Q S +   P   NG + 
Sbjct: 259 GDPPKLLNLDKLSIREWGSRQGSGTLTPDAVRSTPRNGFFQNRQISEVALRPHSENGLRK 318

Query: 323 DLTVVEHRVSFEITAEDVVRCVEKKPSLLMKTGSVSHN-----KREEKLAEMSNGQELDG 487
           D  +V+HRVSFE+T EDVVRCVEKKP+ L +  S S       ++EE   E  N      
Sbjct: 319 D-QIVDHRVSFELTTEDVVRCVEKKPTTLAEAVSESLQNGTTVEKEESSGEAENVHHSCA 377

Query: 488 HEPSSEIREGSSTDGEDGQRHQKHRSITLGSSKEFNFDNVDGGYPDKAIVGSDWWANEKV 667
            E +++    +  D E+  RHQK +SITLGS+KEFNFD+ DG   +  I  SDWWANEKV
Sbjct: 378 GEAANDEPLKTPVDVEEAPRHQKQQSITLGSTKEFNFDSADGDSHEPTI-ASDWWANEKV 436

Query: 668 LGKEGGPCNNW-IFPMIQ--PGVS 730
           +GK+ G   NW  FP+IQ  PGVS
Sbjct: 437 VGKDSGAIKNWAFFPVIQPAPGVS 460


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  221 bits (563), Expect = 4e-55
 Identities = 131/264 (49%), Positives = 160/264 (60%), Gaps = 21/264 (7%)
 Frame = +2

Query: 2   DAGQRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDRE--STRPQF----- 160
           + GQ+FPF+ YEFQSY L PGSPV NLISP S IS SGTSSP  D E  +  PQF     
Sbjct: 199 EQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHR 258

Query: 161 ------LNLEKIAPHEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGWKN 322
                 LNL+K++  EWGSRQGS TLTPD V     +    N Q S +   P   NG + 
Sbjct: 259 GDPPKLLNLDKLSIREWGSRQGSGTLTPDAVGSTPRNGFFQNRQISEVALRPHSENGLRK 318

Query: 323 DLTVVEHRVSFEITAEDVVRCVEKKPSLLMKTGSVSHN-----KREEKLAEMSNGQELDG 487
           D  +V+HRVSFE+T EDVVRCVEKKP+ L +  S S       ++EE   E  N      
Sbjct: 319 D-QIVDHRVSFELTTEDVVRCVEKKPTTLAEAVSESLQNGTTVEKEESSGEAENVHHSCA 377

Query: 488 HEPSSEIREGSSTDGEDGQRHQKHRSITLGSSKEFNFDNVDGGYPDKAIVGSDWWANEKV 667
            E +++    +  D E+  RHQK +SITLGS+KEFNFD+ DG   +  I  SDWWANEKV
Sbjct: 378 GEAANDEPLKTPVDVEEAPRHQKQQSITLGSTKEFNFDSADGDSHEPTI-ASDWWANEKV 436

Query: 668 LGKEGGPCNNW-IFPMIQ--PGVS 730
           +GK+ G   NW  FP+IQ  PGVS
Sbjct: 437 VGKDSGAIKNWAFFPVIQPAPGVS 460


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  219 bits (559), Expect = 1e-54
 Identities = 130/283 (45%), Positives = 161/283 (56%), Gaps = 40/283 (14%)
 Frame = +2

Query: 2    DAGQRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDREST----------- 148
            +AG RF  +QYEFQSYQL PGSPV +LISP S IS SGTSSP  DR+             
Sbjct: 197  EAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSPFPDRDFVCSGSSQFLEFR 256

Query: 149  ---RPQFLNLEKIAPHEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPK------ 301
                P+ L L+K++ HEWGSR GS ++TPD + P   D  +L+ Q S +   P       
Sbjct: 257  AGGPPKLLTLDKLSNHEWGSRIGSGSITPDALGPPSRDGSVLDRQVSDVIHPPSGDDSVL 316

Query: 302  ------------PFNGWKNDLTVVEHRVSFEITAEDVVRCVEKKPSLLMKTGSVS-HNKR 442
                          +G  N+  +V+HRVSFE+TAEDVVRCVEK  + L+K  S S  N  
Sbjct: 317  DRQISDVASHSLSDSGCPNNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAVSASLQNPA 376

Query: 443  EEKLAEMSNGQELDGHEPSSEIREG------SSTDGEDGQRHQKHRSITLGSSKEFNFDN 604
              ++ E S    +D      E             +GE+GQ H K RSITLGS+KEFNFDN
Sbjct: 377  TVEIDENSREVVVDSEGRVGETANNPPEKAPEDANGEEGQPHHKQRSITLGSAKEFNFDN 436

Query: 605  VDGGYPDKAIVGSDWWANEKVLGKEGGPCNNW-IFPMIQPGVS 730
             DGG+ DK  + SDWWANEKV+GKE G   NW IF M+QP VS
Sbjct: 437  ADGGHSDKPNISSDWWANEKVVGKEVGASKNWSIFHMMQPSVS 479


>gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  218 bits (554), Expect = 4e-54
 Identities = 123/262 (46%), Positives = 154/262 (58%), Gaps = 19/262 (7%)
 Frame = +2

Query: 2   DAGQRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLD-------------RE 142
           + GQRFP + YEFQSYQL PGSPV  LISP S IS SGTSSP  D             R 
Sbjct: 195 EGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRT 254

Query: 143 STRPQFLNLEKIAPHEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGWKN 322
              P+ LNL+ ++  +WGSR GS ++TPD       D  LL  Q   +   P+  N  +N
Sbjct: 255 GDPPKLLNLDILSTRDWGSRLGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNNRGRN 314

Query: 323 DLTVVEHRVSFEITAEDVVRCVEKKPSLLMKTGSVSHNKREEKLAEMSNGQELDGH---- 490
           +   + HRVSFE+++E+V+RCVEKKP  L +  S S    E+  ++    + +       
Sbjct: 315 NDISINHRVSFELSSEEVIRCVEKKPVALAEAVSTSLEDTEKAQSKEDPSKVVSSSICPV 374

Query: 491 -EPSSEIREGSSTDGEDGQRHQKHRSITLGSSKEFNFDNVDGGYPDKAIVGSDWWANEKV 667
            E S++  E +  DGE+ Q H K RSITLGS KEFNFDN DGG    +I GSDWWANEKV
Sbjct: 375 GETSNDAAEKAVADGEEAQLHPKQRSITLGSVKEFNFDNPDGGDSGNSI-GSDWWANEKV 433

Query: 668 LGKEGGPCNNW-IFPMIQPGVS 730
             KE GP  NW  FPM+QPGVS
Sbjct: 434 DAKENGPTKNWSFFPMMQPGVS 455


>gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  204 bits (520), Expect = 4e-50
 Identities = 123/258 (47%), Positives = 150/258 (58%), Gaps = 19/258 (7%)
 Frame = +2

Query: 11  QRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDRESTR------------P 154
           QRFP + YEFQSYQL PGSPV  LISP S IS SGTSSP  D E               P
Sbjct: 200 QRFPISHYEFQSYQLHPGSPVGQLISPSSGISGSGTSSPFRDGEFAASLHFPEFRMGDPP 259

Query: 155 QFLNLEKIAPHEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGW-KNDLT 331
           + LNL+K +  EWGS  GS TLTPD       +  LL+HQ S I   P   N   +ND  
Sbjct: 260 KLLNLDKHSSCEWGSHHGSGTLTPDATRSTPRNGFLLDHQISEITSHPHLKNKEVQNDQV 319

Query: 332 VVEHRVSFEITAEDVVRCVEKKPSLLMKTGSVSHNKREEKLAEMSNGQELDGHE-----P 496
              HRVSFE+T E+VVR +E + +   +  S S      + +E  + + +D +E      
Sbjct: 320 AHNHRVSFELTTEEVVRSLEMETATPSEAVSGSLQIEATRESEEHDTKVVDDYECRVGET 379

Query: 497 SSEIREGSSTDGEDGQRHQKHRSITLGSSKEFNFDNVDGGYPDKAIVGSDWWANEKVLGK 676
           S+E  E +  D E   +H KH+SITLGS+KEFNFDNVDGG   K I+ SDWWAN+KV GK
Sbjct: 380 SNERPEKALADREGKPQHHKHQSITLGSAKEFNFDNVDGGDAHKPILTSDWWANDKVAGK 439

Query: 677 EGGPCNNW-IFPMIQPGV 727
            GG   NW  FPM+QPGV
Sbjct: 440 GGGVPRNWSFFPMMQPGV 457


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
           gi|550346902|gb|ERP65330.1| hypothetical protein
           POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  204 bits (519), Expect = 5e-50
 Identities = 124/262 (47%), Positives = 154/262 (58%), Gaps = 19/262 (7%)
 Frame = +2

Query: 2   DAGQRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDRE------------- 142
           D G RFPF   +FQSYQ  PGSPV  LISP S IS SGTSSP  D E             
Sbjct: 197 DTGLRFPF---DFQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRI 253

Query: 143 STRPQFLNLEKIAPHEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGWKN 322
              P+ LNL+K++  EWGS QGS  LTP++V  +   + LL+ Q S +P  P+  NG KN
Sbjct: 254 GEPPKLLNLDKLSTCEWGSYQGSGALTPESV-RRGSPNFLLHRQFSDVPSRPRSGNGHKN 312

Query: 323 DLTVVEHRVSFEITAEDVVRCVEKKPSLLMKTGSVSHNKREEKLAEMSNGQELDGHE--- 493
              VV HRVSFE+TAED  RCVE+KP+  +KT         +   E ++G+ +   E   
Sbjct: 313 G-QVVNHRVSFELTAEDASRCVEEKPAFSIKTVPEYVENGTQAKEEKNSGESIQSFECRV 371

Query: 494 --PSSEIREGSSTDGEDGQRHQKHRSITLGSSKEFNFDNVDGGYPDKAIVGSDWWANEKV 667
              S++  E +STDGE   +H+K +SITLGS KEFNFDN D G   K    S+WWAN  V
Sbjct: 372 GVTSNDSPEMASTDGEAAPQHRKQQSITLGSVKEFNFDNADEGDSRKP-SSSNWWANGSV 430

Query: 668 LGKEGGPCNNW-IFPMIQPGVS 730
           +GKEG    NW  FPM+Q GVS
Sbjct: 431 IGKEGETTKNWSFFPMVQSGVS 452


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
           gi|550346901|gb|EEE82832.2| hypothetical protein
           POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  204 bits (519), Expect = 5e-50
 Identities = 124/262 (47%), Positives = 154/262 (58%), Gaps = 19/262 (7%)
 Frame = +2

Query: 2   DAGQRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDRE------------- 142
           D G RFPF   +FQSYQ  PGSPV  LISP S IS SGTSSP  D E             
Sbjct: 198 DTGLRFPF---DFQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRI 254

Query: 143 STRPQFLNLEKIAPHEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGWKN 322
              P+ LNL+K++  EWGS QGS  LTP++V  +   + LL+ Q S +P  P+  NG KN
Sbjct: 255 GEPPKLLNLDKLSTCEWGSYQGSGALTPESV-RRGSPNFLLHRQFSDVPSRPRSGNGHKN 313

Query: 323 DLTVVEHRVSFEITAEDVVRCVEKKPSLLMKTGSVSHNKREEKLAEMSNGQELDGHE--- 493
              VV HRVSFE+TAED  RCVE+KP+  +KT         +   E ++G+ +   E   
Sbjct: 314 G-QVVNHRVSFELTAEDASRCVEEKPAFSIKTVPEYVENGTQAKEEKNSGESIQSFECRV 372

Query: 494 --PSSEIREGSSTDGEDGQRHQKHRSITLGSSKEFNFDNVDGGYPDKAIVGSDWWANEKV 667
              S++  E +STDGE   +H+K +SITLGS KEFNFDN D G   K    S+WWAN  V
Sbjct: 373 GVTSNDSPEMASTDGEAAPQHRKQQSITLGSVKEFNFDNADEGDSRKP-SSSNWWANGSV 431

Query: 668 LGKEGGPCNNW-IFPMIQPGVS 730
           +GKEG    NW  FPM+Q GVS
Sbjct: 432 IGKEGETTKNWSFFPMVQSGVS 453


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  203 bits (517), Expect = 9e-50
 Identities = 126/267 (47%), Positives = 155/267 (58%), Gaps = 24/267 (8%)
 Frame = +2

Query: 2   DAGQRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLD-------------RE 142
           + GQRFP    EFQSY  QPGSP+  LISP S IS SGTSSP  D             R 
Sbjct: 200 EPGQRFPIFHNEFQSYYFQPGSPIGQLISPSSGISGSGTSSPFPDPEFAARGPHFLEFRT 259

Query: 143 STRPQFLNLEKIAPHEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGWKN 322
              P+ LNL+K++  +WGSRQGS +LTPD+V P     +         P L KP    +N
Sbjct: 260 GDPPKLLNLDKLSKFDWGSRQGSGSLTPDSVKPISTFEVA--------PHL-KPNGRCRN 310

Query: 323 DLTVVEHRVSFEITAEDVVRCVEKKP-----SLLMKTGSVSHNKREE-----KLAEMSNG 472
              V + RVSF+++ EDV+R VEKK      ++L      +  +REE     K+ E+  G
Sbjct: 311 AENVADRRVSFDVSTEDVIRYVEKKTVPLAEAMLTSLKDTTMGQREENSDSNKVEEI--G 368

Query: 473 QELDGHEPSSEIREGSSTDGEDGQRHQKHRSITLGSSKEFNFDNVDGGYPDKAIVGSDWW 652
            E    E S+E  + + T GE+  +HQKHRSITLGSSKEFNFDN D G   K+   SDWW
Sbjct: 369 CENRVGETSNEEPDKAPTSGEEVLQHQKHRSITLGSSKEFNFDNADAGDLHKSDSVSDWW 428

Query: 653 ANEKVLGKEGGPCNNW-IFPMIQPGVS 730
           AN+KV GKEG P  NW  FPMIQPGVS
Sbjct: 429 ANQKVAGKEGAPSQNWSFFPMIQPGVS 455


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
           vesca subsp. vesca]
          Length = 422

 Score =  196 bits (498), Expect = 1e-47
 Identities = 118/266 (44%), Positives = 146/266 (54%), Gaps = 23/266 (8%)
 Frame = +2

Query: 2   DAGQRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDRE------------- 142
           + GQR+P + YEFQSYQ  PGSPV  LISP S IS SGTSSP LD E             
Sbjct: 158 EGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRT 217

Query: 143 STRPQFLNLEKIAPHEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGWKN 322
              P+ LNL+ +   +WGSR  S ++TPD       +   L           +  +  +N
Sbjct: 218 GEAPKVLNLDILFTRDWGSRLCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSNSRRRN 277

Query: 323 DLTVVEHRVSFEITAEDVVRCVEKKPSLLMKTGSVSHNKREEKLAEMSNGQEL------- 481
           D   + HRVSFE++AE+VVRCVEKKP  L +  S S    E+   E    QE+       
Sbjct: 278 DGASIGHRVSFELSAEEVVRCVEKKPVALAEAVSTSLQSAEKAEREEGPNQEVSSSHECP 337

Query: 482 --DGHEPSSEIREGSSTDGEDGQRHQKHRSITLGSSKEFNFDNVDGGYPDKAIVGSDWWA 655
             D    SSE   G   + E   R+QK RSITLGS+KEFNFDN DGG    + + +DWWA
Sbjct: 338 VVDTSNDSSEKAVGGDAE-ELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWA 396

Query: 656 NEKVLGKEGGPCNNW-IFPMIQPGVS 730
           NEKV+ KE G   NW  FPMIQPG+S
Sbjct: 397 NEKVVLKENGESKNWSFFPMIQPGMS 422


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
           vesca subsp. vesca]
          Length = 459

 Score =  196 bits (498), Expect = 1e-47
 Identities = 118/266 (44%), Positives = 146/266 (54%), Gaps = 23/266 (8%)
 Frame = +2

Query: 2   DAGQRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDRE------------- 142
           + GQR+P + YEFQSYQ  PGSPV  LISP S IS SGTSSP LD E             
Sbjct: 195 EGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRT 254

Query: 143 STRPQFLNLEKIAPHEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGWKN 322
              P+ LNL+ +   +WGSR  S ++TPD       +   L           +  +  +N
Sbjct: 255 GEAPKVLNLDILFTRDWGSRLCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSNSRRRN 314

Query: 323 DLTVVEHRVSFEITAEDVVRCVEKKPSLLMKTGSVSHNKREEKLAEMSNGQEL------- 481
           D   + HRVSFE++AE+VVRCVEKKP  L +  S S    E+   E    QE+       
Sbjct: 315 DGASIGHRVSFELSAEEVVRCVEKKPVALAEAVSTSLQSAEKAEREEGPNQEVSSSHECP 374

Query: 482 --DGHEPSSEIREGSSTDGEDGQRHQKHRSITLGSSKEFNFDNVDGGYPDKAIVGSDWWA 655
             D    SSE   G   + E   R+QK RSITLGS+KEFNFDN DGG    + + +DWWA
Sbjct: 375 VVDTSNDSSEKAVGGDAE-ELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWA 433

Query: 656 NEKVLGKEGGPCNNW-IFPMIQPGVS 730
           NEKV+ KE G   NW  FPMIQPG+S
Sbjct: 434 NEKVVLKENGESKNWSFFPMIQPGMS 459


>gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao]
          Length = 489

 Score =  196 bits (497), Expect = 2e-47
 Identities = 122/288 (42%), Positives = 153/288 (53%), Gaps = 48/288 (16%)
 Frame = +2

Query: 11   QRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDRE-------STRPQFLNL 169
            Q+F  + YEFQSYQ+ PGSP  NLISPGSAIS SGTSSP  DR           P+ L  
Sbjct: 202  QKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGEAPKLLGF 261

Query: 170  EKIAPHEWGSRQGS--------------------------------RTLTPDTVYPKYHD 253
            E     +WGSR GS                                 +LTPD + P   D
Sbjct: 262  ENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRD 321

Query: 254  SLLLNHQNSGIPRLPKPFNGWKNDLTVVEHRVSFEITAEDVVRCVEKKPSLLMKTGS--- 424
              L+  Q S +  L  P NG KND T+V+HRVSFE++ EDV  C+E K  L  +  S   
Sbjct: 322  GFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYP 381

Query: 425  ---VSHNKREEK--LAEMSNGQELDGHEPSSEIREGSSTDGEDGQRHQKHRSITLGSSKE 589
               V+  ++E      ++ +  EL   E S+E  E +S + E+   +QKHRS+TLGS KE
Sbjct: 382  KDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKE 441

Query: 590  FNFDNVDGGYPDKAIVGSDWWANEKVLGKEGGPCNNW-IFPMIQPGVS 730
            FNFDN  G   DK  + S+WWANEKV GKE  P N+W  FPM+QP VS
Sbjct: 442  FNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489


>gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
          Length = 485

 Score =  196 bits (497), Expect = 2e-47
 Identities = 122/288 (42%), Positives = 153/288 (53%), Gaps = 48/288 (16%)
 Frame = +2

Query: 11   QRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDRE-------STRPQFLNL 169
            Q+F  + YEFQSYQ+ PGSP  NLISPGSAIS SGTSSP  DR           P+ L  
Sbjct: 198  QKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGEAPKLLGF 257

Query: 170  EKIAPHEWGSRQGS--------------------------------RTLTPDTVYPKYHD 253
            E     +WGSR GS                                 +LTPD + P   D
Sbjct: 258  ENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRD 317

Query: 254  SLLLNHQNSGIPRLPKPFNGWKNDLTVVEHRVSFEITAEDVVRCVEKKPSLLMKTGS--- 424
              L+  Q S +  L  P NG KND T+V+HRVSFE++ EDV  C+E K  L  +  S   
Sbjct: 318  GFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYP 377

Query: 425  ---VSHNKREEK--LAEMSNGQELDGHEPSSEIREGSSTDGEDGQRHQKHRSITLGSSKE 589
               V+  ++E      ++ +  EL   E S+E  E +S + E+   +QKHRS+TLGS KE
Sbjct: 378  KDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKE 437

Query: 590  FNFDNVDGGYPDKAIVGSDWWANEKVLGKEGGPCNNW-IFPMIQPGVS 730
            FNFDN  G   DK  + S+WWANEKV GKE  P N+W  FPM+QP VS
Sbjct: 438  FNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485


>emb|CBI34651.3| unnamed protein product [Vitis vinifera]
          Length = 412

 Score =  195 bits (496), Expect = 2e-47
 Identities = 119/251 (47%), Positives = 143/251 (56%), Gaps = 8/251 (3%)
 Frame = +2

Query: 2   DAGQRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDRESTRPQFLNLEKIA 181
           +AG RF  +QYEFQSYQL PGSPV +LISP S IS SGTSSP  DR              
Sbjct: 197 EAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSPFPDR-------------- 242

Query: 182 PHEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGWKNDLTVVEHRVSFEI 361
                    S ++TPD + P   D  +L+H  SG P          N+  +V+HRVSFE+
Sbjct: 243 ---------SGSITPDALGPPSRDGSVLDH--SGCP----------NNEIMVDHRVSFEL 281

Query: 362 TAEDVVRCVEKKPSLLMKTGSVS-HNKREEKLAEMSNGQELDGHEPSSEIREG------S 520
           TAEDVVRCVEK  + L+K  S S  N    ++ E S    +D      E           
Sbjct: 282 TAEDVVRCVEKDSAALVKAVSASLQNPATVEIDENSREVVVDSEGRVGETANNPPEKAPE 341

Query: 521 STDGEDGQRHQKHRSITLGSSKEFNFDNVDGGYPDKAIVGSDWWANEKVLGKEGGPCNNW 700
             +GE+GQ H K RSITLGS+KEFNFDN DGG+ DK  + SDWWANEKV+GKE G   NW
Sbjct: 342 DANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKNW 401

Query: 701 -IFPMIQPGVS 730
            IF M+QP VS
Sbjct: 402 SIFHMMQPSVS 412


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  189 bits (481), Expect = 1e-45
 Identities = 114/257 (44%), Positives = 144/257 (56%), Gaps = 17/257 (6%)
 Frame = +2

Query: 11  QRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDREST--RPQFLNLEKIAP 184
           Q+   + YEFQ YQL P SPV +LISP   IS SGTSSP  DR      P+ L  E  + 
Sbjct: 198 QKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRRPIVEAPKLLGFEHFST 254

Query: 185 HEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGWKNDLTVVEHRVSFEIT 364
             WGSR GS +LTPD   P   DS LL +Q S +  L    +G +N  TV++HRVSFE+ 
Sbjct: 255 RRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFELA 314

Query: 365 AEDVVRCVEKKPSLLMKTGSVSHNKREEKLAEMSNGQELDGHEPSSE------------- 505
            EDV  CVEKKP   + +     N  ++ + E    +E DG   S+E             
Sbjct: 315 GEDVAVCVEKKP---VASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKA 371

Query: 506 IREGSSTDGEDGQRHQKHRSITLGSSKEFNFDNVDGGYPDKA-IVGSDWWANEKVLGKEG 682
             E +S +GE+ Q H+KH  I  GS KEFNFDN  G    K  I+GS+WW NEKV+GK  
Sbjct: 372 ASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGKGT 431

Query: 683 GPCNNW-IFPMIQPGVS 730
           GP  NW  FP++QPG+S
Sbjct: 432 GPQTNWTFFPLLQPGIS 448


>emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]
          Length = 385

 Score =  189 bits (481), Expect = 1e-45
 Identities = 114/257 (44%), Positives = 144/257 (56%), Gaps = 17/257 (6%)
 Frame = +2

Query: 11  QRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDREST--RPQFLNLEKIAP 184
           Q+   + YEFQ YQL P SPV +LISP   IS SGTSSP  DR      P+ L  E  + 
Sbjct: 135 QKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRRPIVEAPKLLGFEHFST 191

Query: 185 HEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGWKNDLTVVEHRVSFEIT 364
             WGSR GS +LTPD   P   DS LL +Q S +  L    +G +N  TV++HRVSFE+ 
Sbjct: 192 RRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFELA 251

Query: 365 AEDVVRCVEKKPSLLMKTGSVSHNKREEKLAEMSNGQELDGHEPSSE------------- 505
            EDV  CVEKKP   + +     N  ++ + E    +E DG   S+E             
Sbjct: 252 GEDVAVCVEKKP---VASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKA 308

Query: 506 IREGSSTDGEDGQRHQKHRSITLGSSKEFNFDNVDGGYPDKA-IVGSDWWANEKVLGKEG 682
             E +S +GE+ Q H+KH  I  GS KEFNFDN  G    K  I+GS+WW NEKV+GK  
Sbjct: 309 ASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGKGT 368

Query: 683 GPCNNW-IFPMIQPGVS 730
           GP  NW  FP++QPG+S
Sbjct: 369 GPQTNWTFFPLLQPGIS 385


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
           gi|223549721|gb|EEF51209.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 459

 Score =  185 bits (470), Expect = 2e-44
 Identities = 118/261 (45%), Positives = 149/261 (57%), Gaps = 19/261 (7%)
 Frame = +2

Query: 2   DAGQRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDRE------------- 142
           +AG RFPF+ YEFQSYQ  PGSPV  LISP S IS SGTSSP  D E             
Sbjct: 202 EAGLRFPFSNYEFQSYQFYPGSPVGQLISPSSGISGSGTSSPFPDGEFAAAGPRFLEFQM 261

Query: 143 STRPQFLNLEKIAPHEWGSRQGSRTLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGWKN 322
           +  P+ LNL+K++ HE GSRQGS TLTPD V      S  L+ Q S I       N  K+
Sbjct: 262 AVPPKLLNLDKLSVHECGSRQGSGTLTPDAVRAT-SCSFPLDRQCSDIASNRHSDNENKD 320

Query: 323 DLTVVEHRVSFEITAEDVVRCVEKKPSLLMKTGSVSHN-----KREEKLAEMSNGQELDG 487
           D  V + RVSF+++AED +R  E KP+  +K    S       ++ +K +E+ +  E   
Sbjct: 321 D-QVADLRVSFDLSAEDALRYAEPKPASPVKIMPESMKNEIAAEKVQKSSEIRHNFECRV 379

Query: 488 HEPSSEIREGSSTDGEDGQRHQKHRSITLGSSKEFNFDNVDGGYPDKAIVGSDWWANEKV 667
            E S+ I E +ST GE   RHQKHR++TLG+ KEFNFDN DG    K   G DWW N   
Sbjct: 380 GETSNGILEQASTGGEKTPRHQKHRTLTLGTFKEFNFDNADG--VPKPSAGPDWWDNGSD 437

Query: 668 LGKEGGPCNNW-IFPMIQPGV 727
           +GKE     NW  FP++QP +
Sbjct: 438 VGKEDFTAKNWSFFPVMQPSI 458


>gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica]
          Length = 499

 Score =  184 bits (467), Expect = 5e-44
 Identities = 117/303 (38%), Positives = 149/303 (49%), Gaps = 64/303 (21%)
 Frame = +2

Query: 11   QRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDREST-------RPQFLNL 169
            Q+F  + YEFQ YQ  PGSP  NLISPGSA+S SGTSSP  DR           P+    
Sbjct: 197  QKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPDRHPVLEFRMGEAPKLFGF 256

Query: 170  EKIAPHEWGSRQGSRTLTPDTVY------------------------------------- 238
            +     +WGSR GS +LTPD V                                      
Sbjct: 257  DHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELGSRLGSGCVTPNGAGIGSRL 316

Query: 239  -----------PKYHDSLLLNHQNSGIPRLPKPFNGWKNDLTVVEHRVSFEITAEDVVRC 385
                       P   DS LL +Q S +  L    +G +   TV +HRVSFE+T EDV  C
Sbjct: 317  GSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVETVFDHRVSFELTGEDVACC 376

Query: 386  VEKKPSLLMKTGSVSH--------NKREEKLAEMSNGQELDGHEPSSEIREGSSTDGEDG 541
            +  K     +T S S         ++R+   ++ SN  E    E SS I E  S +GED 
Sbjct: 377  LANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCEFSVEESSSRIPENVSGEGED- 435

Query: 542  QRHQKHRSITLGSSKEFNFDNVDGGYPDKAIVGSDWWANEKVLGKEGGPCNNW-IFPMIQ 718
            Q ++KHRSITLGS+K+FNFDN     P+K  +GS+WWAN+ V  KE  PCN+W  FP++Q
Sbjct: 436  QGYRKHRSITLGSTKDFNFDNTKAEVPNKPNIGSEWWANKNVAAKESKPCNDWTFFPILQ 495

Query: 719  PGV 727
            PGV
Sbjct: 496  PGV 498


>ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|222858882|gb|EEE96429.1| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 507

 Score =  176 bits (447), Expect = 1e-41
 Identities = 116/305 (38%), Positives = 146/305 (47%), Gaps = 65/305 (21%)
 Frame = +2

Query: 11   QRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPLLDRE-------STRPQFLNL 169
            Q+F  + YEFQSY L PGSP   +ISPGSAIS SGTSSP  DR           P+ L  
Sbjct: 204  QKFSLSHYEFQSYHLYPGSPGGQIISPGSAISNSGTSSPFPDRHPMLEFRMGEAPKLLGF 263

Query: 170  EKIAPHEWGSRQGSR--------------------------------------------- 214
            E  +  +WGSR GS                                              
Sbjct: 264  EHFSTRKWGSRLGSGSLTPDATPDGMGLSRLGSGTVTPDGMGLSRLCSGTATPDGAGLRS 323

Query: 215  -----TLTPDTVYPKYHDSLLLNHQNSGIPRLPKPFNGWKNDLTVVEHRVSFEITAEDVV 379
                 TLTPD   P      LL +Q S +  L    NG K +  VV HRVSFE++ E+V 
Sbjct: 324  RLGSGTLTPDCFVPASQIGFLLENQISEVASLTNSENGSKTEENVVHHRVSFELSGEEVA 383

Query: 380  RCVEKKPSLLMKT-------GSVSHNKREEKLAEMSNGQELDGHEPSSEIREGSSTDGED 538
            RC+E K     +T              R ++LA M+  + L   E SSE+ E +S + E+
Sbjct: 384  RCLEIKSVASTRTFPEYPQDTMPEDPVRGDRLA-MNGERCLQNGEASSEMPEKNSEETEE 442

Query: 539  GQRHQKHRSITLGSSKEFNFDNVDGGYPDKAIVGSDWWANEKVLGKEGGPCNNW-IFPMI 715
               ++KHRSITLGS KEFNFDN  G   DK  + S+WWANE + GKE  P N+W  FP++
Sbjct: 443  DHVYRKHRSITLGSIKEFNFDNSKGEVSDKPAISSEWWANETIAGKEARPANSWTFFPLL 502

Query: 716  QPGVS 730
            QP VS
Sbjct: 503  QPEVS 507


Top