BLASTX nr result

ID: Papaver31_contig00038612 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver31_contig00038612
         (1122 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_008238921.1| PREDICTED: uncharacterized protein LOC103337...   147   1e-32
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   145   8e-32
ref|XP_011030157.1| PREDICTED: uncharacterized protein LOC105129...   144   1e-31
ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648...   144   1e-31
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   143   3e-31
ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr...   142   4e-31
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   140   2e-30
emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]   140   2e-30
ref|XP_010102658.1| hypothetical protein L484_004326 [Morus nota...   140   2e-30
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   139   3e-30
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   139   3e-30
ref|XP_010255317.1| PREDICTED: uncharacterized protein LOC104596...   138   7e-30
ref|XP_007038760.1| Hydroxyproline-rich glycoprotein family prot...   134   1e-28
emb|CDP12040.1| unnamed protein product [Coffea canephora]            133   2e-28
gb|KDO51973.1| hypothetical protein CISIN_1g010808mg [Citrus sin...   133   2e-28
ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-...   133   2e-28
ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citr...   133   2e-28
ref|XP_011029309.1| PREDICTED: uncharacterized protein LOC105129...   132   4e-28
ref|XP_011029307.1| PREDICTED: uncharacterized protein LOC105129...   132   4e-28
gb|KDO51974.1| hypothetical protein CISIN_1g010808mg [Citrus sin...   130   2e-27

>ref|XP_008238921.1| PREDICTED: uncharacterized protein LOC103337539 [Prunus mume]
          Length = 455

 Score =  147 bits (372), Expect = 1e-32
 Identities = 93/210 (44%), Positives = 119/210 (56%), Gaps = 9/210 (4%)
 Frame = -1

Query: 990 TGDVSNLWNLNELSASRWVTRTRQSSGSRTPDPAVAFYDKEFILANQISEVASLANSDNG 811
           TGD   L NL+ LS   W +R    SGS TPD A +     F+L  Q  EV     S+N 
Sbjct: 254 TGDPPKLLNLDILSTRDWGSRL--GSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNNR 311

Query: 810 SEINDSTINHRVSFELNAMEVATCLEKELGLPLRTISEASPAAAVNASTPERDGHPPNQT 631
              ND +INHRVSFEL++ EV  C+EK+   P+  ++EA   +  +A   + +  P    
Sbjct: 312 GRNNDISINHRVSFELSSEEVIRCVEKK---PV-ALAEAVSTSLEDAEKAQSEEDPSKVV 367

Query: 630 E------GPTSNDLAEKDSVDGEEEQLHIQQPSVLTSVGSVKEFKFENSDEGTSDKP--- 478
                  G TSND AEK   DGEE QLH +Q S+  ++GSVKEF F+N D G S      
Sbjct: 368 SSSICPVGETSNDAAEKAVADGEEAQLHPKQRSI--TLGSVKEFNFDNPDGGDSGNSIGS 425

Query: 477 DWWAKENVVTKESGPHDNWTFFPVLQPGVS 388
           DWWA E V  KE+GP  NW+FFP++QPGVS
Sbjct: 426 DWWANEKVDAKENGPTKNWSFFPMMQPGVS 455


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
           gi|462404864|gb|EMJ10328.1| hypothetical protein
           PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  145 bits (365), Expect = 8e-32
 Identities = 91/206 (44%), Positives = 114/206 (55%), Gaps = 5/206 (2%)
 Frame = -1

Query: 990 TGDVSNLWNLNELSASRWVTRTRQSSGSRTPDPAVAFYDKEFILANQISEVASLANSDNG 811
           TGD   L NL+ LS   W +R    SGS TPD A +     F+L  Q  EV     S+N 
Sbjct: 254 TGDPPKLLNLDILSTRDWGSRL--GSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNNR 311

Query: 810 SEINDSTINHRVSFELNAMEVATCLEKELGLPLRTISEA--SPAAAVNASTPERDGHPPN 637
              ND +INHRVSFEL++ EV  C+EK+       +S +      A +   P +      
Sbjct: 312 GRNNDISINHRVSFELSSEEVIRCVEKKPVALAEAVSTSLEDTEKAQSKEDPSKVVSSSI 371

Query: 636 QTEGPTSNDLAEKDSVDGEEEQLHIQQPSVLTSVGSVKEFKFENSDEGTSDKP---DWWA 466
              G TSND AEK   DGEE QLH +Q S+  ++GSVKEF F+N D G S      DWWA
Sbjct: 372 CPVGETSNDAAEKAVADGEEAQLHPKQRSI--TLGSVKEFNFDNPDGGDSGNSIGSDWWA 429

Query: 465 KENVVTKESGPHDNWTFFPVLQPGVS 388
            E V  KE+GP  NW+FFP++QPGVS
Sbjct: 430 NEKVDAKENGPTKNWSFFPMMQPGVS 455


>ref|XP_011030157.1| PREDICTED: uncharacterized protein LOC105129679 [Populus euphratica]
          Length = 519

 Score =  144 bits (364), Expect = 1e-31
 Identities = 96/251 (38%), Positives = 136/251 (54%), Gaps = 19/251 (7%)
 Frame = -1

Query: 1083 TRQSSGSLTPDPAGPTSNY------KEFLSENQISEGTGDVSNLWNLNELSASRWVT--- 931
            +R  SGSLTPD  G  S         + +  +++  GT     +  L+ L +        
Sbjct: 273  SRLGSGSLTPDGVGLGSRLGSGTATPDGMGLSRLGSGTVTPDGM-GLSRLCSGTATPDGA 331

Query: 930  --RTRQSSGSRTPDPAVAFYDKEFILANQISEVASLANSDNGSEINDSTINHRVSFELNA 757
              R+R  SG+ TPD  V      F+L NQISEVASL NS+NGS+  ++ ++HRVSFEL+ 
Sbjct: 332  GLRSRLGSGTLTPDCFVPASQIGFLLENQISEVASLTNSENGSKTEENVVHHRVSFELSG 391

Query: 756  MEVATCLEKELGLPLRTISE----ASPAAAVNASTPERDGHPPNQTEGPTSNDLAEKDSV 589
             EVA CLE +     RT  E      P   V       +G    Q  G  S+++ EK+S 
Sbjct: 392  EEVARCLESKSAASTRTFPEYPQDTMPDDPVRGDRLAMNGERCIQ-NGEASSEMPEKNSE 450

Query: 588  DGEEEQLHIQQPSVLTSVGSVKEFKFENSDEGTSDKP----DWWAKENVVTKESGPHDNW 421
            + EE+  + +  S+  ++GS+KEF F+NS    SDKP    +WWA E +  KE+GP ++W
Sbjct: 451  ETEEDHGYRKHRSI--TLGSIKEFNFDNSKGEVSDKPAISSEWWANETIAGKEAGPANSW 508

Query: 420  TFFPVLQPGVS 388
            TFFP+LQP VS
Sbjct: 509  TFFPLLQPEVS 519


>ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648441 [Jatropha curcas]
            gi|643706116|gb|KDP22248.1| hypothetical protein
            JCGZ_26079 [Jatropha curcas]
          Length = 498

 Score =  144 bits (363), Expect = 1e-31
 Identities = 98/240 (40%), Positives = 133/240 (55%), Gaps = 8/240 (3%)
 Frame = -1

Query: 1083 TRQSSGSLTPDPAGPTSNYKEFLSENQISEGTGDVSNLWNLNELSASRWVTRTRQSSGSR 904
            +R  SG+LTPD  G  S      S     +G G  S L +   ++      R+R  SGS 
Sbjct: 267  SRLGSGTLTPDGVGLGSR---LCSGTATPDGVGLGSRLGS-GSVTPDGVGLRSRLGSGSL 322

Query: 903  TPDPAVAFYDKEFILANQISEVASLANSDNGSEINDSTINHRVSFELNAMEVATCLEKEL 724
            TPD  V       +L NQISEVASLANS+N S+ +++ ++HRVSFEL+  EVA CLE + 
Sbjct: 323  TPDCVVPASQDGLLLENQISEVASLANSENASKNDENIVDHRVSFELSGEEVARCLESKS 382

Query: 723  GLPLRTISEASPAAAVNASTPERDGHPPNQTE----GPTSNDLAEKDSVDGEEEQLHIQQ 556
                RT SE  P  ++       +    N  +    G TSN+  EK S + EEE  + + 
Sbjct: 383  MTSSRTFSEC-PQDSMAEEQINSEEILINSNDCLHIGETSNETPEKPSGETEEEPCYRKH 441

Query: 555  PSVLTSVGSVKEFKFENSDEGTSDKP----DWWAKENVVTKESGPHDNWTFFPVLQPGVS 388
             S+  ++GS+KEF F+NS E   DKP    +WWA E +  KE+ P +NWTFFP+LQP VS
Sbjct: 442  RSI--TLGSIKEFNFDNSKE-VPDKPTISSEWWANETIAGKEARPANNWTFFPLLQPEVS 498


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  143 bits (360), Expect = 3e-31
 Identities = 97/240 (40%), Positives = 130/240 (54%), Gaps = 8/240 (3%)
 Frame = -1

Query: 1083 TRQSSGSLTPDPAGPTSNYKEFLSENQISEGTGDVSNLWNLNELSASRWVTRTRQSSGSR 904
            +R  SG++TPD  G  S      S     +G G  S L +   ++      R+   SGS 
Sbjct: 280  SRLGSGTVTPDGVGLGSRLG---SGTVTPDGVGQGSRLGS-GTVTPDGVGLRSMLGSGSL 335

Query: 903  TPDPAVAFYDKEFILANQISEVASLANSDNGSEINDSTINHRVSFELNAMEVATCLEKEL 724
            TPD         F L NQISEVASLANS+NGS+ +++ ++HRVSFEL+  EVA CLE + 
Sbjct: 336  TPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLESKS 395

Query: 723  GLPLRTISEASPAAAVNASTPERDGH----PPNQTEGPTSNDLAEKDSVDGEEEQLHIQQ 556
                R  SE  P +   A    + G       N   G TS +  EK S + EEE  + + 
Sbjct: 396  LASCRAFSECPPDSM--AEDQIKSGKMLMTDENLPTGETSGETPEKPSGEMEEEHCYRKH 453

Query: 555  PSVLTSVGSVKEFKFENSDEGTSDKP----DWWAKENVVTKESGPHDNWTFFPVLQPGVS 388
             S+  ++GS+KEF F+NS E   DKP    +WWA E +  KE+ P +NWTFFP+LQP VS
Sbjct: 454  RSI--TLGSIKEFNFDNSKE-VPDKPSINSEWWANETIAGKEARPANNWTFFPLLQPEVS 510


>ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|222858882|gb|EEE96429.1| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 507

 Score =  142 bits (359), Expect = 4e-31
 Identities = 94/240 (39%), Positives = 131/240 (54%), Gaps = 8/240 (3%)
 Frame = -1

Query: 1083 TRQSSGSLTPDPAGPTSNYKEFLSENQISEGTGDVSNLWNLNELSASRWVTRTRQSSGSR 904
            +R  SGSLTPD            S     +G G +S L +         + R+R  SG+ 
Sbjct: 273  SRLGSGSLTPDATPDGMGLSRLGSGTVTPDGMG-LSRLCSGTATPDGAGL-RSRLGSGTL 330

Query: 903  TPDPAVAFYDKEFILANQISEVASLANSDNGSEINDSTINHRVSFELNAMEVATCLEKEL 724
            TPD  V      F+L NQISEVASL NS+NGS+  ++ ++HRVSFEL+  EVA CLE + 
Sbjct: 331  TPDCFVPASQIGFLLENQISEVASLTNSENGSKTEENVVHHRVSFELSGEEVARCLEIKS 390

Query: 723  GLPLRTISE----ASPAAAVNASTPERDGHPPNQTEGPTSNDLAEKDSVDGEEEQLHIQQ 556
                RT  E      P   V       +G    Q  G  S+++ EK+S + EE+ ++ + 
Sbjct: 391  VASTRTFPEYPQDTMPEDPVRGDRLAMNGERCLQ-NGEASSEMPEKNSEETEEDHVYRKH 449

Query: 555  PSVLTSVGSVKEFKFENSDEGTSDKP----DWWAKENVVTKESGPHDNWTFFPVLQPGVS 388
             S+  ++GS+KEF F+NS    SDKP    +WWA E +  KE+ P ++WTFFP+LQP VS
Sbjct: 450  RSI--TLGSIKEFNFDNSKGEVSDKPAISSEWWANETIAGKEARPANSWTFFPLLQPEVS 507


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
           gi|731424007|ref|XP_010662699.1| PREDICTED:
           uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  140 bits (353), Expect = 2e-30
 Identities = 86/211 (40%), Positives = 111/211 (52%), Gaps = 12/211 (5%)
 Frame = -1

Query: 984 DVSNLWNLNELSASRWVTRTRQSSGSRTPDPAVAFYDKEFILANQISEVASLANSDNGSE 805
           +   L      S  RW +R    SGS TPD A       F+L NQISEVASLANS++GS+
Sbjct: 242 EAPKLLGFEHFSTRRWGSRL--GSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQ 299

Query: 804 INDSTINHRVSFELNAMEVATCLEKELGLPLRTISEASPAAAVNASTP-ERDGHPPNQTE 628
             ++ I+HRVSFEL   +VA C+EK+      T+               ERDG   +   
Sbjct: 300 NGETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTEN 359

Query: 627 ------GPTSNDLAEKDSVDGEEEQLHIQQPSVLTSVGSVKEFKFENSDEGTSDKP---- 478
                 G      +EK S +GEEEQ H + P +    GS+KEF F+N+    S KP    
Sbjct: 360 CCEFCVGEALKAASEKASAEGEEEQCHKKHPPI--RHGSIKEFNFDNTKGEVSAKPNIIG 417

Query: 477 -DWWAKENVVTKESGPHDNWTFFPVLQPGVS 388
            +WW  E VV K +GP  NWTFFP+LQPG+S
Sbjct: 418 SEWWVNEKVVGKGTGPQTNWTFFPLLQPGIS 448


>emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]
          Length = 385

 Score =  140 bits (353), Expect = 2e-30
 Identities = 86/211 (40%), Positives = 111/211 (52%), Gaps = 12/211 (5%)
 Frame = -1

Query: 984 DVSNLWNLNELSASRWVTRTRQSSGSRTPDPAVAFYDKEFILANQISEVASLANSDNGSE 805
           +   L      S  RW +R    SGS TPD A       F+L NQISEVASLANS++GS+
Sbjct: 179 EAPKLLGFEHFSTRRWGSRL--GSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQ 236

Query: 804 INDSTINHRVSFELNAMEVATCLEKELGLPLRTISEASPAAAVNASTP-ERDGHPPNQTE 628
             ++ I+HRVSFEL   +VA C+EK+      T+               ERDG   +   
Sbjct: 237 NGETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTEN 296

Query: 627 ------GPTSNDLAEKDSVDGEEEQLHIQQPSVLTSVGSVKEFKFENSDEGTSDKP---- 478
                 G      +EK S +GEEEQ H + P +    GS+KEF F+N+    S KP    
Sbjct: 297 CCEFCVGEALKAASEKASAEGEEEQCHKKHPPI--RHGSIKEFNFDNTKGEVSAKPNIIG 354

Query: 477 -DWWAKENVVTKESGPHDNWTFFPVLQPGVS 388
            +WW  E VV K +GP  NWTFFP+LQPG+S
Sbjct: 355 SEWWVNEKVVGKGTGPQTNWTFFPLLQPGIS 385


>ref|XP_010102658.1| hypothetical protein L484_004326 [Morus notabilis]
            gi|587905707|gb|EXB93840.1| hypothetical protein
            L484_004326 [Morus notabilis]
          Length = 521

 Score =  140 bits (352), Expect = 2e-30
 Identities = 98/266 (36%), Positives = 136/266 (51%), Gaps = 28/266 (10%)
 Frame = -1

Query: 1101 SNWVTRTRQSSGSLTPDPAGPTSNYKEFLSENQISEGTGDVSNL---------WNLNELS 949
            + W   +R  SGSLTPD  G  S      S +   +G G  S L         + L    
Sbjct: 262  TTWKWGSRLGSGSLTPDGVGLGSRLG---SGSVTPDGVGLGSRLGSGSLTPDGYGLGSRL 318

Query: 948  ASRWVTR------TRQSSGSRTPDPAVAFYDKEFILANQISEVASLANSDNGSEINDSTI 787
             S  +T       +R  SG+ TPD  +      F+L NQISEVASLANSDNG + + S +
Sbjct: 319  GSGCMTPNGPGLGSRLGSGTLTPDGFLVVSGDSFLLENQISEVASLANSDNGCQNDGSVV 378

Query: 786  NHRVSFELNAMEVATCL-EKELGLPLRTISEASPAAAVNASTPERDGHPPNQTEGP---- 622
            +HRVSFEL   +VA CL  K      RT SE+   +     T ++DG   N  + P    
Sbjct: 379  DHRVSFELTGEDVARCLASKSASSNGRTTSESLEDSPAECPT-KKDGISANNVDSPNDQS 437

Query: 621  ----TSNDLAEKDSVDGEEEQLHIQQPSVLTSVGSVKEFKFENSDEGTSDKP----DWWA 466
                TSN   + D  +GE++  + +  S+  ++GS+KEF F+N+    S KP    +WWA
Sbjct: 438  CVEETSNKTPQSDCREGEDDHFYQKHRSI--TLGSIKEFNFDNTKADVSVKPTIGSEWWA 495

Query: 465  KENVVTKESGPHDNWTFFPVLQPGVS 388
             E V  KE+   ++W+FFP+LQPGVS
Sbjct: 496  NEKVAGKEAKAGNSWSFFPILQPGVS 521


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  139 bits (351), Expect = 3e-30
 Identities = 99/254 (38%), Positives = 132/254 (51%), Gaps = 13/254 (5%)
 Frame = -1

Query: 1110 LSASNWVTR---TRQSSGSLTPDPAGPTSNYKEFLSENQISEGTGDVSNLWNLNELSASR 940
            L   N+ TR   +R  SGSLTPD  G  S             G+G V+            
Sbjct: 259  LGFENFTTRKWGSRLGSGSLTPDGLGQGSRL-----------GSGSVT---------PDG 298

Query: 939  WVTRTRQSSGSRTPDPAVAFYDKEFILANQISEVASLANSDNGSEINDSTINHRVSFELN 760
                +R  SGS TPD         F++ +QISEVA LAN  NG + +++ ++HRVSFEL+
Sbjct: 299  MGLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELS 358

Query: 759  AMEVATCLEKELGLPLRTISEASPAAAVNASTPERDGHPPNQTEG------PTSNDLAEK 598
              +VA CLE +  LP R +SE  P   V     ERDG   +           TSN+  EK
Sbjct: 359  GEDVAPCLESKSLLPSRAVSE-YPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEK 417

Query: 597  DSVDGEEEQLHIQQPSVLTSVGSVKEFKFENSDEGTSDKP----DWWAKENVVTKESGPH 430
             S + EEE  + +  SV  ++GS+KEF F+N+    SDKP    +WWA E V  KE+ P 
Sbjct: 418  ASGEAEEEHSYQKHRSV--TLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPG 475

Query: 429  DNWTFFPVLQPGVS 388
            ++WTFFP+LQP VS
Sbjct: 476  NSWTFFPMLQPEVS 489


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  139 bits (351), Expect = 3e-30
 Identities = 99/254 (38%), Positives = 132/254 (51%), Gaps = 13/254 (5%)
 Frame = -1

Query: 1110 LSASNWVTR---TRQSSGSLTPDPAGPTSNYKEFLSENQISEGTGDVSNLWNLNELSASR 940
            L   N+ TR   +R  SGSLTPD  G  S             G+G V+            
Sbjct: 255  LGFENFTTRKWGSRLGSGSLTPDGLGQGSRL-----------GSGSVT---------PDG 294

Query: 939  WVTRTRQSSGSRTPDPAVAFYDKEFILANQISEVASLANSDNGSEINDSTINHRVSFELN 760
                +R  SGS TPD         F++ +QISEVA LAN  NG + +++ ++HRVSFEL+
Sbjct: 295  MGLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELS 354

Query: 759  AMEVATCLEKELGLPLRTISEASPAAAVNASTPERDGHPPNQTEG------PTSNDLAEK 598
              +VA CLE +  LP R +SE  P   V     ERDG   +           TSN+  EK
Sbjct: 355  GEDVAPCLESKSLLPSRAVSE-YPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEK 413

Query: 597  DSVDGEEEQLHIQQPSVLTSVGSVKEFKFENSDEGTSDKP----DWWAKENVVTKESGPH 430
             S + EEE  + +  SV  ++GS+KEF F+N+    SDKP    +WWA E V  KE+ P 
Sbjct: 414  ASGEAEEEHSYQKHRSV--TLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPG 471

Query: 429  DNWTFFPVLQPGVS 388
            ++WTFFP+LQP VS
Sbjct: 472  NSWTFFPMLQPEVS 485


>ref|XP_010255317.1| PREDICTED: uncharacterized protein LOC104596031 [Nelumbo nucifera]
          Length = 452

 Score =  138 bits (348), Expect = 7e-30
 Identities = 82/188 (43%), Positives = 110/188 (58%), Gaps = 10/188 (5%)
 Frame = -1

Query: 921 QSSGSRTPDPAVAFYDKEFILANQISEVASLANSDNGSEINDSTINHRVSFELNAMEVAT 742
           Q SGS TPD A       F   NQISEVASLANSD+GS+  D  I+HRVSFEL   EV +
Sbjct: 275 QGSGSLTPDAAGPTSRDGFPQDNQISEVASLANSDSGSQNGDIVIDHRVSFELTTGEVPS 334

Query: 741 CLEKELGLPLRTISEASPAAAVNASTPERDGHPPNQTE------GPTSNDLAEKDSVDGE 580
           C+EK +      + E+   +  + +T E DG      +      G TSN+  +K   DGE
Sbjct: 335 CVEKAI------LDESISQSLPSGTTAEVDGISRKTEDVAETRIGETSNNTPQKSMEDGE 388

Query: 579 EEQLHIQQPSVLTSVGSVKEFKFENSDEGTSDKP----DWWAKENVVTKESGPHDNWTFF 412
           E++ H  Q +   ++GSVKEF F+N+D G +DKP    +WW  +    KE+ P + WTFF
Sbjct: 389 EQRQHHHQKNPSLTLGSVKEFNFDNADGGAADKPTISSEWWINQ----KEARPCNQWTFF 444

Query: 411 PVLQPGVS 388
           P++QPGVS
Sbjct: 445 PMMQPGVS 452


>ref|XP_007038760.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
            gi|508776005|gb|EOY23261.1| Hydroxyproline-rich
            glycoprotein family protein [Theobroma cacao]
          Length = 540

 Score =  134 bits (337), Expect = 1e-28
 Identities = 95/250 (38%), Positives = 127/250 (50%), Gaps = 10/250 (4%)
 Frame = -1

Query: 1110 LSASNWVTRTRQSSGSLTPDPAGPTSNYKEFLSENQISEGTGDVSNLWNLNELSASRWVT 931
            L+   W +R    SGSLTPD  G  S             G+G V+               
Sbjct: 315  LTTRKWCSRL--GSGSLTPDGLGRGSRL-----------GSGSVT---------PDGMGL 352

Query: 930  RTRQSSGSRTPDPAVAFYDKEFILANQISEVASLANSDNGSEINDSTINHRVSFELNAME 751
             +R  SGS TPD         F+L +QISEVA L N  NG + +++ ++HRVSFEL+  +
Sbjct: 353  GSRLGSGSLTPDGLGPPSRDGFLLGSQISEVALLTNQANGPKNDETIVDHRVSFELSGED 412

Query: 750  VATCLEKELGLPLRTISEASPAAAVNASTPERDGHPPNQTEG------PTSNDLAEKDSV 589
            VA CLE +  LP RT+SE  P   V     ERDG   +           TSN+  EK S 
Sbjct: 413  VARCLESKSLLPSRTVSE-YPKDLVAEGRIERDGIKKDLESSCELFIRETSNETVEKASG 471

Query: 588  DGEEEQLHIQQPSVLTSVGSVKEFKFENSDEGTSDKP----DWWAKENVVTKESGPHDNW 421
              EEE  + +  SV  ++GS+KEF F+N+    SDKP    +WWA E    KE+ P ++W
Sbjct: 472  KAEEEHSYQKHRSV--TLGSIKEFNFDNTKGEASDKPTIRSEWWANEKFARKEARPGNSW 529

Query: 420  TFFPVLQPGV 391
            TFFP+ +PGV
Sbjct: 530  TFFPMFRPGV 539


>emb|CDP12040.1| unnamed protein product [Coffea canephora]
          Length = 466

 Score =  133 bits (335), Expect = 2e-28
 Identities = 83/240 (34%), Positives = 128/240 (53%), Gaps = 12/240 (5%)
 Frame = -1

Query: 1071 SGSLTPDPAGP----TSNYKEFLSENQISEGTGDVSNLWNLNELSASRWVTRTRQSSGSR 904
            SG+ +P P G       ++ EF S        GD   L NL +++   W   +RQ SG+ 
Sbjct: 239  SGTSSPFPDGEFVYGRPHFLEFRS--------GDPPKLLNLEKIAPHEW--GSRQGSGTI 288

Query: 903  TPDPAVAFYDKEFILANQISEVASLANSDNGSEINDSTINHRVSFELNAMEVATCLEKEL 724
            TPD     Y   F+L NQ S+ ++++NS N + ++++ ++HRVSFE+ A EV  C+EK  
Sbjct: 289  TPDTVAPRYRNGFLLDNQKSDASTVSNSYNVTRVDETAVDHRVSFEITAEEVVRCVEKTP 348

Query: 723  GLPLRTISEASPA----AAVNASTPERDGHPPNQTEGPTSNDLAEKDSVDGEEEQLHIQQ 556
             +  + +   +P+           P+   +      G  S   + + SVDG+  Q H +Q
Sbjct: 349  AVFPKAVLATTPSNTECVVKTEDNPKEMANGHEGCAGEASRIGSGRASVDGDGGQWHQKQ 408

Query: 555  PSVLTSVGSVKEFKFENSDEGTSDKP----DWWAKENVVTKESGPHDNWTFFPVLQPGVS 388
             ++  ++GS KEF F++ DEG SD P    DWWA E V+ K+     +WTFFPV+QPGVS
Sbjct: 409  RTI--TLGSAKEFNFDSVDEGNSDTPNIGSDWWANEKVMGKDGVAGKSWTFFPVMQPGVS 466


>gb|KDO51973.1| hypothetical protein CISIN_1g010808mg [Citrus sinensis]
          Length = 500

 Score =  133 bits (335), Expect = 2e-28
 Identities = 96/242 (39%), Positives = 129/242 (53%), Gaps = 10/242 (4%)
 Frame = -1

Query: 1083 TRQSSGSLTPDPAGPTSNYKEFLSENQISEGTGDVSNLWNLNELSASRWVTRTRQSSGSR 904
            +R  SGS+TPD  G  S      S +   +G G  S L +   ++       +R  SGS 
Sbjct: 267  SRLGSGSVTPDGVGIGSRMG---SGSLTPDGVGLGSRLGS-GTVTPDGAGLGSRLGSGSL 322

Query: 903  TPDPAVAFYDKEFILANQISEVASLANSDNGSEINDSTINHRVSFELNAMEVATCLEKEL 724
            TPD         F+  NQISEVASLANSDNG++ ++  I+HRVSFEL+  EVA CL  + 
Sbjct: 323  TPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHIIDHRVSFELSGEEVARCLANKS 382

Query: 723  GLPLRTISEASPAAAVNASTPERDGHPPNQTE------GPTSNDLAEKDSVDGEEEQLHI 562
                R + E  P   V      RDG   +           +SN + EK   DGEEE  + 
Sbjct: 383  AASPRIVPE-FPQDIVPEGEIRRDGKLTDSENHFELCPEESSNRMPEKTMRDGEEEYCYR 441

Query: 561  QQPSVLTSVGSVKEFKFENSDEGTSDKP----DWWAKENVVTKESGPHDNWTFFPVLQPG 394
            +  S+  ++GS+KEF F+N++   S+KP    +WWA EN V KES P +NWTFFP+LQ  
Sbjct: 442  KHRSI--TLGSIKEFNFDNTEGEVSNKPSINSEWWANEN-VGKESKPSNNWTFFPMLQSE 498

Query: 393  VS 388
             S
Sbjct: 499  AS 500


>ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-like [Citrus sinensis]
          Length = 500

 Score =  133 bits (335), Expect = 2e-28
 Identities = 96/242 (39%), Positives = 129/242 (53%), Gaps = 10/242 (4%)
 Frame = -1

Query: 1083 TRQSSGSLTPDPAGPTSNYKEFLSENQISEGTGDVSNLWNLNELSASRWVTRTRQSSGSR 904
            +R  SGS+TPD  G  S      S +   +G G  S L +   ++       +R  SGS 
Sbjct: 267  SRLGSGSVTPDGVGIGSRMG---SGSLTPDGVGLGSRLGS-GTVTPDGAGLGSRLGSGSL 322

Query: 903  TPDPAVAFYDKEFILANQISEVASLANSDNGSEINDSTINHRVSFELNAMEVATCLEKEL 724
            TPD         F+  NQISEVASLANSDNG++ ++  I+HRVSFEL+  EVA CL  + 
Sbjct: 323  TPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHIIDHRVSFELSGEEVARCLANKS 382

Query: 723  GLPLRTISEASPAAAVNASTPERDGHPPNQTE------GPTSNDLAEKDSVDGEEEQLHI 562
                R + E  P   V      RDG   +           +SN + EK   DGEEE  + 
Sbjct: 383  AASPRIVPE-FPQDIVPEGEIRRDGKLTDSENHFELCPEESSNRMPEKTMRDGEEEYCYR 441

Query: 561  QQPSVLTSVGSVKEFKFENSDEGTSDKP----DWWAKENVVTKESGPHDNWTFFPVLQPG 394
            +  S+  ++GS+KEF F+N++   S+KP    +WWA EN V KES P +NWTFFP+LQ  
Sbjct: 442  KHRSI--TLGSIKEFNFDNTEGEVSNKPSINSEWWANEN-VGKESKPSNNWTFFPMLQSE 498

Query: 393  VS 388
             S
Sbjct: 499  AS 500


>ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citrus clementina]
            gi|557523850|gb|ESR35217.1| hypothetical protein
            CICLE_v10004813mg [Citrus clementina]
          Length = 500

 Score =  133 bits (335), Expect = 2e-28
 Identities = 96/242 (39%), Positives = 129/242 (53%), Gaps = 10/242 (4%)
 Frame = -1

Query: 1083 TRQSSGSLTPDPAGPTSNYKEFLSENQISEGTGDVSNLWNLNELSASRWVTRTRQSSGSR 904
            +R  SGS+TPD  G  S      S +   +G G  S L +   ++       +R  SGS 
Sbjct: 267  SRLGSGSVTPDGVGIGSRMG---SGSLTPDGVGLGSRLGS-GTVTPDGAGLGSRLGSGSL 322

Query: 903  TPDPAVAFYDKEFILANQISEVASLANSDNGSEINDSTINHRVSFELNAMEVATCLEKEL 724
            TPD         F+  NQISEVASLANSDNG++ ++  I+HRVSFEL+  EVA CL  + 
Sbjct: 323  TPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHIIDHRVSFELSGEEVARCLANKS 382

Query: 723  GLPLRTISEASPAAAVNASTPERDGHPPNQTE------GPTSNDLAEKDSVDGEEEQLHI 562
                R + E  P   V      RDG   +           +SN + EK   DGEEE  + 
Sbjct: 383  AASPRIVPE-FPQDIVPEGEIRRDGKLTDSENHFELCPEESSNRMPEKTMRDGEEEYCYR 441

Query: 561  QQPSVLTSVGSVKEFKFENSDEGTSDKP----DWWAKENVVTKESGPHDNWTFFPVLQPG 394
            +  S+  ++GS+KEF F+N++   S+KP    +WWA EN V KES P +NWTFFP+LQ  
Sbjct: 442  KHRSI--TLGSIKEFNFDNTEGEVSNKPSINSEWWANEN-VGKESKPSNNWTFFPMLQSE 498

Query: 393  VS 388
             S
Sbjct: 499  AS 500


>ref|XP_011029309.1| PREDICTED: uncharacterized protein LOC105129075 isoform X2 [Populus
            euphratica]
          Length = 452

 Score =  132 bits (333), Expect = 4e-28
 Identities = 89/234 (38%), Positives = 126/234 (53%), Gaps = 6/234 (2%)
 Frame = -1

Query: 1071 SGSLTPDPAGPTSNYKEFLSENQISEGTGDVSNLWNLNELSASRWVTRTRQSSGSRTPDP 892
            SG+ +P P G  S      +E ++    G+   L +L++LS   W   + Q SG+ TP+ 
Sbjct: 230  SGTSSPFPDGEFSVGGAHFTEFRM----GEPPKLLSLDKLSTCEW--GSYQGSGALTPE- 282

Query: 891  AVAFYDKEFILANQISEVASLANSDNGSEINDSTINHRVSFELNAMEVATCLEKELGLPL 712
            +V      F+L  Q S+V S   SDNG + ND  +NHRVSFEL A + + C+E++    +
Sbjct: 283  SVRRGSPNFLLHRQFSDVPSRPRSDNGHK-NDQVVNHRVSFELTAEDASRCVEEKPAFSI 341

Query: 711  RTISEASPAAAVNASTPERDGHPPNQTE---GPTSNDLAEKDSVDGEEEQLHIQQPSVLT 541
            +T+ E        A   +  G      E   G TSND  E  S DGE    H +Q S+  
Sbjct: 342  KTVPEYVENGT-QAKEEKNSGESIQSLECRVGVTSNDSPEMASTDGEVAPQHRKQQSI-- 398

Query: 540  SVGSVKEFKFENSDEGTSDKP---DWWAKENVVTKESGPHDNWTFFPVLQPGVS 388
            ++GSVKEF F+N+DEG S KP   +WWA   V+ KE     NW+FFP++Q GVS
Sbjct: 399  TLGSVKEFNFDNADEGDSRKPSTSNWWANGGVIGKEGETTKNWSFFPMVQSGVS 452


>ref|XP_011029307.1| PREDICTED: uncharacterized protein LOC105129075 isoform X1 [Populus
            euphratica]
          Length = 453

 Score =  132 bits (333), Expect = 4e-28
 Identities = 89/234 (38%), Positives = 126/234 (53%), Gaps = 6/234 (2%)
 Frame = -1

Query: 1071 SGSLTPDPAGPTSNYKEFLSENQISEGTGDVSNLWNLNELSASRWVTRTRQSSGSRTPDP 892
            SG+ +P P G  S      +E ++    G+   L +L++LS   W   + Q SG+ TP+ 
Sbjct: 231  SGTSSPFPDGEFSVGGAHFTEFRM----GEPPKLLSLDKLSTCEW--GSYQGSGALTPE- 283

Query: 891  AVAFYDKEFILANQISEVASLANSDNGSEINDSTINHRVSFELNAMEVATCLEKELGLPL 712
            +V      F+L  Q S+V S   SDNG + ND  +NHRVSFEL A + + C+E++    +
Sbjct: 284  SVRRGSPNFLLHRQFSDVPSRPRSDNGHK-NDQVVNHRVSFELTAEDASRCVEEKPAFSI 342

Query: 711  RTISEASPAAAVNASTPERDGHPPNQTE---GPTSNDLAEKDSVDGEEEQLHIQQPSVLT 541
            +T+ E        A   +  G      E   G TSND  E  S DGE    H +Q S+  
Sbjct: 343  KTVPEYVENGT-QAKEEKNSGESIQSLECRVGVTSNDSPEMASTDGEVAPQHRKQQSI-- 399

Query: 540  SVGSVKEFKFENSDEGTSDKP---DWWAKENVVTKESGPHDNWTFFPVLQPGVS 388
            ++GSVKEF F+N+DEG S KP   +WWA   V+ KE     NW+FFP++Q GVS
Sbjct: 400  TLGSVKEFNFDNADEGDSRKPSTSNWWANGGVIGKEGETTKNWSFFPMVQSGVS 453


>gb|KDO51974.1| hypothetical protein CISIN_1g010808mg [Citrus sinensis]
          Length = 421

 Score =  130 bits (327), Expect = 2e-27
 Identities = 95/242 (39%), Positives = 124/242 (51%), Gaps = 10/242 (4%)
 Frame = -1

Query: 1083 TRQSSGSLTPDPAGPTSNYKEFLSENQISEGTGDVSNLWNLNELSASRWVTRTRQSSGSR 904
            +R  SGS+TPD  G  S      S +   +G G  S L                  SGS 
Sbjct: 204  SRLGSGSVTPDGVGIGSRMG---SGSLTPDGVGLGSRL-----------------GSGSL 243

Query: 903  TPDPAVAFYDKEFILANQISEVASLANSDNGSEINDSTINHRVSFELNAMEVATCLEKEL 724
            TPD         F+  NQISEVASLANSDNG++ ++  I+HRVSFEL+  EVA CL  + 
Sbjct: 244  TPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHIIDHRVSFELSGEEVARCLANKS 303

Query: 723  GLPLRTISEASPAAAVNASTPERDGHPPNQTE------GPTSNDLAEKDSVDGEEEQLHI 562
                R + E  P   V      RDG   +           +SN + EK   DGEEE  + 
Sbjct: 304  AASPRIVPE-FPQDIVPEGEIRRDGKLTDSENHFELCPEESSNRMPEKTMRDGEEEYCYR 362

Query: 561  QQPSVLTSVGSVKEFKFENSDEGTSDKP----DWWAKENVVTKESGPHDNWTFFPVLQPG 394
            +  S+  ++GS+KEF F+N++   S+KP    +WWA EN V KES P +NWTFFP+LQ  
Sbjct: 363  KHRSI--TLGSIKEFNFDNTEGEVSNKPSINSEWWANEN-VGKESKPSNNWTFFPMLQSE 419

Query: 393  VS 388
             S
Sbjct: 420  AS 421


Top