BLASTX nr result

ID: Mentha27_contig00038567 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha27_contig00038567
         (495 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261...    80   3e-13
ref|XP_007203847.1| hypothetical protein PRUPE_ppa004367m1g, par...    74   3e-11
ref|XP_007012730.1| Hydroxyproline-rich glycoprotein family prot...    72   6e-11
gb|EXB38898.1| hypothetical protein L484_027333 [Morus notabilis]      72   8e-11
ref|XP_002514089.1| conserved hypothetical protein [Ricinus comm...    71   1e-10
ref|XP_004243732.1| PREDICTED: uncharacterized protein LOC101260...    70   2e-10
ref|XP_006342342.1| PREDICTED: pollen-specific leucine-rich repe...    68   1e-09
ref|XP_002309203.1| hydroxyproline-rich glycoprotein [Populus tr...    63   4e-08
ref|XP_002521366.1| conserved hypothetical protein [Ricinus comm...    59   5e-07
ref|XP_007138687.1| hypothetical protein PHAVU_009G229500g [Phas...    58   2e-06
ref|XP_007024556.1| Hydroxyproline-rich glycoprotein family prot...    56   4e-06

>ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261010 [Vitis vinifera]
          Length = 555

 Score = 80.1 bits (196), Expect = 3e-13
 Identities = 53/127 (41%), Positives = 72/127 (56%), Gaps = 3/127 (2%)
 Frame = -3

Query: 493 LNVVLVLFAVLCGVFARRNDEAPPSEEAVAADGRRIPATASTVSNFVNRPRQSVPEWM-D 317
           LNV+LVLFA+LCGVFAR+NDE   +++ +   G       S+ S  + +  +S+   + +
Sbjct: 74  LNVLLVLFAILCGVFARKNDE--KNDDVLENHG-------SSGSVVMGKSHESISHSLFE 124

Query: 316 FPDRREY--XXXXXXXXXXXXXXSYPDLRQESVWENEGANRSRFFDDFEVNFYRSPPPEN 143
           F DR+ Y                SYPDLRQES+W   G +R RFFDDFEVN YRSP   +
Sbjct: 125 FSDRKIYDPPIQSGSVRLRRSSSSYPDLRQESLW-GAGDDRRRFFDDFEVNNYRSPASSD 183

Query: 142 FEPSRRR 122
           +    RR
Sbjct: 184 YVRRHRR 190


>ref|XP_007203847.1| hypothetical protein PRUPE_ppa004367m1g, partial [Prunus persica]
           gi|462399378|gb|EMJ05046.1| hypothetical protein
           PRUPE_ppa004367m1g, partial [Prunus persica]
          Length = 339

 Score = 73.6 bits (179), Expect = 3e-11
 Identities = 48/126 (38%), Positives = 68/126 (53%), Gaps = 11/126 (8%)
 Frame = -3

Query: 493 LNVVLVLFAVLCGVFARRNDEAPPSEE---AVAADGRRIPATASTVSNFVNRPRQSVPEW 323
           LNV+LV+FA+LCG+FA+RND+  P+EE     A+D       A+  +N          +W
Sbjct: 57  LNVLLVVFAILCGIFAKRNDDGSPAEEDPIQNASDPLNNSIAANNTTNTSEAEVLLPQQW 116

Query: 322 MDFPDRREYXXXXXXXXXXXXXXSYPDLR---QESVWENEGANRS--RFFDDFEVN---F 167
             F +R                 SYPDLR   Q+S WE+   ++S  RFFDDFE+N   +
Sbjct: 117 FGFSER---PPETRGGRLRRSSSSYPDLRQLGQQSSWESGDHSKSQFRFFDDFEINNTTY 173

Query: 166 YRSPPP 149
           +R+PPP
Sbjct: 174 HRTPPP 179


>ref|XP_007012730.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
           gi|508783093|gb|EOY30349.1| Hydroxyproline-rich
           glycoprotein family protein [Theobroma cacao]
          Length = 610

 Score = 72.4 bits (176), Expect = 6e-11
 Identities = 44/119 (36%), Positives = 61/119 (51%), Gaps = 7/119 (5%)
 Frame = -3

Query: 493 LNVVLVLFAVLCGVFARRNDEAPPSEEAVAADGRRIPATASTVSNFVNRPRQSVPEWMDF 314
           LN+ LVLFA+LCGVFARRND+      + ++    +    +   N  +    +  +W  +
Sbjct: 72  LNIFLVLFAILCGVFARRNDD--DDNNSGSSGNNNVRNDNNNNKNEASSHPVNSQQWFGY 129

Query: 313 PDRREY-------XXXXXXXXXXXXXXSYPDLRQESVWENEGANRSRFFDDFEVNFYRS 158
           P R+ Y                     SYPDLR+ES+WE    +R RFFDDFE+N YRS
Sbjct: 130 PGRKIYDDDPPMNASGTSVRRLKRSSSSYPDLRKESLWET-SEHRFRFFDDFEINKYRS 187


>gb|EXB38898.1| hypothetical protein L484_027333 [Morus notabilis]
          Length = 509

 Score = 72.0 bits (175), Expect = 8e-11
 Identities = 51/134 (38%), Positives = 66/134 (49%), Gaps = 12/134 (8%)
 Frame = -3

Query: 493 LNVVLVLFAVLCGVFARRNDEAPPSEEAVAADGRRIPATASTVSNFVNRPRQSVPEWMDF 314
           LN+ LVLFA+LCG+FARRND+   + + V    R      S  +N    P++    W  F
Sbjct: 77  LNIFLVLFAILCGIFARRNDDESANNDVVPTARRSGGVEESEPAN----PQR----WFAF 128

Query: 313 PDRR----------EYXXXXXXXXXXXXXXSYPDLRQESVWE--NEGANRSRFFDDFEVN 170
            D R                          SYPDLRQES+WE  ++   + RFFDDFE+N
Sbjct: 129 SDDRRSEKIYDSVDRTAESGSLRRLRRSSSSYPDLRQESLWETGDDPRFQFRFFDDFEIN 188

Query: 169 FYRSPPPENFEPSR 128
            YR   P  F+PSR
Sbjct: 189 KYRVTAP--FDPSR 200


>ref|XP_002514089.1| conserved hypothetical protein [Ricinus communis]
           gi|223546545|gb|EEF48043.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 831

 Score = 71.2 bits (173), Expect = 1e-10
 Identities = 43/113 (38%), Positives = 61/113 (53%), Gaps = 1/113 (0%)
 Frame = -3

Query: 493 LNVVLVLFAVLCGVFARRNDEAPPSEEAVAADGRRIPATASTVSNFVNRPRQSVPEWMDF 314
           LNV LVLFA+LCG+FARRND+    + A + D     +     SN       +V     +
Sbjct: 57  LNVFLVLFAILCGIFARRNDD----DSAPSGDHSNSSSVLHNNSNNNKERDHAVSNHSHW 112

Query: 313 PDRREYXXXXXXXXXXXXXXSYPDLRQESVWEN-EGANRSRFFDDFEVNFYRS 158
            D  ++              SYPDLRQES+W++ +  +R RFFDDFE++ +RS
Sbjct: 113 LDDNQFASATPMRRLKRSSSSYPDLRQESLWQSGDDIDRFRFFDDFELSKFRS 165


>ref|XP_004243732.1| PREDICTED: uncharacterized protein LOC101260449 [Solanum
           lycopersicum]
          Length = 608

 Score = 70.5 bits (171), Expect = 2e-10
 Identities = 49/129 (37%), Positives = 65/129 (50%), Gaps = 6/129 (4%)
 Frame = -3

Query: 490 NVVLVLFAVLCGVFARRNDEAPPSEEAVAADGRRIPATASTVSNFVNR---PRQSVPEWM 320
           N++LV+FA+LCG+FAR+ND+        AA+  R  +T  + SNF +    P  S   W 
Sbjct: 77  NILLVVFAILCGIFARKNDDNS------AAERNRNVSTTESSSNFNDHHMPPTVSNDRWF 130

Query: 319 DFPDRREY---XXXXXXXXXXXXXXSYPDLRQESVWENEGANRSRFFDDFEVNFYRSPPP 149
           +    + Y                 SYPDLRQ   WE  G N SRF DDF VN YRS   
Sbjct: 131 ETSHDKTYNFGVPETSVNRLRRSSSSYPDLRQVPQWET-GQNHSRFSDDFGVNLYRSTAS 189

Query: 148 ENFEPSRRR 122
           E ++  R+R
Sbjct: 190 E-YDTHRQR 197


>ref|XP_006342342.1| PREDICTED: pollen-specific leucine-rich repeat extensin-like
           protein 1-like [Solanum tuberosum]
          Length = 642

 Score = 68.2 bits (165), Expect = 1e-09
 Identities = 49/138 (35%), Positives = 66/138 (47%), Gaps = 15/138 (10%)
 Frame = -3

Query: 490 NVVLVLFAVLCGVFARRNDEAPPSEEAVAADGRRIPATAST------------VSNFVNR 347
           N++LV+FA+LCG+FAR+ND+    E       R +  T S+            V + + R
Sbjct: 77  NILLVVFAILCGIFARKNDDNSAVER-----NRNVSTTESSNFNDGSASADVDVDHDMRR 131

Query: 346 PRQSVPEWMDFPDRREY---XXXXXXXXXXXXXXSYPDLRQESVWENEGANRSRFFDDFE 176
           P  S   W +  D + Y                 SYPDLRQ   WE  G N SRF+DDF 
Sbjct: 132 P-VSNDRWFEASDEKTYHFGVPETSVNRLRRSSSSYPDLRQVPQWET-GENHSRFYDDFG 189

Query: 175 VNFYRSPPPENFEPSRRR 122
           VN YRS   E ++  R+R
Sbjct: 190 VNLYRSTASE-YDTHRQR 206


>ref|XP_002309203.1| hydroxyproline-rich glycoprotein [Populus trichocarpa]
           gi|222855179|gb|EEE92726.1| hydroxyproline-rich
           glycoprotein [Populus trichocarpa]
          Length = 547

 Score = 63.2 bits (152), Expect = 4e-08
 Identities = 44/126 (34%), Positives = 59/126 (46%), Gaps = 13/126 (10%)
 Frame = -3

Query: 493 LNVVLVLFAVLCGVFARRND-EAPPSEEAVAADGRRIPATASTVSNFVNRPRQSVPEWMD 317
           L + LVLF +LCG+FARRND E+  +E+  +   +  P + S    F +          D
Sbjct: 59  LYIFLVLFTILCGIFARRNDDESTTNEDNPSNHDKSKPHSVSNAPWFAD----------D 108

Query: 316 FPDRREYXXXXXXXXXXXXXXS------------YPDLRQESVWENEGANRSRFFDDFEV 173
           F D + Y              +            YPDL Q+S WE    +R RFFDDFE+
Sbjct: 109 FSDPKIYANTNNSTPLGGTATAATGDRLKMNSRSYPDLMQDSFWETPD-DRFRFFDDFEI 167

Query: 172 NFYRSP 155
           N YRSP
Sbjct: 168 NKYRSP 173


>ref|XP_002521366.1| conserved hypothetical protein [Ricinus communis]
           gi|223539444|gb|EEF41034.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 553

 Score = 59.3 bits (142), Expect = 5e-07
 Identities = 42/121 (34%), Positives = 58/121 (47%), Gaps = 6/121 (4%)
 Frame = -3

Query: 493 LNVVLVLFAVLCGVFARRN----DEAPPSEEAVAADGRRIPATASTVSNFVNRPRQSVP- 329
           LN VLVLFA++CG   R +    +E+  S + +++      A++S V   V R   S P 
Sbjct: 77  LNFVLVLFAIVCGFLGRNSPNTSNESSTSYQRLSSSSS---ASSSNVQQDVQRSYPSTPA 133

Query: 328 -EWMDFPDRREYXXXXXXXXXXXXXXSYPDLRQESVWENEGANRSRFFDDFEVNFYRSPP 152
             W D    ++               SYPDLRQES+W N    R RF+DD  VN Y+   
Sbjct: 134 YRWYDDGQYQDRTASYNTFNRLRSFRSYPDLRQESLWSNND-ERWRFYDDTRVNGYKFSS 192

Query: 151 P 149
           P
Sbjct: 193 P 193


>ref|XP_007138687.1| hypothetical protein PHAVU_009G229500g [Phaseolus vulgaris]
           gi|561011774|gb|ESW10681.1| hypothetical protein
           PHAVU_009G229500g [Phaseolus vulgaris]
          Length = 570

 Score = 57.8 bits (138), Expect = 2e-06
 Identities = 50/130 (38%), Positives = 67/130 (51%), Gaps = 12/130 (9%)
 Frame = -3

Query: 493 LNVVLVLFAVLCGVFARRND-EAPPSEE---AVA---ADGRRIPATASTVSNFVNRPRQS 335
           LN++LV+FA+LCGVFARRND E  PS     AV    A  RR+P+     S ++  P ++
Sbjct: 58  LNILLVVFAILCGVFARRNDDEQTPSNNHHHAVPDRNAAFRRVPSQGQ--SRWLGIPGET 115

Query: 334 VPEWMDFPDRREYXXXXXXXXXXXXXXS---YPDLRQESVWEN-EGANRSRFFDDFEVN- 170
                D P  R                +   YPDLRQ   WE  +  N+ RFFDDFE++ 
Sbjct: 116 KDFINDTPLNRFQSPPTAGATRLRMRRNSSSYPDLRQ---WETADDRNKFRFFDDFEIDK 172

Query: 169 FYRSPPPENF 140
            +RSP  + F
Sbjct: 173 QFRSPARDYF 182


>ref|XP_007024556.1| Hydroxyproline-rich glycoprotein family protein, putative
           [Theobroma cacao] gi|508779922|gb|EOY27178.1|
           Hydroxyproline-rich glycoprotein family protein,
           putative [Theobroma cacao]
          Length = 553

 Score = 56.2 bits (134), Expect = 4e-06
 Identities = 40/116 (34%), Positives = 54/116 (46%), Gaps = 4/116 (3%)
 Frame = -3

Query: 493 LNVVLVLFAVLCGVFARRNDEAPPSEEAVAADGRRIPATASTVSNFVNRPRQSVP-EWMD 317
           LN+VLVLFA++CG F  +N+    S+     +  +   T     + V R   S P +W D
Sbjct: 75  LNLVLVLFAIICG-FLGKNNGNNDSDTRSTYEDYKFSTTPKHDRDHVGRSNPSTPRQWYD 133

Query: 316 FP---DRREYXXXXXXXXXXXXXXSYPDLRQESVWENEGANRSRFFDDFEVNFYRS 158
           +    DR  Y               YPDLR ES W   G +R RF+DD  +  YRS
Sbjct: 134 YSSSSDRTAYNSLQRLRSSNS----YPDLRPESSWMMNGDDRWRFYDDTPLYNYRS 185


Top