BLASTX nr result

ID: Mentha26_contig00012172 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00012172
         (974 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU23414.1| hypothetical protein MIMGU_mgv1a020543mg [Mimulus...   164   7e-38
ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578...   121   5e-25
ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241...   119   1e-24
ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-r...   108   3e-21
ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620...   104   6e-20
ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citr...   103   8e-20
ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prun...   103   1e-19
ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r...   100   1e-18
ref|XP_002509872.1| conserved hypothetical protein [Ricinus comm...    97   1e-17
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...    96   2e-17
ref|XP_006573794.1| PREDICTED: uncharacterized protein LOC102667...    96   2e-17
gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis]      96   3e-17
ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-r...    96   3e-17
ref|XP_002887922.1| hypothetical protein ARALYDRAFT_474948 [Arab...    95   4e-17
ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, part...    94   1e-16
ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297...    94   1e-16
ref|NP_974086.1| late embryogenesis abundant hydroxyproline-rich...    91   7e-16
ref|XP_006300370.1| hypothetical protein CARUB_v10022353mg [Caps...    91   9e-16
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...    90   1e-15
ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r...    90   1e-15

>gb|EYU23414.1| hypothetical protein MIMGU_mgv1a020543mg [Mimulus guttatus]
          Length = 198

 Score =  164 bits (414), Expect = 7e-38
 Identities = 99/211 (46%), Positives = 125/211 (59%), Gaps = 2/211 (0%)
 Frame = -3

Query: 762 MDEETY--LNPEAKSDQKIVTNKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLV 589
           M+EE++  +NP  KSD++  T    +     +   SSK LVYIL+A V  S+ FL+FGLV
Sbjct: 1   MEEESHRIINPYIKSDEEEFTTTTKN-NRRGKGGGSSKCLVYILVAVVLQSVAFLVFGLV 59

Query: 588 VLRINAPSLRLSNVVVKDLRYSNSSFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATX 409
            LRI+ PSLRLS+  V  LR+ ++S N T +A IRL N NFG F+F GGS +L YG AT 
Sbjct: 60  ALRISNPSLRLSSAAVAVLRHDSASLNMTVVAGIRLRNPNFGDFEFNGGSASLLYGEATV 119

Query: 408 XXXXXXXXXXXXXXXRNIDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRV 229
                          + I+  +EV+GG                 LVKL  +AELRGE+RV
Sbjct: 120 GVASIYGGRVGRRDKKEINVTMEVIGGGGG------------GELVKLRSMAELRGEVRV 167

Query: 228 MKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 136
           +KIV R R A MNCTMDLNLT QA Q LSCQ
Sbjct: 168 VKIVKRRRIAFMNCTMDLNLTSQAFQDLSCQ 198


>ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578735 [Solanum tuberosum]
          Length = 223

 Score =  121 bits (303), Expect = 5e-25
 Identities = 71/205 (34%), Positives = 114/205 (55%), Gaps = 8/205 (3%)
 Frame = -3

Query: 729 KSDQKIVTNKALH---PTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLR 559
           KSDQ +  +K+++        +  +S K  VY L   V LSI+ LIF +V  R  +PS  
Sbjct: 18  KSDQGLNLSKSINYYNNHNNNKNRKSGKCFVYFLSTIVILSIIMLIFSMVFFRFKSPSFE 77

Query: 558 LSNVVVKDLRYSNS----SFNATFIADIRLHNMNFGRFDFRGGSTALY-YGNATXXXXXX 394
           L ++ V++LR+SNS    SFN     +I + N NFG+ +++  S +++ Y N T      
Sbjct: 78  LDHINVQNLRFSNSTNSSSFNMNMGGEIIVDNDNFGQINYQDSSMSVFLYDNVTIGIANV 137

Query: 393 XXXXXXXXXXRNIDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVN 214
                     + I  ++++        +Y N+S DI S ++KL    E RG+++ MKI++
Sbjct: 138 NVGRVEARKSKRIGISLQLRTNYQLNYSYGNLSSDINSRMLKLTSFGEFRGKVKAMKIIS 197

Query: 213 RWRTAMMNCTMDLNLTGQAIQGLSC 139
           + +T++MNCTM+LNLT QAIQ L C
Sbjct: 198 KHKTSIMNCTMNLNLTSQAIQDLLC 222


>ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241699 [Vitis vinifera]
          Length = 213

 Score =  119 bits (299), Expect = 1e-24
 Identities = 76/218 (34%), Positives = 109/218 (50%), Gaps = 9/218 (4%)
 Frame = -3

Query: 762 MDEETYLNPEA------KSDQKIVTNKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLI 601
           M E+    P A      KSD++    K   P   +   RSSK  VY+L   V L+ + L+
Sbjct: 1   MPEDNQFQPLAPARLHGKSDEEFGVFK---PRASKPPRRSSKCPVYVLAGLVTLAAIALV 57

Query: 600 FGLVVLRINAPSLRLSNVVVKDLRYSNS---SFNATFIADIRLHNMNFGRFDFRGGSTAL 430
           F L VLR+ AP + L +V VK+L +  S   SFN T  A++ + N NFG F+F  G+  +
Sbjct: 58  FALAVLRVEAPDVELKSVAVKNLTHGTSPSPSFNVTLTAEVSVQNKNFGAFNFENGTATV 117

Query: 429 YYGNATXXXXXXXXXXXXXXXXRNIDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAE 250
            Y                    + ++  ++V        N  N+S DI S  V L   A+
Sbjct: 118 LYEGMVVGDEEFSKAHVESRKTKRMNVTLDVRS--DRLWNDKNLSSDISSGSVNLTTYAQ 175

Query: 249 LRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 136
           + G++RVMK+V R  TA MNC+M LNLT  +IQ L C+
Sbjct: 176 VTGKVRVMKVVRRRTTARMNCSMTLNLTSSSIQDLVCR 213


>ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777616|gb|EOY24872.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 213

 Score =  108 bits (270), Expect = 3e-21
 Identities = 62/190 (32%), Positives = 97/190 (51%), Gaps = 3/190 (1%)
 Frame = -3

Query: 696 LHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS 517
           + PT  Q + +SSK LVY+L+  V    V LIF  +VLR   P + + +V V++L+Y NS
Sbjct: 26  IKPTASQRKEKSSKCLVYVLVGMVIQGAVLLIFASIVLRARTPDVEIVSVTVRNLKYGNS 85

Query: 516 ---SFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAA 346
              SFN T + ++ + N NFG F F   +  ++ G+                    ++ +
Sbjct: 86  SAPSFNLTLVTEVTVENSNFGDFKFENTTGTVWCGSVVVGKMKIPTGRAQARATERLNVS 145

Query: 345 VEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLT 166
           V+V   P    +  N+S +I S L++L    +L G++ +M  + R R   MNC M LNLT
Sbjct: 146 VDVSSLP--LPDTKNVSCNISSGLLELNSHVKLSGKVSIMNFMKRRRHPEMNCFMTLNLT 203

Query: 165 GQAIQGLSCQ 136
           GQ  Q   C+
Sbjct: 204 GQTKQDFPCE 213


>ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620163 [Citrus sinensis]
          Length = 214

 Score =  104 bits (259), Expect = 6e-20
 Identities = 59/204 (28%), Positives = 106/204 (51%), Gaps = 3/204 (1%)
 Frame = -3

Query: 741 NPEAKSDQKIVTNKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSL 562
           N   +SDQ+        P   + + +SSK LVY+L+  V +S   LI   + LR N P +
Sbjct: 15  NEYPRSDQEYA------PAVIESQRKSSKCLVYVLVTIVTVSAALLISASIFLRPNTPEV 68

Query: 561 RLSNVVVKDLRYSNS---SFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXX 391
           +L +V VK+L + N    SFN T + ++ + N N+G F+++  S +++YG+ T       
Sbjct: 69  QLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVFYGSVTVGDVKIR 128

Query: 390 XXXXXXXXXRNIDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNR 211
                    + I+  V+V    +   +  N+  DI S +VKL   A+L G + +  ++ +
Sbjct: 129 DGRVEAREVKRINVTVDVDVRSNGNLDNQNLRSDINSGIVKLNSYAKLHGNVSLFNVLKK 188

Query: 210 WRTAMMNCTMDLNLTGQAIQGLSC 139
            +T  ++C+M+L L  +A++ L C
Sbjct: 189 TKTPELDCSMNLVLARRAVEDLVC 212


>ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citrus clementina]
           gi|557541714|gb|ESR52692.1| hypothetical protein
           CICLE_v10023929mg [Citrus clementina]
          Length = 214

 Score =  103 bits (258), Expect = 8e-20
 Identities = 59/204 (28%), Positives = 106/204 (51%), Gaps = 3/204 (1%)
 Frame = -3

Query: 741 NPEAKSDQKIVTNKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSL 562
           N   +SDQ+        P   + + +SSK LVY+L+  V +S   LI   + LR N P +
Sbjct: 15  NEYPRSDQEYA------PAVIESQRKSSKCLVYVLVTIVTVSAALLISASIFLRPNTPEV 68

Query: 561 RLSNVVVKDLRYSNS---SFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXX 391
           +L +V VK+L + N    SFN T + ++ + N N+G F+++  S +++YG+ T       
Sbjct: 69  QLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVFYGSVTVGDVKIR 128

Query: 390 XXXXXXXXXRNIDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNR 211
                    + I+  V+V    +   +  N+S D  S +VKL   A+L G + +  ++ +
Sbjct: 129 DGRVEAREVKRINVTVDVDVRSNGNLDNQNLSSDRNSGIVKLNSYAKLHGNVNLFNVLKK 188

Query: 210 WRTAMMNCTMDLNLTGQAIQGLSC 139
            +T  ++C+M+L L  +A++ L C
Sbjct: 189 TKTPELDCSMNLVLARRAVEDLVC 212


>ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prunus persica]
           gi|462406396|gb|EMJ11860.1| hypothetical protein
           PRUPE_ppa022176mg [Prunus persica]
          Length = 213

 Score =  103 bits (256), Expect = 1e-19
 Identities = 63/203 (31%), Positives = 104/203 (51%), Gaps = 5/203 (2%)
 Frame = -3

Query: 729 KSDQKIVTNKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSN 550
           +SD++  T +A+         RS+K  VY+  A V  SI  L+F LVVLR+ +P   LS+
Sbjct: 19  RSDEENPTFRAIR------RERSNKCFVYVFAAIVLQSIFILVFALVVLRVKSPGFNLSS 72

Query: 549 VVVKDLRYS---NSSFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXX 379
           V VK L+++    SS NAT + ++ + N NFG + F G S +L+YG              
Sbjct: 73  VSVKSLKHTTSPTSSLNATLVTELAIKNKNFGEYKFEGSSASLWYGGFKVGEAKIGKGRV 132

Query: 378 XXXXXRNIDAAVEVMGG--PSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWR 205
                R +  +++V     P  A N      ++ S  +K+   A+L G++ +MKI+ + +
Sbjct: 133 KARGTRRVSLSIDVRSNRLPQEAKN--GFEGEMNSGYLKISSYAKLTGKVNLMKIMKKRK 190

Query: 204 TAMMNCTMDLNLTGQAIQGLSCQ 136
           T   NCTM + L  + ++ L C+
Sbjct: 191 TIDTNCTMVVVLKSRTVKDLFCR 213


>ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777614|gb|EOY24870.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 188

 Score =  100 bits (248), Expect = 1e-18
 Identities = 58/180 (32%), Positives = 91/180 (50%), Gaps = 3/180 (1%)
 Frame = -3

Query: 666 RSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS---FNATFI 496
           ++ K   YI+   V  +I+ L+F L V+RI  PS RL +V V+ L Y+ S    FN   I
Sbjct: 11  QNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASGVPHFNMRLI 70

Query: 495 ADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAAVEVMGGPSSA 316
            +I + N NFG F F   +  + +G+                  + ++  V+V     S 
Sbjct: 71  MEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVSSSAVSD 130

Query: 315 ANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 136
            + L     + S  + L GVA LRG++ +MK++ + +TA MNCTM +NL   A+Q L C+
Sbjct: 131 EDELRTK--LSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQDLDCE 188


>ref|XP_002509872.1| conserved hypothetical protein [Ricinus communis]
           gi|223549771|gb|EEF51259.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 221

 Score = 97.1 bits (240), Expect = 1e-17
 Identities = 61/189 (32%), Positives = 94/189 (49%), Gaps = 11/189 (5%)
 Frame = -3

Query: 672 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS------- 514
           + RSSK LVY+L   V LS V L+F LVVLR   P+  LS V +KDL Y+  S       
Sbjct: 33  QERSSKCLVYVLAGIVILSAVILVFALVVLRPVNPNAELSFVRLKDLNYAAGSGGNGNNV 92

Query: 513 ----FNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAA 346
               FN T  +++++ N NFG F +   S  ++YG                     ++  
Sbjct: 93  SLPAFNMTLESELKIENSNFGEFKYDNTSARVFYGGMAVGEAILREGRVSARDTLRMNVK 152

Query: 345 VEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLT 166
           VEV        N  +++ DI S ++KL   A+  G + +++I  + R+A M+C+  L+L 
Sbjct: 153 VEVR-SHKYIYNGTDLTSDINSGILKLNSHAKFSGRVNLLQIAKKRRSASMDCSFSLDLR 211

Query: 165 GQAIQGLSC 139
            ++IQ L C
Sbjct: 212 SRSIQDLVC 220


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score = 96.3 bits (238), Expect = 2e-17
 Identities = 55/194 (28%), Positives = 95/194 (48%), Gaps = 4/194 (2%)
 Frame = -3

Query: 705 NKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRY 526
           N  +    E +  +  K   Y     V  +IV L+F L V+RI  P  R+ ++ V+D+ Y
Sbjct: 10  NIDMESAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAY 69

Query: 525 SNS----SFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRN 358
           +++    SFN  F A++ + N NFG F F   + +  YG                   + 
Sbjct: 70  TSTPNPPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKK 129

Query: 357 IDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMD 178
           ++  V++      A +  N++ DI S  + L    +L G++ +MK++ + ++A MNCTM 
Sbjct: 130 MNVTVDLNSNNIPANS--NLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMT 187

Query: 177 LNLTGQAIQGLSCQ 136
           +NL  +AIQ + CQ
Sbjct: 188 VNLASRAIQDIKCQ 201


>ref|XP_006573794.1| PREDICTED: uncharacterized protein LOC102667904 [Glycine max]
          Length = 184

 Score = 95.9 bits (237), Expect = 2e-17
 Identities = 59/182 (32%), Positives = 88/182 (48%), Gaps = 1/182 (0%)
 Frame = -3

Query: 678 QEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS-SFNAT 502
           Q+E RS K  VY+L A V L  + L+F  + LR+  P L+L +     + YS S SFNAT
Sbjct: 6   QQERRSGKCFVYLLAAFVILCALVLVFASL-LRVKNPYLKLRSATSNKISYSTSPSFNAT 64

Query: 501 FIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAAVEVMGGPS 322
            I  + + N NFG F +     ++ Y                    + I+  V++M   +
Sbjct: 65  LIIFLGIKNPNFGAFSYNNNRVSVLYAGVKIADRQINGGRVRFRETKEINVTVKLMSAKA 124

Query: 321 SAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLS 142
             +   N+S DI S  + L    +  G + ++KI+N  +T  M C M LNLT   IQG+ 
Sbjct: 125 PISE--NLSIDISSGSLNLTSNVKFSGTVHMLKIINIRKTIEMACAMKLNLTSHTIQGIQ 182

Query: 141 CQ 136
           CQ
Sbjct: 183 CQ 184


>gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis]
          Length = 213

 Score = 95.5 bits (236), Expect = 3e-17
 Identities = 63/201 (31%), Positives = 97/201 (48%), Gaps = 4/201 (1%)
 Frame = -3

Query: 729 KSDQKIVTNKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSN 550
           +SD++    KAL       + R++K  VYI    V L  + LIF L+VLR  +P ++L +
Sbjct: 19  RSDEENPAFKALR------KERTNKCFVYIFAGIVILGAILLIFALIVLRSKSPEIKLKS 72

Query: 549 VVVKDLRYSNS---SFNATFIADIRLHNMNFGRFDFRGGSTALY-YGNATXXXXXXXXXX 382
           V VK L YS S   S NAT IA + + N NFG + F   ++A++ YG             
Sbjct: 73  VTVKSLDYSTSPWPSLNATLIATVAIKNPNFGPYRFGSNNSAVFLYGGGKLGEQRIRQGK 132

Query: 381 XXXXXXRNIDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRT 202
                 + ++  VE+        +  N+  D+ S +V L    +  G + ++KI    +T
Sbjct: 133 ATAKATKRVNVTVEIRTSRLPQGSN-NLGGDLSSGMVNLSSYCKFTGRVHLIKIFENRKT 191

Query: 201 AMMNCTMDLNLTGQAIQGLSC 139
           A MNC M L L  + I+ L C
Sbjct: 192 AEMNCAMTLVLKTKMIKNLRC 212


>ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721845|gb|EOY13742.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 192

 Score = 95.5 bits (236), Expect = 3e-17
 Identities = 55/181 (30%), Positives = 93/181 (51%), Gaps = 4/181 (2%)
 Frame = -3

Query: 666 RSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSSFNATFI--- 496
           R+ K    ++   +A +I+ L+F L+V+RI  P +RL  V V++LR S+SS + +F    
Sbjct: 12  RNIKCWAIVVAGVIAKTIIILLFVLIVMRIRNPKVRLGGVTVENLRASSSSSSPSFSTKL 71

Query: 495 -ADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAAVEVMGGPSS 319
            A + + N NFG F F+  +  + Y  +                 +  +  + V      
Sbjct: 72  NAQVSVKNTNFGHFKFKNSTLTISYNGSPVGKATIVEGLARARSTKKFNVTILVSSNNKI 131

Query: 318 AANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSC 139
           + N   +S DIES  + L   A+L G+I + KI  + ++A MNCTMD+N + + IQ L+C
Sbjct: 132 SRNSDQLSSDIESGTINLSSHAKLEGKIHLFKIFKKKKSAEMNCTMDVNTSLKQIQKLTC 191

Query: 138 Q 136
           +
Sbjct: 192 K 192


>ref|XP_002887922.1| hypothetical protein ARALYDRAFT_474948 [Arabidopsis lyrata subsp.
           lyrata] gi|297333763|gb|EFH64181.1| hypothetical protein
           ARALYDRAFT_474948 [Arabidopsis lyrata subsp. lyrata]
          Length = 214

 Score = 95.1 bits (235), Expect = 4e-17
 Identities = 61/185 (32%), Positives = 92/185 (49%), Gaps = 6/185 (3%)
 Frame = -3

Query: 672 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS----FNA 505
           E    K LVY L   V +  + LI   + LRI+ P +   ++  +DLR+  +S    FNA
Sbjct: 33  EEPPGKCLVYSLTIIVIIFALCLILSSIFLRISKPEIETRSISTRDLRFGGNSTNPYFNA 92

Query: 504 TFIADIRLHNMNFGRFDFRGGSTALYYGN--ATXXXXXXXXXXXXXXXXRNIDAAVEVMG 331
           T ++DI + N NFG F+F   S  + Y +                    R  D  VE+  
Sbjct: 93  TLVSDISIRNSNFGAFEFEDSSLRVVYADHGVVGETKIAGRRVEAHKTVRITDVVVEI-- 150

Query: 330 GPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQ 151
           G     N  ++  D+    ++L  VAE+RG I+V+    RW+ ++M+CTM LNLTG+ IQ
Sbjct: 151 GSFRLLNTKDLDSDLRLGFLELRSVAEVRGRIKVLGR-RRWKVSVMSCTMRLNLTGRFIQ 209

Query: 150 GLSCQ 136
            L C+
Sbjct: 210 NLLCE 214


>ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, partial [Populus
           trichocarpa] gi|550346930|gb|ERP65340.1| hypothetical
           protein POPTR_0001s09980g, partial [Populus trichocarpa]
          Length = 173

 Score = 93.6 bits (231), Expect = 1e-16
 Identities = 58/170 (34%), Positives = 92/170 (54%), Gaps = 3/170 (1%)
 Frame = -3

Query: 639 LLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS---SFNATFIADIRLHNMN 469
           L   V LS + L+F ++V +   P ++LS+V V+ L Y N+   SFN T  A++ + N N
Sbjct: 7   LALIVILSAIILVFAIIV-KPRTPRVKLSSVAVEHLSYGNNPIPSFNMTLAAEVSVKNSN 65

Query: 468 FGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAAVEVMGGPSSAANYLNISRD 289
           F RF F   S++  Y                    R ++  V++ G P S +   N+S D
Sbjct: 66  FVRFKFENTSSSALYKGMVVGEAKLRSGRVGARKTRRMNIVVKI-GSPGSLSEAKNLSSD 124

Query: 288 IESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSC 139
           I S ++K+   A L+G++R+  IV   RTA+M+C M+LNL+ ++IQ L C
Sbjct: 125 INSGMLKMNSYATLKGDVRLFGIVKN-RTAVMSCGMNLNLSSRSIQDLEC 173


>ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297644 [Fragaria vesca
           subsp. vesca]
          Length = 210

 Score = 93.6 bits (231), Expect = 1e-16
 Identities = 60/212 (28%), Positives = 103/212 (48%), Gaps = 4/212 (1%)
 Frame = -3

Query: 759 DEETYLNPEAKSDQKIVTNKALHPTPEQ-EEHRSSKTLVYILLAAVALSIVFLIFGLVVL 583
           D+E+ + P A    K+      +PT +     RS+K  VY+    V   +  L+F L+VL
Sbjct: 3   DQESQIWPLAPG--KLHQRSEENPTFKAIRRERSNKCFVYVFSGIVFFCVTVLVFALLVL 60

Query: 582 RINAPSLRLSNVVVKDLRYSNS--SFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATX 409
           R+ +P +RL +V VK L+Y++S  SFN +    + + N NFG ++F   + +  Y     
Sbjct: 61  RVKSPEIRLRSVTVKSLKYTSSPPSFNVSLSGQMSVKNPNFGDYEFVPTTVSFLYSRGAV 120

Query: 408 XXXXXXXXXXXXXXXRNIDAAVEVMGGP-SSAANYLNISRDIESNLVKLVGVAELRGEIR 232
                            +   V++        AN   +  DI S ++KL G  ++ G++ 
Sbjct: 121 GSTKVAKGLAKVKKTERLSFGVDLRSNKLPEGAN--TLKSDINSGMLKLTGTGKVSGKVT 178

Query: 231 VMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 136
           + KI+N+ +T  M+CTM L L  + I+ L C+
Sbjct: 179 LWKIINKRKTGKMDCTMTLVLKSKTIKDLVCR 210


>ref|NP_974086.1| late embryogenesis abundant hydroxyproline-rich glycoprotein
           [Arabidopsis thaliana] gi|49823490|gb|AAT68728.1|
           hypothetical protein At1g64065 [Arabidopsis thaliana]
           gi|55740529|gb|AAV63857.1| hypothetical protein
           At1g64065 [Arabidopsis thaliana]
           gi|332196066|gb|AEE34187.1| late embryogenesis abundant
           hydroxyproline-rich glycoprotein [Arabidopsis thaliana]
          Length = 214

 Score = 90.9 bits (224), Expect = 7e-16
 Identities = 57/183 (31%), Positives = 89/183 (48%), Gaps = 4/183 (2%)
 Frame = -3

Query: 672 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS----FNA 505
           E    K LVY L   V +  + LI   + LRI+ P +   ++  +DLR   +S    FNA
Sbjct: 33  EEPPGKCLVYSLTIIVIIFALCLILSSIFLRISKPEIETRSISTRDLRSGGNSTNPYFNA 92

Query: 504 TFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAAVEVMGGP 325
           T ++DI + N NFG F+F   +  + Y +                        V V  G 
Sbjct: 93  TLVSDISIRNSNFGAFEFEDSTLRVVYADHGVVGETKIEGRRVEAHKTVRITGVVVEIGS 152

Query: 324 SSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGL 145
               +  ++ +D+    ++L  VAE+RG I+V+    RW+ ++M+CTM LNLTG+ IQ L
Sbjct: 153 FRLLDTKDLDKDLRLGFLELRSVAEVRGRIKVLGR-KRWKVSVMSCTMRLNLTGRFIQNL 211

Query: 144 SCQ 136
            C+
Sbjct: 212 LCE 214


>ref|XP_006300370.1| hypothetical protein CARUB_v10022353mg [Capsella rubella]
           gi|482569080|gb|EOA33268.1| hypothetical protein
           CARUB_v10022353mg [Capsella rubella]
          Length = 215

 Score = 90.5 bits (223), Expect = 9e-16
 Identities = 60/182 (32%), Positives = 86/182 (47%), Gaps = 4/182 (2%)
 Frame = -3

Query: 672 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS----FNA 505
           E    K LVY L   V +  V LI   + LRI+ P +   +V  +DLR   +S    FNA
Sbjct: 34  EEPPGKCLVYSLTIIVIVFAVCLILSSIFLRISKPEIETRSVSTRDLRSGGNSTNPYFNA 93

Query: 504 TFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAAVEVMGGP 325
           T ++DI + N NFG F+F   S  + Y +                        V V  G 
Sbjct: 94  TLVSDISIRNSNFGAFEFEDSSLRVVYADHGVVGEATIPGRRVEAHKTVRITGVVVEIGS 153

Query: 324 SSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGL 145
               +   +  D+ S  ++L  VAE+RG I+V+    RW+ ++M+CTM LNLT + IQ L
Sbjct: 154 FRLLDRKGLELDLRSGFLELRSVAEVRGRIKVLG-RRRWKVSVMSCTMRLNLTNRFIQNL 212

Query: 144 SC 139
            C
Sbjct: 213 FC 214


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score = 90.1 bits (222), Expect = 1e-15
 Identities = 54/206 (26%), Positives = 101/206 (49%), Gaps = 4/206 (1%)
 Frame = -3

Query: 741 NPEAKSDQKIVTNKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSL 562
           N   +SD++     A   + E +  +  K  VYI   AV  ++V LIF L V+R+  P +
Sbjct: 15  NGHPRSDEE----SASLQSKELKRKKRIKYAVYIAAFAVFQTVVILIFALTVMRVKNPKV 70

Query: 561 RLSNVVVKDLRYSN----SSFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXX 394
           R+  V V+ +  SN    +SFN  FI  + + N NFG + F   + +  Y          
Sbjct: 71  RIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLYDGVMVGEAII 130

Query: 393 XXXXXXXXXXRNIDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVN 214
                     + +D  VEV    +  +    +  ++ S+++ L   A+L+G++ +MK++ 
Sbjct: 131 PKARARARSTKKLDVTVEV-NSSALTSTTTGLGSELSSSVLTLNSQAKLKGKVELMKVMK 189

Query: 213 RWRTAMMNCTMDLNLTGQAIQGLSCQ 136
           + ++  MNCT+  N++ +++Q L C+
Sbjct: 190 KKKSPEMNCTLIFNVSTRSLQDLKCK 215


>ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721844|gb|EOY13741.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 259

 Score = 90.1 bits (222), Expect = 1e-15
 Identities = 48/188 (25%), Positives = 88/188 (46%), Gaps = 2/188 (1%)
 Frame = -3

Query: 696 LHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS 517
           +  + E +  +  K L Y+    +  + + L+F L V+RI  P  R+ +V+V DL ++NS
Sbjct: 10  MEQSKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNS 69

Query: 516 S--FNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAAV 343
           S  FN  FIA + + N NFG + F   +    Y  +                       V
Sbjct: 70  SPSFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNV 129

Query: 342 EVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTG 163
            +    +  AN  ++  D+ S  + L   + L G++ +MK++ + ++  MNCTM +NL  
Sbjct: 130 TMDLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQ 189

Query: 162 QAIQGLSC 139
           + ++ + C
Sbjct: 190 KLVRDIKC 197


Top