BLASTX nr result

ID: Mentha28_contig00010996 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00010996
         (1015 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU23414.1| hypothetical protein MIMGU_mgv1a020543mg [Mimulus...   164   4e-38
ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241...   122   3e-25
ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578...   118   3e-24
ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-r...   110   1e-21
ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620...   104   5e-20
ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citr...   104   7e-20
ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r...   102   3e-19
ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prun...   101   4e-19
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...    97   8e-18
ref|XP_002509872.1| conserved hypothetical protein [Ricinus comm...    97   8e-18
ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-r...    97   1e-17
ref|XP_002887922.1| hypothetical protein ARALYDRAFT_474948 [Arab...    96   2e-17
ref|XP_006573794.1| PREDICTED: uncharacterized protein LOC102667...    96   3e-17
gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis]      94   7e-17
ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, part...    92   3e-16
ref|NP_974086.1| late embryogenesis abundant hydroxyproline-rich...    92   4e-16
ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r...    91   6e-16
ref|XP_006300370.1| hypothetical protein CARUB_v10022353mg [Caps...    91   6e-16
ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297...    91   8e-16
ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r...    91   1e-15

>gb|EYU23414.1| hypothetical protein MIMGU_mgv1a020543mg [Mimulus guttatus]
          Length = 198

 Score =  164 bits (416), Expect = 4e-38
 Identities = 103/216 (47%), Positives = 127/216 (58%), Gaps = 7/216 (3%)
 Frame = -3

Query: 779 MDEETY--LNPEAKSDQKIVTNKGLHPTPEQEEHR-----SSKTLVYILLAAVALSIVFL 621
           M+EE++  +NP  KSD++  T      T  +   R     SSK LVYIL+A V  S+ FL
Sbjct: 1   MEEESHRIINPYIKSDEEEFT------TTTKNNRRGKGGGSSKCLVYILVAVVLQSVAFL 54

Query: 620 IFGLVVLRINAPSLRLSNVVVKDLRYSNSSFNATFIADIRLHNMNFGRFDFRGGSAALYY 441
           +FGLV LRI+ PSLRLS+  V  LR+ ++S N T +A IRL N NFG F+F GGSA+L Y
Sbjct: 55  VFGLVALRISNPSLRLSSAAVAVLRHDSASLNMTVVAGIRLRNPNFGDFEFNGGSASLLY 114

Query: 440 GNATXXXXXXXXXXXXXXXXXXIDATVEVMGGPSSAANYLNISRDIESNLVKLVGVAELR 261
           G AT                  I+ T+EV+GG                 LVKL  +AELR
Sbjct: 115 GEATVGVASIYGGRVGRRDKKEINVTMEVIGGGGG------------GELVKLRSMAELR 162

Query: 260 GEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 153
           GE+RV+KIV R R A MNCTMDLNLT QA Q LSCQ
Sbjct: 163 GEVRVVKIVKRRRIAFMNCTMDLNLTSQAFQDLSCQ 198


>ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241699 [Vitis vinifera]
          Length = 213

 Score =  122 bits (305), Expect = 3e-25
 Identities = 78/218 (35%), Positives = 110/218 (50%), Gaps = 9/218 (4%)
 Frame = -3

Query: 779 MDEETYLNPEA------KSDQKIVTNKGLHPTPEQEEHRSSKTLVYILLAAVALSIVFLI 618
           M E+    P A      KSD++    K   P   +   RSSK  VY+L   V L+ + L+
Sbjct: 1   MPEDNQFQPLAPARLHGKSDEEFGVFK---PRASKPPRRSSKCPVYVLAGLVTLAAIALV 57

Query: 617 FGLVVLRINAPSLRLSNVVVKDLRYSNS---SFNATFIADIRLHNMNFGRFDFRGGSAAL 447
           F L VLR+ AP + L +V VK+L +  S   SFN T  A++ + N NFG F+F  G+A +
Sbjct: 58  FALAVLRVEAPDVELKSVAVKNLTHGTSPSPSFNVTLTAEVSVQNKNFGAFNFENGTATV 117

Query: 446 YYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGPSSAANYLNISRDIESNLVKLVGVAE 267
            Y                      ++ T++V        N  N+S DI S  V L   A+
Sbjct: 118 LYEGMVVGDEEFSKAHVESRKTKRMNVTLDVRS--DRLWNDKNLSSDISSGSVNLTTYAQ 175

Query: 266 LRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 153
           + G++RVMK+V R  TA MNC+M LNLT  +IQ L C+
Sbjct: 176 VTGKVRVMKVVRRRTTARMNCSMTLNLTSSSIQDLVCR 213


>ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578735 [Solanum tuberosum]
          Length = 223

 Score =  118 bits (296), Expect = 3e-24
 Identities = 71/205 (34%), Positives = 112/205 (54%), Gaps = 8/205 (3%)
 Frame = -3

Query: 746 KSDQKIVTNKGLH---PTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLR 576
           KSDQ +  +K ++        +  +S K  VY L   V LSI+ LIF +V  R  +PS  
Sbjct: 18  KSDQGLNLSKSINYYNNHNNNKNRKSGKCFVYFLSTIVILSIIMLIFSMVFFRFKSPSFE 77

Query: 575 LSNVVVKDLRYSNS----SFNATFIADIRLHNMNFGRFDFRGGSAALY-YGNATXXXXXX 411
           L ++ V++LR+SNS    SFN     +I + N NFG+ +++  S +++ Y N T      
Sbjct: 78  LDHINVQNLRFSNSTNSSSFNMNMGGEIIVDNDNFGQINYQDSSMSVFLYDNVTIGIANV 137

Query: 410 XXXXXXXXXXXXIDATVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVN 231
                       I  ++++        +Y N+S DI S ++KL    E RG+++ MKI++
Sbjct: 138 NVGRVEARKSKRIGISLQLRTNYQLNYSYGNLSSDINSRMLKLTSFGEFRGKVKAMKIIS 197

Query: 230 RWRTAMMNCTMDLNLTGQAIQGLSC 156
           + +T++MNCTM+LNLT QAIQ L C
Sbjct: 198 KHKTSIMNCTMNLNLTSQAIQDLLC 222


>ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777616|gb|EOY24872.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 213

 Score =  110 bits (274), Expect = 1e-21
 Identities = 63/191 (32%), Positives = 98/191 (51%), Gaps = 3/191 (1%)
 Frame = -3

Query: 716 GLHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSN 537
           G+ PT  Q + +SSK LVY+L+  V    V LIF  +VLR   P + + +V V++L+Y N
Sbjct: 25  GIKPTASQRKEKSSKCLVYVLVGMVIQGAVLLIFASIVLRARTPDVEIVSVTVRNLKYGN 84

Query: 536 S---SFNATFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDA 366
           S   SFN T + ++ + N NFG F F   +  ++ G+                    ++ 
Sbjct: 85  SSAPSFNLTLVTEVTVENSNFGDFKFENTTGTVWCGSVVVGKMKIPTGRAQARATERLNV 144

Query: 365 TVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNL 186
           +V+V   P    +  N+S +I S L++L    +L G++ +M  + R R   MNC M LNL
Sbjct: 145 SVDVSSLP--LPDTKNVSCNISSGLLELNSHVKLSGKVSIMNFMKRRRHPEMNCFMTLNL 202

Query: 185 TGQAIQGLSCQ 153
           TGQ  Q   C+
Sbjct: 203 TGQTKQDFPCE 213


>ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620163 [Citrus sinensis]
          Length = 214

 Score =  104 bits (260), Expect = 5e-20
 Identities = 60/204 (29%), Positives = 106/204 (51%), Gaps = 3/204 (1%)
 Frame = -3

Query: 758 NPEAKSDQKIVTNKGLHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSL 579
           N   +SDQ+        P   + + +SSK LVY+L+  V +S   LI   + LR N P +
Sbjct: 15  NEYPRSDQEYA------PAVIESQRKSSKCLVYVLVTIVTVSAALLISASIFLRPNTPEV 68

Query: 578 RLSNVVVKDLRYSNS---SFNATFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXX 408
           +L +V VK+L + N    SFN T + ++ + N N+G F+++  S +++YG+ T       
Sbjct: 69  QLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVFYGSVTVGDVKIR 128

Query: 407 XXXXXXXXXXXIDATVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNR 228
                      I+ TV+V    +   +  N+  DI S +VKL   A+L G + +  ++ +
Sbjct: 129 DGRVEAREVKRINVTVDVDVRSNGNLDNQNLRSDINSGIVKLNSYAKLHGNVSLFNVLKK 188

Query: 227 WRTAMMNCTMDLNLTGQAIQGLSC 156
            +T  ++C+M+L L  +A++ L C
Sbjct: 189 TKTPELDCSMNLVLARRAVEDLVC 212


>ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citrus clementina]
           gi|557541714|gb|ESR52692.1| hypothetical protein
           CICLE_v10023929mg [Citrus clementina]
          Length = 214

 Score =  104 bits (259), Expect = 7e-20
 Identities = 60/204 (29%), Positives = 106/204 (51%), Gaps = 3/204 (1%)
 Frame = -3

Query: 758 NPEAKSDQKIVTNKGLHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSL 579
           N   +SDQ+        P   + + +SSK LVY+L+  V +S   LI   + LR N P +
Sbjct: 15  NEYPRSDQEYA------PAVIESQRKSSKCLVYVLVTIVTVSAALLISASIFLRPNTPEV 68

Query: 578 RLSNVVVKDLRYSNS---SFNATFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXX 408
           +L +V VK+L + N    SFN T + ++ + N N+G F+++  S +++YG+ T       
Sbjct: 69  QLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVFYGSVTVGDVKIR 128

Query: 407 XXXXXXXXXXXIDATVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNR 228
                      I+ TV+V    +   +  N+S D  S +VKL   A+L G + +  ++ +
Sbjct: 129 DGRVEAREVKRINVTVDVDVRSNGNLDNQNLSSDRNSGIVKLNSYAKLHGNVNLFNVLKK 188

Query: 227 WRTAMMNCTMDLNLTGQAIQGLSC 156
            +T  ++C+M+L L  +A++ L C
Sbjct: 189 TKTPELDCSMNLVLARRAVEDLVC 212


>ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777614|gb|EOY24870.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 188

 Score =  102 bits (254), Expect = 3e-19
 Identities = 60/180 (33%), Positives = 92/180 (51%), Gaps = 3/180 (1%)
 Frame = -3

Query: 683 RSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS---FNATFI 513
           ++ K   YI+   V  +I+ L+F L V+RI  PS RL +V V+ L Y+ S    FN   I
Sbjct: 11  QNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASGVPHFNMRLI 70

Query: 512 ADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGPSSA 333
            +I + N NFG F F   +A + +G+                    ++ TV+V     S 
Sbjct: 71  MEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVSSSAVSD 130

Query: 332 ANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 153
            + L     + S  + L GVA LRG++ +MK++ + +TA MNCTM +NL   A+Q L C+
Sbjct: 131 EDELRTK--LSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQDLDCE 188


>ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prunus persica]
           gi|462406396|gb|EMJ11860.1| hypothetical protein
           PRUPE_ppa022176mg [Prunus persica]
          Length = 213

 Score =  101 bits (252), Expect = 4e-19
 Identities = 59/182 (32%), Positives = 95/182 (52%), Gaps = 5/182 (2%)
 Frame = -3

Query: 683 RSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYS---NSSFNATFI 513
           RS+K  VY+  A V  SI  L+F LVVLR+ +P   LS+V VK L+++    SS NAT +
Sbjct: 34  RSNKCFVYVFAAIVLQSIFILVFALVVLRVKSPGFNLSSVSVKSLKHTTSPTSSLNATLV 93

Query: 512 ADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGG--PS 339
            ++ + N NFG + F G SA+L+YG                     +  +++V     P 
Sbjct: 94  TELAIKNKNFGEYKFEGSSASLWYGGFKVGEAKIGKGRVKARGTRRVSLSIDVRSNRLPQ 153

Query: 338 SAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLS 159
            A N      ++ S  +K+   A+L G++ +MKI+ + +T   NCTM + L  + ++ L 
Sbjct: 154 EAKN--GFEGEMNSGYLKISSYAKLTGKVNLMKIMKKRKTIDTNCTMVVVLKSRTVKDLF 211

Query: 158 CQ 153
           C+
Sbjct: 212 CR 213


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score = 97.4 bits (241), Expect = 8e-18
 Identities = 56/194 (28%), Positives = 95/194 (48%), Gaps = 4/194 (2%)
 Frame = -3

Query: 722 NKGLHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRY 543
           N  +    E +  +  K   Y     V  +IV L+F L V+RI  P  R+ ++ V+D+ Y
Sbjct: 10  NIDMESAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAY 69

Query: 542 SNS----SFNATFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXX 375
           +++    SFN  F A++ + N NFG F F   + +  YG                     
Sbjct: 70  TSTPNPPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKK 129

Query: 374 IDATVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMD 195
           ++ TV++      A +  N++ DI S  + L    +L G++ +MK++ + ++A MNCTM 
Sbjct: 130 MNVTVDLNSNNIPANS--NLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMT 187

Query: 194 LNLTGQAIQGLSCQ 153
           +NL  +AIQ + CQ
Sbjct: 188 VNLASRAIQDIKCQ 201


>ref|XP_002509872.1| conserved hypothetical protein [Ricinus communis]
           gi|223549771|gb|EEF51259.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 221

 Score = 97.4 bits (241), Expect = 8e-18
 Identities = 62/189 (32%), Positives = 95/189 (50%), Gaps = 11/189 (5%)
 Frame = -3

Query: 689 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS------- 531
           + RSSK LVY+L   V LS V L+F LVVLR   P+  LS V +KDL Y+  S       
Sbjct: 33  QERSSKCLVYVLAGIVILSAVILVFALVVLRPVNPNAELSFVRLKDLNYAAGSGGNGNNV 92

Query: 530 ----FNATFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDAT 363
               FN T  +++++ N NFG F +   SA ++YG                     ++  
Sbjct: 93  SLPAFNMTLESELKIENSNFGEFKYDNTSARVFYGGMAVGEAILREGRVSARDTLRMNVK 152

Query: 362 VEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLT 183
           VEV        N  +++ DI S ++KL   A+  G + +++I  + R+A M+C+  L+L 
Sbjct: 153 VEVR-SHKYIYNGTDLTSDINSGILKLNSHAKFSGRVNLLQIAKKRRSASMDCSFSLDLR 211

Query: 182 GQAIQGLSC 156
            ++IQ L C
Sbjct: 212 SRSIQDLVC 220


>ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721845|gb|EOY13742.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 192

 Score = 96.7 bits (239), Expect = 1e-17
 Identities = 56/181 (30%), Positives = 93/181 (51%), Gaps = 4/181 (2%)
 Frame = -3

Query: 683 RSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSSFNATFI--- 513
           R+ K    ++   +A +I+ L+F L+V+RI  P +RL  V V++LR S+SS + +F    
Sbjct: 12  RNIKCWAIVVAGVIAKTIIILLFVLIVMRIRNPKVRLGGVTVENLRASSSSSSPSFSTKL 71

Query: 512 -ADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGPSS 336
            A + + N NFG F F+  +  + Y  +                    + T+ V      
Sbjct: 72  NAQVSVKNTNFGHFKFKNSTLTISYNGSPVGKATIVEGLARARSTKKFNVTILVSSNNKI 131

Query: 335 AANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSC 156
           + N   +S DIES  + L   A+L G+I + KI  + ++A MNCTMD+N + + IQ L+C
Sbjct: 132 SRNSDQLSSDIESGTINLSSHAKLEGKIHLFKIFKKKKSAEMNCTMDVNTSLKQIQKLTC 191

Query: 155 Q 153
           +
Sbjct: 192 K 192


>ref|XP_002887922.1| hypothetical protein ARALYDRAFT_474948 [Arabidopsis lyrata subsp.
           lyrata] gi|297333763|gb|EFH64181.1| hypothetical protein
           ARALYDRAFT_474948 [Arabidopsis lyrata subsp. lyrata]
          Length = 214

 Score = 95.9 bits (237), Expect = 2e-17
 Identities = 59/183 (32%), Positives = 89/183 (48%), Gaps = 4/183 (2%)
 Frame = -3

Query: 689 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS----FNA 522
           E    K LVY L   V +  + LI   + LRI+ P +   ++  +DLR+  +S    FNA
Sbjct: 33  EEPPGKCLVYSLTIIVIIFALCLILSSIFLRISKPEIETRSISTRDLRFGGNSTNPYFNA 92

Query: 521 TFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGP 342
           T ++DI + N NFG F+F   S  + Y +                        V V  G 
Sbjct: 93  TLVSDISIRNSNFGAFEFEDSSLRVVYADHGVVGETKIAGRRVEAHKTVRITDVVVEIGS 152

Query: 341 SSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGL 162
               N  ++  D+    ++L  VAE+RG I+V+    RW+ ++M+CTM LNLTG+ IQ L
Sbjct: 153 FRLLNTKDLDSDLRLGFLELRSVAEVRGRIKVLGR-RRWKVSVMSCTMRLNLTGRFIQNL 211

Query: 161 SCQ 153
            C+
Sbjct: 212 LCE 214


>ref|XP_006573794.1| PREDICTED: uncharacterized protein LOC102667904 [Glycine max]
          Length = 184

 Score = 95.5 bits (236), Expect = 3e-17
 Identities = 60/182 (32%), Positives = 88/182 (48%), Gaps = 1/182 (0%)
 Frame = -3

Query: 695 QEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS-SFNAT 519
           Q+E RS K  VY+L A V L  + L+F  + LR+  P L+L +     + YS S SFNAT
Sbjct: 6   QQERRSGKCFVYLLAAFVILCALVLVFASL-LRVKNPYLKLRSATSNKISYSTSPSFNAT 64

Query: 518 FIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGPS 339
            I  + + N NFG F +     ++ Y                      I+ TV++M   +
Sbjct: 65  LIIFLGIKNPNFGAFSYNNNRVSVLYAGVKIADRQINGGRVRFRETKEINVTVKLMSAKA 124

Query: 338 SAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLS 159
             +   N+S DI S  + L    +  G + ++KI+N  +T  M C M LNLT   IQG+ 
Sbjct: 125 PISE--NLSIDISSGSLNLTSNVKFSGTVHMLKIINIRKTIEMACAMKLNLTSHTIQGIQ 182

Query: 158 CQ 153
           CQ
Sbjct: 183 CQ 184


>gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis]
          Length = 213

 Score = 94.4 bits (233), Expect = 7e-17
 Identities = 59/182 (32%), Positives = 89/182 (48%), Gaps = 4/182 (2%)
 Frame = -3

Query: 689 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS---SFNAT 519
           + R++K  VYI    V L  + LIF L+VLR  +P ++L +V VK L YS S   S NAT
Sbjct: 32  KERTNKCFVYIFAGIVILGAILLIFALIVLRSKSPEIKLKSVTVKSLDYSTSPWPSLNAT 91

Query: 518 FIADIRLHNMNFGRFDFRGGSAALY-YGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGP 342
            IA + + N NFG + F   ++A++ YG                     ++ TVE+    
Sbjct: 92  LIATVAIKNPNFGPYRFGSNNSAVFLYGGGKLGEQRIRQGKATAKATKRVNVTVEIRTSR 151

Query: 341 SSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGL 162
               +  N+  D+ S +V L    +  G + ++KI    +TA MNC M L L  + I+ L
Sbjct: 152 LPQGSN-NLGGDLSSGMVNLSSYCKFTGRVHLIKIFENRKTAEMNCAMTLVLKTKMIKNL 210

Query: 161 SC 156
            C
Sbjct: 211 RC 212


>ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, partial [Populus
           trichocarpa] gi|550346930|gb|ERP65340.1| hypothetical
           protein POPTR_0001s09980g, partial [Populus trichocarpa]
          Length = 173

 Score = 92.4 bits (228), Expect = 3e-16
 Identities = 57/170 (33%), Positives = 91/170 (53%), Gaps = 3/170 (1%)
 Frame = -3

Query: 656 LLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS---SFNATFIADIRLHNMN 486
           L   V LS + L+F ++V +   P ++LS+V V+ L Y N+   SFN T  A++ + N N
Sbjct: 7   LALIVILSAIILVFAIIV-KPRTPRVKLSSVAVEHLSYGNNPIPSFNMTLAAEVSVKNSN 65

Query: 485 FGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGPSSAANYLNISRD 306
           F RF F   S++  Y                      ++  V++ G P S +   N+S D
Sbjct: 66  FVRFKFENTSSSALYKGMVVGEAKLRSGRVGARKTRRMNIVVKI-GSPGSLSEAKNLSSD 124

Query: 305 IESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSC 156
           I S ++K+   A L+G++R+  IV   RTA+M+C M+LNL+ ++IQ L C
Sbjct: 125 INSGMLKMNSYATLKGDVRLFGIVKN-RTAVMSCGMNLNLSSRSIQDLEC 173


>ref|NP_974086.1| late embryogenesis abundant hydroxyproline-rich glycoprotein
           [Arabidopsis thaliana] gi|49823490|gb|AAT68728.1|
           hypothetical protein At1g64065 [Arabidopsis thaliana]
           gi|55740529|gb|AAV63857.1| hypothetical protein
           At1g64065 [Arabidopsis thaliana]
           gi|332196066|gb|AEE34187.1| late embryogenesis abundant
           hydroxyproline-rich glycoprotein [Arabidopsis thaliana]
          Length = 214

 Score = 91.7 bits (226), Expect = 4e-16
 Identities = 57/183 (31%), Positives = 89/183 (48%), Gaps = 4/183 (2%)
 Frame = -3

Query: 689 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS----FNA 522
           E    K LVY L   V +  + LI   + LRI+ P +   ++  +DLR   +S    FNA
Sbjct: 33  EEPPGKCLVYSLTIIVIIFALCLILSSIFLRISKPEIETRSISTRDLRSGGNSTNPYFNA 92

Query: 521 TFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGP 342
           T ++DI + N NFG F+F   +  + Y +                        V V  G 
Sbjct: 93  TLVSDISIRNSNFGAFEFEDSTLRVVYADHGVVGETKIEGRRVEAHKTVRITGVVVEIGS 152

Query: 341 SSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGL 162
               +  ++ +D+    ++L  VAE+RG I+V+    RW+ ++M+CTM LNLTG+ IQ L
Sbjct: 153 FRLLDTKDLDKDLRLGFLELRSVAEVRGRIKVLGR-KRWKVSVMSCTMRLNLTGRFIQNL 211

Query: 161 SCQ 153
            C+
Sbjct: 212 LCE 214


>ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721844|gb|EOY13741.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 259

 Score = 91.3 bits (225), Expect = 6e-16
 Identities = 48/188 (25%), Positives = 88/188 (46%), Gaps = 2/188 (1%)
 Frame = -3

Query: 713 LHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS 534
           +  + E +  +  K L Y+    +  + + L+F L V+RI  P  R+ +V+V DL ++NS
Sbjct: 10  MEQSKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNS 69

Query: 533 S--FNATFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATV 360
           S  FN  FIA + + N NFG + F   +    Y  +                       V
Sbjct: 70  SPSFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNV 129

Query: 359 EVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTG 180
            +    +  AN  ++  D+ S  + L   + L G++ +MK++ + ++  MNCTM +NL  
Sbjct: 130 TMDLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQ 189

Query: 179 QAIQGLSC 156
           + ++ + C
Sbjct: 190 KLVRDIKC 197


>ref|XP_006300370.1| hypothetical protein CARUB_v10022353mg [Capsella rubella]
           gi|482569080|gb|EOA33268.1| hypothetical protein
           CARUB_v10022353mg [Capsella rubella]
          Length = 215

 Score = 91.3 bits (225), Expect = 6e-16
 Identities = 60/182 (32%), Positives = 86/182 (47%), Gaps = 4/182 (2%)
 Frame = -3

Query: 689 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS----FNA 522
           E    K LVY L   V +  V LI   + LRI+ P +   +V  +DLR   +S    FNA
Sbjct: 34  EEPPGKCLVYSLTIIVIVFAVCLILSSIFLRISKPEIETRSVSTRDLRSGGNSTNPYFNA 93

Query: 521 TFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGP 342
           T ++DI + N NFG F+F   S  + Y +                        V V  G 
Sbjct: 94  TLVSDISIRNSNFGAFEFEDSSLRVVYADHGVVGEATIPGRRVEAHKTVRITGVVVEIGS 153

Query: 341 SSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGL 162
               +   +  D+ S  ++L  VAE+RG I+V+    RW+ ++M+CTM LNLT + IQ L
Sbjct: 154 FRLLDRKGLELDLRSGFLELRSVAEVRGRIKVLG-RRRWKVSVMSCTMRLNLTNRFIQNL 212

Query: 161 SC 156
            C
Sbjct: 213 FC 214


>ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297644 [Fragaria vesca
           subsp. vesca]
          Length = 210

 Score = 90.9 bits (224), Expect = 8e-16
 Identities = 53/180 (29%), Positives = 90/180 (50%), Gaps = 3/180 (1%)
 Frame = -3

Query: 683 RSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS--SFNATFIA 510
           RS+K  VY+    V   +  L+F L+VLR+ +P +RL +V VK L+Y++S  SFN +   
Sbjct: 33  RSNKCFVYVFSGIVFFCVTVLVFALLVLRVKSPEIRLRSVTVKSLKYTSSPPSFNVSLSG 92

Query: 509 DIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGP-SSA 333
            + + N NFG ++F   + +  Y                      +   V++        
Sbjct: 93  QMSVKNPNFGDYEFVPTTVSFLYSRGAVGSTKVAKGLAKVKKTERLSFGVDLRSNKLPEG 152

Query: 332 ANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 153
           AN   +  DI S ++KL G  ++ G++ + KI+N+ +T  M+CTM L L  + I+ L C+
Sbjct: 153 AN--TLKSDINSGMLKLTGTGKVSGKVTLWKIINKRKTGKMDCTMTLVLKSKTIKDLVCR 210


>ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 185

 Score = 90.5 bits (223), Expect = 1e-15
 Identities = 49/179 (27%), Positives = 88/179 (49%), Gaps = 3/179 (1%)
 Frame = -3

Query: 680 SSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS---FNATFIA 510
           ++K L Y+ +  V  + + LIF L V+RI  P +R   V V++    NSS   F+   +A
Sbjct: 9   NAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMRLMA 68

Query: 509 DIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGPSSAA 330
            + + N NFG F +   S  + YG                      D T+++     S  
Sbjct: 69  QVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKLSTN 128

Query: 329 NYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 153
           +  N+  DI S ++ L   A+L G++ +MK++ + +++ M+CTM +N+  + +Q L C+
Sbjct: 129 S--NLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDLKCK 185


Top