BLASTX nr result

ID: Mentha29_contig00006567 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00006567
         (990 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007051691.1| Late embryogenesis abundant (LEA) hydroxypro...   194   5e-47
ref|XP_002303614.1| hypothetical protein POPTR_0003s13320g [Popu...   184   6e-44
ref|XP_002272605.2| PREDICTED: uncharacterized protein LOC100246...   181   5e-43
gb|EYU35113.1| hypothetical protein MIMGU_mgv1a013462mg [Mimulus...   180   7e-43
ref|XP_002277286.1| PREDICTED: uncharacterized protein LOC100258...   180   9e-43
ref|XP_004306727.1| PREDICTED: uncharacterized protein LOC101306...   179   1e-42
ref|XP_002511928.1| conserved hypothetical protein [Ricinus comm...   179   2e-42
gb|EXC34332.1| hypothetical protein L484_006687 [Morus notabilis]     177   8e-42
ref|XP_006477512.1| PREDICTED: uncharacterized protein LOC102619...   176   1e-41
ref|XP_006439451.1| hypothetical protein CICLE_v10023484mg [Citr...   176   2e-41
ref|XP_002509874.1| conserved hypothetical protein [Ricinus comm...   174   4e-41
ref|XP_002299505.1| hypothetical protein POPTR_0001s09990g [Popu...   174   5e-41
ref|XP_006352263.1| PREDICTED: uncharacterized protein LOC102590...   174   7e-41
ref|XP_002301379.1| hypothetical protein POPTR_0002s16660g [Popu...   174   7e-41
ref|XP_007210919.1| hypothetical protein PRUPE_ppa021244mg [Prun...   172   1e-40
ref|XP_007040372.1| Late embryogenesis abundant hydroxyproline-r...   172   2e-40
ref|XP_003548621.1| PREDICTED: uncharacterized protein LOC100799...   171   4e-40
gb|ACU19833.1| unknown [Glycine max]                                  171   4e-40
gb|EXB52691.1| hypothetical protein L484_022468 [Morus notabilis]     170   7e-40
ref|XP_007219590.1| hypothetical protein PRUPE_ppa023497mg [Prun...   167   6e-39

>ref|XP_007051691.1| Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein
           family, putative isoform 1 [Theobroma cacao]
           gi|590721708|ref|XP_007051692.1| Late embryogenesis
           abundant (LEA) hydroxyproline-rich glycoprotein family,
           putative isoform 1 [Theobroma cacao]
           gi|508703952|gb|EOX95848.1| Late embryogenesis abundant
           (LEA) hydroxyproline-rich glycoprotein family, putative
           isoform 1 [Theobroma cacao] gi|508703953|gb|EOX95849.1|
           Late embryogenesis abundant (LEA) hydroxyproline-rich
           glycoprotein family, putative isoform 1 [Theobroma
           cacao]
          Length = 220

 Score =  194 bits (493), Expect = 5e-47
 Identities = 93/222 (41%), Positives = 147/222 (66%), Gaps = 1/222 (0%)
 Frame = +1

Query: 40  MAERREQEKPLSPFTP-PLAVDKRTPFAAEFISRRRYRCLKCCGCCAALFLILAVTILIL 216
           M   R+Q +PL+P +  P + D       + + R++  C+KCCGC AAL +I AV I+IL
Sbjct: 1   MVVDRDQVRPLAPASDLPSSDDGEAALQLKKVQRKK--CVKCCGCIAALMIIQAVVIIIL 58

Query: 217 MITVLHIKDPTLKLNSVKINGLDLLSNTTNNSAQNLTLVADISIKNPNTASFKFDASTTD 396
           + TV  +KDP +K+N V +  L+L++ TT     N++L+AD+S+KNPN ASFK+  +TT 
Sbjct: 59  VFTVFRVKDPVIKMNGVAVTHLELINGTTPKPGSNISLIADVSVKNPNVASFKYKNTTTT 118

Query: 397 VFYDGTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLSTSTSL 576
           ++Y GT+VGE R P G A ARRT RMN+SVD++ +R++      +D+ +  LT+S+ + +
Sbjct: 119 LYYYGTIVGEARGPAGRAKARRTMRMNISVDIITDRLLASPNLVADVNSGTLTMSSYSRI 178

Query: 577 RGVVKIADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRRDVAL 702
            G V + +++K+   VKMNC+++VN+ S+ IQ   C+R V L
Sbjct: 179 GGRVNMLNIIKKHVTVKMNCSMTVNISSQAIQEQKCKRKVDL 220


>ref|XP_002303614.1| hypothetical protein POPTR_0003s13320g [Populus trichocarpa]
           gi|222841046|gb|EEE78593.1| hypothetical protein
           POPTR_0003s13320g [Populus trichocarpa]
          Length = 219

 Score =  184 bits (466), Expect = 6e-44
 Identities = 98/222 (44%), Positives = 142/222 (63%), Gaps = 1/222 (0%)
 Frame = +1

Query: 40  MAERREQEKPLSPFTPPLAVDKRTPFAAEFISRRRYRCLKCCGCCAALFLILAVTILILM 219
           MAE  EQ KPL+P    +  D+      +  +RRR  C+KCCGC  A+ LI+AVTI++L+
Sbjct: 1   MAET-EQVKPLAPAAFQIRSDEEETMPVQLKTRRR-NCIKCCGCITAMLLIVAVTIVVLV 58

Query: 220 ITVLHIKDPTLKLNSVKINGLDLLSNTTNNSAQNLTLVADISIKNPNTASFKFDASTTDV 399
           +TV H+KDP +K+N V +  L+L + T      N+TL+AD+S+KNPN ASFKF+  TT +
Sbjct: 59  LTVFHVKDPVIKMNRVFVQRLELANGTLRTDV-NVTLLADVSVKNPNAASFKFEKGTTTI 117

Query: 400 FYDGTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDL-IARNLTLSTSTSL 576
           +Y G VVGE   P G A ARRT  MNV+VD++  +++ V R  SD+  A  L +++ST +
Sbjct: 118 YYGGAVVGEANTPPGMAKARRTLHMNVTVDLIPAKLLAVPRSFSDIRSAGELNMTSSTII 177

Query: 577 RGVVKIADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRRDVAL 702
            G V+I    K+  VV +NCTV+ NL S  I   NC+  +++
Sbjct: 178 SGKVRILHTFKKYIVVGVNCTVTYNLASREIHGGNCKPHLSI 219


>ref|XP_002272605.2| PREDICTED: uncharacterized protein LOC100246818 [Vitis vinifera]
          Length = 214

 Score =  181 bits (458), Expect = 5e-43
 Identities = 94/217 (43%), Positives = 141/217 (64%)
 Frame = +1

Query: 40  MAERREQEKPLSPFTPPLAVDKRTPFAAEFISRRRYRCLKCCGCCAALFLILAVTILILM 219
           MAE+ +Q KPL+P    L  D+    + E       + +KCCGC  AL LI AV +L+L 
Sbjct: 1   MAEK-DQSKPLAPAGYNLRSDEEEAMSRELKKLPPRKYIKCCGCITALILIQAVILLVLA 59

Query: 220 ITVLHIKDPTLKLNSVKINGLDLLSNTTNNSAQNLTLVADISIKNPNTASFKFDASTTDV 399
            T+  +KDP +K+N ++I  L+L  +T      NLTLVAD+S+KNPN ASF F  +TT +
Sbjct: 60  FTIFRVKDPVIKMNGMRIGPLELEDST------NLTLVADVSVKNPNVASFIFSNATTSI 113

Query: 400 FYDGTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLSTSTSLR 579
            Y+GTV+G+ R P G+A ARRT RMNV+V+++ E+++ +    + L +R + +S+ T + 
Sbjct: 114 TYNGTVIGQARTPPGKAKARRTLRMNVTVEIIPEKIMAIMAVTTLLSSRAINISSYTRIS 173

Query: 580 GVVKIADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRR 690
           G VKI  ++K++ VVK+NCT+ VNL S  IQ  +C+R
Sbjct: 174 GRVKIIKIIKKNVVVKLNCTMMVNLTSREIQGQSCKR 210


>gb|EYU35113.1| hypothetical protein MIMGU_mgv1a013462mg [Mimulus guttatus]
          Length = 220

 Score =  180 bits (457), Expect = 7e-43
 Identities = 97/226 (42%), Positives = 144/226 (63%), Gaps = 5/226 (2%)
 Frame = +1

Query: 40  MAERREQEKPLSPFTPPLAVDKRTPF---AAEFISRRRYRCLKCCGCCAALFLILAVTIL 210
           M E  EQ +PL+P     A D+ T     +A     RR RCLKCCGC AA+ LI AV ++
Sbjct: 1   MVETSEQVRPLAP-----AYDRHTSSDDESATVTGIRRRRCLKCCGCAAAVILIQAVVVV 55

Query: 211 ILMITVLHIKDPTLKLNSVKINGLDLLSNTTNNS--AQNLTLVADISIKNPNTASFKFDA 384
           IL+ TV  +K+P +++N V +  LDL++ TT       N+TL AD+S+KNPN ASF++  
Sbjct: 56  ILIFTVFKVKEPVIRMNKVTVLNLDLVNGTTTTPRPGSNMTLNADVSVKNPNHASFRYMN 115

Query: 385 STTDVFYDGTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLST 564
           +TT ++Y G V+GE R P G A ARRT RMNV+V+V+ +R++      +D+    LT+ST
Sbjct: 116 TTTTLYYRGAVIGEARGPPGSARARRTMRMNVTVEVITDRVLSNPDLGTDINTGLLTMST 175

Query: 565 STSLRGVVKIADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRRDVAL 702
            T + G VK+  ++K+   V+MNC++++N+ S+ IQ   C+R V L
Sbjct: 176 YTVVGGRVKML-MIKKHLTVRMNCSMTINITSQAIQEQKCKRKVKL 220


>ref|XP_002277286.1| PREDICTED: uncharacterized protein LOC100258567 [Vitis vinifera]
           gi|147832282|emb|CAN73280.1| hypothetical protein
           VITISV_040609 [Vitis vinifera]
           gi|297745338|emb|CBI40418.3| unnamed protein product
           [Vitis vinifera]
          Length = 220

 Score =  180 bits (456), Expect = 9e-43
 Identities = 92/219 (42%), Positives = 144/219 (65%), Gaps = 2/219 (0%)
 Frame = +1

Query: 52  REQEKPLSPFTPPLAVDKRTPFAAEFISR-RRYRCLKCCGCCAALFLILAVTILILMITV 228
           REQ +PL+P +  L+ +         +SR RR RC+KC GC AA  LI A  ++IL+ TV
Sbjct: 4   REQVRPLAPASHRLSSEDDK--VTNHLSRLRRRRCIKCWGCIAATILIQAAVVIILVFTV 61

Query: 229 LHIKDPTLKLNSVKINGLDLLSNTTN-NSAQNLTLVADISIKNPNTASFKFDASTTDVFY 405
             +KDP +KLN   ++ L+L++ TT      N++L AD+S+KNPN ASF++  +TT +FY
Sbjct: 62  FRVKDPVIKLNGFTVDKLELINGTTTPGPGVNMSLTADVSVKNPNFASFRYKNTTTTLFY 121

Query: 406 DGTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLSTSTSLRGV 585
            GTV+GE R P G+A ARRT +MNV+++++++ ++      +D+ +  L ++T + + G 
Sbjct: 122 SGTVIGEARGPPGQAKARRTMKMNVTIEIILDSLMSNPSLLTDISSGILPMNTYSRVPGR 181

Query: 586 VKIADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRRDVAL 702
           VK+  ++K+  VVKMNC+V+VN+ S  IQ   C+RDV L
Sbjct: 182 VKMLKIIKKHVVVKMNCSVTVNITSRSIQEQKCKRDVNL 220


>ref|XP_004306727.1| PREDICTED: uncharacterized protein LOC101306460 [Fragaria vesca
           subsp. vesca]
          Length = 219

 Score =  179 bits (455), Expect = 1e-42
 Identities = 90/217 (41%), Positives = 140/217 (64%)
 Frame = +1

Query: 52  REQEKPLSPFTPPLAVDKRTPFAAEFISRRRYRCLKCCGCCAALFLILAVTILILMITVL 231
           +EQ +PL+P     + D         I+RR+ + + CCGC  A+ LI AV I+IL  TV 
Sbjct: 4   KEQARPLAPAGYRPSSDDNEAALHMKIARRK-KFINCCGCITAIVLIQAVVIIILAFTVF 62

Query: 232 HIKDPTLKLNSVKINGLDLLSNTTNNSAQNLTLVADISIKNPNTASFKFDASTTDVFYDG 411
            +K+P + +N V +  L+L++ TT     N++L AD+S+KNPN ASFK+  +TT ++Y G
Sbjct: 63  RVKEPKIMMNKVTVTKLELVNGTTPKPGTNISLTADVSVKNPNVASFKYSNTTTTLYYHG 122

Query: 412 TVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLSTSTSLRGVVK 591
           TVVGE R P G A ARRT RMN++VD++ + +      ++D+ +  LT+S+ + + G V 
Sbjct: 123 TVVGEARGPPGRAKARRTMRMNITVDIITDILTTNPNLKTDVGSGLLTMSSYSRIPGRVN 182

Query: 592 IADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRRDVAL 702
           + ++VK+  VVKMNCT++VN+ S+ IQ   C+R V+L
Sbjct: 183 MLNIVKKHVVVKMNCTMTVNISSQAIQEQKCKRKVSL 219


>ref|XP_002511928.1| conserved hypothetical protein [Ricinus communis]
           gi|223549108|gb|EEF50597.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 218

 Score =  179 bits (453), Expect = 2e-42
 Identities = 87/216 (40%), Positives = 135/216 (62%)
 Frame = +1

Query: 55  EQEKPLSPFTPPLAVDKRTPFAAEFISRRRYRCLKCCGCCAALFLILAVTILILMITVLH 234
           EQ +PL+P     + D            RR RC+KCCGC  A  L+ A+ I+IL+ TV  
Sbjct: 5   EQVRPLAPSADRTSSDDEEA-TIHLKKTRRRRCIKCCGCITASLLVPAIVIVILIFTVFR 63

Query: 235 IKDPTLKLNSVKINGLDLLSNTTNNSAQNLTLVADISIKNPNTASFKFDASTTDVFYDGT 414
           +KDPT+KLN+V I  ++L++NT      N++LVAD+S+KNPN  SFK+D +T+ ++Y G 
Sbjct: 64  VKDPTIKLNNVIITHMELINNTIPKPGTNISLVADLSVKNPNIVSFKYDNTTSALYYHGV 123

Query: 415 VVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLSTSTSLRGVVKI 594
           +VGE R P G + ARRT R+N ++D++ ++++      +D     LT+ + T L G VKI
Sbjct: 124 LVGEARGPPGHSKARRTMRLNATIDLVADKLISNPNLNTDAATGLLTVDSYTKLPGKVKI 183

Query: 595 ADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRRDVAL 702
             ++K+   +KMNC+++VN+ S+ IQ   C+  V L
Sbjct: 184 L-IIKKHVTIKMNCSLTVNISSQAIQSQKCKNKVDL 218


>gb|EXC34332.1| hypothetical protein L484_006687 [Morus notabilis]
          Length = 228

 Score =  177 bits (448), Expect = 8e-42
 Identities = 97/228 (42%), Positives = 142/228 (62%), Gaps = 7/228 (3%)
 Frame = +1

Query: 40  MAERREQEKPLSPFTPPLAVDKRTPFAAE-------FISRRRYRCLKCCGCCAALFLILA 198
           MA+R+EQ KPL+P       D+      +       F  RRR  C+K CGC +A+ +I A
Sbjct: 1   MADRKEQVKPLAPAFYLFRSDEEDNTNNDDNKNKSFFADRRRNSCVKRCGCASAILVIAA 60

Query: 199 VTILILMITVLHIKDPTLKLNSVKINGLDLLSNTTNNSAQNLTLVADISIKNPNTASFKF 378
           VT++IL ITV H+K P +K+ SV ++ L   +N T ++ +N+TLVA +S+KNPN ASF++
Sbjct: 61  VTMMILAITVFHVKGPIVKMTSVTVDPLQTYANGTIDTDKNVTLVAGVSVKNPNAASFRY 120

Query: 379 DASTTDVFYDGTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTL 558
             +TT VFY G  VGE     G+A ARRT +MN++V++   +M+       D  +  LT 
Sbjct: 121 ANTTTTVFYGGAAVGEGWNAAGKAKARRTVKMNLTVEISTAKMLESPGLLKDWGSGELTF 180

Query: 559 STSTSLRGVVKIADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRRDVAL 702
            + T + G VKI DVVK+  VVK+NCTVS N+ S+ I+  +C+R V+L
Sbjct: 181 DSYTRIEGRVKITDVVKKKVVVKLNCTVSYNVSSKGIERQHCKRHVSL 228


>ref|XP_006477512.1| PREDICTED: uncharacterized protein LOC102619888 [Citrus sinensis]
          Length = 224

 Score =  176 bits (446), Expect = 1e-41
 Identities = 91/216 (42%), Positives = 144/216 (66%), Gaps = 4/216 (1%)
 Frame = +1

Query: 37  AMAERREQEKPLSPFTPPLAVD--KRTPFAAEFISRRRYRCLKCCGCCAALFLILAVTIL 210
           A A   EQ KPL+P    L  D   +     +   + R RCL+C GC  AL L+ AV I+
Sbjct: 2   ANANNSEQVKPLAPAEYHLRSDYEDQAMSGGKLTLQHRRRCLQCFGCVTALLLV-AVVII 60

Query: 211 ILMITVLHIKDPTLKLNSVKINGLDLLSNTTNNSA--QNLTLVADISIKNPNTASFKFDA 384
           I++ TV H+K+P++++NSV I  L+ L+N++  S+  +N+TL+AD+S+KNPN  SFK+  
Sbjct: 61  IVIATVFHVKNPSIRMNSVTIQRLEFLNNSSVRSSGDENVTLLADVSVKNPNYNSFKYGN 120

Query: 385 STTDVFYDGTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLST 564
           +TT ++Y G VVGE   P+ +A ARRT RMNV+VD+  E+ + V   +SD+++R+L +S+
Sbjct: 121 TTTSIYYGGVVVGEGHIPQAQAKARRTLRMNVTVDLNPEKFLTVPSLKSDILSRSLNMSS 180

Query: 565 STSLRGVVKIADVVKRSFVVKMNCTVSVNLGSEVIQ 672
            T + G +K+  V+K+S V+KMNC++  N+ S+ +Q
Sbjct: 181 YTRIDGKIKLIKVLKKSVVLKMNCSIIYNISSQAVQ 216


>ref|XP_006439451.1| hypothetical protein CICLE_v10023484mg [Citrus clementina]
           gi|557541713|gb|ESR52691.1| hypothetical protein
           CICLE_v10023484mg [Citrus clementina]
          Length = 226

 Score =  176 bits (445), Expect = 2e-41
 Identities = 94/226 (41%), Positives = 145/226 (64%), Gaps = 4/226 (1%)
 Frame = +1

Query: 37  AMAERREQEKPLSPFTPPLAVDKRTPFAA--EFISRRRYRCLKCCGCCAALFLILAVTIL 210
           A A   EQ KPL+P    L  D   P  +  +   + R RCL+C GC  AL L+ AV I+
Sbjct: 2   ANANNSEQVKPLAPAEYHLRSDYEDPAMSGGKLTLQHRRRCLQCFGCVTALLLV-AVVII 60

Query: 211 ILMITVLHIKDPTLKLNSVKINGLDLLSNTTNNSA--QNLTLVADISIKNPNTASFKFDA 384
           I++ TV H+K+P++++NSV I  L+ L+N +  S+  +N+TL+AD+S+KNPN  SFK+  
Sbjct: 61  IVIATVFHVKNPSIRMNSVTIQRLEFLNNGSVRSSGDENVTLLADVSVKNPNYNSFKYGN 120

Query: 385 STTDVFYDGTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLST 564
           +TT ++Y G VVGE   P+G+A ARRT RMNV+V +  E+ + V    SD+++R+L +S+
Sbjct: 121 TTTSIYYGGVVVGEGHIPQGQAKARRTLRMNVTVHLNPEKFLTVPSLRSDILSRSLNMSS 180

Query: 565 STSLRGVVKIADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRRDVAL 702
            T + G +K+  + K+S V+KMNC++  N+ S+ +Q   CR    L
Sbjct: 181 YTRIDGKIKLLKIFKKSVVLKMNCSIIYNISSQAVQ-QECRGQALL 225


>ref|XP_002509874.1| conserved hypothetical protein [Ricinus communis]
           gi|223549773|gb|EEF51261.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 222

 Score =  174 bits (442), Expect = 4e-41
 Identities = 91/214 (42%), Positives = 135/214 (63%), Gaps = 3/214 (1%)
 Frame = +1

Query: 40  MAERREQEKPLSPFTPPLAVDKRTPFAAEFISRRRYR---CLKCCGCCAALFLILAVTIL 210
           MA+  +Q KPL+P       D+    ++   ++  +R   C+KC GCC A  LI+AVTIL
Sbjct: 1   MADTDQQAKPLAPAAFQSRSDEEAAASSSITTQFNFRHRNCIKCFGCCTAFLLIIAVTIL 60

Query: 211 ILMITVLHIKDPTLKLNSVKINGLDLLSNTTNNSAQNLTLVADISIKNPNTASFKFDAST 390
           IL  TV H+K+P +K+N + +  L+L  + +  +  N+TL  DIS+KNPN A F+F+  T
Sbjct: 61  ILFFTVFHVKNPVIKMNEITLLQLELNKDGSLRNGTNVTLELDISVKNPNVAPFRFNNFT 120

Query: 391 TDVFYDGTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLSTST 570
           T V Y G  VGE R P G A ARRT  MNV+VD++ E+++ V     D+ + NLT+++ST
Sbjct: 121 TTVLYGGNNVGEARTPSGTAKARRTVHMNVTVDLIPEKILQVPGLLQDVSSGNLTMNSST 180

Query: 571 SLRGVVKIADVVKRSFVVKMNCTVSVNLGSEVIQ 672
            + G VKI  +VK+  VV++NC+V+ N  S+ IQ
Sbjct: 181 VIGGKVKILKIVKKYLVVEVNCSVTYNFSSKEIQ 214


>ref|XP_002299505.1| hypothetical protein POPTR_0001s09990g [Populus trichocarpa]
           gi|222846763|gb|EEE84310.1| hypothetical protein
           POPTR_0001s09990g [Populus trichocarpa]
          Length = 219

 Score =  174 bits (441), Expect = 5e-41
 Identities = 90/222 (40%), Positives = 142/222 (63%), Gaps = 1/222 (0%)
 Frame = +1

Query: 40  MAERREQEKPLSPFTPPLAVDKRTPFAAEFISRRRYRCLKCCGCCAALFLILAVTILILM 219
           MAE  EQ KPL+P    +   +         + RR  C+KCCGC  A+ LILA T+++L+
Sbjct: 1   MAET-EQAKPLAPAALRIRSVEEENMPTLSKTHRR-NCIKCCGCITAILLILATTVVVLV 58

Query: 220 ITVLHIKDPTLKLNSVKINGLDLLSNTTNNSAQNLTLVADISIKNPNTASFKFDASTTDV 399
           +TV  + DP +K+N V +  L+L + T      N+TL+AD+S+KNPN A+FKF   TT V
Sbjct: 59  LTVFQVDDPVIKMNKVSVQRLELANGTLRTDV-NVTLLADVSVKNPNAAAFKFKNGTTTV 117

Query: 400 FYDGTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIA-RNLTLSTSTSL 576
           +  G +VGE   P G+A ARRT  MNV+VD++ ++++ V    SD+++ R LT+S++T +
Sbjct: 118 YCGGVMVGEANTPPGKAKARRTLHMNVTVDLIPKKLLSVPSLMSDVVSVRKLTMSSNTVI 177

Query: 577 RGVVKIADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRRDVAL 702
            G V+I  ++K+  VV++NCT++ N  S+ IQ  NC+  +++
Sbjct: 178 GGKVRILQIIKKYLVVRVNCTMTYNFTSQAIQGGNCKPHLSM 219


>ref|XP_006352263.1| PREDICTED: uncharacterized protein LOC102590666 [Solanum tuberosum]
          Length = 235

 Score =  174 bits (440), Expect = 7e-41
 Identities = 91/226 (40%), Positives = 142/226 (62%), Gaps = 9/226 (3%)
 Frame = +1

Query: 52  REQEKPLSPFTPPLAVDKR-----TPFAAEFIS---RRRYRCLKCCGCCAALFLILAVTI 207
           R+Q +PL+P +  +  +       T  ++  +S   RRR RC+KCCGCC    +I+ + I
Sbjct: 11  RDQVRPLAPSSHRIHTENEEGVNYTSTSSIELSKKQRRRKRCIKCCGCCGITTIIIGIVI 70

Query: 208 LILMITVLHIKDPTLKLNSVKINGLDLLSNTTN-NSAQNLTLVADISIKNPNTASFKFDA 384
           LIL +TV  +KDPT+++NS++  GL  L++++N  +  N+T+ ADISIKNPN  SFKF  
Sbjct: 71  LILALTVFKVKDPTIRMNSIRFEGLSSLTSSSNLQTNMNITVSADISIKNPNAVSFKFKP 130

Query: 385 STTDVFYDGTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLST 564
           +T  + Y   ++ E   P G A ARRT  MNV V VMVE+++ + R   DLIA  L +S 
Sbjct: 131 ATASLIYSDRIIAEAMLPRGSARARRTFSMNVKVMVMVEKLIVIPRLTIDLIAGELPVSM 190

Query: 565 STSLRGVVKIADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRRDVAL 702
           ST++ G V +  V+K+S  ++MNC + V+L  + ++ ++C R V+L
Sbjct: 191 STNINGKVNLG-VIKKSVDIRMNCNMVVDLQRQDVKDIDCERKVSL 235


>ref|XP_002301379.1| hypothetical protein POPTR_0002s16660g [Populus trichocarpa]
           gi|222843105|gb|EEE80652.1| hypothetical protein
           POPTR_0002s16660g [Populus trichocarpa]
          Length = 217

 Score =  174 bits (440), Expect = 7e-41
 Identities = 88/216 (40%), Positives = 138/216 (63%)
 Frame = +1

Query: 49  RREQEKPLSPFTPPLAVDKRTPFAAEFISRRRYRCLKCCGCCAALFLILAVTILILMITV 228
           +REQ +PL+P       D+    A++   +R  +C+KCC    A+F+IL   I++L  TV
Sbjct: 3   KREQVRPLAPAAERRCSDEEG--ASKHHKKRSRKCVKCCVFVTAIFVILVTAIIVLRFTV 60

Query: 229 LHIKDPTLKLNSVKINGLDLLSNTTNNSAQNLTLVADISIKNPNTASFKFDASTTDVFYD 408
             +KDP + +NS  I  L+L + TT     N+TL+AD+S+KNPN ASFK+  +TT ++YD
Sbjct: 61  FRVKDPVITMNSFTITKLELSNGTTPKPGVNITLIADVSVKNPNVASFKYSNTTTTLYYD 120

Query: 409 GTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLSTSTSLRGVV 588
             +VGE R   G A ARRT RMNV+VD++ +R++      +D+ +  L+++T T + G +
Sbjct: 121 RQIVGEARNGPGHARARRTMRMNVTVDIIPDRIMSNPNLNADMSSGILSMTTYTRVPGRM 180

Query: 589 KIADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRRDV 696
           KI  +VKR+ VVKM C++++N+ S+ IQ   C+R V
Sbjct: 181 KIV-IVKRNIVVKMTCSITLNITSQQIQTQQCKRKV 215


>ref|XP_007210919.1| hypothetical protein PRUPE_ppa021244mg [Prunus persica]
           gi|462406654|gb|EMJ12118.1| hypothetical protein
           PRUPE_ppa021244mg [Prunus persica]
          Length = 219

 Score =  172 bits (437), Expect = 1e-40
 Identities = 95/218 (43%), Positives = 141/218 (64%), Gaps = 1/218 (0%)
 Frame = +1

Query: 52  REQEKPLSPFTP-PLAVDKRTPFAAEFISRRRYRCLKCCGCCAALFLILAVTILILMITV 228
           +EQ KPL+P     L  D+   F +  I   + + + CCGC +ALFLI+AVT ++L  TV
Sbjct: 4   KEQGKPLAPANSYHLRSDEEEVFVSSHIKLCQRKYVMCCGCVSALFLIIAVTAIVLGFTV 63

Query: 229 LHIKDPTLKLNSVKINGLDLLSNTTNNSAQNLTLVADISIKNPNTASFKFDASTTDVFYD 408
            H+K P +K+N V I  L++ +N    S  N+TL+AD+SIKNPN ASFK+  +TT V+Y 
Sbjct: 64  FHVKGPRIKMNDVTIQQLEV-ANGALRSDTNVTLLADVSIKNPNVASFKYGNTTTRVYYS 122

Query: 409 GTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLSTSTSLRGVV 588
           GT VG+ R P G A ARRT RMNV+VD++   +  V  F  ++ +  LT+ST T + G V
Sbjct: 123 GTEVGQGRTPAGVAKARRTMRMNVTVDIVPGEISAVPGFIKEVASGKLTVSTYTRIEGKV 182

Query: 589 KIADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRRDVAL 702
           KI  +V ++ VV++NC+++ N  S+ I+  +C+R V+L
Sbjct: 183 KIL-MVNKNVVVELNCSMTYNFASKGIEGEDCKRRVSL 219


>ref|XP_007040372.1| Late embryogenesis abundant hydroxyproline-rich glycoprotein
           family, putative [Theobroma cacao]
           gi|508777617|gb|EOY24873.1| Late embryogenesis abundant
           hydroxyproline-rich glycoprotein family, putative
           [Theobroma cacao]
          Length = 219

 Score =  172 bits (436), Expect = 2e-40
 Identities = 91/216 (42%), Positives = 136/216 (62%), Gaps = 1/216 (0%)
 Frame = +1

Query: 52  REQEKPLSPFTPPLAVDKRTPFAAEF-ISRRRYRCLKCCGCCAALFLILAVTILILMITV 228
           REQ KPL+P       D     + +  + RRRY  ++CCGC AAL LI AV IL+L  TV
Sbjct: 4   REQVKPLAPAAFQTRSDDEEALSKQLKLKRRRY--IQCCGCVAALLLIQAVVILVLFFTV 61

Query: 229 LHIKDPTLKLNSVKINGLDLLSNTTNNSAQNLTLVADISIKNPNTASFKFDASTTDVFYD 408
             I+DP +++NSV I  L+   N +  +  N+TL+AD+S+KNPN A+FKF+ STT ++Y 
Sbjct: 62  FRIQDPMIRMNSVTIQRLEFFQNGSLRTDVNVTLLADVSVKNPNVAAFKFNNSTTLIYYG 121

Query: 409 GTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLSTSTSLRGVV 588
           G VVGE    +G+A ARRT R NV+VD++ E+++ V    SD  ++ L +S+ T + G V
Sbjct: 122 GRVVGEGHHLQGKAKARRTLRRNVTVDIIPEKILAVPSLMSDFASQALNISSYTRISGRV 181

Query: 589 KIADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRRDV 696
           +I + +K+  VVK NCT++  L  +     +CR ++
Sbjct: 182 RILNFIKKKVVVKFNCTMTYRLSGQEFHGESCRPEL 217


>ref|XP_003548621.1| PREDICTED: uncharacterized protein LOC100799820 [Glycine max]
          Length = 219

 Score =  171 bits (433), Expect = 4e-40
 Identities = 86/218 (39%), Positives = 138/218 (63%), Gaps = 2/218 (0%)
 Frame = +1

Query: 55  EQEKPLSPFTPPLAVDK--RTPFAAEFISRRRYRCLKCCGCCAALFLILAVTILILMITV 228
           EQ +PL+P     + D+   TP      ++   + +K C C     L++A+ I++L+ TV
Sbjct: 5   EQARPLAPSIERQSSDEDNTTPHPQ---TQGHKKLIKRCACPLISLLLIAIVIIVLIFTV 61

Query: 229 LHIKDPTLKLNSVKINGLDLLSNTTNNSAQNLTLVADISIKNPNTASFKFDASTTDVFYD 408
             +KDP + +NS+KI  L L++  +     N++LVAD+S+KNPN ASF++  +TT ++Y 
Sbjct: 62  FRVKDPVITMNSIKITKLQLVNTMSQQPGANMSLVADVSVKNPNVASFRYSNTTTSLYYH 121

Query: 409 GTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLSTSTSLRGVV 588
           G +VGE R P G A ARRT RMNV++DV+  R++    F +DL +  LT+S+ + + G V
Sbjct: 122 GVIVGEARGPPGRAKARRTLRMNVTIDVITARVISSPDFVTDLGSGLLTMSSFSRVPGQV 181

Query: 589 KIADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRRDVAL 702
           KI +++KR  VVKMNCT + N+ ++ I+  +C+R V L
Sbjct: 182 KILNLIKRHVVVKMNCTTTFNISTQAIKEQSCKRKVKL 219


>gb|ACU19833.1| unknown [Glycine max]
          Length = 219

 Score =  171 bits (433), Expect = 4e-40
 Identities = 85/216 (39%), Positives = 137/216 (63%)
 Frame = +1

Query: 55  EQEKPLSPFTPPLAVDKRTPFAAEFISRRRYRCLKCCGCCAALFLILAVTILILMITVLH 234
           EQ +PL+P     + D+    A    ++   + +K C C     L++A+ I++L+ TV  
Sbjct: 5   EQARPLAPSIERQSSDEDNT-APHPQTQGHKKLIKRCACPLISLLLIAIVIIVLIFTVFR 63

Query: 235 IKDPTLKLNSVKINGLDLLSNTTNNSAQNLTLVADISIKNPNTASFKFDASTTDVFYDGT 414
           +KDP + +NS+KI  L L++  +     N++LVAD+S+KNPN ASF++  +TT ++Y G 
Sbjct: 64  VKDPVITMNSIKITKLQLVNTMSQQPGANMSLVADVSVKNPNVASFRYSNTTTSLYYHGV 123

Query: 415 VVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLSTSTSLRGVVKI 594
           +VGE R P G A ARRT RMNV++DV+  R++    F +DL +  LT+S+ + + G VKI
Sbjct: 124 IVGEARGPPGRAKARRTLRMNVTIDVIAARVISSPDFVTDLGSGLLTMSSFSRVPGQVKI 183

Query: 595 ADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRRDVAL 702
            +++KR  VVKMNCT + N+ ++ I+  +C+R V L
Sbjct: 184 LNLIKRHVVVKMNCTTTFNISTQAIKEQSCKRKVKL 219


>gb|EXB52691.1| hypothetical protein L484_022468 [Morus notabilis]
          Length = 229

 Score =  170 bits (431), Expect = 7e-40
 Identities = 87/213 (40%), Positives = 142/213 (66%), Gaps = 1/213 (0%)
 Frame = +1

Query: 55  EQEKPLSPFTP-PLAVDKRTPFAAEFISRRRYRCLKCCGCCAALFLILAVTILILMITVL 231
           +Q +PL+P    P + D+     ++ I  R+   +K CGC AA  LILAV I+IL+ TV 
Sbjct: 5   DQARPLAPAANRPSSDDEEVALHSKKIRHRKL--IKYCGCLAAFVLILAVAIIILIFTVF 62

Query: 232 HIKDPTLKLNSVKINGLDLLSNTTNNSAQNLTLVADISIKNPNTASFKFDASTTDVFYDG 411
            IK+P +K+N + ++ L L++N T+    N++L AD+S+KNPN ASF++  +TT ++YDG
Sbjct: 63  KIKEPVIKMNGITVSNLALVNNRTSEG--NMSLTADVSVKNPNAASFRYSNTTTALYYDG 120

Query: 412 TVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLSTSTSLRGVVK 591
            ++GE R P G+A ARRT+RMN++VD+++++++       D+ +  + +S+ + + G VK
Sbjct: 121 KMIGEARGPPGQARARRTKRMNITVDIIMDQVMTSPNLLGDVGSGLVEMSSYSRIPGRVK 180

Query: 592 IADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRR 690
           I +V+KR  VVKMNCT +VN+ S+ I    C+R
Sbjct: 181 ILNVIKRHVVVKMNCTFTVNITSKSIVEQKCKR 213


>ref|XP_007219590.1| hypothetical protein PRUPE_ppa023497mg [Prunus persica]
           gi|462416052|gb|EMJ20789.1| hypothetical protein
           PRUPE_ppa023497mg [Prunus persica]
          Length = 313

 Score =  167 bits (423), Expect = 6e-39
 Identities = 90/218 (41%), Positives = 140/218 (64%), Gaps = 5/218 (2%)
 Frame = +1

Query: 52  REQEKPLSPFTPPLAVDKRTPFAAEFISRRRYRCLK----CCGCCAALFLILAVTILILM 219
           +EQ +PL+P     A + ++  A E     +   LK    CCG   AL LILAV I+IL 
Sbjct: 4   KEQVRPLAP-----AANGQSSDADEAALHSKKFGLKKFIYCCGGITALLLILAVVIIILA 58

Query: 220 ITVLHIKDPTLKLNSVKINGLDLLS-NTTNNSAQNLTLVADISIKNPNTASFKFDASTTD 396
            TV  +K+P +K+N V +  L+L++ NTT     N++L AD+S+KNPN ASF+++ +TT 
Sbjct: 59  FTVFRLKEPKIKMNKVTVTRLELINDNTTPKPGSNISLTADVSVKNPNAASFRYNNTTTT 118

Query: 397 VFYDGTVVGEVRAPEGEAAARRTRRMNVSVDVMVERMVGVSRFESDLIARNLTLSTSTSL 576
           ++Y G VVGE     G+A ARRT RMN++VDV+ +R+    ++ +D+ +  LT+S+ + +
Sbjct: 119 LYYHGVVVGEAHGSPGKAKARRTMRMNITVDVITDRLTSNPKWGADVGSGLLTMSSYSRI 178

Query: 577 RGVVKIADVVKRSFVVKMNCTVSVNLGSEVIQVVNCRR 690
            G V + +++KR  VVKMNCT++VN+ S+ IQ   C+R
Sbjct: 179 PGRVNMWNIIKRHVVVKMNCTMTVNISSQAIQEQKCKR 216


Top