BLASTX nr result

ID: Mentha27_contig00005936 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha27_contig00005936
         (1252 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU25168.1| hypothetical protein MIMGU_mgv1a018620mg [Mimulus...   227   9e-57
ref|XP_002513626.1| conserved hypothetical protein [Ricinus comm...   166   2e-38
ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r...   162   2e-37
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...   161   6e-37
gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus...   160   1e-36
ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom...   160   1e-36
gb|EYU25166.1| hypothetical protein MIMGU_mgv1a014071mg [Mimulus...   159   2e-36
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...   157   9e-36
gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus...   157   1e-35
ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302...   148   4e-33
ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293...   147   9e-33
ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r...   144   1e-31
ref|XP_006362675.1| PREDICTED: uncharacterized protein LOC102578...   140   1e-30
ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-r...   139   3e-30
ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295...   137   7e-30
ref|XP_007210712.1| hypothetical protein PRUPE_ppa022983mg [Prun...   137   7e-30
ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303...   132   2e-28
emb|CBI22611.3| unnamed protein product [Vitis vinifera]              132   2e-28
ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r...   132   3e-28
ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294...   130   1e-27

>gb|EYU25168.1| hypothetical protein MIMGU_mgv1a018620mg [Mimulus guttatus]
          Length = 214

 Score =  227 bits (578), Expect = 9e-57
 Identities = 119/220 (54%), Positives = 152/220 (69%)
 Frame = +3

Query: 159 MAEEREATGIKAYTAPAQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVI 338
           MAE     G KAY +P     YGRVDEEVA VA ++++R+KR+KCF+Y+A F+V Q+ + 
Sbjct: 1   MAENNHQAGEKAYASP-----YGRVDEEVASVAQKNEKRKKRVKCFTYVAVFIVIQSVIF 55

Query: 339 LIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNG 518
           +IF LTIMK RTPKF +RSA F GAF+V+     PSFNI M+A+L ++N NFG+YKYQN 
Sbjct: 56  MIFGLTIMKVRTPKFHVRSATF-GAFEVSTLDTNPSFNINMIADLSVRNRNFGQYKYQNS 114

Query: 519 TVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLSSAGVPAGELSAQFGAAPGVIPLTS 698
           TVEF++ GTKVGEA I R+ A ARSTR+F  TVDLSSAGVP   L+ +F     +IPLTS
Sbjct: 115 TVEFFFRGTKVGEARIVRSRANARSTRRFLATVDLSSAGVPTEVLANEF-RTHALIPLTS 173

Query: 699 RSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818
           RS + GKVE              CTM+I + S+QL  +SC
Sbjct: 174 RSTLRGKVEIMKLMKKNKSTNMNCTMEIMISSKQLGNISC 213


>ref|XP_002513626.1| conserved hypothetical protein [Ricinus communis]
           gi|223547534|gb|EEF49029.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 217

 Score =  166 bits (420), Expect = 2e-38
 Identities = 89/224 (39%), Positives = 132/224 (58%), Gaps = 4/224 (1%)
 Frame = +3

Query: 159 MAEEREATGIKAYTAPAQINPYGRVDEE---VAEVAARDQRRRKRIKCFSYLAAFVVFQT 329
           MAE+ +A        P   +   R DEE         ++ R++KR+KC +++ AF +FQT
Sbjct: 1   MAEKEQAP------TPLVADGQTRSDEESGTAGTAQTKELRKKKRMKCIAFVVAFTIFQT 54

Query: 330 AVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKY 509
            +IL+F  T+++F+ PKFR+RSA+FD  F V    AAPSFN+ M  + G+KN NFG +KY
Sbjct: 55  GIILLFVFTVLRFKDPKFRVRSASFDDTFHVGTDAAAPSFNLTMNTQFGVKNTNFGHFKY 114

Query: 510 QNGTVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLSSAGVPAG-ELSAQFGAAPGVI 686
           +  TV F Y GT VG   + +A A+ARSTRKF+  V L +  +P G ELS+   +  G I
Sbjct: 115 ETSTVTFEYRGTVVGLVNVDKARARARSTRKFDAIVVLRTDRLPDGFELSSDISS--GKI 172

Query: 687 PLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818
           PL+S S + G++               CTM +D+Q+R L+++ C
Sbjct: 173 PLSSSSRLDGEIHLMKVIKKKKSAEMNCTMNVDIQTRTLQDIVC 216


>ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721844|gb|EOY13741.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 259

 Score =  162 bits (411), Expect = 2e-37
 Identities = 81/189 (42%), Positives = 126/189 (66%), Gaps = 2/189 (1%)
 Frame = +3

Query: 258 ARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTA 437
           +++ +R+KR+KC +Y+AAFV+FQTA+IL+FALT+M+ + PKFR+RS   D   D+  + +
Sbjct: 13  SKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVD---DLTFNNS 69

Query: 438 APSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIP--RANAKARSTRKFNV 611
           +PSFN++ +A++ +KN NFG YK++N TV F Y G++VGEA +   RA A+ARST+K NV
Sbjct: 70  SPSFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNV 129

Query: 612 TVDLSSAGVPAGELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQ 791
           T+DL+S GV A +         G + LTS+S + GKV               CTM +++ 
Sbjct: 130 TMDLNSNGV-ANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLA 188

Query: 792 SRQLRELSC 818
            + +R++ C
Sbjct: 189 QKLVRDIKC 197


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score =  161 bits (407), Expect = 6e-37
 Identities = 82/190 (43%), Positives = 124/190 (65%), Gaps = 2/190 (1%)
 Frame = +3

Query: 255 AARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAAST 434
           +A + +R+KR+K F+Y AAFVVFQT VIL+F+LT+M+ + PKFR+RS   +   D+A ++
Sbjct: 15  SAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVE---DIAYTS 71

Query: 435 AA--PSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIPRANAKARSTRKFN 608
               PSFN++  AE+ +KN NFG +K+ N T+ F YGG +VGEAF+ +  AKARST+K N
Sbjct: 72  TPNPPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMN 131

Query: 609 VTVDLSSAGVPAGELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDV 788
           VTVDL+S  +PA    A    + G + LT+ + ++GKV               CTM +++
Sbjct: 132 VTVDLNSNNIPANSNLAS-DISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNL 190

Query: 789 QSRQLRELSC 818
            SR ++++ C
Sbjct: 191 ASRAIQDIKC 200


>gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus guttatus]
          Length = 213

 Score =  160 bits (405), Expect = 1e-36
 Identities = 90/204 (44%), Positives = 125/204 (61%), Gaps = 3/204 (1%)
 Frame = +3

Query: 216 NPYGRVDEEVAEVA--ARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRL 389
           N +GR D E    A  AR+QR++KR KCF Y+A FV+FQ  VI IF++T+MK RTPKFR+
Sbjct: 13  NGHGRSDAEAGAAAHDAREQRKKKRTKCFIYIALFVIFQLGVIAIFSVTVMKIRTPKFRI 72

Query: 390 RSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIP 569
           RSA        A +  +PSF+  + AE  +KNANFGRYKY+N TV F+Y GT VG+ F+ 
Sbjct: 73  RSAHLTTFH--AGTPGSPSFSGTVNAEFSVKNANFGRYKYRNTTVGFFYKGTPVGQVFVR 130

Query: 570 RANAKARSTRKFNVTVDLSSAGVPAG-ELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXX 746
            + A  RST+KF V VDL+ A      +L++   A  GV+ +TS++ + G+VE       
Sbjct: 131 DSRAGWRSTKKFRVVVDLNLANAQGNPQLASDLNA--GVVQITSQARMAGRVELIFVMKK 188

Query: 747 XXXXXXXCTMQIDVQSRQLRELSC 818
                  C M+I   ++Q+R L C
Sbjct: 189 NKSTDMNCNMEIVTATQQIRNLVC 212


>ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao]
           gi|508776114|gb|EOY23370.1| Uncharacterized protein
           TCM_015287 [Theobroma cacao]
          Length = 214

 Score =  160 bits (405), Expect = 1e-36
 Identities = 88/216 (40%), Positives = 126/216 (58%), Gaps = 1/216 (0%)
 Frame = +3

Query: 174 EATGIKAYTAPAQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVILIFAL 353
           EA     Y      N + R DEE     +++ +++KR+KC  Y+  F VFQT +IL+FAL
Sbjct: 2   EAKSQSPYPLVPAANGHERSDEESVAAHSKELKKKKRMKCLLYIVLFAVFQTGIILLFAL 61

Query: 354 TIMKFRTPKFRLRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFY 533
           T+M+ R PKFR+RS +F   F+V    A+PSF+++M  +  +KN NFG +KY+ G V F 
Sbjct: 62  TVMRIRNPKFRVRSGSFT-TFNVGTE-ASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFA 119

Query: 534 YGGTKVGEAFIPRANAKARSTRKFNVTVDLSSAGVP-AGELSAQFGAAPGVIPLTSRSAV 710
           Y GT VG A I +A A+ARST+K +V V+LSS G+P   EL     A  GV+ LTS S +
Sbjct: 120 YRGTPVGRATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISA--GVLTLTSSSKL 177

Query: 711 TGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818
            GK+               CTM + + +R +R + C
Sbjct: 178 DGKIHLMKVIKKKKSTQMNCTMDVAIDTRTVRNIIC 213


>gb|EYU25166.1| hypothetical protein MIMGU_mgv1a014071mg [Mimulus guttatus]
          Length = 202

 Score =  159 bits (402), Expect = 2e-36
 Identities = 92/205 (44%), Positives = 129/205 (62%), Gaps = 1/205 (0%)
 Frame = +3

Query: 207 AQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFR 386
           A  N +GR D E A  AA + R++ R KCF Y+A FV+FQ  VI IF+LT+MK RTPKFR
Sbjct: 2   APANGHGRSDAE-AGGAATEPRKKNRTKCFLYIALFVIFQIGVITIFSLTVMKIRTPKFR 60

Query: 387 LRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFI 566
           +RSA     F+ A + A+PSF+  + AE  +KNANFGRYKY+N TV+F+Y GT VG+  +
Sbjct: 61  IRSAHLTN-FN-AGTPASPSFSATVNAEFTVKNANFGRYKYRNTTVDFFYRGTPVGQVLV 118

Query: 567 PRANAKARSTRKFNVTVDLSSAGVPAG-ELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXX 743
             + A  RST+KFNV V+LS     A  +L++   A  GV+ ++S++ + G+VE      
Sbjct: 119 RDSRAGWRSTKKFNVAVNLSLTNAQANPQLASDLNA--GVVQISSQARMRGRVELIFVMK 176

Query: 744 XXXXXXXXCTMQIDVQSRQLRELSC 818
                   CTM+I   ++QLR + C
Sbjct: 177 KNKSTDMNCTMEIVTATQQLRNILC 201


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score =  157 bits (397), Expect = 9e-36
 Identities = 81/204 (39%), Positives = 124/204 (60%)
 Frame = +3

Query: 207 AQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFR 386
           A  N + R DEE A + +++ +R+KRIK   Y+AAF VFQT VILIFALT+M+ + PK R
Sbjct: 12  APANGHPRSDEESASLQSKELKRKKRIKYAVYIAAFAVFQTVVILIFALTVMRVKNPKVR 71

Query: 387 LRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFI 566
           +     +   + + + AA SFN+R + ++ +KN NFG YK+ N T+ F Y G  VGEA I
Sbjct: 72  IGKVTVE-TMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLYDGVMVGEAII 130

Query: 567 PRANAKARSTRKFNVTVDLSSAGVPAGELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXX 746
           P+A A+ARST+K +VTV+++S+ + +         +  V+ L S++ + GKVE       
Sbjct: 131 PKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKLKGKVELMKVMKK 190

Query: 747 XXXXXXXCTMQIDVQSRQLRELSC 818
                  CT+  +V +R L++L C
Sbjct: 191 KKSPEMNCTLIFNVSTRSLQDLKC 214


>gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus guttatus]
          Length = 214

 Score =  157 bits (396), Expect = 1e-35
 Identities = 84/205 (40%), Positives = 125/205 (60%), Gaps = 1/205 (0%)
 Frame = +3

Query: 207 AQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFR 386
           A  N +GR D E    AA +  +RKR +C  Y+    + Q AV+++F+LT+MK R P+FR
Sbjct: 13  APANDHGRSDTEAGGAAASELHKRKRTQCLIYIGLLAIIQIAVVIVFSLTVMKIRNPRFR 72

Query: 387 LRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFI 566
           +RSA     F+ A + A+P+F  ++ AE  +KNANFGRYKY + TV+F Y GT+VGE F+
Sbjct: 73  IRSAHLTN-FN-AGTPASPAFTGKLNAEFSVKNANFGRYKYMDTTVDFVYRGTRVGEVFV 130

Query: 567 PRANAKARSTRKFNVTVDLSSAGVPAG-ELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXX 743
             + A  R+T+KFNV VDLS A   A  +L++   A  GV+P++S + ++G VE      
Sbjct: 131 RESRAGWRTTKKFNVAVDLSLANARANPQLASDLNA--GVVPISSEARMSGSVELLFVLK 188

Query: 744 XXXXXXXXCTMQIDVQSRQLRELSC 818
                   CTM+I   ++Q+R + C
Sbjct: 189 KNRSTGLNCTMEIVTATQQIRNILC 213


>ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca
           subsp. vesca]
          Length = 222

 Score =  148 bits (374), Expect = 4e-33
 Identities = 83/226 (36%), Positives = 132/226 (58%), Gaps = 6/226 (2%)
 Frame = +3

Query: 159 MAEEREATGIKAYTAPAQINPYGRVDEEVAEVA---ARDQRRRKRIKCFSYLAAFVVFQT 329
           MAE +EA     Y        Y R D+E A  A   A + R +KR++C  Y++ F VFQ 
Sbjct: 1   MAENKEAAATSPYPLMPSAPSYMRSDQEAAASAPPSAEELRHKKRMRCLLYVSIFAVFQV 60

Query: 330 AVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKY 509
            VI +FALT+MK ++PKFR+R+A+  G F+V  S + PSFN+ M    G+KN NFG ++Y
Sbjct: 61  VVITVFALTVMKIKSPKFRVRTASITG-FEV-GSASNPSFNLEMDVHFGVKNTNFGHFEY 118

Query: 510 QNGTVEFYYGGTKVGEAFIPRANAKARSTRKFNV-TVDLSSAGVPAGELSAQFGA--APG 680
           ++G V F Y   ++G+  +     +ARSTRK +V +VDL+S G+PA   +++ G+  + G
Sbjct: 119 EDGIVVFTYRDVRIGQTNVEEERVRARSTRKVDVSSVDLTSRGLPA---NSRLGSDISTG 175

Query: 681 VIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818
           +IP+T  S + GK+               CTM++ + ++ ++ + C
Sbjct: 176 IIPITISSKLDGKIHLMKIIKKKKSAQMNCTMEVVLATKSVQNVVC 221


>ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  147 bits (371), Expect = 9e-33
 Identities = 79/204 (38%), Positives = 115/204 (56%)
 Frame = +3

Query: 207 AQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFR 386
           A  N Y R D E   ++  + +R+KRIKCF+Y+  F+VFQ AV+ +F LTIMK +TPK R
Sbjct: 12  APSNGYTRSDGE--SLSEDELKRKKRIKCFAYIGIFIVFQIAVMTVFGLTIMKVKTPKVR 69

Query: 387 LRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFI 566
           L ++      D  +S  APSF+     ++ +KN N+G YK+  G V F Y G  VG   +
Sbjct: 70  LGTSTLT---DFTSSDTAPSFDTTFNTQIRVKNTNWGPYKFDQGVVTFMYQGMPVGTVVV 126

Query: 567 PRANAKARSTRKFNVTVDLSSAGVPAGELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXX 746
           P+  A  R T+K NV V L++A +P+   +     + GV+ LTS + +TGKVE       
Sbjct: 127 PKGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAKLTGKVELMLIMKK 186

Query: 747 XXXXXXXCTMQIDVQSRQLRELSC 818
                  CT+QIDV  + ++ L C
Sbjct: 187 KKSASMNCTIQIDVSGKTVKSLEC 210


>ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 185

 Score =  144 bits (362), Expect = 1e-31
 Identities = 71/186 (38%), Positives = 117/186 (62%)
 Frame = +3

Query: 261 RDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAA 440
           + + +R   KC +Y+A FVVFQTA+ILIFALT+M+ + PK R  +   +       ++++
Sbjct: 2   KGEGKRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFS--TGNSSS 59

Query: 441 PSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVD 620
           P F++R++A++ +KN NFG +KY+N ++   YGG  VGEA I +A A+AR T+KF+VT+D
Sbjct: 60  PFFDMRLMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTID 119

Query: 621 LSSAGVPAGELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQ 800
           +SS+ +     +     A GV+PL+S + ++GKV               CTM I++ +R 
Sbjct: 120 ISSSKLSTNS-NLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRT 178

Query: 801 LRELSC 818
           +++L C
Sbjct: 179 VQDLKC 184


>ref|XP_006362675.1| PREDICTED: uncharacterized protein LOC102578608 [Solanum tuberosum]
          Length = 204

 Score =  140 bits (352), Expect = 1e-30
 Identities = 85/219 (38%), Positives = 116/219 (52%)
 Frame = +3

Query: 162 AEEREATGIKAYTAPAQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVIL 341
           AEE +      +  PA+  P      E+        RR+KR K   Y+A F+VFQ AV+L
Sbjct: 3   AEEEQQLQTNGHAKPAEETPNSTQSNEL--------RRKKRNKILVYVALFIVFQIAVLL 54

Query: 342 IFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGT 521
            F+L IMK RTPKF +RSA FD        T   SFNI M AEL +KNANFG Y Y+N T
Sbjct: 55  FFSLYIMKIRTPKFSVRSATFD-----LMVTENASFNITMNAELSVKNANFGPYNYKNST 109

Query: 522 VEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLSSAGVPAGELSAQFGAAPGVIPLTSR 701
           + FYY    +GEAF+ +  A  +S++KFNV V+LSS      E   +     G + LTS+
Sbjct: 110 IYFYYNDVSIGEAFVYQGKAGFKSSKKFNVIVNLSSK-----ESKLRNDLNSGTLILTSK 164

Query: 702 SAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818
           S + GKV+              C + I +  + +R++ C
Sbjct: 165 SKLEGKVKLIFFMKKKKSTEMNCAIIIGLAGKVVRDIQC 203


>ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777613|gb|EOY24869.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 226

 Score =  139 bits (350), Expect = 3e-30
 Identities = 74/178 (41%), Positives = 112/178 (62%), Gaps = 1/178 (0%)
 Frame = +3

Query: 267 QRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPS 446
           +R     KC +Y+AAFVVFQTA+IL+FALT+M+ R+PK R  +   +    V +S+  PS
Sbjct: 4   RREGSNAKCLAYVAAFVVFQTAIILLFALTVMRIRSPKVRFGAVTVESFSTVNSSS--PS 61

Query: 447 FNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLS 626
           F+++++A++ +KN NFG +KY+N TV   YGG  VGEA I +  A+AR T+KFN+ VD+S
Sbjct: 62  FDMKLMAQVAVKNTNFGHFKYENSTVTILYGGMPVGEAAIFKGRARARQTKKFNINVDIS 121

Query: 627 SAGVPA-GELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSR 797
           S+ + +   L     A  GV+PL+S++ + GKV               CTM I++ +R
Sbjct: 122 SSRLSSNSNLGNDINA--GVLPLSSQAKLKGKVHLMKVIKKKKSGEMSCTMGINLATR 177


>ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  137 bits (346), Expect = 7e-30
 Identities = 84/222 (37%), Positives = 124/222 (55%), Gaps = 2/222 (0%)
 Frame = +3

Query: 159 MAEEREATGIKAYTAP-AQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAV 335
           MAE+ + T     T P A  N Y R D E   ++  + +R+KRIKCF+Y+  F+VFQ A+
Sbjct: 1   MAEKSQKTH---QTYPLASENGYTRSDGE--SLSEDELKRKKRIKCFAYIGIFIVFQMAI 55

Query: 336 ILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQN 515
             +F LT++K +TPK RL ++      DV +ST   SF+     ++ +KN N+G YK+  
Sbjct: 56  GAVFGLTVLKVKTPKVRLGTSTLS---DVTSSTT--SFSSTFNTQIRVKNTNWGPYKFDQ 110

Query: 516 GTVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLSSAGVPAGE-LSAQFGAAPGVIPL 692
           G V F Y G  VG   +P+  A  R T+K NV V L++A +P+   LS++     GV+ L
Sbjct: 111 GVVTFMYQGAPVGTVVVPKGKAGMRGTKKINVNVSLNTAALPSSSTLSSELSG--GVLTL 168

Query: 693 TSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818
           TS + +TGKVE              CT+QIDV  + ++ L C
Sbjct: 169 TSEAKLTGKVELMLIMKKKKSASMNCTIQIDVSGKTVKSLEC 210


>ref|XP_007210712.1| hypothetical protein PRUPE_ppa022983mg [Prunus persica]
           gi|462406447|gb|EMJ11911.1| hypothetical protein
           PRUPE_ppa022983mg [Prunus persica]
          Length = 209

 Score =  137 bits (346), Expect = 7e-30
 Identities = 75/210 (35%), Positives = 116/210 (55%), Gaps = 2/210 (0%)
 Frame = +3

Query: 195 YTAPAQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRT 374
           Y     ++ + R DEE   + + + +R+KRIK + Y+  F+V Q  V+ +F LT+MK +T
Sbjct: 5   YNHREPVHGHPRRDEESTALQSEELKRQKRIKMYKYIVIFIVVQLIVLPVFGLTVMKVKT 64

Query: 375 PKFRLRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVG 554
           PKFRL +        V ++   PSF      ++ +KN N+G YK+  GTV F Y G  VG
Sbjct: 65  PKFRLGNIKVQNLSSVPST---PSFEASFATQIRVKNTNWGPYKFDAGTVTFMYKGVTVG 121

Query: 555 EAFIPRANAKARSTRKFNVTVDLSSAGVPAGELSAQFGA--APGVIPLTSRSAVTGKVEX 728
           +  +P++ AK RST+K +VTV L+S G+P+   S+  G     GV+ L+S+  +TGKV  
Sbjct: 122 QVVVPKSKAKMRSTKKIDVTVSLNSYGLPS---SSNLGTELKSGVLTLSSKGKLTGKVVL 178

Query: 729 XXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818
                        CTM  D+ ++ L+ L C
Sbjct: 179 MLMMKKRKSATMDCTMTFDLSTKTLKTLQC 208


>ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca
           subsp. vesca]
          Length = 213

 Score =  132 bits (333), Expect = 2e-28
 Identities = 67/183 (36%), Positives = 107/183 (58%)
 Frame = +3

Query: 270 RRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSF 449
           R++KRIKC  Y+A F VFQ  VI +FALT+MK ++PKFR++S          +++A PS 
Sbjct: 37  RKKKRIKCLIYIAVFAVFQIIVITVFALTVMKIKSPKFRIKSITVQDL--TTSNSANPSL 94

Query: 450 NIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLSS 629
           ++  VAE+ +KN NFGRYKY   ++ F Y GT+VG+A +P+A A+ ++TRK     ++ S
Sbjct: 95  SMSFVAEVSVKNPNFGRYKYDQTSISFIYEGTQVGDAVVPKATARTKATRK-----EIVS 149

Query: 630 AGVPAGELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRE 809
             V     +     + G + L++ S + GKV               CTM + + S+Q+++
Sbjct: 150 GAVKTVNSNLASDISAGSVTLSTYSKINGKVYLMNMIKKKKSAEMKCTMVVHLSSKQVQD 209

Query: 810 LSC 818
           + C
Sbjct: 210 IKC 212


>emb|CBI22611.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  132 bits (333), Expect = 2e-28
 Identities = 68/190 (35%), Positives = 114/190 (60%)
 Frame = +3

Query: 249 EVAARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAA 428
           +V + + RR K  +  +YL+AF +F+T VI++  +T+M+ R+PKFR R+ + +   +  +
Sbjct: 109 DVESEELRRMKCTRYIAYLSAFALFETIVIMVCVVTLMRIRSPKFRFRAVSIEN-LNYTS 167

Query: 429 STAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIPRANAKARSTRKFN 608
            T +PSFNIR  A++ +KN NFG +K++N T+   Y G  VG+A I +A A+ARST+K N
Sbjct: 168 DTTSPSFNIRFNAKVAVKNTNFGHFKFKNSTITLAYRGDHVGDAKISKARARARSTKKMN 227

Query: 609 VTVDLSSAGVPAGELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDV 788
           VTVD++S  V +    A      G + LT +  + GKV               CT++I++
Sbjct: 228 VTVDVTSNNVSSNSNLAS-DINSGFLTLTGQGKLNGKVHLMKVFKKKKSPQMNCTIKINL 286

Query: 789 QSRQLRELSC 818
           +++ ++E  C
Sbjct: 287 ENKVIQEWKC 296


>ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776108|gb|EOY23364.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 191

 Score =  132 bits (332), Expect = 3e-28
 Identities = 70/184 (38%), Positives = 111/184 (60%), Gaps = 1/184 (0%)
 Frame = +3

Query: 270 RRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSF 449
           RR++ IKC +Y+ A V+ QT +IL+F + +M+ R PK RL     +   ++ +S+++PSF
Sbjct: 10  RRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVEN-LNLNSSSSSPSF 68

Query: 450 NIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLSS 629
           ++ + A++ +KN NFG +K+QN T+   Y GT VGEA I +A A+ARST K NVTV +SS
Sbjct: 69  SMNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSS 128

Query: 630 AGVPAGE-LSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLR 806
             +     LS+  G+  G I L+S + + GK+               CTM++   S+Q++
Sbjct: 129 DKMSRNSALSSDVGS--GTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQ 186

Query: 807 ELSC 818
            L C
Sbjct: 187 NLMC 190


>ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294764 [Fragaria vesca
           subsp. vesca]
          Length = 182

 Score =  130 bits (327), Expect = 1e-27
 Identities = 71/177 (40%), Positives = 105/177 (59%)
 Frame = +3

Query: 288 KCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSFNIRMVA 467
           KC +Y+A F+VFQ  VI IFALT+MK + PK R ++A     F+  +STAA SF+  +V 
Sbjct: 8   KCLAYVAIFIVFQIIVITIFALTVMKIKGPKVRFQTATVSN-FNSDSSTAA-SFSGDLVT 65

Query: 468 ELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLSSAGVPAG 647
           +  +KN NFG +KY N TV   Y G  +G A +P   AKARSTR+ ++T+ + S+ + +G
Sbjct: 66  KFAVKNTNFGHFKYPNSTVSILYEGQVIGTAAVPSQKAKARSTRRTDITISIDSSKL-SG 124

Query: 648 ELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818
             +       GV+PLTS S + GKVE              CTM +++++R + +L C
Sbjct: 125 TTNLTTAIGAGVVPLTSESTLKGKVEVMKIIKKNKSGKMSCTMLLNLKTRTVDDLKC 181


Top