BLASTX nr result
ID: Mentha27_contig00005936
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha27_contig00005936 (1252 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU25168.1| hypothetical protein MIMGU_mgv1a018620mg [Mimulus... 227 9e-57 ref|XP_002513626.1| conserved hypothetical protein [Ricinus comm... 166 2e-38 ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r... 162 2e-37 ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r... 161 6e-37 gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus... 160 1e-36 ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom... 160 1e-36 gb|EYU25166.1| hypothetical protein MIMGU_mgv1a014071mg [Mimulus... 159 2e-36 ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r... 157 9e-36 gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus... 157 1e-35 ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302... 148 4e-33 ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293... 147 9e-33 ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r... 144 1e-31 ref|XP_006362675.1| PREDICTED: uncharacterized protein LOC102578... 140 1e-30 ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-r... 139 3e-30 ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295... 137 7e-30 ref|XP_007210712.1| hypothetical protein PRUPE_ppa022983mg [Prun... 137 7e-30 ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303... 132 2e-28 emb|CBI22611.3| unnamed protein product [Vitis vinifera] 132 2e-28 ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r... 132 3e-28 ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294... 130 1e-27 >gb|EYU25168.1| hypothetical protein MIMGU_mgv1a018620mg [Mimulus guttatus] Length = 214 Score = 227 bits (578), Expect = 9e-57 Identities = 119/220 (54%), Positives = 152/220 (69%) Frame = +3 Query: 159 MAEEREATGIKAYTAPAQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVI 338 MAE G KAY +P YGRVDEEVA VA ++++R+KR+KCF+Y+A F+V Q+ + Sbjct: 1 MAENNHQAGEKAYASP-----YGRVDEEVASVAQKNEKRKKRVKCFTYVAVFIVIQSVIF 55 Query: 339 LIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNG 518 +IF LTIMK RTPKF +RSA F GAF+V+ PSFNI M+A+L ++N NFG+YKYQN Sbjct: 56 MIFGLTIMKVRTPKFHVRSATF-GAFEVSTLDTNPSFNINMIADLSVRNRNFGQYKYQNS 114 Query: 519 TVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLSSAGVPAGELSAQFGAAPGVIPLTS 698 TVEF++ GTKVGEA I R+ A ARSTR+F TVDLSSAGVP L+ +F +IPLTS Sbjct: 115 TVEFFFRGTKVGEARIVRSRANARSTRRFLATVDLSSAGVPTEVLANEF-RTHALIPLTS 173 Query: 699 RSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818 RS + GKVE CTM+I + S+QL +SC Sbjct: 174 RSTLRGKVEIMKLMKKNKSTNMNCTMEIMISSKQLGNISC 213 >ref|XP_002513626.1| conserved hypothetical protein [Ricinus communis] gi|223547534|gb|EEF49029.1| conserved hypothetical protein [Ricinus communis] Length = 217 Score = 166 bits (420), Expect = 2e-38 Identities = 89/224 (39%), Positives = 132/224 (58%), Gaps = 4/224 (1%) Frame = +3 Query: 159 MAEEREATGIKAYTAPAQINPYGRVDEE---VAEVAARDQRRRKRIKCFSYLAAFVVFQT 329 MAE+ +A P + R DEE ++ R++KR+KC +++ AF +FQT Sbjct: 1 MAEKEQAP------TPLVADGQTRSDEESGTAGTAQTKELRKKKRMKCIAFVVAFTIFQT 54 Query: 330 AVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKY 509 +IL+F T+++F+ PKFR+RSA+FD F V AAPSFN+ M + G+KN NFG +KY Sbjct: 55 GIILLFVFTVLRFKDPKFRVRSASFDDTFHVGTDAAAPSFNLTMNTQFGVKNTNFGHFKY 114 Query: 510 QNGTVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLSSAGVPAG-ELSAQFGAAPGVI 686 + TV F Y GT VG + +A A+ARSTRKF+ V L + +P G ELS+ + G I Sbjct: 115 ETSTVTFEYRGTVVGLVNVDKARARARSTRKFDAIVVLRTDRLPDGFELSSDISS--GKI 172 Query: 687 PLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818 PL+S S + G++ CTM +D+Q+R L+++ C Sbjct: 173 PLSSSSRLDGEIHLMKVIKKKKSAEMNCTMNVDIQTRTLQDIVC 216 >ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508721844|gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 259 Score = 162 bits (411), Expect = 2e-37 Identities = 81/189 (42%), Positives = 126/189 (66%), Gaps = 2/189 (1%) Frame = +3 Query: 258 ARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTA 437 +++ +R+KR+KC +Y+AAFV+FQTA+IL+FALT+M+ + PKFR+RS D D+ + + Sbjct: 13 SKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVD---DLTFNNS 69 Query: 438 APSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIP--RANAKARSTRKFNV 611 +PSFN++ +A++ +KN NFG YK++N TV F Y G++VGEA + RA A+ARST+K NV Sbjct: 70 SPSFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNV 129 Query: 612 TVDLSSAGVPAGELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQ 791 T+DL+S GV A + G + LTS+S + GKV CTM +++ Sbjct: 130 TMDLNSNGV-ANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLA 188 Query: 792 SRQLRELSC 818 + +R++ C Sbjct: 189 QKLVRDIKC 197 >ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776113|gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 201 Score = 161 bits (407), Expect = 6e-37 Identities = 82/190 (43%), Positives = 124/190 (65%), Gaps = 2/190 (1%) Frame = +3 Query: 255 AARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAAST 434 +A + +R+KR+K F+Y AAFVVFQT VIL+F+LT+M+ + PKFR+RS + D+A ++ Sbjct: 15 SAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVE---DIAYTS 71 Query: 435 AA--PSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIPRANAKARSTRKFN 608 PSFN++ AE+ +KN NFG +K+ N T+ F YGG +VGEAF+ + AKARST+K N Sbjct: 72 TPNPPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMN 131 Query: 609 VTVDLSSAGVPAGELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDV 788 VTVDL+S +PA A + G + LT+ + ++GKV CTM +++ Sbjct: 132 VTVDLNSNNIPANSNLAS-DISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNL 190 Query: 789 QSRQLRELSC 818 SR ++++ C Sbjct: 191 ASRAIQDIKC 200 >gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus guttatus] Length = 213 Score = 160 bits (405), Expect = 1e-36 Identities = 90/204 (44%), Positives = 125/204 (61%), Gaps = 3/204 (1%) Frame = +3 Query: 216 NPYGRVDEEVAEVA--ARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRL 389 N +GR D E A AR+QR++KR KCF Y+A FV+FQ VI IF++T+MK RTPKFR+ Sbjct: 13 NGHGRSDAEAGAAAHDAREQRKKKRTKCFIYIALFVIFQLGVIAIFSVTVMKIRTPKFRI 72 Query: 390 RSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIP 569 RSA A + +PSF+ + AE +KNANFGRYKY+N TV F+Y GT VG+ F+ Sbjct: 73 RSAHLTTFH--AGTPGSPSFSGTVNAEFSVKNANFGRYKYRNTTVGFFYKGTPVGQVFVR 130 Query: 570 RANAKARSTRKFNVTVDLSSAGVPAG-ELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXX 746 + A RST+KF V VDL+ A +L++ A GV+ +TS++ + G+VE Sbjct: 131 DSRAGWRSTKKFRVVVDLNLANAQGNPQLASDLNA--GVVQITSQARMAGRVELIFVMKK 188 Query: 747 XXXXXXXCTMQIDVQSRQLRELSC 818 C M+I ++Q+R L C Sbjct: 189 NKSTDMNCNMEIVTATQQIRNLVC 212 >ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao] gi|508776114|gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao] Length = 214 Score = 160 bits (405), Expect = 1e-36 Identities = 88/216 (40%), Positives = 126/216 (58%), Gaps = 1/216 (0%) Frame = +3 Query: 174 EATGIKAYTAPAQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVILIFAL 353 EA Y N + R DEE +++ +++KR+KC Y+ F VFQT +IL+FAL Sbjct: 2 EAKSQSPYPLVPAANGHERSDEESVAAHSKELKKKKRMKCLLYIVLFAVFQTGIILLFAL 61 Query: 354 TIMKFRTPKFRLRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFY 533 T+M+ R PKFR+RS +F F+V A+PSF+++M + +KN NFG +KY+ G V F Sbjct: 62 TVMRIRNPKFRVRSGSFT-TFNVGTE-ASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFA 119 Query: 534 YGGTKVGEAFIPRANAKARSTRKFNVTVDLSSAGVP-AGELSAQFGAAPGVIPLTSRSAV 710 Y GT VG A I +A A+ARST+K +V V+LSS G+P EL A GV+ LTS S + Sbjct: 120 YRGTPVGRATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISA--GVLTLTSSSKL 177 Query: 711 TGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818 GK+ CTM + + +R +R + C Sbjct: 178 DGKIHLMKVIKKKKSTQMNCTMDVAIDTRTVRNIIC 213 >gb|EYU25166.1| hypothetical protein MIMGU_mgv1a014071mg [Mimulus guttatus] Length = 202 Score = 159 bits (402), Expect = 2e-36 Identities = 92/205 (44%), Positives = 129/205 (62%), Gaps = 1/205 (0%) Frame = +3 Query: 207 AQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFR 386 A N +GR D E A AA + R++ R KCF Y+A FV+FQ VI IF+LT+MK RTPKFR Sbjct: 2 APANGHGRSDAE-AGGAATEPRKKNRTKCFLYIALFVIFQIGVITIFSLTVMKIRTPKFR 60 Query: 387 LRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFI 566 +RSA F+ A + A+PSF+ + AE +KNANFGRYKY+N TV+F+Y GT VG+ + Sbjct: 61 IRSAHLTN-FN-AGTPASPSFSATVNAEFTVKNANFGRYKYRNTTVDFFYRGTPVGQVLV 118 Query: 567 PRANAKARSTRKFNVTVDLSSAGVPAG-ELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXX 743 + A RST+KFNV V+LS A +L++ A GV+ ++S++ + G+VE Sbjct: 119 RDSRAGWRSTKKFNVAVNLSLTNAQANPQLASDLNA--GVVQISSQARMRGRVELIFVMK 176 Query: 744 XXXXXXXXCTMQIDVQSRQLRELSC 818 CTM+I ++QLR + C Sbjct: 177 KNKSTDMNCTMEIVTATQQLRNILC 201 >ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777615|gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 215 Score = 157 bits (397), Expect = 9e-36 Identities = 81/204 (39%), Positives = 124/204 (60%) Frame = +3 Query: 207 AQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFR 386 A N + R DEE A + +++ +R+KRIK Y+AAF VFQT VILIFALT+M+ + PK R Sbjct: 12 APANGHPRSDEESASLQSKELKRKKRIKYAVYIAAFAVFQTVVILIFALTVMRVKNPKVR 71 Query: 387 LRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFI 566 + + + + + AA SFN+R + ++ +KN NFG YK+ N T+ F Y G VGEA I Sbjct: 72 IGKVTVE-TMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLYDGVMVGEAII 130 Query: 567 PRANAKARSTRKFNVTVDLSSAGVPAGELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXX 746 P+A A+ARST+K +VTV+++S+ + + + V+ L S++ + GKVE Sbjct: 131 PKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKLKGKVELMKVMKK 190 Query: 747 XXXXXXXCTMQIDVQSRQLRELSC 818 CT+ +V +R L++L C Sbjct: 191 KKSPEMNCTLIFNVSTRSLQDLKC 214 >gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus guttatus] Length = 214 Score = 157 bits (396), Expect = 1e-35 Identities = 84/205 (40%), Positives = 125/205 (60%), Gaps = 1/205 (0%) Frame = +3 Query: 207 AQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFR 386 A N +GR D E AA + +RKR +C Y+ + Q AV+++F+LT+MK R P+FR Sbjct: 13 APANDHGRSDTEAGGAAASELHKRKRTQCLIYIGLLAIIQIAVVIVFSLTVMKIRNPRFR 72 Query: 387 LRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFI 566 +RSA F+ A + A+P+F ++ AE +KNANFGRYKY + TV+F Y GT+VGE F+ Sbjct: 73 IRSAHLTN-FN-AGTPASPAFTGKLNAEFSVKNANFGRYKYMDTTVDFVYRGTRVGEVFV 130 Query: 567 PRANAKARSTRKFNVTVDLSSAGVPAG-ELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXX 743 + A R+T+KFNV VDLS A A +L++ A GV+P++S + ++G VE Sbjct: 131 RESRAGWRTTKKFNVAVDLSLANARANPQLASDLNA--GVVPISSEARMSGSVELLFVLK 188 Query: 744 XXXXXXXXCTMQIDVQSRQLRELSC 818 CTM+I ++Q+R + C Sbjct: 189 KNRSTGLNCTMEIVTATQQIRNILC 213 >ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca subsp. vesca] Length = 222 Score = 148 bits (374), Expect = 4e-33 Identities = 83/226 (36%), Positives = 132/226 (58%), Gaps = 6/226 (2%) Frame = +3 Query: 159 MAEEREATGIKAYTAPAQINPYGRVDEEVAEVA---ARDQRRRKRIKCFSYLAAFVVFQT 329 MAE +EA Y Y R D+E A A A + R +KR++C Y++ F VFQ Sbjct: 1 MAENKEAAATSPYPLMPSAPSYMRSDQEAAASAPPSAEELRHKKRMRCLLYVSIFAVFQV 60 Query: 330 AVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKY 509 VI +FALT+MK ++PKFR+R+A+ G F+V S + PSFN+ M G+KN NFG ++Y Sbjct: 61 VVITVFALTVMKIKSPKFRVRTASITG-FEV-GSASNPSFNLEMDVHFGVKNTNFGHFEY 118 Query: 510 QNGTVEFYYGGTKVGEAFIPRANAKARSTRKFNV-TVDLSSAGVPAGELSAQFGA--APG 680 ++G V F Y ++G+ + +ARSTRK +V +VDL+S G+PA +++ G+ + G Sbjct: 119 EDGIVVFTYRDVRIGQTNVEEERVRARSTRKVDVSSVDLTSRGLPA---NSRLGSDISTG 175 Query: 681 VIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818 +IP+T S + GK+ CTM++ + ++ ++ + C Sbjct: 176 IIPITISSKLDGKIHLMKIIKKKKSAQMNCTMEVVLATKSVQNVVC 221 >ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca subsp. vesca] Length = 211 Score = 147 bits (371), Expect = 9e-33 Identities = 79/204 (38%), Positives = 115/204 (56%) Frame = +3 Query: 207 AQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFR 386 A N Y R D E ++ + +R+KRIKCF+Y+ F+VFQ AV+ +F LTIMK +TPK R Sbjct: 12 APSNGYTRSDGE--SLSEDELKRKKRIKCFAYIGIFIVFQIAVMTVFGLTIMKVKTPKVR 69 Query: 387 LRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFI 566 L ++ D +S APSF+ ++ +KN N+G YK+ G V F Y G VG + Sbjct: 70 LGTSTLT---DFTSSDTAPSFDTTFNTQIRVKNTNWGPYKFDQGVVTFMYQGMPVGTVVV 126 Query: 567 PRANAKARSTRKFNVTVDLSSAGVPAGELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXX 746 P+ A R T+K NV V L++A +P+ + + GV+ LTS + +TGKVE Sbjct: 127 PKGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAKLTGKVELMLIMKK 186 Query: 747 XXXXXXXCTMQIDVQSRQLRELSC 818 CT+QIDV + ++ L C Sbjct: 187 KKSASMNCTIQIDVSGKTVKSLEC 210 >ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 185 Score = 144 bits (362), Expect = 1e-31 Identities = 71/186 (38%), Positives = 117/186 (62%) Frame = +3 Query: 261 RDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAA 440 + + +R KC +Y+A FVVFQTA+ILIFALT+M+ + PK R + + ++++ Sbjct: 2 KGEGKRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFS--TGNSSS 59 Query: 441 PSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVD 620 P F++R++A++ +KN NFG +KY+N ++ YGG VGEA I +A A+AR T+KF+VT+D Sbjct: 60 PFFDMRLMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTID 119 Query: 621 LSSAGVPAGELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQ 800 +SS+ + + A GV+PL+S + ++GKV CTM I++ +R Sbjct: 120 ISSSKLSTNS-NLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRT 178 Query: 801 LRELSC 818 +++L C Sbjct: 179 VQDLKC 184 >ref|XP_006362675.1| PREDICTED: uncharacterized protein LOC102578608 [Solanum tuberosum] Length = 204 Score = 140 bits (352), Expect = 1e-30 Identities = 85/219 (38%), Positives = 116/219 (52%) Frame = +3 Query: 162 AEEREATGIKAYTAPAQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVIL 341 AEE + + PA+ P E+ RR+KR K Y+A F+VFQ AV+L Sbjct: 3 AEEEQQLQTNGHAKPAEETPNSTQSNEL--------RRKKRNKILVYVALFIVFQIAVLL 54 Query: 342 IFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGT 521 F+L IMK RTPKF +RSA FD T SFNI M AEL +KNANFG Y Y+N T Sbjct: 55 FFSLYIMKIRTPKFSVRSATFD-----LMVTENASFNITMNAELSVKNANFGPYNYKNST 109 Query: 522 VEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLSSAGVPAGELSAQFGAAPGVIPLTSR 701 + FYY +GEAF+ + A +S++KFNV V+LSS E + G + LTS+ Sbjct: 110 IYFYYNDVSIGEAFVYQGKAGFKSSKKFNVIVNLSSK-----ESKLRNDLNSGTLILTSK 164 Query: 702 SAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818 S + GKV+ C + I + + +R++ C Sbjct: 165 SKLEGKVKLIFFMKKKKSTEMNCAIIIGLAGKVVRDIQC 203 >ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] gi|508777613|gb|EOY24869.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 226 Score = 139 bits (350), Expect = 3e-30 Identities = 74/178 (41%), Positives = 112/178 (62%), Gaps = 1/178 (0%) Frame = +3 Query: 267 QRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPS 446 +R KC +Y+AAFVVFQTA+IL+FALT+M+ R+PK R + + V +S+ PS Sbjct: 4 RREGSNAKCLAYVAAFVVFQTAIILLFALTVMRIRSPKVRFGAVTVESFSTVNSSS--PS 61 Query: 447 FNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLS 626 F+++++A++ +KN NFG +KY+N TV YGG VGEA I + A+AR T+KFN+ VD+S Sbjct: 62 FDMKLMAQVAVKNTNFGHFKYENSTVTILYGGMPVGEAAIFKGRARARQTKKFNINVDIS 121 Query: 627 SAGVPA-GELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSR 797 S+ + + L A GV+PL+S++ + GKV CTM I++ +R Sbjct: 122 SSRLSSNSNLGNDINA--GVLPLSSQAKLKGKVHLMKVIKKKKSGEMSCTMGINLATR 177 >ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca subsp. vesca] Length = 211 Score = 137 bits (346), Expect = 7e-30 Identities = 84/222 (37%), Positives = 124/222 (55%), Gaps = 2/222 (0%) Frame = +3 Query: 159 MAEEREATGIKAYTAP-AQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAV 335 MAE+ + T T P A N Y R D E ++ + +R+KRIKCF+Y+ F+VFQ A+ Sbjct: 1 MAEKSQKTH---QTYPLASENGYTRSDGE--SLSEDELKRKKRIKCFAYIGIFIVFQMAI 55 Query: 336 ILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQN 515 +F LT++K +TPK RL ++ DV +ST SF+ ++ +KN N+G YK+ Sbjct: 56 GAVFGLTVLKVKTPKVRLGTSTLS---DVTSSTT--SFSSTFNTQIRVKNTNWGPYKFDQ 110 Query: 516 GTVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLSSAGVPAGE-LSAQFGAAPGVIPL 692 G V F Y G VG +P+ A R T+K NV V L++A +P+ LS++ GV+ L Sbjct: 111 GVVTFMYQGAPVGTVVVPKGKAGMRGTKKINVNVSLNTAALPSSSTLSSELSG--GVLTL 168 Query: 693 TSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818 TS + +TGKVE CT+QIDV + ++ L C Sbjct: 169 TSEAKLTGKVELMLIMKKKKSASMNCTIQIDVSGKTVKSLEC 210 >ref|XP_007210712.1| hypothetical protein PRUPE_ppa022983mg [Prunus persica] gi|462406447|gb|EMJ11911.1| hypothetical protein PRUPE_ppa022983mg [Prunus persica] Length = 209 Score = 137 bits (346), Expect = 7e-30 Identities = 75/210 (35%), Positives = 116/210 (55%), Gaps = 2/210 (0%) Frame = +3 Query: 195 YTAPAQINPYGRVDEEVAEVAARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRT 374 Y ++ + R DEE + + + +R+KRIK + Y+ F+V Q V+ +F LT+MK +T Sbjct: 5 YNHREPVHGHPRRDEESTALQSEELKRQKRIKMYKYIVIFIVVQLIVLPVFGLTVMKVKT 64 Query: 375 PKFRLRSAAFDGAFDVAASTAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVG 554 PKFRL + V ++ PSF ++ +KN N+G YK+ GTV F Y G VG Sbjct: 65 PKFRLGNIKVQNLSSVPST---PSFEASFATQIRVKNTNWGPYKFDAGTVTFMYKGVTVG 121 Query: 555 EAFIPRANAKARSTRKFNVTVDLSSAGVPAGELSAQFGA--APGVIPLTSRSAVTGKVEX 728 + +P++ AK RST+K +VTV L+S G+P+ S+ G GV+ L+S+ +TGKV Sbjct: 122 QVVVPKSKAKMRSTKKIDVTVSLNSYGLPS---SSNLGTELKSGVLTLSSKGKLTGKVVL 178 Query: 729 XXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818 CTM D+ ++ L+ L C Sbjct: 179 MLMMKKRKSATMDCTMTFDLSTKTLKTLQC 208 >ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca subsp. vesca] Length = 213 Score = 132 bits (333), Expect = 2e-28 Identities = 67/183 (36%), Positives = 107/183 (58%) Frame = +3 Query: 270 RRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSF 449 R++KRIKC Y+A F VFQ VI +FALT+MK ++PKFR++S +++A PS Sbjct: 37 RKKKRIKCLIYIAVFAVFQIIVITVFALTVMKIKSPKFRIKSITVQDL--TTSNSANPSL 94 Query: 450 NIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLSS 629 ++ VAE+ +KN NFGRYKY ++ F Y GT+VG+A +P+A A+ ++TRK ++ S Sbjct: 95 SMSFVAEVSVKNPNFGRYKYDQTSISFIYEGTQVGDAVVPKATARTKATRK-----EIVS 149 Query: 630 AGVPAGELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRE 809 V + + G + L++ S + GKV CTM + + S+Q+++ Sbjct: 150 GAVKTVNSNLASDISAGSVTLSTYSKINGKVYLMNMIKKKKSAEMKCTMVVHLSSKQVQD 209 Query: 810 LSC 818 + C Sbjct: 210 IKC 212 >emb|CBI22611.3| unnamed protein product [Vitis vinifera] Length = 297 Score = 132 bits (333), Expect = 2e-28 Identities = 68/190 (35%), Positives = 114/190 (60%) Frame = +3 Query: 249 EVAARDQRRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAA 428 +V + + RR K + +YL+AF +F+T VI++ +T+M+ R+PKFR R+ + + + + Sbjct: 109 DVESEELRRMKCTRYIAYLSAFALFETIVIMVCVVTLMRIRSPKFRFRAVSIEN-LNYTS 167 Query: 429 STAAPSFNIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIPRANAKARSTRKFN 608 T +PSFNIR A++ +KN NFG +K++N T+ Y G VG+A I +A A+ARST+K N Sbjct: 168 DTTSPSFNIRFNAKVAVKNTNFGHFKFKNSTITLAYRGDHVGDAKISKARARARSTKKMN 227 Query: 609 VTVDLSSAGVPAGELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDV 788 VTVD++S V + A G + LT + + GKV CT++I++ Sbjct: 228 VTVDVTSNNVSSNSNLAS-DINSGFLTLTGQGKLNGKVHLMKVFKKKKSPQMNCTIKINL 286 Query: 789 QSRQLRELSC 818 +++ ++E C Sbjct: 287 ENKVIQEWKC 296 >ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776108|gb|EOY23364.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 191 Score = 132 bits (332), Expect = 3e-28 Identities = 70/184 (38%), Positives = 111/184 (60%), Gaps = 1/184 (0%) Frame = +3 Query: 270 RRRKRIKCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSF 449 RR++ IKC +Y+ A V+ QT +IL+F + +M+ R PK RL + ++ +S+++PSF Sbjct: 10 RRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVEN-LNLNSSSSSPSF 68 Query: 450 NIRMVAELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLSS 629 ++ + A++ +KN NFG +K+QN T+ Y GT VGEA I +A A+ARST K NVTV +SS Sbjct: 69 SMNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSS 128 Query: 630 AGVPAGE-LSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLR 806 + LS+ G+ G I L+S + + GK+ CTM++ S+Q++ Sbjct: 129 DKMSRNSALSSDVGS--GTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQ 186 Query: 807 ELSC 818 L C Sbjct: 187 NLMC 190 >ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294764 [Fragaria vesca subsp. vesca] Length = 182 Score = 130 bits (327), Expect = 1e-27 Identities = 71/177 (40%), Positives = 105/177 (59%) Frame = +3 Query: 288 KCFSYLAAFVVFQTAVILIFALTIMKFRTPKFRLRSAAFDGAFDVAASTAAPSFNIRMVA 467 KC +Y+A F+VFQ VI IFALT+MK + PK R ++A F+ +STAA SF+ +V Sbjct: 8 KCLAYVAIFIVFQIIVITIFALTVMKIKGPKVRFQTATVSN-FNSDSSTAA-SFSGDLVT 65 Query: 468 ELGLKNANFGRYKYQNGTVEFYYGGTKVGEAFIPRANAKARSTRKFNVTVDLSSAGVPAG 647 + +KN NFG +KY N TV Y G +G A +P AKARSTR+ ++T+ + S+ + +G Sbjct: 66 KFAVKNTNFGHFKYPNSTVSILYEGQVIGTAAVPSQKAKARSTRRTDITISIDSSKL-SG 124 Query: 648 ELSAQFGAAPGVIPLTSRSAVTGKVEXXXXXXXXXXXXXXCTMQIDVQSRQLRELSC 818 + GV+PLTS S + GKVE CTM +++++R + +L C Sbjct: 125 TTNLTTAIGAGVVPLTSESTLKGKVEVMKIIKKNKSGKMSCTMLLNLKTRTVDDLKC 181