BLASTX nr result
ID: Catharanthus22_contig00015043
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00015043 (841 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578... 150 5e-34 ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citr... 144 4e-32 ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620... 143 9e-32 gb|EMJ11860.1| hypothetical protein PRUPE_ppa022176mg [Prunus pe... 137 5e-30 ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241... 137 6e-30 gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis] 131 3e-28 ref|XP_002509872.1| conserved hypothetical protein [Ricinus comm... 129 2e-27 ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, part... 126 8e-27 gb|EOY24872.1| Late embryogenesis abundant hydroxyproline-rich g... 122 1e-25 ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297... 115 3e-23 gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich g... 111 4e-22 gb|EOY23364.1| Late embryogenesis abundant hydroxyproline-rich g... 110 8e-22 ref|XP_004308331.1| PREDICTED: uncharacterized protein LOC101303... 110 8e-22 gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich g... 109 1e-21 gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich g... 107 5e-21 gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich g... 106 1e-20 gb|EOY13742.1| Late embryogenesis abundant hydroxyproline-rich g... 104 3e-20 emb|CBI22611.3| unnamed protein product [Vitis vinifera] 104 4e-20 gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich g... 103 6e-20 gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis] 102 1e-19 >ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578735 [Solanum tuberosum] Length = 223 Score = 150 bits (379), Expect = 5e-34 Identities = 84/223 (37%), Positives = 133/223 (59%), Gaps = 9/223 (4%) Frame = -3 Query: 734 MAEDHHTIPLAPPRIYPRSDEE---SSAMPIPKHINSRRRQKSSKCXXXXXXXXXXXXXX 564 MA+D H IPLAPPR YP+SD+ S ++ + N+ + +KS KC Sbjct: 1 MAQDSHIIPLAPPRAYPKSDQGLNLSKSINYYNNHNNNKNRKSGKCFVYFLSTIVILSII 60 Query: 563 XXXXXXXXLRFNSPRVKLESVEIKNLDYT----SDSLNMSMVAELTIKNKNFGRLKLQNS 396 RF SP +L+ + ++NL ++ S S NM+M E+ + N NFG++ Q+S Sbjct: 61 MLIFSMVFFRFKSPSFELDHINVQNLRFSNSTNSSSFNMNMGGEIIVDNDNFGQINYQDS 120 Query: 395 S-AIVLYGNSTIGSGTINGGNVIKGRETQRMNFTIQVKAN-GLGRENSNFSSEISSGLLK 222 S ++ LY N TIG +N G V + R+++R+ ++Q++ N L N SS+I+S +LK Sbjct: 121 SMSVFLYDNVTIGIANVNVGRV-EARKSKRIGISLQLRTNYQLNYSYGNLSSDINSRMLK 179 Query: 221 LSSYAKLRGEIRVLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 L+S+ + RG+++ +K I + +T+I+NCTMNL L+SQ IQDL C Sbjct: 180 LTSFGEFRGKVKAMKIISKHKTSIMNCTMNLNLTSQAIQDLLC 222 >ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citrus clementina] gi|557541714|gb|ESR52692.1| hypothetical protein CICLE_v10023929mg [Citrus clementina] Length = 214 Score = 144 bits (363), Expect = 4e-32 Identities = 90/219 (41%), Positives = 129/219 (58%), Gaps = 5/219 (2%) Frame = -3 Query: 734 MAEDHHTIPLAPPRI-YPRSDEESSAMPIPKHINSRRRQKSSKCXXXXXXXXXXXXXXXX 558 MAE++ IPLAPPR YPRSD+E + P I S+R KSSKC Sbjct: 1 MAEENPKIPLAPPRNEYPRSDQEYA----PAVIESQR--KSSKCLVYVLVTIVTVSAALL 54 Query: 557 XXXXXXLRFNSPRVKLESVEIKNLDY---TSDSLNMSMVAELTIKNKNFGRLKLQNSSAI 387 LR N+P V+LESV +KNL + TS S N+++V ELTI N+N+G + +N S Sbjct: 55 ISASIFLRPNTPEVQLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGS 114 Query: 386 VLYGNSTIGSGTINGGNVIKGRETQRMNFTIQVKANGLGR-ENSNFSSEISSGLLKLSSY 210 V YG+ T+G I G V + RE +R+N T+ V G +N N SS+ +SG++KL+SY Sbjct: 115 VFYGSVTVGDVKIRDGRV-EAREVKRINVTVDVDVRSNGNLDNQNLSSDRNSGIVKLNSY 173 Query: 209 AKLRGEIRVLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 AKL G + + + + +T L+C+MNL L+ + ++DL C Sbjct: 174 AKLHGNVNLFNVLKKTKTPELDCSMNLVLARRAVEDLVC 212 >ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620163 [Citrus sinensis] Length = 214 Score = 143 bits (360), Expect = 9e-32 Identities = 89/219 (40%), Positives = 128/219 (58%), Gaps = 5/219 (2%) Frame = -3 Query: 734 MAEDHHTIPLAPPRI-YPRSDEESSAMPIPKHINSRRRQKSSKCXXXXXXXXXXXXXXXX 558 MAE++ PLAPPR YPRSD+E + P I S+R KSSKC Sbjct: 1 MAEENPKFPLAPPRNEYPRSDQEYA----PAVIESQR--KSSKCLVYVLVTIVTVSAALL 54 Query: 557 XXXXXXLRFNSPRVKLESVEIKNLDY---TSDSLNMSMVAELTIKNKNFGRLKLQNSSAI 387 LR N+P V+LESV +KNL + TS S N+++V ELTI N+N+G + +N S Sbjct: 55 ISASIFLRPNTPEVQLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGS 114 Query: 386 VLYGNSTIGSGTINGGNVIKGRETQRMNFTIQVKANGLGR-ENSNFSSEISSGLLKLSSY 210 V YG+ T+G I G V + RE +R+N T+ V G +N N S+I+SG++KL+SY Sbjct: 115 VFYGSVTVGDVKIRDGRV-EAREVKRINVTVDVDVRSNGNLDNQNLRSDINSGIVKLNSY 173 Query: 209 AKLRGEIRVLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 AKL G + + + + +T L+C+MNL L+ + ++DL C Sbjct: 174 AKLHGNVSLFNVLKKTKTPELDCSMNLVLARRAVEDLVC 212 >gb|EMJ11860.1| hypothetical protein PRUPE_ppa022176mg [Prunus persica] Length = 213 Score = 137 bits (345), Expect = 5e-30 Identities = 86/219 (39%), Positives = 124/219 (56%), Gaps = 5/219 (2%) Frame = -3 Query: 734 MAEDHHTI-PLAPPRIYPRSDEESSAMPIPKHINSRRRQKSSKCXXXXXXXXXXXXXXXX 558 MAE + PLAP R++ RSDEE+ + RR++S+KC Sbjct: 1 MAEQESQVWPLAPSRLHRRSDEENPTF------RAIRRERSNKCFVYVFAAIVLQSIFIL 54 Query: 557 XXXXXXLRFNSPRVKLESVEIKNLDYTSD---SLNMSMVAELTIKNKNFGRLKLQNSSAI 387 LR SP L SV +K+L +T+ SLN ++V EL IKNKNFG K + SSA Sbjct: 55 VFALVVLRVKSPGFNLSSVSVKSLKHTTSPTSSLNATLVTELAIKNKNFGEYKFEGSSAS 114 Query: 386 VLYGNSTIGSGTINGGNVIKGRETQRMNFTIQVKANGLGRENSN-FSSEISSGLLKLSSY 210 + YG +G I G V K R T+R++ +I V++N L +E N F E++SG LK+SSY Sbjct: 115 LWYGGFKVGEAKIGKGRV-KARGTRRVSLSIDVRSNRLPQEAKNGFEGEMNSGYLKISSY 173 Query: 209 AKLRGEIRVLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 AKL G++ ++K + +R+T NCTM + L S+ ++DL C Sbjct: 174 AKLTGKVNLMKIMKKRKTIDTNCTMVVVLKSRTVKDLFC 212 >ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241699 [Vitis vinifera] Length = 213 Score = 137 bits (344), Expect = 6e-30 Identities = 82/217 (37%), Positives = 127/217 (58%), Gaps = 3/217 (1%) Frame = -3 Query: 734 MAEDHHTIPLAPPRIYPRSDEESSAMPIPKHINSRRRQKSSKCXXXXXXXXXXXXXXXXX 555 M ED+ PLAP R++ +SDEE K S+ ++SSKC Sbjct: 1 MPEDNQFQPLAPARLHGKSDEEFGVF---KPRASKPPRRSSKCPVYVLAGLVTLAAIALV 57 Query: 554 XXXXXLRFNSPRVKLESVEIKNLDY---TSDSLNMSMVAELTIKNKNFGRLKLQNSSAIV 384 LR +P V+L+SV +KNL + S S N+++ AE++++NKNFG +N +A V Sbjct: 58 FALAVLRVEAPDVELKSVAVKNLTHGTSPSPSFNVTLTAEVSVQNKNFGAFNFENGTATV 117 Query: 383 LYGNSTIGSGTINGGNVIKGRETQRMNFTIQVKANGLGRENSNFSSEISSGLLKLSSYAK 204 LY +G + +V + R+T+RMN T+ V+++ L + N SS+ISSG + L++YA+ Sbjct: 118 LYEGMVVGDEEFSKAHV-ESRKTKRMNVTLDVRSDRLWNDK-NLSSDISSGSVNLTTYAQ 175 Query: 203 LRGEIRVLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 + G++RV+K + RR TA +NC+M L L+S IQDL C Sbjct: 176 VTGKVRVMKVVRRRTTARMNCSMTLNLTSSSIQDLVC 212 >gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis] Length = 213 Score = 131 bits (329), Expect = 3e-28 Identities = 73/216 (33%), Positives = 123/216 (56%), Gaps = 4/216 (1%) Frame = -3 Query: 728 EDHHTIPLAPPRIYPRSDEESSAMPIPKHINSRRRQKSSKCXXXXXXXXXXXXXXXXXXX 549 ++ + PLAP R++ RSDEE+ A + R+++++KC Sbjct: 4 QESQSWPLAPMRVHQRSDEENPAF------KALRKERTNKCFVYIFAGIVILGAILLIFA 57 Query: 548 XXXLRFNSPRVKLESVEIKNLDYTSD---SLNMSMVAELTIKNKNFGRLKL-QNSSAIVL 381 LR SP +KL+SV +K+LDY++ SLN +++A + IKN NFG + N+SA+ L Sbjct: 58 LIVLRSKSPEIKLKSVTVKSLDYSTSPWPSLNATLIATVAIKNPNFGPYRFGSNNSAVFL 117 Query: 380 YGNSTIGSGTINGGNVIKGRETQRMNFTIQVKANGLGRENSNFSSEISSGLLKLSSYAKL 201 YG +G I G + T+R+N T++++ + L + ++N ++SSG++ LSSY K Sbjct: 118 YGGGKLGEQRIRQGKAT-AKATKRVNVTVEIRTSRLPQGSNNLGGDLSSGMVNLSSYCKF 176 Query: 200 RGEIRVLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 G + ++K R+TA +NC M L L ++ I++LRC Sbjct: 177 TGRVHLIKIFENRKTAEMNCAMTLVLKTKMIKNLRC 212 >ref|XP_002509872.1| conserved hypothetical protein [Ricinus communis] gi|223549771|gb|EEF51259.1| conserved hypothetical protein [Ricinus communis] Length = 221 Score = 129 bits (323), Expect = 2e-27 Identities = 78/225 (34%), Positives = 122/225 (54%), Gaps = 11/225 (4%) Frame = -3 Query: 734 MAEDHHTIPLAPPRIYPRSDEESSAMPIPKHINSRRRQKSSKCXXXXXXXXXXXXXXXXX 555 M ED+ +PLAP PRSDEE +A+ N R +++SSKC Sbjct: 1 MVEDNQIVPLAPAETNPRSDEEFAAVKP----NLRLQERSSKCLVYVLAGIVILSAVILV 56 Query: 554 XXXXXLRFNSPRVKLESVEIKNLDYTSDS-----------LNMSMVAELTIKNKNFGRLK 408 LR +P +L V +K+L+Y + S NM++ +EL I+N NFG K Sbjct: 57 FALVVLRPVNPNAELSFVRLKDLNYAAGSGGNGNNVSLPAFNMTLESELKIENSNFGEFK 116 Query: 407 LQNSSAIVLYGNSTIGSGTINGGNVIKGRETQRMNFTIQVKANGLGRENSNFSSEISSGL 228 N+SA V YG +G + G V R+T RMN ++V+++ ++ +S+I+SG+ Sbjct: 117 YDNTSARVFYGGMAVGEAILREGRV-SARDTLRMNVKVEVRSHKYIYNGTDLTSDINSGI 175 Query: 227 LKLSSYAKLRGEIRVLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 LKL+S+AK G + +L+ +RR+A ++C+ +L L S+ IQDL C Sbjct: 176 LKLNSHAKFSGRVNLLQIAKKRRSASMDCSFSLDLRSRSIQDLVC 220 >ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, partial [Populus trichocarpa] gi|550346930|gb|ERP65340.1| hypothetical protein POPTR_0001s09980g, partial [Populus trichocarpa] Length = 173 Score = 126 bits (317), Expect = 8e-27 Identities = 65/148 (43%), Positives = 101/148 (68%), Gaps = 3/148 (2%) Frame = -3 Query: 527 SPRVKLESVEIKNLDYTSD---SLNMSMVAELTIKNKNFGRLKLQNSSAIVLYGNSTIGS 357 +PRVKL SV +++L Y ++ S NM++ AE+++KN NF R K +N+S+ LY +G Sbjct: 28 TPRVKLSSVAVEHLSYGNNPIPSFNMTLAAEVSVKNSNFVRFKFENTSSSALYKGMVVGE 87 Query: 356 GTINGGNVIKGRETQRMNFTIQVKANGLGRENSNFSSEISSGLLKLSSYAKLRGEIRVLK 177 + G V R+T+RMN +++ + G E N SS+I+SG+LK++SYA L+G++R L Sbjct: 88 AKLRSGRV-GARKTRRMNIVVKIGSPGSLSEAKNLSSDINSGMLKMNSYATLKGDVR-LF 145 Query: 176 KIIRRRTAILNCTMNLKLSSQEIQDLRC 93 I++ RTA+++C MNL LSS+ IQDL C Sbjct: 146 GIVKNRTAVMSCGMNLNLSSRSIQDLEC 173 >gb|EOY24872.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 213 Score = 122 bits (307), Expect = 1e-25 Identities = 77/217 (35%), Positives = 116/217 (53%), Gaps = 3/217 (1%) Frame = -3 Query: 734 MAEDHHTIPLAPPRIYPRSDEESSAMPIPKHINSRRRQKSSKCXXXXXXXXXXXXXXXXX 555 M ED PLAP YPRSD E + K S+R++KSSKC Sbjct: 1 MQEDPQAKPLAPVEYYPRSDMEFGGI---KPTASQRKEKSSKCLVYVLVGMVIQGAVLLI 57 Query: 554 XXXXXLRFNSPRVKLESVEIKNLDY---TSDSLNMSMVAELTIKNKNFGRLKLQNSSAIV 384 LR +P V++ SV ++NL Y ++ S N+++V E+T++N NFG K +N++ V Sbjct: 58 FASIVLRARTPDVEIVSVTVRNLKYGNSSAPSFNLTLVTEVTVENSNFGDFKFENTTGTV 117 Query: 383 LYGNSTIGSGTINGGNVIKGRETQRMNFTIQVKANGLGRENSNFSSEISSGLLKLSSYAK 204 G+ +G I G + R T+R+N ++ V + L + N S ISSGLL+L+S+ K Sbjct: 118 WCGSVVVGKMKIPTGRA-QARATERLNVSVDVSSLPLP-DTKNVSCNISSGLLELNSHVK 175 Query: 203 LRGEIRVLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 L G++ ++ + RRR +NC M L L+ Q QD C Sbjct: 176 LSGKVSIMNFMKRRRHPEMNCFMTLNLTGQTKQDFPC 212 >ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297644 [Fragaria vesca subsp. vesca] Length = 210 Score = 115 bits (287), Expect = 3e-23 Identities = 70/217 (32%), Positives = 119/217 (54%), Gaps = 3/217 (1%) Frame = -3 Query: 734 MAEDHHTI-PLAPPRIYPRSDEESSAMPIPKHINSRRRQKSSKCXXXXXXXXXXXXXXXX 558 MA+ I PLAP +++ RS+E + I RR++S+KC Sbjct: 1 MADQESQIWPLAPGKLHQRSEENPTFKAI-------RRERSNKCFVYVFSGIVFFCVTVL 53 Query: 557 XXXXXXLRFNSPRVKLESVEIKNLDYTSD--SLNMSMVAELTIKNKNFGRLKLQNSSAIV 384 LR SP ++L SV +K+L YTS S N+S+ ++++KN NFG + ++ Sbjct: 54 VFALLVLRVKSPEIRLRSVTVKSLKYTSSPPSFNVSLSGQMSVKNPNFGDYEFVPTTVSF 113 Query: 383 LYGNSTIGSGTINGGNVIKGRETQRMNFTIQVKANGLGRENSNFSSEISSGLLKLSSYAK 204 LY +GS + G + K ++T+R++F + +++N L + S+I+SG+LKL+ K Sbjct: 114 LYSRGAVGSTKVAKG-LAKVKKTERLSFGVDLRSNKLPEGANTLKSDINSGMLKLTGTGK 172 Query: 203 LRGEIRVLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 + G++ + K I +R+T ++CTM L L S+ I+DL C Sbjct: 173 VSGKVTLWKIINKRKTGKMDCTMTLVLKSKTIKDLVC 209 >gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 201 Score = 111 bits (277), Expect = 4e-22 Identities = 55/152 (36%), Positives = 95/152 (62%), Gaps = 4/152 (2%) Frame = -3 Query: 536 RFNSPRVKLESVEIKNLDYTSD----SLNMSMVAELTIKNKNFGRLKLQNSSAIVLYGNS 369 R +P+ ++ S+ ++++ YTS S NM AE+ +KN NFG K N++ YG Sbjct: 51 RIKNPKFRVRSITVEDIAYTSTPNPPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGV 110 Query: 368 TIGSGTINGGNVIKGRETQRMNFTIQVKANGLGRENSNFSSEISSGLLKLSSYAKLRGEI 189 +G + G K R T++MN T+ + +N + NSN +S+ISSG L L+++ KL G++ Sbjct: 111 QVGEAFVAKGRA-KARSTKKMNVTVDLNSNNIPA-NSNLASDISSGFLTLTTHTKLSGKV 168 Query: 188 RVLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 ++K I ++++A +NCTM + L+S+ IQD++C Sbjct: 169 HLMKLIKKKKSAQMNCTMTVNLASRAIQDIKC 200 >gb|EOY23364.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 191 Score = 110 bits (274), Expect = 8e-22 Identities = 61/183 (33%), Positives = 103/183 (56%), Gaps = 4/183 (2%) Frame = -3 Query: 629 RRQKSSKCXXXXXXXXXXXXXXXXXXXXXXLRFNSPRVKLESVEIKNLDYTSDS----LN 462 RR+++ KC +R +P+V+L V ++NL+ S S + Sbjct: 10 RRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFS 69 Query: 461 MSMVAELTIKNKNFGRLKLQNSSAIVLYGNSTIGSGTINGGNVIKGRETQRMNFTIQVKA 282 M++ A++T+KN NFG K QNS+ + Y + +G TI + R T ++N T+ V + Sbjct: 70 MNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARA-RARSTTKLNVTVSVSS 128 Query: 281 NGLGRENSNFSSEISSGLLKLSSYAKLRGEIRVLKKIIRRRTAILNCTMNLKLSSQEIQD 102 + + R NS SS++ SG + LSS+AKL G+I + K ++++A +NCTM + SS++IQ+ Sbjct: 129 DKMSR-NSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQN 187 Query: 101 LRC 93 L C Sbjct: 188 LMC 190 >ref|XP_004308331.1| PREDICTED: uncharacterized protein LOC101303168 [Fragaria vesca subsp. vesca] Length = 215 Score = 110 bits (274), Expect = 8e-22 Identities = 55/152 (36%), Positives = 98/152 (64%), Gaps = 4/152 (2%) Frame = -3 Query: 536 RFNSPRVKLESVEIKNLDY----TSDSLNMSMVAELTIKNKNFGRLKLQNSSAIVLYGNS 369 RF P +KL+S ++NL+ T ++NMS+ E+ IKN+N+G K S+ ++ YG Sbjct: 71 RFKDPNIKLDSTIVENLNVGLVSTPSTINMSLSQEILIKNQNWGGFKYDESAVVISYGGV 130 Query: 368 TIGSGTINGGNVIKGRETQRMNFTIQVKANGLGRENSNFSSEISSGLLKLSSYAKLRGEI 189 T+G GTI+ G+ IK R+++ ++ ++VK +G ++ISSG+L L SY K+ G++ Sbjct: 131 TVGQGTISKGS-IKLRKSKMVSVVVEVKVEEVG-------NDISSGVLGLKSYTKISGKV 182 Query: 188 RVLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 ++ + +RRT +NC++N+ L++++IQD C Sbjct: 183 SMVGMVKKRRTGEMNCSLNISLANKKIQDFNC 214 >gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 188 Score = 109 bits (273), Expect = 1e-21 Identities = 54/151 (35%), Positives = 96/151 (63%), Gaps = 3/151 (1%) Frame = -3 Query: 536 RFNSPRVKLESVEIKNLDYTSDSL---NMSMVAELTIKNKNFGRLKLQNSSAIVLYGNST 366 R +P +L SV +++L+Y + + NM ++ E+ +KNKNFG + N++A V +G+ Sbjct: 39 RIKTPSARLRSVTVQSLNYNASGVPHFNMRLIMEIAVKNKNFGHFRFDNTTANVTFGSVM 98 Query: 365 IGSGTINGGNVIKGRETQRMNFTIQVKANGLGRENSNFSSEISSGLLKLSSYAKLRGEIR 186 +G G I + R+T+RMN T+ V ++ + E+ +++SSG L L+ A+LRG++ Sbjct: 99 VGDGEIVKSRA-RARKTKRMNVTVDVSSSAVSDEDE-LRTKLSSGTLTLTGVARLRGKVT 156 Query: 185 VLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 ++K + +R+TA +NCTM + L+S +QDL C Sbjct: 157 LMKLMKKRKTAEMNCTMTVNLNSHAVQDLDC 187 >gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 215 Score = 107 bits (267), Expect = 5e-21 Identities = 61/219 (27%), Positives = 116/219 (52%), Gaps = 5/219 (2%) Frame = -3 Query: 734 MAE-DHHTIPLAPPRIYPRSDEESSAMPIPKHINSRRRQKSSKCXXXXXXXXXXXXXXXX 558 MAE D PLAP +PRSDEES+++ +R+K K Sbjct: 1 MAEKDQQVHPLAPANGHPRSDEESASL----QSKELKRKKRIKYAVYIAAFAVFQTVVIL 56 Query: 557 XXXXXXLRFNSPRVKLESVEIKNLDYTSD----SLNMSMVAELTIKNKNFGRLKLQNSSA 390 +R +P+V++ V ++ ++ ++ S N+ + ++T+KN NFG K N++ Sbjct: 57 IFALTVMRVKNPKVRIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATM 116 Query: 389 IVLYGNSTIGSGTINGGNVIKGRETQRMNFTIQVKANGLGRENSNFSSEISSGLLKLSSY 210 LY +G I + R T++++ T++V ++ L + SE+SS +L L+S Sbjct: 117 SFLYDGVMVGEAIIPKARA-RARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQ 175 Query: 209 AKLRGEIRVLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 AKL+G++ ++K + ++++ +NCT+ +S++ +QDL+C Sbjct: 176 AKLKGKVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKC 214 >gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 185 Score = 106 bits (264), Expect = 1e-20 Identities = 52/151 (34%), Positives = 97/151 (64%), Gaps = 3/151 (1%) Frame = -3 Query: 536 RFNSPRVKLESVEIKNLDYTSDS---LNMSMVAELTIKNKNFGRLKLQNSSAIVLYGNST 366 R +P+V+ +V ++N + S +M ++A++T+KN NFG K +NSS +LYG Sbjct: 36 RIKNPKVRFGAVTVENFSTGNSSSPFFDMRLMAQVTVKNTNFGHFKYENSSIRILYGGMP 95 Query: 365 IGSGTINGGNVIKGRETQRMNFTIQVKANGLGRENSNFSSEISSGLLKLSSYAKLRGEIR 186 +G TI + R+T++ + TI + ++ L NSN ++I+SG+L LSS AKL G++ Sbjct: 96 VGEATIVKARA-RARQTKKFDVTIDISSSKLST-NSNLGNDIASGVLPLSSEAKLSGKVH 153 Query: 185 VLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 ++K I +++++ ++CTM + + ++ +QDL+C Sbjct: 154 LMKVIKKKKSSEMSCTMGINIGTRTVQDLKC 184 >gb|EOY13742.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 192 Score = 104 bits (260), Expect = 3e-20 Identities = 57/153 (37%), Positives = 94/153 (61%), Gaps = 5/153 (3%) Frame = -3 Query: 536 RFNSPRVKLESVEIKNLDYTSDSLNMS----MVAELTIKNKNFGRLKLQNSSAIVLYGNS 369 R +P+V+L V ++NL +S S + S + A++++KN NFG K +NS+ + Y S Sbjct: 40 RIRNPKVRLGGVTVENLRASSSSSSPSFSTKLNAQVSVKNTNFGHFKFKNSTLTISYNGS 99 Query: 368 TIGSGTINGGNVIKGRETQRMNFTIQVKANG-LGRENSNFSSEISSGLLKLSSYAKLRGE 192 +G TI G + + R T++ N TI V +N + R + SS+I SG + LSS+AKL G+ Sbjct: 100 PVGKATIVEG-LARARSTKKFNVTILVSSNNKISRNSDQLSSDIESGTINLSSHAKLEGK 158 Query: 191 IRVLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 I + K ++++A +NCTM++ S ++IQ L C Sbjct: 159 IHLFKIFKKKKSAEMNCTMDVNTSLKQIQKLTC 191 >emb|CBI22611.3| unnamed protein product [Vitis vinifera] Length = 297 Score = 104 bits (259), Expect = 4e-20 Identities = 52/152 (34%), Positives = 91/152 (59%), Gaps = 4/152 (2%) Frame = -3 Query: 536 RFNSPRVKLESVEIKNLDYTSD----SLNMSMVAELTIKNKNFGRLKLQNSSAIVLYGNS 369 R SP+ + +V I+NL+YTSD S N+ A++ +KN NFG K +NS+ + Y Sbjct: 147 RIRSPKFRFRAVSIENLNYTSDTTSPSFNIRFNAKVAVKNTNFGHFKFKNSTITLAYRGD 206 Query: 368 TIGSGTINGGNVIKGRETQRMNFTIQVKANGLGRENSNFSSEISSGLLKLSSYAKLRGEI 189 +G I+ + R T++MN T+ V +N + NSN +S+I+SG L L+ KL G++ Sbjct: 207 HVGDAKISKARA-RARSTKKMNVTVDVTSNNVS-SNSNLASDINSGFLTLTGQGKLNGKV 264 Query: 188 RVLKKIIRRRTAILNCTMNLKLSSQEIQDLRC 93 ++K ++++ +NCT+ + L ++ IQ+ +C Sbjct: 265 HLMKVFKKKKSPQMNCTIKINLENKVIQEWKC 296 >gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 259 Score = 103 bits (258), Expect = 6e-20 Identities = 52/182 (28%), Positives = 101/182 (55%), Gaps = 3/182 (1%) Frame = -3 Query: 629 RRQKSSKCXXXXXXXXXXXXXXXXXXXXXXLRFNSPRVKLESVEIKNLDY--TSDSLNMS 456 +R+K KC +R +P+ ++ SV + +L + +S S NM Sbjct: 17 KRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSSPSFNMK 76 Query: 455 MVAELTIKNKNFGRLKLQNSSAIVLYGNSTIGSGTINGGNV-IKGRETQRMNFTIQVKAN 279 +A++T+KN NFG K +NS+ Y S +G + G + R T++MN T+ + +N Sbjct: 77 FIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVTMDLNSN 136 Query: 278 GLGRENSNFSSEISSGLLKLSSYAKLRGEIRVLKKIIRRRTAILNCTMNLKLSSQEIQDL 99 G+ + S+ S+++SG L L+S + L G++ ++K I ++++ +NCTM + L+ + ++D+ Sbjct: 137 GVAND-SDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQKLVRDI 195 Query: 98 RC 93 +C Sbjct: 196 KC 197 >gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis] Length = 213 Score = 102 bits (255), Expect = 1e-19 Identities = 51/142 (35%), Positives = 85/142 (59%) Frame = -3 Query: 518 VKLESVEIKNLDYTSDSLNMSMVAELTIKNKNFGRLKLQNSSAIVLYGNSTIGSGTINGG 339 V +E + I N D S SL+M +E+ +KN NFG K SS +Y + +G ++ G Sbjct: 74 VAIEDLTISNSDTNSPSLSMKFDSEIGVKNTNFGEFKFDESSITFVYKGTEVGDASVEKG 133 Query: 338 NVIKGRETQRMNFTIQVKANGLGRENSNFSSEISSGLLKLSSYAKLRGEIRVLKKIIRRR 159 K R T++MN T +V AN SN ++++ SG L L+S +KL G++ ++K I +++ Sbjct: 134 KA-KARSTKKMNVTAEVNAN------SNLANDVRSGFLTLTSQSKLNGKVHLMKVIKKKK 186 Query: 158 TAILNCTMNLKLSSQEIQDLRC 93 TA +NCT+ + L ++ +QD +C Sbjct: 187 TAEMNCTITINLENKVVQDFKC 208