BLASTX nr result
ID: Rehmannia23_contig00028136
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia23_contig00028136 (869 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579... 143 9e-32 gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich g... 121 4e-25 ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295... 113 8e-23 ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295... 108 3e-21 ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296... 106 1e-20 gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich g... 105 2e-20 ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293... 102 1e-19 ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296... 102 2e-19 gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich g... 101 4e-19 ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303... 100 9e-19 gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao] 98 3e-18 gb|EOY24872.1| Late embryogenesis abundant hydroxyproline-rich g... 98 5e-18 emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera] 98 5e-18 gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich g... 96 2e-17 ref|XP_002513626.1| conserved hypothetical protein [Ricinus comm... 96 2e-17 ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302... 95 3e-17 gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis] 95 4e-17 gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich g... 94 6e-17 emb|CBI22611.3| unnamed protein product [Vitis vinifera] 94 6e-17 ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295... 94 8e-17 >ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579067 [Solanum tuberosum] Length = 197 Score = 143 bits (360), Expect = 9e-32 Identities = 80/199 (40%), Positives = 121/199 (60%), Gaps = 4/199 (2%) Frame = +2 Query: 65 QGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXXX 244 Q YPLAPS I+PRSD E+ATNN FQS +Q +KKK L Sbjct: 5 QKYPLAPSNIMPRSDAEFATNN-FQSNNQRRKKK----LRSTFLLTIFLTGIILLFCFTF 59 Query: 245 XXXRTPRVRLDDITVTSDASTGNVRFTGRVSVRNRNFGRYEFDSSLATIRSGN-TNMGQF 421 ++P++R+++I +T+D G + F+ +V +RNRNF RY +DS+L TI + T +G+F Sbjct: 60 LRIKSPKIRIENIRITNDGD-GRINFSAQVFLRNRNFWRYGYDSTLGTINTAEGTTIGRF 118 Query: 422 VIHDARARARSTRRIYVIADLRVPGA---NSSVLDLNIEARLRGKVRLIRVIRRNRSADM 592 VI D R RST+ IYV+ + +P S +L + EA++RGKV++ RV R ++ D+ Sbjct: 119 VIPDGEVRRRSTKTIYVMENFILPSRLNNTSGILPVISEAKIRGKVKVFRVFRWKKNVDL 178 Query: 593 NCTMRINLSTNVVQNLRCE 649 +CTM INL+ + +Q+L C+ Sbjct: 179 SCTMSINLTISAIQDLDCQ 197 >gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 215 Score = 121 bits (303), Expect = 4e-25 Identities = 78/222 (35%), Positives = 125/222 (56%), Gaps = 18/222 (8%) Frame = +2 Query: 38 MAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXX 217 MAEK QQ +PLAP+ PRSDEE A+ QS++ +K+KKR+K Sbjct: 1 MAEKDQQV---HPLAPANGHPRSDEESAS---LQSKE-LKRKKRIKYAVYIAAFAVFQTV 53 Query: 218 XXXXXXXXXXXXRTPRVRLDDITV-------TSDASTGNVRFTGRVSVRNRNFGRYEFDS 376 + P+VR+ +TV T A++ N+RF +V+V+N NFG Y+FD+ Sbjct: 54 VILIFALTVMRVKNPKVRIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDN 113 Query: 377 SLATIRSGNTNMGQFVIHDARARARSTRRIYVIADLRVPGA-----------NSSVLDLN 523 + + +G+ +I ARARARST+++ V ++ +SSVL LN Sbjct: 114 ATMSFLYDGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLN 173 Query: 524 IEARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649 +A+L+GKV L++V+++ +S +MNCT+ N+ST +Q+L+C+ Sbjct: 174 SQAKLKGKVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKCK 215 >ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295333 [Fragaria vesca subsp. vesca] Length = 200 Score = 113 bits (283), Expect = 8e-23 Identities = 71/209 (33%), Positives = 111/209 (53%), Gaps = 5/209 (2%) Frame = +2 Query: 38 MAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXX 217 MA+K+QQ YPLAPS RSD E S+D++K+KKR+KC Sbjct: 1 MADKHQQV---YPLAPSNGYTRSDGESL------SEDELKRKKRIKCFAYIGIFIVFQMA 51 Query: 218 XXXXXXXXXXXXRTPRVRLDDIT----VTSDASTGNVRFTGRVSVRNRNFGRYEFDSSLA 385 +TP+VRLD + VTS ++ + F ++ V+N N+G Y+FD + Sbjct: 52 VGAVFGLTVLKVKTPKVRLDTTSTLSGVTSSTTSFSSTFNTQIRVKNTNWGPYKFDEGVV 111 Query: 386 TIRSGNTNMGQFVIHDARARARSTRRIYVIADLRVPGANSS-VLDLNIEARLRGKVRLIR 562 T + T +G F + +A R T++I L NSS L L EA+L GKV L+ Sbjct: 112 TFKYQGTPVGTFTVPKGKAGMRGTKKIDASVSLNTAALNSSGELTLTSEAKLTGKVTLMF 171 Query: 563 VIRRNRSADMNCTMRINLSTNVVQNLRCE 649 ++++ +SA MNCT++I++S V+++ C+ Sbjct: 172 IMKKKKSASMNCTIQIDVSGQTVKSVVCK 200 >ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca subsp. vesca] Length = 211 Score = 108 bits (269), Expect = 3e-21 Identities = 71/217 (32%), Positives = 107/217 (49%), Gaps = 13/217 (5%) Frame = +2 Query: 38 MAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXX 217 MAEK Q+ Q YPLA RSD E S+D++K+KKR+KC Sbjct: 1 MAEKSQKTHQTYPLASENGYTRSDGESL------SEDELKRKKRIKCFAYIGIFIVFQMA 54 Query: 218 XXXXXXXXXXXXRTPRVRLDDIT---VTSDASTGNVRFTGRVSVRNRNFGRYEFDSSLAT 388 +TP+VRL T VTS ++ + F ++ V+N N+G Y+FD + T Sbjct: 55 IGAVFGLTVLKVKTPKVRLGTSTLSDVTSSTTSFSSTFNTQIRVKNTNWGPYKFDQGVVT 114 Query: 389 IRSGNTNMGQFVIHDARARARSTRRIYVIADLRVPGANSS----------VLDLNIEARL 538 +G V+ +A R T++I V L SS VL L EA+L Sbjct: 115 FMYQGAPVGTVVVPKGKAGMRGTKKINVNVSLNTAALPSSSTLSSELSGGVLTLTSEAKL 174 Query: 539 RGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649 GKV L+ ++++ +SA MNCT++I++S V++L C+ Sbjct: 175 TGKVELMLIMKKKKSASMNCTIQIDVSGKTVKSLECK 211 >ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca subsp. vesca] Length = 211 Score = 106 bits (264), Expect = 1e-20 Identities = 74/219 (33%), Positives = 113/219 (51%), Gaps = 15/219 (6%) Frame = +2 Query: 38 MAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXX 217 MAEK Q YPLAP+ RSD E + S+D++K++KR K Sbjct: 1 MAEKTNQ---AYPLAPANGYTRSDGE-----SLVSEDELKRQKRRKLFMYIGIFIVVQII 52 Query: 218 XXXXXXXXXXXXRTPRVRLDDITVTSDASTG-----NVRFTGRVSVRNRNFGRYEFDSSL 382 +TP+VRL I V S S + FT ++ V+N N+G Y+FD+S Sbjct: 53 VMTVFGLTVMKVKTPKVRLGGINVQSLNSVPATPSFDTSFTTQIRVKNTNWGPYKFDAST 112 Query: 383 ATIRSGNTNMGQFVIHDARARARSTRRIYVIADLR---VPGA-------NSSVLDLNIEA 532 AT +GQ I ++AR RST++I V L +P + NS +L L +A Sbjct: 113 ATFMYQGVAVGQVSIPKSKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQA 172 Query: 533 RLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649 +L GKV L+ ++++ +SA M+CT+ +LST V++L+C+ Sbjct: 173 KLTGKVELMLIMKKKKSATMDCTIAFDLSTKTVKSLQCK 211 >gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 259 Score = 105 bits (263), Expect = 2e-20 Identities = 59/187 (31%), Positives = 101/187 (54%), Gaps = 17/187 (9%) Frame = +2 Query: 137 QSQDQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXXXXXXRTPRVRL-----DDITVTSDA 301 + ++K+KKRMKCL + P+ R+ DD+T + + Sbjct: 11 EQSKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSS 70 Query: 302 STGNVRFTGRVSVRNRNFGRYEFDSSLATIRSGNTNMGQFVIHD--ARARARSTRRIYVI 475 + N++F +V+V+N NFG Y+F++S T + +G+ ++ ARARARST+++ V Sbjct: 71 PSFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVT 130 Query: 476 ADLRVPGA----------NSSVLDLNIEARLRGKVRLIRVIRRNRSADMNCTMRINLSTN 625 DL G NS L L ++ L GKV L++VI++ +S +MNCTM +NL+ Sbjct: 131 MDLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQK 190 Query: 626 VVQNLRC 646 +V++++C Sbjct: 191 LVRDIKC 197 >ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca subsp. vesca] Length = 211 Score = 102 bits (255), Expect = 1e-19 Identities = 67/211 (31%), Positives = 102/211 (48%), Gaps = 16/211 (7%) Frame = +2 Query: 65 QGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXXX 244 Q YPLAPS RSD E S+D++K+KKR+KC Sbjct: 7 QSYPLAPSNGYTRSDGESL------SEDELKRKKRIKCFAYIGIFIVFQIAVMTVFGLTI 60 Query: 245 XXXRTPRVRLD-----DITVTSDASTGNVRFTGRVSVRNRNFGRYEFDSSLATIRSGNTN 409 +TP+VRL D T + A + + F ++ V+N N+G Y+FD + T Sbjct: 61 MKVKTPKVRLGTSTLTDFTSSDTAPSFDTTFNTQIRVKNTNWGPYKFDQGVVTFMYQGMP 120 Query: 410 MGQFVIHDARARARSTRRIYVIADLRVPGANSS-----------VLDLNIEARLRGKVRL 556 +G V+ +A R T++I V L SS VL L EA+L GKV L Sbjct: 121 VGTVVVPKGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAKLTGKVEL 180 Query: 557 IRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649 + ++++ +SA MNCT++I++S V++L C+ Sbjct: 181 MLIMKKKKSASMNCTIQIDVSGKTVKSLECK 211 >ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca subsp. vesca] Length = 211 Score = 102 bits (253), Expect = 2e-19 Identities = 68/219 (31%), Positives = 109/219 (49%), Gaps = 15/219 (6%) Frame = +2 Query: 38 MAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXX 217 MAEK Q YPLAP+ RSD E + S+D++K++KR++ Sbjct: 1 MAEKTHQ---AYPLAPANGYTRSDGE-----SLVSKDELKRRKRIRLFTYIGIFIVFQII 52 Query: 218 XXXXXXXXXXXXRTPRVRLDDITVTSDASTG-----NVRFTGRVSVRNRNFGRYEFDSSL 382 +TP+VRL +I V S + FT ++ V+N N+G Y+FD+S Sbjct: 53 VMTVFGLTVMKVKTPKVRLGEINVQDFNSVPATPSFDTTFTTQIRVKNTNWGPYKFDAST 112 Query: 383 ATIRSGNTNMGQFVIHDARARARSTRRIYVIADLRVPGA----------NSSVLDLNIEA 532 T +GQ + +A RST+++ V L G NS VL LN +A Sbjct: 113 VTFMYQGVAVGQVTVPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQA 172 Query: 533 RLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649 +L GKV L+ ++++ +S+ M+C + +LST V++L+C+ Sbjct: 173 KLSGKVELMLIMKKKKSSTMDCMIGFDLSTKTVKSLQCK 211 >gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 201 Score = 101 bits (251), Expect = 4e-19 Identities = 58/200 (29%), Positives = 102/200 (51%), Gaps = 18/200 (9%) Frame = +2 Query: 104 SDEEYATNN-NFQSQDQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXXXXXXRTPRVRLDD 280 +++ Y N + +S ++K+KKRMK + P+ R+ Sbjct: 2 AEQNYQQKNIDMESAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRS 61 Query: 281 ITVTSDASTG-------NVRFTGRVSVRNRNFGRYEFDSSLATIRSGNTNMGQFVIHDAR 439 ITV A T N++F V+V+N NFG ++FD++ + G +G+ + R Sbjct: 62 ITVEDIAYTSTPNPPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGR 121 Query: 440 ARARSTRRIYVIADLR---VPG-------ANSSVLDLNIEARLRGKVRLIRVIRRNRSAD 589 A+ARST+++ V DL +P +S L L +L GKV L+++I++ +SA Sbjct: 122 AKARSTKKMNVTVDLNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQ 181 Query: 590 MNCTMRINLSTNVVQNLRCE 649 MNCTM +NL++ +Q+++C+ Sbjct: 182 MNCTMTVNLASRAIQDIKCQ 201 >ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca subsp. vesca] Length = 213 Score = 100 bits (248), Expect = 9e-19 Identities = 64/216 (29%), Positives = 110/216 (50%), Gaps = 13/216 (6%) Frame = +2 Query: 38 MAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXX 217 MAE+ Q+ P A + RSD E ++ S +++KKKR+KCL Sbjct: 1 MAERNQEAYPFAPYANGQAMARSDAE---SSRAHSDHELRKKKRIKCLIYIAVFAVFQII 57 Query: 218 XXXXXXXXXXXXRTPRVRLDDITVTSDASTGN-------VRFTGRVSVRNRNFGRYEFDS 376 ++P+ R+ ITV D +T N + F VSV+N NFGRY++D Sbjct: 58 VITVFALTVMKIKSPKFRIKSITV-QDLTTSNSANPSLSMSFVAEVSVKNPNFGRYKYDQ 116 Query: 377 SLATIRSGNTNMGQFVIHDARARARSTRRIYVIADLRVPGAN------SSVLDLNIEARL 538 + + T +G V+ A AR ++TR+ V ++ +N + + L+ +++ Sbjct: 117 TSISFIYEGTQVGDAVVPKATARTKATRKEIVSGAVKTVNSNLASDISAGSVTLSTYSKI 176 Query: 539 RGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRC 646 GKV L+ +I++ +SA+M CTM ++LS+ VQ+++C Sbjct: 177 NGKVYLMNMIKKKKSAEMKCTMVVHLSSKQVQDIKC 212 >gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao] Length = 214 Score = 98.2 bits (243), Expect = 3e-18 Identities = 65/210 (30%), Positives = 106/210 (50%), Gaps = 17/210 (8%) Frame = +2 Query: 71 YPLAPSTIV-PRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXXXX 247 YPL P+ RSDEE ++ ++KKKKRMKCL Sbjct: 9 YPLVPAANGHERSDEESVAAHS----KELKKKKRMKCLLYIVLFAVFQTGIILLFALTVM 64 Query: 248 XXRTPRVRLDD-----ITVTSDASTG-NVRFTGRVSVRNRNFGRYEFDSSLATIRSGNTN 409 R P+ R+ V ++AS +++ + +V+N NFG ++++ L T T Sbjct: 65 RIRNPKFRVRSGSFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTP 124 Query: 410 MGQFVIHDARARARSTRRIYVIADLR---VPGAN-------SSVLDLNIEARLRGKVRLI 559 +G+ I ARARARST+++ V+ +L +P N + VL L ++L GK+ L+ Sbjct: 125 VGRATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLM 184 Query: 560 RVIRRNRSADMNCTMRINLSTNVVQNLRCE 649 +VI++ +S MNCTM + + T V+N+ C+ Sbjct: 185 KVIKKKKSTQMNCTMDVAIDTRTVRNIICK 214 >gb|EOY24872.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 213 Score = 97.8 bits (242), Expect = 5e-18 Identities = 67/215 (31%), Positives = 101/215 (46%), Gaps = 16/215 (7%) Frame = +2 Query: 53 QQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXXXXXXX 232 Q+ Q PLAP PRSD E+ SQ +K+K KCL Sbjct: 2 QEDPQAKPLAPVEYYPRSDMEFGGIKPTASQ---RKEKSSKCLVYVLVGMVIQGAVLLIF 58 Query: 233 XXXXXXXRTPRVRLDDITV------TSDASTGNVRFTGRVSVRNRNFGRYEFDSSLATIR 394 RTP V + +TV S A + N+ V+V N NFG ++F+++ T+ Sbjct: 59 ASIVLRARTPDVEIVSVTVRNLKYGNSSAPSFNLTLVTEVTVENSNFGDFKFENTTGTVW 118 Query: 395 SGNTNMGQFVIHDARARARSTRRIYVIAD---LRVPGA-------NSSVLDLNIEARLRG 544 G+ +G+ I RA+AR+T R+ V D L +P +S +L+LN +L G Sbjct: 119 CGSVVVGKMKIPTGRAQARATERLNVSVDVSSLPLPDTKNVSCNISSGLLELNSHVKLSG 178 Query: 545 KVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649 KV ++ ++R R +MNC M +NL+ Q+ CE Sbjct: 179 KVSIMNFMKRRRHPEMNCFMTLNLTGQTKQDFPCE 213 >emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera] Length = 186 Score = 97.8 bits (242), Expect = 5e-18 Identities = 58/179 (32%), Positives = 98/179 (54%), Gaps = 13/179 (7%) Frame = +2 Query: 152 MKKKKRMKCLXXXXXXXXXXXXXXXXXXXXXXXXRTPRVRLDDITVTSDASTGN---VRF 322 +++KK +KCL R P+VR+ I+V + + N + Sbjct: 8 VRRKKSLKCLAYVAAFVVFQTGIILLFVLLVLKIRDPKVRIASISVENQHFSTNSFSMDL 67 Query: 323 TGRVSVRNRNFGRYEFDSSLATIRSGNTNMGQFVIHDARARARSTRRIYV---IADLRVP 493 RV+V+N NFG ++FD+S ATI T +G+ I ARAR+RST+R + I+ +V Sbjct: 68 KARVTVKNTNFGHFKFDNSTATISYFGTAVGEATILKARARSRSTKRFNITVPISSSKVN 127 Query: 494 G-------ANSSVLDLNIEARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649 NS VL+L+ A+L GK+ L ++ ++ +SA+M+CTM ++ +T+ ++NL C+ Sbjct: 128 NHRQLRRDLNSGVLNLSSTAKLSGKIHLFKIFKKKKSAEMSCTMELHTNTSSIENLSCK 186 >gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 188 Score = 95.9 bits (237), Expect = 2e-17 Identities = 57/184 (30%), Positives = 93/184 (50%), Gaps = 16/184 (8%) Frame = +2 Query: 146 DQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXXXXXXRTPRVRLDDITV------TSDAST 307 ++ K+ + MKC +TP RL +TV S Sbjct: 5 EKYKRMQNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASGVPH 64 Query: 308 GNVRFTGRVSVRNRNFGRYEFDSSLATIRSGNTNMGQFVIHDARARARSTRRIYVIADLR 487 N+R ++V+N+NFG + FD++ A + G+ +G I +RARAR T+R+ V D+ Sbjct: 65 FNMRLIMEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVS 124 Query: 488 VPGAN----------SSVLDLNIEARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQN 637 + S L L ARLRGKV L++++++ ++A+MNCTM +NL+++ VQ+ Sbjct: 125 SSAVSDEDELRTKLSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQD 184 Query: 638 LRCE 649 L CE Sbjct: 185 LDCE 188 >ref|XP_002513626.1| conserved hypothetical protein [Ricinus communis] gi|223547534|gb|EEF49029.1| conserved hypothetical protein [Ricinus communis] Length = 217 Score = 95.5 bits (236), Expect = 2e-17 Identities = 68/226 (30%), Positives = 111/226 (49%), Gaps = 22/226 (9%) Frame = +2 Query: 38 MAEKYQQQAQGYPLAPSTIVP----RSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXX 205 MAEK Q AP+ +V RSDEE T Q+++ ++KKKRMKC+ Sbjct: 1 MAEKEQ--------APTPLVADGQTRSDEESGTAGTAQTKE-LRKKKRMKCIAFVVAFTI 51 Query: 206 XXXXXXXXXXXXXXXXRTPRVRL------DDITVTSDASTG--NVRFTGRVSVRNRNFGR 361 + P+ R+ D V +DA+ N+ + V+N NFG Sbjct: 52 FQTGIILLFVFTVLRFKDPKFRVRSASFDDTFHVGTDAAAPSFNLTMNTQFGVKNTNFGH 111 Query: 362 YEFDSSLATIRSGNTNMGQFVIHDARARARSTRRIYVIADLRVP----------GANSSV 511 +++++S T T +G + ARARARSTR+ I LR +S Sbjct: 112 FKYETSTVTFEYRGTVVGLVNVDKARARARSTRKFDAIVVLRTDRLPDGFELSSDISSGK 171 Query: 512 LDLNIEARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649 + L+ +RL G++ L++VI++ +SA+MNCTM +++ T +Q++ C+ Sbjct: 172 IPLSSSSRLDGEIHLMKVIKKKKSAEMNCTMNVDIQTRTLQDIVCK 217 >ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca subsp. vesca] Length = 222 Score = 95.1 bits (235), Expect = 3e-17 Identities = 71/223 (31%), Positives = 110/223 (49%), Gaps = 19/223 (8%) Frame = +2 Query: 38 MAEKYQQQAQG-YPLAPST-IVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXX 211 MAE + A YPL PS RSD+E A + S ++++ KKRM+CL Sbjct: 1 MAENKEAAATSPYPLMPSAPSYMRSDQEAAASAP-PSAEELRHKKRMRCLLYVSIFAVFQ 59 Query: 212 XXXXXXXXXXXXXXRTP--RVRLDDITVTSDASTGNVRFTGRVSV----RNRNFGRYEFD 373 ++P RVR IT S N F + V +N NFG +E++ Sbjct: 60 VVVITVFALTVMKIKSPKFRVRTASITGFEVGSASNPSFNLEMDVHFGVKNTNFGHFEYE 119 Query: 374 SSLATIRSGNTNMGQFVIHDARARARSTRRIYVIA-DLRVPG--ANS--------SVLDL 520 + + +GQ + + R RARSTR++ V + DL G ANS ++ + Sbjct: 120 DGIVVFTYRDVRIGQTNVEEERVRARSTRKVDVSSVDLTSRGLPANSRLGSDISTGIIPI 179 Query: 521 NIEARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649 I ++L GK+ L+++I++ +SA MNCTM + L+T VQN+ C+ Sbjct: 180 TISSKLDGKIHLMKIIKKKKSAQMNCTMEVVLATKSVQNVVCK 222 >gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis] Length = 212 Score = 94.7 bits (234), Expect = 4e-17 Identities = 65/220 (29%), Positives = 113/220 (51%), Gaps = 17/220 (7%) Frame = +2 Query: 38 MAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXXXXX 217 MAE+YQQ YPLAP+ PRSDEE ++N +++ +K++KR+K Sbjct: 1 MAERYQQV---YPLAPANGHPRSDEE---SSNLDAKE-LKRRKRIKLAIYAFIFTASQII 53 Query: 218 XXXXXXXXXXXXRTPRVRLDDI-------TVTSDASTGNVRFTGRVSVRNRNFGRYEFDS 376 ++P++RL D T + + ++ FT ++ V+N N+G Y+FD+ Sbjct: 54 VTLVFVLVVMRVKSPKLRLSDKFEFQTIETNSGSKPSFDISFTTQLRVKNTNWGPYKFDN 113 Query: 377 SLATIRSGNTNMGQFVIHDARARARSTRRIYVIADLRVPGANSS----------VLDLNI 526 + A +GQ VI +A RST+++ V L ++ +L L Sbjct: 114 TTAAFAYEGETVGQVVIPKGKAGMRSTKKVPVSVSLSSSQLKNNTNLGSELSGGILTLRC 173 Query: 527 EARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRC 646 A++ GKV+L+ ++++ +SA+MNCT+ I++ V NL+C Sbjct: 174 TAKMTGKVKLMLIMKKKKSANMNCTINIHVKEKTV-NLKC 212 >gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 185 Score = 94.0 bits (232), Expect = 6e-17 Identities = 55/149 (36%), Positives = 88/149 (59%), Gaps = 17/149 (11%) Frame = +2 Query: 254 RTPRVRLDDITVTSDASTGN-------VRFTGRVSVRNRNFGRYEFDSSLATIRSGNTNM 412 + P+VR +TV + STGN +R +V+V+N NFG +++++S I G + Sbjct: 38 KNPKVRFGAVTV-ENFSTGNSSSPFFDMRLMAQVTVKNTNFGHFKYENSSIRILYGGMPV 96 Query: 413 GQFVIHDARARARSTRRIYVIADLRVPGAN----------SSVLDLNIEARLRGKVRLIR 562 G+ I ARARAR T++ V D+ + S VL L+ EA+L GKV L++ Sbjct: 97 GEATIVKARARARQTKKFDVTIDISSSKLSTNSNLGNDIASGVLPLSSEAKLSGKVHLMK 156 Query: 563 VIRRNRSADMNCTMRINLSTNVVQNLRCE 649 VI++ +S++M+CTM IN+ T VQ+L+C+ Sbjct: 157 VIKKKKSSEMSCTMGINIGTRTVQDLKCK 185 >emb|CBI22611.3| unnamed protein product [Vitis vinifera] Length = 297 Score = 94.0 bits (232), Expect = 6e-17 Identities = 65/232 (28%), Positives = 114/232 (49%), Gaps = 17/232 (7%) Frame = +2 Query: 2 FLSSNKQKIYKTMAEKYQQQAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCL 181 FL+ ++ K K MA+K QQ +P+ P+ ++D E +++++ K + + Sbjct: 78 FLNFSRAKA-KKMAQKKQQV---HPIEPTGGPAKTDVE---------SEELRRMKCTRYI 124 Query: 182 XXXXXXXXXXXXXXXXXXXXXXXXRTPRVR-----LDDITVTSDASTG--NVRFTGRVSV 340 R+P+ R ++++ TSD ++ N+RF +V+V Sbjct: 125 AYLSAFALFETIVIMVCVVTLMRIRSPKFRFRAVSIENLNYTSDTTSPSFNIRFNAKVAV 184 Query: 341 RNRNFGRYEFDSSLATIRSGNTNMGQFVIHDARARARSTRRIYVIADLR----------V 490 +N NFG ++F +S T+ ++G I ARARARST+++ V D+ Sbjct: 185 KNTNFGHFKFKNSTITLAYRGDHVGDAKISKARARARSTKKMNVTVDVTSNNVSSNSNLA 244 Query: 491 PGANSSVLDLNIEARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRC 646 NS L L + +L GKV L++V ++ +S MNCT++INL V+Q +C Sbjct: 245 SDINSGFLTLTGQGKLNGKVHLMKVFKKKKSPQMNCTIKINLENKVIQEWKC 296 >ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295630 [Fragaria vesca subsp. vesca] Length = 212 Score = 93.6 bits (231), Expect = 8e-17 Identities = 63/222 (28%), Positives = 108/222 (48%), Gaps = 18/222 (8%) Frame = +2 Query: 38 MAEKYQQ---QAQGYPLAPSTIVPRSDEEYATNNNFQSQDQMKKKKRMKCLXXXXXXXXX 208 MAEK++ +GY + DE+ T FQS++++K++KR+K Sbjct: 1 MAEKFKHALASVKGY-------ATKKDEQLPT---FQSEEELKRQKRIKLFTYIGIFIGF 50 Query: 209 XXXXXXXXXXXXXXXRTPRVRL-----DDITVTSDASTGNVRFTGRVSVRNRNFGRYEFD 373 +TP+VRL ++ + + + F ++ ++N N+G Y+FD Sbjct: 51 QIIVMTVFGLTVMKVKTPKVRLGATNVQNLNFVPTSPSFDTTFATQIRIKNTNWGPYKFD 110 Query: 374 SSLATIRSGNTNMGQFVIHDARARARSTRRIYVIADLR---VPGAN-------SSVLDLN 523 + AT +GQ ++A RST++I L +P + S VL L Sbjct: 111 AGTATFMYQGVAVGQVSFPKSKAGMRSTKKINAEVSLNSNEIPSTSNLGSELSSGVLTLT 170 Query: 524 IEARLRGKVRLIRVIRRNRSADMNCTMRINLSTNVVQNLRCE 649 EA+L GKV L+ ++++ +SA MNCTM+++LST +Q L C+ Sbjct: 171 SEAKLTGKVELMLIMKKKKSATMNCTMKLDLSTKTIQALECK 212