BLASTX nr result
ID: Mentha27_contig00015644
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha27_contig00015644 (1493 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citr... 199 3e-48 ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-... 198 5e-48 ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun... 192 4e-46 ref|XP_002865912.1| hydroxyproline-rich glycoprotein family prot... 190 1e-45 ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein... 189 3e-45 ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr... 189 3e-45 ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm... 188 5e-45 ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260... 188 6e-45 ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264... 188 6e-45 emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera] 188 6e-45 ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Caps... 186 2e-44 ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583... 186 2e-44 ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun... 184 1e-43 ref|XP_002867602.1| hydroxyproline-rich glycoprotein family prot... 181 6e-43 ref|XP_006401825.1| hypothetical protein EUTSA_v10013563mg [Eutr... 180 2e-42 ref|XP_006283737.1| hypothetical protein CARUB_v10004810mg [Caps... 179 4e-42 ref|NP_194292.2| hydroxyproline-rich glycoprotein family protein... 177 1e-41 emb|CAA18164.1| putative protein [Arabidopsis thaliana] gi|72694... 172 5e-40 ref|XP_007038760.1| Hydroxyproline-rich glycoprotein family prot... 163 2e-37 ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu... 162 5e-37 >ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] gi|557523850|gb|ESR35217.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] Length = 500 Score = 199 bits (505), Expect = 3e-48 Identities = 150/501 (29%), Positives = 193/501 (38%), Gaps = 150/501 (29%) Frame = +1 Query: 97 MRSVHDSXXXXXXXXXXXXXXXXXXQPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVI 276 M SVHDS +P+ +QKRRWG WS+YWCFGS+K SKRI H +++ Sbjct: 1 MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLV 60 Query: 277 SQETVNGVSTSTCYAQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 456 + V G + Q Sbjct: 61 PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLNSLSVN 120 Query: 457 XELT---AQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFA 627 A +F IGPYA ETQLV+PPVFS+FTT+PS+A TPPPE+VQ+TTPSSPEVPFA Sbjct: 121 AYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFA 180 Query: 628 QLLSSSLAQKWRN-------------------------------------SGAPSPFYDK 696 QLL+SSL + RN SG SPF D+ Sbjct: 181 QLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDR 240 Query: 697 RADIDLPMVEAPKFVGYEHFMNYKW----------------------------------- 771 +D APK +G+EHF KW Sbjct: 241 HPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSR 300 Query: 772 -------------GSRLGSGALTPNGKEPPSQECNILENNQNFEVVESEN---------- 882 GSRLGSG+LTP+G P S++ + NQ EV N Sbjct: 301 LGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRD-GFVRENQISEVASLANSDNGTKSDEH 359 Query: 883 --NHRVSFELRGEDIPISIMKETTKGKDLATEVALSFQTQTSVRSD-------------- 1014 +HRVSFEL GE++ + ++ + E + +R D Sbjct: 360 IIDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCP 419 Query: 1015 -------------DGRD-------RTASFGSSKDFNFNNT----------------NDEV 1086 DG + R+ + GS K+FNF+NT N+ V Sbjct: 420 EESSNRMPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANENV 479 Query: 1087 AIELGPQKNWNFFPMLQSGGS 1149 E P NW FFPMLQS S Sbjct: 480 GKESKPSNNWTFFPMLQSEAS 500 >ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-like [Citrus sinensis] Length = 500 Score = 198 bits (504), Expect = 5e-48 Identities = 150/501 (29%), Positives = 193/501 (38%), Gaps = 150/501 (29%) Frame = +1 Query: 97 MRSVHDSXXXXXXXXXXXXXXXXXXQPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVI 276 M SVHDS +P+ +QKRRWG WS+YWCFGS+K SKRI H +++ Sbjct: 1 MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLL 60 Query: 277 SQETVNGVSTSTCYAQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 456 + V G + Q Sbjct: 61 PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPSSATQSPAGLLSLNSLSVN 120 Query: 457 XELT---AQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFA 627 A +F IGPYA ETQLV+PPVFS+FTT+PS+A TPPPE+VQ+TTPSSPEVPFA Sbjct: 121 AYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFA 180 Query: 628 QLLSSSLAQKWRN-------------------------------------SGAPSPFYDK 696 QLL+SSL + RN SG SPF D+ Sbjct: 181 QLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDR 240 Query: 697 RADIDLPMVEAPKFVGYEHFMNYKW----------------------------------- 771 +D APK +G+EHF KW Sbjct: 241 HPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSR 300 Query: 772 -------------GSRLGSGALTPNGKEPPSQECNILENNQNFEVVESEN---------- 882 GSRLGSG+LTP+G P S++ + NQ EV N Sbjct: 301 LGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRD-GFVRENQISEVASLANSDNGTKSDEH 359 Query: 883 --NHRVSFELRGEDIPISIMKETTKGKDLATEVALSFQTQTSVRSD-------------- 1014 +HRVSFEL GE++ + ++ + E + +R D Sbjct: 360 IIDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCP 419 Query: 1015 -------------DGRD-------RTASFGSSKDFNFNNT----------------NDEV 1086 DG + R+ + GS K+FNF+NT N+ V Sbjct: 420 EESSNRMPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANENV 479 Query: 1087 AIELGPQKNWNFFPMLQSGGS 1149 E P NW FFPMLQS S Sbjct: 480 GKESKPSNNWTFFPMLQSEAS 500 >ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] gi|462415503|gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] Length = 499 Score = 192 bits (487), Expect = 4e-46 Identities = 157/503 (31%), Positives = 203/503 (40%), Gaps = 154/503 (30%) Frame = +1 Query: 97 MRSVHDSXXXXXXXXXXXXXXXXXXQPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVI 276 MRSV+ S QP+ V KRRWG WS+YWCFG +K+ KRIGH +++ Sbjct: 1 MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFGPHKN-KRIGHAVLV 59 Query: 277 SQ--------ETVNGVSTSTCYAQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 432 + ++ +TST Sbjct: 60 PEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLSAN 119 Query: 433 XXXXXXXXXELTAQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSP 612 A IF IGPYA ETQLVSPPVFS+F T+PS+AP TPPPE+VQ+TTPSSP Sbjct: 120 AYSPGGP-----ASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSP 174 Query: 613 EVPFAQLLSSSLAQKWR-------------------------------------NSGAPS 681 EVPFAQLL+SSL + R NSG S Sbjct: 175 EVPFAQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSS 234 Query: 682 PFYDKRADIDLPMVEAPKFVGYEHFMNYKW------------------------------ 771 PF D+ ++ M EAPK G++HF KW Sbjct: 235 PFPDRHPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGN 294 Query: 772 ------------------GSRLGSGALTPNGKEPPSQECNILEN-----------NQNFE 864 GSRLGSG LTP+G P S++ +LEN + Sbjct: 295 ELGSRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQ 354 Query: 865 VVESENNHRVSFELRGEDIPISIMKETT--------KGKDLATEV-----ALSFQT---- 993 VE+ +HRVSFEL GED+ + + K +A+E ALS + Sbjct: 355 TVETVFDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHC 414 Query: 994 -----QTSVR-----SDDGRD------RTASFGSSKDFNFNNTNDEV------------- 1086 ++S R S +G D R+ + GS+KDFNF+NT EV Sbjct: 415 EFSVEESSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVPNKPNIGSEWWAN 474 Query: 1087 ----AIELGPQKNWNFFPMLQSG 1143 A E P +W FFP+LQ G Sbjct: 475 KNVAAKESKPCNDWTFFPILQPG 497 >ref|XP_002865912.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] gi|297311747|gb|EFH42171.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] Length = 437 Score = 190 bits (483), Expect = 1e-45 Identities = 147/439 (33%), Positives = 196/439 (44%), Gaps = 88/439 (20%) Frame = +1 Query: 97 MRSVHDSXXXXXXXXXXXXXXXXXXQPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVI 276 MR+V++S QPS VQK RWG WS+Y CFG+ K++KRIG+ +++ Sbjct: 1 MRNVNNSVETVNAAATAIVTAESRVQPSSVQKGRWGKCWSLYSCFGTQKNNKRIGNAVLV 60 Query: 277 SQETVNGVSTSTCYAQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 456 + +GV T Sbjct: 61 PEPVASGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPGGQLSLTSNTFS 120 Query: 457 XELTAQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPE-AVQMTTPSSPEVPFAQL 633 + +F +GPYA+ETQ V+PPVFS+F T+PS+AP TPPPE +V +TTPSSPEVPFAQL Sbjct: 121 PKEPQSVFTVGPYANETQPVTPPVFSAFVTEPSTAPYTPPPESSVHITTPSSPEVPFAQL 180 Query: 634 LSSSLA-----------QKW-----------------------------RNSGAPSPFYD 693 L+SSL QK+ NSG SP+ Sbjct: 181 LTSSLELTRRNSSSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPG 240 Query: 694 KRADIDLPMVEAPKFVGYEHFMNYKWGSRLG--------------SGALTPNGKE----- 816 K ++ + E PKF+G+EHF KWGSR G SGALTPNG E Sbjct: 241 KSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGLASGALTPNGLEIISGN 300 Query: 817 -PPSQECNILENNQNFEVVESEN----------NHRVSFELRGEDIPISIMKETTKGKD- 960 PS L +NQ EV N +HRVSFEL GED+ + + + D Sbjct: 301 LTPSNTTWPL-HNQISEVASLANSDHGSEVIVADHRVSFELTGEDVARCLASKLNRSHDR 359 Query: 961 ------LATEVALSFQTQTSV--RSDDGRD--------RTASFGSSKDFNFNNTNDEVAI 1092 + TE + S + ++ RS D ++S GSSK+F F+NT DE I Sbjct: 360 MNNNDRIETEESSSTDLRRNMEKRSADRETEQQRIQKLNSSSIGSSKEFKFDNTKDE-NI 418 Query: 1093 ELGPQKNWNFFPMLQSGGS 1149 E +W+FFP L+SG S Sbjct: 419 EKVAGNSWSFFPGLRSGVS 437 >ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|10177409|dbj|BAB10540.1| unnamed protein product [Arabidopsis thaliana] gi|40823427|gb|AAR92282.1| At5g52430 [Arabidopsis thaliana] gi|56381929|gb|AAV85683.1| At5g52430 [Arabidopsis thaliana] gi|110738650|dbj|BAF01250.1| hypothetical protein [Arabidopsis thaliana] gi|332008830|gb|AED96213.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 438 Score = 189 bits (480), Expect = 3e-45 Identities = 141/416 (33%), Positives = 186/416 (44%), Gaps = 90/416 (21%) Frame = +1 Query: 172 QPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXXXX 351 QPS QK RWG WS+Y CFG+ K++KRIG+ +++ + +GV T Sbjct: 27 QPSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPEPVTSGVPVVTVQNSATSTTVVLP 86 Query: 352 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELTAQIFLIGPYADETQLVSPPVF 531 + +F +GPYA+ETQ V+PPVF Sbjct: 87 FIAPPSSPASFLQSDPSSVSHSPVGPLSLTSNTFSPKEPQSVFTVGPYANETQPVTPPVF 146 Query: 532 SSFTTQPSSAPLTPPPEA-VQMTTPSSPEVPFAQLLSSSLAQKWR--------------- 663 S+F T+PS+AP TPPPE+ V +TTPSSPEVPFAQLL+SSL R Sbjct: 147 SAFITEPSTAPYTPPPESSVHITTPSSPEVPFAQLLTSSLELTRRDSTSGMNQKFSSSHY 206 Query: 664 -------------------------NSGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYK 768 NSG SP+ K ++ + E PKF+G+EHF K Sbjct: 207 EFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGKSPMVEFRIGEPPKFLGFEHFTARK 266 Query: 769 WGSRLG--------------SGALTPNGKEPPSQECNILEN-------NQNFEVVESEN- 882 WGSR G SGALTPNG E S N+ N NQ EV N Sbjct: 267 WGSRFGSGSITPVGHGSGLASGALTPNGPEIVSG--NLTPNNTTWPLQNQISEVASLANS 324 Query: 883 ---------NHRVSFELRGEDIPISIMKETTKGKD-------LATEVALSFQTQTSVRSD 1014 +HRVSFEL GED+ + + + D + TE + S + ++ Sbjct: 325 DHGSEVMVADHRVSFELTGEDVARCLASKLNRSHDRMNNNDRIETEESSSTDIRRNIEKR 384 Query: 1015 DGRDR-----------TASFGSSKDFNFNNTNDEVAIELGPQKNWNFFPMLQSGGS 1149 G DR ++S GSSK+F F+NT DE IE +W+FFP L+SG S Sbjct: 385 SG-DRENEQHRIQKLSSSSIGSSKEFKFDNTKDE-NIEKVAGNSWSFFPGLRSGVS 438 >ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|222858882|gb|EEE96429.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 507 Score = 189 bits (480), Expect = 3e-45 Identities = 150/473 (31%), Positives = 194/473 (41%), Gaps = 153/473 (32%) Frame = +1 Query: 178 SPVQKRRWGGWWSMYWCFGSY---KHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXXX 348 S VQKRRWGG WS+YWCFGS+ K+SKRIGH +++ + V G +S+ Q Sbjct: 31 SSVQKRRWGGCWSLYWCFGSHGSHKNSKRIGHAVLVPEPEVPGAVSSSTENQTQSTPILL 90 Query: 349 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELT---AQIFLIGPYADETQLVS 519 A IF IGPYA ETQLV+ Sbjct: 91 PFIAPPSSPASFLQSDPPSSTQSPAGLLSLTSLSANAYSPRGPASIFAIGPYAHETQLVT 150 Query: 520 PPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSLAQKWR------------ 663 PPVFS+FTT+PS+AP TPPPE+VQ+TTPSSPEVPFAQLL+SSL + R Sbjct: 151 PPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGPNQKFSLSH 210 Query: 664 -------------------------NSGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYK 768 NSG SPF D+ ++ M EAPK +G+EHF K Sbjct: 211 YEFQSYHLYPGSPGGQIISPGSAISNSGTSSPFPDRHPMLEFRMGEAPKLLGFEHFSTRK 270 Query: 769 WGSRLGSGALTPNGKEP------------------------------------------- 819 WGSRLGSG+LTP+ Sbjct: 271 WGSRLGSGSLTPDATPDGMGLSRLGSGTVTPDGMGLSRLCSGTATPDGAGLRSRLGSGTL 330 Query: 820 ------PSQECNILENNQNFEV---VESEN---------NHRVSFELRGEDI-------- 921 P+ + L NQ EV SEN +HRVSFEL GE++ Sbjct: 331 TPDCFVPASQIGFLLENQISEVASLTNSENGSKTEENVVHHRVSFELSGEEVARCLEIKS 390 Query: 922 ----------PISIMKE-TTKGKDLAT---------EVALSFQTQTSVRSDDG----RDR 1029 P M E +G LA E + + S +++ + R Sbjct: 391 VASTRTFPEYPQDTMPEDPVRGDRLAMNGERCLQNGEASSEMPEKNSEETEEDHVYRKHR 450 Query: 1030 TASFGSSKDFNFNNTNDEVA-----------------IELGPQKNWNFFPMLQ 1137 + + GS K+FNF+N+ EV+ E P +W FFP+LQ Sbjct: 451 SITLGSIKEFNFDNSKGEVSDKPAISSEWWANETIAGKEARPANSWTFFPLLQ 503 >ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis] gi|223547583|gb|EEF49078.1| conserved hypothetical protein [Ricinus communis] Length = 510 Score = 188 bits (478), Expect = 5e-45 Identities = 150/469 (31%), Positives = 198/469 (42%), Gaps = 147/469 (31%) Frame = +1 Query: 172 QPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXXXX 351 QP+ VQKRRWGG WS+YWCFGS+K +KRIGH ++ + V G ++ Q Sbjct: 40 QPTTVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGAVVTSAENQSQSTAITVP 98 Query: 352 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELT---AQIFLIGPYADETQLVSP 522 A IF IGPYA ETQLV+P Sbjct: 99 FIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLVTP 158 Query: 523 PVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSLAQKWR------------- 663 P FS+FTT+PS+AP TPPPE+VQ+TTPSSPEVPFAQLL+SSL + R Sbjct: 159 PAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALSHY 218 Query: 664 ------------------------NSGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYKW 771 NSG SPF D+ ++ M EAPK +G+EHF KW Sbjct: 219 EFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPILEFRMGEAPKLLGFEHFTTRKW 278 Query: 772 GSRLGSGALTPNG----------------------------------------------- 810 GSRLGSG +TP+G Sbjct: 279 GSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSGSLTPD 338 Query: 811 -KEPPSQECNILEN--NQNFEVVESEN---------NHRVSFELRGEDI----------- 921 P S++ LEN ++ + SEN +HRVSFEL GE++ Sbjct: 339 AVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLESKSLAS 398 Query: 922 --------PISIMKETTK-GKDLATEVALSFQTQTSVRSDD------------GRDRTAS 1038 P S+ ++ K GK L T+ L +TS + + + R+ + Sbjct: 399 CRAFSECPPDSMAEDQIKSGKMLMTDENLP-TGETSGETPEKPSGEMEEEHCYRKHRSIT 457 Query: 1039 FGSSKDFNFNNT---------------NDEVA-IELGPQKNWNFFPMLQ 1137 GS K+FNF+N+ N+ +A E P NW FFP+LQ Sbjct: 458 LGSIKEFNFDNSKEVPDKPSINSEWWANETIAGKEARPANNWTFFPLLQ 506 >ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum lycopersicum] Length = 470 Score = 188 bits (477), Expect = 6e-45 Identities = 134/344 (38%), Positives = 165/344 (47%), Gaps = 116/344 (33%) Frame = +1 Query: 466 TAQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSS 645 TA IF IGPYA ETQLVSPPVFS+FTT+PS+A TPPPE V MTTP SPEVPFAQLL+SS Sbjct: 127 TASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSS 186 Query: 646 LAQKWR------------------------------------NSGAPSPFYDKRADIDLP 717 LA+ R NSG SPF K I+ Sbjct: 187 LARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFR 246 Query: 718 MVEAPKFVGYEHFMNYKWGSR----------------------------LGSGALTPNGK 813 E PKF+GYEHF KWGSR LGSG +TPNG Sbjct: 247 KGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGG 306 Query: 814 EPPSQECNILEN-----------NQNFEVVESENNHRVSFELRGEDIPISIMKE------ 942 EPPS++ +LEN + E+ E+ +HRVSFEL ED+P KE Sbjct: 307 EPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSHS 366 Query: 943 -TTKGKDLATEVALSFQTQTSV-----------RSDDGRD------RTASFGSSKDFNFN 1068 T D++ +A ++ +S+ S+ G D R +FGSSKDF+F+ Sbjct: 367 QPTLPMDVSNLLASEMRSGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGSSKDFDFD 426 Query: 1069 N----------------TNDEVAI-ELGPQKNWNFFPMLQSGGS 1149 N T+D+ A+ E G Q NW FFP+LQ G S Sbjct: 427 NVKIEVLEKDSIDCEWWTSDKAAVKESGIQNNWTFFPVLQPGVS 470 Score = 70.1 bits (170), Expect = 2e-09 Identities = 27/42 (64%), Positives = 33/42 (78%) Frame = +1 Query: 172 QPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNG 297 QPS VQKRRWG WS+YWCFGS+KHSKRIGH +++ + G Sbjct: 26 QPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPVAPG 67 >ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera] Length = 448 Score = 188 bits (477), Expect = 6e-45 Identities = 127/325 (39%), Positives = 162/325 (49%), Gaps = 100/325 (30%) Frame = +1 Query: 469 AQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSL 648 A +F IGPYA ETQLVSPPVFS+F T+PS+AP TPPPE+VQ+TTPSSPEVPFAQLL+SSL Sbjct: 128 ASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSL 187 Query: 649 AQKWR----------------------------------NSGAPSPFYDKRADIDLPMVE 726 + R NSG SPF D+R P+VE Sbjct: 188 DRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISPISNSGTSSPFPDRR-----PIVE 242 Query: 727 APKFVGYEHFMNYKWGSRLGSGALTPNGKEPPSQECNILENNQNFEVVESEN-------- 882 APK +G+EHF +WGSRLGSG+LTP+G P S++ +LE NQ EV N Sbjct: 243 APKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLE-NQISEVASLANSESGSQNG 301 Query: 883 ----NHRVSFELRGEDIPISIMK------ETTKG--KDLATE------------------ 972 +HRVSFEL GED+ + + K ET + +D+ E Sbjct: 302 ETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCC 361 Query: 973 ---VALSFQTQTSVRSDDGRDRTA-------SFGSSKDFNFNNTNDEVAIE--------- 1095 V + + + S +G + GS K+FNF+NT EV+ + Sbjct: 362 EFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWW 421 Query: 1096 ---------LGPQKNWNFFPMLQSG 1143 GPQ NW FFP+LQ G Sbjct: 422 VNEKVVGKGTGPQTNWTFFPLLQPG 446 Score = 66.2 bits (160), Expect = 4e-08 Identities = 30/67 (44%), Positives = 40/67 (59%) Frame = +1 Query: 97 MRSVHDSXXXXXXXXXXXXXXXXXXQPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVI 276 MRSV++S QP+ VQKRRWG S+YWCFGS++HSKRIGH +++ Sbjct: 1 MRSVNNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLV 60 Query: 277 SQETVNG 297 + V G Sbjct: 61 PEPMVPG 67 >emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera] Length = 385 Score = 188 bits (477), Expect = 6e-45 Identities = 127/325 (39%), Positives = 162/325 (49%), Gaps = 100/325 (30%) Frame = +1 Query: 469 AQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSL 648 A +F IGPYA ETQLVSPPVFS+F T+PS+AP TPPPE+VQ+TTPSSPEVPFAQLL+SSL Sbjct: 65 ASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSL 124 Query: 649 AQKWR----------------------------------NSGAPSPFYDKRADIDLPMVE 726 + R NSG SPF D+R P+VE Sbjct: 125 DRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISPISNSGTSSPFPDRR-----PIVE 179 Query: 727 APKFVGYEHFMNYKWGSRLGSGALTPNGKEPPSQECNILENNQNFEVVESEN-------- 882 APK +G+EHF +WGSRLGSG+LTP+G P S++ +LE NQ EV N Sbjct: 180 APKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLE-NQISEVASLANSESGSQNG 238 Query: 883 ----NHRVSFELRGEDIPISIMK------ETTKG--KDLATE------------------ 972 +HRVSFEL GED+ + + K ET + +D+ E Sbjct: 239 ETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCC 298 Query: 973 ---VALSFQTQTSVRSDDGRDRTA-------SFGSSKDFNFNNTNDEVAIE--------- 1095 V + + + S +G + GS K+FNF+NT EV+ + Sbjct: 299 EFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWW 358 Query: 1096 ---------LGPQKNWNFFPMLQSG 1143 GPQ NW FFP+LQ G Sbjct: 359 VNEKVVGKGTGPQTNWTFFPLLQPG 383 >ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Capsella rubella] gi|482549191|gb|EOA13385.1| hypothetical protein CARUB_v10026425mg [Capsella rubella] Length = 437 Score = 186 bits (473), Expect = 2e-44 Identities = 142/441 (32%), Positives = 191/441 (43%), Gaps = 90/441 (20%) Frame = +1 Query: 97 MRSVHDSXXXXXXXXXXXXXXXXXXQPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVI 276 MR+V++S QPS VQKRRW WS+Y CFGS K++KRIG+ +++ Sbjct: 1 MRNVNNSVETVNAAATAIITAESRVQPSSVQKRRWAKCWSLYSCFGSQKNNKRIGNAVLV 60 Query: 277 SQETVNGVSTSTCYAQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 456 + +GV T Sbjct: 61 PEPVASGVPVVTVQNSATSTTVVLPFIAPPSSPASFLPSDPSSVSHSPVGPLSLTSNTFS 120 Query: 457 XELTAQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLL 636 + +F +GPYA+ETQ V+PPVFS+F T+PS+AP TPPPE+ TPSSPEVPFAQLL Sbjct: 121 PKEPQSVFTVGPYANETQPVTPPVFSAFITEPSTAPYTPPPES--SVTPSSPEVPFAQLL 178 Query: 637 SSSLAQKWR---------------------------------------NSGAPSPFYDKR 699 +SSL R NSG SP+ K Sbjct: 179 TSSLELTRRDSSGINQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGKS 238 Query: 700 ADIDLPMVEAPKFVGYEHFMNYKWGSRLGSGALTP--------------------NGKEP 819 ++ + E PKF+G+EHF KWGSR GSG++TP +G Sbjct: 239 PMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGMASGALTPNAPEIISGNLT 298 Query: 820 PSQECNILENNQNFEVVESEN----------NHRVSFELRGEDIPISIMKETTKGKD--- 960 PS L+ NQ EV N +HRVSFEL GED+ + + + D Sbjct: 299 PSNTTWPLQ-NQISEVASLANSDHGSEVIVADHRVSFELTGEDVARCLASKLNRSHDRMN 357 Query: 961 ----LATEVAL--------SFQTQTSVRSDDGRDR------TASFGSSKDFNFNNTNDEV 1086 +ATE + SFQ S + + + ++S GSSK+F F+NT DE Sbjct: 358 NNDRIATEESSSTDRGRRNSFQKIESTENRETEQQRIQKLSSSSIGSSKEFKFDNTKDE- 416 Query: 1087 AIELGPQKNWNFFPMLQSGGS 1149 IE +W+FFP L+SG S Sbjct: 417 NIEKVAGNSWSFFPGLRSGVS 437 >ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum] Length = 470 Score = 186 bits (472), Expect = 2e-44 Identities = 133/344 (38%), Positives = 161/344 (46%), Gaps = 116/344 (33%) Frame = +1 Query: 466 TAQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSS 645 TA IF IGPYA ETQLVSPPVFS+FTT+PS+A TPPPE V MTTP SPEVPFAQLL+SS Sbjct: 127 TASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPFAQLLTSS 186 Query: 646 LAQKWR------------------------------------NSGAPSPFYDKRADIDLP 717 LA+ R NSG SPF K I+ Sbjct: 187 LARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFR 246 Query: 718 MVEAPKFVGYEHFMNYKWGSR----------------------------LGSGALTPNGK 813 E PKF+GYEHF KWGSR LGSG +TPNG Sbjct: 247 KGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGG 306 Query: 814 EPPSQECNILE-----------NNQNFEVVESENNHRVSFELRGEDIPISIMKE------ 942 EPPS++ +LE ++ E+ E +HRVSFEL GED+P KE Sbjct: 307 EPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREKEPVMSHS 366 Query: 943 -TTKGKDLATEVALSFQTQTSV-----------RSDDGRD------RTASFGSSKDFNFN 1068 T D++ +A ++ +S+ S+ G D R +FGSSKDF+F+ Sbjct: 367 QQTLPMDVSNLLANEMKSGSSMAEEKTYGSPRKASESGEDQCHRKHRNITFGSSKDFDFD 426 Query: 1069 NTNDEV-----------------AIELGPQKNWNFFPMLQSGGS 1149 N EV E G Q NW FFP+LQ G S Sbjct: 427 NVKIEVLEKDSIDCEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 470 Score = 70.1 bits (170), Expect = 2e-09 Identities = 27/42 (64%), Positives = 33/42 (78%) Frame = +1 Query: 172 QPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNG 297 QPS VQKRRWG WS+YWCFGS+KHSKRIGH +++ + G Sbjct: 26 QPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPAAPG 67 >ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica] gi|462404864|gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica] Length = 455 Score = 184 bits (466), Expect = 1e-43 Identities = 137/420 (32%), Positives = 178/420 (42%), Gaps = 98/420 (23%) Frame = +1 Query: 184 VQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXXXXXXXX 363 VQKRRWG WWSMYWCFG +H KRIGH +++ + T G Sbjct: 37 VQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPETTDRGGDAPRAENPIQTPSIVLPFVAP 96 Query: 364 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELTAQIFLIGPYADETQLVSPPVFSSFT 543 IF IGPYA ETQLVSPPVFS+FT Sbjct: 97 PSSPASFLQSEPPSATQSPAGFFSLTASMYSPSGPTSIFAIGPYAHETQLVSPPVFSTFT 156 Query: 544 TQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSS--------------------------- 642 T+PS+AP TPPPE+V +TTPSSPEVPFAQLL Sbjct: 157 TEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYPGS 216 Query: 643 ------SLAQKWRNSGAPSPFYDKRAD------IDLPMVEAPKFVGYEHFMNYKWGSRLG 786 S + SG SPF D ++ + PK + + WGSRLG Sbjct: 217 PVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGDPPKLLNLDILSTRDWGSRLG 276 Query: 787 SGALTPNGKEPPSQECNILENNQNFEVV---ESEN---------NHRVSFELRGEDI--- 921 SG++TP+G + S + L Q EVV S N NHRVSFEL E++ Sbjct: 277 SGSVTPDGAKSTSSD-GFLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEVIRC 335 Query: 922 ----PISIMK---------ETTKGKDLATEVALSFQTQTSVRSDDG-------------- 1020 P+++ + E + K+ ++V S S+D Sbjct: 336 VEKKPVALAEAVSTSLEDTEKAQSKEDPSKVVSSSICPVGETSNDAAEKAVADGEEAQLH 395 Query: 1021 -RDRTASFGSSKDFNFNN---------------TNDEV-AIELGPQKNWNFFPMLQSGGS 1149 + R+ + GS K+FNF+N N++V A E GP KNW+FFPM+Q G S Sbjct: 396 PKQRSITLGSVKEFNFDNPDGGDSGNSIGSDWWANEKVDAKENGPTKNWSFFPMMQPGVS 455 >ref|XP_002867602.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] gi|297313438|gb|EFH43861.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] Length = 421 Score = 181 bits (460), Expect = 6e-43 Identities = 137/412 (33%), Positives = 184/412 (44%), Gaps = 88/412 (21%) Frame = +1 Query: 172 QPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXXXX 351 QPS V K+ WG WWS+Y CFGS K++KRIGH +++ + +G + + Q Sbjct: 22 QPSSVHKK-WGSWWSLYLCFGSKKNNKRIGHAVLVPEPAASGAAVAP--VQNSSSNSTSM 78 Query: 352 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELTAQIFLIGPYADETQLVSPPVF 531 F IGPYA ETQ V+PPVF Sbjct: 79 FMPFIAPPSSPASFLPSGPPSVSHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVF 138 Query: 532 SSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSLAQKWRN--------------- 666 S+FTT+PS+AP TPPPE +PSSPEVPFAQLL+SSL + RN Sbjct: 139 SAFTTEPSTAPFTPPPE-----SPSSPEVPFAQLLTSSLEKARRNIGGGMHHKFSAAHYE 193 Query: 667 -------------------SGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYKW------ 771 SG SP+ K + I+ + E PKF+G+EHF KW Sbjct: 194 FKSHQVYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGS 253 Query: 772 --------GSRLGSGALTPNGKEPPSQECNILENNQNFEVVESENN-------------- 885 GSRLGSGALTP+G P E ++L+ +Q EV N+ Sbjct: 254 GSITPAGQGSRLGSGALTPDGLTP--LEGSLLD-SQITEVASLANSDHGSSRHNDEAAVV 310 Query: 886 -HRVSFELRGEDIPISIMK--------ETTKGKDLATEVALSFQTQTSVRSDDGRD-RTA 1035 HRVSFEL GED+ + E G+ L +T S+ + R+ Sbjct: 311 PHRVSFELTGEDVARCLASKLNRSGSHEKASGEHLRPN---GCKTSGETESEQSQKLRSF 367 Query: 1036 SFGSSKDFNFNNTNDEVAIEL----------------GPQKNWNFFPMLQSG 1143 S GSSK+F F+NTN+E+ ++ P+ +W FFP+L+SG Sbjct: 368 STGSSKEFKFDNTNEEMIEKVRSEWWANEKVAGKGDHSPRNSWTFFPVLRSG 419 >ref|XP_006401825.1| hypothetical protein EUTSA_v10013563mg [Eutrema salsugineum] gi|557102915|gb|ESQ43278.1| hypothetical protein EUTSA_v10013563mg [Eutrema salsugineum] Length = 440 Score = 180 bits (456), Expect = 2e-42 Identities = 144/441 (32%), Positives = 188/441 (42%), Gaps = 90/441 (20%) Frame = +1 Query: 97 MRSVHDSXXXXXXXXXXXXXXXXXXQPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVI 276 MR+V++S QPS V KRRW WS+ CFGS K++KRIG+ +++ Sbjct: 1 MRNVNNSVETVNAAATAIVTAESRVQPSSVPKRRWRNCWSLNSCFGSQKNNKRIGNAMLV 60 Query: 277 SQETV--NGVSTSTCYAQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 450 E V G T Sbjct: 61 VPEPVATGGAPVVTVQNSATSSSIVLPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTSNT 120 Query: 451 XXXELTAQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPE-AVQMTTPSSPEVPFA 627 +F IGPYA+ETQ V+PPVFS+F T+PS+AP TPPPE +V +TTPSSPEVPFA Sbjct: 121 FSTTEPQSVFTIGPYANETQPVTPPVFSAFITEPSTAPFTPPPESSVHITTPSSPEVPFA 180 Query: 628 QLLSSSLA----------QKW-----------------------------RNSGAPSPFY 690 QLL+SSL QK+ NSG SP+ Sbjct: 181 QLLTSSLELTRRNSSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYP 240 Query: 691 DKRADIDLPMVEAPKFVGYEHFMNYKWGSRL----------GSGALTPNGKEPPSQECNI 840 K ++ + E PKF+G+EHF KWGSR GSGALTPNG S+ Sbjct: 241 GKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGALTPNGPGMVSESLTP 300 Query: 841 LENN--------QNFEVVESEN----------NHRVSFELRGEDIPISIMKETTKGKDLA 966 NN Q EV N +HRVSFEL GED+ + + + D Sbjct: 301 NNNNNTTWPLTSQVSEVASLANSDHGSEVVAADHRVSFELTGEDVARCLASKLNRSHDRM 360 Query: 967 T---------EVALSFQTQTSVRSDDGRDR-----------TASFGSSKDFNFNNTNDEV 1086 ++SFQ + + DR ++S GSSK+F F+NT +E Sbjct: 361 NNDERVETDERRSISFQKRENNVERVSGDREIEQQRIHKLSSSSIGSSKEFKFDNTKEE- 419 Query: 1087 AIELGPQKNWNFFPMLQSGGS 1149 IE +W+FFP L+SG S Sbjct: 420 NIEKVAGNSWSFFPGLRSGVS 440 >ref|XP_006283737.1| hypothetical protein CARUB_v10004810mg [Capsella rubella] gi|482552442|gb|EOA16635.1| hypothetical protein CARUB_v10004810mg [Capsella rubella] Length = 444 Score = 179 bits (453), Expect = 4e-42 Identities = 134/424 (31%), Positives = 180/424 (42%), Gaps = 99/424 (23%) Frame = +1 Query: 172 QPSP--VQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXX 345 QPS + K++WG WWS+YWCFGS K++KRIGH ++ + +GV+ + Sbjct: 27 QPSSSLLHKKKWGSWWSLYWCFGSKKNNKRIGHAVLAPEPAASGVAVAPVQNSSSSNSTS 86 Query: 346 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELTAQIFLIGPYADETQLVSPP 525 F IGPYA ETQ V+PP Sbjct: 87 IFMPFIAPPSSPASFLPSGPPSVSHTPDPCRLRCSLLVNEPPSAFAIGPYAHETQPVTPP 146 Query: 526 VFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSLAQKWRN------------- 666 VFS+FTT+PS+AP TPPPE +PSSPEVPFAQLL+SSL + RN Sbjct: 147 VFSAFTTEPSTAPFTPPPE-----SPSSPEVPFAQLLTSSLERARRNSSGGMNHKFSAAH 201 Query: 667 ---------------------SGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYKW---- 771 SG SP+ K + I+ + E PKF+G+EHF KW Sbjct: 202 YEFKSHQVYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRF 261 Query: 772 ----------GSRLGSGALTPNGKEPPSQEC---------NILENNQNFEVVESENN--- 885 GSRLGSGALTP+G + + L ++Q EV N+ Sbjct: 262 GSGSITPAGQGSRLGSGALTPDGGGGMGSKIASGALTPLEDSLLDSQVSEVASLANSDHG 321 Query: 886 ------------HRVSFELRGEDIPISIMK--------ETTKGKDLATEVALSFQTQTSV 1005 HRVSFEL GED+ + E G+ L +T Sbjct: 322 SSRHNDEAVVVAHRVSFELTGEDVARCLASKLNRSGSHERASGEHLRPN---GCKTSGET 378 Query: 1006 RSDDGRD-RTASFGSSKDFNFNNTNDEVAIEL----------------GPQKNWNFFPML 1134 S+ + R+ S GSSK+F F+NT +E ++ P +W FFP+L Sbjct: 379 ESEQSQKLRSFSLGSSKEFKFDNTEEETIEKVRSEWWANEKVAGKGDHSPANSWTFFPVL 438 Query: 1135 QSGG 1146 +S G Sbjct: 439 RSSG 442 >ref|NP_194292.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|26449762|dbj|BAC42004.1| unknown protein [Arabidopsis thaliana] gi|28951011|gb|AAO63429.1| At4g25620 [Arabidopsis thaliana] gi|332659684|gb|AEE85084.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 449 Score = 177 bits (448), Expect = 1e-41 Identities = 138/431 (32%), Positives = 186/431 (43%), Gaps = 107/431 (24%) Frame = +1 Query: 172 QPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXXXX 351 QPS VQK+R G WWS+YWCFGS K++KRIGH +++ + +G + + Q Sbjct: 27 QPSSVQKKR-GSWWSLYWCFGSKKNNKRIGHAVLVPEPAASGAAVAP--VQNSSSNSTSI 83 Query: 352 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELTAQIFLIGPYADETQLVSPPVF 531 F IGPYA ETQ V+PPVF Sbjct: 84 FMPFIAPPSSPASFLPSGPPSASHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVF 143 Query: 532 SSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSLAQKWRN--------------- 666 S+FTT+PS+AP TPPPE +PSSPEVPFAQLL+SSL + RN Sbjct: 144 SAFTTEPSTAPFTPPPE-----SPSSPEVPFAQLLTSSLERARRNSGGGMNQKFSAAHYE 198 Query: 667 -------------------SGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYKW------ 771 SG SP+ K + I+ + E PKF+G+EHF KW Sbjct: 199 FKSCQVYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGS 258 Query: 772 --------GSRLGSGALTPNGKE-------PPSQECNI-------------LENNQNFEV 867 GSRLGSGALTP+G + P E I L ++Q EV Sbjct: 259 GSITPAGQGSRLGSGALTPDGSKLTSGVVTPNGAETVIRMSYGNLTPLEGSLLDSQISEV 318 Query: 868 VESENN---------------HRVSFELRGEDIPISIMK--------ETTKGKDLATEVA 978 N+ HRVSFEL GED+ + E G+ L Sbjct: 319 ASLANSDHGSSRHNDEALVVPHRVSFELTGEDVARCLASKLNRSGSHEKASGEHLRPNCC 378 Query: 979 LSFQTQTSVRSDDGRDRTASFGSSKDFNFNNTNDEVAIEL----------------GPQK 1110 + S +S + R+ S GS+K+F F++TN+E+ ++ P+ Sbjct: 379 KTSGETESEQSQ--KLRSFSTGSNKEFKFDSTNEEMIEKIRSEWWANEKVAGKGDHSPRN 436 Query: 1111 NWNFFPMLQSG 1143 +W FFP+L+SG Sbjct: 437 SWTFFPVLRSG 447 >emb|CAA18164.1| putative protein [Arabidopsis thaliana] gi|7269412|emb|CAB81372.1| putative protein [Arabidopsis thaliana] Length = 424 Score = 172 bits (435), Expect = 5e-40 Identities = 133/426 (31%), Positives = 182/426 (42%), Gaps = 107/426 (25%) Frame = +1 Query: 187 QKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXXXXXXXXX 366 QK++ G WWS+YWCFGS K++KRIGH +++ + +G + + Q Sbjct: 6 QKKKRGSWWSLYWCFGSKKNNKRIGHAVLVPEPAASGAAVAP--VQNSSSNSTSIFMPFI 63 Query: 367 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELTAQIFLIGPYADETQLVSPPVFSSFTT 546 F IGPYA ETQ V+PPVFS+FTT Sbjct: 64 APPSSPASFLPSGPPSASHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVFSAFTT 123 Query: 547 QPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSLAQKWRN-------------------- 666 +PS+AP TPPPE +PSSPEVPFAQLL+SSL + RN Sbjct: 124 EPSTAPFTPPPE-----SPSSPEVPFAQLLTSSLERARRNSGGGMNQKFSAAHYEFKSCQ 178 Query: 667 --------------SGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYKW----------- 771 SG SP+ K + I+ + E PKF+G+EHF KW Sbjct: 179 VYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITP 238 Query: 772 ---GSRLGSGALTPNGKE-------PPSQECNI-------------LENNQNFEVVESEN 882 GSRLGSGALTP+G + P E I L ++Q EV N Sbjct: 239 AGQGSRLGSGALTPDGSKLTSGVVTPNGAETVIRMSYGNLTPLEGSLLDSQISEVASLAN 298 Query: 883 N---------------HRVSFELRGEDIPISIMK--------ETTKGKDLATEVALSFQT 993 + HRVSFEL GED+ + E G+ L + Sbjct: 299 SDHGSSRHNDEALVVPHRVSFELTGEDVARCLASKLNRSGSHEKASGEHLRPNCCKTSGE 358 Query: 994 QTSVRSDDGRDRTASFGSSKDFNFNNTNDEVAIEL----------------GPQKNWNFF 1125 S +S + R+ S GS+K+F F++TN+E+ ++ P+ +W FF Sbjct: 359 TESEQSQ--KLRSFSTGSNKEFKFDSTNEEMIEKIRSEWWANEKVAGKGDHSPRNSWTFF 416 Query: 1126 PMLQSG 1143 P+L+SG Sbjct: 417 PVLRSG 422 >ref|XP_007038760.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao] gi|508776005|gb|EOY23261.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao] Length = 540 Score = 163 bits (412), Expect = 2e-37 Identities = 120/324 (37%), Positives = 156/324 (48%), Gaps = 99/324 (30%) Frame = +1 Query: 469 AQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSL 648 A IF IGPYA ETQLV+PPVFS+ T +PS+AP TPPPE++Q+TTPSSPEVPFAQLL+SSL Sbjct: 216 ASIFAIGPYAHETQLVTPPVFSALTPEPSTAPFTPPPESIQLTTPSSPEVPFAQLLASSL 275 Query: 649 AQKWR----NSGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYKW--------------- 771 R NSG SPF D+R ++ M EAPK +G+E+ KW Sbjct: 276 ESARRKAISNSGTSSPFPDRRPILEFHMGEAPKLLGFENLTTRKWCSRLGSGSLTPDGLG 335 Query: 772 -GSRLGS----------------GALTPNGKEPPSQECNILENNQNFEVV---------- 870 GSRLGS G+LTP+G PPS++ L +Q EV Sbjct: 336 RGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPPSRD-GFLLGSQISEVALLTNQANGPK 394 Query: 871 --ESENNHRVSFELRGEDI----------PISIMKETTKG-------------KDL--AT 969 E+ +HRVSFEL GED+ P + E K KDL + Sbjct: 395 NDETIVDHRVSFELSGEDVARCLESKSLLPSRTVSEYPKDLVAEGRIERDGIKKDLESSC 454 Query: 970 EVALSFQTQTSVRSDDG---------RDRTASFGSSKDFNFNNTNDEVA----------- 1089 E+ + + +V G + R+ + GS K+FNF+NT E + Sbjct: 455 ELFIRETSNETVEKASGKAEEEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWA 514 Query: 1090 ------IELGPQKNWNFFPMLQSG 1143 E P +W FFPM + G Sbjct: 515 NEKFARKEARPGNSWTFFPMFRPG 538 >ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] gi|550346901|gb|EEE82832.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] Length = 453 Score = 162 bits (409), Expect = 5e-37 Identities = 133/419 (31%), Positives = 172/419 (41%), Gaps = 97/419 (23%) Frame = +1 Query: 184 VQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETV--NGVSTSTCYAQXXXXXXXXXXX 357 VQKRRWG WS+Y CFG KH K+IGH ++ + + NG S Q Sbjct: 37 VQKRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAP 96 Query: 358 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEL-TAQIFLIGPYADETQLVSPPVFS 534 A IF IGPYA ETQLVSPPVFS Sbjct: 97 PSSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFS 156 Query: 535 SFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSLA------------QKWR----- 663 +FTT+PS+AP TPPPE+V +TTPSSPEVPFAQ L SL Q ++ Sbjct: 157 TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFPFDFQSYQFHPGS 216 Query: 664 -------------NSGAPSPFYDKRADI------DLPMVEAPKFVGYEHFMNYKWGSRLG 786 SG SPF D + + + E PK + + +WGS G Sbjct: 217 PVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQG 276 Query: 787 SGALTPNGKEPPSQECNILENNQNFEVVESEN-----------NHRVSFELRGED----- 918 SGALTP S N L + Q +V NHRVSFEL ED Sbjct: 277 SGALTPESVRRGSP--NFLLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCV 334 Query: 919 ----------IPISIMKET-TKGKDLATEVALSFQTQTSVRSDDG--------------- 1020 +P + T K + + E SF+ + V S+D Sbjct: 335 EEKPAFSIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHR 394 Query: 1021 RDRTASFGSSKDFNFNNTND----------------EVAIELGPQKNWNFFPMLQSGGS 1149 + ++ + GS K+FNF+N ++ + E KNW+FFPM+QSG S Sbjct: 395 KQQSITLGSVKEFNFDNADEGDSRKPSSSNWWANGSVIGKEGETTKNWSFFPMVQSGVS 453