BLASTX nr result
ID: Mentha25_contig00024423
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00024423 (1488 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260... 202 4e-49 ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583... 196 2e-47 ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264... 187 1e-44 ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot... 176 3e-41 gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] 173 2e-40 ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot... 170 2e-39 ref|XP_002865912.1| hydroxyproline-rich glycoprotein family prot... 165 6e-38 ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein... 162 5e-37 ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun... 161 6e-37 ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun... 160 1e-36 ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm... 160 2e-36 ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Caps... 159 2e-36 ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr... 159 4e-36 ref|XP_002867602.1| hydroxyproline-rich glycoprotein family prot... 155 6e-35 ref|XP_006401825.1| hypothetical protein EUTSA_v10013563mg [Eutr... 154 1e-34 ref|XP_006283737.1| hypothetical protein CARUB_v10004810mg [Caps... 154 1e-34 ref|NP_194292.2| hydroxyproline-rich glycoprotein family protein... 149 2e-33 emb|CAA18164.1| putative protein [Arabidopsis thaliana] gi|72694... 145 3e-32 ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494... 145 5e-32 ref|XP_007143454.1| hypothetical protein PHAVU_007G073100g [Phas... 144 8e-32 >ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum lycopersicum] Length = 470 Score = 202 bits (513), Expect = 4e-49 Identities = 148/447 (33%), Positives = 187/447 (41%), Gaps = 134/447 (29%) Frame = -1 Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 1030 QP VQKRRWG WS+YWCFGS+KHSKRIGH + + + G + + + PN ++T Sbjct: 26 QPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPVAPGPAVPVT--ENPNHSATIV 83 Query: 1029 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-------IFLKGPYADETQLV 871 +PF + IF GPYA ETQLV Sbjct: 84 IPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSINAYSPGGTASIFAIGPYAHETQLV 143 Query: 870 SPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWR----------- 724 SPPVFS+FTT+PS+A FTPPPE V MTTP SPE PFAQLL+SSLA+ R Sbjct: 144 SPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLS 203 Query: 723 -------------------------NTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYK 619 N+ SP K ++ R E P+F+GYEHF K Sbjct: 204 QYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRK 263 Query: 618 WGS------------------------------NSSALTPNGKEPPSQECDILEN----- 544 WGS S +TPNG EPPS++ +LEN Sbjct: 264 WGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLENQISEV 323 Query: 543 -----------------NHRVSFELRGEDIPTSIVK-------GTTKGKDLATEVALSFR 436 +HRVSFEL ED+P+ K T D++ +A R Sbjct: 324 ASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSHSQPTLPMDVSNLLASEMR 383 Query: 435 TQTSV-----------RSNDGRDD-----RTTSFGSSKXXXXXXXXDEV----------- 337 + +S+ S G D+ R +FGSSK EV Sbjct: 384 SGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGSSKDFDFDNVKIEVLEKDSIDCEWW 443 Query: 336 -----GIKELGPQKNWNFFPMLQSGGS 271 +KE G Q NW FFP+LQ G S Sbjct: 444 TSDKAAVKESGIQNNWTFFPVLQPGVS 470 >ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum] Length = 470 Score = 196 bits (499), Expect = 2e-47 Identities = 147/447 (32%), Positives = 185/447 (41%), Gaps = 134/447 (29%) Frame = -1 Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 1030 QP VQKRRWG WS+YWCFGS+KHSKRIGH + + + G + + + PN ++T Sbjct: 26 QPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPAAPGPAVPVT--ENPNHSATIV 83 Query: 1029 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-------IFLKGPYADETQLV 871 +PF + IF GPYA ETQLV Sbjct: 84 IPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSLSINAYSPGGTASIFAIGPYAHETQLV 143 Query: 870 SPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWR----------- 724 SPPVFS+FTT+PS+A FTPPPE V MTTP SPE PFAQLL+SSLA+ R Sbjct: 144 SPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLS 203 Query: 723 -------------------------NTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYK 619 N+ SP K ++ R E P+F+GYEHF K Sbjct: 204 QYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRK 263 Query: 618 WGSN------------------------------SSALTPNGKEPPSQECDILEN----- 544 WGS S +TPNG EPPS++ +LE Sbjct: 264 WGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLEYQISEV 323 Query: 543 -----------------NHRVSFELRGEDIPTSIVKGT-------TKGKDLATEVALSFR 436 +HRVSFEL GED+P+ K T D++ +A + Sbjct: 324 ASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREKEPVMSHSQQTLPMDVSNLLANEMK 383 Query: 435 TQTSVR-----------SNDGRDD-----RTTSFGSSKXXXXXXXXDEV----------- 337 + +S+ S G D R +FGSSK EV Sbjct: 384 SGSSMAEEKTYGSPRKASESGEDQCHRKHRNITFGSSKDFDFDNVKIEVLEKDSIDCEWW 443 Query: 336 -----GIKELGPQKNWNFFPMLQSGGS 271 KE G Q NW FFP+LQ G S Sbjct: 444 TSDKAAGKESGIQNNWTFFPVLQPGVS 470 >ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera] Length = 448 Score = 187 bits (474), Expect = 1e-44 Identities = 144/453 (31%), Positives = 190/453 (41%), Gaps = 117/453 (25%) Frame = -1 Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 1105 MRSV++S VQP VQKRRWG S+YWCFGS++HSKRIGH + + Sbjct: 1 MRSVNNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLV 60 Query: 1104 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 925 + + G S + N +++ LPF Sbjct: 61 PEPMVPGAVAPAS--ENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALS 118 Query: 924 XXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAP 766 + +F GPYA ETQLVSPPVFS+F T+PS+APFTPPPESVQ+TTPSSPE P Sbjct: 119 VNAYSPSGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVP 178 Query: 765 FAQLLSSSLAQKWRNT--------------------EVP--------------SPLFDKR 688 FAQLL+SSL + RN+ E P SP D+R Sbjct: 179 FAQLLTSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISPISNSGTSSPFPDRR 238 Query: 687 ANVDLRLVEAPEFVGYEHFMNYKWGS--NSSALTPNGKEPPSQECDILEN---------- 544 +VEAP+ +G+EHF +WGS S +LTP+G P S++ +LEN Sbjct: 239 P-----IVEAPKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLAN 293 Query: 543 ------------NHRVSFELRGEDIPTSIVKGTTKG--------KDLATE---------- 454 +HRVSFEL GED+ + K +D+ E Sbjct: 294 SESGSQNGETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGI 353 Query: 453 -----------VALSFRTQTSVRSNDGRDDR------TTSFGSSKXXXXXXXXDEVGIKE 325 V + + + S +G +++ GS K EV K Sbjct: 354 SESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKP 413 Query: 324 -----------------LGPQKNWNFFPMLQSG 277 GPQ NW FFP+LQ G Sbjct: 414 NIIGSEWWVNEKVVGKGTGPQTNWTFFPLLQPG 446 >ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 485 Score = 176 bits (445), Expect = 3e-41 Identities = 146/483 (30%), Positives = 190/483 (39%), Gaps = 149/483 (30%) Frame = -1 Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 1105 MRSV+DS VQP VQK+RWG W +YWCFGS K+SKRIGH + + Sbjct: 1 MRSVNDSVETVNAAATAIVSADSRVQPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLV 60 Query: 1104 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 925 + + G S ST+ NPT LPF Sbjct: 61 PEPVVPGASVSTA-ENVSNPTGII-LPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLS 118 Query: 924 XXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAP 766 + IF GPYA ETQLV+PPVFS+ TT+PS+APFTPPPESVQ+TTPSSPE P Sbjct: 119 VNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVP 178 Query: 765 FAQLLSSSLAQKWRNTEV-------------------------------------PSPLF 697 FAQLL+SSL + RN+ + SP Sbjct: 179 FAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFP 238 Query: 696 DKRANVDLRLVEAPEFVGYEHFMNYKWGS------------------------------- 610 D+R ++ R+ EAP+ +G+E+F KWGS Sbjct: 239 DRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLG 298 Query: 609 ---NSSALTPNGKEPPSQ----------ECDILEN------------NHRVSFELRGEDI 505 S +LTP+G P S+ E +L N +HRVSFEL GED+ Sbjct: 299 SRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDV 358 Query: 504 ------------------PTSIV-------KGTTKGKDLATEVALSFRTQTSVRSNDGRD 400 P +V G K + + E+ + + +V G Sbjct: 359 APCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEA 418 Query: 399 D--------RTTSFGSSKXXXXXXXXDE----------------VGIKELGPQKNWNFFP 292 + R+ + GS K E V KE P +W FFP Sbjct: 419 EEEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFP 478 Query: 291 MLQ 283 MLQ Sbjct: 479 MLQ 481 >gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] Length = 521 Score = 173 bits (438), Expect = 2e-40 Identities = 106/281 (37%), Positives = 134/281 (47%), Gaps = 47/281 (16%) Frame = -1 Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 1105 MR+V++S QP AV KRRWG WS+YWCFGS+K+SKRIGH + + Sbjct: 1 MRTVNNSVETINAAATAIVSAEARAQPAAVPKRRWGSCWSLYWCFGSHKNSKRIGHAVLV 60 Query: 1104 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 925 + + G + Q P+ + LPF Sbjct: 61 PEPVLPGAAAPAPENQAPS--TAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLS 118 Query: 924 XXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAP 766 + IF GPYA ETQLVSPPVFS+FTT+PS+APFTPPPESVQ+TTPSSPE P Sbjct: 119 INAYSPGGPTSIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVP 178 Query: 765 FAQLLSSSLAQKWRNTE--------------------------------------VPSPL 700 FAQLL+SSL + RN+ SP Sbjct: 179 FAQLLTSSLDRTRRNSSGANQKFSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPF 238 Query: 699 FDKRANVDLRLVEAPEFVGYEHFMNYKWGS--NSSALTPNG 583 DK + R+ EAP +G+EHF +KWGS S +LTP+G Sbjct: 239 PDKHPILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDG 279 >ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 489 Score = 170 bits (430), Expect = 2e-39 Identities = 146/487 (29%), Positives = 190/487 (39%), Gaps = 153/487 (31%) Frame = -1 Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQ----KRRWGEWWSMYWCFGSYKHSKRIGH 1117 MRSV+DS VQP VQ K+RWG W +YWCFGS K+SKRIGH Sbjct: 1 MRSVNDSVETVNAAATAIVSADSRVQPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGH 60 Query: 1116 TLAISQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXX 937 + + + + G S ST+ NPT LPF Sbjct: 61 AVLVPEPVVPGASVSTA-ENVSNPTGII-LPFIAPPSSPASFLQSDPPSATQSPAGLLSL 118 Query: 936 XXXXXXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSS 778 + IF GPYA ETQLV+PPVFS+ TT+PS+APFTPPPESVQ+TTPSS Sbjct: 119 TSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSS 178 Query: 777 PEAPFAQLLSSSLAQKWRNTEV-------------------------------------P 709 PE PFAQLL+SSL + RN+ + Sbjct: 179 PEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTS 238 Query: 708 SPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS--------------------------- 610 SP D+R ++ R+ EAP+ +G+E+F KWGS Sbjct: 239 SPFPDRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDG 298 Query: 609 -------NSSALTPNGKEPPSQ----------ECDILEN------------NHRVSFELR 517 S +LTP+G P S+ E +L N +HRVSFEL Sbjct: 299 MGLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELS 358 Query: 516 GEDI------------------PTSIV-------KGTTKGKDLATEVALSFRTQTSVRSN 412 GED+ P +V G K + + E+ + + +V Sbjct: 359 GEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKA 418 Query: 411 DGRDD--------RTTSFGSSKXXXXXXXXDE----------------VGIKELGPQKNW 304 G + R+ + GS K E V KE P +W Sbjct: 419 SGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSW 478 Query: 303 NFFPMLQ 283 FFPMLQ Sbjct: 479 TFFPMLQ 485 >ref|XP_002865912.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] gi|297311747|gb|EFH42171.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] Length = 437 Score = 165 bits (417), Expect = 6e-38 Identities = 139/441 (31%), Positives = 186/441 (42%), Gaps = 103/441 (23%) Frame = -1 Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 1105 MR+V++S VQP +VQK RWG+ WS+Y CFG+ K++KRIG+ + + Sbjct: 1 MRNVNNSVETVNAAATAIVTAESRVQPSSVQKGRWGKCWSLYSCFGTQKNNKRIGNAVLV 60 Query: 1104 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 925 + +GV T Q ++T LPF Sbjct: 61 PEPVASGVPVVT--VQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPGGQLSLTSNT 118 Query: 924 XXPH----IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPE-SVQMTTPSSPEAPFA 760 P +F GPYA+ETQ V+PPVFS+F T+PS+AP+TPPPE SV +TTPSSPE PFA Sbjct: 119 FSPKEPQSVFTVGPYANETQPVTPPVFSAFVTEPSTAPYTPPPESSVHITTPSSPEVPFA 178 Query: 759 QLLSSSLAQKWRNTE----------------------------------------VPSPL 700 QLL+SSL RN+ SP Sbjct: 179 QLLTSSLELTRRNSSSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPY 238 Query: 699 FDKRANVDLRLVEAPEFVGYEHFMNYKWGSN----------------SSALTPNGKE--- 577 K V+ R+ E P+F+G+EHF KWGS S ALTPNG E Sbjct: 239 PGKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGLASGALTPNGLEIIS 298 Query: 576 ---PPSQE--------------------CDILENNHRVSFELRGEDIPTSIVKGTTKGKD 466 PS +++ +HRVSFEL GED+ + + D Sbjct: 299 GNLTPSNTTWPLHNQISEVASLANSDHGSEVIVADHRVSFELTGEDVARCLASKLNRSHD 358 Query: 465 -------LATEVALS--FRTQTSVRSNDGRDDR-------TTSFGSSKXXXXXXXXDEVG 334 + TE + S R RS D ++ ++S GSSK DE Sbjct: 359 RMNNNDRIETEESSSTDLRRNMEKRSADRETEQQRIQKLNSSSIGSSKEFKFDNTKDENI 418 Query: 333 IKELGPQKNWNFFPMLQSGGS 271 K G +W+FFP L+SG S Sbjct: 419 EKVAG--NSWSFFPGLRSGVS 437 >ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|10177409|dbj|BAB10540.1| unnamed protein product [Arabidopsis thaliana] gi|40823427|gb|AAR92282.1| At5g52430 [Arabidopsis thaliana] gi|56381929|gb|AAV85683.1| At5g52430 [Arabidopsis thaliana] gi|110738650|dbj|BAF01250.1| hypothetical protein [Arabidopsis thaliana] gi|332008830|gb|AED96213.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 438 Score = 162 bits (409), Expect = 5e-37 Identities = 133/416 (31%), Positives = 179/416 (43%), Gaps = 103/416 (24%) Frame = -1 Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 1030 QP + QK RWG+ WS+Y CFG+ K++KRIG+ + + + +GV T Q ++T Sbjct: 27 QPSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPEPVTSGVPVVT--VQNSATSTTVV 84 Query: 1029 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH----IFLKGPYADETQLVSPP 862 LPF P +F GPYA+ETQ V+PP Sbjct: 85 LPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTSNTFSPKEPQSVFTVGPYANETQPVTPP 144 Query: 861 VFSSFTTQPSSAPFTPPPE-SVQMTTPSSPEAPFAQLLSSSLA-----------QKW--- 727 VFS+F T+PS+AP+TPPPE SV +TTPSSPE PFAQLL+SSL QK+ Sbjct: 145 VFSAFITEPSTAPYTPPPESSVHITTPSSPEVPFAQLLTSSLELTRRDSTSGMNQKFSSS 204 Query: 726 --------------------------RNTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMN 625 N+ SP K V+ R+ E P+F+G+EHF Sbjct: 205 HYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGKSPMVEFRIGEPPKFLGFEHFTA 264 Query: 624 YKWGSN----------------SSALTPNGKEPPS------------------------- 568 KWGS S ALTPNG E S Sbjct: 265 RKWGSRFGSGSITPVGHGSGLASGALTPNGPEIVSGNLTPNNTTWPLQNQISEVASLANS 324 Query: 567 -QECDILENNHRVSFELRGEDIPTSIVKGTTKGKD-------LATEVALS--FRTQTSVR 418 +++ +HRVSFEL GED+ + + D + TE + S R R Sbjct: 325 DHGSEVMVADHRVSFELTGEDVARCLASKLNRSHDRMNNNDRIETEESSSTDIRRNIEKR 384 Query: 417 SNDGRDDR-------TTSFGSSKXXXXXXXXDEVGIKELGPQKNWNFFPMLQSGGS 271 S D +++ ++S GSSK DE K G +W+FFP L+SG S Sbjct: 385 SGDRENEQHRIQKLSSSSIGSSKEFKFDNTKDENIEKVAG--NSWSFFPGLRSGVS 438 >ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica] gi|462404864|gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica] Length = 455 Score = 161 bits (408), Expect = 6e-37 Identities = 127/421 (30%), Positives = 172/421 (40%), Gaps = 112/421 (26%) Frame = -1 Query: 1197 VQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTTLPFX 1018 VQKRRWG WWSMYWCFG +H KRIGH + + + T G A+ P T + LPF Sbjct: 37 VQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPETTDRGGDAPR--AENPIQTPSIVLPFV 94 Query: 1017 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH----IFLKGPYADETQLVSPPVFSS 850 P IF GPYA ETQLVSPPVFS+ Sbjct: 95 APPSSPASFLQSEPPSATQSPAGFFSLTASMYSPSGPTSIFAIGPYAHETQLVSPPVFST 154 Query: 849 FTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLL--------------------------- 751 FTT+PS+APFTPPPESV +TTPSSPE PFAQLL Sbjct: 155 FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYP 214 Query: 750 ----------SSSLAQKWRNTEVPSPLFDKRAN--VDLRLVEAPEFVGYEHFMNYKWGS- 610 SS ++ ++ P F R + ++ R + P+ + + WGS Sbjct: 215 GSPVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGDPPKLLNLDILSTRDWGSR 274 Query: 609 -NSSALTPNGKEPPSQECDILEN----------------------NHRVSFELRGEDI-- 505 S ++TP+G + S + +L+ NHRVSFEL E++ Sbjct: 275 LGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEVIR 334 Query: 504 -----PTSIVKGTT---------KGKDLATEVALSFRTQTSVRSNDGRD----------- 400 P ++ + + + K+ ++V S SND + Sbjct: 335 CVEKKPVALAEAVSTSLEDTEKAQSKEDPSKVVSSSICPVGETSNDAAEKAVADGEEAQL 394 Query: 399 ---DRTTSFGSSK---------------XXXXXXXXDEVGIKELGPQKNWNFFPMLQSGG 274 R+ + GS K ++V KE GP KNW+FFPM+Q G Sbjct: 395 HPKQRSITLGSVKEFNFDNPDGGDSGNSIGSDWWANEKVDAKENGPTKNWSFFPMMQPGV 454 Query: 273 S 271 S Sbjct: 455 S 455 >ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] gi|462415503|gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] Length = 499 Score = 160 bits (406), Expect = 1e-36 Identities = 138/502 (27%), Positives = 183/502 (36%), Gaps = 166/502 (33%) Frame = -1 Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 1105 MRSV+ S QP V KRRWG WS+YWCFG +K+ KRIGH + + Sbjct: 1 MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFGPHKN-KRIGHAVLV 59 Query: 1104 SQETINGVSTSTSYAQKPNPTSTTTL--PFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 931 + + G + S N T++T + PF Sbjct: 60 PEPVVPGAAVSAI----DNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKS 115 Query: 930 XXXXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPE 772 + IF GPYA ETQLVSPPVFS+F T+PS+APFTPPPESVQ+TTPSSPE Sbjct: 116 LSANAYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPE 175 Query: 771 APFAQLLSSSLAQKWR-------------------------------------NTEVPSP 703 PFAQLL+SSL + R N+ SP Sbjct: 176 VPFAQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSP 235 Query: 702 LFDKRANVDLRLVEAPEFVGYEHFMNYKWGS----------------------------- 610 D+ ++ R+ EAP+ G++HF KWGS Sbjct: 236 FPDRHPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNE 295 Query: 609 ---------------------NSSALTPNGKEPPSQECDILEN----------------- 544 S LTP+G P S++ +LEN Sbjct: 296 LGSRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQT 355 Query: 543 -----NHRVSFELRGEDIPTSIVKGTTKGKDLAT----EVALSFRTQTSVRSNDG----- 406 +HRVSFEL GED+ + A+ +A + ++ S+D Sbjct: 356 VETVFDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCE 415 Query: 405 -----------------------RDDRTTSFGSSKXXXXXXXXDE--------------- 340 R R+ + GS+K E Sbjct: 416 FSVEESSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVPNKPNIGSEWWANK 475 Query: 339 -VGIKELGPQKNWNFFPMLQSG 277 V KE P +W FFP+LQ G Sbjct: 476 NVAAKESKPCNDWTFFPILQPG 497 >ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis] gi|223547583|gb|EEF49078.1| conserved hypothetical protein [Ricinus communis] Length = 510 Score = 160 bits (404), Expect = 2e-36 Identities = 96/255 (37%), Positives = 127/255 (49%), Gaps = 46/255 (18%) Frame = -1 Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 1030 QP VQKRRWG WS+YWCFGS+K +KRIGH + + + G ++ A+ + ++ T Sbjct: 40 QPTTVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGAVVTS--AENQSQSTAIT 96 Query: 1029 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-------IFLKGPYADETQLV 871 +PF + IF GPYA ETQLV Sbjct: 97 VPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLV 156 Query: 870 SPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWR----------- 724 +PP FS+FTT+PS+APFTPPPESVQ+TTPSSPE PFAQLL+SSL + R Sbjct: 157 TPPAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALS 216 Query: 723 --------------------------NTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNY 622 N+ SP D+ ++ R+ EAP+ +G+EHF Sbjct: 217 HYEFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPILEFRMGEAPKLLGFEHFTTR 276 Query: 621 KWGS--NSSALTPNG 583 KWGS S +TP+G Sbjct: 277 KWGSRLGSGTVTPDG 291 >ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Capsella rubella] gi|482549191|gb|EOA13385.1| hypothetical protein CARUB_v10026425mg [Capsella rubella] Length = 437 Score = 159 bits (403), Expect = 2e-36 Identities = 140/443 (31%), Positives = 185/443 (41%), Gaps = 105/443 (23%) Frame = -1 Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 1105 MR+V++S VQP +VQKRRW + WS+Y CFGS K++KRIG+ + + Sbjct: 1 MRNVNNSVETVNAAATAIITAESRVQPSSVQKRRWAKCWSLYSCFGSQKNNKRIGNAVLV 60 Query: 1104 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 925 + +GV T Q ++T LPF Sbjct: 61 PEPVASGVPVVT--VQNSATSTTVVLPFIAPPSSPASFLPSDPSSVSHSPVGPLSLTSNT 118 Query: 924 XXPH----IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQ 757 P +F GPYA+ETQ V+PPVFS+F T+PS+AP+TPPPES TPSSPE PFAQ Sbjct: 119 FSPKEPQSVFTVGPYANETQPVTPPVFSAFITEPSTAPYTPPPES--SVTPSSPEVPFAQ 176 Query: 756 LLSSSLA----------QKW-----------------------------RNTEVPSPLFD 694 LL+SSL QK+ N+ SP Sbjct: 177 LLTSSLELTRRDSSGINQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPG 236 Query: 693 KRANVDLRLVEAPEFVGYEHFMNYKWGSN----------------SSALTPNGKE----- 577 K V+ R+ E P+F+G+EHF KWGS S ALTPN E Sbjct: 237 KSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGMASGALTPNAPEIISGN 296 Query: 576 -PPSQECDILEN--------------------NHRVSFELRGEDIPTSIVKGTTKGKD-- 466 PS L+N +HRVSFEL GED+ + + D Sbjct: 297 LTPSNTTWPLQNQISEVASLANSDHGSEVIVADHRVSFELTGEDVARCLASKLNRSHDRM 356 Query: 465 -----LATEVAL--------SFRTQTSVRSNDGRDDR-----TTSFGSSKXXXXXXXXDE 340 +ATE + SF+ S + + R ++S GSSK DE Sbjct: 357 NNNDRIATEESSSTDRGRRNSFQKIESTENRETEQQRIQKLSSSSIGSSKEFKFDNTKDE 416 Query: 339 VGIKELGPQKNWNFFPMLQSGGS 271 K G +W+FFP L+SG S Sbjct: 417 NIEKVAG--NSWSFFPGLRSGVS 437 >ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|222858882|gb|EEE96429.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 507 Score = 159 bits (401), Expect = 4e-36 Identities = 100/255 (39%), Positives = 128/255 (50%), Gaps = 50/255 (19%) Frame = -1 Query: 1200 AVQKRRWGEWWSMYWCFGSY---KHSKRIGHTLAISQETING-VSTSTSYAQKPNPTSTT 1033 +VQKRRWG WS+YWCFGS+ K+SKRIGH + + + + G VS+ST + P Sbjct: 32 SVQKRRWGGCWSLYWCFGSHGSHKNSKRIGHAVLVPEPEVPGAVSSSTENQTQSTPI--- 88 Query: 1032 TLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-------IFLKGPYADETQL 874 LPF + IF GPYA ETQL Sbjct: 89 LLPFIAPPSSPASFLQSDPPSSTQSPAGLLSLTSLSANAYSPRGPASIFAIGPYAHETQL 148 Query: 873 VSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWR---------- 724 V+PPVFS+FTT+PS+APFTPPPESVQ+TTPSSPE PFAQLL+SSL + R Sbjct: 149 VTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGPNQKFSL 208 Query: 723 ---------------------------NTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMN 625 N+ SP D+ ++ R+ EAP+ +G+EHF Sbjct: 209 SHYEFQSYHLYPGSPGGQIISPGSAISNSGTSSPFPDRHPMLEFRMGEAPKLLGFEHFST 268 Query: 624 YKWGS--NSSALTPN 586 KWGS S +LTP+ Sbjct: 269 RKWGSRLGSGSLTPD 283 >ref|XP_002867602.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] gi|297313438|gb|EFH43861.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] Length = 421 Score = 155 bits (391), Expect = 6e-35 Identities = 125/407 (30%), Positives = 168/407 (41%), Gaps = 96/407 (23%) Frame = -1 Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 1030 QP +V K+ WG WWS+Y CFGS K++KRIGH + + + +G + + N TS Sbjct: 22 QPSSVHKK-WGSWWSLYLCFGSKKNNKRIGHAVLVPEPAASGAAVAPVQNSSSNSTSMFM 80 Query: 1029 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPHIFLKGPYADETQLVSPPVFSS 850 P F GPYA ETQ V+PPVFS+ Sbjct: 81 PFIAPPSSPASFLPSGPPSVSHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVFSA 140 Query: 849 FTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRN----------------- 721 FTT+PS+APFTPPPES PSSPE PFAQLL+SSL + RN Sbjct: 141 FTTEPSTAPFTPPPES-----PSSPEVPFAQLLTSSLEKARRNIGGGMHHKFSAAHYEFK 195 Query: 720 -----------------TEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS------ 610 + SP K + ++ R+ E P+F+G+EHF KWGS Sbjct: 196 SHQVYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGS 255 Query: 609 ----------NSSALTPNGKEP------PSQECDI--LENN---------------HRVS 529 S ALTP+G P SQ ++ L N+ HRVS Sbjct: 256 ITPAGQGSRLGSGALTPDGLTPLEGSLLDSQITEVASLANSDHGSSRHNDEAAVVPHRVS 315 Query: 528 FELRGEDIPTSIVKGTTK--------GKDLATEVALSFRTQTSVRSNDGRDDRTTSFGSS 373 FEL GED+ + + G+ L +T S + R+ S GSS Sbjct: 316 FELTGEDVARCLASKLNRSGSHEKASGEHLRPN---GCKTSGETESEQSQKLRSFSTGSS 372 Query: 372 KXXXXXXXXDEV---------------GIKELGPQKNWNFFPMLQSG 277 K +E+ G + P+ +W FFP+L+SG Sbjct: 373 KEFKFDNTNEEMIEKVRSEWWANEKVAGKGDHSPRNSWTFFPVLRSG 419 >ref|XP_006401825.1| hypothetical protein EUTSA_v10013563mg [Eutrema salsugineum] gi|557102915|gb|ESQ43278.1| hypothetical protein EUTSA_v10013563mg [Eutrema salsugineum] Length = 440 Score = 154 bits (388), Expect = 1e-34 Identities = 132/442 (29%), Positives = 172/442 (38%), Gaps = 104/442 (23%) Frame = -1 Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 1105 MR+V++S VQP +V KRRW WS+ CFGS K++KRIG+ + + Sbjct: 1 MRNVNNSVETVNAAATAIVTAESRVQPSSVPKRRWRNCWSLNSCFGSQKNNKRIGNAMLV 60 Query: 1104 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 925 E + Q +S+ LPF Sbjct: 61 VPEPVATGGAPVVTVQNSATSSSIVLPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTSNT 120 Query: 924 XXP----HIFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPE-SVQMTTPSSPEAPFA 760 +F GPYA+ETQ V+PPVFS+F T+PS+APFTPPPE SV +TTPSSPE PFA Sbjct: 121 FSTTEPQSVFTIGPYANETQPVTPPVFSAFITEPSTAPFTPPPESSVHITTPSSPEVPFA 180 Query: 759 QLLSSSLAQKWRNTE---------------------------------------VPSPLF 697 QLL+SSL RN+ SP Sbjct: 181 QLLTSSLELTRRNSSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYP 240 Query: 696 DKRANVDLRLVEAPEFVGYEHFMNYKWGS------------NSSALTPNGKEPPSQECDI 553 K V+ R+ E P+F+G+EHF KWGS S ALTPNG S+ Sbjct: 241 GKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGALTPNGPGMVSESLTP 300 Query: 552 LENN-----------------------------HRVSFELRGEDIPTSIVKGTTKGKDLA 460 NN HRVSFEL GED+ + + D Sbjct: 301 NNNNNTTWPLTSQVSEVASLANSDHGSEVVAADHRVSFELTGEDVARCLASKLNRSHDRM 360 Query: 459 T---EVALSFRTQTSVRSNDGRDDR----------------TTSFGSSKXXXXXXXXDEV 337 V R S + + +R ++S GSSK +E Sbjct: 361 NNDERVETDERRSISFQKRENNVERVSGDREIEQQRIHKLSSSSIGSSKEFKFDNTKEEN 420 Query: 336 GIKELGPQKNWNFFPMLQSGGS 271 K G +W+FFP L+SG S Sbjct: 421 IEKVAG--NSWSFFPGLRSGVS 440 >ref|XP_006283737.1| hypothetical protein CARUB_v10004810mg [Capsella rubella] gi|482552442|gb|EOA16635.1| hypothetical protein CARUB_v10004810mg [Capsella rubella] Length = 444 Score = 154 bits (388), Expect = 1e-34 Identities = 127/450 (28%), Positives = 176/450 (39%), Gaps = 113/450 (25%) Frame = -1 Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPA---VQKRRWGEWWSMYWCFGSYKHSKRIGHT 1114 MRSV++S +Q P+ + K++WG WWS+YWCFGS K++KRIGH Sbjct: 1 MRSVNNSVDTVTAAASAIVSADSRLQQPSSSLLHKKKWGSWWSLYWCFGSKKNNKRIGHA 60 Query: 1113 LAISQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 934 + + +GV+ + + +++ +PF Sbjct: 61 VLAPEPAASGVAVAPVQNSSSSNSTSIFMPFIAPPSSPASFLPSGPPSVSHTPDPCRLRC 120 Query: 933 XXXXXP--HIFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFA 760 F GPYA ETQ V+PPVFS+FTT+PS+APFTPPPES PSSPE PFA Sbjct: 121 SLLVNEPPSAFAIGPYAHETQPVTPPVFSAFTTEPSTAPFTPPPES-----PSSPEVPFA 175 Query: 759 QLLSSSLAQKWRNTE----------------------------------VPSPLFDKRAN 682 QLL+SSL + RN+ SP K + Sbjct: 176 QLLTSSLERARRNSSGGMNHKFSAAHYEFKSHQVYPGSPGGNLISPGSGTSSPYPGKCSI 235 Query: 681 VDLRLVEAPEFVGYEHFMNYKWGS----------------NSSALTPNG----------- 583 ++ R+ E P+F+G+EHF KWGS S ALTP+G Sbjct: 236 IEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQGSRLGSGALTPDGGGGMGSKIASG 295 Query: 582 ---------KEPPSQECDILENN---------------HRVSFELRGEDIPTSIVKGTTK 475 + E L N+ HRVSFEL GED+ + + Sbjct: 296 ALTPLEDSLLDSQVSEVASLANSDHGSSRHNDEAVVVAHRVSFELTGEDVARCLASKLNR 355 Query: 474 --------GKDLATEVALSFRTQTSVRSNDGRDDRTTSFGSSKXXXXXXXXDE------- 340 G+ L +T S + R+ S GSSK +E Sbjct: 356 SGSHERASGEHLRPN---GCKTSGETESEQSQKLRSFSLGSSKEFKFDNTEEETIEKVRS 412 Query: 339 --------VGIKELGPQKNWNFFPMLQSGG 274 G + P +W FFP+L+S G Sbjct: 413 EWWANEKVAGKGDHSPANSWTFFPVLRSSG 442 >ref|NP_194292.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|26449762|dbj|BAC42004.1| unknown protein [Arabidopsis thaliana] gi|28951011|gb|AAO63429.1| At4g25620 [Arabidopsis thaliana] gi|332659684|gb|AEE85084.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 449 Score = 149 bits (377), Expect = 2e-33 Identities = 125/430 (29%), Positives = 168/430 (39%), Gaps = 119/430 (27%) Frame = -1 Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 1030 QP +VQK+R G WWS+YWCFGS K++KRIGH + + + +G + + N TS Sbjct: 27 QPSSVQKKR-GSWWSLYWCFGSKKNNKRIGHAVLVPEPAASGAAVAPVQNSSSNSTSIFM 85 Query: 1029 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPHIFLKGPYADETQLVSPPVFSS 850 P F GPYA ETQ V+PPVFS+ Sbjct: 86 PFIAPPSSPASFLPSGPPSASHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVFSA 145 Query: 849 FTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRN----------------- 721 FTT+PS+APFTPPPES PSSPE PFAQLL+SSL + RN Sbjct: 146 FTTEPSTAPFTPPPES-----PSSPEVPFAQLLTSSLERARRNSGGGMNQKFSAAHYEFK 200 Query: 720 -----------------TEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS------ 610 + SP K + ++ R+ E P+F+G+EHF KWGS Sbjct: 201 SCQVYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGS 260 Query: 609 ----------NSSALTPNGKEPPS-------------------------------QECDI 553 S ALTP+G + S E Sbjct: 261 ITPAGQGSRLGSGALTPDGSKLTSGVVTPNGAETVIRMSYGNLTPLEGSLLDSQISEVAS 320 Query: 552 LENN---------------HRVSFELRGEDIPTSIVKGTTK--------GKDLATEVALS 442 L N+ HRVSFEL GED+ + + G+ L Sbjct: 321 LANSDHGSSRHNDEALVVPHRVSFELTGEDVARCLASKLNRSGSHEKASGEHLRPNCC-- 378 Query: 441 FRTQTSVRSNDGRDDRTTSFGSSKXXXXXXXXDEV---------------GIKELGPQKN 307 +T S + R+ S GS+K +E+ G + P+ + Sbjct: 379 -KTSGETESEQSQKLRSFSTGSNKEFKFDSTNEEMIEKIRSEWWANEKVAGKGDHSPRNS 437 Query: 306 WNFFPMLQSG 277 W FFP+L+SG Sbjct: 438 WTFFPVLRSG 447 >emb|CAA18164.1| putative protein [Arabidopsis thaliana] gi|7269412|emb|CAB81372.1| putative protein [Arabidopsis thaliana] Length = 424 Score = 145 bits (367), Expect = 3e-32 Identities = 121/425 (28%), Positives = 164/425 (38%), Gaps = 119/425 (28%) Frame = -1 Query: 1194 QKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTTLPFXX 1015 QK++ G WWS+YWCFGS K++KRIGH + + + +G + + N TS Sbjct: 6 QKKKRGSWWSLYWCFGSKKNNKRIGHAVLVPEPAASGAAVAPVQNSSSNSTSIFMPFIAP 65 Query: 1014 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPHIFLKGPYADETQLVSPPVFSSFTTQP 835 P F GPYA ETQ V+PPVFS+FTT+P Sbjct: 66 PSSPASFLPSGPPSASHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVFSAFTTEP 125 Query: 834 SSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRN---------------------- 721 S+APFTPPPES PSSPE PFAQLL+SSL + RN Sbjct: 126 STAPFTPPPES-----PSSPEVPFAQLLTSSLERARRNSGGGMNQKFSAAHYEFKSCQVY 180 Query: 720 ------------TEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS----------- 610 + SP K + ++ R+ E P+F+G+EHF KWGS Sbjct: 181 PGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAG 240 Query: 609 -----NSSALTPNGKEPPS-------------------------------QECDILENN- 541 S ALTP+G + S E L N+ Sbjct: 241 QGSRLGSGALTPDGSKLTSGVVTPNGAETVIRMSYGNLTPLEGSLLDSQISEVASLANSD 300 Query: 540 --------------HRVSFELRGEDIPTSIVKGTTK--------GKDLATEVALSFRTQT 427 HRVSFEL GED+ + + G+ L +T Sbjct: 301 HGSSRHNDEALVVPHRVSFELTGEDVARCLASKLNRSGSHEKASGEHLRPNCC---KTSG 357 Query: 426 SVRSNDGRDDRTTSFGSSKXXXXXXXXDEV---------------GIKELGPQKNWNFFP 292 S + R+ S GS+K +E+ G + P+ +W FFP Sbjct: 358 ETESEQSQKLRSFSTGSNKEFKFDSTNEEMIEKIRSEWWANEKVAGKGDHSPRNSWTFFP 417 Query: 291 MLQSG 277 +L+SG Sbjct: 418 VLRSG 422 >ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494240 [Cicer arietinum] Length = 492 Score = 145 bits (366), Expect = 5e-32 Identities = 92/253 (36%), Positives = 122/253 (48%), Gaps = 44/253 (17%) Frame = -1 Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 1030 QP K+RWG +S+ CFGS+K SKRIGH + + + V + S PNP++ Sbjct: 27 QPSTSPKKRWGSCFSLSSCFGSHKSSKRIGHAVLVPEPVAPIVPVAHS---APNPSTVIV 83 Query: 1029 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH------IFLKGPYADETQLVS 868 +PF IF GPYA ETQLVS Sbjct: 84 MPFIAPPSSPASFLQSDPPSSTHSPAAGLLSPSVNAAYSSSGSASIFTIGPYAYETQLVS 143 Query: 867 PPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRN----------- 721 PPVFS+FTT+PS+A FTPPPESVQMTTPSSPE PFAQLL+SSL + +N Sbjct: 144 PPVFSNFTTEPSTASFTPPPESVQMTTPSSPEVPFAQLLASSLDRARKNNGSHKFALYNY 203 Query: 720 -------------------------TEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYKW 616 + +P D+R++++L E P+ +G+EHF +W Sbjct: 204 EFQPYQQYPGSPGAQLVSPGSVISTSGTSTPFPDRRSSLELSRGETPKILGFEHFSTRRW 263 Query: 615 GS--NSSALTPNG 583 S S +LTP+G Sbjct: 264 NSRIGSGSLTPDG 276 >ref|XP_007143454.1| hypothetical protein PHAVU_007G073100g [Phaseolus vulgaris] gi|561016644|gb|ESW15448.1| hypothetical protein PHAVU_007G073100g [Phaseolus vulgaris] Length = 479 Score = 144 bits (364), Expect = 8e-32 Identities = 106/333 (31%), Positives = 139/333 (41%), Gaps = 98/333 (29%) Frame = -1 Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQ--ETINGVSTSTSYAQKPNPTST 1036 QP K+RWG WS+YWCFG +K+SKRIG+ + + + E + + + A PNP++ Sbjct: 23 QPATSPKKRWGSCWSLYWCFGPHKNSKRIGNAVLVPEPVEPAGQIGSHLATAA-PNPSTA 81 Query: 1035 TTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-----IFLKGPYADETQLV 871 +PF IF GPY ETQLV Sbjct: 82 VAMPFIVPPSSPASFLESDSSSATQSPVGLFSLSSLNANASCGPASIFAIGPYTYETQLV 141 Query: 870 SPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSS----------------- 742 SPPVFS+FTT+PS+APFTPPPESVQ+TTPSSPE PFAQLL+SS Sbjct: 142 SPPVFSNFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRDCKDKGTNQRFALS 201 Query: 741 -----LAQKWRNTEVP---------------SPLFDKRANVDLRLVEAPEFVGYEHFMNY 622 L Q++ + P +P D ++ EA +G+EHF + Sbjct: 202 NYEFQLYQQYPGSPGPQLISPASIISTSGSSTPFPDTHPLLEFHKGEASNLLGFEHFSTH 261 Query: 621 KWG--------------------------------SNSSALTPNGKEPPSQ--------- 565 KW S+S LTP G P ++ Sbjct: 262 KWNSRLGSGSLTPDSTGQGSGLGSGSLTPNAVKLVSSSGCLTPEGVAPTARNGIYVGKQT 321 Query: 564 -ECDILEN------------NHRVSFELRGEDI 505 E L N +HRVSFEL GED+ Sbjct: 322 SELTPLANSENECQPNAALVDHRVSFELTGEDV 354