BLASTX nr result
ID: Mentha22_contig00026187
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00026187 (1095 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260... 197 8e-48 ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264... 183 1e-43 ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot... 174 6e-41 gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] 173 1e-40 ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot... 168 3e-39 ref|XP_002865912.1| hydroxyproline-rich glycoprotein family prot... 162 2e-37 ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun... 160 1e-36 ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm... 160 1e-36 ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein... 159 1e-36 ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr... 159 3e-36 ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Caps... 157 7e-36 ref|XP_006283737.1| hypothetical protein CARUB_v10004810mg [Caps... 152 2e-34 ref|XP_002867602.1| hydroxyproline-rich glycoprotein family prot... 152 2e-34 ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun... 152 3e-34 ref|XP_006401825.1| hypothetical protein EUTSA_v10013563mg [Eutr... 151 4e-34 ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660... 147 8e-33 ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254... 146 2e-32 ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494... 145 3e-32 ref|NP_194292.2| hydroxyproline-rich glycoprotein family protein... 145 3e-32 ref|XP_007143454.1| hypothetical protein PHAVU_007G073100g [Phas... 144 5e-32 >ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum lycopersicum] Length = 470 Score = 197 bits (500), Expect = 8e-48 Identities = 138/404 (34%), Positives = 176/404 (43%), Gaps = 118/404 (29%) Frame = -2 Query: 863 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 684 QP VQKRRWG WS+YWCFGS+KHSKRIGH + + + G + + + PN ++T Sbjct: 26 QPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPVAPGPAVPVT--ENPNHSATIV 83 Query: 683 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-------IFLKGPYADETQLV 525 +PF + IF GPYA ETQLV Sbjct: 84 IPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSINAYSPGGTASIFAIGPYAHETQLV 143 Query: 524 SPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWR----------- 378 SPPVFS+FTT+PS+A FTPPPE V MTTP SPE PFAQLL+SSLA+ R Sbjct: 144 SPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLS 203 Query: 377 -------------------------NTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYK 273 N+ SP K ++ R E P+F+GYEHF K Sbjct: 204 QYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRK 263 Query: 272 WGS------------------------------NSSALTPNGKEPPSQECDILEN----- 198 WGS S +TPNG EPPS++ +LEN Sbjct: 264 WGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLENQISEV 323 Query: 197 -----------------NHRVSFELRGEDIPTSIVK-------GTTKGKDLATEVALSFR 90 +HRVSFEL ED+P+ K T D++ +A R Sbjct: 324 ASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSHSQPTLPMDVSNLLASEMR 383 Query: 89 TQTSV-----------RSNDGRDD-----RTTSFGSSKDFDFNN 6 + +S+ S G D+ R +FGSSKDFDF+N Sbjct: 384 SGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGSSKDFDFDN 427 >ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera] Length = 448 Score = 183 bits (465), Expect = 1e-43 Identities = 122/331 (36%), Positives = 158/331 (47%), Gaps = 65/331 (19%) Frame = -2 Query: 938 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 759 MRSV++S VQP VQKRRWG S+YWCFGS++HSKRIGH + + Sbjct: 1 MRSVNNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLV 60 Query: 758 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 579 + + G S + N +++ LPF Sbjct: 61 PEPMVPGAVAPAS--ENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALS 118 Query: 578 XXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAP 420 + +F GPYA ETQLVSPPVFS+F T+PS+APFTPPPESVQ+TTPSSPE P Sbjct: 119 VNAYSPSGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVP 178 Query: 419 FAQLLSSSLAQKWRNT--------------------EVP--------------SPLFDKR 342 FAQLL+SSL + RN+ E P SP D+R Sbjct: 179 FAQLLTSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISPISNSGTSSPFPDRR 238 Query: 341 ANVDLRLVEAPEFVGYEHFMNYKWGS--NSSALTPNGKEPPSQECDILEN---------- 198 +VEAP+ +G+EHF +WGS S +LTP+G P S++ +LEN Sbjct: 239 P-----IVEAPKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLAN 293 Query: 197 ------------NHRVSFELRGEDIPTSIVK 141 +HRVSFEL GED+ + K Sbjct: 294 SESGSQNGETVIDHRVSFELAGEDVAVCVEK 324 >ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 485 Score = 174 bits (441), Expect = 6e-41 Identities = 137/444 (30%), Positives = 183/444 (41%), Gaps = 133/444 (29%) Frame = -2 Query: 938 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 759 MRSV+DS VQP VQK+RWG W +YWCFGS K+SKRIGH + + Sbjct: 1 MRSVNDSVETVNAAATAIVSADSRVQPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLV 60 Query: 758 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 579 + + G S ST+ NPT LPF Sbjct: 61 PEPVVPGASVSTA-ENVSNPTGII-LPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLS 118 Query: 578 XXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAP 420 + IF GPYA ETQLV+PPVFS+ TT+PS+APFTPPPESVQ+TTPSSPE P Sbjct: 119 VNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVP 178 Query: 419 FAQLLSSSLAQKWRNTEV-------------------------------------PSPLF 351 FAQLL+SSL + RN+ + SP Sbjct: 179 FAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFP 238 Query: 350 DKRANVDLRLVEAPEFVGYEHFMNYKWGS------------------------------- 264 D+R ++ R+ EAP+ +G+E+F KWGS Sbjct: 239 DRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLG 298 Query: 263 ---NSSALTPNGKEPPSQ----------ECDILEN------------NHRVSFELRGEDI 159 S +LTP+G P S+ E +L N +HRVSFEL GED+ Sbjct: 299 SRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDV 358 Query: 158 ------------------PTSIV-------KGTTKGKDLATEVALSFRTQTSVRSNDGRD 54 P +V G K + + E+ + + +V G Sbjct: 359 APCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEA 418 Query: 53 D--------RTTSFGSSKDFDFNN 6 + R+ + GS K+F+F+N Sbjct: 419 EEEHSYQKHRSVTLGSIKEFNFDN 442 >gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] Length = 521 Score = 173 bits (438), Expect = 1e-40 Identities = 106/281 (37%), Positives = 134/281 (47%), Gaps = 47/281 (16%) Frame = -2 Query: 938 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 759 MR+V++S QP AV KRRWG WS+YWCFGS+K+SKRIGH + + Sbjct: 1 MRTVNNSVETINAAATAIVSAEARAQPAAVPKRRWGSCWSLYWCFGSHKNSKRIGHAVLV 60 Query: 758 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 579 + + G + Q P+ + LPF Sbjct: 61 PEPVLPGAAAPAPENQAPS--TAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLS 118 Query: 578 XXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAP 420 + IF GPYA ETQLVSPPVFS+FTT+PS+APFTPPPESVQ+TTPSSPE P Sbjct: 119 INAYSPGGPTSIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVP 178 Query: 419 FAQLLSSSLAQKWRNTE--------------------------------------VPSPL 354 FAQLL+SSL + RN+ SP Sbjct: 179 FAQLLTSSLDRTRRNSSGANQKFSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPF 238 Query: 353 FDKRANVDLRLVEAPEFVGYEHFMNYKWGS--NSSALTPNG 237 DK + R+ EAP +G+EHF +KWGS S +LTP+G Sbjct: 239 PDKHPILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDG 279 >ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 489 Score = 168 bits (426), Expect = 3e-39 Identities = 137/448 (30%), Positives = 183/448 (40%), Gaps = 137/448 (30%) Frame = -2 Query: 938 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQ----KRRWGEWWSMYWCFGSYKHSKRIGH 771 MRSV+DS VQP VQ K+RWG W +YWCFGS K+SKRIGH Sbjct: 1 MRSVNDSVETVNAAATAIVSADSRVQPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGH 60 Query: 770 TLAISQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXX 591 + + + + G S ST+ NPT LPF Sbjct: 61 AVLVPEPVVPGASVSTA-ENVSNPTGII-LPFIAPPSSPASFLQSDPPSATQSPAGLLSL 118 Query: 590 XXXXXXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSS 432 + IF GPYA ETQLV+PPVFS+ TT+PS+APFTPPPESVQ+TTPSS Sbjct: 119 TSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSS 178 Query: 431 PEAPFAQLLSSSLAQKWRNTEV-------------------------------------P 363 PE PFAQLL+SSL + RN+ + Sbjct: 179 PEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTS 238 Query: 362 SPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS--------------------------- 264 SP D+R ++ R+ EAP+ +G+E+F KWGS Sbjct: 239 SPFPDRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDG 298 Query: 263 -------NSSALTPNGKEPPSQ----------ECDILEN------------NHRVSFELR 171 S +LTP+G P S+ E +L N +HRVSFEL Sbjct: 299 MGLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELS 358 Query: 170 GEDI------------------PTSIV-------KGTTKGKDLATEVALSFRTQTSVRSN 66 GED+ P +V G K + + E+ + + +V Sbjct: 359 GEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKA 418 Query: 65 DGRDD--------RTTSFGSSKDFDFNN 6 G + R+ + GS K+F+F+N Sbjct: 419 SGEAEEEHSYQKHRSVTLGSIKEFNFDN 446 >ref|XP_002865912.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] gi|297311747|gb|EFH42171.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] Length = 437 Score = 162 bits (411), Expect = 2e-37 Identities = 130/414 (31%), Positives = 176/414 (42%), Gaps = 103/414 (24%) Frame = -2 Query: 938 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 759 MR+V++S VQP +VQK RWG+ WS+Y CFG+ K++KRIG+ + + Sbjct: 1 MRNVNNSVETVNAAATAIVTAESRVQPSSVQKGRWGKCWSLYSCFGTQKNNKRIGNAVLV 60 Query: 758 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 579 + +GV T Q ++T LPF Sbjct: 61 PEPVASGVPVVT--VQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPGGQLSLTSNT 118 Query: 578 XXPH----IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPE-SVQMTTPSSPEAPFA 414 P +F GPYA+ETQ V+PPVFS+F T+PS+AP+TPPPE SV +TTPSSPE PFA Sbjct: 119 FSPKEPQSVFTVGPYANETQPVTPPVFSAFVTEPSTAPYTPPPESSVHITTPSSPEVPFA 178 Query: 413 QLLSSSLAQKWRNTE----------------------------------------VPSPL 354 QLL+SSL RN+ SP Sbjct: 179 QLLTSSLELTRRNSSSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPY 238 Query: 353 FDKRANVDLRLVEAPEFVGYEHFMNYKWGSN----------------SSALTPNGKE--- 231 K V+ R+ E P+F+G+EHF KWGS S ALTPNG E Sbjct: 239 PGKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGLASGALTPNGLEIIS 298 Query: 230 ---PPSQE--------------------CDILENNHRVSFELRGEDIPTSIVKGTTKGKD 120 PS +++ +HRVSFEL GED+ + + D Sbjct: 299 GNLTPSNTTWPLHNQISEVASLANSDHGSEVIVADHRVSFELTGEDVARCLASKLNRSHD 358 Query: 119 -------LATEVALS--FRTQTSVRSNDGRDDR-------TTSFGSSKDFDFNN 6 + TE + S R RS D ++ ++S GSSK+F F+N Sbjct: 359 RMNNNDRIETEESSSTDLRRNMEKRSADRETEQQRIQKLNSSSIGSSKEFKFDN 412 >ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] gi|462415503|gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] Length = 499 Score = 160 bits (404), Expect = 1e-36 Identities = 130/461 (28%), Positives = 175/461 (37%), Gaps = 150/461 (32%) Frame = -2 Query: 938 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 759 MRSV+ S QP V KRRWG WS+YWCFG +K+ KRIGH + + Sbjct: 1 MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFGPHKN-KRIGHAVLV 59 Query: 758 SQETINGVSTSTSYAQKPNPTSTTTL--PFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 585 + + G + S N T++T + PF Sbjct: 60 PEPVVPGAAVSAI----DNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKS 115 Query: 584 XXXXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPE 426 + IF GPYA ETQLVSPPVFS+F T+PS+APFTPPPESVQ+TTPSSPE Sbjct: 116 LSANAYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPE 175 Query: 425 APFAQLLSSSLAQKWR-------------------------------------NTEVPSP 357 PFAQLL+SSL + R N+ SP Sbjct: 176 VPFAQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSP 235 Query: 356 LFDKRANVDLRLVEAPEFVGYEHFMNYKWGS----------------------------- 264 D+ ++ R+ EAP+ G++HF KWGS Sbjct: 236 FPDRHPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNE 295 Query: 263 ---------------------NSSALTPNGKEPPSQECDILEN----------------- 198 S LTP+G P S++ +LEN Sbjct: 296 LGSRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQT 355 Query: 197 -----NHRVSFELRGEDIPTSIVKGTTKGKDLAT----EVALSFRTQTSVRSNDG----- 60 +HRVSFEL GED+ + A+ +A + ++ S+D Sbjct: 356 VETVFDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCE 415 Query: 59 -----------------------RDDRTTSFGSSKDFDFNN 6 R R+ + GS+KDF+F+N Sbjct: 416 FSVEESSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDN 456 >ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis] gi|223547583|gb|EEF49078.1| conserved hypothetical protein [Ricinus communis] Length = 510 Score = 160 bits (404), Expect = 1e-36 Identities = 96/255 (37%), Positives = 127/255 (49%), Gaps = 46/255 (18%) Frame = -2 Query: 863 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 684 QP VQKRRWG WS+YWCFGS+K +KRIGH + + + G ++ A+ + ++ T Sbjct: 40 QPTTVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGAVVTS--AENQSQSTAIT 96 Query: 683 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-------IFLKGPYADETQLV 525 +PF + IF GPYA ETQLV Sbjct: 97 VPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLV 156 Query: 524 SPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWR----------- 378 +PP FS+FTT+PS+APFTPPPESVQ+TTPSSPE PFAQLL+SSL + R Sbjct: 157 TPPAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALS 216 Query: 377 --------------------------NTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNY 276 N+ SP D+ ++ R+ EAP+ +G+EHF Sbjct: 217 HYEFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPILEFRMGEAPKLLGFEHFTTR 276 Query: 275 KWGS--NSSALTPNG 237 KWGS S +TP+G Sbjct: 277 KWGSRLGSGTVTPDG 291 >ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|10177409|dbj|BAB10540.1| unnamed protein product [Arabidopsis thaliana] gi|40823427|gb|AAR92282.1| At5g52430 [Arabidopsis thaliana] gi|56381929|gb|AAV85683.1| At5g52430 [Arabidopsis thaliana] gi|110738650|dbj|BAF01250.1| hypothetical protein [Arabidopsis thaliana] gi|332008830|gb|AED96213.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 438 Score = 159 bits (403), Expect = 1e-36 Identities = 124/389 (31%), Positives = 169/389 (43%), Gaps = 103/389 (26%) Frame = -2 Query: 863 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 684 QP + QK RWG+ WS+Y CFG+ K++KRIG+ + + + +GV T Q ++T Sbjct: 27 QPSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPEPVTSGVPVVT--VQNSATSTTVV 84 Query: 683 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH----IFLKGPYADETQLVSPP 516 LPF P +F GPYA+ETQ V+PP Sbjct: 85 LPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTSNTFSPKEPQSVFTVGPYANETQPVTPP 144 Query: 515 VFSSFTTQPSSAPFTPPPE-SVQMTTPSSPEAPFAQLLSSSLA-----------QKW--- 381 VFS+F T+PS+AP+TPPPE SV +TTPSSPE PFAQLL+SSL QK+ Sbjct: 145 VFSAFITEPSTAPYTPPPESSVHITTPSSPEVPFAQLLTSSLELTRRDSTSGMNQKFSSS 204 Query: 380 --------------------------RNTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMN 279 N+ SP K V+ R+ E P+F+G+EHF Sbjct: 205 HYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGKSPMVEFRIGEPPKFLGFEHFTA 264 Query: 278 YKWGSN----------------SSALTPNGKEPPS------------------------- 222 KWGS S ALTPNG E S Sbjct: 265 RKWGSRFGSGSITPVGHGSGLASGALTPNGPEIVSGNLTPNNTTWPLQNQISEVASLANS 324 Query: 221 -QECDILENNHRVSFELRGEDIPTSIVKGTTKGKD-------LATEVALS--FRTQTSVR 72 +++ +HRVSFEL GED+ + + D + TE + S R R Sbjct: 325 DHGSEVMVADHRVSFELTGEDVARCLASKLNRSHDRMNNNDRIETEESSSTDIRRNIEKR 384 Query: 71 SNDGRDDR-------TTSFGSSKDFDFNN 6 S D +++ ++S GSSK+F F+N Sbjct: 385 SGDRENEQHRIQKLSSSSIGSSKEFKFDN 413 >ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|222858882|gb|EEE96429.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 507 Score = 159 bits (401), Expect = 3e-36 Identities = 100/255 (39%), Positives = 128/255 (50%), Gaps = 50/255 (19%) Frame = -2 Query: 854 AVQKRRWGEWWSMYWCFGSY---KHSKRIGHTLAISQETING-VSTSTSYAQKPNPTSTT 687 +VQKRRWG WS+YWCFGS+ K+SKRIGH + + + + G VS+ST + P Sbjct: 32 SVQKRRWGGCWSLYWCFGSHGSHKNSKRIGHAVLVPEPEVPGAVSSSTENQTQSTPI--- 88 Query: 686 TLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-------IFLKGPYADETQL 528 LPF + IF GPYA ETQL Sbjct: 89 LLPFIAPPSSPASFLQSDPPSSTQSPAGLLSLTSLSANAYSPRGPASIFAIGPYAHETQL 148 Query: 527 VSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWR---------- 378 V+PPVFS+FTT+PS+APFTPPPESVQ+TTPSSPE PFAQLL+SSL + R Sbjct: 149 VTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGPNQKFSL 208 Query: 377 ---------------------------NTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMN 279 N+ SP D+ ++ R+ EAP+ +G+EHF Sbjct: 209 SHYEFQSYHLYPGSPGGQIISPGSAISNSGTSSPFPDRHPMLEFRMGEAPKLLGFEHFST 268 Query: 278 YKWGS--NSSALTPN 240 KWGS S +LTP+ Sbjct: 269 RKWGSRLGSGSLTPD 283 >ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Capsella rubella] gi|482549191|gb|EOA13385.1| hypothetical protein CARUB_v10026425mg [Capsella rubella] Length = 437 Score = 157 bits (397), Expect = 7e-36 Identities = 131/416 (31%), Positives = 175/416 (42%), Gaps = 105/416 (25%) Frame = -2 Query: 938 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 759 MR+V++S VQP +VQKRRW + WS+Y CFGS K++KRIG+ + + Sbjct: 1 MRNVNNSVETVNAAATAIITAESRVQPSSVQKRRWAKCWSLYSCFGSQKNNKRIGNAVLV 60 Query: 758 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 579 + +GV T Q ++T LPF Sbjct: 61 PEPVASGVPVVT--VQNSATSTTVVLPFIAPPSSPASFLPSDPSSVSHSPVGPLSLTSNT 118 Query: 578 XXPH----IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQ 411 P +F GPYA+ETQ V+PPVFS+F T+PS+AP+TPPPES TPSSPE PFAQ Sbjct: 119 FSPKEPQSVFTVGPYANETQPVTPPVFSAFITEPSTAPYTPPPES--SVTPSSPEVPFAQ 176 Query: 410 LLSSSLA----------QKW-----------------------------RNTEVPSPLFD 348 LL+SSL QK+ N+ SP Sbjct: 177 LLTSSLELTRRDSSGINQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPG 236 Query: 347 KRANVDLRLVEAPEFVGYEHFMNYKWGSN----------------SSALTPNGKE----- 231 K V+ R+ E P+F+G+EHF KWGS S ALTPN E Sbjct: 237 KSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGMASGALTPNAPEIISGN 296 Query: 230 -PPSQECDILEN--------------------NHRVSFELRGEDIPTSIVKGTTKGKD-- 120 PS L+N +HRVSFEL GED+ + + D Sbjct: 297 LTPSNTTWPLQNQISEVASLANSDHGSEVIVADHRVSFELTGEDVARCLASKLNRSHDRM 356 Query: 119 -----LATEVAL--------SFRTQTSVRSNDGRDDR-----TTSFGSSKDFDFNN 6 +ATE + SF+ S + + R ++S GSSK+F F+N Sbjct: 357 NNNDRIATEESSSTDRGRRNSFQKIESTENRETEQQRIQKLSSSSIGSSKEFKFDN 412 >ref|XP_006283737.1| hypothetical protein CARUB_v10004810mg [Capsella rubella] gi|482552442|gb|EOA16635.1| hypothetical protein CARUB_v10004810mg [Capsella rubella] Length = 444 Score = 152 bits (385), Expect = 2e-34 Identities = 120/409 (29%), Positives = 166/409 (40%), Gaps = 98/409 (23%) Frame = -2 Query: 938 MRSVHDSXXXXXXXXXXXXXXXXXVQPPA---VQKRRWGEWWSMYWCFGSYKHSKRIGHT 768 MRSV++S +Q P+ + K++WG WWS+YWCFGS K++KRIGH Sbjct: 1 MRSVNNSVDTVTAAASAIVSADSRLQQPSSSLLHKKKWGSWWSLYWCFGSKKNNKRIGHA 60 Query: 767 LAISQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 588 + + +GV+ + + +++ +PF Sbjct: 61 VLAPEPAASGVAVAPVQNSSSSNSTSIFMPFIAPPSSPASFLPSGPPSVSHTPDPCRLRC 120 Query: 587 XXXXXP--HIFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFA 414 F GPYA ETQ V+PPVFS+FTT+PS+APFTPPPES PSSPE PFA Sbjct: 121 SLLVNEPPSAFAIGPYAHETQPVTPPVFSAFTTEPSTAPFTPPPES-----PSSPEVPFA 175 Query: 413 QLLSSSLAQKWRNTE----------------------------------VPSPLFDKRAN 336 QLL+SSL + RN+ SP K + Sbjct: 176 QLLTSSLERARRNSSGGMNHKFSAAHYEFKSHQVYPGSPGGNLISPGSGTSSPYPGKCSI 235 Query: 335 VDLRLVEAPEFVGYEHFMNYKWGS----------------NSSALTPNG----------- 237 ++ R+ E P+F+G+EHF KWGS S ALTP+G Sbjct: 236 IEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQGSRLGSGALTPDGGGGMGSKIASG 295 Query: 236 ---------KEPPSQECDILENN---------------HRVSFELRGEDIPTSIVKGTTK 129 + E L N+ HRVSFEL GED+ + + Sbjct: 296 ALTPLEDSLLDSQVSEVASLANSDHGSSRHNDEAVVVAHRVSFELTGEDVARCLASKLNR 355 Query: 128 --------GKDLATEVALSFRTQTSVRSNDGRDDRTTSFGSSKDFDFNN 6 G+ L +T S + R+ S GSSK+F F+N Sbjct: 356 SGSHERASGEHLRPN---GCKTSGETESEQSQKLRSFSLGSSKEFKFDN 401 >ref|XP_002867602.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] gi|297313438|gb|EFH43861.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] Length = 421 Score = 152 bits (385), Expect = 2e-34 Identities = 118/367 (32%), Positives = 156/367 (42%), Gaps = 81/367 (22%) Frame = -2 Query: 863 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 684 QP +V K+ WG WWS+Y CFGS K++KRIGH + + + +G + + N TS Sbjct: 22 QPSSVHKK-WGSWWSLYLCFGSKKNNKRIGHAVLVPEPAASGAAVAPVQNSSSNSTSMFM 80 Query: 683 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPHIFLKGPYADETQLVSPPVFSS 504 P F GPYA ETQ V+PPVFS+ Sbjct: 81 PFIAPPSSPASFLPSGPPSVSHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVFSA 140 Query: 503 FTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRN----------------- 375 FTT+PS+APFTPPPES PSSPE PFAQLL+SSL + RN Sbjct: 141 FTTEPSTAPFTPPPES-----PSSPEVPFAQLLTSSLEKARRNIGGGMHHKFSAAHYEFK 195 Query: 374 -----------------TEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS------ 264 + SP K + ++ R+ E P+F+G+EHF KWGS Sbjct: 196 SHQVYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGS 255 Query: 263 ----------NSSALTPNGKEP------PSQECDI--LENN---------------HRVS 183 S ALTP+G P SQ ++ L N+ HRVS Sbjct: 256 ITPAGQGSRLGSGALTPDGLTPLEGSLLDSQITEVASLANSDHGSSRHNDEAAVVPHRVS 315 Query: 182 FELRGEDIPTSIVKGTTK--------GKDLATEVALSFRTQTSVRSNDGRDDRTTSFGSS 27 FEL GED+ + + G+ L +T S + R+ S GSS Sbjct: 316 FELTGEDVARCLASKLNRSGSHEKASGEHLRPN---GCKTSGETESEQSQKLRSFSTGSS 372 Query: 26 KDFDFNN 6 K+F F+N Sbjct: 373 KEFKFDN 379 >ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica] gi|462404864|gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica] Length = 455 Score = 152 bits (383), Expect = 3e-34 Identities = 115/379 (30%), Positives = 159/379 (41%), Gaps = 97/379 (25%) Frame = -2 Query: 851 VQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTTLPFX 672 VQKRRWG WWSMYWCFG +H KRIGH + + + T G A+ P T + LPF Sbjct: 37 VQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPETTDRGGDAPR--AENPIQTPSIVLPFV 94 Query: 671 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH----IFLKGPYADETQLVSPPVFSS 504 P IF GPYA ETQLVSPPVFS+ Sbjct: 95 APPSSPASFLQSEPPSATQSPAGFFSLTASMYSPSGPTSIFAIGPYAHETQLVSPPVFST 154 Query: 503 FTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLL--------------------------- 405 FTT+PS+APFTPPPESV +TTPSSPE PFAQLL Sbjct: 155 FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYP 214 Query: 404 ----------SSSLAQKWRNTEVPSPLFDKRAN--VDLRLVEAPEFVGYEHFMNYKWGS- 264 SS ++ ++ P F R + ++ R + P+ + + WGS Sbjct: 215 GSPVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGDPPKLLNLDILSTRDWGSR 274 Query: 263 -NSSALTPNGKEPPSQECDILEN----------------------NHRVSFELRGEDI-- 159 S ++TP+G + S + +L+ NHRVSFEL E++ Sbjct: 275 LGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEVIR 334 Query: 158 -----PTSIVKGTT---------KGKDLATEVALSFRTQTSVRSNDGRD----------- 54 P ++ + + + K+ ++V S SND + Sbjct: 335 CVEKKPVALAEAVSTSLEDTEKAQSKEDPSKVVSSSICPVGETSNDAAEKAVADGEEAQL 394 Query: 53 ---DRTTSFGSSKDFDFNN 6 R+ + GS K+F+F+N Sbjct: 395 HPKQRSITLGSVKEFNFDN 413 >ref|XP_006401825.1| hypothetical protein EUTSA_v10013563mg [Eutrema salsugineum] gi|557102915|gb|ESQ43278.1| hypothetical protein EUTSA_v10013563mg [Eutrema salsugineum] Length = 440 Score = 151 bits (382), Expect = 4e-34 Identities = 124/415 (29%), Positives = 162/415 (39%), Gaps = 104/415 (25%) Frame = -2 Query: 938 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 759 MR+V++S VQP +V KRRW WS+ CFGS K++KRIG+ + + Sbjct: 1 MRNVNNSVETVNAAATAIVTAESRVQPSSVPKRRWRNCWSLNSCFGSQKNNKRIGNAMLV 60 Query: 758 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 579 E + Q +S+ LPF Sbjct: 61 VPEPVATGGAPVVTVQNSATSSSIVLPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTSNT 120 Query: 578 XXP----HIFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPE-SVQMTTPSSPEAPFA 414 +F GPYA+ETQ V+PPVFS+F T+PS+APFTPPPE SV +TTPSSPE PFA Sbjct: 121 FSTTEPQSVFTIGPYANETQPVTPPVFSAFITEPSTAPFTPPPESSVHITTPSSPEVPFA 180 Query: 413 QLLSSSLAQKWRNTE---------------------------------------VPSPLF 351 QLL+SSL RN+ SP Sbjct: 181 QLLTSSLELTRRNSSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYP 240 Query: 350 DKRANVDLRLVEAPEFVGYEHFMNYKWGS------------NSSALTPNGKEPPSQECDI 207 K V+ R+ E P+F+G+EHF KWGS S ALTPNG S+ Sbjct: 241 GKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGALTPNGPGMVSESLTP 300 Query: 206 LENN-----------------------------HRVSFELRGEDIPTSIVKGTTKGKDLA 114 NN HRVSFEL GED+ + + D Sbjct: 301 NNNNNTTWPLTSQVSEVASLANSDHGSEVVAADHRVSFELTGEDVARCLASKLNRSHDRM 360 Query: 113 T---EVALSFRTQTSVRSNDGRDDR----------------TTSFGSSKDFDFNN 6 V R S + + +R ++S GSSK+F F+N Sbjct: 361 NNDERVETDERRSISFQKRENNVERVSGDREIEQQRIHKLSSSSIGSSKEFKFDN 415 >ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum] Length = 443 Score = 147 bits (371), Expect = 8e-33 Identities = 113/373 (30%), Positives = 153/373 (41%), Gaps = 90/373 (24%) Frame = -2 Query: 854 AVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVS--TSTSYAQKPNPTSTTTL 681 ++QKRRWG WSMYWCFGS K +KRIGH + I + T +G +S + +Q P+ Sbjct: 36 SIQKRRWGGCWSMYWCFGSQKQTKRIGHAVFIPETTASGADRPSSNTSSQAPSIVLPFIA 95 Query: 680 PFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPHIFLKGPYADETQLVSPPVFSSF 501 P IF GPYA ETQLVSPPVFS+F Sbjct: 96 PPSSPASFLPSEPPSATHSPVGSKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVFSAF 155 Query: 500 TTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRNTEVP-------------- 363 TT+PS+APFTPPPESV +TTPSSPE PFA+LL + P Sbjct: 156 TTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQPG 215 Query: 362 -------------------SPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS--NSSALT 246 SP D+ P+F+ E ++WGS S LT Sbjct: 216 SPVSNLISPGSAISVSGTSSPFLDREYTPG-----RPQFLNLEKIAPHEWGSRQGSGTLT 270 Query: 245 PNGKEPPSQE----------------------CDILENNHRVSFELRGEDI-------PT 153 P P + D+ +HRVSFE+ ED+ PT Sbjct: 271 PEAVNPKYHDNFLLNYQNSGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPT 330 Query: 152 SIV-----------KGTTKGKDLAT----------EVALSFRTQTSVRSNDG---RDDRT 45 ++ + T + ++LA E + +S DG + R+ Sbjct: 331 MMMRTGSVSLQDTERSTKRQENLAEMSNGHDHGGHEPSREIHEGSSTDGEDGQRQQKHRS 390 Query: 44 TSFGSSKDFDFNN 6 + GSSK+F+F+N Sbjct: 391 ITLGSSKEFNFDN 403 >ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum lycopersicum] Length = 443 Score = 146 bits (368), Expect = 2e-32 Identities = 111/368 (30%), Positives = 154/368 (41%), Gaps = 85/368 (23%) Frame = -2 Query: 854 AVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVS--TSTSYAQKPNPTSTTTL 681 ++QKRRWG WSMYWCFGS K +KRIGH + I + T + +S + +Q P+ Sbjct: 36 SIQKRRWGSCWSMYWCFGSQKQTKRIGHAVFIPETTASAADRPSSNTSSQAPSIVLPFIA 95 Query: 680 PFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPHIFLKGPYADETQLVSPPVFSSF 501 P IF GPYA ETQLVSPPVFS+F Sbjct: 96 PPSSPASFLPSEPPSATHSPVGSKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVFSAF 155 Query: 500 TTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRNTEVP-------------- 363 TT+PS+APFTPPPESV +TTPSSPE PFA+LL + P Sbjct: 156 TTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQPG 215 Query: 362 ---SPLFDKRANVDLRLVEA-----------PEFVGYEHFMNYKWGS--NSSALTPNGKE 231 S L + + + + P+F+ E ++WGS S LTP Sbjct: 216 SPVSNLISPGSAISVSGTSSPFLEREYTPGRPQFLNLEKIAPHEWGSRQGSGTLTPEAVN 275 Query: 230 PPSQEC----------------------DILENNHRVSFELRGEDI-------PTSIV-- 144 P + D+ +HRVSFE+ ED+ PT ++ Sbjct: 276 PKYHDSFLLNYQNTGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTMMMRT 335 Query: 143 ---------KGTTKGKDLAT----------EVALSFRTQTSVRSNDG---RDDRTTSFGS 30 + T + ++LA E + +S DG + R+ + GS Sbjct: 336 GSVSLQDTERSTKRQENLAEMSNAHDHSGHEPSREIHEGSSTDGEDGQRQQKHRSITLGS 395 Query: 29 SKDFDFNN 6 SK+F+F+N Sbjct: 396 SKEFNFDN 403 >ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494240 [Cicer arietinum] Length = 492 Score = 145 bits (366), Expect = 3e-32 Identities = 92/253 (36%), Positives = 122/253 (48%), Gaps = 44/253 (17%) Frame = -2 Query: 863 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 684 QP K+RWG +S+ CFGS+K SKRIGH + + + V + S PNP++ Sbjct: 27 QPSTSPKKRWGSCFSLSSCFGSHKSSKRIGHAVLVPEPVAPIVPVAHS---APNPSTVIV 83 Query: 683 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH------IFLKGPYADETQLVS 522 +PF IF GPYA ETQLVS Sbjct: 84 MPFIAPPSSPASFLQSDPPSSTHSPAAGLLSPSVNAAYSSSGSASIFTIGPYAYETQLVS 143 Query: 521 PPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRN----------- 375 PPVFS+FTT+PS+A FTPPPESVQMTTPSSPE PFAQLL+SSL + +N Sbjct: 144 PPVFSNFTTEPSTASFTPPPESVQMTTPSSPEVPFAQLLASSLDRARKNNGSHKFALYNY 203 Query: 374 -------------------------TEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYKW 270 + +P D+R++++L E P+ +G+EHF +W Sbjct: 204 EFQPYQQYPGSPGAQLVSPGSVISTSGTSTPFPDRRSSLELSRGETPKILGFEHFSTRRW 263 Query: 269 GS--NSSALTPNG 237 S S +LTP+G Sbjct: 264 NSRIGSGSLTPDG 276 >ref|NP_194292.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|26449762|dbj|BAC42004.1| unknown protein [Arabidopsis thaliana] gi|28951011|gb|AAO63429.1| At4g25620 [Arabidopsis thaliana] gi|332659684|gb|AEE85084.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 449 Score = 145 bits (366), Expect = 3e-32 Identities = 117/390 (30%), Positives = 156/390 (40%), Gaps = 104/390 (26%) Frame = -2 Query: 863 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 684 QP +VQK+R G WWS+YWCFGS K++KRIGH + + + +G + + N TS Sbjct: 27 QPSSVQKKR-GSWWSLYWCFGSKKNNKRIGHAVLVPEPAASGAAVAPVQNSSSNSTSIFM 85 Query: 683 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPHIFLKGPYADETQLVSPPVFSS 504 P F GPYA ETQ V+PPVFS+ Sbjct: 86 PFIAPPSSPASFLPSGPPSASHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVFSA 145 Query: 503 FTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRN----------------- 375 FTT+PS+APFTPPPES PSSPE PFAQLL+SSL + RN Sbjct: 146 FTTEPSTAPFTPPPES-----PSSPEVPFAQLLTSSLERARRNSGGGMNQKFSAAHYEFK 200 Query: 374 -----------------TEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS------ 264 + SP K + ++ R+ E P+F+G+EHF KWGS Sbjct: 201 SCQVYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGS 260 Query: 263 ----------NSSALTPNGKEPPS-------------------------------QECDI 207 S ALTP+G + S E Sbjct: 261 ITPAGQGSRLGSGALTPDGSKLTSGVVTPNGAETVIRMSYGNLTPLEGSLLDSQISEVAS 320 Query: 206 LENN---------------HRVSFELRGEDIPTSIVKGTTK--------GKDLATEVALS 96 L N+ HRVSFEL GED+ + + G+ L Sbjct: 321 LANSDHGSSRHNDEALVVPHRVSFELTGEDVARCLASKLNRSGSHEKASGEHLRPNCC-- 378 Query: 95 FRTQTSVRSNDGRDDRTTSFGSSKDFDFNN 6 +T S + R+ S GS+K+F F++ Sbjct: 379 -KTSGETESEQSQKLRSFSTGSNKEFKFDS 407 >ref|XP_007143454.1| hypothetical protein PHAVU_007G073100g [Phaseolus vulgaris] gi|561016644|gb|ESW15448.1| hypothetical protein PHAVU_007G073100g [Phaseolus vulgaris] Length = 479 Score = 144 bits (364), Expect = 5e-32 Identities = 106/333 (31%), Positives = 139/333 (41%), Gaps = 98/333 (29%) Frame = -2 Query: 863 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQ--ETINGVSTSTSYAQKPNPTST 690 QP K+RWG WS+YWCFG +K+SKRIG+ + + + E + + + A PNP++ Sbjct: 23 QPATSPKKRWGSCWSLYWCFGPHKNSKRIGNAVLVPEPVEPAGQIGSHLATAA-PNPSTA 81 Query: 689 TTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-----IFLKGPYADETQLV 525 +PF IF GPY ETQLV Sbjct: 82 VAMPFIVPPSSPASFLESDSSSATQSPVGLFSLSSLNANASCGPASIFAIGPYTYETQLV 141 Query: 524 SPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSS----------------- 396 SPPVFS+FTT+PS+APFTPPPESVQ+TTPSSPE PFAQLL+SS Sbjct: 142 SPPVFSNFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRDCKDKGTNQRFALS 201 Query: 395 -----LAQKWRNTEVP---------------SPLFDKRANVDLRLVEAPEFVGYEHFMNY 276 L Q++ + P +P D ++ EA +G+EHF + Sbjct: 202 NYEFQLYQQYPGSPGPQLISPASIISTSGSSTPFPDTHPLLEFHKGEASNLLGFEHFSTH 261 Query: 275 KWG--------------------------------SNSSALTPNGKEPPSQ--------- 219 KW S+S LTP G P ++ Sbjct: 262 KWNSRLGSGSLTPDSTGQGSGLGSGSLTPNAVKLVSSSGCLTPEGVAPTARNGIYVGKQT 321 Query: 218 -ECDILEN------------NHRVSFELRGEDI 159 E L N +HRVSFEL GED+ Sbjct: 322 SELTPLANSENECQPNAALVDHRVSFELTGEDV 354