BLASTX nr result
ID: Ziziphus21_contig00014099
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ziziphus21_contig00014099 (1632 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010106355.1| hypothetical protein L484_000896 [Morus nota... 287 2e-74 ref|XP_010664420.1| PREDICTED: RNA-binding protein FUS isoform X... 234 2e-58 ref|XP_009376205.1| PREDICTED: proline-rich protein 2 [Pyrus x b... 231 1e-57 ref|XP_002283770.1| PREDICTED: RNA-binding protein FUS isoform X... 230 3e-57 ref|XP_008219246.1| PREDICTED: collagen alpha-1(III) chain [Prun... 216 5e-53 ref|XP_007018241.1| Hydroxyproline-rich glycoprotein family prot... 207 3e-50 ref|XP_006472450.1| PREDICTED: RNA-binding protein FUS-like [Cit... 204 2e-49 ref|XP_011017017.1| PREDICTED: uncharacterized protein LOC105120... 202 5e-49 ref|XP_008442005.1| PREDICTED: uncharacterized protein LOC103486... 200 3e-48 ref|XP_002514052.1| conserved hypothetical protein [Ricinus comm... 197 3e-47 ref|XP_007224174.1| hypothetical protein PRUPE_ppa016470mg [Prun... 192 6e-46 ref|XP_009607447.1| PREDICTED: uncharacterized protein LOC104101... 191 1e-45 ref|XP_012068030.1| PREDICTED: uncharacterized protein LOC105630... 189 5e-45 ref|XP_009607446.1| PREDICTED: uncharacterized protein LOC104101... 187 3e-44 ref|XP_008459151.1| PREDICTED: uncharacterized protein LOC103498... 185 9e-44 ref|XP_012068029.1| PREDICTED: uncharacterized protein LOC105630... 185 1e-43 gb|KHG05960.1| Epidermal growth factor receptor [Gossypium arbor... 176 5e-41 ref|XP_007018242.1| Hydroxyproline-rich glycoprotein family prot... 176 5e-41 ref|XP_002306529.2| hydroxyproline-rich glycoprotein [Populus tr... 176 7e-41 ref|XP_006575356.1| PREDICTED: vegetative cell wall protein gp1-... 174 2e-40 >ref|XP_010106355.1| hypothetical protein L484_000896 [Morus notabilis] gi|587967407|gb|EXC52457.1| hypothetical protein L484_000896 [Morus notabilis] Length = 346 Score = 287 bits (734), Expect = 2e-74 Identities = 180/362 (49%), Positives = 218/362 (60%), Gaps = 12/362 (3%) Frame = -1 Query: 1374 MEESEKRRERLKAMRMEAACSE--NSNDLASPMPS-LSNPLAETSKAMH--DGYCATSRF 1210 MEESEKRRERL+AMR EAA + N+ A MP LSNPL ETS A + TSRF Sbjct: 1 MEESEKRRERLRAMRHEAAAQSVNSDNNEAPAMPCYLSNPLVETSAAAPPPEQSHGTSRF 60 Query: 1209 GFYTDPMAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSPLPGPRNPGMTSPGAY 1030 FYTDPMAAFSA+K+RNN ++ IS+ + T P GS + SP GPR GM+ A+ Sbjct: 61 DFYTDPMAAFSANKRRNNT-SDPISSHHVTPPANSGSPMLRSPSPFSGPRYAGMSP--AH 117 Query: 1029 QFQSNYTPN-QMYQARGFGLNSSFPRSPMGIHRP-TMHQGNPDAWNGSGGAAGY-NFSPN 859 QFQSNY+PN +MYQ +GFG + +G+ RP MHQGN D G G AAGY NF N Sbjct: 118 QFQSNYSPNPRMYQPQGFGHDPISQSGELGMSRPFNMHQGNMDPSIGPGSAAGYYNFPSN 177 Query: 858 PSRECNFPSPRFGPTGSPCFNTGQGRANWPNQXXXXXXXXXXXXXXXXXXXGDHWRGGR- 682 R FPSPR GPTGS FN GQGRA+W N G W GG Sbjct: 178 QPRGSRFPSPRIGPTGS-FFNAGQGRAHWHNHSPNPGLGRGGSPSPSLGRGGGRWHGGST 236 Query: 681 SPASGQRGGWGNFSPXXXXXXXXXXXSHARPSAMDRPLGPEKYYDNAMVEDPWEFLEPVI 502 SP SG+RGG G S R MDR LGPE++YD +M+ED W+FLEPV+ Sbjct: 237 SPGSGRRGGRGPGSA-------------GRHFTMDRQLGPERFYDESMIEDAWKFLEPVV 283 Query: 501 WKGVDDPLNSLRTPESSRSSIS---GTKRAKTYEVPGRSNNQPSLAEYLAASLNEADNDS 331 W+ VD L+SL TP+SS+S I+ G K+AK + +S +QPSLAEYLAAS +EA+ D Sbjct: 284 WREVDASLSSLSTPDSSKSWITRSLGAKKAKVSDSTSKSGSQPSLAEYLAASFDEANKDE 343 Query: 330 SS 325 SS Sbjct: 344 SS 345 >ref|XP_010664420.1| PREDICTED: RNA-binding protein FUS isoform X2 [Vitis vinifera] Length = 346 Score = 234 bits (597), Expect = 2e-58 Identities = 159/363 (43%), Positives = 199/363 (54%), Gaps = 14/363 (3%) Frame = -1 Query: 1374 MEESEKRRERLKAMRMEAACSENSNDL-ASPMPS-LSNPLAETSKAM--HDGYCATSRFG 1207 MEESEKRRERLKAMRMEAA ++ S+ + S MP LSNPL E S + + C T RF Sbjct: 1 MEESEKRRERLKAMRMEAAQTKVSDTVDTSAMPGYLSNPLVEGSATLPVQEDSCVTPRFD 60 Query: 1206 FYTDPMAAFSADKKRNNAGNNQISADYST-SPIAGGSSKMG-FSSPLPGPRNPGMTSPGA 1033 FYTDPM+AFS++K+R+ GN QI DY T S +G ++ M SS L GPRN MT Sbjct: 61 FYTDPMSAFSSNKRRSKVGN-QIQQDYLTPSSNSGYTATMARMSSSLSGPRNCEMTPSPN 119 Query: 1032 YQFQSNYTPNQ-MYQARGFGLNSSFPRSPMGIHRP-TMHQGNPDAWNGSGGAAGYNFSPN 859 FQ N++P Q + QA+G +S RSP+ + P HQG P WNGS G Y N Sbjct: 120 PPFQPNFSPGQGINQAQGLYHSSGPYRSPIEMASPFPAHQGTPGVWNGSNGMPRYGVPSN 179 Query: 858 PSRECNFPSPRFGPTGSPCFNTGQGRANWPNQXXXXXXXXXXXXXXXXXXXGDHWRGGRS 679 R NFPSP F P GSP F +G+GR +W N G S Sbjct: 180 SPRGGNFPSPGFRPVGSPSFRSGRGRGHWFNNSPSPVSGRG---------------GSSS 224 Query: 678 PASGQ-RGGW--GNFSPXXXXXXXXXXXSHARPSAMDRPLGPEKYYDNAMVEDPWEFLEP 508 P SG+ R GW + SP HA SA DR PE +Y+ +MVEDPW+FL+P Sbjct: 225 PNSGRGRSGWFGNSMSPGSGRGRGRGLGFHAHVSAQDR---PELFYNKSMVEDPWKFLKP 281 Query: 507 VIW---KGVDDPLNSLRTPESSRSSISGTKRAKTYEVPGRSNNQPSLAEYLAASLNEADN 337 VIW K + N+ +P+S K+ + E S++Q SLAEYLAAS NEA N Sbjct: 282 VIWSREKALGKMGNASDSPKSWLPKSINMKKTRVSEATNESSSQQSLAEYLAASFNEAVN 341 Query: 336 DSS 328 D+S Sbjct: 342 DAS 344 >ref|XP_009376205.1| PREDICTED: proline-rich protein 2 [Pyrus x bretschneideri] Length = 358 Score = 231 bits (589), Expect = 1e-57 Identities = 173/389 (44%), Positives = 209/389 (53%), Gaps = 39/389 (10%) Frame = -1 Query: 1374 MEESEKRRERLKAMRMEAACSENSNDLA-SPMPS-LSNPLAETSKAMH--DGYCATSRFG 1207 M+ESEKR+ERL+AMR+EA +E S A S +P LSNPLAE + A+ + CA RF Sbjct: 1 MDESEKRKERLRAMRIEAEETEASLKAATSAVPVYLSNPLAEDTTAIPVPEEPCAPFRFD 60 Query: 1206 FYTDPMAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSPLPG-PRNPGMTSPGAY 1030 FY+DPMAAFS+D KR G+ QI+ + GG SPL G PRNP MT+ A+ Sbjct: 61 FYSDPMAAFSSDNKRIKVGD-QIAQENFRHSNTGGFPGARLPSPLSGGPRNPQMTASPAH 119 Query: 1029 QFQSNYTPNQ-MYQARGFGLNSSFPRSPMGIHRP-TMHQGN-PDAWNGSG----GAAGYN 871 QFQ +Y+P+Q MYQA+G N S RSP+G+ RP MH GN P+ WNG+ + GY Sbjct: 120 QFQRSYSPDQRMYQAQGSYQNFSPQRSPVGMERPFPMHHGNRPEVWNGAEFRPPASPGYG 179 Query: 870 ------FSPNPSRECNFP-SPRFGPTGSPCF----------------NTGQGRANWPNQX 760 F P S P SPRF P GSP F N GQGR +W + Sbjct: 180 PQGSPRFRPQGSPGFRPPGSPRFQPPGSPGFRPPTSPGFRPPGSPGSNIGQGRGHWFSHT 239 Query: 759 XXXXXXXXXXXXXXXXXXGDHWRGGRSPASGQ-RGGWGNFSPXXXXXXXXXXXSHARPSA 583 G SP S RGGW SH R A Sbjct: 240 PRPQSVHG---------------GSPSPGSSSGRGGWRG--------------SHGR--A 268 Query: 582 MDRPLGPEKYYDNAMVEDPWEFLEPVIWKGVDDPLNSLRTPESSRSSI---SGTKRAKTY 412 MDR LGPE++Y+ MVEDPW+FLEPVIWKGVD PL L T SS+ SI S T A Sbjct: 269 MDRQLGPERFYNATMVEDPWKFLEPVIWKGVDTPLRCLNTHGSSKLSIGRSSSTNNASIS 328 Query: 411 EVPGRSNNQPSLAEYLAASLNEADNDSSS 325 E +S QPSLAE+LAASLNEA +D+ S Sbjct: 329 EALNKSMPQPSLAEFLAASLNEAVDDAPS 357 >ref|XP_002283770.1| PREDICTED: RNA-binding protein FUS isoform X1 [Vitis vinifera] gi|731428688|ref|XP_010664419.1| PREDICTED: RNA-binding protein FUS isoform X1 [Vitis vinifera] gi|302142075|emb|CBI19278.3| unnamed protein product [Vitis vinifera] Length = 347 Score = 230 bits (586), Expect = 3e-57 Identities = 158/364 (43%), Positives = 198/364 (54%), Gaps = 15/364 (4%) Frame = -1 Query: 1374 MEESEKRRERLKAMRMEAACSENSNDL-ASPMPS-LSNPLAETSKAM--HDGYCATSRFG 1207 MEESEKRRERLKAMRMEAA ++ S+ + S MP LSNPL E S + + C T RF Sbjct: 1 MEESEKRRERLKAMRMEAAQTKVSDTVDTSAMPGYLSNPLVEGSATLPVQEDSCVTPRFD 60 Query: 1206 FYTDPMAAFSADKKRNNAGNNQISADYST-SPIAGGSSKMG--FSSPLPGPRNPGMTSPG 1036 FYTDPM+AFS++K+R+ GN QI DY T S +G ++ M SS GPRN MT Sbjct: 61 FYTDPMSAFSSNKRRSKVGN-QIQQDYLTPSSNSGYTATMARMSSSLSAGPRNCEMTPSP 119 Query: 1035 AYQFQSNYTPNQ-MYQARGFGLNSSFPRSPMGIHRP-TMHQGNPDAWNGSGGAAGYNFSP 862 FQ N++P Q + QA+G +S RSP+ + P HQG P WNGS G Y Sbjct: 120 NPPFQPNFSPGQGINQAQGLYHSSGPYRSPIEMASPFPAHQGTPGVWNGSNGMPRYGVPS 179 Query: 861 NPSRECNFPSPRFGPTGSPCFNTGQGRANWPNQXXXXXXXXXXXXXXXXXXXGDHWRGGR 682 N R NFPSP F P GSP F +G+GR +W N G Sbjct: 180 NSPRGGNFPSPGFRPVGSPSFRSGRGRGHWFNNSPSPVSGRG---------------GSS 224 Query: 681 SPASGQ-RGGW--GNFSPXXXXXXXXXXXSHARPSAMDRPLGPEKYYDNAMVEDPWEFLE 511 SP SG+ R GW + SP HA SA DR PE +Y+ +MVEDPW+FL+ Sbjct: 225 SPNSGRGRSGWFGNSMSPGSGRGRGRGLGFHAHVSAQDR---PELFYNKSMVEDPWKFLK 281 Query: 510 PVIW---KGVDDPLNSLRTPESSRSSISGTKRAKTYEVPGRSNNQPSLAEYLAASLNEAD 340 PVIW K + N+ +P+S K+ + E S++Q SLAEYLAAS NEA Sbjct: 282 PVIWSREKALGKMGNASDSPKSWLPKSINMKKTRVSEATNESSSQQSLAEYLAASFNEAV 341 Query: 339 NDSS 328 ND+S Sbjct: 342 NDAS 345 >ref|XP_008219246.1| PREDICTED: collagen alpha-1(III) chain [Prunus mume] Length = 428 Score = 216 bits (550), Expect = 5e-53 Identities = 167/439 (38%), Positives = 211/439 (48%), Gaps = 91/439 (20%) Frame = -1 Query: 1374 MEESEKRRERLKAMRMEAACSENSNDLA-SPMPS-LSNPLAETSKAM--HDGYCATSRFG 1207 M+ESEKR+ERL+AMR EA +E S+ + S +P LSNPLAE S A+ H+ CA SRF Sbjct: 1 MDESEKRKERLRAMRTEAEEAEASHSVTTSAVPGYLSNPLAEDSAAIPVHEEPCAPSRFD 60 Query: 1206 FYTDPMAAFSADKKRNNAGNNQISADYSTS-----PIA--GGSSKMGFSSPLPG-PRNPG 1051 FYTDPMAAFS+D KR GN +++ S P+A GGS SSPL G PRNP Sbjct: 61 FYTDPMAAFSSDTKRVKVGNQIAPSNFGRSNTGGSPMARTGGSPMARHSSPLSGGPRNPE 120 Query: 1050 MTSPGAYQFQSNYTPNQ-MYQAR-----GFG----------------------------- 976 MT+P ++QFQSNY+P+Q MYQ + FG Sbjct: 121 MTAPPSHQFQSNYSPDQRMYQVQQGFCQNFGPQRNPIGIVRPFPMHHGNPPEVWNGAEGA 180 Query: 975 LNSSFPRSP----------------MGIHRPTMHQGNPDAWNGSG----------GAAGY 874 N SFP P +G P P +G G G+ G+ Sbjct: 181 ANYSFPSDPSRECRFPGPGFRPPGSLGFRPPGSPVLGPQGSSGFGPPGSPGFRPPGSPGF 240 Query: 873 N------FSPNPSRECNFP---------SPRFGPTGSPCFNTGQGRANWPNQXXXXXXXX 739 F P S P SP F P GSP N+GQGR +W + Sbjct: 241 RPPGSPGFGPQGSSVFGPPGSPGFRPPASPGFRPLGSPGSNSGQGRGHWRSNSPSPRSVH 300 Query: 738 XXXXXXXXXXXGDHWRGGRSPASGQRGGWGNFSPXXXXXXXXXXXSHARPSAMDRPLGPE 559 +P SG+RGG G S H R S M++ LGPE Sbjct: 301 GGNTSPSSSSGRGGGHWSTNPGSGRRGGRGLGS-------------HGR-STMEKQLGPE 346 Query: 558 KYYDNAMVEDPWEFLEPVIWKGVDDPLNSLRTPESSRSSI---SGTKRAKTYEVPGRSNN 388 +YY+++MVEDPW+FL+PVIWKGVD P+ +P SS+ I S TK A E +S + Sbjct: 347 RYYNDSMVEDPWKFLKPVIWKGVDTPMKRYYSPGSSKPPIEKSSSTKDASISEGSNKSTS 406 Query: 387 QPSLAEYLAASLNEADNDS 331 QPSLAEYLAAS N+A D+ Sbjct: 407 QPSLAEYLAASFNDAVKDT 425 >ref|XP_007018241.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|508723569|gb|EOY15466.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 368 Score = 207 bits (526), Expect = 3e-50 Identities = 150/386 (38%), Positives = 197/386 (51%), Gaps = 36/386 (9%) Frame = -1 Query: 1374 MEESEKRRERLKAMRMEAACSENSNDLASP-MPS-LSNPLAETSK--AMHDGYCATSRFG 1207 M+ESEKR+ERLKAMR+EAA SE N++A+P +P LSNPL+ETS A+ + +C+T RF Sbjct: 1 MDESEKRKERLKAMRLEAAQSEVPNNVATPSVPGHLSNPLSETSSTAAVQEDFCSTPRFD 60 Query: 1206 FYTDPMAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSPLPGPRNPGMTSPGAYQ 1027 +YTDPMAAFSA+KKR A +NQ + +Y T P G S PGPRN M P + Sbjct: 61 YYTDPMAAFSANKKRGKA-DNQSTQNYFTPPTTSGWPVARVSPSHPGPRNYDMNPPVRHM 119 Query: 1026 FQSNYTPNQ-MYQARGFGLNSSFPRSPMGIHRPTMHQGNPDAWNGSGGAAGYNFSP---- 862 QS Y+ +Q MY +G N + RSP+ MH GN DAWNGS Y S Sbjct: 120 -QSQYSLDQRMYHQQGPHSNFAAHRSPITRSPSHMHHGNSDAWNGSQAFGNYYSSASDGS 178 Query: 861 ----------------------NPSRECNFPSPRFGPTGSPCFNTGQGRANWPNQXXXXX 748 N SR N P+P F P P G+GR P Q Sbjct: 179 PGGMFGTPLMHPGTTPRFWNPSNASRYSNSPTPGFSPADIPY---GRGR---PQQFGNYP 232 Query: 747 XXXXXXXXXXXXXXGDHWRGGRSPASGQ-RGGWGNFSPXXXXXXXXXXXSHARPSAMDRP 571 G S G+ RG G+ + H SA +R Sbjct: 233 LPSPGHGGSL----------GLSSGRGRGRGYGGSITHGIGRSGGRGLGFHGHSSASNRM 282 Query: 570 LGPEKYYDNAMVEDPWEFLEPVIWKGVDDPLNSLRTPESSRS----SISGTKRAKTYEVP 403 +GPE +YD +M+EDPW+ L+PV+W+ + ++SL P+SS S SIS K+ K E Sbjct: 283 MGPESFYDESMLEDPWQHLKPVLWRRREAGMDSLSNPDSSNSWFPKSIS-AKKVKVSEAS 341 Query: 402 GRSNNQPSLAEYLAASLNEADNDSSS 325 + N+Q SLAEYLAAS N+A D+ + Sbjct: 342 NKFNSQLSLAEYLAASFNKAVEDTKN 367 >ref|XP_006472450.1| PREDICTED: RNA-binding protein FUS-like [Citrus sinensis] Length = 379 Score = 204 bits (518), Expect = 2e-49 Identities = 156/398 (39%), Positives = 197/398 (49%), Gaps = 52/398 (13%) Frame = -1 Query: 1374 MEESEKRRERLKAMRMEAACSENSNDLAS-PMPS-LSNPLAETSKA--MHDGYCATSRFG 1207 MEESEKR+ERLKAMR EAA +E + + + P+PS LSNPL E S A + + A SRFG Sbjct: 1 MEESEKRKERLKAMRAEAAQAEVCSSVETFPVPSSLSNPLFEDSAAQPIQEQPFAGSRFG 60 Query: 1206 FYTDPMAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSPLPGPRNPGMTSPGAYQ 1027 FYTDP+AAFSA+KKR NN DYS P + SS PRN GM +Q Sbjct: 61 FYTDPVAAFSANKKRGQHDNNT-RQDYSMPPSISAPAMARPSSFFSEPRNSGMIPSPGHQ 119 Query: 1026 FQSNYTPNQ-MYQARGFGLNSSFP------RSPMGIHRPT-------------------- 928 Q++ + +Q MYQA+ N+ P SP+ IH+ T Sbjct: 120 LQASSSFDQRMYQAQS-PYNNPHPYRGPRGASPLPIHQGTPGAWSGLQATTSHYSPTIYG 178 Query: 927 -------------MHQGNPDAWNGSGGAAGYNFSPNPSRECNFPSPRFGPTGSPCFNTGQ 787 +HQG P++WNGSGG A YN S SP FGP SP F GQ Sbjct: 179 QRSPRGMASPFTGIHQGTPESWNGSGGTARYNSPSTASGGGQIFSPGFGPVRSPTFGYGQ 238 Query: 786 GRANWPNQXXXXXXXXXXXXXXXXXXXGDHWRGGR-SPASGQ-RGGW--GNFSPXXXXXX 619 GR W + RGG P+SG+ RG W G+ SP Sbjct: 239 GRPQWQGRSPSPGSG----------------RGGSPGPSSGRGRGRWYGGSVSPGLGCSG 282 Query: 618 XXXXXSHARPSAMDRPLGPEKYYDNAMVEDPWEFLEPVIWKGVDDPLNSLRTPESSRS-- 445 H+R D GPE +YD +M EDPW+ LEP++WK + ++P SS S Sbjct: 283 GRGRGPHSRGFGGDGKQGPECFYDKSMDEDPWQELEPLVWKS-----RNFKSPGSSNSWF 337 Query: 444 --SISGTKRAKTYEVPGRSNNQPSLAEYLAASLNEADN 337 SIS K+ + E +S++QPSLAEYLAAS NEA N Sbjct: 338 PKSIS-MKKPRVSEASRQSSSQPSLAEYLAASFNEATN 374 >ref|XP_011017017.1| PREDICTED: uncharacterized protein LOC105120493 isoform X2 [Populus euphratica] Length = 333 Score = 202 bits (515), Expect = 5e-49 Identities = 150/372 (40%), Positives = 192/372 (51%), Gaps = 21/372 (5%) Frame = -1 Query: 1374 MEESEKRRERLKAMRMEAACSE---NSNDLASPMPSL-SNPLAE---TSKAMHDGYCATS 1216 ME++EKR ERLKAMR A+ N N S +P L +NPL E T A+ + AT Sbjct: 1 MEDAEKRSERLKAMRAVASAQAETCNDNVETSAVPGLLANPLLENAATRPALEESR-ATP 59 Query: 1215 RFGFYTDPMAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSPLPGPRNPGMTSPG 1036 RF FYTDP AAFSA++KR N S +P SS FSSP PG RNP +T Sbjct: 60 RFDFYTDPSAAFSANRKRTATAN---LVARSFTPPNNISSMPQFSSPRPGQRNPEVTPSS 116 Query: 1035 AYQFQSNYTPNQ-MYQARGFGLNSSFPRSPMGIHRP-TMHQGNPDAWNGSGGAAGYNFSP 862 AYQ QSNY+PNQ MY +G N++F R+P RP TM+QG P+ WNG GG A Y S Sbjct: 117 AYQMQSNYSPNQRMYPGQGPYHNAAFYRTPSNFARPFTMNQGTPEMWNGPGGPASYQ-SY 175 Query: 861 NPSRECNFP------SPRFGPTG-SPCFNTGQGRANWPNQXXXXXXXXXXXXXXXXXXXG 703 P R + P +P FGP G SP +G Sbjct: 176 TPYRGISRPYPIHQGNPGFGPVGSSPSPVSGY---------------------------- 207 Query: 702 DHWRGGRSPASGQRG-GWGNFSPXXXXXXXXXXXSHARPSAMDRPLGPEKYYDNAMVEDP 526 G SPAS RG G+ + S +R A + PE +YDN+MVEDP Sbjct: 208 -----GGSPASSGRGRGYWDSSSGLGQSGGRGRGFRSRGFAPNETQEPECFYDNSMVEDP 262 Query: 525 WEFLEPVIWKGVDDPLNSLRTPESSRS----SISGTKRAKTYEVPGRSNNQPSLAEYLAA 358 W+ L PV+W+G+DDP N+L P SS S SIS K+ + E +S + +LAEYL+A Sbjct: 263 WQHLTPVLWRGLDDPGNNLNGPVSSNSWLPKSIS-VKKTRISESSNKSTSGQTLAEYLSA 321 Query: 357 SLNEADNDSSSI 322 + EA ND+ ++ Sbjct: 322 AFTEATNDAPNV 333 >ref|XP_008442005.1| PREDICTED: uncharacterized protein LOC103486001 [Cucumis melo] gi|659082738|ref|XP_008442006.1| PREDICTED: uncharacterized protein LOC103486001 [Cucumis melo] gi|659082740|ref|XP_008442007.1| PREDICTED: uncharacterized protein LOC103486001 [Cucumis melo] gi|659082742|ref|XP_008442009.1| PREDICTED: uncharacterized protein LOC103486001 [Cucumis melo] Length = 331 Score = 200 bits (509), Expect = 3e-48 Identities = 137/352 (38%), Positives = 184/352 (52%), Gaps = 5/352 (1%) Frame = -1 Query: 1374 MEESEKRRERLKAMRMEAACSENSNDLASPMPS-LSNPLAETSKAMHDGY--CATSRFGF 1204 MEESEKRRERL+AMRMEAA ++ +N + + +P+ LSNPL E+S M C T RF + Sbjct: 1 MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMMGQLAPCTTPRFDY 60 Query: 1203 YTDPMAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSPLPGPRNPGMTSPGAYQF 1024 YT+PMAAFS KK+ N +S ++ P +S F PG RNP M+S +QF Sbjct: 61 YTNPMAAFSTSKKKGKIENQLVSDNFV--PYHHNTSSPTF----PGLRNPEMSSASTHQF 114 Query: 1023 QSNYTPNQMYQARGFGLNSSFPRSPMGIHRP-TMHQGNPDAWNGSGGAAGYNFSPNPSRE 847 +M+ ARG + SP G+ RP + QG+P W GS + +P RE Sbjct: 115 HQCSPDRRMFYARGDS-EAGGHGSP-GMPRPYAVDQGDPHMWRGSKRPFVNQYPTHPPRE 172 Query: 846 CNFPSPRFGPTGSPCFNTGQGRANWPNQXXXXXXXXXXXXXXXXXXXGDHWRGGRSPASG 667 N PS P G+ N Q RAN+ + + G SP G Sbjct: 173 MNSPSHVSRPRGNSYTNPTQDRANYRSSSPNPG-----------------FLGSFSPGRG 215 Query: 666 QRGGWGNFSPXXXXXXXXXXXSHARPSAMDRPLGPEKYYDNAMVEDPWEFLEPVIWKGVD 487 G GN +P SH R S++D+ GPE++Y+ +M+EDPW+ L+P IW + Sbjct: 216 SHGHHGNMTPSPRFGYGRGTGSHGRHSSLDKSPGPEQFYNVSMLEDPWKVLQPCIWTTIA 275 Query: 486 DPLNSLRTPESSRSSISGTKRAKTYE-VPGRSNNQPSLAEYLAASLNEADND 334 NS ES S+ GTK+A+ + GRSN+QPSLAEYLAAS EA D Sbjct: 276 PSSNSTEPSESWISTKFGTKKARVSDSSSGRSNSQPSLAEYLAASFKEAIED 327 >ref|XP_002514052.1| conserved hypothetical protein [Ricinus communis] gi|223547138|gb|EEF48635.1| conserved hypothetical protein [Ricinus communis] Length = 412 Score = 197 bits (500), Expect = 3e-47 Identities = 142/345 (41%), Positives = 168/345 (48%), Gaps = 10/345 (2%) Frame = -1 Query: 1326 EAACSENSNDLASPMPSLSNPLAETSKAM--HDGYCATSRFGFYTDPMAAFSADKKRNNA 1153 EA CS + A L+NPL E+ + AT RF FYT+PMAAFSADK+ A Sbjct: 98 EAGCSSHVQTSAVS-GFLTNPLLESPATFPAKEESSATPRFDFYTNPMAAFSADKRI--A 154 Query: 1152 GNNQISADYSTSPIAGGSSKMGFSSPLPGPRNPGMTSPGAYQFQSNYTPNQMYQARGFGL 973 NQ + Y P G FSSP+PGP NPGMT YQ QSNY PNQ +G Sbjct: 155 SINQPAPRYFIPPSNNGPMPW-FSSPVPGPGNPGMTPSPVYQMQSNYLPNQRTHQQG-PY 212 Query: 972 NSSFP-RSPMGIHRPTMHQGNPDAWNGSGGAAGYNFSPNPSRECNFP----SPRFGPTGS 808 NS+ P RSP P MHQG PDAWNG GG A +P R C +P +P F P GS Sbjct: 213 NSAVPYRSPRAGPFP-MHQGTPDAWNGPGGIAAA--APYRGRMCPYPIHESNPGFQPAGS 269 Query: 807 PCFNTGQGRANWPNQXXXXXXXXXXXXXXXXXXXGDHWRGGRSPASGQ-RGGWGNFSPXX 631 P FN GQGR W GG S SG+ +G W S Sbjct: 270 PSFNYGQGRPPWSGNSPSPRSV----------------HGGSSTYSGRGQGQWHGSSRGQ 313 Query: 630 XXXXXXXXXSHARPSAMDRPLGPEKYYDNAMVEDPWEFLEPVIWK--GVDDPLNSLRTPE 457 H+R A GPE +Y+ +MVEDPW+ LEPV+WK GV NS Sbjct: 314 ISGQSGRRGFHSRGPAPGEAFGPESFYEKSMVEDPWKQLEPVVWKMLGVPGSSNSWLPKS 373 Query: 456 SSRSSISGTKRAKTYEVPGRSNNQPSLAEYLAASLNEADNDSSSI 322 SR K+ + E SN++ SLAEYLAAS NEA D S+ Sbjct: 374 ISR------KKPRPSESSNNSNSKQSLAEYLAASFNEAVKDGPSV 412 >ref|XP_007224174.1| hypothetical protein PRUPE_ppa016470mg [Prunus persica] gi|462421110|gb|EMJ25373.1| hypothetical protein PRUPE_ppa016470mg [Prunus persica] Length = 398 Score = 192 bits (489), Expect = 6e-46 Identities = 153/410 (37%), Positives = 186/410 (45%), Gaps = 62/410 (15%) Frame = -1 Query: 1374 MEESEKRRERLKAMRMEAACSENSNDLA-SPMPS-LSNPLAETSKAM--HDGYCATSRFG 1207 M+ESEKR+ERL+AMR EA +E S+ + S +P LSNPLAE S A+ H CA SRF Sbjct: 1 MDESEKRKERLRAMRTEAEEAEASHSVTTSAVPGYLSNPLAEDSAALPVHKEPCAPSRFD 60 Query: 1206 FYTDPMAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSPLP-------------- 1069 FYTDPMAAFS+D KR GN QI+ P GGS SSPL Sbjct: 61 FYTDPMAAFSSDTKRVKVGN-QIAPSNFGRPNTGGSPMARLSSPLSDKRMYRVQQGFCQN 119 Query: 1068 -GP-RNP-------------------GMTSPGAYQFQSNYTPNQMYQARGF--------- 979 GP RNP G Y F S+ + + GF Sbjct: 120 FGPQRNPIGIARPFPMHHGNPPEVWNGAEGAANYSFPSDPSRECRFPGPGFRPPGSPGFR 179 Query: 978 ----------GLNSSFPRSPMGIHRPTMHQGNPDAWNGSGGAAGYNFSPNPSRECNFP-S 832 G + P G P P G G F P S P S Sbjct: 180 PPGSPGLGPQGSSGFGPPGSPGFRPPGSPGFRPPGSPGFGPQGSSGFGPPGSPGFRPPAS 239 Query: 831 PRFGPTGSPCFNTGQGRANWPNQXXXXXXXXXXXXXXXXXXXGDHWRGGRSPASGQRGGW 652 P F P GSP N+GQGR +W + SP SG+RGG Sbjct: 240 PGFRPLGSPGSNSGQGRGHWRSNSPSPHSVHGGNTSPSSSSGRGGGHWSTSPGSGRRGGR 299 Query: 651 GNFSPXXXXXXXXXXXSHARPSAMDRPLGPEKYYDNAMVEDPWEFLEPVIWKGVDDPLNS 472 G S H R S M++ LGPE+YY+++MVEDPW+FL+PVIWKGVD P+ Sbjct: 300 GLGS-------------HGR-STMEKQLGPERYYNDSMVEDPWKFLKPVIWKGVDTPMKR 345 Query: 471 LRTPESSRSSI---SGTKRAKTYEVPGRSNNQPSLAEYLAASLNEADNDS 331 +P SS+ I S TK A E +S +QPSLAEYLAAS N+A D+ Sbjct: 346 FYSPGSSKPPIENSSSTKDAIISEGSNKSTSQPSLAEYLAASFNDAVKDT 395 >ref|XP_009607447.1| PREDICTED: uncharacterized protein LOC104101664 isoform X2 [Nicotiana tomentosiformis] Length = 340 Score = 191 bits (486), Expect = 1e-45 Identities = 141/360 (39%), Positives = 177/360 (49%), Gaps = 12/360 (3%) Frame = -1 Query: 1371 EESEKRRERLKAMRMEAACSENSNDLASPMPSLSNPLAETSKAMHDGYCATSRFGFYTDP 1192 EESEKR+ERLKAMRMEA+ N N+ + + LSNPL E+ + +CA RF +YTDP Sbjct: 3 EESEKRKERLKAMRMEASECGNYNETENQLQGLSNPLVESPSGQAE-FCAAPRFDYYTDP 61 Query: 1191 MAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSPLPGPRNPGMTSPGAYQFQSNY 1012 MAAFSA+KKRNN A Y+ P PRNP SP Y Q NY Sbjct: 62 MAAFSANKKRNNVSPQVSQACYTP----------------PRPRNP--QSP-IYTAQDNY 102 Query: 1011 TPNQMYQARGFG-----LNSSFPRSPMGIHRPTMHQGNPDAWNGSGGAAGYNFSPNPSRE 847 + +Q Q++G L + SP G T + +P+AW S G F PN S Sbjct: 103 SLDQRSQSQGVHHTFNPLGNPGQNSPFG----TPQRSSPNAWGSSFGTPNNYFPPNSSIG 158 Query: 846 CNFPSPRFGPTGSPCFNTGQGRANWPNQXXXXXXXXXXXXXXXXXXXGDHWRGGRSPASG 667 NF SP G P F+ GQGR N P H RG S Sbjct: 159 GNFASPGIHRGGRPGFHYGQGRGN-PGSGYRGSPSQGSGYRGSPYQGPGH-RGSPYQGSA 216 Query: 666 Q-RGGW--GNFSPXXXXXXXXXXXSHARPSAMDRPLGPEKYYDNAMVEDPWEFLEPVIWK 496 Q R W + SP SH SA RP + YY+ +MVEDPW+ ++PVIWK Sbjct: 217 QGRSQWMGNSSSPVSVQRGRRGLGSHGCTSAESRP---DLYYNKSMVEDPWKEMKPVIWK 273 Query: 495 GVDDPLNSLRTPESSRSS----ISGTKRAKTYEVPGRSNNQPSLAEYLAASLNEADNDSS 328 ++ P N+L TPES +SS K+AK + P +S Q SLAEYL+AS NEA + S Sbjct: 274 PLNAPSNNLDTPESEKSSWLPKSISAKKAKIPDAPLKSTPQQSLAEYLSASFNEAAGNES 333 >ref|XP_012068030.1| PREDICTED: uncharacterized protein LOC105630718 isoform X2 [Jatropha curcas] gi|643734820|gb|KDP41490.1| hypothetical protein JCGZ_15897 [Jatropha curcas] Length = 341 Score = 189 bits (481), Expect = 5e-45 Identities = 137/359 (38%), Positives = 180/359 (50%), Gaps = 10/359 (2%) Frame = -1 Query: 1371 EESEKRRERLKAMRMEAACSENSNDLASP---MPSLSNPLAETSKAMHDGYCATSRFGFY 1201 E+SE+RRERLKAMR AA +E S+ + + + L+NPL E+ + + AT RF FY Sbjct: 3 EDSERRRERLKAMRTVAAQAEASSHVQTSSGYIGFLANPLLESPELTQEPSHATPRFDFY 62 Query: 1200 TDPMAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSPLPGPRNPGMTSPGAYQFQ 1021 TDPMAAF ++KKR+ G NQ Y T P SS FSSP PGPRNP MT + Q Q Sbjct: 63 TDPMAAFYSNKKRSGVG-NQAPQGYLTPPSDSASSMSQFSSPHPGPRNPDMTPFPSNQMQ 121 Query: 1020 SNYTPNQMYQARGFGLNSSFPRSPMGIHRPTMHQGNPDAWNGSGGAAGY-NFSPNPSREC 844 NY+P Q+ NS P + MHQG P A GS G A Y N +P+ Sbjct: 122 HNYSPYQIMDQTQVAYNSIPPCTSPRAGPFPMHQGMPYAQGGSSGVAYYHNNAPHRGMTS 181 Query: 843 NF----PSPRFGPTGSPCFNTGQGRANWPN-QXXXXXXXXXXXXXXXXXXXGDHWRG-GR 682 + +P F P G+ FN GQGR P W G R Sbjct: 182 QYHVRSRNPNFQPEGNHSFNYGQGRPLSPRIGNNPYFGSGRGGSSTHSGRGQGQWHGSSR 241 Query: 681 SPASGQRGGWGNFSPXXXXXXXXXXXSHARPSAMDRPLGPEKYYDNAMVEDPWEFLEPVI 502 S SG+ GG G ++ D L E +YD +MVEDPW+ LEPV+ Sbjct: 242 SQVSGRNGGRGR-------------GFYSHGIGSDAALRAESFYDKSMVEDPWQRLEPVL 288 Query: 501 WKGVDDPLNSLRTPESSRSSISGTKRAKTYEVPGRSNNQPSLAEYLAASLNEADNDSSS 325 WKG+D +S P+S + K+ + E +S++Q +LAEYLAA+ NE+ ND+ S Sbjct: 289 WKGLDGSSDSW-LPKS-----ASMKKPRVSESSNKSSSQ-NLAEYLAAAFNESVNDAPS 340 >ref|XP_009607446.1| PREDICTED: uncharacterized protein LOC104101664 isoform X1 [Nicotiana tomentosiformis] Length = 350 Score = 187 bits (474), Expect = 3e-44 Identities = 140/370 (37%), Positives = 177/370 (47%), Gaps = 22/370 (5%) Frame = -1 Query: 1371 EESEKRRERLKAMRMEAACSENSNDLASPMPSLSNPLAETSKAMHDGYCATSRFGFYTDP 1192 EESEKR+ERLKAMRMEA+ N N+ + + LSNPL E+ + +CA RF +YTDP Sbjct: 3 EESEKRKERLKAMRMEASECGNYNETENQLQGLSNPLVESPSGQAE-FCAAPRFDYYTDP 61 Query: 1191 MAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSPLPGPRNPGMTSPGAYQFQSNY 1012 MAAFSA+KKRNN A Y+ P PRNP SP Y Q NY Sbjct: 62 MAAFSANKKRNNVSPQVSQACYTP----------------PRPRNP--QSP-IYTAQDNY 102 Query: 1011 TPNQMYQARGFG-----LNSSFPRSPMGIHRPTMHQGNPDAWNGSGGAAGYNFSPNPSRE 847 + +Q Q++G L + SP G T + +P+AW S G F PN S Sbjct: 103 SLDQRSQSQGVHHTFNPLGNPGQNSPFG----TPQRSSPNAWGSSFGTPNNYFPPNSSIG 158 Query: 846 CNFPSPRFGPTGSPCFNTGQGRANWPNQXXXXXXXXXXXXXXXXXXXGDHWRGGRSPASG 667 NF SP G P F+ GQGR N P + RG G Sbjct: 159 GNFASPGIHRGGRPGFHYGQGRGN-PGSGYRGSPSQGSGYRGSPNQGSGY-RGSPYQGPG 216 Query: 666 QRGG-----------W--GNFSPXXXXXXXXXXXSHARPSAMDRPLGPEKYYDNAMVEDP 526 RG W + SP SH SA RP + YY+ +MVEDP Sbjct: 217 HRGSPYQGSAQGRSQWMGNSSSPVSVQRGRRGLGSHGCTSAESRP---DLYYNKSMVEDP 273 Query: 525 WEFLEPVIWKGVDDPLNSLRTPESSRSS----ISGTKRAKTYEVPGRSNNQPSLAEYLAA 358 W+ ++PVIWK ++ P N+L TPES +SS K+AK + P +S Q SLAEYL+A Sbjct: 274 WKEMKPVIWKPLNAPSNNLDTPESEKSSWLPKSISAKKAKIPDAPLKSTPQQSLAEYLSA 333 Query: 357 SLNEADNDSS 328 S NEA + S Sbjct: 334 SFNEAAGNES 343 >ref|XP_008459151.1| PREDICTED: uncharacterized protein LOC103498353 isoform X1 [Cucumis melo] gi|659118498|ref|XP_008459152.1| PREDICTED: uncharacterized protein LOC103498353 isoform X1 [Cucumis melo] Length = 335 Score = 185 bits (470), Expect = 9e-44 Identities = 135/356 (37%), Positives = 183/356 (51%), Gaps = 8/356 (2%) Frame = -1 Query: 1374 MEESEKRRERLKAMRMEAACSENSNDLASPMPS-LSNPLAETSKAMHDGY--CATSRFGF 1204 MEESEKRRERL+AMRMEAA ++ N + + +P+ LSNPL E+S M C RF + Sbjct: 1 MEESEKRRERLRAMRMEAAQADVVNYIETSLPNHLSNPLVESSATMVGQLAPCTAPRFDY 60 Query: 1203 YTDPMAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSP-LPGPRNPGMTSPGAYQ 1027 YT+PMAAFS KK+ N +S + P +S + P PG RNP M+ +Q Sbjct: 61 YTNPMAAFSTSKKKGKIENQPVSDTFV--PYHHNTSSTTYLPPTFPGLRNPEMSPSSTHQ 118 Query: 1026 FQSNYTPNQM-YQARGFGLNSSFPRSPMGIHRP-TMHQGNPDAWNGSGGAAGYNFSPNPS 853 F Y+P+Q + ARG + SP G+ RP ++QG+P W G F +P Sbjct: 119 FHQ-YSPDQRTFYARGDS-EAGGHGSP-GMPRPYAVNQGDPHMWRGPRRPFVNQFPTHPP 175 Query: 852 RECNFPSPRFGPTGSPCFNTGQGRANWPNQXXXXXXXXXXXXXXXXXXXGDHWRGGRSPA 673 RE N S GP G+ N Q RA + + + G SP Sbjct: 176 REMNSSSHVSGPRGNSYTNPTQDRAKYRSSSPNPG-----------------FHGSLSPG 218 Query: 672 SGQRGGWGNFSPXXXXXXXXXXXSHARPSAMDRPLGPEKYYDNAMVEDPWEFLEPVIWKG 493 G G GN +P H R S +D+ GPE++Y+ +M+EDPW+ L+P IW Sbjct: 219 RGSHGHHGNMTPSPRFGYGRGTGFHGRHSLLDKS-GPEQFYNVSMLEDPWKVLQPCIWTT 277 Query: 492 VDDPLNSLRTPESSRSSISGTKRAKTYE-VPGRSNN-QPSLAEYLAASLNEADNDS 331 +D NS + P S S GTK+A+ + GRS++ QPSLAEYLAAS EA D+ Sbjct: 278 IDSSSNSAK-PSESWISKFGTKKARVSDSSSGRSSSQQPSLAEYLAASFKEAIEDA 332 >ref|XP_012068029.1| PREDICTED: uncharacterized protein LOC105630718 isoform X1 [Jatropha curcas] Length = 342 Score = 185 bits (469), Expect = 1e-43 Identities = 137/360 (38%), Positives = 180/360 (50%), Gaps = 11/360 (3%) Frame = -1 Query: 1371 EESEKRRERLKAMRMEAACSENSNDLASP---MPSLSNPLAETSKAMHDGYCATSRFGFY 1201 E+SE+RRERLKAMR AA +E S+ + + + L+NPL E+ + + AT RF FY Sbjct: 3 EDSERRRERLKAMRTVAAQAEASSHVQTSSGYIGFLANPLLESPELTQEPSHATPRFDFY 62 Query: 1200 TDPMAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSPLP-GPRNPGMTSPGAYQF 1024 TDPMAAF ++KKR+ G NQ Y T P SS FSSP P GPRNP MT + Q Sbjct: 63 TDPMAAFYSNKKRSGVG-NQAPQGYLTPPSDSASSMSQFSSPHPAGPRNPDMTPFPSNQM 121 Query: 1023 QSNYTPNQMYQARGFGLNSSFPRSPMGIHRPTMHQGNPDAWNGSGGAAGY-NFSPNPSRE 847 Q NY+P Q+ NS P + MHQG P A GS G A Y N +P+ Sbjct: 122 QHNYSPYQIMDQTQVAYNSIPPCTSPRAGPFPMHQGMPYAQGGSSGVAYYHNNAPHRGMT 181 Query: 846 CNF----PSPRFGPTGSPCFNTGQGRANWPN-QXXXXXXXXXXXXXXXXXXXGDHWRG-G 685 + +P F P G+ FN GQGR P W G Sbjct: 182 SQYHVRSRNPNFQPEGNHSFNYGQGRPLSPRIGNNPYFGSGRGGSSTHSGRGQGQWHGSS 241 Query: 684 RSPASGQRGGWGNFSPXXXXXXXXXXXSHARPSAMDRPLGPEKYYDNAMVEDPWEFLEPV 505 RS SG+ GG G ++ D L E +YD +MVEDPW+ LEPV Sbjct: 242 RSQVSGRNGGRGR-------------GFYSHGIGSDAALRAESFYDKSMVEDPWQRLEPV 288 Query: 504 IWKGVDDPLNSLRTPESSRSSISGTKRAKTYEVPGRSNNQPSLAEYLAASLNEADNDSSS 325 +WKG+D +S P+S + K+ + E +S++Q +LAEYLAA+ NE+ ND+ S Sbjct: 289 LWKGLDGSSDSW-LPKS-----ASMKKPRVSESSNKSSSQ-NLAEYLAAAFNESVNDAPS 341 >gb|KHG05960.1| Epidermal growth factor receptor [Gossypium arboreum] Length = 332 Score = 176 bits (446), Expect = 5e-41 Identities = 134/374 (35%), Positives = 178/374 (47%), Gaps = 30/374 (8%) Frame = -1 Query: 1374 MEESEKRRERLKAMRMEAACSENSNDL-ASPMP-SLSNPLAETSKAM--HDGYCATSRFG 1207 M+ESEKR+ERLKAMRMEAA +E S+++ +S MP SLSNPL ETS ++ D +C RF Sbjct: 1 MDESEKRKERLKAMRMEAANAEVSDNVQSSAMPGSLSNPLIETSSSLTAQDDFCRAPRFD 60 Query: 1206 FYTDPMAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSPLPGPRNPGMTSPGAYQ 1027 +YTDPMAAFS +KKR+ N S GPRN G P +Q Sbjct: 61 YYTDPMAAFSGNKKRDYVHNRAPSDS--------------------GPRNTGRGLP-VHQ 99 Query: 1026 FQSNYTPNQMYQARGFGLNSSFPRSPMGIHRPTMHQGNPDAWNGSGGAAGYNF----SP- 862 QS++ P++ G+ P SP MHQG DAWNG YNF SP Sbjct: 100 MQSHFAPDR-------GVYKQGPYSPRLRSPSLMHQGQSDAWNGPQATEHYNFVSDGSPR 152 Query: 861 -----------------NPSRECNF---PSPRFGPTGSPCFNTGQGRAN-WPNQXXXXXX 745 NPS ++ P+P F P FN G R + Sbjct: 153 GMFGGPPQHPGTFHRVWNPSNTSSYGKLPNPGFSPADGRSFNYGAARPQMFGRNPILDQR 212 Query: 744 XXXXXXXXXXXXXGDHWRGGRSPASGQRGGWGNFSPXXXXXXXXXXXSHARPSAMDRPLG 565 G +RG P G+ G G H SA ++ LG Sbjct: 213 PGSSPSFSPGRGRGPGYRGSSGPGLGRSAGRGQ-------------GFHGHSSASNKMLG 259 Query: 564 PEKYYDNAMVEDPWEFLEPVIWKGVDDPLNSLRTPESSRSSISGTKRAKTYEVPGRSNNQ 385 PE Y+D +M++DPW+ L+P+ W+ + ++SL P +S S SG KRAK E ++++ Sbjct: 260 PECYFDESMLKDPWQHLKPIPWRRQEAGMDSLGAPGTSNS--SGIKRAKVSE----ASSK 313 Query: 384 PSLAEYLAASLNEA 343 SLAEYLAAS N+A Sbjct: 314 QSLAEYLAASFNKA 327 >ref|XP_007018242.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] gi|508723570|gb|EOY15467.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] Length = 345 Score = 176 bits (446), Expect = 5e-41 Identities = 140/386 (36%), Positives = 184/386 (47%), Gaps = 36/386 (9%) Frame = -1 Query: 1374 MEESEKRRERLKAMRMEAACSENSNDLASP-MPS-LSNPLAETSK--AMHDGYCATSRFG 1207 M+ESEKR+ERLKAMR+EAA SE N++A+P +P LSNPL+ETS A+ + +C+T RF Sbjct: 1 MDESEKRKERLKAMRLEAAQSEVPNNVATPSVPGHLSNPLSETSSTAAVQEDFCSTPRFD 60 Query: 1206 FYTDPMAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSPLPGPRNPGMTSPGAYQ 1027 +YTDPMAA S P+A S PGPRN M P + Sbjct: 61 YYTDPMAATSG------------------WPVARVSPSH------PGPRNYDMNPPVRHM 96 Query: 1026 FQSNYTPNQ-MYQARGFGLNSSFPRSPMGIHRPTMHQGNPDAWNGSGGAAGYNFSP---- 862 QS Y+ +Q MY +G N + RSP+ MH GN DAWNGS Y S Sbjct: 97 -QSQYSLDQRMYHQQGPHSNFAAHRSPITRSPSHMHHGNSDAWNGSQAFGNYYSSASDGS 155 Query: 861 ----------------------NPSRECNFPSPRFGPTGSPCFNTGQGRANWPNQXXXXX 748 N SR N P+P F P P G+GR P Q Sbjct: 156 PGGMFGTPLMHPGTTPRFWNPSNASRYSNSPTPGFSPADIPY---GRGR---PQQFGNYP 209 Query: 747 XXXXXXXXXXXXXXGDHWRGGRSPASGQ-RGGWGNFSPXXXXXXXXXXXSHARPSAMDRP 571 G S G+ RG G+ + H SA +R Sbjct: 210 LPSPGHGGSL----------GLSSGRGRGRGYGGSITHGIGRSGGRGLGFHGHSSASNRM 259 Query: 570 LGPEKYYDNAMVEDPWEFLEPVIWKGVDDPLNSLRTPESSRS----SISGTKRAKTYEVP 403 +GPE +YD +M+EDPW+ L+PV+W+ + ++SL P+SS S SIS K+ K E Sbjct: 260 MGPESFYDESMLEDPWQHLKPVLWRRREAGMDSLSNPDSSNSWFPKSIS-AKKVKVSEAS 318 Query: 402 GRSNNQPSLAEYLAASLNEADNDSSS 325 + N+Q SLAEYLAAS N+A D+ + Sbjct: 319 NKFNSQLSLAEYLAASFNKAVEDTKN 344 >ref|XP_002306529.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550339341|gb|EEE93525.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 331 Score = 176 bits (445), Expect = 7e-41 Identities = 144/367 (39%), Positives = 183/367 (49%), Gaps = 28/367 (7%) Frame = -1 Query: 1374 MEESEKRRERLKAMR-MEAACSENSNDLASPMPS--LSNPLAETSKAM--HDGYCATSRF 1210 ME+SEKRRERLKAMR + AA +E SN++ + P L+ PL T + A RF Sbjct: 1 MEDSEKRRERLKAMRSIAAAQAETSNNVETSAPPGLLAYPLLGTPATLLAQGESSAIPRF 60 Query: 1209 GFYTDPMAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSPLPGPRNPGMTSPGAY 1030 FYTDP AAFSA++K A NQ + Y TSP + SS SSP PG RN +T P AY Sbjct: 61 DFYTDPSAAFSANRK--GAAGNQAARGYFTSP-SNNSSVPQLSSPHPGQRNLEVTPPHAY 117 Query: 1029 QFQ----------SNYTPNQ-MYQARGFGLNSSFPRSPMGIHRP-TMHQGNP-DAWNGSG 889 Q Q SN+ PNQ MY+ +G N++ RSP G P M+QG P + W+G G Sbjct: 118 QMQNSYPHANQMQSNHLPNQRMYRGQGPYHNAASYRSPRGFSCPFPMNQGAPPEMWSGPG 177 Query: 888 GAAGYNFSPNPSRECNFP------SPRFGPTGS-PCFNTGQGRANWPNQXXXXXXXXXXX 730 A Y FS + P +P FGP GS P +G G + +Q Sbjct: 178 FPASY-FSSTVHGGLSSPYPICQGNPGFGPVGSSPSPVSGYGGSPAISQTGQG------- 229 Query: 729 XXXXXXXXGDHWRGGRSPASGQRGGWGNFSPXXXXXXXXXXXSHARPSAMDRPLGPEKYY 550 HW S GQ GG G H+R A + GPE +Y Sbjct: 230 ----------HWHS--SSGFGQSGGRGR-------------GFHSRGFAPNEAQGPECFY 264 Query: 549 DNAMVEDPWEFLEPVIWKGVDDPLNSLRTPESSRSSIS---GTKRAKTYEVPGRSNNQPS 379 DN+MVEDPW+ LEPV+W G+DD N+L P SS S + K++ E +S + S Sbjct: 265 DNSMVEDPWQHLEPVLWSGLDDWGNNLNGPGSSNSLLPKSISMKKSSVAESSNKSTSGVS 324 Query: 378 LAEYLAA 358 LAEYLAA Sbjct: 325 LAEYLAA 331 >ref|XP_006575356.1| PREDICTED: vegetative cell wall protein gp1-like [Glycine max] gi|734389136|gb|KHN26144.1| hypothetical protein glysoja_019468 [Glycine soja] gi|947124264|gb|KRH72470.1| hypothetical protein GLYMA_02G215100 [Glycine max] Length = 343 Score = 174 bits (441), Expect = 2e-40 Identities = 132/384 (34%), Positives = 176/384 (45%), Gaps = 33/384 (8%) Frame = -1 Query: 1374 MEESEKRRERLKAMRMEAACSENSNDL-ASPMPS-LSNPLAETSKAM--HDGYCATSRFG 1207 ME+SE+R++RLK MR++A +E S S +P LSNPL E M D A RF Sbjct: 1 MEDSEQRKKRLKQMRVQADQAEVSGGREGSVVPGFLSNPLIEAPSTMPSRDTSYAAPRFD 60 Query: 1206 FYTDPMAAFSADKKRNNAGNNQISADYSTSPIAGGSSKMGFSSPLPGPRNPGMT------ 1045 +YTDPM+AFS+ KRNNA ++ S GG +SSP P +NP MT Sbjct: 61 YYTDPMSAFSS--KRNNASTQAAPDNFPPSKF-GGPPMAQYSSPHPESKNPQMTPHPIQA 117 Query: 1044 SPGAYQFQSNYTPNQMYQARGFGLNSSFPRSPMGIHRPTMHQGNPDAWNGSGGAAGYNFS 865 SP AY+ NP W+G GG A YNF Sbjct: 118 SPAAYR-------------------------------------NP-VWSGPGGPAHYNFP 139 Query: 864 PNPSRECNFPSPRFGPTGSPCFNTGQGRANWPNQXXXXXXXXXXXXXXXXXXXGDH---- 697 +PS +PSPRF P+G P +NT QG A+ P+ Sbjct: 140 LHPSSGGTYPSPRFEPSGGPLYNTAQGIAHQPSYSPNPPYPGYVNSPRPSYSPNPSPGYS 199 Query: 696 --------------WRGGRSPASGQ-RGGWGNF-SPXXXXXXXXXXXSHARPSAMDRPLG 565 +R SP G+ RG W N SP H S + G Sbjct: 200 NCPMPSYSPNPSPGYRNSPSPGQGRGRGFWRNTGSPVSGWGSGQGPNFHGHRSNENTVHG 259 Query: 564 PEKYYDNAMVEDPWEFLEPVIWKGVDDPLNSLRTPESSR---SSISGTKRAKTYEVPGRS 394 P+++Y +MVEDPWE LEP+IWK D LN+ R P +S+ S S TK + +S Sbjct: 260 PDRFYKRSMVEDPWEHLEPIIWKANDGYLNTSRVPLNSQPWISKASSTKGEGSSAASVKS 319 Query: 393 NNQPSLAEYLAASLNEADNDSSSI 322 +++PSLAEYLA++ NEA ND+ ++ Sbjct: 320 SSEPSLAEYLASAFNEAANDAENV 343