BLASTX nr result
ID: Papaver27_contig00029297
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver27_contig00029297 (583 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002283770.1| PREDICTED: uncharacterized protein LOC100243... 81 2e-13 ref|XP_006472450.1| PREDICTED: RNA-binding protein FUS-like [Cit... 78 2e-12 ref|XP_006433813.1| hypothetical protein CICLE_v10001506mg [Citr... 75 1e-11 gb|EXC52457.1| hypothetical protein L484_000896 [Morus notabilis] 74 3e-11 ref|XP_002306529.2| hydroxyproline-rich glycoprotein [Populus tr... 71 2e-10 ref|XP_007018241.1| Hydroxyproline-rich glycoprotein family prot... 70 5e-10 ref|XP_007224174.1| hypothetical protein PRUPE_ppa016470mg [Prun... 69 9e-10 ref|XP_002514052.1| conserved hypothetical protein [Ricinus comm... 65 1e-08 ref|XP_006366423.1| PREDICTED: heterogeneous nuclear ribonucleop... 64 3e-08 ref|XP_006413421.1| hypothetical protein EUTSA_v10025760mg [Eutr... 63 7e-08 ref|XP_004252719.1| PREDICTED: uncharacterized protein LOC101249... 61 2e-07 ref|XP_004301384.1| PREDICTED: uncharacterized protein LOC101291... 60 3e-07 ref|XP_002869714.1| hydroxyproline-rich glycoprotein family prot... 60 3e-07 ref|XP_007018242.1| Hydroxyproline-rich glycoprotein family prot... 60 4e-07 gb|AAK76684.1| unknown protein [Arabidopsis thaliana] 59 1e-06 ref|NP_567704.1| hydroxyproline-rich glycoprotein family protein... 59 1e-06 ref|XP_006596379.1| PREDICTED: serine/threonine-protein phosphat... 57 4e-06 ref|XP_006386460.1| hypothetical protein POPTR_0002s11210g [Popu... 57 4e-06 ref|NP_001046932.1| Os02g0510300 [Oryza sativa Japonica Group] g... 57 4e-06 ref|XP_006575356.1| PREDICTED: vegetative cell wall protein gp1-... 57 5e-06 >ref|XP_002283770.1| PREDICTED: uncharacterized protein LOC100243767 [Vitis vinifera] gi|302142075|emb|CBI19278.3| unnamed protein product [Vitis vinifera] Length = 347 Score = 81.3 bits (199), Expect = 2e-13 Identities = 70/225 (31%), Positives = 95/225 (42%), Gaps = 33/225 (14%) Frame = +3 Query: 3 ANSEASHSVGTSMPTINLANPLVETSS---VPQRPTAARRFDYYTDPMAAFSGNKRTN-- 167 A ++ S +V TS L+NPLVE S+ V + RFD+YTDPM+AFS NKR + Sbjct: 19 AQTKVSDTVDTSAMPGYLSNPLVEGSATLPVQEDSCVTPRFDFYTDPMSAFSSNKRRSKV 78 Query: 168 ------NHMYQAXXXXXXXXXXXXXQSYPSGPRNFYPIPPPSHQLQRAFEANP------- 308 +++ + S +GPRN P P+ Q F Sbjct: 79 GNQIQQDYLTPSSNSGYTATMARMSSSLSAGPRNCEMTPSPNPPFQPNFSPGQGINQAQG 138 Query: 309 -HHSPNSWRSPV--APPFYGNRGPPPEAWNRPSGTGGYGYPSNSLXXXXXXXXXXXXXXX 479 +HS +RSP+ A PF ++G P WN +G YG PSNS Sbjct: 139 LYHSSGPYRSPIEMASPFPAHQG-TPGVWNGSNGMPRYGVPSNS----------PRGGNF 187 Query: 480 PRPSVNPY------PGSGSSYTFSNSPNHGTGLRG------GRGR 578 P P P G G + F+NSP+ +G G GRGR Sbjct: 188 PSPGFRPVGSPSFRSGRGRGHWFNNSPSPVSGRGGSSSPNSGRGR 232 >ref|XP_006472450.1| PREDICTED: RNA-binding protein FUS-like [Citrus sinensis] Length = 379 Score = 77.8 bits (190), Expect = 2e-12 Identities = 73/268 (27%), Positives = 98/268 (36%), Gaps = 76/268 (28%) Frame = +3 Query: 3 ANSEASHSVGTSMPTINLANPLVETSS---VPQRPTAARRFDYYTDPMAAFSGNKRTNNH 173 A +E SV T +L+NPL E S+ + ++P A RF +YTDP+AAFS NK+ H Sbjct: 19 AQAEVCSSVETFPVPSSLSNPLFEDSAAQPIQEQPFAGSRFGFYTDPVAAFSANKKRGQH 78 Query: 174 -----MYQAXXXXXXXXXXXXXQSYPSGPRNFYPIPPPSHQL-------QRAFEA-NPHH 314 + S+ S PRN IP P HQL QR ++A +P++ Sbjct: 79 DNNTRQDYSMPPSISAPAMARPSSFFSEPRNSGMIPSPGHQLQASSSFDQRMYQAQSPYN 138 Query: 315 SPNSWRSP-------------------------VAPPFYGNRGP-------------PPE 380 +P+ +R P +P YG R P PE Sbjct: 139 NPHPYRGPRGASPLPIHQGTPGAWSGLQATTSHYSPTIYGQRSPRGMASPFTGIHQGTPE 198 Query: 381 AWNRPSGTGGYGYPSNSLXXXXXXXXXXXXXXXP-------RP---------------SV 494 +WN GT Y PS + P RP S Sbjct: 199 SWNGSGGTARYNSPSTASGGGQIFSPGFGPVRSPTFGYGQGRPQWQGRSPSPGSGRGGSP 258 Query: 495 NPYPGSGSSYTFSNSPNHGTGLRGGRGR 578 P G G + S + G G GGRGR Sbjct: 259 GPSSGRGRGRWYGGSVSPGLGCSGGRGR 286 >ref|XP_006433813.1| hypothetical protein CICLE_v10001506mg [Citrus clementina] gi|557535935|gb|ESR47053.1| hypothetical protein CICLE_v10001506mg [Citrus clementina] Length = 379 Score = 75.1 bits (183), Expect = 1e-11 Identities = 71/268 (26%), Positives = 97/268 (36%), Gaps = 76/268 (28%) Frame = +3 Query: 3 ANSEASHSVGTSMPTINLANPLVETSS---VPQRPTAARRFDYYTDPMAAFSGNKRTNNH 173 A +E SV T +L+NPL E S+ + ++P RF +YTDP+AAFS NK H Sbjct: 19 AQAEVCSSVETFPVPSSLSNPLFEDSAAQPIQEQPFTGSRFGFYTDPVAAFSANKNRGQH 78 Query: 174 -----MYQAXXXXXXXXXXXXXQSYPSGPRNFYPIPPPSHQL-------QRAFEA-NPHH 314 + S+ S PRN IP P HQL QR +++ +P++ Sbjct: 79 DNNTRQNYSMPPSISAPAMARPSSFFSEPRNSGMIPSPGHQLQASSSFDQRMYQSQSPYN 138 Query: 315 SPNSWRSP-------------------------VAPPFYGNRGP-------------PPE 380 +P+ +R P +P YG R P PE Sbjct: 139 NPHPYRGPRGASPLPIYQGTPEAWSRLQATTIHYSPTIYGQRSPRGMASPFTGIHQGTPE 198 Query: 381 AWNRPSGTGGYGYPSNSLXXXXXXXXXXXXXXXP-------RP---------------SV 494 +WN GT Y PS + P RP S Sbjct: 199 SWNGSGGTARYNSPSTASGGGQIFSPSFGPVRSPTFGYGQGRPQWQGRSPSPGSGRGGSP 258 Query: 495 NPYPGSGSSYTFSNSPNHGTGLRGGRGR 578 P G G + +S + G G GGRGR Sbjct: 259 GPSSGRGRGRWYGSSVSPGLGCSGGRGR 286 >gb|EXC52457.1| hypothetical protein L484_000896 [Morus notabilis] Length = 346 Score = 73.9 bits (180), Expect = 3e-11 Identities = 63/195 (32%), Positives = 82/195 (42%), Gaps = 23/195 (11%) Frame = +3 Query: 54 LANPLVETSSV---PQRPTAARRFDYYTDPMAAFSGNKRTNN-----HMYQAXXXXXXXX 209 L+NPLVETS+ P++ RFD+YTDPMAAFS NKR NN + Sbjct: 37 LSNPLVETSAAAPPPEQSHGTSRFDFYTDPMAAFSANKRRNNTSDPISSHHVTPPANSGS 96 Query: 210 XXXXXQSYPSGPRNFYPIPPPSHQLQRAFEANPH--------HSP--NSWRSPVAPPFYG 359 S SGPR Y P+HQ Q + NP H P S ++ PF Sbjct: 97 PMLRSPSPFSGPR--YAGMSPAHQFQSNYSPNPRMYQPQGFGHDPISQSGELGMSRPFNM 154 Query: 360 NRGPPPEAWNRPSGTGGYGYPSNSLXXXXXXXXXXXXXXXPRPSVNP-----YPGSGSSY 524 ++G + S G Y +PSN P P + P G G ++ Sbjct: 155 HQGNMDPSIGPGSAAGYYNFPSNQ----------PRGSRFPSPRIGPTGSFFNAGQGRAH 204 Query: 525 TFSNSPNHGTGLRGG 569 ++SPN G G RGG Sbjct: 205 WHNHSPNPGLG-RGG 218 >ref|XP_002306529.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550339341|gb|EEE93525.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 331 Score = 70.9 bits (172), Expect = 2e-10 Identities = 71/230 (30%), Positives = 96/230 (41%), Gaps = 38/230 (16%) Frame = +3 Query: 3 ANSEASHSVGTSMPTINLANPLVETSSV---PQRPTAARRFDYYTDPMAAFSGNKR--TN 167 A +E S++V TS P LA PL+ T + +A RFD+YTDP AAFS N++ Sbjct: 20 AQAETSNNVETSAPPGLLAYPLLGTPATLLAQGESSAIPRFDFYTDPSAAFSANRKGAAG 79 Query: 168 NHMYQAXXXXXXXXXXXXXQSYP-SGPRNFYPIPPPSHQLQRAF-EAN------------ 305 N + S P G RN PP ++Q+Q ++ AN Sbjct: 80 NQAARGYFTSPSNNSSVPQLSSPHPGQRNLEVTPPHAYQMQNSYPHANQMQSNHLPNQRM 139 Query: 306 -----PHHSPNSWRSP--VAPPFYGNRGPPPEAWNRPSGTGGY------GYPSNSLXXXX 446 P+H+ S+RSP + PF N+G PPE W+ P Y G S+ Sbjct: 140 YRGQGPYHNAASYRSPRGFSCPFPMNQGAPPEMWSGPGFPASYFSSTVHGGLSSPYPICQ 199 Query: 447 XXXXXXXXXXXPRPSVNPYPGS------GSSYTFSNSPNHGTGLRGGRGR 578 P P V+ Y GS G + S+S G G GGRGR Sbjct: 200 GNPGFGPVGSSPSP-VSGYGGSPAISQTGQGHWHSSS---GFGQSGGRGR 245 >ref|XP_007018241.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|508723569|gb|EOY15466.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 368 Score = 69.7 bits (169), Expect = 5e-10 Identities = 75/233 (32%), Positives = 94/233 (40%), Gaps = 41/233 (17%) Frame = +3 Query: 3 ANSEASHSVGTSMPTINLANPLVETSS---VPQRPTAARRFDYYTDPMAAFSGNK---RT 164 A SE ++V T +L+NPL ETSS V + + RFDYYTDPMAAFS NK + Sbjct: 19 AQSEVPNNVATPSVPGHLSNPLSETSSTAAVQEDFCSTPRFDYYTDPMAAFSANKKRGKA 78 Query: 165 NNHMYQAXXXXXXXXXXXXXQSYPS--GPRNFYPIPPPSHQL------QRAF-EANPHHS 317 +N Q + PS GPRN+ PP H QR + + PH + Sbjct: 79 DNQSTQNYFTPPTTSGWPVARVSPSHPGPRNYDMNPPVRHMQSQYSLDQRMYHQQGPHSN 138 Query: 318 PNSWRSPVA-PPFYGNRGPPPEAWNRPSGTGGY------GYPSNSLXXXXXXXXXXXXXX 476 + RSP+ P + + G +AWN G Y G P Sbjct: 139 FAAHRSPITRSPSHMHHG-NSDAWNGSQAFGNYYSSASDGSPGGMFGTPLMHPGTTPRFW 197 Query: 477 XP----RPSVNPYP---------GSGSSYTFSN----SPNHG--TGLRGGRGR 578 P R S +P P G G F N SP HG GL GRGR Sbjct: 198 NPSNASRYSNSPTPGFSPADIPYGRGRPQQFGNYPLPSPGHGGSLGLSSGRGR 250 >ref|XP_007224174.1| hypothetical protein PRUPE_ppa016470mg [Prunus persica] gi|462421110|gb|EMJ25373.1| hypothetical protein PRUPE_ppa016470mg [Prunus persica] Length = 398 Score = 68.9 bits (167), Expect = 9e-10 Identities = 52/146 (35%), Positives = 69/146 (47%), Gaps = 6/146 (4%) Frame = +3 Query: 9 SEASHSVGTSMPTINLANPLVETSS---VPQRPTAARRFDYYTDPMAAFSGN-KRTNNHM 176 +EASHSV TS L+NPL E S+ V + P A RFD+YTDPMAAFS + KR Sbjct: 21 AEASHSVTTSAVPGYLSNPLAEDSAALPVHKEPCAPSRFDFYTDPMAAFSSDTKRVKVGN 80 Query: 177 YQAXXXXXXXXXXXXXQSYPSGPRNFYPIPPPSHQLQRAFEAN--PHHSPNSWRSPVAPP 350 A + S P + +++Q+ F N P +P +A P Sbjct: 81 QIAPSNFGRPNTGGSPMARLSSPLS----DKRMYRVQQGFCQNFGPQRNPIG----IARP 132 Query: 351 FYGNRGPPPEAWNRPSGTGGYGYPSN 428 F + G PPE WN G Y +PS+ Sbjct: 133 FPMHHGNPPEVWNGAEGAANYSFPSD 158 >ref|XP_002514052.1| conserved hypothetical protein [Ricinus communis] gi|223547138|gb|EEF48635.1| conserved hypothetical protein [Ricinus communis] Length = 412 Score = 65.1 bits (157), Expect = 1e-08 Identities = 50/146 (34%), Positives = 69/146 (47%), Gaps = 13/146 (8%) Frame = +3 Query: 3 ANSEASHSVGTSMPTINLANPLVE---TSSVPQRPTAARRFDYYTDPMAAFSGNKRTNNH 173 A + S V TS + L NPL+E T + +A RFD+YT+PMAAFS +KR + Sbjct: 97 AEAGCSSHVQTSAVSGFLTNPLLESPATFPAKEESSATPRFDFYTNPMAAFSADKRIASI 156 Query: 174 MYQAXXXXXXXXXXXXXQSYPS---GPRNFYPIPPPSHQL-------QRAFEANPHHSPN 323 A + S GP N P P +Q+ QR + P++S Sbjct: 157 NQPAPRYFIPPSNNGPMPWFSSPVPGPGNPGMTPSPVYQMQSNYLPNQRTHQQGPYNSAV 216 Query: 324 SWRSPVAPPFYGNRGPPPEAWNRPSG 401 +RSP A PF ++G P+AWN P G Sbjct: 217 PYRSPRAGPFPMHQG-TPDAWNGPGG 241 >ref|XP_006366423.1| PREDICTED: heterogeneous nuclear ribonucleoproteins A2/B1-like [Solanum tuberosum] Length = 348 Score = 63.9 bits (154), Expect = 3e-08 Identities = 61/190 (32%), Positives = 79/190 (41%), Gaps = 4/190 (2%) Frame = +3 Query: 9 SEASHSVGTSMPTINLANPLVETSSVPQRPTAARRFDYYTDPMAAFSGNKRTNNHMY--- 179 +EA G + P ++ + VE+ ++P RP RFDYYTDPMAAFS NKR+NN + Sbjct: 25 NEAVGHGGLTNPLTDVPSGNVESYAMP-RP----RFDYYTDPMAAFSANKRSNNQPHVSP 79 Query: 180 QAXXXXXXXXXXXXXQSYPSGPRNFYPIPPPSHQLQRAFEANPHHSPNSWRSPVAPPFYG 359 Q QS PR Y + Q + NP +P SP P G Sbjct: 80 QISQQCYTPPRATNPQSPICTPRGNYSV----DQRSQGVHYNPLGNPGQ-NSPFGTPQRG 134 Query: 360 NRGPPPEAWNRPSGTGGYGYPSNSLXXXXXXXXXXXXXXXPRPSVNPYPGSGSSYT-FSN 536 + P AWN GT P NS RP + GSG + + Sbjct: 135 S----PSAWNNSFGTPNNYLPPNS--SMGGNFASPGIHQGGRPGFHYGQGSGQPGSGYGG 188 Query: 537 SPNHGTGLRG 566 SP G+G RG Sbjct: 189 SPYQGSGYRG 198 >ref|XP_006413421.1| hypothetical protein EUTSA_v10025760mg [Eutrema salsugineum] gi|557114591|gb|ESQ54874.1| hypothetical protein EUTSA_v10025760mg [Eutrema salsugineum] Length = 317 Score = 62.8 bits (151), Expect = 7e-08 Identities = 61/222 (27%), Positives = 79/222 (35%), Gaps = 31/222 (13%) Frame = +3 Query: 3 ANSEASHSVGTSMPTINLANPLVETSSVPQRPTAARRFDYYTDPMAAFSGNKRTNNHMYQ 182 A + S TSM T +L+NPL ETS+ Q RFDYYTDPM+A+S KR Q Sbjct: 20 AQDDGSTGPETSMNTSHLSNPLAETSTHQQESYETPRFDYYTDPMSAYSSFKRNKTPKQQ 79 Query: 183 AXXXXXXXXXXXXXQSYPSGP----------RNFYPIPPPS------HQLQRAFEANPHH 314 PS P N P H + R + Sbjct: 80 HISSPSCQISPPVPPFPPSVPGSLGCDYQAHANHAGFQGPQYEGDNLHTVPRGMAPSHRG 139 Query: 315 SPNSWRSPVAPPFYGNRGPP---PEAW-----------NRPSGTGGYGYPSNSLXXXXXX 452 SP +W + PP + GPP P + NR +G G Y Y + Sbjct: 140 SPVAWNNNFRPPPVNHLGPPQWVPRPFPFSQESPDMGNNRFNGRGSYNYTA--------- 190 Query: 453 XXXXXXXXXPRPSVNPYPGSGSSYTFSNSPNHGTGL-RGGRG 575 P Y S++ + P+ G G RGGRG Sbjct: 191 -----------PQYPNYGRQNSNWVGNTYPSSGRGRGRGGRG 221 >ref|XP_004252719.1| PREDICTED: uncharacterized protein LOC101249715 [Solanum lycopersicum] Length = 348 Score = 61.2 bits (147), Expect = 2e-07 Identities = 59/188 (31%), Positives = 78/188 (41%), Gaps = 2/188 (1%) Frame = +3 Query: 9 SEASHSVGTSMPTINLANPLVETSSVPQRPTAARRFDYYTDPMAAFSGNKRTNNHMYQAX 188 +EA G + P ++ + VE+ ++P RP RFDYYTDPMAAFS NKR+NN + + Sbjct: 25 NEAVGYGGLTNPLTDVPSGNVESYAMP-RP----RFDYYTDPMAAFSANKRSNNQPHVSP 79 Query: 189 XXXXXXXXXXXXQSYP-SGPRNFYPIPPPSHQLQRAFEANPHHSPNSWRSPVAPPFYGNR 365 P PR Y + S + F NP +P SP P G+ Sbjct: 80 QVSQQCYTRATNPQSPICTPRGNYSVDQRSQGVHHTF--NPLGNPGQ-NSPFGIPQRGS- 135 Query: 366 GPPPEAWNRPSGTGGYGYPSNSLXXXXXXXXXXXXXXXPRPSVNPYPGSGSSYT-FSNSP 542 P AWN T P NS RP + GSG + + SP Sbjct: 136 ---PSAWNNSFDTPKNYLPPNS--SMGGNFASPGIQRGGRPGFHYGQGSGQPGSGYGGSP 190 Query: 543 NHGTGLRG 566 G+G RG Sbjct: 191 YQGSGYRG 198 >ref|XP_004301384.1| PREDICTED: uncharacterized protein LOC101291633 [Fragaria vesca subsp. vesca] Length = 425 Score = 60.5 bits (145), Expect = 3e-07 Identities = 56/201 (27%), Positives = 78/201 (38%), Gaps = 12/201 (5%) Frame = +3 Query: 9 SEASHSVGTSMPTINLANPLVETSSVPQRPTAARRFDYYTDPMAAFSGNK---RTNNHMY 179 +EASH+ T+ L+NPL + + + P A RF +YTDPMA FS + +T +H Sbjct: 18 AEASHNDTTADVPGYLSNPLADGNVAQEEPCAPSRFGFYTDPMAGFSADTKRCKTGDHFA 77 Query: 180 QAXXXXXXXXXXXXXQSYP--SGPRNFYPIPPPSHQLQRAFEANPHHSPNSW---RSPVA 344 + P SG +PPP H Q + + ++ RSP A Sbjct: 78 SNSFKHSDAGGLPVPRLPPPLSGRPMNPEMPPPPHLFQSNYSPDQRMYQQNFAPQRSPAA 137 Query: 345 --PPFYGNRGPPPEAWNRPSGTGGYGYPSNSLXXXXXXXXXXXXXXXP--RPSVNPYPGS 512 PF + G PE W Y + S+ P RP +P G Sbjct: 138 MVRPFAMHHGNLPELWTGAECPASYNFSSDPSIESRSTGPRFRPPGSPGYRPPGSPGFGP 197 Query: 513 GSSYTFSNSPNHGTGLRGGRG 575 S F S + G GL G G Sbjct: 198 SGSPGFGPSGSPGFGLPGSPG 218 >ref|XP_002869714.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] gi|297315550|gb|EFH45973.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] Length = 320 Score = 60.5 bits (145), Expect = 3e-07 Identities = 58/214 (27%), Positives = 81/214 (37%), Gaps = 23/214 (10%) Frame = +3 Query: 6 NSEASHSVGTSMPTINLANPLVETSSVPQRPTAARRFDYYTDPMAAFSGNKRTNNHMYQA 185 N +++ TSM T +L+NPL ETS+ Q RFDYYTDPM+A+S K+ Q Sbjct: 24 NDDSTTDPETSMNTGHLSNPLAETSTQHQDSFETSRFDYYTDPMSAYSSFKKIKTPKQQ- 82 Query: 186 XXXXXXXXXXXXXQSYPSGPRNFYPIPPPSHQLQRAFEANPHH----------------- 314 Q+ P F P PP L ++A+ +H Sbjct: 83 ------YISSPSHQASSPVPPQFPPSVPPG-SLGSEYQAHTNHGGFQAAHYEPRGMSHLS 135 Query: 315 -----SPNSWRSPVAPPFYGNRGPPPEAW-NRPSGTGGYGYPSNSLXXXXXXXXXXXXXX 476 SP SW + PP + GPP W RP + + Sbjct: 136 PPYRGSPASWNNNFRPPPVNHPGPP--QWVPRP-----FPFSQEIPNMGNNRFGDRGSYN 188 Query: 477 XPRPSVNPYPGSGSSYTFSNSPNHGTGLRGGRGR 578 P + Y +++ + PN G G GGRGR Sbjct: 189 NTAPHFSNYGRQNANWVGNTYPNSGRG--GGRGR 220 >ref|XP_007018242.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] gi|508723570|gb|EOY15467.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] Length = 345 Score = 60.1 bits (144), Expect = 4e-07 Identities = 71/228 (31%), Positives = 88/228 (38%), Gaps = 36/228 (15%) Frame = +3 Query: 3 ANSEASHSVGTSMPTINLANPLVETSS---VPQRPTAARRFDYYTDPMAAFSGNKRTNNH 173 A SE ++V T +L+NPL ETSS V + + RFDYYTDPMAA SG Sbjct: 19 AQSEVPNNVATPSVPGHLSNPLSETSSTAAVQEDFCSTPRFDYYTDPMAATSG------- 71 Query: 174 MYQAXXXXXXXXXXXXXQSYPSGPRNFYPIPPPSHQL------QRAF-EANPHHSPNSWR 332 S+P GPRN+ PP H QR + + PH + + R Sbjct: 72 ----------WPVARVSPSHP-GPRNYDMNPPVRHMQSQYSLDQRMYHQQGPHSNFAAHR 120 Query: 333 SPVA-PPFYGNRGPPPEAWNRPSGTGGY------GYPSNSLXXXXXXXXXXXXXXXP--- 482 SP+ P + + G +AWN G Y G P P Sbjct: 121 SPITRSPSHMHHG-NSDAWNGSQAFGNYYSSASDGSPGGMFGTPLMHPGTTPRFWNPSNA 179 Query: 483 -RPSVNPYP---------GSGSSYTFSN----SPNHG--TGLRGGRGR 578 R S +P P G G F N SP HG GL GRGR Sbjct: 180 SRYSNSPTPGFSPADIPYGRGRPQQFGNYPLPSPGHGGSLGLSSGRGR 227 >gb|AAK76684.1| unknown protein [Arabidopsis thaliana] Length = 319 Score = 58.9 bits (141), Expect = 1e-06 Identities = 58/223 (26%), Positives = 81/223 (36%), Gaps = 33/223 (14%) Frame = +3 Query: 6 NSEASHSVGTSMPTINLANPLVETSSVPQRPTAARRFDYYTDPMAAFSGNKRTNNHMYQA 185 + +A+ TSM T +L+NPL ETS+ Q +RFDYYTDPMAA+S K+ Q Sbjct: 23 DDDATTGTETSMSTGHLSNPLAETSNHQQDSFETQRFDYYTDPMAAYSSFKKNKTPKQQ- 81 Query: 186 XXXXXXXXXXXXXQSYPSGPRNFYPIPPPSHQLQRAFEANPHHS---------------- 317 Q P F P PP L ++A +H Sbjct: 82 ------YISSPSHQGSSPVPPQFPPSVPPG-SLCSEYQAQTNHGGFHAAHYEPRGMAHLS 134 Query: 318 ------PNSWRSPVAPPFYGNRGPP-------PEAWNRP----SGTGGYGYPSNSLXXXX 446 P W + PP + GPP P + P + GG G +N+ Sbjct: 135 PSHRGPPAGWNNNFRPPPVNHSGPPQWVPRPFPFSQEMPNMGNNRFGGRGSYNNT----- 189 Query: 447 XXXXXXXXXXXPRPSVNPYPGSGSSYTFSNSPNHGTGLRGGRG 575 P + Y +++ + PN G G GRG Sbjct: 190 ------------PPQFSNYGRQNANWGGNTHPNSGRGRSRGRG 220 >ref|NP_567704.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|334186879|ref|NP_001190822.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|4220540|emb|CAA23013.1| hypothetical protein [Arabidopsis thaliana] gi|7269300|emb|CAB79360.1| hypothetical protein [Arabidopsis thaliana] gi|23296316|gb|AAN13039.1| unknown protein [Arabidopsis thaliana] gi|332659515|gb|AEE84915.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|332659517|gb|AEE84917.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 319 Score = 58.5 bits (140), Expect = 1e-06 Identities = 58/223 (26%), Positives = 81/223 (36%), Gaps = 33/223 (14%) Frame = +3 Query: 6 NSEASHSVGTSMPTINLANPLVETSSVPQRPTAARRFDYYTDPMAAFSGNKRTNNHMYQA 185 + +A+ TSM T +L+NPL ETS+ Q +RFDYYTDPMAA+S K+ Q Sbjct: 23 DDDATTGTETSMSTGHLSNPLAETSNHQQDSFETQRFDYYTDPMAAYSSFKKNKTPKQQ- 81 Query: 186 XXXXXXXXXXXXXQSYPSGPRNFYPIPPPSHQLQRAFEANPHHS---------------- 317 Q P F P PP L ++A +H Sbjct: 82 ------YISSPSHQGSSPVPPQFPPSVPPG-SLCSEYQAQTNHGGFHAAHYEPRGMAHLS 134 Query: 318 ------PNSWRSPVAPPFYGNRGPP-------PEAWNRP----SGTGGYGYPSNSLXXXX 446 P W + PP + GPP P + P + GG G +N+ Sbjct: 135 PSHRGPPAGWNNNFRPPPVNHSGPPQWVPRPFPFSQEMPNMGNNRFGGRGSYNNT----- 189 Query: 447 XXXXXXXXXXXPRPSVNPYPGSGSSYTFSNSPNHGTGLRGGRG 575 P + Y +++ + PN G G GRG Sbjct: 190 ------------PPQFSNYGRQNANWGGNTYPNSGRGRSRGRG 220 >ref|XP_006596379.1| PREDICTED: serine/threonine-protein phosphatase 1 regulatory subunit 10-like isoform X4 [Glycine max] Length = 360 Score = 57.0 bits (136), Expect = 4e-06 Identities = 66/233 (28%), Positives = 82/233 (35%), Gaps = 44/233 (18%) Frame = +3 Query: 9 SEASHSVGTSMPTINLANPLVETSSV---PQRPTAARRFDYYTDPMAAFSGNKR------ 161 +E S V S+ L+NPL+E S + AA RFDYYTDPM+AFS N+R Sbjct: 21 AEVSGGVEGSVVPGFLSNPLMEAPSTMPSQDKSYAAPRFDYYTDPMSAFSSNRRNIASIQ 80 Query: 162 --------------------------TNNHMYQAXXXXXXXXXXXXXQSYPSGPRNF-YP 260 TN S P GP ++ +P Sbjct: 81 AALDNFPPSKFGGSPMAQYSSPHPESTNPQRTPHPIQASPAAYRKPIWSGPGGPAHYNFP 140 Query: 261 IPPPSHQLQRAFEANPHHSP--NSWRSPVAPPFYG------NRGPPPEAWNRPSGTGGYG 416 I P S + P P NS + PP Y G P P+ + GYG Sbjct: 141 IHPSSGGTYPSPRFEPSGGPLYNSGQGIAHPPSYSPNPPYPGYGDSPRPSYNPNPSPGYG 200 Query: 417 YPSNSLXXXXXXXXXXXXXXXPRPSVNPYPGSGSSYTFSNSPNHGTGLRGGRG 575 NS PRPS P P G + NSP+ G G GRG Sbjct: 201 ---NSPRPSYNPNPSPGYGNSPRPSYRPNPSPG----YRNSPSPGQG--RGRG 244 >ref|XP_006386460.1| hypothetical protein POPTR_0002s11210g [Populus trichocarpa] gi|550344772|gb|ERP64257.1| hypothetical protein POPTR_0002s11210g [Populus trichocarpa] Length = 254 Score = 57.0 bits (136), Expect = 4e-06 Identities = 49/156 (31%), Positives = 68/156 (43%), Gaps = 26/156 (16%) Frame = +3 Query: 24 SVGTSMPTINLANPLVE---TSSVPQRPTAARRFDYYTDPMAAFSGNKR---TNNHMYQA 185 +V TS LANPL+E T + +A RFD+YTDP AAFS +++ T N + + Sbjct: 28 NVETSAVPGLLANPLLENPATQPALEELSATPRFDFYTDPSAAFSSDRKRTATANQVARG 87 Query: 186 XXXXXXXXXXXXXQSYPSGPRNFYPIP----------PPSHQLQRAFEAN--------PH 311 S G RN P P++Q+Q + N P+ Sbjct: 88 FRPPNNISSMPQFSSPHPGQRNPEVTPSSAYQMQNNYSPANQMQSNYSPNQRMYPGQGPY 147 Query: 312 HSPNSWRSP--VAPPFYGNRGPPPEAWNRPSGTGGY 413 H+ +R+P A PF N+G PE WN P G Y Sbjct: 148 HNAAFYRTPSNFARPFTMNQG-TPEMWNGPGGPASY 182 >ref|NP_001046932.1| Os02g0510300 [Oryza sativa Japonica Group] gi|48716468|dbj|BAD23074.1| unknown protein [Oryza sativa Japonica Group] gi|48716976|dbj|BAD23669.1| unknown protein [Oryza sativa Japonica Group] gi|113536463|dbj|BAF08846.1| Os02g0510300 [Oryza sativa Japonica Group] gi|215695493|dbj|BAG90684.1| unnamed protein product [Oryza sativa Japonica Group] Length = 320 Score = 57.0 bits (136), Expect = 4e-06 Identities = 52/185 (28%), Positives = 72/185 (38%), Gaps = 5/185 (2%) Frame = +3 Query: 39 MPTINLANPLVETSSVPQRPTAARRFDYYTDPMAAFSGNKRTNNHMYQAXXXXXXXXXXX 218 +P +L + T++ RP RFDYYT+P AAF+ + ++ A Sbjct: 46 LPVRDLMDASATTTAAAPRPPP--RFDYYTNPAAAFASSSAASHKRKVAEPPPPGSGNYG 103 Query: 219 XXQSYPSGPRNFYPIPPPSHQLQRAFEANPHHSPNSWRSPVA--PPFYGNRGPPPEA--- 383 YP P + PPP H R +P SP WRSP+ P G RGPPP A Sbjct: 104 --SGYPP-PHQHHMAPPPIHTPSRLSHDSPGGSP--WRSPMQFQAPMSGYRGPPPGAPPP 158 Query: 384 WNRPSGTGGYGYPSNSLXXXXXXXXXXXXXXXPRPSVNPYPGSGSSYTFSNSPNHGTGLR 563 W+ SG P ++ P + P+P S ++ G Sbjct: 159 WSPHSGVPPPWNPHSA---------------PPSQGLYPHPPSYGPRNYNPGQGGGRMNY 203 Query: 564 GGRGR 578 G RGR Sbjct: 204 GPRGR 208 >ref|XP_006575356.1| PREDICTED: vegetative cell wall protein gp1-like [Glycine max] Length = 343 Score = 56.6 bits (135), Expect = 5e-06 Identities = 56/174 (32%), Positives = 74/174 (42%), Gaps = 6/174 (3%) Frame = +3 Query: 54 LANPLVET-SSVPQRPT--AARRFDYYTDPMAAFSGNKRTNNHMYQAXXXXXXXXXXXXX 224 L+NPL+E S++P R T AA RFDYYTDPM+AFS + NN QA Sbjct: 36 LSNPLIEAPSTMPSRDTSYAAPRFDYYTDPMSAFSSKR--NNASTQA------APDNFPP 87 Query: 225 QSYPSGPRNFYPIPPPSHQLQRAFEANPHHSPNSWRSPVAPPFYGNRGPPPEAWNRPSGT 404 + P Y P P + + SP ++R+PV + G GP + + Sbjct: 88 SKFGGPPMAQYSSPHPESKNPQMTPHPIQASPAAYRNPV---WSGPGGPAHYNFPLHPSS 144 Query: 405 GGYGYPSNSLXXXXXXXXXXXXXXXPRPSVN---PYPGSGSSYTFSNSPNHGTG 557 GG YPS +PS + PYPG +S S SPN G Sbjct: 145 GG-TYPSPRFEPSGGPLYNTAQGIAHQPSYSPNPPYPGYVNSPRPSYSPNPSPG 197