BLASTX nr result
ID: Atropa21_contig00023749
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00023749 (616 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu... 164 1e-38 ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247... 163 3e-38 ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [... 69 1e-09 gb|EOY32160.1| Hydroxyproline-rich glycoprotein family protein, ... 66 9e-09 gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, ... 66 9e-09 ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215... 62 2e-07 ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253... 61 2e-07 ref|XP_006442253.1| hypothetical protein CICLE_v10019766mg [Citr... 61 3e-07 >ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum] Length = 480 Score = 164 bits (416), Expect = 1e-38 Identities = 103/198 (52%), Positives = 111/198 (56%), Gaps = 7/198 (3%) Frame = -3 Query: 575 MHRSLSKLSHCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTDFPNFGFSPGKPESD 396 M RSLSKLSHC DFPNFGFSPGK S+ Sbjct: 1 MRRSLSKLSHCSIGRPTTASPAYAFSTFSGGGGGGRGRGRGS---DFPNFGFSPGKSASE 57 Query: 395 DSKPES----VPPGIGHGRGRGKXXXXXXXXXXXXXI-DNPNPNMNAGRGRGGIGSFSXX 231 DSKPES P G GHGRGRGK + DNPNP AGRGRGGIG FS Sbjct: 58 DSKPESSTPTTPSGTGHGRGRGKPLPSSPIVPSFYSVVDNPNPP--AGRGRGGIGPFSPP 115 Query: 230 XXXXXXXXXPTP--RKPIFFAKEEETADSNTNSSDALKHREDPNLPSSIISVLCGAGRGK 57 RKPIFFAKEEETADSN++SSDA R+D NL SS+ISVL GAGRGK Sbjct: 116 PQPQQQQQQQQQPLRKPIFFAKEEETADSNSSSSDAPTPRDDSNLSSSVISVLTGAGRGK 175 Query: 56 PMNTPSHVSKKPKEENRH 3 P+ T S VS+KPKEENRH Sbjct: 176 PLQTASPVSEKPKEENRH 193 >ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247662 isoform 1 [Solanum lycopersicum] gi|460368563|ref|XP_004230135.1| PREDICTED: uncharacterized protein LOC101247662 isoform 2 [Solanum lycopersicum] Length = 473 Score = 163 bits (413), Expect = 3e-38 Identities = 101/195 (51%), Positives = 110/195 (56%), Gaps = 4/195 (2%) Frame = -3 Query: 575 MHRSLSKLSHCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTDFPNFGFSPGKPESD 396 M RSLSKLSHC D PNFGFSPGK S+ Sbjct: 1 MRRSLSKLSHCSIGRPITASSGSAFSTFSGGGGGGRGRGRGS---DSPNFGFSPGKSASE 57 Query: 395 DSKPES----VPPGIGHGRGRGKXXXXXXXXXXXXXIDNPNPNMNAGRGRGGIGSFSXXX 228 DSKPES P G GHGRGRGK + NPN AGRGRGGIG FS Sbjct: 58 DSKPESSTPATPSGTGHGRGRGKPLPSSPIVPSFHSFVD-NPNTPAGRGRGGIGPFSPPP 116 Query: 227 XXXXXXXXPTPRKPIFFAKEEETADSNTNSSDALKHREDPNLPSSIISVLCGAGRGKPMN 48 P RKPIFFAKEEET DSN++SS+A K R+D NLPSS+ISVL GAGRGKP+ Sbjct: 117 QPQQQQQQPL-RKPIFFAKEEETTDSNSSSSNAPKPRDDSNLPSSVISVLTGAGRGKPLQ 175 Query: 47 TPSHVSKKPKEENRH 3 T S VS+KPKEENRH Sbjct: 176 TASSVSEKPKEENRH 190 >ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [Glycine max] gi|571476117|ref|XP_006586864.1| PREDICTED: la-related protein 1 isoform X2 [Glycine max] Length = 481 Score = 68.6 bits (166), Expect = 1e-09 Identities = 52/153 (33%), Positives = 70/153 (45%), Gaps = 8/153 (5%) Frame = -3 Query: 437 FPNFGFSPGKPESDDSKPES----VPPGIGHGRGRGKXXXXXXXXXXXXXIDNPNPNMNA 270 F N +P +P S +SK ++ +PPG G G GRGK I + N A Sbjct: 50 FNNNERAPVEPNSSESKSDTTEPPIPPGSGLGHGRGKPMPPSGLPSFSSFISSIN-QPPA 108 Query: 269 GRGRGGIGSFSXXXXXXXXXXXPTPRKPIFFAKEEETADSNTNS----SDALKHREDPNL 102 GRGRG + P+KPIFF +E+ + + +N ++ H D L Sbjct: 109 GRGRG----TAPHPQHDLQPPDSGPKKPIFFKREDSVSPTASNDFLPPKRSVDHAHDNKL 164 Query: 101 PSSIISVLCGAGRGKPMNTPSHVSKKPKEENRH 3 P SI VL G GRGK M P + + EENRH Sbjct: 165 PGSIPGVLSGLGRGKSMKQPD-LETQVTEENRH 196 >gb|EOY32160.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] Length = 403 Score = 65.9 bits (159), Expect = 9e-09 Identities = 56/158 (35%), Positives = 75/158 (47%), Gaps = 13/158 (8%) Frame = -3 Query: 437 FPNFGFSPGKPESDDSK---PESVPPGIGHGRGRGKXXXXXXXXXXXXXIDNPNPN---- 279 F +F PGK S DS ES P G+GHGRGRG +P P+ Sbjct: 55 FIDFTPPPGKSGSGDSNRDSAESPPAGVGHGRGRG-----------GPLSSDPIPHPFSS 103 Query: 278 --MNAGRGRGGIGSFSXXXXXXXXXXXPTPRKPIFFAK--EEETADSNTNSSDALKHRED 111 G GRG + S S ++PIF K E+ET S +++ ++ E Sbjct: 104 FVSQTGSGRGRVTSES---VPPPPPPPAQAKQPIFIKKKDEDETESSAKAAAEPIQSSE- 159 Query: 110 PNLPSSI--ISVLCGAGRGKPMNTPSHVSKKPKEENRH 3 P P +I +SVL GAGRGKP+ P S++ +EENRH Sbjct: 160 PIFPPNILPVSVLSGAGRGKPVKQPEPASRR-QEENRH 196 >gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 474 Score = 65.9 bits (159), Expect = 9e-09 Identities = 56/158 (35%), Positives = 75/158 (47%), Gaps = 13/158 (8%) Frame = -3 Query: 437 FPNFGFSPGKPESDDSK---PESVPPGIGHGRGRGKXXXXXXXXXXXXXIDNPNPN---- 279 F +F PGK S DS ES P G+GHGRGRG +P P+ Sbjct: 55 FIDFTPPPGKSGSGDSNRDSAESPPAGVGHGRGRG-----------GPLSSDPIPHPFSS 103 Query: 278 --MNAGRGRGGIGSFSXXXXXXXXXXXPTPRKPIFFAK--EEETADSNTNSSDALKHRED 111 G GRG + S S ++PIF K E+ET S +++ ++ E Sbjct: 104 FVSQTGSGRGRVTSES---VPPPPPPPAQAKQPIFIKKKDEDETESSAKAAAEPIQSSE- 159 Query: 110 PNLPSSI--ISVLCGAGRGKPMNTPSHVSKKPKEENRH 3 P P +I +SVL GAGRGKP+ P S++ +EENRH Sbjct: 160 PIFPPNILPVSVLSGAGRGKPVKQPEPASRR-QEENRH 196 >ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215545 [Cucumis sativus] gi|449502143|ref|XP_004161555.1| PREDICTED: uncharacterized protein LOC101224016 [Cucumis sativus] Length = 478 Score = 61.6 bits (148), Expect = 2e-07 Identities = 53/156 (33%), Positives = 72/156 (46%), Gaps = 11/156 (7%) Frame = -3 Query: 437 FPN--FGFSPGKPE---SDDSKPESVP----PGIGHGRGRGKXXXXXXXXXXXXXIDNPN 285 FP+ F F+P P S+ SK E + PG+GHGRG+ + Sbjct: 46 FPSGPFDFTPPVPNQEHSNASKQEPIDSRPTPGLGHGRGK-PTPSSPLRPSFSSFSPSVR 104 Query: 284 PNMNAGRGRGGIGSFSXXXXXXXXXXXPTPRKPIFFAKEEETADSNTNSSDALKHRE--D 111 P+ + GRGRG + P+KP+FF+K DS ++S HR + Sbjct: 105 PS-SVGRGRGD----ASPSIRSPPEPDSEPKKPVFFSKNN-AGDSAASTSLGGLHRVSGE 158 Query: 110 PNLPSSIISVLCGAGRGKPMNTPSHVSKKPKEENRH 3 NLP S+ S G GRGKPM P +PK+ENRH Sbjct: 159 RNLPESLHSEFSGVGRGKPMKQPV-PEDQPKQENRH 193 >ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253300 [Vitis vinifera] Length = 482 Score = 61.2 bits (147), Expect = 2e-07 Identities = 53/154 (34%), Positives = 70/154 (45%), Gaps = 12/154 (7%) Frame = -3 Query: 428 FGFSPGKPE--------SDDSKPESVPPGIGHGRGRGKXXXXXXXXXXXXXIDNPNPNMN 273 F F+ G PE + +S P G+GHGRG+ + + Sbjct: 45 FDFASGAPEKTEPTADPNSESSESPFPLGLGHGRGKPPSQPSAPTLPSF----SSFASTG 100 Query: 272 AGRGRGGIGSFSXXXXXXXXXXXPTPRKPIFFAKEEETADSNTNSSDAL--KHREDPNLP 99 GRGRG + + P+KPIFF+K E+ ADS L E+ NLP Sbjct: 101 IGRGRGRL-TAHPTDSVPQQSPDFAPKKPIFFSK-EDAADSAPKPQSQLGTTPPEENNLP 158 Query: 98 SSIISVLC-GAGRGKPM-NTPSHVSKKPKEENRH 3 SI+S L GAGRG+P+ TP+ PKEENRH Sbjct: 159 VSILSALSGGAGRGQPLKQTPA----PPKEENRH 188 >ref|XP_006442253.1| hypothetical protein CICLE_v10019766mg [Citrus clementina] gi|557544515|gb|ESR55493.1| hypothetical protein CICLE_v10019766mg [Citrus clementina] Length = 511 Score = 60.8 bits (146), Expect = 3e-07 Identities = 52/152 (34%), Positives = 62/152 (40%), Gaps = 13/152 (8%) Frame = -3 Query: 419 SPGKPESDDSKPESVP----PGIGHGRGRGKXXXXXXXXXXXXXIDNPNPNMNAGRGRGG 252 +PG+P S+ SKP+S P P G G GRG+ AGRGR Sbjct: 104 APGQPASE-SKPDSPPQPQAPPSGSGHGRGQPSAAPSPSISSFSSFLTAVKSGAGRGRVS 162 Query: 251 IGSFSXXXXXXXXXXXPTPRKPIFFAKEEETADSNTNSSDALKHREDPNLPSSIISVLCG 72 S P KP F E DS S +PNLPSSIIS L G Sbjct: 163 FAS----DPNESPRPDAQPAKPRTFTPNESATDSTQPS--------EPNLPSSIISTLPG 210 Query: 71 AGRGKPMNTPSHVSKK---------PKEENRH 3 AGRGK + T ++ P+EENRH Sbjct: 211 AGRGKTVVTQQQQQQQHQRQQPGPPPQEENRH 242