BLASTX nr result
ID: Cnidium21_contig00042901
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cnidium21_contig00042901 (511 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN66820.1| hypothetical protein VITISV_003496 [Vitis vinifera] 115 3e-24 ref|XP_002321853.1| predicted protein [Populus trichocarpa] gi|2... 113 1e-23 ref|XP_002510900.1| hypothetical protein RCOM_1498790 [Ricinus c... 107 1e-21 gb|ACU21406.1| unknown [Glycine max] 97 2e-18 ref|NP_199981.2| hydroxyproline-rich glycoprotein family protein... 84 2e-14 >emb|CAN66820.1| hypothetical protein VITISV_003496 [Vitis vinifera] Length = 341 Score = 115 bits (289), Expect = 3e-24 Identities = 71/173 (41%), Positives = 94/173 (54%), Gaps = 30/173 (17%) Frame = -2 Query: 462 SNRHRIKQPVSVPFLWEVRPGLPKKDWKPNTSITQ------------------------A 355 SNR +I+QP SVPFLWE +PG+PKKDWKP + Sbjct: 8 SNRKQIRQPPSVPFLWEEKPGIPKKDWKPEVTAVNPPPPPPPPPPPPPPPPPPPPPPPPP 67 Query: 354 DPFALPPVKLIASVPFKWEEMPGKPLPYF---PQAMPKAVXXXXXXXXPSMLGDSRSPAP 184 P PP+KLIAS+PF WEE PGKPLP+F P + S L D+ Sbjct: 68 PPPPPPPIKLIASIPFTWEEKPGKPLPFFSGTPHDDSLLLFPPKKLVCCSSLSDAD---- 123 Query: 183 TVYSQNYCDNSDDE---MYDSYLDAWGFEFDEESISSAPSLLANRMLPMLAIT 34 S++Y D+ DDE +++S +A+GFE D +S SSAPSLLANR++ +AI+ Sbjct: 124 ---SKDYEDDGDDEHDGIFESDFEAFGFETD-DSFSSAPSLLANRLMSTVAIS 172 >ref|XP_002321853.1| predicted protein [Populus trichocarpa] gi|222868849|gb|EEF05980.1| predicted protein [Populus trichocarpa] Length = 333 Score = 113 bits (283), Expect = 1e-23 Identities = 70/159 (44%), Positives = 86/159 (54%), Gaps = 6/159 (3%) Frame = -2 Query: 462 SNRHRIKQPVSVPFLWEVRPGLPKKDWKPNTSITQADPFALPPVKLIASVPFKWEEMPGK 283 S + I+QP SVPFLWEVRPG+ K+DWKP S P LPPVKLIASVPF WEE PGK Sbjct: 29 SRKKHIRQPPSVPFLWEVRPGVAKRDWKPEVS--SVTPVQLPPVKLIASVPFNWEEKPGK 86 Query: 282 PLPYFPQA------MPKAVXXXXXXXXPSMLGDSRSPAPTVYSQNYCDNSDDEMYDSYLD 121 PL F Q+ P+A GD S + M++S L+ Sbjct: 87 PLSCFSQSPESAFITPQANLLALPWHVTCSQGDDNHKQEDGDSGEENFGDEQVMFNSDLE 146 Query: 120 AWGFEFDEESISSAPSLLANRMLPMLAITNGDSGQLQAP 4 ++ FE D ES SSA SLLAN M+ +AI+ Q +P Sbjct: 147 SFSFETD-ESFSSAQSLLANCMVSSVAISTAVPVQTTSP 184 >ref|XP_002510900.1| hypothetical protein RCOM_1498790 [Ricinus communis] gi|223550015|gb|EEF51502.1| hypothetical protein RCOM_1498790 [Ricinus communis] Length = 278 Score = 107 bits (266), Expect = 1e-21 Identities = 72/166 (43%), Positives = 91/166 (54%), Gaps = 20/166 (12%) Frame = -2 Query: 468 QGSNRHRIKQPVSVPFLWEVRPGLPKKDWKPNTS--ITQADPFALPPVKLIASVPFKWEE 295 + S R I+QP VPFLWE RPG+ KKDWKP S T A P PPVKLIASVPF WEE Sbjct: 8 EASKRKHIRQPPFVPFLWEERPGIAKKDWKPVVSSVTTLALP---PPVKLIASVPFNWEE 64 Query: 294 MPGKPLPYF---PQAMPKAVXXXXXXXXPSMLGDSRSPAPTVYSQNYCD-----NSDDEM 139 PGKPLP F P P A + P+P +Y Q CD N + Sbjct: 65 KPGKPLPCFSQPPMESPPATL-------------NSLPSPPMYYQR-CDDCEFNNENRAG 110 Query: 138 YDSY---------LDAWGFEFD-EESISSAPSLLANRMLPMLAITN 31 +D+Y LD F F+ ++S+SSAPSLLAN ++ +A+++ Sbjct: 111 HDNYGEKEEGIFDLDIESFSFETDDSLSSAPSLLANCLVSSVAVSD 156 >gb|ACU21406.1| unknown [Glycine max] Length = 222 Score = 96.7 bits (239), Expect = 2e-18 Identities = 58/152 (38%), Positives = 80/152 (52%), Gaps = 8/152 (5%) Frame = -2 Query: 465 GSNRHRIKQPVSVPFLWEVRPGLPKKDWKPNTSITQADPFALPPVKLIASVPFKWEEMPG 286 G +H +++P SVPF+WEV+PG+PKKDWKP P+KLIASVPF WEE PG Sbjct: 5 GKKKH-VREPPSVPFIWEVKPGIPKKDWKPEPE----PEVPKTPLKLIASVPFVWEEKPG 59 Query: 285 KPLPYF-------PQAMPKAVXXXXXXXXPSMLGDSRSPAPTVYSQNYCDNSDDEMYDSY 127 KPLP F P+ + V G + +SD+E + Sbjct: 60 KPLPNFSVDHPVPPKPLLIHVASSSAFSFACNFGHDHDK-----DKGSLSSSDNESITTL 114 Query: 126 -LDAWGFEFDEESISSAPSLLANRMLPMLAIT 34 L+A+ F+ DE +SS PSLLAN ++P ++ Sbjct: 115 DLEAFSFDEDESFVSSVPSLLANCLVPSAKVS 146 >ref|NP_199981.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|332008731|gb|AED96114.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 343 Score = 83.6 bits (205), Expect = 2e-14 Identities = 37/66 (56%), Positives = 47/66 (71%), Gaps = 5/66 (7%) Frame = -2 Query: 456 RHRIKQPVSVPFLWEVRPGLPKKDWKPNTSITQADPFALP-----PVKLIASVPFKWEEM 292 R +++QP SVPF+WE RPG PKK+W+P+ + P LP PVKL+ SVPF+WEE Sbjct: 13 RKQLRQPPSVPFIWEERPGFPKKNWQPSLATFVPSPPPLPPPIPVPVKLVTSVPFRWEET 72 Query: 291 PGKPLP 274 PGKPLP Sbjct: 73 PGKPLP 78