BLASTX nr result
ID: Dioscorea21_contig00037141
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00037141 (324 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003547059.1| PREDICTED: uncharacterized protein LOC100805... 127 7e-28 ref|XP_003532977.1| PREDICTED: uncharacterized protein LOC100791... 127 7e-28 gb|EIE25862.1| hypothetical protein COCSUDRAFT_40112 [Coccomyxa ... 113 2e-23 gb|EFN57090.1| hypothetical protein CHLNCDRAFT_143874 [Chlorella... 108 3e-22 ref|XP_001689575.1| separase, cell cycle protease [Chlamydomonas... 108 5e-22 >ref|XP_003547059.1| PREDICTED: uncharacterized protein LOC100805306 [Glycine max] Length = 2185 Score = 127 bits (320), Expect = 7e-28 Identities = 66/153 (43%), Positives = 83/153 (54%), Gaps = 45/153 (29%) Frame = +1 Query: 1 IENLDRCAATLLMGCSSGSLVIKGQYTPEGPPLSYLLAGCPAIIANLWDVLSNDINRYCK 180 I+ LD+CAATLLMGCSSGSL + GQY P+G PLSYLLAG PAI+ NLW+V DI+R+ K Sbjct: 2008 IQKLDKCAATLLMGCSSGSLTLPGQYAPQGIPLSYLLAGSPAIVGNLWEVTDKDIDRFGK 2067 Query: 181 VLLDAWLRDASQ---------------------------------------------TDY 225 +LDAWL++ S + Sbjct: 2068 AMLDAWLKERSDMPTECLQCNLLSEEFEAMNLKGCKGRAKRKAPRKKLLELAESESPKNC 2127 Query: 226 GEELRFASLMGKARDACRFPFLTGAAPVCYGVP 324 G + + MG+AR+ C PFLTGA+PVCYGVP Sbjct: 2128 GHRRKIGAFMGQAREVCTLPFLTGASPVCYGVP 2160 >ref|XP_003532977.1| PREDICTED: uncharacterized protein LOC100791010 [Glycine max] Length = 2142 Score = 127 bits (320), Expect = 7e-28 Identities = 66/153 (43%), Positives = 83/153 (54%), Gaps = 45/153 (29%) Frame = +1 Query: 1 IENLDRCAATLLMGCSSGSLVIKGQYTPEGPPLSYLLAGCPAIIANLWDVLSNDINRYCK 180 I+ LD+CAATLLMGCSSGSL + GQY P+G PLSYLLAG PAI+ NLW+V DI+R+ K Sbjct: 1981 IQKLDKCAATLLMGCSSGSLTLPGQYAPQGIPLSYLLAGSPAIVGNLWEVTDKDIDRFGK 2040 Query: 181 VLLDAWLRDASQ---------------------------------------------TDY 225 +LDAWL++ S + Sbjct: 2041 AMLDAWLKERSDMPTECLQCNLLSEEFEAMNLKGCKGRAKRKAPRKKLLELAESESPKNC 2100 Query: 226 GEELRFASLMGKARDACRFPFLTGAAPVCYGVP 324 G + + MG+AR+ C PFLTGA+PVCYGVP Sbjct: 2101 GHRRKIGAFMGQAREVCTLPFLTGASPVCYGVP 2133 >gb|EIE25862.1| hypothetical protein COCSUDRAFT_40112 [Coccomyxa subellipsoidea C-169] Length = 2026 Score = 113 bits (282), Expect = 2e-23 Identities = 56/115 (48%), Positives = 69/115 (60%), Gaps = 7/115 (6%) Frame = +1 Query: 1 IENLDRCAATLLMGCSSGSLVIKGQYTPEGPPLSYLLAGCPAIIANLWDVLSNDINRYCK 180 + L RC+A+LLMGCSSG L G Y P GP L+YLLAGCP +ANLWDV DI+R+ Sbjct: 1906 LRRLPRCSASLLMGCSSGRLCRAGAYDPSGPVLAYLLAGCPTAVANLWDVTDRDIDRFAM 1965 Query: 181 VLLDAWL-RDASQTDYGEE------LRFASLMGKARDACRFPFLTGAAPVCYGVP 324 LL+ WL DA E + + + +R CR P L GAAPVCYG+P Sbjct: 1966 ALLEKWLPADAESASAPNEDGKAGSMCISGSVAVSRSVCRLPHLIGAAPVCYGIP 2020 >gb|EFN57090.1| hypothetical protein CHLNCDRAFT_143874 [Chlorella variabilis] Length = 2177 Score = 108 bits (271), Expect = 3e-22 Identities = 55/126 (43%), Positives = 72/126 (57%), Gaps = 18/126 (14%) Frame = +1 Query: 1 IENLDRCAATLLMGCSSGSLVIKGQYTPEGPPLSYLLAGCPAIIANLWDVLSNDINRYCK 180 + +L+RC+A LLMGCSSG L + Y P G L+YLLAGCPA +ANLWDV DI+R+ + Sbjct: 2043 LRSLERCSAALLMGCSSGRLRAQQHYEPIGAVLAYLLAGCPAAVANLWDVTDKDIDRFSQ 2102 Query: 181 VLLDAWLRDA------------------SQTDYGEELRFASLMGKARDACRFPFLTGAAP 306 LL AW+ A S +D + + +R AC+ P L GAAP Sbjct: 2103 ALLTAWISGASGGSGGGSSGGDSGVDSGSSSDGNSGSDMCAAVAASRAACKLPHLVGAAP 2162 Query: 307 VCYGVP 324 VCYG+P Sbjct: 2163 VCYGIP 2168 >ref|XP_001689575.1| separase, cell cycle protease [Chlamydomonas reinhardtii] gi|158283563|gb|EDP09313.1| separase, cell cycle protease [Chlamydomonas reinhardtii] Length = 2337 Score = 108 bits (270), Expect = 5e-22 Identities = 57/125 (45%), Positives = 71/125 (56%), Gaps = 17/125 (13%) Frame = +1 Query: 1 IENLDRCAATLLMGCSSGSLVIKGQYTPEGPPLSYLLAGCPAIIANLWDVLSNDINRYCK 180 + L RCAA +LMGCSSG L + G Y P G ++Y +AG PA++ANLWDV DI+RYC+ Sbjct: 2210 LRKLQRCAAAVLMGCSSGRLRLHGAYDPAGAVVAYAVAGSPAVVANLWDVTDRDIDRYCQ 2269 Query: 181 VLLDAWL-----------RDASQTDYGEELRFASLMG------KARDACRFPFLTGAAPV 309 LL WL A Q D +E + G +R ACR P L GAAPV Sbjct: 2270 ALLRNWLGCADPQAAAAAAAAGQEDEEQEAQPVGWAGLGQAVVSSRGACRLPHLIGAAPV 2329 Query: 310 CYGVP 324 CYG+P Sbjct: 2330 CYGLP 2334