BLASTX nr result
ID: Cephaelis21_contig00000903
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00000903 (2304 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003534756.1| PREDICTED: uncharacterized protein LOC100781... 385 e-104 ref|XP_002304112.1| predicted protein [Populus trichocarpa] gi|2... 377 e-102 ref|XP_002519384.1| conserved hypothetical protein [Ricinus comm... 374 e-101 gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indi... 361 5e-97 gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japo... 342 3e-91 >ref|XP_003534756.1| PREDICTED: uncharacterized protein LOC100781827 [Glycine max] Length = 426 Score = 385 bits (989), Expect = e-104 Identities = 228/451 (50%), Positives = 287/451 (63%), Gaps = 39/451 (8%) Frame = -2 Query: 2207 VELPLGKAAATFSLEKAVCSHGLFMMAPNFWDPQSKTLQRPLRLSHNSYD---PDHQTSV 2037 +ELP + F LE+AVCSHGLFMM PN WDP SKTL RPLR S +S+ H S+ Sbjct: 1 MELP-----SPFQLEQAVCSHGLFMMPPNHWDPLSKTLIRPLRSSPSSFLVSLSQHSQSL 55 Query: 2036 AVRISQPSHSSHSLQVQVFGTDXXXXXXXXXXXXQVSRMLRLSEEDNKIVREFQEIHGVA 1857 AVR+ H++H+L Q QVSRMLR SE + K VREF+ +H V Sbjct: 56 AVRV----HATHALSPQ----------QQNHITAQVSRMLRFSEAEEKAVREFRSLHVVD 101 Query: 1856 K-ERGF-GRVFRSPTLFEDMIKCVLLCNCQWSRTLSMARALCELQWELQHPLSSTSCAKL 1683 R F GRVFRSPTLFEDM+KC+LLCNCQW RTLSMA+ALCELQ ELQ+ + C Sbjct: 102 HPNRSFSGRVFRSPTLFEDMVKCILLCNCQWPRTLSMAQALCELQLELQN---GSPCTIA 158 Query: 1682 YSGTNNSSPIVDSKHFIPNTPDKKEAKRKL----GVL---------NYAIDHMSNCSQ-- 1548 SG + +S+ FIP TP KE +R G+ N IDH+ S Sbjct: 159 VSGNSKG----ESEGFIPKTPASKETRRNKVSTKGMFCKKKLELDGNLQIDHVVASSSTA 214 Query: 1547 ---------ENEELK-------YSNGTRADSQHGIGNFPSPKELASVNASYLAKRCNLGY 1416 ++EEL+ +SNG S+ GNFPSP ELA+++ S+LAKRC LGY Sbjct: 215 TTLLTTDNGDSEELRSHDSCHEFSNGNEYFSR--TGNFPSPSELANLDESFLAKRCGLGY 272 Query: 1415 RASRILKLAQLVEDGGIQLRELEEACRTPTLCNYDKLAEQLKVIDGFGPFTCANVLVCVG 1236 RA I++LA+ + +G IQL +LEE + +L NY +L +QLK I G+GPFT ANVL+C+G Sbjct: 273 RAGYIIELARAIVEGKIQLGQLEELSKDASLSNYKQLDDQLKQIRGYGPFTRANVLMCLG 332 Query: 1235 FYHAIPSDSETIRHLKQVHARHTTTKTVHGDLEIIYGKYAPFQFLAYWSEVWHFYEEWFG 1056 +YH IP+DSET+RHLKQVH+R+TT+KT+ +LE IYGKY P+QFLA+WSEVW FYE FG Sbjct: 333 YYHVIPTDSETVRHLKQVHSRYTTSKTIERELEEIYGKYEPYQFLAFWSEVWDFYETRFG 392 Query: 1055 KLSEMAPSSYKLITAANMKP---KKKAQERK 972 KL+EM S YKLITA NM+ K+K RK Sbjct: 393 KLNEMHSSDYKLITACNMRSTTNKRKRPSRK 423 >ref|XP_002304112.1| predicted protein [Populus trichocarpa] gi|222841544|gb|EEE79091.1| predicted protein [Populus trichocarpa] Length = 445 Score = 377 bits (969), Expect = e-102 Identities = 220/445 (49%), Positives = 268/445 (60%), Gaps = 55/445 (12%) Frame = -2 Query: 2261 MEACETAGIVDGNRGSMVVELPLGKAAATFSLEKAVCSHGLFMMAPNFWDPQSKTLQRPL 2082 ME +T G + S+V E+PLG AA TF+LEKAVCSHGLFMM+PN WDP S T RPL Sbjct: 1 MENTKTDGKEEEEEESVVFEIPLGDAAETFNLEKAVCSHGLFMMSPNHWDPLSLTFSRPL 60 Query: 2081 RLSHNSYDPDHQT---SVAVRISQPSHSSHSLQVQVFGTDXXXXXXXXXXXXQVSRMLRL 1911 RLS + DP T S+ V IS P H SL V+V+GT QV RMLRL Sbjct: 61 RLSLSDSDPQVSTPTTSLFVSISHPPHLPRSLSVRVYGTRCLSPKHQESLVAQVVRMLRL 120 Query: 1910 SEEDNKIVREFQEIHGVAKER-------GFG-RVFRSPTLFEDMIKCVLLCNCQWSRTLS 1755 SE D + REF++I A GFG RVFRSPTLFEDM+KC+LLCNCQW RTLS Sbjct: 121 SETDERNAREFRKIAEAAAAEENNSWLTGFGGRVFRSPTLFEDMVKCILLCNCQWPRTLS 180 Query: 1754 MARALCELQWELQHPLSSTSCAKLYSGTNNSSPIVDSKHFIPNTPDKKEAKRKLGV---- 1587 MARALCELQ ELQ S A+ + T + + +FIPNT KE+KR + Sbjct: 181 MARALCELQCELQCKSSGVFVAQAVNATVKNKCNDTAHNFIPNTSAGKESKRNIRASKVT 240 Query: 1586 -----------------LNYAIDHMSNCSQENEELKYSNGTRADSQHG------------ 1494 N D + E ++ + R S+HG Sbjct: 241 KNLASKIVETETLLEADANLKTDSAHIGRETLESVENDSCARCSSRHGSDSWAPDSLQSQ 300 Query: 1493 ----------IGNFPSPKELASVNASYLAKRCNLGYRASRILKLAQLVEDGGIQLRELEE 1344 I NFPSP+ELA+++ S+LAKRCNLGYRA RI+KLAQ + +G I LRE+EE Sbjct: 301 HGIQPGVNKMICNFPSPRELANLDESFLAKRCNLGYRAIRIIKLAQSIVEGRIPLREVEE 360 Query: 1343 ACRTPTLCN-YDKLAEQLKVIDGFGPFTCANVLVCVGFYHAIPSDSETIRHLKQVHARHT 1167 C + Y+KLA+Q + IDGFGPFTCANVL+C+GFYH IP+DSET+RHLKQVHA+ + Sbjct: 361 DCANGASSSCYNKLADQFRQIDGFGPFTCANVLMCMGFYHIIPTDSETVRHLKQVHAKKS 420 Query: 1166 TTKTVHGDLEIIYGKYAPFQFLAYW 1092 T +TV D+E IYGKYAPFQFLAYW Sbjct: 421 TIQTVQRDVEEIYGKYAPFQFLAYW 445 >ref|XP_002519384.1| conserved hypothetical protein [Ricinus communis] gi|223541451|gb|EEF43001.1| conserved hypothetical protein [Ricinus communis] Length = 458 Score = 374 bits (961), Expect = e-101 Identities = 222/465 (47%), Positives = 274/465 (58%), Gaps = 50/465 (10%) Frame = -2 Query: 2192 GKAAATFSLEKAVCSHGLFMMAPNFWDPQSKTLQRPLRLSHNSYDPDHQTSVAVRISQPS 2013 G+AA TF LEK VCSHGLFM++PN WDP S+T RPLRL+ D S+ V ISQ Sbjct: 16 GEAADTFDLEKTVCSHGLFMLSPNHWDPLSRTFSRPLRLND-----DTDNSLMVSISQ-- 68 Query: 2012 HSSHSLQVQVFGTDXXXXXXXXXXXXQVSRMLRLSEEDNKIVREFQEIHGVAKERGF--- 1842 H S SL V+V+G Q+ RMLRLS+ D REF++I + Sbjct: 69 HLSKSLLVRVYGNRSLSPKHQESLLVQIVRMLRLSDMDEFNAREFRKIVSAFEGEECPLI 128 Query: 1841 ----GRVFRSPTLFEDMIKCVLLCNCQWSRTLSMARALCELQWELQHPLSSTSCAKLYSG 1674 GRV RSPTLFEDM+KC+LLCNCQWSRTLSMA ALC+ Q EL A Sbjct: 129 GDFGGRVLRSPTLFEDMVKCILLCNCQWSRTLSMADALCKFQIELHSQSPQQKHA----- 183 Query: 1673 TNNSSPIVDSKHFIPNTPDKKEAKRKLGVLNYAIDHMS---------------------N 1557 HFIPNTP KKE KRK+ + + M N Sbjct: 184 ---------FNHFIPNTPVKKEPKRKIRLSKVPTESMDLEAADTCLTTDDSQMKISNSLN 234 Query: 1556 CSQEN--EELK--------YSNGTRADS--------QH----GIGNFPSPKELASVNASY 1443 C + + LK YS G A S QH GNFPSP+ELA+++ + Sbjct: 235 CVDDGSFDNLKSCQGSNTFYSTGPYATSDIQSHLVTQHCAKKTTGNFPSPRELANLDERF 294 Query: 1442 LAKRCNLGYRASRILKLAQLVEDGGIQLRELEEACRTPTLCNYDKLAEQLKVIDGFGPFT 1263 LAKRC LGYRA RI+KLAQ + +G I LRE E+ +L Y KL +QL+ I+GFGPFT Sbjct: 295 LAKRCGLGYRAGRIIKLAQGIVEGRIPLREFEQVSNGGSLSTYSKLTDQLREIEGFGPFT 354 Query: 1262 CANVLVCVGFYHAIPSDSETIRHLKQVHARHTTTKTVHGDLEIIYGKYAPFQFLAYWSEV 1083 ANVL+C+GFYH IP+DSET+RH KQVHA+++T KTV + E IY K+APFQFL YW+E+ Sbjct: 355 RANVLMCMGFYHVIPTDSETVRHFKQVHAKNSTIKTVQSEAEEIYRKFAPFQFLVYWAEL 414 Query: 1082 WHFYEEWFGKLSEMAPSSYKLITAANMKPKKKAQERKRTKASRPE 948 WHFYE+ FGKLSEM S+YKLITA+N++ K + KR K SR + Sbjct: 415 WHFYEQRFGKLSEMPCSNYKLITASNLR-NKGHHKAKRAKISRAD 458 >gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indica Group] Length = 463 Score = 361 bits (926), Expect = 5e-97 Identities = 208/444 (46%), Positives = 275/444 (61%), Gaps = 41/444 (9%) Frame = -2 Query: 2207 VELPLGKA-----AATFSLEKAVCSHGLFMMAPNFWDPQSKTLQRPLRLSHNSYDPDHQT 2043 +ELPLG A AA F LE AVCSHGLFMMAPN WDP S+ L RPLRL+ D Sbjct: 21 LELPLGGAPPYPGAAPFDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLAS-----DRAA 75 Query: 2042 SVAVRISQ-PSHSSHSLQVQVFGT--DXXXXXXXXXXXXQVSRMLRLSEEDNKIVREFQE 1872 SVAVR+S+ P+ S +L V V G D QV RMLRL EED + EFQ Sbjct: 76 SVAVRVSRHPARPSDALLVSVLGAPGDALSPPDQTSILEQVRRMLRLDEEDGRAAAEFQA 135 Query: 1871 IHGVAKERGFGRVFRSPTLFEDMIKCVLLCNCQWSRTLSMARALCELQWELQHPLSSTS- 1695 +H VA+E GFGR+FRSPTLFEDM+KC+LLCNCQW+RTLSM+ ALCELQ EL+ S+ + Sbjct: 136 MHAVAREAGFGRIFRSPTLFEDMVKCILLCNCQWTRTLSMSTALCELQLELRSSSSTENF 195 Query: 1694 ---------CAKLYSGTNNSSPIVDSK------------HFIPNTPDKKEAKRKLGVLNY 1578 C + S N +++K + +T + + + + + Sbjct: 196 QSRTPPIRECKRKRSNKRNVRVKLETKFNEDKLVCLEDPNLATDTANLQTYENSFNLPSA 255 Query: 1577 A--IDHMSNCSQENEELKYSNGTRADSQHGIGNFPSPKELASVNASYLAKRCNLGYRASR 1404 A + S S ++ ELK N + G +FP+P+ELA+++ +LAKRCNLGYRA R Sbjct: 256 ASGTGNTSEVSLDHSELKLRNEPCLEDCGG--DFPTPEELANLDEDFLAKRCNLGYRARR 313 Query: 1403 ILKLAQLVEDGGIQLRELEEACR---------TPTLCNYDKLAEQLKVIDGFGPFTCANV 1251 I+ LA+ + +G I L++LEE + + T YD+L E+L I GFGPFT ANV Sbjct: 314 IVMLARSIVEGKICLQKLEEIRKMSVPTVEGLSTTPSTYDRLNEELSTISGFGPFTRANV 373 Query: 1250 LVCVGFYHAIPSDSETIRHLKQVHARHTTTKTVHGDLEIIYGKYAPFQFLAYWSEVWHFY 1071 L+C+GF+H IP+D+ETIRHLKQ H R +T +V +L+ IYGKYAPFQFLAYW E+W FY Sbjct: 374 LMCMGFFHMIPADTETIRHLKQFHKRASTISSVQKELDNIYGKYAPFQFLAYWCELWGFY 433 Query: 1070 EEWFGKLSEMAPSSYKLITAANMK 999 + FGK+S+M P +Y+L TA+ +K Sbjct: 434 NKQFGKISDMEPINYRLFTASKLK 457 >gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japonica Group] Length = 442 Score = 342 bits (876), Expect = 3e-91 Identities = 203/432 (46%), Positives = 262/432 (60%), Gaps = 29/432 (6%) Frame = -2 Query: 2207 VELPLGKA-----AATFSLEKAVCSHGLFMMAPNFWDPQSKTLQRPLRLSHNSYDPDHQT 2043 +ELPLG A AA F LE AVCSHGLFMMAPN WDP S+ L RPLRL+ D Sbjct: 21 LELPLGGAPPYPGAAPFDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLAS-----DRAA 75 Query: 2042 SVAVRISQ-PSHSSHSLQVQVFGT---DXXXXXXXXXXXXQVSRMLRLSEEDNKIVREFQ 1875 SVAVR+S+ P+ S +L V V G D QV RMLRL EED + V EFQ Sbjct: 76 SVAVRVSRHPARPSDALLVSVLGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQ 135 Query: 1874 EIHGVAKERGFGRVFRSPTLFEDMIKCVLLCNCQWSRTLSMARALCELQWELQHPLSSTS 1695 +H VA+E GFGR+FRSPTLFEDMIKC+LLCNCQW+RTLSM+ ALCELQ EL+ S+ + Sbjct: 136 AMHAVAREVGFGRIFRSPTLFEDMIKCILLCNCQWTRTLSMSTALCELQLELRSSSSTEN 195 Query: 1694 ----------CAKLYSGTNN-----SSPIVDSKHFIPNTPD--KKEAKRKLGVLNYAIDH 1566 C + S N + + K P+ A L L + + Sbjct: 196 FQSRTPPIRECKRKRSNKRNVRVKLETKFNEDKMVCLEDPNLATNTANENLFSLPSSANE 255 Query: 1565 MSNCSQ---ENEELKYSNGTRADSQHGIGNFPSPKELASVNASYLAKRCNLGYRASRILK 1395 N S+ ++ ELK + G+FP+P+ELA+++ +LAKRCNLGYRA RI+ Sbjct: 256 TGNTSEVSLDHSELKLRYELCLEDCG--GDFPTPEELANLDEDFLAKRCNLGYRARRIVM 313 Query: 1394 LAQLVEDGGIQLRELEEACRTPTLCNYDKLAEQLKVIDGFGPFTCANVLVCVGFYHAIPS 1215 LA+ + +G I L++LEE + L E+L I G PF NVL+C+GF+H IP+ Sbjct: 314 LARSIVEGKICLQKLEEIRKI--------LIEELSTISGIWPFHSCNVLMCMGFFHMIPA 365 Query: 1214 DSETIRHLKQVHARHTTTKTVHGDLEIIYGKYAPFQFLAYWSEVWHFYEEWFGKLSEMAP 1035 D+ETIRHLKQ H R +T +V +L+ IYGKYAPFQFLAYW E+W FY + FG +S+M P Sbjct: 366 DTETIRHLKQFHKRASTISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGIISDMEP 425 Query: 1034 SSYKLITAANMK 999 +Y+L TA+ +K Sbjct: 426 INYRLFTASKLK 437