BLASTX nr result

ID: Cephaelis21_contig00002275 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00002275
         (1959 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor,...   592   e-166
dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]     536   e-150
ref|XP_002884082.1| aspartyl protease family protein [Arabidopsi...   525   e-146
ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis tha...   523   e-146
ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein ...   506   e-140

>ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223525945|gb|EEF28342.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 533

 Score =  592 bits (1526), Expect = e-166
 Identities = 288/442 (65%), Positives = 347/442 (78%), Gaps = 2/442 (0%)
 Frame = -1

Query: 1593 FGTFGFDIHHRYSHPVKGFLHPDDDLPEKGTTDYFATMARRDHLIRARHLADTTTTTPVV 1414
            FGTFGFD+HHRYS PVKG L  DD LPEKG+  Y+A+MA RD LI  R L    T+TP+ 
Sbjct: 38   FGTFGFDLHHRYSDPVKGMLSVDD-LPEKGSLHYYASMAHRDILIHGRKLVSDNTSTPLT 96

Query: 1413 FADGNETVRFNSLGFLYYANVSVGTPDLWFLVALDTGSDLFWLPCNCRSS-CVQKLFTNT 1237
            F  GNET RF+SLGFL+YANVS+GTP L +LVALDTGSDLFWLPC+C +S CVQ L   +
Sbjct: 97   FFSGNETYRFSSLGFLHYANVSIGTPSLSYLVALDTGSDLFWLPCDCTNSGCVQGLQFPS 156

Query: 1236 GRKIDLNMYSPNASSTSMVVPCNSTTCSAQRKCAVSRNACAYQIQYLSNGTSSTGVLIED 1057
            G +ID N+Y PNASSTS  +PCN+T CS Q +C  +++ C YQ+QYLSNGTSSTGVL+ED
Sbjct: 157  GEQIDFNIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCPYQVQYLSNGTSSTGVLVED 216

Query: 1056 VLHLATDDSQQK-VDAPITLGCGILQTGGFLSGAAPNGLFGLGMDNISVPSTLAEKGLAA 880
            +LHL TDD+Q + +DA I  GCG +QTG FL GAAPNGLFGLGM NISVPSTLA +G  +
Sbjct: 217  LLHLTTDDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNISVPSTLAREGYTS 276

Query: 879  NSFSMCFSRGGLGRIVFGDKGSSDQGETPFNLYQPHPTYNVSITQITVAENVTSVDFTAI 700
            NSFSMCF R G+GRI FGD GSS QGETPFNL Q HPTYNVSIT+I V      ++F+AI
Sbjct: 277  NSFSMCFGRDGIGRISFGDTGSSGQGETPFNLRQLHPTYNVSITKINVGGRDADLEFSAI 336

Query: 699  FDSGTSFTYLNDPAYTIITQNFNSQVHEPRYQSESQLPFDYCYMLSPNQTSVELPDLNLT 520
            FDSGTSFTYLNDPAYT+I+++FN    E RY S S +PF+YCY +S NQT++E+P +NL 
Sbjct: 337  FDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYEMSSNQTNLEIPTVNLV 396

Query: 519  MKGGDQFFVTNPVEIVGVTGGGAVYCLAVLKSGDVNIIGQNFMSGYRIVFNRENMALGWK 340
            M+GG QF VT+P+ IV + GG ++YCLA++KSGDVNIIGQNFM+GYRIVFNRE   LGWK
Sbjct: 397  MQGGSQFNVTDPIVIVILQGGASIYCLAIVKSGDVNIIGQNFMTGYRIVFNRERNVLGWK 456

Query: 339  ASDCNDAVQANTSTFPINHQSP 274
            ASDC D    +T+TFP++  SP
Sbjct: 457  ASDCYD--DMDTTTFPVDPISP 476


>dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
          Length = 515

 Score =  536 bits (1382), Expect = e-150
 Identities = 262/443 (59%), Positives = 332/443 (74%), Gaps = 1/443 (0%)
 Frame = -1

Query: 1602 CEAFGTFGFDIHHRYSHPVKGFLHPDDDLPEKGTTDYFATMARRDHLIRARHLADTTTTT 1423
            CE  G FGF+ HHR+S  V G L P D LP + ++ Y+  MA RD LIR R LA    + 
Sbjct: 27   CEGLGEFGFEFHHRFSDQVVGVL-PGDGLPNRDSSKYYRVMAHRDRLIRGRRLASEDQSL 85

Query: 1422 PVVFADGNETVRFNSLGFLYYANVSVGTPDLWFLVALDTGSDLFWLPCNCRSSCVQKLFT 1243
             V FADGNET+R N+LGFL+YANV+VGTP  WFLVALDTGSDLFWLPC+C ++CV++L  
Sbjct: 86   -VTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCDCSTNCVRELKA 144

Query: 1242 NTGRKIDLNMYSPNASSTSMVVPCNSTTCSAQRKCAVSRNACAYQIQYLSNGTSSTGVLI 1063
              G  +DLN+YSPNASSTS  VPCNST C+   +CA   + C YQI+YLSNGTSSTGVL+
Sbjct: 145  PGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCPYQIRYLSNGTSSTGVLV 204

Query: 1062 EDVLHLATDDSQQK-VDAPITLGCGILQTGGFLSGAAPNGLFGLGMDNISVPSTLAEKGL 886
            EDVLHL + +   K + A ITLGCG++QTG F  GAAPNGLFGLG+++ISVPS LA++G+
Sbjct: 205  EDVLHLVSMEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGI 264

Query: 885  AANSFSMCFSRGGLGRIVFGDKGSSDQGETPFNLYQPHPTYNVSITQITVAENVTSVDFT 706
            AANSFSMCF   G GRI FGDKGS DQ ETP N+ QPHPTYNV++TQI+V  N   ++F 
Sbjct: 265  AANSFSMCFGDDGAGRISFGDKGSVDQRETPLNIRQPHPTYNVTVTQISVGGNTGDLEFD 324

Query: 705  AIFDSGTSFTYLNDPAYTIITQNFNSQVHEPRYQSESQLPFDYCYMLSPNQTSVELPDLN 526
            A+FD+GTSFTYL D  YT+I+++FNS   + RYQ++S+LPF+YCY +SPN+ S E PD+N
Sbjct: 325  AVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCYAVSPNKKSFEYPDVN 384

Query: 525  LTMKGGDQFFVTNPVEIVGVTGGGAVYCLAVLKSGDVNIIGQNFMSGYRIVFNRENMALG 346
            LTMKGG  + V +P+ +V +     VYCLA++KS D++IIGQNFM+GYR+VF+RE + LG
Sbjct: 385  LTMKGGSSYPVYHPLIVVPIE-DTVVYCLAIMKSEDISIIGQNFMTGYRVVFDREKLILG 443

Query: 345  WKASDCNDAVQANTSTFPINHQS 277
            WK SDC+   + +  T P N  S
Sbjct: 444  WKESDCSTG-ETSARTQPSNRSS 465


>ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297329922|gb|EFH60341.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  525 bits (1353), Expect = e-146
 Identities = 259/446 (58%), Positives = 333/446 (74%), Gaps = 2/446 (0%)
 Frame = -1

Query: 1608 QSCEAFGTFGFDIHHRYSHPVKGFLHPDDDLPEKGTTDYFATMARRDHLIRARHLADTTT 1429
            + CE FG FGF+ HHR+S  V G L P D LP + ++ Y+  MA RD LIR R LA+   
Sbjct: 25   ERCEGFGEFGFEFHHRFSDQVVGVL-PGDGLPNRDSSKYYRVMAHRDRLIRGRRLANEDQ 83

Query: 1428 TTPVVFADGNETVRFNSLGFLYYANVSVGTPDLWFLVALDTGSDLFWLPCNCRSSCVQKL 1249
            +  V F+DGNET+R ++LGFL+YANV+VGTP  WFLVALDTGSDLFWLPC+C ++CV++L
Sbjct: 84   SL-VTFSDGNETIRVDALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCDC-TNCVREL 141

Query: 1248 FTNTGRKIDLNMYSPNASSTSMVVPCNSTTCSAQRKCAVSRNACAYQIQYLSNGTSSTGV 1069
                G  +DLN+YSPNASSTS  VPCNST C+   +CA   + C YQI+YLSNGTSSTGV
Sbjct: 142  KAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQIRYLSNGTSSTGV 201

Query: 1068 LIEDVLHLATDDSQQK-VDAPITLGCGILQTGGFLSGAAPNGLFGLGMDNISVPSTLAEK 892
            L+EDVLHL ++D   K + A +TLGCG +QTG F  GAAPNGLFGLG+++ISVPS LA++
Sbjct: 202  LVEDVLHLVSNDKSSKAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKE 261

Query: 891  GLAANSFSMCFSRGGLGRIVFGDKGSSDQGETPFNLYQPHPTYNVSITQITVAENVTSVD 712
            G+AANSFSMCF   G GRI FGDKGS DQ ETP N+ QPHPTYN+++T+I+V  N   ++
Sbjct: 262  GIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVEGNTGDLE 321

Query: 711  FTAIFDSGTSFTYLNDPAYTIITQNFNSQVHEPRYQ-SESQLPFDYCYMLSPNQTSVELP 535
            F A+FDSGTSFTYL D AYT+I+++FNS   + RYQ ++S+LPF+YCY LSPN+ S + P
Sbjct: 322  FDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYP 381

Query: 534  DLNLTMKGGDQFFVTNPVEIVGVTGGGAVYCLAVLKSGDVNIIGQNFMSGYRIVFNRENM 355
             +NLTMKGG  + V +P+ ++ +     VYCLA+LK  D++IIGQNFM+GYR+VF+RE +
Sbjct: 382  AVNLTMKGGSSYPVYHPLVVIPMKDTD-VYCLAILKIEDISIIGQNFMTGYRVVFDREKL 440

Query: 354  ALGWKASDCNDAVQANTSTFPINHQS 277
             LGWK SDC    + +  T P N  S
Sbjct: 441  ILGWKESDCYTG-ETSARTLPSNRSS 465


>ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid
            DNA-binding protein [Arabidopsis thaliana]
            gi|22655368|gb|AAM98276.1| At2g17760/At2g17760
            [Arabidopsis thaliana] gi|330251585|gb|AEC06679.1|
            aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 513

 Score =  523 bits (1346), Expect = e-146
 Identities = 257/444 (57%), Positives = 331/444 (74%), Gaps = 2/444 (0%)
 Frame = -1

Query: 1602 CEAFGTFGFDIHHRYSHPVKGFLHPDDDLPEKGTTDYFATMARRDHLIRARHLADTTTTT 1423
            CE FG FGF+ HHR+S  V G L P D LP + ++ Y+  MA RD LIR R LA+   + 
Sbjct: 27   CEGFGEFGFEFHHRFSDQVVGVL-PGDGLPNRDSSKYYRVMAHRDRLIRGRRLANEDQSL 85

Query: 1422 PVVFADGNETVRFNSLGFLYYANVSVGTPDLWFLVALDTGSDLFWLPCNCRSSCVQKLFT 1243
             V F+DGNETVR ++LGFL+YANV+VGTP  WF+VALDTGSDLFWLPC+C ++CV++L  
Sbjct: 86   -VTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDC-TNCVRELKA 143

Query: 1242 NTGRKIDLNMYSPNASSTSMVVPCNSTTCSAQRKCAVSRNACAYQIQYLSNGTSSTGVLI 1063
              G  +DLN+YSPNASSTS  VPCNST C+   +CA   + C YQI+YLSNGTSSTGVL+
Sbjct: 144  PGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLV 203

Query: 1062 EDVLHLATDDSQQK-VDAPITLGCGILQTGGFLSGAAPNGLFGLGMDNISVPSTLAEKGL 886
            EDVLHL ++D   K + A +T GCG +QTG F  GAAPNGLFGLG+++ISVPS LA++G+
Sbjct: 204  EDVLHLVSNDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGI 263

Query: 885  AANSFSMCFSRGGLGRIVFGDKGSSDQGETPFNLYQPHPTYNVSITQITVAENVTSVDFT 706
            AANSFSMCF   G GRI FGDKGS DQ ETP N+ QPHPTYN+++T+I+V  N   ++F 
Sbjct: 264  AANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFD 323

Query: 705  AIFDSGTSFTYLNDPAYTIITQNFNSQVHEPRYQ-SESQLPFDYCYMLSPNQTSVELPDL 529
            A+FDSGTSFTYL D AYT+I+++FNS   + RYQ ++S+LPF+YCY LSPN+ S + P +
Sbjct: 324  AVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAV 383

Query: 528  NLTMKGGDQFFVTNPVEIVGVTGGGAVYCLAVLKSGDVNIIGQNFMSGYRIVFNRENMAL 349
            NLTMKGG  + V +P+ ++ +     VYCLA++K  D++IIGQNFM+GYR+VF+RE + L
Sbjct: 384  NLTMKGGSSYPVYHPLVVIPMKDTD-VYCLAIMKIEDISIIGQNFMTGYRVVFDREKLIL 442

Query: 348  GWKASDCNDAVQANTSTFPINHQS 277
            GWK SDC    + +  T P N  S
Sbjct: 443  GWKESDCYTG-ETSARTLPSNRSS 465


>ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 508

 Score =  506 bits (1302), Expect = e-140
 Identities = 267/448 (59%), Positives = 317/448 (70%), Gaps = 4/448 (0%)
 Frame = -1

Query: 1608 QSCEAFGTFGFDIHHRYSHPVKGFLHPDDDLPEKGTTDYFATMARRDHLIRARHLADTTT 1429
            QSC A  +FGFDIHHR+S PVK  L   D LP+KGT  Y+  MA RD + R R LA    
Sbjct: 22   QSCHALHSFGFDIHHRFSDPVKEILGVHD-LPDKGTRQYYVAMAHRDRIFRGRRLA-AGY 79

Query: 1428 TTPVVFADGNETVRFNSLGFLYYANVSVGTPDLWFLVALDTGSDLFWLPCNCRSSCVQKL 1249
             +P+ F   NET +  + GFL++ANVSVGTP L FLVALDTGSDLFWLPCNC + CV  +
Sbjct: 80   HSPLTFIPSNETYQIEAFGFLHFANVSVGTPPLSFLVALDTGSDLFWLPCNC-TKCVHGI 138

Query: 1248 FTNTGRKIDLNMYSPNASSTSMVVPCNSTTCSAQRKCAVSRNACAYQIQYLSNGTSSTGV 1069
              + G KI  N+Y    SSTS  V CNS+ C  QR+C  S   C Y++ YLSNGTS+TG 
Sbjct: 139  GLSNGEKIAFNIYDLKGSSTSQPVLCNSSLCELQRQCPSSDTICPYEVNYLSNGTSTTGF 198

Query: 1068 LIEDVLHLATDDSQQK-VDAPITLGCGILQTGGFLSGAAPNGLFGLGMDNISVPSTLAEK 892
            L+EDVLHL TDD + K  D  IT GCG +QTG FL GAAPNGLFGLGM N SVPS LA++
Sbjct: 199  LVEDVLHLITDDDKTKDADTRITFGCGQVQTGAFLDGAAPNGLFGLGMSNESVPSILAKE 258

Query: 891  GLAANSFSMCFSRGGLGRIVFGDKGSSDQGETPFNLYQPHPTYNVSITQITVAENVTSVD 712
            GL +NSFSMCF   GLGRI FGD  S  QG+TPFNL   HPTYN+++TQI V E V  ++
Sbjct: 259  GLTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPFNLRALHPTYNITVTQIIVGEKVDDLE 318

Query: 711  FTAIFDSGTSFTYLNDPAYTIITQNFNSQVHEPRY--QSESQLPFDYCYMLSPNQTSVEL 538
            F AIFDSGTSFTYLNDPAY  IT +FNS++   R+   S ++LPF+YCY LSPNQT VEL
Sbjct: 319  FHAIFDSGTSFTYLNDPAYKQITNSFNSEIKLQRHSTSSSNELPFEYCYELSPNQT-VEL 377

Query: 537  PDLNLTMKGGDQFFVTNPVEIVGVTGGGA-VYCLAVLKSGDVNIIGQNFMSGYRIVFNRE 361
              +NLTMKGGD + VT+P  IV V+G G  + CL VLKS +VNIIGQNFM+GYRIVF+RE
Sbjct: 378  -SINLTMKGGDNYLVTDP--IVTVSGEGINLLCLGVLKSNNVNIIGQNFMTGYRIVFDRE 434

Query: 360  NMALGWKASDCNDAVQANTSTFPINHQS 277
            NM LGW+ S+C D      ST PIN  +
Sbjct: 435  NMILGWRESNCYD---DELSTLPINRSN 459


Top