BLASTX nr result

ID: Cimicifuga21_contig00008074 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cimicifuga21_contig00008074
         (1411 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002273997.2| PREDICTED: LOW QUALITY PROTEIN: glyoxysomal ...   469   e-129
ref|XP_002305124.1| predicted protein [Populus trichocarpa] gi|2...   444   e-122
ref|XP_002329829.1| predicted protein [Populus trichocarpa] gi|2...   414   e-113
ref|XP_002509448.1| trypsin domain-containing protein, putative ...   408   e-111
emb|CAN59793.1| hypothetical protein VITISV_001901 [Vitis vinifera]   395   e-107

>ref|XP_002273997.2| PREDICTED: LOW QUALITY PROTEIN: glyoxysomal processing protease,
            glyoxysomal-like [Vitis vinifera]
          Length = 753

 Score =  469 bits (1206), Expect = e-129
 Identities = 267/475 (56%), Positives = 319/475 (67%), Gaps = 6/475 (1%)
 Frame = +1

Query: 1    LIPGAQIDVMIETKMVQGNNLNKRNEGTLPWIPSKLLALVDVPASSLALQSIIEASGGSP 180
            LI G QIDVM+E      NN  + ++    W+P +LLALVDVPA SLA+QSIIEAS GS 
Sbjct: 103  LIHGVQIDVMVEE-----NNSEEIDKKAPHWLPVQLLALVDVPAFSLAVQSIIEASSGSR 157

Query: 181  ERGSWEVGWSLALLNNGRQSQKDGLQTQVGRDNRYPLENDSNSRQSESRKASSVSLSTIR 360
            E+G W+VGWSLA       +  D +QTQV       L    +    +S   S +  ST R
Sbjct: 158  EQG-WDVGWSLASYTGDSHTLVDAIQTQVS------LAXFLHFMVGDSSHPSLMGKSTAR 210

Query: 361  IAFLGVPSVTTEGLPNINISLSSKRGDLLLAMGSPFGVLSPVHFFNSIAVGSIANFCPSS 540
            IA LGV S+ ++ LPNI IS S+KRGDLLLAMGSPFGVLSPVHFFNSI+VGSIAN    S
Sbjct: 211  IALLGVSSINSKDLPNIAISPSNKRGDLLLAMGSPFGVLSPVHFFNSISVGSIANCYTPS 270

Query: 541  FYSSSLLMADIRGLPGMEGGPVFNEHAHLIGILNRPLRQRAGGAEVQLVIPWEAIALAQS 720
                SLLMADIR LPGMEGGPVFNEHA LIGIL RPLRQ+ GGAE+QLVIPWEAIA A  
Sbjct: 271  PSRRSLLMADIRCLPGMEGGPVFNEHAQLIGILTRPLRQKTGGAEIQLVIPWEAIATACC 330

Query: 721  GLLQNGAPKTGVI--YKKDDIHAVGNMCSSNSPESD---SSVGKNQDSHHLTALRIEKAM 885
             LLQ      G +  Y + +++AVG     +  +SD   +S+ +  D        IEKAM
Sbjct: 331  DLLQKEVQNEGEMKHYNRGNLNAVGKKYLFSGHDSDGPFNSMHQQPDCCSPPLSLIEKAM 390

Query: 886  ASVALVTIDEEAWASGVILNNHGLILTNAHLLEPWRFGKTDVVGGDMTTSGPLP-VTFQN 1062
            AS+ LVTID+  WASGV+LN+ GLILTNAHLLEPWRFGKT   GG       +P +  + 
Sbjct: 391  ASICLVTIDDGVWASGVVLNSQGLILTNAHLLEPWRFGKTVARGGRCGAEPEIPFIPSEE 450

Query: 1063 GMSMWQEGSKDQKERQSLLPNTVRNSDQSPGDEPTEYKVSSFYRSYKRIRVRLDHMNPWI 1242
             +    EG+   ++ Q LLP T++ +  S  D    YK SS YR ++ IR+RLDH +P I
Sbjct: 451  SVYCRDEGTYSHQKSQDLLPKTLKIAGSSVMDGHGGYKSSSTYRGHRNIRIRLDHTDPRI 510

Query: 1243 WCDARVVYISKGPLDIALLQLESFPEHVCPIIPDFTCPSSGSKAYVIGHGLFGPR 1407
            WCDARVVY+SKGPLDIALLQLE  P  +CPII DF CPS+GSKAYVIGHGLFGPR
Sbjct: 511  WCDARVVYVSKGPLDIALLQLEFVPGQLCPIIMDFACPSAGSKAYVIGHGLFGPR 565


>ref|XP_002305124.1| predicted protein [Populus trichocarpa] gi|222848088|gb|EEE85635.1|
            predicted protein [Populus trichocarpa]
          Length = 752

 Score =  444 bits (1142), Expect = e-122
 Identities = 257/474 (54%), Positives = 315/474 (66%), Gaps = 5/474 (1%)
 Frame = +1

Query: 1    LIPGAQIDVMIETKMVQGNNLNKR-NEGTLPWIPSKLLALVDVPASSLALQSIIEASGGS 177
            LIPGAQIDVM E K    N  +   ++GT  W+ ++++ LVDVP SSLALQS++EAS GS
Sbjct: 101  LIPGAQIDVMAEGKSDLRNGADGGLDKGTSHWLRAQVIRLVDVPLSSLALQSLVEASSGS 160

Query: 178  PERGSWEVGWSLALLNNGRQSQKDGLQTQVGRDNRYPLENDSNSRQSESRKASSVSLSTI 357
               G WEVGWSLA   NG QS  D +QTQ    N    E+   +R+ ES   S +  ST 
Sbjct: 161  MNHG-WEVGWSLASPENGSQSFMDVVQTQTEHGNASIAESQRRARE-ESSNPSIMGKSTT 218

Query: 358  RIAFLGVPSVTTEGLPNINISLSSKRGDLLLAMGSPFGVLSPVHFFNSIAVGSIANFCPS 537
            R+A LGV  +  + LPN  IS SS+RGD LLA+GSPFGVLSPVHFFNS++VGSIAN  P 
Sbjct: 219  RVAILGV-FLHLKDLPNFEISASSRRGDFLLAVGSPFGVLSPVHFFNSLSVGSIANCYPP 277

Query: 538  SFYSSSLLMADIRGLPGMEGGPVFNEHAHLIGILNRPLRQRAGGAEVQLVIPWEAIALAQ 717
                 SLLMADIR LPGMEG PVF E+++ IGIL RPLRQ++ GAE+QLVIPWEAIALA 
Sbjct: 278  RSSDISLLMADIRCLPGMEGSPVFCENSNFIGILIRPLRQKSSGAEIQLVIPWEAIALAC 337

Query: 718  SGLL----QNGAPKTGVIYKKDDIHAVGNMCSSNSPESDSSVGKNQDSHHLTALRIEKAM 885
            S LL    QN   + G+   K++++AVGN  SS+S        ++  S+  +   +EKAM
Sbjct: 338  SDLLLKEPQNA--EKGIHINKENLNAVGNAYSSSSDGPFPLKHEHHISYCSSPPPVEKAM 395

Query: 886  ASVALVTIDEEAWASGVILNNHGLILTNAHLLEPWRFGKTDVVGGDMTTSGPLPVTFQNG 1065
            AS+ L+TIDE  WASGV+LN+ GLILTNAHLLEPWRFGKT V GG+  T    P      
Sbjct: 396  ASICLITIDELVWASGVLLNDQGLILTNAHLLEPWRFGKTTVNGGEDGTKLQDPFIPPEE 455

Query: 1066 MSMWQEGSKDQKERQSLLPNTVRNSDQSPGDEPTEYKVSSFYRSYKRIRVRLDHMNPWIW 1245
               + E    +K  Q L P T+   + S  DE   YK+S  Y+    IRVRLDH +PWIW
Sbjct: 456  FPRYSEVDGHEK-TQRLPPKTLNIMNSSVADESKGYKLSLSYKGPMNIRVRLDHADPWIW 514

Query: 1246 CDARVVYISKGPLDIALLQLESFPEHVCPIIPDFTCPSSGSKAYVIGHGLFGPR 1407
            CDA+VV++ KGPLD+ALLQLE  P+ + P   DF C S GSKAYVIGHGLFGPR
Sbjct: 515  CDAKVVHVCKGPLDVALLQLEHVPDQLFPTKVDFECSSLGSKAYVIGHGLFGPR 568


>ref|XP_002329829.1| predicted protein [Populus trichocarpa] gi|222870891|gb|EEF08022.1|
            predicted protein [Populus trichocarpa]
          Length = 716

 Score =  414 bits (1064), Expect = e-113
 Identities = 244/478 (51%), Positives = 304/478 (63%), Gaps = 9/478 (1%)
 Frame = +1

Query: 1    LIPGAQIDVMIETKMVQGNNLNKRNEGTLP-----WIPSKLLALVDVPASSLALQSIIEA 165
            LIPGA +DVM+E K+     L K  +G L      W+ ++L+ LVDVP SSLALQS++EA
Sbjct: 101  LIPGAHVDVMVEGKL----GLRKDEDGVLDKGAPCWLSAQLIRLVDVPVSSLALQSLVEA 156

Query: 166  SGGSPERGSWEVGWSLALLNNGRQSQKDGLQTQVGRDNRYPLENDSNSRQSESRKASSVS 345
            S GS + G WEVGWSLA   +G Q   D   T+ G  N   +E+  ++R   S  +    
Sbjct: 157  SSGSMDHG-WEVGWSLASHESGPQPFMD---TEHG--NASTVESHRHARGGSSNPSIMGR 210

Query: 346  LSTIRIAFLGVPSVTTEGLPNINISLSSKRGDLLLAMGSPFGVLSPVHFFNSIAVGSIAN 525
            L+T R+A LGV  +  + LPN  I  S KRGD LLA+GSPFG+LSPVHFFNS++VGSIAN
Sbjct: 211  LTT-RVAILGV-FLHLKDLPNFKILASRKRGDFLLAVGSPFGILSPVHFFNSLSVGSIAN 268

Query: 526  FCPSSFYSSSLLMADIRGLPGMEGGPVFNEHAHLIGILNRPLRQRAGGAEVQLVIPWEAI 705
              P      SLLMAD R LPGMEG PVF E++  IGIL RPLRQ++ GAE+QLVIPWEAI
Sbjct: 269  CYPPRSSDISLLMADFRCLPGMEGSPVFGENSDFIGILIRPLRQKSTGAEIQLVIPWEAI 328

Query: 706  ALAQSGLL----QNGAPKTGVIYKKDDIHAVGNMCSSNSPESDSSVGKNQDSHHLTALRI 873
            A A S LL    QN   + G+ + K++++A                  + +SH  + L +
Sbjct: 329  ATACSDLLLKEPQNA--EKGIHFNKENLNA------------------HHNSHRPSPLPV 368

Query: 874  EKAMASVALVTIDEEAWASGVILNNHGLILTNAHLLEPWRFGKTDVVGGDMTTSGPLPVT 1053
            EKAMAS+ L+TIDE  WASGV+LN+ GLILTNAHLLEPWRFGKT V G +  T       
Sbjct: 369  EKAMASICLITIDEAVWASGVLLNDQGLILTNAHLLEPWRFGKTTVNGREDGTKSEDLFF 428

Query: 1054 FQNGMSMWQEGSKDQKERQSLLPNTVRNSDQSPGDEPTEYKVSSFYRSYKRIRVRLDHMN 1233
                 S + E    +K  Q L P T+   D    DE   YK+S  Y+  + IRVRLDH +
Sbjct: 429  PPKEFSRYSEVDGYRKS-QRLPPKTMNIVDSLVADERKGYKLSLSYKGSRNIRVRLDHAD 487

Query: 1234 PWIWCDARVVYISKGPLDIALLQLESFPEHVCPIIPDFTCPSSGSKAYVIGHGLFGPR 1407
            PWIWCDA+VVY+ KGPLD+ALLQLE  P+ +CP   DF  PS GSKAY+IGHGLFGPR
Sbjct: 488  PWIWCDAKVVYVCKGPLDVALLQLEHVPDQLCPTKVDFKSPSLGSKAYIIGHGLFGPR 545


>ref|XP_002509448.1| trypsin domain-containing protein, putative [Ricinus communis]
            gi|223549347|gb|EEF50835.1| trypsin domain-containing
            protein, putative [Ricinus communis]
          Length = 729

 Score =  408 bits (1049), Expect = e-111
 Identities = 234/451 (51%), Positives = 300/451 (66%), Gaps = 6/451 (1%)
 Frame = +1

Query: 73   NEGTLPWIPSKLLALVDVPASSLALQSIIEASGGSPERGSWEVGWSLALLNNGRQSQKDG 252
            ++GT  W  ++L+ LVDV  SSLALQS++E+S GS + G WE+GWSLA  +NG ++  D 
Sbjct: 113  DKGTSYWHTARLIRLVDVAESSLALQSLVESSLGSLDHG-WEIGWSLASHDNGHRNSMDV 171

Query: 253  LQTQVGRDNRYPLENDSNSRQSESRKASSVSLSTIRIAFLGVPSVTTEGLPNINISLSSK 432
            +QTQV           S ++  ES   + VS ++ RIA LGV S+  + LP I IS S  
Sbjct: 172  IQTQV-----------SKAQVGESGNPTLVSKTSTRIALLGV-SLNLKDLPIITISPSII 219

Query: 433  RGDLLLAMGSPFGVLSPVHFFNSIAVGSIANFCPSSFYSSSLLMADIRGLPGMEGGPVFN 612
            RGD LL +GSPFGVLSPVHFFNS+++GS+AN  P+   + SL+MADIR LPGMEG P F 
Sbjct: 220  RGDSLLTVGSPFGVLSPVHFFNSLSMGSVANCYPARSSNVSLVMADIRCLPGMEGAPAFG 279

Query: 613  EHAHLIGILNRPLRQRAGGAEVQLVIPWEAIALAQSGLL----QNGAPKTGVIYKKDDIH 780
            E    IGIL RPLRQ++ GAE+QLVIPWEAIA A   LL    QN   + G+   K++++
Sbjct: 280  ECGDFIGILTRPLRQKSTGAEIQLVIPWEAIATACGDLLLKEPQNA--EEGIAINKENLN 337

Query: 781  AVGNMCSSNSPESDSSVGKNQDSHHLTALRIEKAMASVALVTIDEEAWASGVILNNHGLI 960
            AV N  S  S    S   ++ +SH  + L +EK MASV L+TIDE  WASGV+LN+ GL+
Sbjct: 338  AVENAYSHESDGPFSYKYEHFNSHCSSTLPVEKVMASVCLITIDEGIWASGVLLNDQGLV 397

Query: 961  LTNAHLLEPWRFGKTDVVGG-DMTTSGPLPVTFQNGMSMWQEGSKDQKERQSLLP-NTVR 1134
            LTNAHLLEPWRFGKT + GG + T SG L +  +   S+    S     R S +P N  +
Sbjct: 398  LTNAHLLEPWRFGKTTINGGRNRTKSGALFLPPEG--SVIPGHSNVDSYRGSQMPLNKAK 455

Query: 1135 NSDQSPGDEPTEYKVSSFYRSYKRIRVRLDHMNPWIWCDARVVYISKGPLDIALLQLESF 1314
              D S  D+    ++S  Y  ++ IRVRLDH NPWIWCDA+V+Y+SKGPLD+ALLQLE  
Sbjct: 456  IMDSSVFDQTKGDQLSLSYSGHRNIRVRLDHFNPWIWCDAKVIYVSKGPLDVALLQLEYV 515

Query: 1315 PEHVCPIIPDFTCPSSGSKAYVIGHGLFGPR 1407
            P+ +CPI  D+ CP  GSKAYVIGHGLFGPR
Sbjct: 516  PDQLCPIKADYACPILGSKAYVIGHGLFGPR 546


>emb|CAN59793.1| hypothetical protein VITISV_001901 [Vitis vinifera]
          Length = 840

 Score =  395 bits (1016), Expect = e-107
 Identities = 237/499 (47%), Positives = 294/499 (58%), Gaps = 30/499 (6%)
 Frame = +1

Query: 1    LIPGAQIDVMIETKMVQGNNLNKRNEGTLPWIPSKLLALVDVPASSLALQSIIEASGGSP 180
            LI G QIDVM+E      NN  + ++    W+P +LLALVDVPA SLA+QSIIEAS GS 
Sbjct: 179  LIHGVQIDVMVEE-----NNSEEIDKKAPHWLPVQLLALVDVPAFSLAVQSIIEASSGSR 233

Query: 181  ERGSWEVGWSLALLNNGRQSQKDGLQTQ------------------------VGRDNRYP 288
            E+G W+VGWSLA       +  D +QTQ                        V  + +  
Sbjct: 234  EQG-WDVGWSLASYTGDSHTLVDAIQTQRTNQSFLAARQLYCKSTFVNEGKKVDCNAKSS 292

Query: 289  LENDSNSRQSESRKASSVSLSTIRIAFLGVPSVTTEGLPNINISLSSKRGDLLLAMGSPF 468
            +E   +    +S   S +  ST RIA LGV S+ ++ LPNI IS S+KRGDLLLAMGSPF
Sbjct: 293  IEGQRHFMVGDSSHPSLMGKSTARIALLGVSSINSKDLPNIAISPSNKRGDLLLAMGSPF 352

Query: 469  GVLSPVHFFNSIAVGSIANFCPSSFYSSSLLMADIRGLPGMEGGPVFNEHAHLIGILNRP 648
            GVLSPVHFFN  ++  +      S  +  L         GMEGGPVFNEHA LIGIL RP
Sbjct: 353  GVLSPVHFFNRSSLVHLVLLDSDSILTLYL--------SGMEGGPVFNEHAQLIGILTRP 404

Query: 649  LRQRAGGAEVQLVIPWEAIALAQSGLLQNGAPKTGVI--YKKDDIHAVGNMCSSNSPESD 822
            LRQ+ GGAE+QLVIPWEAI  A   LLQ      G +  Y + +++AVG     +  +SD
Sbjct: 405  LRQKTGGAEIQLVIPWEAIXTACCDLLQKEVQNEGEMKHYNRGNLNAVGKKYLFSGHDSD 464

Query: 823  ---SSVGKNQDSHHLTALRIEKAMASVALVTIDEEAWASGVILNNHGLILTNAHLLEPWR 993
               +S+ +  D        IEKAMAS+ LVTID+  WASGV+LN+ GLILTNAHLLEPWR
Sbjct: 465  GPFNSMHQQPDCCSPPLSLIEKAMASICLVTIDDGVWASGVVLNSQGLILTNAHLLEPWR 524

Query: 994  FGKTDVVGGDMTTSGPLP-VTFQNGMSMWQEGSKDQKERQSLLPNTVRNSDQSPGDEPTE 1170
            FGKT   GG       +P +  +  +    EG+   ++        + +           
Sbjct: 525  FGKTVARGGRCGAEPEIPFIPSEESVYCRDEGTYSHQKSPGFATKNIEDC---------- 574

Query: 1171 YKVSSFYRSYKRIRVRLDHMNPWIWCDARVVYISKGPLDIALLQLESFPEHVCPIIPDFT 1350
                   R ++ IR+RLDH +P IWCDARVVY+SKGPLDIALLQLE  P  +CPII DF 
Sbjct: 575  -------RGHRNIRIRLDHTDPRIWCDARVVYVSKGPLDIALLQLEFVPGQLCPIIMDFA 627

Query: 1351 CPSSGSKAYVIGHGLFGPR 1407
            CPS+GSKAYVIGHGLFGPR
Sbjct: 628  CPSAGSKAYVIGHGLFGPR 646


Top