BLASTX nr result

ID: Angelica22_contig00021849 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00021849
         (1331 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002533377.1| cysteine protease, putative [Ricinus communi...   511   e-142
ref|NP_567010.5| Papain family cysteine protease [Arabidopsis th...   507   e-141
ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vit...   502   e-140
ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arab...   501   e-139
ref|XP_002316398.1| predicted protein [Populus trichocarpa] gi|2...   501   e-139

>ref|XP_002533377.1| cysteine protease, putative [Ricinus communis]
            gi|223526784|gb|EEF29008.1| cysteine protease, putative
            [Ricinus communis]
          Length = 381

 Score =  511 bits (1315), Expect = e-142
 Identities = 245/367 (66%), Positives = 287/367 (78%), Gaps = 3/367 (0%)
 Frame = -3

Query: 1299 PLLTYALFGVLLTYAPTI--STTSNSNIRQVIDR-DFTGNNNNLIGTATERHFVSFMKKY 1129
            PL   A   + LT + T   +T  +  I QV D    T +N   +GT TE +F  FM KY
Sbjct: 15   PLAILAFTTLTLTTSATSGDATLQDPTILQVTDDPSVTLSNRKFLGTNTEENFKMFMIKY 74

Query: 1128 GKEYSTREEYMHRLGIFAKNMMWAAEHQALDPTAVHGVTQFSDLSEEEFETRFXXXXXXX 949
             KEY TREEYMHRLG+FAKN++ AAEHQ LDPTAVHG+T F DL+EEEFE  +       
Sbjct: 75   DKEYDTREEYMHRLGVFAKNLIRAAEHQVLDPTAVHGITPFMDLTEEEFERMYTGVVGGG 134

Query: 948  XXXXXXXGEAPVVDGKGLPESFDWREKGAVTPVKMQGSCGSCWAFSTTGSLEGANFIATG 769
                        ++  GLP SFDWR+KGAVT VKMQG+CGSCWAFSTTG++EGANFIATG
Sbjct: 135  AVGAEGVTATSFLETAGLPSSFDWRKKGAVTDVKMQGACGSCWAFSTTGAIEGANFIATG 194

Query: 768  KLTSLSEQQLVDCDHTCDAKDKSSCNDGCSGGLMTNAYEYLIKAGGIEEEDAYPYTGKRG 589
            KL +LSEQQLVDCD  CD K+K++C+DGC GGLMTNAY YLI+AGG+E+E +YPYTGK G
Sbjct: 195  KLLNLSEQQLVDCDRVCDIKEKTACDDGCGGGLMTNAYRYLIEAGGLEDEISYPYTGKPG 254

Query: 588  DCKFDPEKIAVRVTNFTNIPADEEQIAAHLVHHGPLAVGLNAVFMQTYIGGVSCPLICGK 409
             CKFD +KIAVRV NFT+IP DE QIAAHLVHHGPLA+GLNAVFMQTYIGGVSCPLICGK
Sbjct: 255  KCKFDEKKIAVRVVNFTSIPIDENQIAAHLVHHGPLAIGLNAVFMQTYIGGVSCPLICGK 314

Query: 408  RFLNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGENWGEQGYYRLCRGHNMCGMSSMVS 229
            +++NHGVLLVGYGAKGFSILRLG KPYWIIKNSWG+ WGE+GYYR+C+G+ MCGM  MVS
Sbjct: 315  KWINHGVLLVGYGAKGFSILRLGYKPYWIIKNSWGKRWGEEGYYRICKGYGMCGMDRMVS 374

Query: 228  AVMTKIS 208
            AV+T++S
Sbjct: 375  AVVTQVS 381


>ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
            gi|17979125|gb|AAL49820.1| putative cysteine proteinase
            [Arabidopsis thaliana] gi|332645795|gb|AEE79316.1| Papain
            family cysteine protease [Arabidopsis thaliana]
          Length = 367

 Score =  507 bits (1305), Expect = e-141
 Identities = 244/359 (67%), Positives = 286/359 (79%), Gaps = 1/359 (0%)
 Frame = -3

Query: 1281 LFGVLLTYAPTISTTSNSNIRQVIDRDFTGNNNNLIGTATERHFVSFMKKYGKEYSTREE 1102
            L   ++ +   +++  +  IRQV   D      NL+GT TE  F  FM  YGK YSTREE
Sbjct: 9    LITCIILFCHVVASVEDLTIRQVT-ADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREE 67

Query: 1101 YMHRLGIFAKNMMWAAEHQALDPTAVHGVTQFSDLSEEEFETRFXXXXXXXXXXXXXXG- 925
            Y+HRLGIFAKN++ AAEHQ +DP+AVHGVTQFSDL+EEEF+  +              G 
Sbjct: 68   YIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGSRGGTVGA 127

Query: 924  EAPVVDGKGLPESFDWREKGAVTPVKMQGSCGSCWAFSTTGSLEGANFIATGKLTSLSEQ 745
            EAP+V+  GLPE FDWREKG VT VK QG+CGSCWAFSTTG+ EGA+F++TGKL SLSEQ
Sbjct: 128  EAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQ 187

Query: 744  QLVDCDHTCDAKDKSSCNDGCSGGLMTNAYEYLIKAGGIEEEDAYPYTGKRGDCKFDPEK 565
            QLVDCD  CD KDK +C++GC GGLMTNAYEYL++AGG+EEE +YPYTGKRG CKFDPEK
Sbjct: 188  QLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPEK 247

Query: 564  IAVRVTNFTNIPADEEQIAAHLVHHGPLAVGLNAVFMQTYIGGVSCPLICGKRFLNHGVL 385
            +AVRV NFT IP DE QIAA+LV HGPLAVGLNAVFMQTYIGGVSCPLIC KR +NHGVL
Sbjct: 248  VAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVL 307

Query: 384  LVGYGAKGFSILRLGNKPYWIIKNSWGENWGEQGYYRLCRGHNMCGMSSMVSAVMTKIS 208
            LVGYG+KGFSILRL NKPYWIIKNSWG+ WGE GYY+LCRGH++CG++SMVSAV T++S
Sbjct: 308  LVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVATQVS 366


>ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
          Length = 375

 Score =  502 bits (1293), Expect = e-140
 Identities = 240/349 (68%), Positives = 282/349 (80%), Gaps = 4/349 (1%)
 Frame = -3

Query: 1242 TTSNSNIRQVIDRDFTGNNNNLIGT----ATERHFVSFMKKYGKEYSTREEYMHRLGIFA 1075
            T  + NI QV D    G+++   G      TE+ F  FM+KYGKEYS+REEY+HRLGIFA
Sbjct: 31   TPWDPNIVQVTD----GHSHRKFGVDGVLGTEKEFRMFMEKYGKEYSSREEYVHRLGIFA 86

Query: 1074 KNMMWAAEHQALDPTAVHGVTQFSDLSEEEFETRFXXXXXXXXXXXXXXGEAPVVDGKGL 895
            KNM+ AAEHQALDPTA+HGVT FSDLSEEEFE  F                A  ++  GL
Sbjct: 87   KNMVRAAEHQALDPTALHGVTPFSDLSEEEFERMFTGVVGRPHMKGGVAETAAALEVDGL 146

Query: 894  PESFDWREKGAVTPVKMQGSCGSCWAFSTTGSLEGANFIATGKLTSLSEQQLVDCDHTCD 715
            PESFDWREKGAVT VKMQG+CGSCWAFSTTG++EGA+FI+T KL +LSEQQLVDCDH CD
Sbjct: 147  PESFDWREKGAVTEVKMQGTCGSCWAFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMCD 206

Query: 714  AKDKSSCNDGCSGGLMTNAYEYLIKAGGIEEEDAYPYTGKRGDCKFDPEKIAVRVTNFTN 535
             +DK++C+ GC GGLMTNAY+YLI+AGG+EEE +YPYTGK G+CKF P+++AVRV NFT 
Sbjct: 207  IRDKTACDSGCEGGLMTNAYKYLIEAGGLEEESSYPYTGKHGECKFKPDRVAVRVVNFTE 266

Query: 534  IPADEEQIAAHLVHHGPLAVGLNAVFMQTYIGGVSCPLICGKRFLNHGVLLVGYGAKGFS 355
            +P +E QIAA+LV HGPLAVGLNA+FMQTYIGGVSCPLIC KR++NHGVLLVGYGAKG+S
Sbjct: 267  VPINENQIAANLVCHGPLAVGLNAIFMQTYIGGVSCPLICPKRWINHGVLLVGYGAKGYS 326

Query: 354  ILRLGNKPYWIIKNSWGENWGEQGYYRLCRGHNMCGMSSMVSAVMTKIS 208
            ILR G KPYWIIKNSWG+ WGE GYYRLCRGH MCGM++MVSAV+T+ S
Sbjct: 327  ILRFGYKPYWIIKNSWGKRWGEHGYYRLCRGHGMCGMNTMVSAVVTQTS 375


>ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
            lyrata] gi|297322116|gb|EFH52537.1| hypothetical protein
            ARALYDRAFT_485911 [Arabidopsis lyrata subsp. lyrata]
          Length = 368

 Score =  501 bits (1291), Expect = e-139
 Identities = 244/360 (67%), Positives = 286/360 (79%), Gaps = 2/360 (0%)
 Frame = -3

Query: 1281 LFGVLLTYAPTISTTSNSNIRQVIDRDFTGNNNNLIGTATERHFVSFMKKYGKEYSTREE 1102
            L   ++ +   +++  +  IRQV   D      NL+GT TE  F  FM  YGK YSTREE
Sbjct: 9    LITCIIFFCHVVASVEDLTIRQVT-ADERRVRPNLLGTHTESKFRVFMSDYGKNYSTREE 67

Query: 1101 YMHRLGIFAKNMMWAAEHQALDPTAVHGVTQFSDLSEEEFETRFXXXXXXXXXXXXXXG- 925
            Y+HRLGIFAKN++ AAEHQ +DPTAVHGVTQFSDL+EEEF+  +              G 
Sbjct: 68   YIHRLGIFAKNVLKAAEHQMMDPTAVHGVTQFSDLTEEEFKRMYTGVADVGGSRGHAVGA 127

Query: 924  EAPVVDGKGLPESFDWREKGAVTPVKMQGSCGSCWAFSTTGSLEGANFIATGKLTSLSEQ 745
            EAP+V+  GLPE FDWREKG VT VK QG+CGSCWAFSTTG+ EGA+F++TGKL SLSEQ
Sbjct: 128  EAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQ 187

Query: 744  QLVDCDHT-CDAKDKSSCNDGCSGGLMTNAYEYLIKAGGIEEEDAYPYTGKRGDCKFDPE 568
            QLVDCD   CD KDK +C++GC GGLMTNAYEYL++AGG+EEE +YPYTGKRG CKFDPE
Sbjct: 188  QLVDCDQAVCDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPE 247

Query: 567  KIAVRVTNFTNIPADEEQIAAHLVHHGPLAVGLNAVFMQTYIGGVSCPLICGKRFLNHGV 388
            K+AVRV NFT IP DE+QIAA+LV  GPLAVGLNAVFMQTYIGGVSCPLIC KR +NHGV
Sbjct: 248  KVAVRVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQTYIGGVSCPLICSKRKVNHGV 307

Query: 387  LLVGYGAKGFSILRLGNKPYWIIKNSWGENWGEQGYYRLCRGHNMCGMSSMVSAVMTKIS 208
            LLVGYG+KGFSILRL NKPYWIIKNSWG+ WGE GYY+LCRGH++CG++SMVSAV T++S
Sbjct: 308  LLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVATQVS 367


>ref|XP_002316398.1| predicted protein [Populus trichocarpa] gi|222865438|gb|EEF02569.1|
            predicted protein [Populus trichocarpa]
          Length = 327

 Score =  501 bits (1291), Expect = e-139
 Identities = 235/328 (71%), Positives = 272/328 (82%)
 Frame = -3

Query: 1191 NNNNLIGTATERHFVSFMKKYGKEYSTREEYMHRLGIFAKNMMWAAEHQALDPTAVHGVT 1012
            N  NL+GT  E  F  F+K++ KEY+TREEY+HR GIF KN++ A EHQALDPTA+HGVT
Sbjct: 3    NGLNLLGT--EEKFKMFIKEHNKEYATREEYVHRFGIFGKNLIRAVEHQALDPTAIHGVT 60

Query: 1011 QFSDLSEEEFETRFXXXXXXXXXXXXXXGEAPVVDGKGLPESFDWREKGAVTPVKMQGSC 832
             F DL+EEEFE R               G    +D  GLP+SFDWREKGAVT VK+QGSC
Sbjct: 61   PFMDLTEEEFE-RMYAGVLGGGTVPVEKGSVSFMDASGLPDSFDWREKGAVTDVKIQGSC 119

Query: 831  GSCWAFSTTGSLEGANFIATGKLTSLSEQQLVDCDHTCDAKDKSSCNDGCSGGLMTNAYE 652
            GSCWAFSTTGS+EGANFIATGKL +LSEQQLVDCD  CD  DK+SC+DGC GGLMTNAY 
Sbjct: 120  GSCWAFSTTGSVEGANFIATGKLLNLSEQQLVDCDRVCDKTDKASCDDGCGGGLMTNAYR 179

Query: 651  YLIKAGGIEEEDAYPYTGKRGDCKFDPEKIAVRVTNFTNIPADEEQIAAHLVHHGPLAVG 472
            YLI+AGG++EE +YPYTGK G+CKFDPEKIAV+V NFT+I  DE QIAA+LVHHGPLA+G
Sbjct: 180  YLIEAGGLQEESSYPYTGKSGECKFDPEKIAVKVANFTSIAVDENQIAANLVHHGPLAIG 239

Query: 471  LNAVFMQTYIGGVSCPLICGKRFLNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGENWG 292
            LNA+FMQTYIGGVSCPLICGK++LNHGVLLVGYGA+G+SILR G KPYWIIKNSWG +WG
Sbjct: 240  LNAIFMQTYIGGVSCPLICGKKWLNHGVLLVGYGARGYSILRFGYKPYWIIKNSWGNHWG 299

Query: 291  EQGYYRLCRGHNMCGMSSMVSAVMTKIS 208
            E+GYYRLCRGH MCGM+ MVSAV+TK++
Sbjct: 300  EKGYYRLCRGHGMCGMNKMVSAVVTKVA 327


Top