BLASTX nr result

ID: Cephaelis21_contig00003787 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00003787
         (1420 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002518705.1| cysteine protease, putative [Ricinus communi...   441   e-177
ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [V...   443   e-173
ref|XP_002313136.1| predicted protein [Populus trichocarpa] gi|2...   437   e-172
ref|XP_002298740.1| predicted protein [Populus trichocarpa] gi|2...   431   e-172
gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]            431   e-169

>ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
            gi|223542086|gb|EEF43630.1| cysteine protease, putative
            [Ricinus communis]
          Length = 471

 Score =  441 bits (1133), Expect(2) = e-177
 Identities = 203/289 (70%), Positives = 229/289 (79%), Gaps = 6/289 (2%)
 Frame = +2

Query: 572  GSCWAFSTIAAVEGINQIATGELITLSEQELVDCDTAYNQGCNGGLMDYAFDFIVKNGGI 751
            GSCWAFST+ AVEGINQI TGELI+LSEQELVDCD +YNQGCNGGLMDYAF+FI+ NGGI
Sbjct: 160  GSCWAFSTVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGI 219

Query: 752  DTEEDYTYTARDDMCDQYRKNARVVSIDGYEDVHGNDEKSLKKALAHQPISVAIEAGGRA 931
            DTEEDY Y A D++CD  RKNA+VV+IDGYEDV  NDE SLKKA+AHQP+SVAIEAGGRA
Sbjct: 220  DTEEDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRA 279

Query: 932  FQLYHSGVFTGRCGTQLDHGVVAVGYGTENGVDYWIVRNSWGPSWGENGYIRLERNLANT 1111
            FQLY SGVFTGRCGT+LDHGVVAVGYGTENGV+YWIVRNSWG +WGE+GYIR+ERN+ANT
Sbjct: 280  FQLYKSGVFTGRCGTELDHGVVAVGYGTENGVNYWIVRNSWGSAWGESGYIRMERNVANT 339

Query: 1112 TTGKCGIAIEPSY------XXXXXXXXXXXXXXXXXXXXXXXXCDDYFSCPEGSTCCCIY 1273
             TGKCGIAI+PSY                              CDDYFSCP+G+TCCCIY
Sbjct: 340  KTGKCGIAIQPSYPTKKGANPPNPGPSPPSPVNPPPPVSPSTVCDDYFSCPDGNTCCCIY 399

Query: 1274 QYGSYCFGWGCCPLEAATXXXXXXXXXXXXYPICDVDAGTCQMSKDNPL 1420
            +Y  YCFGWGCCPLE+AT            YP+CD+ AGTC++SKDNPL
Sbjct: 400  EYSGYCFGWGCCPLESATCCDDHNSCCPHEYPVCDLKAGTCRLSKDNPL 448



 Score =  208 bits (529), Expect(2) = e-177
 Identities = 96/142 (67%), Positives = 117/142 (82%), Gaps = 1/142 (0%)
 Frame = +1

Query: 97  DMSIIDYNINHGGVYPTG-EHEVKALYESWLVKHGKAYNALGEREKRFEIFKDNIQFINE 273
           DMSI+DYNI HG  YP   + +V+ +YE WLV+HGKAYNALGE+EKRFEIFKDN++FI+E
Sbjct: 25  DMSIVDYNIKHGTKYPLRTDSQVRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDE 84

Query: 274 HNAQDGTYKLGLNRFADLTNEEYRSLFVSGRMDRAGRLTSPRSNDRYAFRTGDDLPESVD 453
           HN+ D +YK+GLNRFADLTNEEY+++F+  +M+R  R    RS  RY F+ GDDLPE+VD
Sbjct: 85  HNSVDRSYKVGLNRFADLTNEEYKAMFLGTKMERKNRFLGTRS-QRYLFKDGDDLPENVD 143

Query: 454 WRAKGAVPPVKDQGQCGSCWAF 519
           WR KGAV PVKDQGQCGSCWAF
Sbjct: 144 WREKGAVVPVKDQGQCGSCWAF 165


>ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 469

 Score =  443 bits (1140), Expect(2) = e-173
 Identities = 206/283 (72%), Positives = 226/283 (79%)
 Frame = +2

Query: 572  GSCWAFSTIAAVEGINQIATGELITLSEQELVDCDTAYNQGCNGGLMDYAFDFIVKNGGI 751
            GSCWAFSTIAAVEGINQIATG+LI+LSEQELVDCD +YNQGCNGGLMDYAF+FI+ NGGI
Sbjct: 164  GSCWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGI 223

Query: 752  DTEEDYTYTARDDMCDQYRKNARVVSIDGYEDVHGNDEKSLKKALAHQPISVAIEAGGRA 931
            D+EEDY Y A D  CD  RKNARVVSIDGYEDV  NDE+SLKKA+A+QP+SVAIEAGGRA
Sbjct: 224  DSEEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRA 283

Query: 932  FQLYHSGVFTGRCGTQLDHGVVAVGYGTENGVDYWIVRNSWGPSWGENGYIRLERNLANT 1111
            FQLY SGVFTG+CGTQLDHGVVAVGYGTEN VDYWIVRNSWGP+WGE+GYI+LERNLA T
Sbjct: 284  FQLYQSGVFTGQCGTQLDHGVVAVGYGTENSVDYWIVRNSWGPNWGESGYIKLERNLAGT 343

Query: 1112 TTGKCGIAIEPSYXXXXXXXXXXXXXXXXXXXXXXXXCDDYFSCPEGSTCCCIYQYGSYC 1291
             TGKCGIAIEPSY                        CD+Y++CPE STCCCIY+Y  +C
Sbjct: 344  ETGKCGIAIEPSYPIKNGQNPPNPGPSPPSPSKPSVVCDEYYTCPEESTCCCIYEYAGFC 403

Query: 1292 FGWGCCPLEAATXXXXXXXXXXXXYPICDVDAGTCQMSKDNPL 1420
            F WGCCPLE AT            YP+CDVDAGTCQMSK NPL
Sbjct: 404  FEWGCCPLEGATCCDDHYSCCPHEYPVCDVDAGTCQMSKGNPL 446



 Score =  192 bits (487), Expect(2) = e-173
 Identities = 93/141 (65%), Positives = 110/141 (78%)
 Frame = +1

Query: 97  DMSIIDYNINHGGVYPTGEHEVKALYESWLVKHGKAYNALGEREKRFEIFKDNIQFINEH 276
           DMSII Y      +    + EV A+YE+WLVKHGK+YNALGERE+RFEIFKDN++FI EH
Sbjct: 32  DMSIISYGDR---LEKRTDAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEH 88

Query: 277 NAQDGTYKLGLNRFADLTNEEYRSLFVSGRMDRAGRLTSPRSNDRYAFRTGDDLPESVDW 456
           NA + TYK+GLNRFADLTNEEYRS ++  R +    L + R +DRY+FR G+DLPESVDW
Sbjct: 89  NAVNRTYKVGLNRFADLTNEEYRSRYLGRRDETRRGLRASRVSDRYSFRAGEDLPESVDW 148

Query: 457 RAKGAVPPVKDQGQCGSCWAF 519
           R KGAV PVKDQG CGSCWAF
Sbjct: 149 REKGAVVPVKDQGNCGSCWAF 169


>ref|XP_002313136.1| predicted protein [Populus trichocarpa] gi|222849544|gb|EEE87091.1|
            predicted protein [Populus trichocarpa]
          Length = 477

 Score =  437 bits (1125), Expect(2) = e-172
 Identities = 203/286 (70%), Positives = 225/286 (78%), Gaps = 4/286 (1%)
 Frame = +2

Query: 572  GSCWAFSTIAAVEGINQIATGELITLSEQELVDCDTAYNQGCNGGLMDYAFDFIVKNGGI 751
            GSCWAFST+ AVEGINQI TG L +LSEQELVDCD  YNQGCNGGLMDYAF+FI+KNGGI
Sbjct: 160  GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGI 219

Query: 752  DTEEDYTYTARDDMCDQYRKNARVVSIDGYEDVHGNDEKSLKKALAHQPISVAIEAGGRA 931
            DTEEDY Y A D MCD  RKNARVV+IDGYEDV  NDEKSL+KA+A+QP+SVAIEAGGRA
Sbjct: 220  DTEEDYPYKAVDSMCDPNRKNARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRA 279

Query: 932  FQLYHSGVFTGRCGTQLDHGVVAVGYGTENGVDYWIVRNSWGPSWGENGYIRLERNLANT 1111
            FQLY SGVFTG CGTQLDHGVVAVGYGTENGVDYW+VRNSWGP+WGENGYIR+ERN+A+T
Sbjct: 280  FQLYQSGVFTGSCGTQLDHGVVAVGYGTENGVDYWVVRNSWGPAWGENGYIRMERNVAST 339

Query: 1112 TTGKCGIAIEPSY----XXXXXXXXXXXXXXXXXXXXXXXXCDDYFSCPEGSTCCCIYQY 1279
             TGKCGIA+E SY                            CDDY+SCP GSTCCCIY Y
Sbjct: 340  ETGKCGIAMEASYPTKKGANPPNPGPSPPSPVNPSPPPSSECDDYYSCPAGSTCCCIYPY 399

Query: 1280 GSYCFGWGCCPLEAATXXXXXXXXXXXXYPICDVDAGTCQMSKDNP 1417
            G YCFGWGCCPLE+AT            YP+CD++AGTC+MSK+NP
Sbjct: 400  GDYCFGWGCCPLESATCCDDHNSCCPHEYPVCDLEAGTCRMSKNNP 445



 Score =  196 bits (499), Expect(2) = e-172
 Identities = 93/142 (65%), Positives = 111/142 (78%), Gaps = 1/142 (0%)
 Frame = +1

Query: 97  DMSIIDYNINHGGVYPTGEHEVKALYESWLVKHGKAYNALGEREKRFEIFKDNIQFINEH 276
           DMSIIDYN+ HG V    E E   LYE WLVK+GKAYNALGE+E+RFEIFKDN++F+++H
Sbjct: 24  DMSIIDYNLKHGQVPERTEAETLRLYEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQH 83

Query: 277 NAQDG-TYKLGLNRFADLTNEEYRSLFVSGRMDRAGRLTSPRSNDRYAFRTGDDLPESVD 453
           N+    +YKLGLN+FADL+NEEYR+ ++  RMD   RL     + RY F+ GDDLPESVD
Sbjct: 84  NSVGNPSYKLGLNKFADLSNEEYRAAYLGTRMDGKRRLLGGPKSARYLFKDGDDLPESVD 143

Query: 454 WRAKGAVPPVKDQGQCGSCWAF 519
           WR KGAV PVKDQGQCGSCWAF
Sbjct: 144 WREKGAVAPVKDQGQCGSCWAF 165


>ref|XP_002298740.1| predicted protein [Populus trichocarpa] gi|222845998|gb|EEE83545.1|
            predicted protein [Populus trichocarpa]
          Length = 455

 Score =  431 bits (1109), Expect(2) = e-172
 Identities = 201/288 (69%), Positives = 222/288 (77%), Gaps = 6/288 (2%)
 Frame = +2

Query: 572  GSCWAFSTIAAVEGINQIATGELITLSEQELVDCDTAYNQGCNGGLMDYAFDFIVKNGGI 751
            GSCWAFST+ AVEGINQI TG L +LSEQELVDCD  YN GCNGGLMDYAFDFI++NGGI
Sbjct: 136  GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDFIIENGGI 195

Query: 752  DTEEDYTYTARDDMCDQYRKNARVVSIDGYEDVHGNDEKSLKKALAHQPISVAIEAGGRA 931
            DTEEDY Y A D MCD  RKNARVV+IDGYEDV  NDEKSLKKA+A+QP+SVAIEAGGR 
Sbjct: 196  DTEEDYPYKAIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAIEAGGRG 255

Query: 932  FQLYHSGVFTGRCGTQLDHGVVAVGYGTENGVDYWIVRNSWGPSWGENGYIRLERNLANT 1111
            FQLY SGVFTG CGTQLDHGVV VGYGTE+GVDYWIVRNSWGP+WGENGYIR+ER++A+T
Sbjct: 256  FQLYQSGVFTGSCGTQLDHGVVTVGYGTEHGVDYWIVRNSWGPAWGENGYIRMERDVAST 315

Query: 1112 TTGKCGIAIEPSY------XXXXXXXXXXXXXXXXXXXXXXXXCDDYFSCPEGSTCCCIY 1273
             TGKCGIA+E SY                              CDDY+SCP GSTCCCIY
Sbjct: 316  ETGKCGIAMEASYPTKKSANPPNPGPSPPSPVNPPPPEKPSSECDDYYSCPAGSTCCCIY 375

Query: 1274 QYGSYCFGWGCCPLEAATXXXXXXXXXXXXYPICDVDAGTCQMSKDNP 1417
            QYG YCFGWGCCPLE+AT            YP+CD++AGTC+MSK NP
Sbjct: 376  QYGDYCFGWGCCPLESATCCDDHNSCCPHEYPVCDLEAGTCRMSKSNP 423



 Score =  202 bits (515), Expect(2) = e-172
 Identities = 94/141 (66%), Positives = 114/141 (80%), Gaps = 1/141 (0%)
 Frame = +1

Query: 100 MSIIDYNINHGGVYPTGEHEVKALYESWLVKHGKAYNALGEREKRFEIFKDNIQFINEHN 279
           MSIIDYNI HG V    E E + +YE WLVKHG+AYNALGE+E+RFEIFKDN++FI+EHN
Sbjct: 1   MSIIDYNIKHGQVPERTEAETRRIYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHN 60

Query: 280 AQDG-TYKLGLNRFADLTNEEYRSLFVSGRMDRAGRLTSPRSNDRYAFRTGDDLPESVDW 456
           +    +YKLGLN+FADL+N+EYRS+++  RMD  GRL     ++RY F+ GDDLPE+VDW
Sbjct: 61  SVGNPSYKLGLNKFADLSNDEYRSVYLGTRMDGKGRLLGGPKSERYLFKEGDDLPETVDW 120

Query: 457 RAKGAVPPVKDQGQCGSCWAF 519
           R KGAV PVKDQGQCGSCWAF
Sbjct: 121 REKGAVAPVKDQGQCGSCWAF 141


>gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
          Length = 463

 Score =  431 bits (1109), Expect(2) = e-169
 Identities = 201/283 (71%), Positives = 219/283 (77%)
 Frame = +2

Query: 572  GSCWAFSTIAAVEGINQIATGELITLSEQELVDCDTAYNQGCNGGLMDYAFDFIVKNGGI 751
            GSCWAFSTI+AVEGINQI TGELI+LSEQELVDCD +YN GCNGGLMDY F FI+ NGGI
Sbjct: 156  GSCWAFSTISAVEGINQIVTGELISLSEQELVDCDKSYNMGCNGGLMDYGFQFIINNGGI 215

Query: 752  DTEEDYTYTARDDMCDQYRKNARVVSIDGYEDVHGNDEKSLKKALAHQPISVAIEAGGRA 931
            DTEEDY Y A D  CDQ+RKNARVVSI+GYEDV  +DE SLKKA+A+QP+SVAIEAGGRA
Sbjct: 216  DTEEDYPYRAVDGTCDQFRKNARVVSINGYEDVPEDDENSLKKAVANQPVSVAIEAGGRA 275

Query: 932  FQLYHSGVFTGRCGTQLDHGVVAVGYGTENGVDYWIVRNSWGPSWGENGYIRLERNLANT 1111
            FQLY SGVFTG CGT LDHGVVAVGYGTENGVDYW VRNSWGP WGENGYI+LERN+ N 
Sbjct: 276  FQLYESGVFTGHCGTNLDHGVVAVGYGTENGVDYWTVRNSWGPKWGENGYIKLERNI-NA 334

Query: 1112 TTGKCGIAIEPSYXXXXXXXXXXXXXXXXXXXXXXXXCDDYFSCPEGSTCCCIYQYGSYC 1291
            T+GKCGIA   SY                        CDDY+SCPEGSTCCC+YQYG +C
Sbjct: 335  TSGKCGIASMASYPTKTGSNPPNPGPSPPTPVNPPTVCDDYYSCPEGSTCCCVYQYGDFC 394

Query: 1292 FGWGCCPLEAATXXXXXXXXXXXXYPICDVDAGTCQMSKDNPL 1420
             GWGCCPLE+AT            YPICD+D GTC MSKDNPL
Sbjct: 395  IGWGCCPLESATCCDDHSSCCPHEYPICDLDGGTCLMSKDNPL 437



 Score =  192 bits (488), Expect(2) = e-169
 Identities = 92/141 (65%), Positives = 109/141 (77%)
 Frame = +1

Query: 97  DMSIIDYNINHGGVYPTGEHEVKALYESWLVKHGKAYNALGEREKRFEIFKDNIQFINEH 276
           DMSII Y+  H       + E  A+YE WL  HGKAYNA+GE+E+RFEIFKDN++F++EH
Sbjct: 24  DMSIISYDQTHPP--QRTDAEAMAIYEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEH 81

Query: 277 NAQDGTYKLGLNRFADLTNEEYRSLFVSGRMDRAGRLTSPRSNDRYAFRTGDDLPESVDW 456
           NA  G+Y++GLNRFADLTNEEYRS+F+ G M+   R  S +S DRYAFR GD LP SVDW
Sbjct: 82  NAVAGSYRVGLNRFADLTNEEYRSMFLGGNMEMKERSASTKS-DRYAFRAGDKLPGSVDW 140

Query: 457 RAKGAVPPVKDQGQCGSCWAF 519
           R KGAV PVKDQGQCGSCWAF
Sbjct: 141 REKGAVSPVKDQGQCGSCWAF 161


Top