BLASTX nr result

ID: Lithospermum22_contig00021415 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum22_contig00021415
         (1439 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]            582   e-163
dbj|BAD29954.1| cysteine protease [Daucus carota]                     570   e-160
dbj|BAD29956.1| cysteine protease [Daucus carota]                     570   e-160
ref|XP_002518705.1| cysteine protease, putative [Ricinus communi...   567   e-159
ref|XP_002313136.1| predicted protein [Populus trichocarpa] gi|2...   567   e-159

>gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
          Length = 463

 Score =  582 bits (1499), Expect = e-163
 Identities = 267/357 (74%), Positives = 301/357 (84%), Gaps = 2/357 (0%)
 Frame = -1

Query: 1439 NRFADLTNEEYRNMYVGGRMNRKK--VGTKNQIYGFEAGDELPEAVDWRQKGAVAPVKDQ 1266
            NRFADLTNEEYR+M++GG M  K+    TK+  Y F AGD+LP +VDWR+KGAV+PVKDQ
Sbjct: 93   NRFADLTNEEYRSMFLGGNMEMKERSASTKSDRYAFRAGDKLPGSVDWREKGAVSPVKDQ 152

Query: 1265 GQCGSCWAFSTVGAVEGINQIVTGDLITLSEQELVDCDKSYNQGCNGGLMDYAFDFIIKN 1086
            GQCGSCWAFST+ AVEGINQIVTG+LI+LSEQELVDCDKSYN GCNGGLMDY F FII N
Sbjct: 153  GQCGSCWAFSTISAVEGINQIVTGELISLSEQELVDCDKSYNMGCNGGLMDYGFQFIINN 212

Query: 1085 GGIDTEEDYPYHASDGTCDQYRKNARVVSIDGFEDVPENDERSLQKAVAHQPVSVAIEAG 906
            GGIDTEEDYPY A DGTCDQ+RKNARVVSI+G+EDVPE+DE SL+KAVA+QPVSVAIEAG
Sbjct: 213  GGIDTEEDYPYRAVDGTCDQFRKNARVVSINGYEDVPEDDENSLKKAVANQPVSVAIEAG 272

Query: 905  GRDFQLYESGVFTGSCGTQLDHGVIAVGYGTENGKDYWIVRNSWGPEWGEHGYIKLERNV 726
            GR FQLYESGVFTG CGT LDHGV+AVGYGTENG DYW VRNSWGP+WGE+GYIKLERN+
Sbjct: 273  GRAFQLYESGVFTGHCGTNLDHGVVAVGYGTENGVDYWTVRNSWGPKWGENGYIKLERNI 332

Query: 725  ANITTGKCGIAIEASYPTKTGQNXXXXXXXXXXXXXXXSVCDDYFTCPERSTCCCMFQFG 546
             N T+GKCGIA  ASYPTKTG N               +VCDDY++CPE STCCC++Q+G
Sbjct: 333  -NATSGKCGIASMASYPTKTGSNPPNPGPSPPTPVNPPTVCDDYYSCPEGSTCCCVYQYG 391

Query: 545  DFCFAWGCCPAESAVCCDDHYSCCPHDYPICDEERETCLMSKDSPMQVKALKRHPAK 375
            DFC  WGCCP ESA CCDDH SCCPH+YPICD +  TCLMSKD+P+ VKALKR PA+
Sbjct: 392  DFCIGWGCCPLESATCCDDHSSCCPHEYPICDLDGGTCLMSKDNPLGVKALKRGPAR 448


>dbj|BAD29954.1| cysteine protease [Daucus carota]
          Length = 474

 Score =  570 bits (1470), Expect = e-160
 Identities = 253/358 (70%), Positives = 303/358 (84%), Gaps = 4/358 (1%)
 Frame = -1

Query: 1439 NRFADLTNEEYRNMYVGGRMNRKKV----GTKNQIYGFEAGDELPEAVDWRQKGAVAPVK 1272
            N+FADLTN+EYR++Y+ G+M +++     G ++  + FE GD LPE+VDWR +GAVAPVK
Sbjct: 107  NKFADLTNDEYRSLYLSGKMMKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRGAVAPVK 166

Query: 1271 DQGQCGSCWAFSTVGAVEGINQIVTGDLITLSEQELVDCDKSYNQGCNGGLMDYAFDFII 1092
            DQGQCGSCWAFSTVGAVEGIN+IVTG+LI+LSEQELVDCD  YNQGCNGGLMDYAF+FI+
Sbjct: 167  DQGQCGSCWAFSTVGAVEGINKIVTGELISLSEQELVDCDNGYNQGCNGGLMDYAFEFIV 226

Query: 1091 KNGGIDTEEDYPYHASDGTCDQYRKNARVVSIDGFEDVPENDERSLQKAVAHQPVSVAIE 912
            KNGGIDTE+DYPY   DG CDQ RKNA+VV+I+G+EDVP NDE+SL+KAVAHQPVSVAIE
Sbjct: 227  KNGGIDTEDDYPYKGVDGLCDQNRKNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIE 286

Query: 911  AGGRDFQLYESGVFTGSCGTQLDHGVIAVGYGTENGKDYWIVRNSWGPEWGEHGYIKLER 732
            AGGR FQLYESGVFTG CGT+LDHGV+AVGYG+ENGKDYWIVRNSWGP+WGE GYI+LER
Sbjct: 287  AGGRAFQLYESGVFTGQCGTELDHGVVAVGYGSENGKDYWIVRNSWGPDWGESGYIRLER 346

Query: 731  NVANITTGKCGIAIEASYPTKTGQNXXXXXXXXXXXXXXXSVCDDYFTCPERSTCCCMFQ 552
            NVA+ +TGKCGIA++ASYPTKTG N               +VCDDY++CPE +TCCC+++
Sbjct: 347  NVASTSTGKCGIAMQASYPTKTGDNPPKPGPSPPSPVKPQTVCDDYYSCPESTTCCCLYE 406

Query: 551  FGDFCFAWGCCPAESAVCCDDHYSCCPHDYPICDEERETCLMSKDSPMQVKALKRHPA 378
             G +CF WGCCP  SA CCDDHYSCCP ++P+CD +  TCLMSKD+P+ VKAL+R PA
Sbjct: 407  IGQYCFGWGCCPLASATCCDDHYSCCPQEFPVCDLDAGTCLMSKDNPIGVKALERRPA 464


>dbj|BAD29956.1| cysteine protease [Daucus carota]
          Length = 423

 Score =  570 bits (1469), Expect = e-160
 Identities = 251/359 (69%), Positives = 301/359 (83%)
 Frame = -1

Query: 1439 NRFADLTNEEYRNMYVGGRMNRKKVGTKNQIYGFEAGDELPEAVDWRQKGAVAPVKDQGQ 1260
            N+FADL+NEEY++M++GGRM R + G ++  + +  GDELP++VDWR+KGAVAPVKDQGQ
Sbjct: 54   NKFADLSNEEYKSMFLGGRMVRDRKGFESDRFKYGVGDELPQSVDWREKGAVAPVKDQGQ 113

Query: 1259 CGSCWAFSTVGAVEGINQIVTGDLITLSEQELVDCDKSYNQGCNGGLMDYAFDFIIKNGG 1080
            CGSCWAFSTV AVEGINQI TGDLI+LSEQELVDCDK +NQGCNGG MDYAF+FI+KNGG
Sbjct: 114  CGSCWAFSTVAAVEGINQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKNGG 173

Query: 1079 IDTEEDYPYHASDGTCDQYRKNARVVSIDGFEDVPENDERSLQKAVAHQPVSVAIEAGGR 900
            IDTE+DYPY   DG CDQ RKNA+VV+I+GFEDVP+NDE+SL+KAVAHQPVSVAIEAGGR
Sbjct: 174  IDTEDDYPYKGVDGQCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGR 233

Query: 899  DFQLYESGVFTGSCGTQLDHGVIAVGYGTENGKDYWIVRNSWGPEWGEHGYIKLERNVAN 720
             FQLYESG+F G CGT LDHGV+AVGYGTE+GKDYWIVRNSWGP WGE+GYI+LERNVA+
Sbjct: 234  AFQLYESGIFNGLCGTDLDHGVVAVGYGTEDGKDYWIVRNSWGPNWGENGYIRLERNVAS 293

Query: 719  ITTGKCGIAIEASYPTKTGQNXXXXXXXXXXXXXXXSVCDDYFTCPERSTCCCMFQFGDF 540
              TGKCGIA++ SYPTKTG N               SVCDDY+TCP  +TCCC++++G +
Sbjct: 294  TNTGKCGIAMQPSYPTKTGVNPPKPGPSPPSPVKPQSVCDDYYTCPASTTCCCVYEYGKY 353

Query: 539  CFAWGCCPAESAVCCDDHYSCCPHDYPICDEERETCLMSKDSPMQVKALKRHPAKLMWS 363
            CF WGCCP E+A CCDDH SCCP +YP+CD   +TC +SK+SP+ +KALKR PA+  W+
Sbjct: 354  CFGWGCCPLEAATCCDDHSSCCPQEYPVCDINAQTCRLSKNSPIGIKALKRSPARPNWT 412


>ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
            gi|223542086|gb|EEF43630.1| cysteine protease, putative
            [Ricinus communis]
          Length = 471

 Score =  567 bits (1461), Expect = e-159
 Identities = 263/375 (70%), Positives = 305/375 (81%), Gaps = 8/375 (2%)
 Frame = -1

Query: 1439 NRFADLTNEEYRNMYVGGRMNRKK--VGTKNQIYGFEAGDELPEAVDWRQKGAVAPVKDQ 1266
            NRFADLTNEEY+ M++G +M RK   +GT++Q Y F+ GD+LPE VDWR+KGAV PVKDQ
Sbjct: 97   NRFADLTNEEYKAMFLGTKMERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQ 156

Query: 1265 GQCGSCWAFSTVGAVEGINQIVTGDLITLSEQELVDCDKSYNQGCNGGLMDYAFDFIIKN 1086
            GQCGSCWAFSTVGAVEGINQIVTG+LI+LSEQELVDCDKSYNQGCNGGLMDYAF+FII N
Sbjct: 157  GQCGSCWAFSTVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINN 216

Query: 1085 GGIDTEEDYPYHASDGTCDQYRKNARVVSIDGFEDVPENDERSLQKAVAHQPVSVAIEAG 906
            GGIDTEEDYPY ASD  CD  RKNA+VV+IDG+EDVPENDE SL+KAVAHQPVSVAIEAG
Sbjct: 217  GGIDTEEDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAG 276

Query: 905  GRDFQLYESGVFTGSCGTQLDHGVIAVGYGTENGKDYWIVRNSWGPEWGEHGYIKLERNV 726
            GR FQLY+SGVFTG CGT+LDHGV+AVGYGTENG +YWIVRNSWG  WGE GYI++ERNV
Sbjct: 277  GRAFQLYKSGVFTGRCGTELDHGVVAVGYGTENGVNYWIVRNSWGSAWGESGYIRMERNV 336

Query: 725  ANITTGKCGIAIEASYPTKTGQN------XXXXXXXXXXXXXXXSVCDDYFTCPERSTCC 564
            AN  TGKCGIAI+ SYPTK G N                     +VCDDYF+CP+ +TCC
Sbjct: 337  ANTKTGKCGIAIQPSYPTKKGANPPNPGPSPPSPVNPPPPVSPSTVCDDYFSCPDGNTCC 396

Query: 563  CMFQFGDFCFAWGCCPAESAVCCDDHYSCCPHDYPICDEERETCLMSKDSPMQVKALKRH 384
            C++++  +CF WGCCP ESA CCDDH SCCPH+YP+CD +  TC +SKD+P+ VKAL+R 
Sbjct: 397  CIYEYSGYCFGWGCCPLESATCCDDHNSCCPHEYPVCDLKAGTCRLSKDNPLGVKALRRG 456

Query: 383  PAKLMWSSLVGGKIS 339
            PAK   + L G  I+
Sbjct: 457  PAKRTHTHLNGINIA 471


>ref|XP_002313136.1| predicted protein [Populus trichocarpa] gi|222849544|gb|EEE87091.1|
            predicted protein [Populus trichocarpa]
          Length = 477

 Score =  567 bits (1461), Expect = e-159
 Identities = 264/372 (70%), Positives = 305/372 (81%), Gaps = 7/372 (1%)
 Frame = -1

Query: 1439 NRFADLTNEEYRNMYVGGRMNRKKV---GTKNQIYGFEAGDELPEAVDWRQKGAVAPVKD 1269
            N+FADL+NEEYR  Y+G RM+ K+    G K+  Y F+ GD+LPE+VDWR+KGAVAPVKD
Sbjct: 96   NKFADLSNEEYRAAYLGTRMDGKRRLLGGPKSARYLFKDGDDLPESVDWREKGAVAPVKD 155

Query: 1268 QGQCGSCWAFSTVGAVEGINQIVTGDLITLSEQELVDCDKSYNQGCNGGLMDYAFDFIIK 1089
            QGQCGSCWAFSTVGAVEGINQIVTG+L +LSEQELVDCDK YNQGCNGGLMDYAF+FI+K
Sbjct: 156  QGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKVYNQGCNGGLMDYAFEFIMK 215

Query: 1088 NGGIDTEEDYPYHASDGTCDQYRKNARVVSIDGFEDVPENDERSLQKAVAHQPVSVAIEA 909
            NGGIDTEEDYPY A D  CD  RKNARVV+IDG+EDVP+NDE+SL+KAVA+QPVSVAIEA
Sbjct: 216  NGGIDTEEDYPYKAVDSMCDPNRKNARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEA 275

Query: 908  GGRDFQLYESGVFTGSCGTQLDHGVIAVGYGTENGKDYWIVRNSWGPEWGEHGYIKLERN 729
            GGR FQLY+SGVFTGSCGTQLDHGV+AVGYGTENG DYW+VRNSWGP WGE+GYI++ERN
Sbjct: 276  GGRAFQLYQSGVFTGSCGTQLDHGVVAVGYGTENGVDYWVVRNSWGPAWGENGYIRMERN 335

Query: 728  VANITTGKCGIAIEASYPTKTGQN----XXXXXXXXXXXXXXXSVCDDYFTCPERSTCCC 561
            VA+  TGKCGIA+EASYPTK G N                   S CDDY++CP  STCCC
Sbjct: 336  VASTETGKCGIAMEASYPTKKGANPPNPGPSPPSPVNPSPPPSSECDDYYSCPAGSTCCC 395

Query: 560  MFQFGDFCFAWGCCPAESAVCCDDHYSCCPHDYPICDEERETCLMSKDSPMQVKALKRHP 381
            ++ +GD+CF WGCCP ESA CCDDH SCCPH+YP+CD E  TC MSK++P  VKAL R P
Sbjct: 396  IYPYGDYCFGWGCCPLESATCCDDHNSCCPHEYPVCDLEAGTCRMSKNNPFGVKALTRAP 455

Query: 380  AKLMWSSLVGGK 345
            A++  S  +GGK
Sbjct: 456  ARIAQSHQLGGK 467


Top