BLASTX nr result

ID: Angelica22_contig00022046 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00022046
         (1526 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          612   e-173
ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|2...   602   e-170
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   581   e-163
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   579   e-163
ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine...   571   e-160

>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  612 bits (1578), Expect = e-173
 Identities = 293/454 (64%), Positives = 349/454 (76%), Gaps = 4/454 (0%)
 Frame = -3

Query: 1518 NTHLSLTMTWLWSFLVPTLLLCIPCIYSSTTSD---LFEAWCITHGKTYSSQQEKLHRFK 1348
            N++ +L + +L S+L          ++SS++S+   LFE WC  HGKTY+SQ+EKL R K
Sbjct: 2    NSNCALFVAFLLSYLF---------LFSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLK 52

Query: 1347 IFEENYMYVTKHNMNNDMKANSSLSYTLSIDNAFADLTHQEFKASRLGLSSNGIIRMNLG 1168
            +F++NY +VT+HN      +  + SYTLS+ NAFADLTH EFKASRLGLSS     +N+ 
Sbjct: 53   VFQDNYDFVTEHN------SQGNSSYTLSL-NAFADLTHHEFKASRLGLSSAASASLNVD 105

Query: 1167 GSSKDSDD-VTSVPASLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLTSL 991
             S++   D V  VPAS+DWR  GAVT VKDQG+CGACWSFSATGAIEGINKIVTGSL SL
Sbjct: 106  RSNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSL 165

Query: 990  SEQELVDCDRSYNDGCEGGLMDYAYQFVVKNKGIDTEDDYPYQSRDTTCNKNKMNRHVVT 811
            SEQELVDCD+SYN+GCEGG+MDYA+QFV+ N GIDTE+DYPYQ RD +CNK K+ RHVVT
Sbjct: 166  SEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVT 225

Query: 810  IDGYIDVRENDEKQLLAAVAAQPVSVGICGSERNFQLYSKGIFNGPCSTLLNHAVLIVGY 631
            IDGY+DV +N+EK+LL AVA QPVSVGICGSER FQLYSKGIF GPCST L+HAVLIVGY
Sbjct: 226  IDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGY 285

Query: 630  GSENGVDYWIVKNSWGKEWGMNGYMHMQRNSGNAQGICGINMMASYXXXXXXXXXXXXXX 451
            GSENGVDYWIVKNSWG  WGM+GYMHMQRNSG+++G+CGINM+ASY              
Sbjct: 286  GSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPP 345

Query: 450  XXTKCSLLTSCSEGETCCCARSLFGICLSWKCCELNSAVCCDDHRHCCPQDYPICDTKRN 271
              T+C L T C EGETCCC   +FGICLSWKCCEL+SAVCC D RHCCP+DYP+CDT RN
Sbjct: 346  GPTRCDLFTHCGEGETCCCVHHIFGICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRN 405

Query: 270  MCLKQTGNYTLVKEFKNKKSFGKLGGWTSLLGEW 169
            +CLK  GN T +++F    S GK   W+SLL  W
Sbjct: 406  ICLKHYGNATRIEKFAKNSSSGKFRSWSSLLEGW 439


>ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|222857137|gb|EEE94684.1|
            predicted protein [Populus trichocarpa]
          Length = 436

 Score =  602 bits (1553), Expect = e-170
 Identities = 294/446 (65%), Positives = 340/446 (76%), Gaps = 3/446 (0%)
 Frame = -3

Query: 1497 MTWLWSFLVPTLLLCI--PCIYSSTTSDLFEAWCITHGKTYSSQQEKLHRFKIFEENYMY 1324
            M +L+ F + TLL+ +  P   SS  S LFE WC  HGK+Y+SQ+E+ HR K+FE+NY +
Sbjct: 1    MNFLYIFAL-TLLISVLSPSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDF 59

Query: 1323 VTKHNMNNDMKANSSLSYTLSIDNAFADLTHQEFKASRLGLSSNGIIRMNLGGSSKDSDD 1144
            VTKHN     K NSS S  L   NAFADLTH EFK SRLGLS+     +NL   + +   
Sbjct: 60   VTKHNS----KGNSSYSLAL---NAFADLTHHEFKTSRLGLSA---APLNLAHRNLEITG 109

Query: 1143 VTS-VPASLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLTSLSEQELVDC 967
            V   +PAS+DWR+KG VTNVKDQGSCGACWSFSATGAIEGINKIVTGSL SLSEQEL++C
Sbjct: 110  VVGDIPASIDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIEC 169

Query: 966  DRSYNDGCEGGLMDYAYQFVVKNKGIDTEDDYPYQSRDTTCNKNKMNRHVVTIDGYIDVR 787
            D+SYNDGC GGLMDYA+QFV+ N GIDTE+DYPY++RD TCNK++M R VVTID Y+DV 
Sbjct: 170  DKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVP 229

Query: 786  ENDEKQLLAAVAAQPVSVGICGSERNFQLYSKGIFNGPCSTLLNHAVLIVGYGSENGVDY 607
            EN+EKQLL AVAAQPVSVGICGSER FQ+YSKGIF GPCST L+HAVLIVGYGSENGVDY
Sbjct: 230  ENNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDY 289

Query: 606  WIVKNSWGKEWGMNGYMHMQRNSGNAQGICGINMMASYXXXXXXXXXXXXXXXXTKCSLL 427
            WIVKNSWG  WGM GYMHMQRNSGN+QG+CGINM+ASY                TKC+LL
Sbjct: 290  WIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLL 349

Query: 426  TSCSEGETCCCARSLFGICLSWKCCELNSAVCCDDHRHCCPQDYPICDTKRNMCLKQTGN 247
            T C+ GETCCCAR  FGIC+SWKCC L+SAVCC D  HCCP DYP+CDT +NMC K+ GN
Sbjct: 350  TYCAAGETCCCARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGN 409

Query: 246  YTLVKEFKNKKSFGKLGGWTSLLGEW 169
             T ++  + K S GK G W SL   W
Sbjct: 410  ATRMEAIEGKTS-GKFGSWISLPEAW 434


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  581 bits (1497), Expect = e-163
 Identities = 277/393 (70%), Positives = 313/393 (79%)
 Frame = -3

Query: 1437 SSTTSDLFEAWCITHGKTYSSQQEKLHRFKIFEENYMYVTKHNMNNDMKANSSLSYTLSI 1258
            SS  S LFE+W   HGKTY+S+++KL+RFKIFEENY +V KHN      +  + SYTLS+
Sbjct: 25   SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHN------SQGNSSYTLSL 78

Query: 1257 DNAFADLTHQEFKASRLGLSSNGIIRMNLGGSSKDSDDVTSVPASLDWRDKGAVTNVKDQ 1078
             NAFADLTH EFKASRLGLS+          +    D V  VP S+DWR KGAV+ VKDQ
Sbjct: 79   -NAFADLTHHEFKASRLGLSAFSTSGKLSRRNFPLHDFVGDVPISIDWRKKGAVSQVKDQ 137

Query: 1077 GSCGACWSFSATGAIEGINKIVTGSLTSLSEQELVDCDRSYNDGCEGGLMDYAYQFVVKN 898
            G+CGACWSFSATGAIEGINKIVTGSL SLSEQELVDCDRSYN+GCEGGLMDYAYQFV++N
Sbjct: 138  GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIEN 197

Query: 897  KGIDTEDDYPYQSRDTTCNKNKMNRHVVTIDGYIDVRENDEKQLLAAVAAQPVSVGICGS 718
             GIDTE+DYPYQ+R+ TCNK K+ RHVVTIDGY DV +N+EK+LL AVAAQPVSVGICGS
Sbjct: 198  NGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGS 257

Query: 717  ERNFQLYSKGIFNGPCSTLLNHAVLIVGYGSENGVDYWIVKNSWGKEWGMNGYMHMQRNS 538
            ER FQLYSKGIF GPCST L+HAVLIVGYGSENGVDYWIVKNSWG  WG+NGYM+M RNS
Sbjct: 258  ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNS 317

Query: 537  GNAQGICGINMMASYXXXXXXXXXXXXXXXXTKCSLLTSCSEGETCCCARSLFGICLSWK 358
            GN+QG+CGINM+AS+                TKC L T C EGETCCC R +FG+C SWK
Sbjct: 318  GNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWK 377

Query: 357  CCELNSAVCCDDHRHCCPQDYPICDTKRNMCLK 259
            CCEL+SAVCC D  HCCP DYP+CDTKRNMCLK
Sbjct: 378  CCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  579 bits (1493), Expect = e-163
 Identities = 283/436 (64%), Positives = 330/436 (75%)
 Frame = -3

Query: 1491 WLWSFLVPTLLLCIPCIYSSTTSDLFEAWCITHGKTYSSQQEKLHRFKIFEENYMYVTKH 1312
            + + FL   LLL  P   +S  S+LFE WC  HGK+YSS +EKL+R  +F +NY +VT H
Sbjct: 4    YAFHFLTLFLLLFRPLSATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHH 63

Query: 1311 NMNNDMKANSSLSYTLSIDNAFADLTHQEFKASRLGLSSNGIIRMNLGGSSKDSDDVTSV 1132
            N N D     + SYTLS+ N++ADLTH EFK SRLG S    +R       ++      V
Sbjct: 64   N-NLD-----NSSYTLSL-NSYADLTHHEFKVSRLGFSP--ALRNFRPVLPQEPSLPRDV 114

Query: 1131 PASLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLTSLSEQELVDCDRSYN 952
            P SLDWR KGAVT VKDQGSCGACWSFSATGA+EGIN+I+TGSL SLSEQEL+DCDRSYN
Sbjct: 115  PDSLDWRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYN 174

Query: 951  DGCEGGLMDYAYQFVVKNKGIDTEDDYPYQSRDTTCNKNKMNRHVVTIDGYIDVRENDEK 772
             GC GGLMDYAYQFV+ N GIDTE+DYPYQ+RD +C K+K+ R+VVTIDGY D+  NDE 
Sbjct: 175  SGCGGGLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEG 234

Query: 771  QLLAAVAAQPVSVGICGSERNFQLYSKGIFNGPCSTLLNHAVLIVGYGSENGVDYWIVKN 592
            +LL AVAAQPVSVGICGSER FQLYSKGIF+GPCST L+HAVLIVGYGSENGVDYWIVKN
Sbjct: 235  KLLQAVAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKN 294

Query: 591  SWGKEWGMNGYMHMQRNSGNAQGICGINMMASYXXXXXXXXXXXXXXXXTKCSLLTSCSE 412
            SWGK WGM+GYMHMQRNSGN++G+CGIN +ASY                TKCS+LTSC+ 
Sbjct: 295  SWGKSWGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAA 354

Query: 411  GETCCCARSLFGICLSWKCCELNSAVCCDDHRHCCPQDYPICDTKRNMCLKQTGNYTLVK 232
            GETCCCA+   G+CLSWKCC L+SAVCC D RHCCP DYPICDT RN+CLKQT N T  +
Sbjct: 355  GETCCCAKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTE 414

Query: 231  EFKNKKSFGKLGGWTS 184
              +N+ S G  G W+S
Sbjct: 415  ILENRSSSGSSGTWSS 430


>ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  571 bits (1472), Expect = e-160
 Identities = 275/435 (63%), Positives = 326/435 (74%)
 Frame = -3

Query: 1488 LWSFLVPTLLLCIPCIYSSTTSDLFEAWCITHGKTYSSQQEKLHRFKIFEENYMYVTKHN 1309
            L  FL   LL  +  + +S TS+LFE WC  H KTYSS++EKL+R K+FE+NY +V +HN
Sbjct: 9    LLQFLSLILLFTLFFLSASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHN 68

Query: 1308 MNNDMKANSSLSYTLSIDNAFADLTHQEFKASRLGLSSNGIIRMNLGGSSKDSDDVTSVP 1129
             N +   N+S SYTLS+ NAFADLTH EFK +RLGL    ++R      ++ S D+  +P
Sbjct: 69   QNANNNNNNS-SYTLSL-NAFADLTHHEFKTTRLGLPLT-LLRFKRP-QNQQSRDLLHIP 124

Query: 1128 ASLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLTSLSEQELVDCDRSYND 949
            + +DWR  GAVT VKDQ SCGACW+FSATGAIEGINKIVTGSL SLSEQEL+DCD SYN 
Sbjct: 125  SQIDWRQSGAVTPVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNS 184

Query: 948  GCEGGLMDYAYQFVVKNKGIDTEDDYPYQSRDTTCNKNKMNRHVVTIDGYIDVRENDEKQ 769
            GC GGLMD+AYQFV+ NKGIDTEDDYPYQ+R  +C+K+K+ R  VTI+ Y+DV  ++E +
Sbjct: 185  GCGGGLMDFAYQFVIDNKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEE-E 243

Query: 768  LLAAVAAQPVSVGICGSERNFQLYSKGIFNGPCSTLLNHAVLIVGYGSENGVDYWIVKNS 589
            +L AVA+QPVSVGICGSER FQLYSKGIF GPCST L+HAVLIVGYGSENGVDYWIVKNS
Sbjct: 244  ILKAVASQPVSVGICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNS 303

Query: 588  WGKEWGMNGYMHMQRNSGNAQGICGINMMASYXXXXXXXXXXXXXXXXTKCSLLTSCSEG 409
            WGK WGMNGY+HM RNSGN++GICGIN +ASY                 +C+L T CSEG
Sbjct: 304  WGKYWGMNGYIHMIRNSGNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEG 363

Query: 408  ETCCCARSLFGICLSWKCCELNSAVCCDDHRHCCPQDYPICDTKRNMCLKQTGNYTLVKE 229
            ETCCCA+S  GIC SWKCC L SAVCC D RHCCPQDYPICDT+R  CLK+T N T    
Sbjct: 364  ETCCCAKSFLGICFSWKCCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTIT 423

Query: 228  FKNKKSFGKLGGWTS 184
             +N+    K  GW S
Sbjct: 424  SENQDFSHKSRGWKS 438


Top