BLASTX nr result

ID: Angelica23_contig00014038 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00014038
         (1660 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          596   e-168
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   576   e-162
ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|2...   572   e-160
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   567   e-159
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   566   e-159

>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  596 bits (1536), Expect = e-168
 Identities = 282/440 (64%), Positives = 343/440 (77%), Gaps = 6/440 (1%)
 Frame = +1

Query: 223  LFVLLLHTPCCIYSSTTYD---LFQTWCASYGKTYNSDQEKLSRFKIFEENYAYITQHNN 393
            LFV  L +   ++SS++ +   LF+TWC  +GKTY S +EKL R K+F++NY ++T+HN+
Sbjct: 7    LFVAFLLSYLFLFSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNS 66

Query: 394  KLAANSSLSYTLSLNAFADLSHQEFKASRLGLSAGGTNKLIRMNRGSSIIKGSSGVRNIP 573
            +     + SYTLSLNAFADL+H EFKASRLGLS+  +  L  ++R +  I     V ++P
Sbjct: 67   Q----GNSSYTLSLNAFADLTHHEFKASRLGLSSAASASL-NVDRSNRQIPDF--VADVP 119

Query: 574  SSLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNS 753
            +S+DWR  GAVT VKDQG+CGACWSFSATGAIEGIN+IVTGSL SLSEQELVDCD+SYN+
Sbjct: 120  ASVDWRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNN 179

Query: 754  GCEGGLMDYAYQFVIDNNGIDTEDDYPYQSRETKCNKNKLNRHVVTIDGYVDVRENDEER 933
            GCEGG+MDYA+QFVIDN+GIDTE+DYPYQ R+  CNK KL RHVVTIDGYVDV +N+E+ 
Sbjct: 180  GCEGGIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKE 239

Query: 934  LLEAVAAQPVSVGICGSERNFQLYSKGIFSGPCSTSLDHAVLIVGYGTENGVDYWILKNS 1113
            LL+AVA QPVSVGICGSER FQLYSKGIF+GPCSTSLDHAVLIVGYG+ENGVDYWI+KNS
Sbjct: 240  LLKAVANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNS 299

Query: 1114 WGTQWGMNGYMHMQRNNGNSQGICGINMMXXXXXXXXXXXXXXXXXXXXKCSLLFSCSEG 1293
            WG+ WGM+GYMHMQRN+G+S+G+CGINM+                    +C L   C EG
Sbjct: 300  WGSYWGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEG 359

Query: 1294 ETCCCSWSLFGLCLSWKCCELDSAVCCDDHRHCCPQDYPICDTKRNLCLKKTGNYTLVKE 1473
            ETCCC   +FG+CLSWKCCELDSAVCC D RHCCP+DYP+CDT RN+CLK  GN T +++
Sbjct: 360  ETCCCVHHIFGICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEK 419

Query: 1474 FGNKRSSG---NWSSLLRDW 1524
            F    SSG   +WSSLL  W
Sbjct: 420  FAKNSSSGKFRSWSSLLEGW 439


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  576 bits (1485), Expect = e-162
 Identities = 280/419 (66%), Positives = 322/419 (76%), Gaps = 4/419 (0%)
 Frame = +1

Query: 199  MNQLRSWLLFVLLLHTPCCIYSSTTYD---LFQTWCASYGKTYNSDQEKLSRFKIFEENY 369
            MN L +  L  LL         S++ D   LF++W   +GKTY S ++KL RFKIFEENY
Sbjct: 1    MNFLSALFLITLLFFNLSISSFSSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENY 60

Query: 370  AYITQHNNKLAANSSLSYTLSLNAFADLSHQEFKASRLGLSAGGTN-KLIRMNRGSSIIK 546
             ++ +HN++     + SYTLSLNAFADL+H EFKASRLGLSA  T+ KL R N       
Sbjct: 61   EFVKKHNSQ----GNSSYTLSLNAFADLTHHEFKASRLGLSAFSTSGKLSRRNFPLHDFV 116

Query: 547  GSSGVRNIPSSLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINQIVTGSLTSLSEQEL 726
            G     ++P S+DWR KGAV+ VKDQG+CGACWSFSATGAIEGIN+IVTGSL SLSEQEL
Sbjct: 117  G-----DVPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQEL 171

Query: 727  VDCDRSYNSGCEGGLMDYAYQFVIDNNGIDTEDDYPYQSRETKCNKNKLNRHVVTIDGYV 906
            VDCDRSYN+GCEGGLMDYAYQFVI+NNGIDTE+DYPYQ+RE  CNK KL RHVVTIDGY 
Sbjct: 172  VDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYT 231

Query: 907  DVRENDEERLLEAVAAQPVSVGICGSERNFQLYSKGIFSGPCSTSLDHAVLIVGYGTENG 1086
            DV +N+E+ LL+AVAAQPVSVGICGSER FQLYSKGIF+GPCSTSLDHAVLIVGYG+ENG
Sbjct: 232  DVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENG 291

Query: 1087 VDYWILKNSWGTQWGMNGYMHMQRNNGNSQGICGINMMXXXXXXXXXXXXXXXXXXXXKC 1266
            VDYWI+KNSWGT WG+NGYM+M RN+GNSQG+CGINM+                    KC
Sbjct: 292  VDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKC 351

Query: 1267 SLLFSCSEGETCCCSWSLFGLCLSWKCCELDSAVCCDDHRHCCPQDYPICDTKRNLCLK 1443
             L   C EGETCCC+  +FGLC SWKCCELDSAVCC D  HCCP DYP+CDTKRN+CLK
Sbjct: 352  DLFTRCGEGETCCCTRRIFGLCFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410


>ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|222857137|gb|EEE94684.1|
            predicted protein [Populus trichocarpa]
          Length = 436

 Score =  572 bits (1474), Expect = e-160
 Identities = 271/444 (61%), Positives = 332/444 (74%), Gaps = 2/444 (0%)
 Frame = +1

Query: 199  MNQLRSWLLFVLLLHTPCCIYSSTTYDLFQTWCASYGKTYNSDQEKLSRFKIFEENYAYI 378
            MN L  + L +L+        SS    LF+TWC  +GK+Y S +E+  R K+FE+NY ++
Sbjct: 1    MNFLYIFALTLLISVLSPSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFV 60

Query: 379  TQHNNKLAANSSLSYTLSLNAFADLSHQEFKASRLGLSAGGTNKLIRMNRGSSIIKGSSG 558
            T+HN+K     + SY+L+LNAFADL+H EFK SRLGLSA   N   R    + +      
Sbjct: 61   TKHNSK----GNSSYSLALNAFADLTHHEFKTSRLGLSAAPLNLAHRNLEITGV------ 110

Query: 559  VRNIPSSLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCD 738
            V +IP+S+DWR+KG VTNVKDQGSCGACWSFSATGAIEGIN+IVTGSL SLSEQEL++CD
Sbjct: 111  VGDIPASIDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECD 170

Query: 739  RSYNSGCEGGLMDYAYQFVIDNNGIDTEDDYPYQSRETKCNKNKLNRHVVTIDGYVDVRE 918
            +SYN GC GGLMDYA+QFVI+N+GIDTE+DYPY++R+  CNK+++ R VVTID YVDV E
Sbjct: 171  KSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPE 230

Query: 919  NDEERLLEAVAAQPVSVGICGSERNFQLYSKGIFSGPCSTSLDHAVLIVGYGTENGVDYW 1098
            N+E++LL+AVAAQPVSVGICGSER FQ+YSKGIF+GPCSTSLDHAVLIVGYG+ENGVDYW
Sbjct: 231  NNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYW 290

Query: 1099 ILKNSWGTQWGMNGYMHMQRNNGNSQGICGINMMXXXXXXXXXXXXXXXXXXXXKCSLLF 1278
            I+KNSWGT WGM GYMHMQRN+GNSQG+CGINM+                    KC+LL 
Sbjct: 291  IVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLT 350

Query: 1279 SCSEGETCCCSWSLFGLCLSWKCCELDSAVCCDDHRHCCPQDYPICDTKRNLCLKKTGNY 1458
             C+ GETCCC+   FG+C+SWKCC LDSAVCC D  HCCP DYP+CDT +N+C K+ GN 
Sbjct: 351  YCAAGETCCCARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNA 410

Query: 1459 TLVKEFGNKRSS--GNWSSLLRDW 1524
            T ++    K S   G+W SL   W
Sbjct: 411  TRMEAIEGKTSGKFGSWISLPEAW 434


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  567 bits (1462), Expect = e-159
 Identities = 276/434 (63%), Positives = 324/434 (74%), Gaps = 3/434 (0%)
 Frame = +1

Query: 217  WLLFVLLLHTPCCIYSSTTYDLFQTWCASYGKTYNSDQEKLSRFKIFEENYAYITQHNNK 396
            +L   LLL  P    S+ + +LF+ WC  +GK+Y+S +EKL R  +F +NY ++T HNN 
Sbjct: 8    FLTLFLLLFRPLSATSNVS-ELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNL 66

Query: 397  LAANSSLSYTLSLNAFADLSHQEFKASRLGLSAGGTNKLIRMNRGSSIIKGSSGVRNIPS 576
                 + SYTLSLN++ADL+H EFK SRLG S    N    + +  S+       R++P 
Sbjct: 67   ----DNSSYTLSLNSYADLTHHEFKVSRLGFSPALRNFRPVLPQEPSL------PRDVPD 116

Query: 577  SLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNSG 756
            SLDWR KGAVT VKDQGSCGACWSFSATGA+EGINQI+TGSL SLSEQEL+DCDRSYNSG
Sbjct: 117  SLDWRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSG 176

Query: 757  CEGGLMDYAYQFVIDNNGIDTEDDYPYQSRETKCNKNKLNRHVVTIDGYVDVRENDEERL 936
            C GGLMDYAYQFVI N+GIDTE+DYPYQ+R+  C K+KL R+VVTIDGY D+  NDE +L
Sbjct: 177  CGGGLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKL 236

Query: 937  LEAVAAQPVSVGICGSERNFQLYSKGIFSGPCSTSLDHAVLIVGYGTENGVDYWILKNSW 1116
            L+AVAAQPVSVGICGSER FQLYSKGIFSGPCSTSLDHAVLIVGYG+ENGVDYWI+KNSW
Sbjct: 237  LQAVAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSW 296

Query: 1117 GTQWGMNGYMHMQRNNGNSQGICGINMMXXXXXXXXXXXXXXXXXXXXKCSLLFSCSEGE 1296
            G  WGM+GYMHMQRN+GNS+G+CGIN +                    KCS+L SC+ GE
Sbjct: 297  GKSWGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGE 356

Query: 1297 TCCCSWSLFGLCLSWKCCELDSAVCCDDHRHCCPQDYPICDTKRNLCLKKTGNYTLVKEF 1476
            TCCC+    GLCLSWKCC L SAVCC D RHCCP DYPICDT RNLCLK+T N T  +  
Sbjct: 357  TCCCAKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEIL 416

Query: 1477 GNKRSSGN---WSS 1509
             N+ SSG+   WSS
Sbjct: 417  ENRSSSGSSGTWSS 430


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  567 bits (1460), Expect = e-159
 Identities = 271/425 (63%), Positives = 325/425 (76%), Gaps = 2/425 (0%)
 Frame = +1

Query: 226  FVLLLHTPCCIYSSTTYDLFQTWCASYGKTYNSDQEKLSRFKIFEENYAYITQHNNKLAA 405
            F+LL+ +P    S    +LF  WC  +GKTY S++E+  R +IF++N+ ++TQHN  L  
Sbjct: 15   FLLLVSSPSS--SDDISELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHN--LIT 70

Query: 406  NSSLSYTLSLNAFADLSHQEFKASRLGLSAGGTNKLIRMNRGSSIIKGSSGVRNIPSSLD 585
            N++  Y+LSLNAFADL+H EFKASRLGLS   ++ LI  ++G S+     G   +P S+D
Sbjct: 71   NAT--YSLSLNAFADLTHHEFKASRLGLSVSASS-LIMASKGQSL----GGNAKVPDSVD 123

Query: 586  WRDKGAVTNVKDQGSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNSGCEG 765
            WR KGAVTNVKDQGSCGACWSFSATGA+EGINQIVTG L SLSEQEL+DCD+SYN+GC G
Sbjct: 124  WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 183

Query: 766  GLMDYAYQFVIDNNGIDTEDDYPYQSRETKCNKNKLNRHVVTIDGYVDVRENDEERLLEA 945
            GLMDYA++FVI N+GIDTE DYPYQ R+  C K+KL + VVTID Y  V+ NDE+ L EA
Sbjct: 184  GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREA 243

Query: 946  VAAQPVSVGICGSERNFQLYSK--GIFSGPCSTSLDHAVLIVGYGTENGVDYWILKNSWG 1119
            VAAQPVSVGICGSER FQLYS+  GIFSGPCSTSLDHAVLIVGYG++NGVDYWI+KNSWG
Sbjct: 244  VAAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWG 303

Query: 1120 TQWGMNGYMHMQRNNGNSQGICGINMMXXXXXXXXXXXXXXXXXXXXKCSLLFSCSEGET 1299
              WGM+G+MHMQRN GNS+GICGINM+                    KC+L   CS GET
Sbjct: 304  KSWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGET 363

Query: 1300 CCCSWSLFGLCLSWKCCELDSAVCCDDHRHCCPQDYPICDTKRNLCLKKTGNYTLVKEFG 1479
            CCC+ +LFGLC SWKCCE++SAVCC D RHCCP DYP+CDT R+LCLKKTGN+T +K F 
Sbjct: 364  CCCARNLFGLCFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFW 423

Query: 1480 NKRSS 1494
             K SS
Sbjct: 424  KKDSS 428


Top