BLASTX nr result

ID: Angelica22_contig00024522 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00024522
         (1757 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]   488   e-135
ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2...   487   e-135
emb|CBI24128.3| unnamed protein product [Vitis vinifera]              478   e-132
ref|NP_187876.2| aspartyl protease family protein [Arabidopsis t...   382   e-103
gb|AAL49921.1| unknown protein [Arabidopsis thaliana]                 382   e-103

>emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  488 bits (1255), Expect = e-135
 Identities = 253/452 (55%), Positives = 321/452 (71%), Gaps = 9/452 (1%)
 Frame = +1

Query: 301  VIEMPKT----LQAMLRNDANRQATFLRKQSVIKKHQRLQSISKIRRELVEETKNNVSR- 465
            V+  PKT    L+ ++ +D+ RQ   L K         L+     RR+  E   ++  R 
Sbjct: 13   VMGRPKTQLQRLKELVHSDSVRQLMILHK---------LRGGQIPRRKAKEVLSSSSGRG 63

Query: 466  LNVAVDMPMHSGADFGIGQYLVAFKAGTPSQKLKLIVDTGSDLTWIKCRYSCENDPNKCS 645
             + A+++PMH  AD+GIGQY VAFK GTPSQK  L+ DTGSDLTW+ C+Y C +    CS
Sbjct: 64   SDDAIEVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRS--RNCS 121

Query: 646  GISDEDSKVE-VFEADDSSSFNTIPCSSEMCKVDLSNLFSLAMCPFPFTPCGFDYRYSDG 822
                   + + VF A+ SSSF TIPC ++MCK++L +LFSL  CP P TPCG+DYRYSDG
Sbjct: 122  NRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDG 181

Query: 823  SSAIGFFANETVTLGLTNGRKAKLDKILMGCSQSSQGPSFRGQVAPDGVLGLGFSKSSFA 1002
            S+A+GFFANETVT+ L  GRK KL  +L+GCS+S QG SF+   A DGV+GLG+SK SFA
Sbjct: 182  STALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQ---AADGVMGLGYSKYSFA 238

Query: 1003 IRATDAFGGKFSYCLPDHLSPKNVSSHLTF-SSPSYQNAATSKIYT--ILGAIGTFYAVN 1173
            I+A + FGGKFSYCL DHLS KNVS++LTF SS S +    +  YT  +LG + +FYAVN
Sbjct: 239  IKAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVN 298

Query: 1174 IRGISVGDELLNIPGEIWDVNGVGGAILDTGSSLTLLALPAYKPIVAALTASLNKFQRLD 1353
            + GIS+G  +L IP E+WDV G GG ILD+GSSLT L  PAY+P++AAL  SL KF++++
Sbjct: 299  MMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVE 358

Query: 1354 IKIRQLEYCFNATGFSELLDMPRLIFHFQDGTRYEPPVKNYVIDAADGVKCLGLVPAVWP 1533
            + I  LEYCFN+TGF E L +PRL+FHF DG  +EPPVK+YVI AADGV+CLG V   WP
Sbjct: 359  MDIGPLEYCFNSTGFEESL-VPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWP 417

Query: 1534 GVSVIGNIMQQNHVWEYDLVNGKLGFAPSSCT 1629
            G SV+GNIMQQNH+WE+DL   KLGFAPSSCT
Sbjct: 418  GTSVVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449


>ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score =  487 bits (1253), Expect = e-135
 Identities = 253/452 (55%), Positives = 321/452 (71%), Gaps = 9/452 (1%)
 Frame = +1

Query: 301  VIEMPKT----LQAMLRNDANRQATFLRKQSVIKKHQRLQSISKIRRELVEETKNNVSR- 465
            V+  PKT    L+ ++ +D+ RQ   L K         L+     RR+  E   ++  R 
Sbjct: 13   VMGRPKTQLQRLKELVHSDSVRQLMILHK---------LRGGQIPRRKAKEVLSSSSGRG 63

Query: 466  LNVAVDMPMHSGADFGIGQYLVAFKAGTPSQKLKLIVDTGSDLTWIKCRYSCENDPNKCS 645
             + A+++PMH  AD+GIGQY VAFK GTPSQK  L+ DTGSDLTW+ C+Y C +    CS
Sbjct: 64   SDDAIEVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRS--RNCS 121

Query: 646  GISDEDSKVE-VFEADDSSSFNTIPCSSEMCKVDLSNLFSLAMCPFPFTPCGFDYRYSDG 822
                   + + VF A+ SSSF TIPC ++MCK++L +LFSL  CP P TPCG+DYRYSDG
Sbjct: 122  NRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDG 181

Query: 823  SSAIGFFANETVTLGLTNGRKAKLDKILMGCSQSSQGPSFRGQVAPDGVLGLGFSKSSFA 1002
            S+A+GFFANETVT+ L  GRK KL  +L+GCS+S QG SF+   A DGV+GLG+SK SFA
Sbjct: 182  STALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQ---AADGVMGLGYSKYSFA 238

Query: 1003 IRATDAFGGKFSYCLPDHLSPKNVSSHLTF-SSPSYQNAATSKIYT--ILGAIGTFYAVN 1173
            I+A + FGGKFSYCL DHLS KNVS++LTF SS S +    +  YT  +LG + +FYAVN
Sbjct: 239  IKAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVN 298

Query: 1174 IRGISVGDELLNIPGEIWDVNGVGGAILDTGSSLTLLALPAYKPIVAALTASLNKFQRLD 1353
            + GIS+G  +L IP E+WDV G GG ILD+GSSLT L  PAY+P++AAL  SL KF++++
Sbjct: 299  MMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVE 358

Query: 1354 IKIRQLEYCFNATGFSELLDMPRLIFHFQDGTRYEPPVKNYVIDAADGVKCLGLVPAVWP 1533
            + I  LEYCFN+TGF E L +PRL+FHF DG  +EPPVK+YVI AADGV+CLG V   WP
Sbjct: 359  MDIGPLEYCFNSTGFEESL-VPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWP 417

Query: 1534 GVSVIGNIMQQNHVWEYDLVNGKLGFAPSSCT 1629
            G SV+GNIMQQNH+WE+DL   KLGFAPSSCT
Sbjct: 418  GTSVVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449


>emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score =  478 bits (1229), Expect = e-132
 Identities = 236/384 (61%), Positives = 290/384 (75%), Gaps = 4/384 (1%)
 Frame = +1

Query: 490  MHSGADFGIGQYLVAFKAGTPSQKLKLIVDTGSDLTWIKCRYSCENDPNKCSGISDEDSK 669
            MH  AD+GIGQY VAFK GTPSQK  L+ DTGSDLTW+ C+Y C +    CS       +
Sbjct: 1    MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRS--RNCSNRKARRIR 58

Query: 670  VE-VFEADDSSSFNTIPCSSEMCKVDLSNLFSLAMCPFPFTPCGFDYRYSDGSSAIGFFA 846
             + VF A+ SSSF TIPC ++MCK++L +LFSL  CP P TPCG+DYRYSDGS+A+GFFA
Sbjct: 59   HKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFA 118

Query: 847  NETVTLGLTNGRKAKLDKILMGCSQSSQGPSFRGQVAPDGVLGLGFSKSSFAIRATDAFG 1026
            NETVT+ L  GRK KL  +L+GCS+S QG SF+   A DGV+GLG+SK SFAI+A + FG
Sbjct: 119  NETVTVELKEGRKMKLHNVLIGCSESFQGQSFQ---AADGVMGLGYSKYSFAIKAAEKFG 175

Query: 1027 GKFSYCLPDHLSPKNVSSHLTF-SSPSYQNAATSKIYT--ILGAIGTFYAVNIRGISVGD 1197
            GKFSYCL DHLS KNVS++LTF SS S +    +  YT  +LG + +FYAVN+ GIS+G 
Sbjct: 176  GKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGG 235

Query: 1198 ELLNIPGEIWDVNGVGGAILDTGSSLTLLALPAYKPIVAALTASLNKFQRLDIKIRQLEY 1377
             +L IP E+WDV G GG ILD+GSSLT L  PAY+P++AAL  SL KF+++++ I  LEY
Sbjct: 236  AMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEY 295

Query: 1378 CFNATGFSELLDMPRLIFHFQDGTRYEPPVKNYVIDAADGVKCLGLVPAVWPGVSVIGNI 1557
            CFN+TGF E L +PRL+FHF DG  +EPPVK+YVI AADGV+CLG V   WPG SV+GNI
Sbjct: 296  CFNSTGFEESL-VPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNI 354

Query: 1558 MQQNHVWEYDLVNGKLGFAPSSCT 1629
            MQQNH+WE+DL   KLGFAPSSCT
Sbjct: 355  MQQNHLWEFDLGLKKLGFAPSSCT 378


>ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
            gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA
            binding protein-like [Arabidopsis thaliana]
            gi|332641715|gb|AEE75236.1| aspartyl protease family
            protein [Arabidopsis thaliana]
          Length = 461

 Score =  382 bits (982), Expect = e-103
 Identities = 196/403 (48%), Positives = 258/403 (64%), Gaps = 1/403 (0%)
 Frame = +1

Query: 424  RRELVEETKNNVSRLNVAVDMPMHSGADFGIGQYLVAFKAGTPSQKLKLIVDTGSDLTWI 603
            R  L+   +N+     V V M + SG D+G  QY    + GTP++K +++VDTGS+LTW+
Sbjct: 77   RHSLISRKRNST----VGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWV 132

Query: 604  KCRYSCENDPNKCSGISDEDSKVEVFEADDSSSFNTIPCSSEMCKVDLSNLFSLAMCPFP 783
             CRY      N+            VF AD+S SF T+ C ++ CKVDL NLFSL  CP P
Sbjct: 133  NCRYRARGKDNR-----------RVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTP 181

Query: 784  FTPCGFDYRYSDGSSAIGFFANETVTLGLTNGRKAKLDKILMGCSQSSQGPSFRGQVAPD 963
             TPC +DYRY+DGS+A G FA ET+T+GLTNGR A+L   L+GCS S  G SF+G    D
Sbjct: 182  STPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGA---D 238

Query: 964  GVLGLGFSKSSFAIRATDAFGGKFSYCLPDHLSPKNVSSHLTFSSPSYQNAATSKIYTI- 1140
            GVLGL FS  SF   AT  +G KFSYCL DHLS KNVS++L F S      A  +   + 
Sbjct: 239  GVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLD 298

Query: 1141 LGAIGTFYAVNIRGISVGDELLNIPGEIWDVNGVGGAILDTGSSLTLLALPAYKPIVAAL 1320
            L  I  FYA+N+ GIS+G ++L+IP ++WD    GG ILD+G+SLTLLA  AYK +V  L
Sbjct: 299  LTRIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGL 358

Query: 1321 TASLNKFQRLDIKIRQLEYCFNATGFSELLDMPRLIFHFQDGTRYEPPVKNYVIDAADGV 1500
               L + +R+  +   +EYCF+ T    +  +P+L FH + G R+EP  K+Y++DAA GV
Sbjct: 359  ARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGV 418

Query: 1501 KCLGLVPAVWPGVSVIGNIMQQNHVWEYDLVNGKLGFAPSSCT 1629
            KCLG V A  P  +VIGNIMQQN++WE+DL+   L FAPS+CT
Sbjct: 419  KCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSACT 461


>gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score =  382 bits (982), Expect = e-103
 Identities = 196/403 (48%), Positives = 258/403 (64%), Gaps = 1/403 (0%)
 Frame = +1

Query: 424  RRELVEETKNNVSRLNVAVDMPMHSGADFGIGQYLVAFKAGTPSQKLKLIVDTGSDLTWI 603
            R  L+   +N+     V V M + SG D+G  QY    + GTP++K +++VDTGS+LTW+
Sbjct: 55   RHSLISRKRNST----VGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWV 110

Query: 604  KCRYSCENDPNKCSGISDEDSKVEVFEADDSSSFNTIPCSSEMCKVDLSNLFSLAMCPFP 783
             CRY      N+            VF AD+S SF T+ C ++ CKVDL NLFSL  CP P
Sbjct: 111  NCRYRARGKDNR-----------RVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTP 159

Query: 784  FTPCGFDYRYSDGSSAIGFFANETVTLGLTNGRKAKLDKILMGCSQSSQGPSFRGQVAPD 963
             TPC +DYRY+DGS+A G FA ET+T+GLTNGR A+L   L+GCS S  G SF+G    D
Sbjct: 160  STPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGA---D 216

Query: 964  GVLGLGFSKSSFAIRATDAFGGKFSYCLPDHLSPKNVSSHLTFSSPSYQNAATSKIYTI- 1140
            GVLGL FS  SF   AT  +G KFSYCL DHLS KNVS++L F S      A  +   + 
Sbjct: 217  GVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLD 276

Query: 1141 LGAIGTFYAVNIRGISVGDELLNIPGEIWDVNGVGGAILDTGSSLTLLALPAYKPIVAAL 1320
            L  I  FYA+N+ GIS+G ++L+IP ++WD    GG ILD+G+SLTLLA  AYK +V  L
Sbjct: 277  LTRIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGL 336

Query: 1321 TASLNKFQRLDIKIRQLEYCFNATGFSELLDMPRLIFHFQDGTRYEPPVKNYVIDAADGV 1500
               L + +R+  +   +EYCF+ T    +  +P+L FH + G R+EP  K+Y++DAA GV
Sbjct: 337  ARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGV 396

Query: 1501 KCLGLVPAVWPGVSVIGNIMQQNHVWEYDLVNGKLGFAPSSCT 1629
            KCLG V A  P  +VIGNIMQQN++WE+DL+   L FAPS+CT
Sbjct: 397  KCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSACT 439


Top