BLASTX nr result

ID: Angelica22_contig00028340 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00028340
         (1504 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002322693.1| predicted protein [Populus trichocarpa] gi|2...   649   0.0  
ref|NP_567055.1| glycoside hydrolase family 28 protein / polygal...   610   e-172
emb|CAB41176.1| putative protein [Arabidopsis thaliana]               610   e-172
ref|XP_002878172.1| hypothetical protein ARALYDRAFT_324272 [Arab...   608   e-172
ref|XP_004161486.1| PREDICTED: exo-poly-alpha-D-galacturonosidas...   585   e-165

>ref|XP_002322693.1| predicted protein [Populus trichocarpa] gi|222867323|gb|EEF04454.1|
            predicted protein [Populus trichocarpa]
          Length = 445

 Score =  649 bits (1674), Expect = 0.0
 Identities = 308/443 (69%), Positives = 356/443 (80%), Gaps = 4/443 (0%)
 Frame = -2

Query: 1386 KIQLPQST-LTLSVRDYGAVGDGVHYDTPSIQSAVNDCHSLG---GCHVIFPPGNYLTAT 1219
            +IQLP  T  TLSV D+GA+GDG+HYDT +IQS +N C +      CHV FPPG YLTAT
Sbjct: 2    RIQLPHPTPTTLSVTDFGAIGDGIHYDTEAIQSTINSCPTTPPTKACHVNFPPGIYLTAT 61

Query: 1218 IHLKSDVILEIQKNAAILGGTKQTDYPKENDRWYVVVAEDAXXXXXXXXXXXXXXGFEFV 1039
            IHLKS+V+L IQ+ A +LGGTK  DYPKE +RWYVV+AE+A              G +FV
Sbjct: 62   IHLKSNVVLNIQEGATLLGGTKLEDYPKEFNRWYVVLAENASDVGITGGGVVDGQGLKFV 121

Query: 1038 SRFDDKKNVMVSWNKTGDCLGDECRPRLVGFLRSRNVRVWDVNLLQPAYWCLHLVQCDNT 859
             RF+++KNVMVSWN TG CLGDECRPRLVGF+   NV+VW+V L +PAYWCLH+VQC NT
Sbjct: 122  KRFNERKNVMVSWNSTGACLGDECRPRLVGFIGCTNVKVWNVRLSEPAYWCLHIVQCLNT 181

Query: 858  SIHDVSIYGDFNTPXXXXXXXXXXXNTFITRCNINTGDDAICPKTYLGPLYNLTATDCWI 679
             I DVSIYGDFN+P           NT ITRC+I+TGDDAICPKTY GP+YNLTATDCWI
Sbjct: 182  HISDVSIYGDFNSPNNDGIDIEDSNNTLITRCHIDTGDDAICPKTYTGPIYNLTATDCWI 241

Query: 678  RTKSSAIKFGSASWFDFKGLVFNNITIVESHRGLGLQIRDGGNVSDVTFSNINITTRYYD 499
            RTKSSAIK GSASWF+FKGLVF+NITIV+SHRGLGLQIRDGGNVSD+TFSNINI+TRYYD
Sbjct: 242  RTKSSAIKLGSASWFEFKGLVFDNITIVDSHRGLGLQIRDGGNVSDITFSNINISTRYYD 301

Query: 498  PSWWGRAEPIYVTTCPREETSKSGSISNILFVNITATSENGVFLSGCKGGILSNLKFENV 319
            PSWWGRAEPIYVTTCPR  +SK GSISN+ F+NIT  SENGVFLSG KGG+LSNL+F N+
Sbjct: 302  PSWWGRAEPIYVTTCPRHSSSKEGSISNLQFINITTNSENGVFLSGSKGGLLSNLRFINM 361

Query: 318  NLTYRRWTNYMDGLVDYRPGCQGLVNHSSAGFLMEHIDGLEAVNVNMKWSEDHLGRWNTP 139
            NLT+RRWT Y  GLVDYRPGCQGLVNHS+AG +MEHI+G E  NVNM+WS+     W+ P
Sbjct: 362  NLTFRRWTTYPGGLVDYRPGCQGLVNHSAAGIIMEHIEGFEVENVNMRWSDYQNEPWDNP 421

Query: 138  LSFKPSTVNNISLINFHSGLFKQ 70
            L F+PSTVNNIS +NFHS L+KQ
Sbjct: 422  LDFRPSTVNNISFLNFHSALYKQ 444


>ref|NP_567055.1| glycoside hydrolase family 28 protein / polygalacturonase (pectinase)
            family protein [Arabidopsis thaliana]
            gi|332646180|gb|AEE79701.1| glycoside hydrolase family 28
            protein / polygalacturonase (pectinase) family protein
            [Arabidopsis thaliana]
          Length = 490

 Score =  610 bits (1574), Expect = e-172
 Identities = 293/451 (64%), Positives = 353/451 (78%), Gaps = 10/451 (2%)
 Frame = -2

Query: 1398 SSSPKIQLPQSTLTLSVRDYGAVGDGVHYDTPSIQSAVNDCHS-----LGGCHVIFPPGN 1234
            +S  KIQLP  +LTLSV D+GA GDG++YDT +IQS ++ C+         C V+FP GN
Sbjct: 21   TSYSKIQLPGDSLTLSVTDFGATGDGINYDTSAIQSTIDACNRHYTSFSSICRVVFPSGN 80

Query: 1233 YLTATIHLKSDVILEIQKNAAILGGTKQTDY-PKENDR-WYVVVAEDAXXXXXXXXXXXX 1060
            YLTA +HL+S VIL++ +NA +LGG +  DY P E    WYVVVA +A            
Sbjct: 81   YLTAKLHLRSGVILDVTENAVLLGGPRIEDYYPAETSSDWYVVVANNATDVGITGGGAID 140

Query: 1059 XXGFEFVSRFDDKKNVMVSWNKTGDCLGDECRPRLVGFLRSRNVRVWDVNLLQPAYWCLH 880
              G +FV RFD+KKNVMVSWN+TG CLGDECRPRLVGF+ S NV +W++ L +PAYWCLH
Sbjct: 141  GQGSKFVVRFDEKKNVMVSWNQTGACLGDECRPRLVGFVDSINVEIWNITLREPAYWCLH 200

Query: 879  LVQCDNTSIHDVSIYGDFNTPXXXXXXXXXXXNTFITRCNINTGDDAICPKTYLGPLYNL 700
            +V+C+NTS+HDVSI GDFNTP           NT ITRC+I+TGDDAICPKTY GPLYNL
Sbjct: 201  IVRCENTSVHDVSILGDFNTPNNDGIDIEDSNNTVITRCHIDTGDDAICPKTYTGPLYNL 260

Query: 699  TATDCWIRTKSSAIKFGSASWFDFKGLVFNNITIVESHRGLGLQIRDGGNVSDVTFSNIN 520
            TATDCWIRTKSSAIK GSASWFDFKGLVF+NITI ESHRGLG+QIRDGGNVSDVTFSNIN
Sbjct: 261  TATDCWIRTKSSAIKLGSASWFDFKGLVFDNITIFESHRGLGMQIRDGGNVSDVTFSNIN 320

Query: 519  ITTRYYDPSWWGRAEPIYVTTCPREETSKSGSISNILFVNITATSENGVFLSGCKGGILS 340
            I+TRYYDPSWWGRAEPIY+TTCPR+ ++K GSISN+LFVNIT  SENGVFLSG   G+LS
Sbjct: 321  ISTRYYDPSWWGRAEPIYITTCPRDSSAKEGSISNLLFVNITIDSENGVFLSGSPNGLLS 380

Query: 339  NLKFENVNLTYRRWTNYMDGLVDYRPGCQGLVNH-SSAGFLMEHIDGLEAVNVNMKWSED 163
            ++KF+N+NLT+RRW+NY  GLVDYRPGCQGLVNH +++G +MEH++G    NV++KWS+D
Sbjct: 381  DIKFKNMNLTFRRWSNYSAGLVDYRPGCQGLVNHRATSGIIMEHVNGFRVENVDLKWSDD 440

Query: 162  H--LGRWNTPLSFKPSTVNNISLINFHSGLF 76
                  WN PL F+PSTVNN+S + F SGL+
Sbjct: 441  DDVNAAWNVPLEFRPSTVNNVSFVGFTSGLY 471


>emb|CAB41176.1| putative protein [Arabidopsis thaliana]
          Length = 614

 Score =  610 bits (1574), Expect = e-172
 Identities = 293/451 (64%), Positives = 353/451 (78%), Gaps = 10/451 (2%)
 Frame = -2

Query: 1398 SSSPKIQLPQSTLTLSVRDYGAVGDGVHYDTPSIQSAVNDCHS-----LGGCHVIFPPGN 1234
            +S  KIQLP  +LTLSV D+GA GDG++YDT +IQS ++ C+         C V+FP GN
Sbjct: 21   TSYSKIQLPGDSLTLSVTDFGATGDGINYDTSAIQSTIDACNRHYTSFSSICRVVFPSGN 80

Query: 1233 YLTATIHLKSDVILEIQKNAAILGGTKQTDY-PKENDR-WYVVVAEDAXXXXXXXXXXXX 1060
            YLTA +HL+S VIL++ +NA +LGG +  DY P E    WYVVVA +A            
Sbjct: 81   YLTAKLHLRSGVILDVTENAVLLGGPRIEDYYPAETSSDWYVVVANNATDVGITGGGAID 140

Query: 1059 XXGFEFVSRFDDKKNVMVSWNKTGDCLGDECRPRLVGFLRSRNVRVWDVNLLQPAYWCLH 880
              G +FV RFD+KKNVMVSWN+TG CLGDECRPRLVGF+ S NV +W++ L +PAYWCLH
Sbjct: 141  GQGSKFVVRFDEKKNVMVSWNQTGACLGDECRPRLVGFVDSINVEIWNITLREPAYWCLH 200

Query: 879  LVQCDNTSIHDVSIYGDFNTPXXXXXXXXXXXNTFITRCNINTGDDAICPKTYLGPLYNL 700
            +V+C+NTS+HDVSI GDFNTP           NT ITRC+I+TGDDAICPKTY GPLYNL
Sbjct: 201  IVRCENTSVHDVSILGDFNTPNNDGIDIEDSNNTVITRCHIDTGDDAICPKTYTGPLYNL 260

Query: 699  TATDCWIRTKSSAIKFGSASWFDFKGLVFNNITIVESHRGLGLQIRDGGNVSDVTFSNIN 520
            TATDCWIRTKSSAIK GSASWFDFKGLVF+NITI ESHRGLG+QIRDGGNVSDVTFSNIN
Sbjct: 261  TATDCWIRTKSSAIKLGSASWFDFKGLVFDNITIFESHRGLGMQIRDGGNVSDVTFSNIN 320

Query: 519  ITTRYYDPSWWGRAEPIYVTTCPREETSKSGSISNILFVNITATSENGVFLSGCKGGILS 340
            I+TRYYDPSWWGRAEPIY+TTCPR+ ++K GSISN+LFVNIT  SENGVFLSG   G+LS
Sbjct: 321  ISTRYYDPSWWGRAEPIYITTCPRDSSAKEGSISNLLFVNITIDSENGVFLSGSPNGLLS 380

Query: 339  NLKFENVNLTYRRWTNYMDGLVDYRPGCQGLVNH-SSAGFLMEHIDGLEAVNVNMKWSED 163
            ++KF+N+NLT+RRW+NY  GLVDYRPGCQGLVNH +++G +MEH++G    NV++KWS+D
Sbjct: 381  DIKFKNMNLTFRRWSNYSAGLVDYRPGCQGLVNHRATSGIIMEHVNGFRVENVDLKWSDD 440

Query: 162  H--LGRWNTPLSFKPSTVNNISLINFHSGLF 76
                  WN PL F+PSTVNN+S + F SGL+
Sbjct: 441  DDVNAAWNVPLEFRPSTVNNVSFVGFTSGLY 471


>ref|XP_002878172.1| hypothetical protein ARALYDRAFT_324272 [Arabidopsis lyrata subsp.
            lyrata] gi|297324010|gb|EFH54431.1| hypothetical protein
            ARALYDRAFT_324272 [Arabidopsis lyrata subsp. lyrata]
          Length = 589

 Score =  608 bits (1569), Expect = e-172
 Identities = 290/444 (65%), Positives = 348/444 (78%), Gaps = 8/444 (1%)
 Frame = -2

Query: 1398 SSSPKIQLPQSTLTLSVRDYGAVGDGVHYDTPSIQSAVNDCH-----SLGGCHVIFPPGN 1234
            +S  KIQLP  +L LSV D+GA GDG++YDT ++QS ++ C+     S   C V FP GN
Sbjct: 20   TSYSKIQLPGDSLALSVTDFGATGDGINYDTSAVQSTIDACNRHYTSSSSICRVTFPSGN 79

Query: 1233 YLTATIHLKSDVILEIQKNAAILGGTKQTDY-PKENDR-WYVVVAEDAXXXXXXXXXXXX 1060
            YLTA +HL+S V+L++ +NA +LGG +  DY P E    WYVVVA +A            
Sbjct: 80   YLTAKLHLRSGVVLDVTENAVLLGGPRIEDYYPAETSSDWYVVVANNATDVGITGGGAID 139

Query: 1059 XXGFEFVSRFDDKKNVMVSWNKTGDCLGDECRPRLVGFLRSRNVRVWDVNLLQPAYWCLH 880
              G +FV RFD+KKNVMVSWN+TG CLGDECRPRLVGF+ SRNV +W++ L +PAYWCLH
Sbjct: 140  GQGSKFVVRFDEKKNVMVSWNQTGACLGDECRPRLVGFVDSRNVEIWNITLREPAYWCLH 199

Query: 879  LVQCDNTSIHDVSIYGDFNTPXXXXXXXXXXXNTFITRCNINTGDDAICPKTYLGPLYNL 700
            +V+C+NTS+HDVSI GDFNTP           NT ITRC+I+TGDDAICPKTY GPLYNL
Sbjct: 200  IVRCENTSVHDVSILGDFNTPNNDGIDIEDSNNTVITRCHIDTGDDAICPKTYTGPLYNL 259

Query: 699  TATDCWIRTKSSAIKFGSASWFDFKGLVFNNITIVESHRGLGLQIRDGGNVSDVTFSNIN 520
            TATDCWIRTKSSAIK GSASWFDFKGLVF+NITI ESHRGLG+QIRDGGNVSD+TFSNIN
Sbjct: 260  TATDCWIRTKSSAIKLGSASWFDFKGLVFDNITIFESHRGLGMQIRDGGNVSDITFSNIN 319

Query: 519  ITTRYYDPSWWGRAEPIYVTTCPREETSKSGSISNILFVNITATSENGVFLSGCKGGILS 340
            I+TRYYDPSWWGRAEPIY+TTCPR+ ++K GSISN+LFVNIT  SENGVFLSG   G+LS
Sbjct: 320  ISTRYYDPSWWGRAEPIYITTCPRDSSAKEGSISNLLFVNITINSENGVFLSGSPNGLLS 379

Query: 339  NLKFENVNLTYRRWTNYMDGLVDYRPGCQGLVNHS-SAGFLMEHIDGLEAVNVNMKWSED 163
            ++KF+N+NLT RRW+NY  GLVDYRPGC+GLVNHS +AG +MEH++G    NV++KWS+D
Sbjct: 380  DIKFKNMNLTVRRWSNYSAGLVDYRPGCRGLVNHSATAGIIMEHVNGFSIENVDLKWSDD 439

Query: 162  HLGRWNTPLSFKPSTVNNISLINF 91
                WN PL F+PSTVNN+SL  F
Sbjct: 440  LNSGWNVPLEFRPSTVNNVSLFEF 463


>ref|XP_004161486.1| PREDICTED: exo-poly-alpha-D-galacturonosidase-like [Cucumis sativus]
          Length = 463

 Score =  585 bits (1509), Expect = e-165
 Identities = 276/440 (62%), Positives = 334/440 (75%), Gaps = 1/440 (0%)
 Frame = -2

Query: 1398 SSSPKIQLPQSTLTLSVRDYGAVGDGVHYDTPSIQSAVNDCHSLGGCHVIFPPGNYLTAT 1219
            S+ P I+L + + + SV D+GA+GDG+HYDT +IQSA+N C +   C+V FPPG YLTAT
Sbjct: 20   SAIPSIRLLRRSTSFSVTDFGAIGDGLHYDTTAIQSAINSCPAPSRCYVTFPPGTYLTAT 79

Query: 1218 IHLKSDVILEIQKNAAILGGTKQTDYPKENDRWYVVVAEDAXXXXXXXXXXXXXXGFEFV 1039
            I L+S V+L+IQ  A +L GTK  DYP ++ RW+ VVAE+A              G +FV
Sbjct: 80   IWLRSGVVLDIQPGATVLAGTKMEDYPADSSRWFAVVAENASDVGISGGGTVDGQGLKFV 139

Query: 1038 SRFDDKKNVMVSWNKTGDCLGDECRPRLVGFLRSRNVRVWDVNLLQPAYWCLHLVQCDNT 859
             +FD +KNVMVSWNKTG C GDECRP LVGF+ S  VRV +V+  QPA+WCLHLV+C+NT
Sbjct: 140  EKFDKRKNVMVSWNKTGACYGDECRPDLVGFIGSNKVRVSNVSFNQPAHWCLHLVRCENT 199

Query: 858  SIHDVSIYGDFNTPXXXXXXXXXXXNTFITRCNINTGDDAICPKTYLGPLYNLTATDCWI 679
             I DVSIYGDF+TP           NT ITRC I+TGDDAICPK+  GP++NLTAT+CWI
Sbjct: 200  VIEDVSIYGDFDTPNNDGIDIEDSNNTLITRCRIDTGDDAICPKSSNGPVFNLTATNCWI 259

Query: 678  RTKSSAIKFGSASWFDFKGLVFNNITIVESHRGLGLQIRDGGNVSDVTFSNINITTRYYD 499
            RTKSSAIK GSASWF+F  ++F+N+TIV+SHRGL  Q+RDGG+ +D+TFSNINITTRYYD
Sbjct: 260  RTKSSAIKLGSASWFNFTRMLFDNLTIVDSHRGLAFQLRDGGSANDITFSNINITTRYYD 319

Query: 498  PSWWGRAEPIYVTTCPREETSKSGSISNILFVNITATSENGVFLSGCKGGILSNLKFENV 319
            PSWWGRAEPIYVTTCPR+  SK GSISNI F+NITATSENGVFLSG K G+LSNL+F NV
Sbjct: 320  PSWWGRAEPIYVTTCPRDPGSKEGSISNIRFINITATSENGVFLSGSKSGVLSNLRFTNV 379

Query: 318  NLTYRRWTNYMDGLVDYRPGCQGLVNHSSAGFLMEHIDGLEAVNVNMKWSEDHLG-RWNT 142
             L Y+RWT Y  G+ DYRPGCQG V H  AG +MEHI+GL   NV+M W + +   +WN 
Sbjct: 380  KLRYKRWTKYGGGIADYRPGCQGFVKHGMAGMIMEHIEGLNLENVDMHWFDTNGSLQWNN 439

Query: 141  PLSFKPSTVNNISLINFHSG 82
            PL F+PSTVNNIS  NFHSG
Sbjct: 440  PLDFRPSTVNNISFFNFHSG 459


Top