BLASTX nr result

ID: Angelica23_contig00012135 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00012135
         (1540 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002322693.1| predicted protein [Populus trichocarpa] gi|2...   650   0.0  
ref|NP_567055.1| glycoside hydrolase family 28 protein / polygal...   611   e-172
emb|CAB41176.1| putative protein [Arabidopsis thaliana]               611   e-172
ref|XP_002878172.1| hypothetical protein ARALYDRAFT_324272 [Arab...   609   e-172
ref|XP_004161486.1| PREDICTED: exo-poly-alpha-D-galacturonosidas...   587   e-165

>ref|XP_002322693.1| predicted protein [Populus trichocarpa] gi|222867323|gb|EEF04454.1|
            predicted protein [Populus trichocarpa]
          Length = 445

 Score =  650 bits (1678), Expect = 0.0
 Identities = 308/443 (69%), Positives = 357/443 (80%), Gaps = 4/443 (0%)
 Frame = -3

Query: 1424 KIQLPQST-LTLSVRDYGAVGDGVHYDTPSIQSAVNDCHSLG---GCHVIFPPGNYLTAT 1257
            +IQLP  T  TLSV D+GA+GDG+HYDT +IQS +N C +      CHV FPPG YLTAT
Sbjct: 2    RIQLPHPTPTTLSVTDFGAIGDGIHYDTEAIQSTINSCPTTPPTKACHVNFPPGIYLTAT 61

Query: 1256 IHLKSDVILEVQKNATILGGTKQTDYPKENDRWYVVVAEDAXXXXXXXXXXXXXXGFEFV 1077
            IHLKS+V+L +Q+ AT+LGGTK  DYPKE +RWYVV+AE+A              G +FV
Sbjct: 62   IHLKSNVVLNIQEGATLLGGTKLEDYPKEFNRWYVVLAENASDVGITGGGVVDGQGLKFV 121

Query: 1076 SRFDDKKNVMVSWNKTGDCLGDECRPRLVGFLRSRNVRVWDVNLLQPAYWCLHLVQCDNT 897
             RF+++KNVMVSWN TG CLGDECRPRLVGF+   NV+VW+V L +PAYWCLH+VQC NT
Sbjct: 122  KRFNERKNVMVSWNSTGACLGDECRPRLVGFIGCTNVKVWNVRLSEPAYWCLHIVQCLNT 181

Query: 896  SIHDVSIYGDFNTPXXXXXXXXXXXNTFITRCNINTGDDAICPKTYLGPLYNLTATDCWI 717
             I DVSIYGDFN+P           NT ITRC+I+TGDDAICPKTY GP+YNLTATDCWI
Sbjct: 182  HISDVSIYGDFNSPNNDGIDIEDSNNTLITRCHIDTGDDAICPKTYTGPIYNLTATDCWI 241

Query: 716  RTKSSAIKFGSASWFDFKGLVFNNITIVESHRGLGLQIRDGGNVSDVTFSNINITTRYYD 537
            RTKSSAIK GSASWF+FKGLVF+NITIV+SHRGLGLQIRDGGNVSD+TFSNINI+TRYYD
Sbjct: 242  RTKSSAIKLGSASWFEFKGLVFDNITIVDSHRGLGLQIRDGGNVSDITFSNINISTRYYD 301

Query: 536  PSWWGRAEPIYVTTCPREETSKSGSISNILFVNITATSENGVFLSGCKGGILSNLKFENV 357
            PSWWGRAEPIYVTTCPR  +SK GSISN+ F+NIT  SENGVFLSG KGG+LSNL+F N+
Sbjct: 302  PSWWGRAEPIYVTTCPRHSSSKEGSISNLQFINITTNSENGVFLSGSKGGLLSNLRFINM 361

Query: 356  NLTYRRWTNYMDGLVDYRPGCQGLVNHSSAGFLMEHIDGLEAVNVNMKWSEDHLGRWNTP 177
            NLT+RRWT Y  GLVDYRPGCQGLVNHS+AG +MEHI+G E  NVNM+WS+     W+ P
Sbjct: 362  NLTFRRWTTYPGGLVDYRPGCQGLVNHSAAGIIMEHIEGFEVENVNMRWSDYQNEPWDNP 421

Query: 176  LSFKPSTVNNISLINFHSGLFKQ 108
            L F+PSTVNNIS +NFHS L+KQ
Sbjct: 422  LDFRPSTVNNISFLNFHSALYKQ 444


>ref|NP_567055.1| glycoside hydrolase family 28 protein / polygalacturonase (pectinase)
            family protein [Arabidopsis thaliana]
            gi|332646180|gb|AEE79701.1| glycoside hydrolase family 28
            protein / polygalacturonase (pectinase) family protein
            [Arabidopsis thaliana]
          Length = 490

 Score =  611 bits (1575), Expect = e-172
 Identities = 294/451 (65%), Positives = 353/451 (78%), Gaps = 10/451 (2%)
 Frame = -3

Query: 1436 SSSPKIQLPQSTLTLSVRDYGAVGDGVHYDTPSIQSAVNDCHS-----LGGCHVIFPPGN 1272
            +S  KIQLP  +LTLSV D+GA GDG++YDT +IQS ++ C+         C V+FP GN
Sbjct: 21   TSYSKIQLPGDSLTLSVTDFGATGDGINYDTSAIQSTIDACNRHYTSFSSICRVVFPSGN 80

Query: 1271 YLTATIHLKSDVILEVQKNATILGGTKQTDY-PKENDR-WYVVVAEDAXXXXXXXXXXXX 1098
            YLTA +HL+S VIL+V +NA +LGG +  DY P E    WYVVVA +A            
Sbjct: 81   YLTAKLHLRSGVILDVTENAVLLGGPRIEDYYPAETSSDWYVVVANNATDVGITGGGAID 140

Query: 1097 XXGFEFVSRFDDKKNVMVSWNKTGDCLGDECRPRLVGFLRSRNVRVWDVNLLQPAYWCLH 918
              G +FV RFD+KKNVMVSWN+TG CLGDECRPRLVGF+ S NV +W++ L +PAYWCLH
Sbjct: 141  GQGSKFVVRFDEKKNVMVSWNQTGACLGDECRPRLVGFVDSINVEIWNITLREPAYWCLH 200

Query: 917  LVQCDNTSIHDVSIYGDFNTPXXXXXXXXXXXNTFITRCNINTGDDAICPKTYLGPLYNL 738
            +V+C+NTS+HDVSI GDFNTP           NT ITRC+I+TGDDAICPKTY GPLYNL
Sbjct: 201  IVRCENTSVHDVSILGDFNTPNNDGIDIEDSNNTVITRCHIDTGDDAICPKTYTGPLYNL 260

Query: 737  TATDCWIRTKSSAIKFGSASWFDFKGLVFNNITIVESHRGLGLQIRDGGNVSDVTFSNIN 558
            TATDCWIRTKSSAIK GSASWFDFKGLVF+NITI ESHRGLG+QIRDGGNVSDVTFSNIN
Sbjct: 261  TATDCWIRTKSSAIKLGSASWFDFKGLVFDNITIFESHRGLGMQIRDGGNVSDVTFSNIN 320

Query: 557  ITTRYYDPSWWGRAEPIYVTTCPREETSKSGSISNILFVNITATSENGVFLSGCKGGILS 378
            I+TRYYDPSWWGRAEPIY+TTCPR+ ++K GSISN+LFVNIT  SENGVFLSG   G+LS
Sbjct: 321  ISTRYYDPSWWGRAEPIYITTCPRDSSAKEGSISNLLFVNITIDSENGVFLSGSPNGLLS 380

Query: 377  NLKFENVNLTYRRWTNYMDGLVDYRPGCQGLVNH-SSAGFLMEHIDGLEAVNVNMKWSED 201
            ++KF+N+NLT+RRW+NY  GLVDYRPGCQGLVNH +++G +MEH++G    NV++KWS+D
Sbjct: 381  DIKFKNMNLTFRRWSNYSAGLVDYRPGCQGLVNHRATSGIIMEHVNGFRVENVDLKWSDD 440

Query: 200  H--LGRWNTPLSFKPSTVNNISLINFHSGLF 114
                  WN PL F+PSTVNN+S + F SGL+
Sbjct: 441  DDVNAAWNVPLEFRPSTVNNVSFVGFTSGLY 471


>emb|CAB41176.1| putative protein [Arabidopsis thaliana]
          Length = 614

 Score =  611 bits (1575), Expect = e-172
 Identities = 294/451 (65%), Positives = 353/451 (78%), Gaps = 10/451 (2%)
 Frame = -3

Query: 1436 SSSPKIQLPQSTLTLSVRDYGAVGDGVHYDTPSIQSAVNDCHS-----LGGCHVIFPPGN 1272
            +S  KIQLP  +LTLSV D+GA GDG++YDT +IQS ++ C+         C V+FP GN
Sbjct: 21   TSYSKIQLPGDSLTLSVTDFGATGDGINYDTSAIQSTIDACNRHYTSFSSICRVVFPSGN 80

Query: 1271 YLTATIHLKSDVILEVQKNATILGGTKQTDY-PKENDR-WYVVVAEDAXXXXXXXXXXXX 1098
            YLTA +HL+S VIL+V +NA +LGG +  DY P E    WYVVVA +A            
Sbjct: 81   YLTAKLHLRSGVILDVTENAVLLGGPRIEDYYPAETSSDWYVVVANNATDVGITGGGAID 140

Query: 1097 XXGFEFVSRFDDKKNVMVSWNKTGDCLGDECRPRLVGFLRSRNVRVWDVNLLQPAYWCLH 918
              G +FV RFD+KKNVMVSWN+TG CLGDECRPRLVGF+ S NV +W++ L +PAYWCLH
Sbjct: 141  GQGSKFVVRFDEKKNVMVSWNQTGACLGDECRPRLVGFVDSINVEIWNITLREPAYWCLH 200

Query: 917  LVQCDNTSIHDVSIYGDFNTPXXXXXXXXXXXNTFITRCNINTGDDAICPKTYLGPLYNL 738
            +V+C+NTS+HDVSI GDFNTP           NT ITRC+I+TGDDAICPKTY GPLYNL
Sbjct: 201  IVRCENTSVHDVSILGDFNTPNNDGIDIEDSNNTVITRCHIDTGDDAICPKTYTGPLYNL 260

Query: 737  TATDCWIRTKSSAIKFGSASWFDFKGLVFNNITIVESHRGLGLQIRDGGNVSDVTFSNIN 558
            TATDCWIRTKSSAIK GSASWFDFKGLVF+NITI ESHRGLG+QIRDGGNVSDVTFSNIN
Sbjct: 261  TATDCWIRTKSSAIKLGSASWFDFKGLVFDNITIFESHRGLGMQIRDGGNVSDVTFSNIN 320

Query: 557  ITTRYYDPSWWGRAEPIYVTTCPREETSKSGSISNILFVNITATSENGVFLSGCKGGILS 378
            I+TRYYDPSWWGRAEPIY+TTCPR+ ++K GSISN+LFVNIT  SENGVFLSG   G+LS
Sbjct: 321  ISTRYYDPSWWGRAEPIYITTCPRDSSAKEGSISNLLFVNITIDSENGVFLSGSPNGLLS 380

Query: 377  NLKFENVNLTYRRWTNYMDGLVDYRPGCQGLVNH-SSAGFLMEHIDGLEAVNVNMKWSED 201
            ++KF+N+NLT+RRW+NY  GLVDYRPGCQGLVNH +++G +MEH++G    NV++KWS+D
Sbjct: 381  DIKFKNMNLTFRRWSNYSAGLVDYRPGCQGLVNHRATSGIIMEHVNGFRVENVDLKWSDD 440

Query: 200  H--LGRWNTPLSFKPSTVNNISLINFHSGLF 114
                  WN PL F+PSTVNN+S + F SGL+
Sbjct: 441  DDVNAAWNVPLEFRPSTVNNVSFVGFTSGLY 471


>ref|XP_002878172.1| hypothetical protein ARALYDRAFT_324272 [Arabidopsis lyrata subsp.
            lyrata] gi|297324010|gb|EFH54431.1| hypothetical protein
            ARALYDRAFT_324272 [Arabidopsis lyrata subsp. lyrata]
          Length = 589

 Score =  609 bits (1570), Expect = e-172
 Identities = 291/444 (65%), Positives = 348/444 (78%), Gaps = 8/444 (1%)
 Frame = -3

Query: 1436 SSSPKIQLPQSTLTLSVRDYGAVGDGVHYDTPSIQSAVNDCH-----SLGGCHVIFPPGN 1272
            +S  KIQLP  +L LSV D+GA GDG++YDT ++QS ++ C+     S   C V FP GN
Sbjct: 20   TSYSKIQLPGDSLALSVTDFGATGDGINYDTSAVQSTIDACNRHYTSSSSICRVTFPSGN 79

Query: 1271 YLTATIHLKSDVILEVQKNATILGGTKQTDY-PKENDR-WYVVVAEDAXXXXXXXXXXXX 1098
            YLTA +HL+S V+L+V +NA +LGG +  DY P E    WYVVVA +A            
Sbjct: 80   YLTAKLHLRSGVVLDVTENAVLLGGPRIEDYYPAETSSDWYVVVANNATDVGITGGGAID 139

Query: 1097 XXGFEFVSRFDDKKNVMVSWNKTGDCLGDECRPRLVGFLRSRNVRVWDVNLLQPAYWCLH 918
              G +FV RFD+KKNVMVSWN+TG CLGDECRPRLVGF+ SRNV +W++ L +PAYWCLH
Sbjct: 140  GQGSKFVVRFDEKKNVMVSWNQTGACLGDECRPRLVGFVDSRNVEIWNITLREPAYWCLH 199

Query: 917  LVQCDNTSIHDVSIYGDFNTPXXXXXXXXXXXNTFITRCNINTGDDAICPKTYLGPLYNL 738
            +V+C+NTS+HDVSI GDFNTP           NT ITRC+I+TGDDAICPKTY GPLYNL
Sbjct: 200  IVRCENTSVHDVSILGDFNTPNNDGIDIEDSNNTVITRCHIDTGDDAICPKTYTGPLYNL 259

Query: 737  TATDCWIRTKSSAIKFGSASWFDFKGLVFNNITIVESHRGLGLQIRDGGNVSDVTFSNIN 558
            TATDCWIRTKSSAIK GSASWFDFKGLVF+NITI ESHRGLG+QIRDGGNVSD+TFSNIN
Sbjct: 260  TATDCWIRTKSSAIKLGSASWFDFKGLVFDNITIFESHRGLGMQIRDGGNVSDITFSNIN 319

Query: 557  ITTRYYDPSWWGRAEPIYVTTCPREETSKSGSISNILFVNITATSENGVFLSGCKGGILS 378
            I+TRYYDPSWWGRAEPIY+TTCPR+ ++K GSISN+LFVNIT  SENGVFLSG   G+LS
Sbjct: 320  ISTRYYDPSWWGRAEPIYITTCPRDSSAKEGSISNLLFVNITINSENGVFLSGSPNGLLS 379

Query: 377  NLKFENVNLTYRRWTNYMDGLVDYRPGCQGLVNHS-SAGFLMEHIDGLEAVNVNMKWSED 201
            ++KF+N+NLT RRW+NY  GLVDYRPGC+GLVNHS +AG +MEH++G    NV++KWS+D
Sbjct: 380  DIKFKNMNLTVRRWSNYSAGLVDYRPGCRGLVNHSATAGIIMEHVNGFSIENVDLKWSDD 439

Query: 200  HLGRWNTPLSFKPSTVNNISLINF 129
                WN PL F+PSTVNN+SL  F
Sbjct: 440  LNSGWNVPLEFRPSTVNNVSLFEF 463


>ref|XP_004161486.1| PREDICTED: exo-poly-alpha-D-galacturonosidase-like [Cucumis sativus]
          Length = 463

 Score =  587 bits (1513), Expect = e-165
 Identities = 276/440 (62%), Positives = 335/440 (76%), Gaps = 1/440 (0%)
 Frame = -3

Query: 1436 SSSPKIQLPQSTLTLSVRDYGAVGDGVHYDTPSIQSAVNDCHSLGGCHVIFPPGNYLTAT 1257
            S+ P I+L + + + SV D+GA+GDG+HYDT +IQSA+N C +   C+V FPPG YLTAT
Sbjct: 20   SAIPSIRLLRRSTSFSVTDFGAIGDGLHYDTTAIQSAINSCPAPSRCYVTFPPGTYLTAT 79

Query: 1256 IHLKSDVILEVQKNATILGGTKQTDYPKENDRWYVVVAEDAXXXXXXXXXXXXXXGFEFV 1077
            I L+S V+L++Q  AT+L GTK  DYP ++ RW+ VVAE+A              G +FV
Sbjct: 80   IWLRSGVVLDIQPGATVLAGTKMEDYPADSSRWFAVVAENASDVGISGGGTVDGQGLKFV 139

Query: 1076 SRFDDKKNVMVSWNKTGDCLGDECRPRLVGFLRSRNVRVWDVNLLQPAYWCLHLVQCDNT 897
             +FD +KNVMVSWNKTG C GDECRP LVGF+ S  VRV +V+  QPA+WCLHLV+C+NT
Sbjct: 140  EKFDKRKNVMVSWNKTGACYGDECRPDLVGFIGSNKVRVSNVSFNQPAHWCLHLVRCENT 199

Query: 896  SIHDVSIYGDFNTPXXXXXXXXXXXNTFITRCNINTGDDAICPKTYLGPLYNLTATDCWI 717
             I DVSIYGDF+TP           NT ITRC I+TGDDAICPK+  GP++NLTAT+CWI
Sbjct: 200  VIEDVSIYGDFDTPNNDGIDIEDSNNTLITRCRIDTGDDAICPKSSNGPVFNLTATNCWI 259

Query: 716  RTKSSAIKFGSASWFDFKGLVFNNITIVESHRGLGLQIRDGGNVSDVTFSNINITTRYYD 537
            RTKSSAIK GSASWF+F  ++F+N+TIV+SHRGL  Q+RDGG+ +D+TFSNINITTRYYD
Sbjct: 260  RTKSSAIKLGSASWFNFTRMLFDNLTIVDSHRGLAFQLRDGGSANDITFSNINITTRYYD 319

Query: 536  PSWWGRAEPIYVTTCPREETSKSGSISNILFVNITATSENGVFLSGCKGGILSNLKFENV 357
            PSWWGRAEPIYVTTCPR+  SK GSISNI F+NITATSENGVFLSG K G+LSNL+F NV
Sbjct: 320  PSWWGRAEPIYVTTCPRDPGSKEGSISNIRFINITATSENGVFLSGSKSGVLSNLRFTNV 379

Query: 356  NLTYRRWTNYMDGLVDYRPGCQGLVNHSSAGFLMEHIDGLEAVNVNMKWSEDHLG-RWNT 180
             L Y+RWT Y  G+ DYRPGCQG V H  AG +MEHI+GL   NV+M W + +   +WN 
Sbjct: 380  KLRYKRWTKYGGGIADYRPGCQGFVKHGMAGMIMEHIEGLNLENVDMHWFDTNGSLQWNN 439

Query: 179  PLSFKPSTVNNISLINFHSG 120
            PL F+PSTVNNIS  NFHSG
Sbjct: 440  PLDFRPSTVNNISFFNFHSG 459


Top