BLASTX nr result
ID: Angelica23_contig00012135
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00012135 (1540 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002322693.1| predicted protein [Populus trichocarpa] gi|2... 650 0.0 ref|NP_567055.1| glycoside hydrolase family 28 protein / polygal... 611 e-172 emb|CAB41176.1| putative protein [Arabidopsis thaliana] 611 e-172 ref|XP_002878172.1| hypothetical protein ARALYDRAFT_324272 [Arab... 609 e-172 ref|XP_004161486.1| PREDICTED: exo-poly-alpha-D-galacturonosidas... 587 e-165 >ref|XP_002322693.1| predicted protein [Populus trichocarpa] gi|222867323|gb|EEF04454.1| predicted protein [Populus trichocarpa] Length = 445 Score = 650 bits (1678), Expect = 0.0 Identities = 308/443 (69%), Positives = 357/443 (80%), Gaps = 4/443 (0%) Frame = -3 Query: 1424 KIQLPQST-LTLSVRDYGAVGDGVHYDTPSIQSAVNDCHSLG---GCHVIFPPGNYLTAT 1257 +IQLP T TLSV D+GA+GDG+HYDT +IQS +N C + CHV FPPG YLTAT Sbjct: 2 RIQLPHPTPTTLSVTDFGAIGDGIHYDTEAIQSTINSCPTTPPTKACHVNFPPGIYLTAT 61 Query: 1256 IHLKSDVILEVQKNATILGGTKQTDYPKENDRWYVVVAEDAXXXXXXXXXXXXXXGFEFV 1077 IHLKS+V+L +Q+ AT+LGGTK DYPKE +RWYVV+AE+A G +FV Sbjct: 62 IHLKSNVVLNIQEGATLLGGTKLEDYPKEFNRWYVVLAENASDVGITGGGVVDGQGLKFV 121 Query: 1076 SRFDDKKNVMVSWNKTGDCLGDECRPRLVGFLRSRNVRVWDVNLLQPAYWCLHLVQCDNT 897 RF+++KNVMVSWN TG CLGDECRPRLVGF+ NV+VW+V L +PAYWCLH+VQC NT Sbjct: 122 KRFNERKNVMVSWNSTGACLGDECRPRLVGFIGCTNVKVWNVRLSEPAYWCLHIVQCLNT 181 Query: 896 SIHDVSIYGDFNTPXXXXXXXXXXXNTFITRCNINTGDDAICPKTYLGPLYNLTATDCWI 717 I DVSIYGDFN+P NT ITRC+I+TGDDAICPKTY GP+YNLTATDCWI Sbjct: 182 HISDVSIYGDFNSPNNDGIDIEDSNNTLITRCHIDTGDDAICPKTYTGPIYNLTATDCWI 241 Query: 716 RTKSSAIKFGSASWFDFKGLVFNNITIVESHRGLGLQIRDGGNVSDVTFSNINITTRYYD 537 RTKSSAIK GSASWF+FKGLVF+NITIV+SHRGLGLQIRDGGNVSD+TFSNINI+TRYYD Sbjct: 242 RTKSSAIKLGSASWFEFKGLVFDNITIVDSHRGLGLQIRDGGNVSDITFSNINISTRYYD 301 Query: 536 PSWWGRAEPIYVTTCPREETSKSGSISNILFVNITATSENGVFLSGCKGGILSNLKFENV 357 PSWWGRAEPIYVTTCPR +SK GSISN+ F+NIT SENGVFLSG KGG+LSNL+F N+ Sbjct: 302 PSWWGRAEPIYVTTCPRHSSSKEGSISNLQFINITTNSENGVFLSGSKGGLLSNLRFINM 361 Query: 356 NLTYRRWTNYMDGLVDYRPGCQGLVNHSSAGFLMEHIDGLEAVNVNMKWSEDHLGRWNTP 177 NLT+RRWT Y GLVDYRPGCQGLVNHS+AG +MEHI+G E NVNM+WS+ W+ P Sbjct: 362 NLTFRRWTTYPGGLVDYRPGCQGLVNHSAAGIIMEHIEGFEVENVNMRWSDYQNEPWDNP 421 Query: 176 LSFKPSTVNNISLINFHSGLFKQ 108 L F+PSTVNNIS +NFHS L+KQ Sbjct: 422 LDFRPSTVNNISFLNFHSALYKQ 444 >ref|NP_567055.1| glycoside hydrolase family 28 protein / polygalacturonase (pectinase) family protein [Arabidopsis thaliana] gi|332646180|gb|AEE79701.1| glycoside hydrolase family 28 protein / polygalacturonase (pectinase) family protein [Arabidopsis thaliana] Length = 490 Score = 611 bits (1575), Expect = e-172 Identities = 294/451 (65%), Positives = 353/451 (78%), Gaps = 10/451 (2%) Frame = -3 Query: 1436 SSSPKIQLPQSTLTLSVRDYGAVGDGVHYDTPSIQSAVNDCHS-----LGGCHVIFPPGN 1272 +S KIQLP +LTLSV D+GA GDG++YDT +IQS ++ C+ C V+FP GN Sbjct: 21 TSYSKIQLPGDSLTLSVTDFGATGDGINYDTSAIQSTIDACNRHYTSFSSICRVVFPSGN 80 Query: 1271 YLTATIHLKSDVILEVQKNATILGGTKQTDY-PKENDR-WYVVVAEDAXXXXXXXXXXXX 1098 YLTA +HL+S VIL+V +NA +LGG + DY P E WYVVVA +A Sbjct: 81 YLTAKLHLRSGVILDVTENAVLLGGPRIEDYYPAETSSDWYVVVANNATDVGITGGGAID 140 Query: 1097 XXGFEFVSRFDDKKNVMVSWNKTGDCLGDECRPRLVGFLRSRNVRVWDVNLLQPAYWCLH 918 G +FV RFD+KKNVMVSWN+TG CLGDECRPRLVGF+ S NV +W++ L +PAYWCLH Sbjct: 141 GQGSKFVVRFDEKKNVMVSWNQTGACLGDECRPRLVGFVDSINVEIWNITLREPAYWCLH 200 Query: 917 LVQCDNTSIHDVSIYGDFNTPXXXXXXXXXXXNTFITRCNINTGDDAICPKTYLGPLYNL 738 +V+C+NTS+HDVSI GDFNTP NT ITRC+I+TGDDAICPKTY GPLYNL Sbjct: 201 IVRCENTSVHDVSILGDFNTPNNDGIDIEDSNNTVITRCHIDTGDDAICPKTYTGPLYNL 260 Query: 737 TATDCWIRTKSSAIKFGSASWFDFKGLVFNNITIVESHRGLGLQIRDGGNVSDVTFSNIN 558 TATDCWIRTKSSAIK GSASWFDFKGLVF+NITI ESHRGLG+QIRDGGNVSDVTFSNIN Sbjct: 261 TATDCWIRTKSSAIKLGSASWFDFKGLVFDNITIFESHRGLGMQIRDGGNVSDVTFSNIN 320 Query: 557 ITTRYYDPSWWGRAEPIYVTTCPREETSKSGSISNILFVNITATSENGVFLSGCKGGILS 378 I+TRYYDPSWWGRAEPIY+TTCPR+ ++K GSISN+LFVNIT SENGVFLSG G+LS Sbjct: 321 ISTRYYDPSWWGRAEPIYITTCPRDSSAKEGSISNLLFVNITIDSENGVFLSGSPNGLLS 380 Query: 377 NLKFENVNLTYRRWTNYMDGLVDYRPGCQGLVNH-SSAGFLMEHIDGLEAVNVNMKWSED 201 ++KF+N+NLT+RRW+NY GLVDYRPGCQGLVNH +++G +MEH++G NV++KWS+D Sbjct: 381 DIKFKNMNLTFRRWSNYSAGLVDYRPGCQGLVNHRATSGIIMEHVNGFRVENVDLKWSDD 440 Query: 200 H--LGRWNTPLSFKPSTVNNISLINFHSGLF 114 WN PL F+PSTVNN+S + F SGL+ Sbjct: 441 DDVNAAWNVPLEFRPSTVNNVSFVGFTSGLY 471 >emb|CAB41176.1| putative protein [Arabidopsis thaliana] Length = 614 Score = 611 bits (1575), Expect = e-172 Identities = 294/451 (65%), Positives = 353/451 (78%), Gaps = 10/451 (2%) Frame = -3 Query: 1436 SSSPKIQLPQSTLTLSVRDYGAVGDGVHYDTPSIQSAVNDCHS-----LGGCHVIFPPGN 1272 +S KIQLP +LTLSV D+GA GDG++YDT +IQS ++ C+ C V+FP GN Sbjct: 21 TSYSKIQLPGDSLTLSVTDFGATGDGINYDTSAIQSTIDACNRHYTSFSSICRVVFPSGN 80 Query: 1271 YLTATIHLKSDVILEVQKNATILGGTKQTDY-PKENDR-WYVVVAEDAXXXXXXXXXXXX 1098 YLTA +HL+S VIL+V +NA +LGG + DY P E WYVVVA +A Sbjct: 81 YLTAKLHLRSGVILDVTENAVLLGGPRIEDYYPAETSSDWYVVVANNATDVGITGGGAID 140 Query: 1097 XXGFEFVSRFDDKKNVMVSWNKTGDCLGDECRPRLVGFLRSRNVRVWDVNLLQPAYWCLH 918 G +FV RFD+KKNVMVSWN+TG CLGDECRPRLVGF+ S NV +W++ L +PAYWCLH Sbjct: 141 GQGSKFVVRFDEKKNVMVSWNQTGACLGDECRPRLVGFVDSINVEIWNITLREPAYWCLH 200 Query: 917 LVQCDNTSIHDVSIYGDFNTPXXXXXXXXXXXNTFITRCNINTGDDAICPKTYLGPLYNL 738 +V+C+NTS+HDVSI GDFNTP NT ITRC+I+TGDDAICPKTY GPLYNL Sbjct: 201 IVRCENTSVHDVSILGDFNTPNNDGIDIEDSNNTVITRCHIDTGDDAICPKTYTGPLYNL 260 Query: 737 TATDCWIRTKSSAIKFGSASWFDFKGLVFNNITIVESHRGLGLQIRDGGNVSDVTFSNIN 558 TATDCWIRTKSSAIK GSASWFDFKGLVF+NITI ESHRGLG+QIRDGGNVSDVTFSNIN Sbjct: 261 TATDCWIRTKSSAIKLGSASWFDFKGLVFDNITIFESHRGLGMQIRDGGNVSDVTFSNIN 320 Query: 557 ITTRYYDPSWWGRAEPIYVTTCPREETSKSGSISNILFVNITATSENGVFLSGCKGGILS 378 I+TRYYDPSWWGRAEPIY+TTCPR+ ++K GSISN+LFVNIT SENGVFLSG G+LS Sbjct: 321 ISTRYYDPSWWGRAEPIYITTCPRDSSAKEGSISNLLFVNITIDSENGVFLSGSPNGLLS 380 Query: 377 NLKFENVNLTYRRWTNYMDGLVDYRPGCQGLVNH-SSAGFLMEHIDGLEAVNVNMKWSED 201 ++KF+N+NLT+RRW+NY GLVDYRPGCQGLVNH +++G +MEH++G NV++KWS+D Sbjct: 381 DIKFKNMNLTFRRWSNYSAGLVDYRPGCQGLVNHRATSGIIMEHVNGFRVENVDLKWSDD 440 Query: 200 H--LGRWNTPLSFKPSTVNNISLINFHSGLF 114 WN PL F+PSTVNN+S + F SGL+ Sbjct: 441 DDVNAAWNVPLEFRPSTVNNVSFVGFTSGLY 471 >ref|XP_002878172.1| hypothetical protein ARALYDRAFT_324272 [Arabidopsis lyrata subsp. lyrata] gi|297324010|gb|EFH54431.1| hypothetical protein ARALYDRAFT_324272 [Arabidopsis lyrata subsp. lyrata] Length = 589 Score = 609 bits (1570), Expect = e-172 Identities = 291/444 (65%), Positives = 348/444 (78%), Gaps = 8/444 (1%) Frame = -3 Query: 1436 SSSPKIQLPQSTLTLSVRDYGAVGDGVHYDTPSIQSAVNDCH-----SLGGCHVIFPPGN 1272 +S KIQLP +L LSV D+GA GDG++YDT ++QS ++ C+ S C V FP GN Sbjct: 20 TSYSKIQLPGDSLALSVTDFGATGDGINYDTSAVQSTIDACNRHYTSSSSICRVTFPSGN 79 Query: 1271 YLTATIHLKSDVILEVQKNATILGGTKQTDY-PKENDR-WYVVVAEDAXXXXXXXXXXXX 1098 YLTA +HL+S V+L+V +NA +LGG + DY P E WYVVVA +A Sbjct: 80 YLTAKLHLRSGVVLDVTENAVLLGGPRIEDYYPAETSSDWYVVVANNATDVGITGGGAID 139 Query: 1097 XXGFEFVSRFDDKKNVMVSWNKTGDCLGDECRPRLVGFLRSRNVRVWDVNLLQPAYWCLH 918 G +FV RFD+KKNVMVSWN+TG CLGDECRPRLVGF+ SRNV +W++ L +PAYWCLH Sbjct: 140 GQGSKFVVRFDEKKNVMVSWNQTGACLGDECRPRLVGFVDSRNVEIWNITLREPAYWCLH 199 Query: 917 LVQCDNTSIHDVSIYGDFNTPXXXXXXXXXXXNTFITRCNINTGDDAICPKTYLGPLYNL 738 +V+C+NTS+HDVSI GDFNTP NT ITRC+I+TGDDAICPKTY GPLYNL Sbjct: 200 IVRCENTSVHDVSILGDFNTPNNDGIDIEDSNNTVITRCHIDTGDDAICPKTYTGPLYNL 259 Query: 737 TATDCWIRTKSSAIKFGSASWFDFKGLVFNNITIVESHRGLGLQIRDGGNVSDVTFSNIN 558 TATDCWIRTKSSAIK GSASWFDFKGLVF+NITI ESHRGLG+QIRDGGNVSD+TFSNIN Sbjct: 260 TATDCWIRTKSSAIKLGSASWFDFKGLVFDNITIFESHRGLGMQIRDGGNVSDITFSNIN 319 Query: 557 ITTRYYDPSWWGRAEPIYVTTCPREETSKSGSISNILFVNITATSENGVFLSGCKGGILS 378 I+TRYYDPSWWGRAEPIY+TTCPR+ ++K GSISN+LFVNIT SENGVFLSG G+LS Sbjct: 320 ISTRYYDPSWWGRAEPIYITTCPRDSSAKEGSISNLLFVNITINSENGVFLSGSPNGLLS 379 Query: 377 NLKFENVNLTYRRWTNYMDGLVDYRPGCQGLVNHS-SAGFLMEHIDGLEAVNVNMKWSED 201 ++KF+N+NLT RRW+NY GLVDYRPGC+GLVNHS +AG +MEH++G NV++KWS+D Sbjct: 380 DIKFKNMNLTVRRWSNYSAGLVDYRPGCRGLVNHSATAGIIMEHVNGFSIENVDLKWSDD 439 Query: 200 HLGRWNTPLSFKPSTVNNISLINF 129 WN PL F+PSTVNN+SL F Sbjct: 440 LNSGWNVPLEFRPSTVNNVSLFEF 463 >ref|XP_004161486.1| PREDICTED: exo-poly-alpha-D-galacturonosidase-like [Cucumis sativus] Length = 463 Score = 587 bits (1513), Expect = e-165 Identities = 276/440 (62%), Positives = 335/440 (76%), Gaps = 1/440 (0%) Frame = -3 Query: 1436 SSSPKIQLPQSTLTLSVRDYGAVGDGVHYDTPSIQSAVNDCHSLGGCHVIFPPGNYLTAT 1257 S+ P I+L + + + SV D+GA+GDG+HYDT +IQSA+N C + C+V FPPG YLTAT Sbjct: 20 SAIPSIRLLRRSTSFSVTDFGAIGDGLHYDTTAIQSAINSCPAPSRCYVTFPPGTYLTAT 79 Query: 1256 IHLKSDVILEVQKNATILGGTKQTDYPKENDRWYVVVAEDAXXXXXXXXXXXXXXGFEFV 1077 I L+S V+L++Q AT+L GTK DYP ++ RW+ VVAE+A G +FV Sbjct: 80 IWLRSGVVLDIQPGATVLAGTKMEDYPADSSRWFAVVAENASDVGISGGGTVDGQGLKFV 139 Query: 1076 SRFDDKKNVMVSWNKTGDCLGDECRPRLVGFLRSRNVRVWDVNLLQPAYWCLHLVQCDNT 897 +FD +KNVMVSWNKTG C GDECRP LVGF+ S VRV +V+ QPA+WCLHLV+C+NT Sbjct: 140 EKFDKRKNVMVSWNKTGACYGDECRPDLVGFIGSNKVRVSNVSFNQPAHWCLHLVRCENT 199 Query: 896 SIHDVSIYGDFNTPXXXXXXXXXXXNTFITRCNINTGDDAICPKTYLGPLYNLTATDCWI 717 I DVSIYGDF+TP NT ITRC I+TGDDAICPK+ GP++NLTAT+CWI Sbjct: 200 VIEDVSIYGDFDTPNNDGIDIEDSNNTLITRCRIDTGDDAICPKSSNGPVFNLTATNCWI 259 Query: 716 RTKSSAIKFGSASWFDFKGLVFNNITIVESHRGLGLQIRDGGNVSDVTFSNINITTRYYD 537 RTKSSAIK GSASWF+F ++F+N+TIV+SHRGL Q+RDGG+ +D+TFSNINITTRYYD Sbjct: 260 RTKSSAIKLGSASWFNFTRMLFDNLTIVDSHRGLAFQLRDGGSANDITFSNINITTRYYD 319 Query: 536 PSWWGRAEPIYVTTCPREETSKSGSISNILFVNITATSENGVFLSGCKGGILSNLKFENV 357 PSWWGRAEPIYVTTCPR+ SK GSISNI F+NITATSENGVFLSG K G+LSNL+F NV Sbjct: 320 PSWWGRAEPIYVTTCPRDPGSKEGSISNIRFINITATSENGVFLSGSKSGVLSNLRFTNV 379 Query: 356 NLTYRRWTNYMDGLVDYRPGCQGLVNHSSAGFLMEHIDGLEAVNVNMKWSEDHLG-RWNT 180 L Y+RWT Y G+ DYRPGCQG V H AG +MEHI+GL NV+M W + + +WN Sbjct: 380 KLRYKRWTKYGGGIADYRPGCQGFVKHGMAGMIMEHIEGLNLENVDMHWFDTNGSLQWNN 439 Query: 179 PLSFKPSTVNNISLINFHSG 120 PL F+PSTVNNIS NFHSG Sbjct: 440 PLDFRPSTVNNISFFNFHSG 459