BLASTX nr result

ID: Scutellaria23_contig00008184 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria23_contig00008184
         (1537 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002322693.1| predicted protein [Populus trichocarpa] gi|2...   678   0.0  
ref|XP_002878172.1| hypothetical protein ARALYDRAFT_324272 [Arab...   649   0.0  
ref|NP_567055.1| glycoside hydrolase family 28 protein / polygal...   648   0.0  
emb|CAB41176.1| putative protein [Arabidopsis thaliana]               648   0.0  
ref|XP_003554187.1| PREDICTED: exo-poly-alpha-D-galacturonosidas...   634   e-179

>ref|XP_002322693.1| predicted protein [Populus trichocarpa] gi|222867323|gb|EEF04454.1|
            predicted protein [Populus trichocarpa]
          Length = 445

 Score =  678 bits (1750), Expect = 0.0
 Identities = 318/435 (73%), Positives = 364/435 (83%)
 Frame = -3

Query: 1379 SLSVADYGAAGDGHQYDTVSIQSTIDDCASASARHHRPCEVKFPPGKYLTATLRLKSGVL 1200
            +LSV D+GA GDG  YDT +IQSTI+ C +      + C V FPPG YLTAT+ LKS V+
Sbjct: 12   TLSVTDFGAIGDGIHYDTEAIQSTINSCPTTPPT--KACHVNFPPGIYLTATIHLKSNVV 69

Query: 1199 LNLSRNATILGGTSLDDYPEKQEEWYVVVAENAXXXXXXXXXXXXXXGLKFVKRFDEKKN 1020
            LN+   AT+LGGT L+DYP++   WYVV+AENA              GLKFVKRF+E+KN
Sbjct: 70   LNIQEGATLLGGTKLEDYPKEFNRWYVVLAENASDVGITGGGVVDGQGLKFVKRFNERKN 129

Query: 1019 VMVSWNQTGACLGDECRPRLVGFIGCTNVRIWDINFNQPAYWCLHIVRCQNTSIHDVSIY 840
            VMVSWN TGACLGDECRPRLVGFIGCTNV++W++  ++PAYWCLHIV+C NT I DVSIY
Sbjct: 130  VMVSWNSTGACLGDECRPRLVGFIGCTNVKVWNVRLSEPAYWCLHIVQCLNTHISDVSIY 189

Query: 839  GDFNTPNNDGIDIEDSNNTLITRCTIDTGDDAICPKTYTGPIYNLTATNCWIKTKSSAIK 660
            GDFN+PNNDGIDIEDSNNTLITRC IDTGDDAICPKTYTGPIYNLTAT+CWI+TKSSAIK
Sbjct: 190  GDFNSPNNDGIDIEDSNNTLITRCHIDTGDDAICPKTYTGPIYNLTATDCWIRTKSSAIK 249

Query: 659  LGSASWYDFIGLVFDNITIVESHRGLGFQIRDGGNVSDVTFSNINISTRYYDESWWGRAE 480
            LGSASW++F GLVFDNITIV+SHRGLG QIRDGGNVSD+TFSNINISTRYYD SWWGRAE
Sbjct: 250  LGSASWFEFKGLVFDNITIVDSHRGLGLQIRDGGNVSDITFSNINISTRYYDPSWWGRAE 309

Query: 479  PIYITTCPRDCNSKAGSISNLQFINITATSENGVCLSGSEGGILRNLKFYNVNLTYRRWT 300
            PIY+TTCPR  +SK GSISNLQFINIT  SENGV LSGS+GG+L NL+F N+NLT+RRWT
Sbjct: 310  PIYVTTCPRHSSSKEGSISNLQFINITTNSENGVFLSGSKGGLLSNLRFINMNLTFRRWT 369

Query: 299  NYDDGLVDYRPGCQGLVNHSTAGFMMEHIDGLNLENVNMRWSEEKMDRWNNPLDFRPSTV 120
             Y  GLVDYRPGCQGLVNHS AG +MEHI+G  +ENVNMRWS+ + + W+NPLDFRPSTV
Sbjct: 370  TYPGGLVDYRPGCQGLVNHSAAGIIMEHIEGFEVENVNMRWSDYQNEPWDNPLDFRPSTV 429

Query: 119  NNISLLNFYSGLSQQ 75
            NNIS LNF+S L +Q
Sbjct: 430  NNISFLNFHSALYKQ 444


>ref|XP_002878172.1| hypothetical protein ARALYDRAFT_324272 [Arabidopsis lyrata subsp.
            lyrata] gi|297324010|gb|EFH54431.1| hypothetical protein
            ARALYDRAFT_324272 [Arabidopsis lyrata subsp. lyrata]
          Length = 589

 Score =  649 bits (1673), Expect = 0.0
 Identities = 316/480 (65%), Positives = 369/480 (76%), Gaps = 3/480 (0%)
 Frame = -3

Query: 1463 ILLLLVPTLIQSRSSTWPLSFPQSSEQLSLSVADYGAAGDGHQYDTVSIQSTIDDCASAS 1284
            +L+LL  +L+QSRS T         + L+LSV D+GA GDG  YDT ++QSTID C    
Sbjct: 5    LLILLFFSLVQSRSYTSYSKIQLPGDSLALSVTDFGATGDGINYDTSAVQSTIDACNRHY 64

Query: 1283 ARHHRPCEVKFPPGKYLTATLRLKSGVLLNLSRNATILGGTSLDDY--PEKQEEWYVVVA 1110
                  C V FP G YLTA L L+SGV+L+++ NA +LGG  ++DY   E   +WYVVVA
Sbjct: 65   TSSSSICRVTFPSGNYLTAKLHLRSGVVLDVTENAVLLGGPRIEDYYPAETSSDWYVVVA 124

Query: 1109 ENAXXXXXXXXXXXXXXGLKFVKRFDEKKNVMVSWNQTGACLGDECRPRLVGFIGCTNVR 930
             NA              G KFV RFDEKKNVMVSWNQTGACLGDECRPRLVGF+   NV 
Sbjct: 125  NNATDVGITGGGAIDGQGSKFVVRFDEKKNVMVSWNQTGACLGDECRPRLVGFVDSRNVE 184

Query: 929  IWDINFNQPAYWCLHIVRCQNTSIHDVSIYGDFNTPNNDGIDIEDSNNTLITRCTIDTGD 750
            IW+I   +PAYWCLHIVRC+NTS+HDVSI GDFNTPNNDGIDIEDSNNT+ITRC IDTGD
Sbjct: 185  IWNITLREPAYWCLHIVRCENTSVHDVSILGDFNTPNNDGIDIEDSNNTVITRCHIDTGD 244

Query: 749  DAICPKTYTGPIYNLTATNCWIKTKSSAIKLGSASWYDFIGLVFDNITIVESHRGLGFQI 570
            DAICPKTYTGP+YNLTAT+CWI+TKSSAIKLGSASW+DF GLVFDNITI ESHRGLG QI
Sbjct: 245  DAICPKTYTGPLYNLTATDCWIRTKSSAIKLGSASWFDFKGLVFDNITIFESHRGLGMQI 304

Query: 569  RDGGNVSDVTFSNINISTRYYDESWWGRAEPIYITTCPRDCNSKAGSISNLQFINITATS 390
            RDGGNVSD+TFSNINISTRYYD SWWGRAEPIYITTCPRD ++K GSISNL F+NIT  S
Sbjct: 305  RDGGNVSDITFSNINISTRYYDPSWWGRAEPIYITTCPRDSSAKEGSISNLLFVNITINS 364

Query: 389  ENGVCLSGSEGGILRNLKFYNVNLTYRRWTNYDDGLVDYRPGCQGLVNHS-TAGFMMEHI 213
            ENGV LSGS  G+L ++KF N+NLT RRW+NY  GLVDYRPGC+GLVNHS TAG +MEH+
Sbjct: 365  ENGVFLSGSPNGLLSDIKFKNMNLTVRRWSNYSAGLVDYRPGCRGLVNHSATAGIIMEHV 424

Query: 212  DGLNLENVNMRWSEEKMDRWNNPLDFRPSTVNNISLLNFYSGLSQQ*GRGKRCSFKGLFV 33
            +G ++ENV+++WS++    WN PL+FRPSTVNN+SL  F   +  + G  +  S K L V
Sbjct: 425  NGFSIENVDLKWSDDLNSGWNVPLEFRPSTVNNVSLFEFDYYVLSEIGITEMASTKPLIV 484


>ref|NP_567055.1| glycoside hydrolase family 28 protein / polygalacturonase (pectinase)
            family protein [Arabidopsis thaliana]
            gi|332646180|gb|AEE79701.1| glycoside hydrolase family 28
            protein / polygalacturonase (pectinase) family protein
            [Arabidopsis thaliana]
          Length = 490

 Score =  648 bits (1671), Expect = 0.0
 Identities = 315/465 (67%), Positives = 364/465 (78%), Gaps = 5/465 (1%)
 Frame = -3

Query: 1463 ILLLLVPTLIQSRSSTWPLSFPQSSEQLSLSVADYGAAGDGHQYDTVSIQSTIDDCASAS 1284
            +LLLL  +L+QSRS T         + L+LSV D+GA GDG  YDT +IQSTID C    
Sbjct: 6    LLLLLFFSLVQSRSDTSYSKIQLPGDSLTLSVTDFGATGDGINYDTSAIQSTIDACNRHY 65

Query: 1283 ARHHRPCEVKFPPGKYLTATLRLKSGVLLNLSRNATILGGTSLDDY--PEKQEEWYVVVA 1110
                  C V FP G YLTA L L+SGV+L+++ NA +LGG  ++DY   E   +WYVVVA
Sbjct: 66   TSFSSICRVVFPSGNYLTAKLHLRSGVILDVTENAVLLGGPRIEDYYPAETSSDWYVVVA 125

Query: 1109 ENAXXXXXXXXXXXXXXGLKFVKRFDEKKNVMVSWNQTGACLGDECRPRLVGFIGCTNVR 930
             NA              G KFV RFDEKKNVMVSWNQTGACLGDECRPRLVGF+   NV 
Sbjct: 126  NNATDVGITGGGAIDGQGSKFVVRFDEKKNVMVSWNQTGACLGDECRPRLVGFVDSINVE 185

Query: 929  IWDINFNQPAYWCLHIVRCQNTSIHDVSIYGDFNTPNNDGIDIEDSNNTLITRCTIDTGD 750
            IW+I   +PAYWCLHIVRC+NTS+HDVSI GDFNTPNNDGIDIEDSNNT+ITRC IDTGD
Sbjct: 186  IWNITLREPAYWCLHIVRCENTSVHDVSILGDFNTPNNDGIDIEDSNNTVITRCHIDTGD 245

Query: 749  DAICPKTYTGPIYNLTATNCWIKTKSSAIKLGSASWYDFIGLVFDNITIVESHRGLGFQI 570
            DAICPKTYTGP+YNLTAT+CWI+TKSSAIKLGSASW+DF GLVFDNITI ESHRGLG QI
Sbjct: 246  DAICPKTYTGPLYNLTATDCWIRTKSSAIKLGSASWFDFKGLVFDNITIFESHRGLGMQI 305

Query: 569  RDGGNVSDVTFSNINISTRYYDESWWGRAEPIYITTCPRDCNSKAGSISNLQFINITATS 390
            RDGGNVSDVTFSNINISTRYYD SWWGRAEPIYITTCPRD ++K GSISNL F+NIT  S
Sbjct: 306  RDGGNVSDVTFSNINISTRYYDPSWWGRAEPIYITTCPRDSSAKEGSISNLLFVNITIDS 365

Query: 389  ENGVCLSGSEGGILRNLKFYNVNLTYRRWTNYDDGLVDYRPGCQGLVNH-STAGFMMEHI 213
            ENGV LSGS  G+L ++KF N+NLT+RRW+NY  GLVDYRPGCQGLVNH +T+G +MEH+
Sbjct: 366  ENGVFLSGSPNGLLSDIKFKNMNLTFRRWSNYSAGLVDYRPGCQGLVNHRATSGIIMEHV 425

Query: 212  DGLNLENVNMRWSEEK--MDRWNNPLDFRPSTVNNISLLNFYSGL 84
            +G  +ENV+++WS++      WN PL+FRPSTVNN+S + F SGL
Sbjct: 426  NGFRVENVDLKWSDDDDVNAAWNVPLEFRPSTVNNVSFVGFTSGL 470


>emb|CAB41176.1| putative protein [Arabidopsis thaliana]
          Length = 614

 Score =  648 bits (1671), Expect = 0.0
 Identities = 315/465 (67%), Positives = 364/465 (78%), Gaps = 5/465 (1%)
 Frame = -3

Query: 1463 ILLLLVPTLIQSRSSTWPLSFPQSSEQLSLSVADYGAAGDGHQYDTVSIQSTIDDCASAS 1284
            +LLLL  +L+QSRS T         + L+LSV D+GA GDG  YDT +IQSTID C    
Sbjct: 6    LLLLLFFSLVQSRSDTSYSKIQLPGDSLTLSVTDFGATGDGINYDTSAIQSTIDACNRHY 65

Query: 1283 ARHHRPCEVKFPPGKYLTATLRLKSGVLLNLSRNATILGGTSLDDY--PEKQEEWYVVVA 1110
                  C V FP G YLTA L L+SGV+L+++ NA +LGG  ++DY   E   +WYVVVA
Sbjct: 66   TSFSSICRVVFPSGNYLTAKLHLRSGVILDVTENAVLLGGPRIEDYYPAETSSDWYVVVA 125

Query: 1109 ENAXXXXXXXXXXXXXXGLKFVKRFDEKKNVMVSWNQTGACLGDECRPRLVGFIGCTNVR 930
             NA              G KFV RFDEKKNVMVSWNQTGACLGDECRPRLVGF+   NV 
Sbjct: 126  NNATDVGITGGGAIDGQGSKFVVRFDEKKNVMVSWNQTGACLGDECRPRLVGFVDSINVE 185

Query: 929  IWDINFNQPAYWCLHIVRCQNTSIHDVSIYGDFNTPNNDGIDIEDSNNTLITRCTIDTGD 750
            IW+I   +PAYWCLHIVRC+NTS+HDVSI GDFNTPNNDGIDIEDSNNT+ITRC IDTGD
Sbjct: 186  IWNITLREPAYWCLHIVRCENTSVHDVSILGDFNTPNNDGIDIEDSNNTVITRCHIDTGD 245

Query: 749  DAICPKTYTGPIYNLTATNCWIKTKSSAIKLGSASWYDFIGLVFDNITIVESHRGLGFQI 570
            DAICPKTYTGP+YNLTAT+CWI+TKSSAIKLGSASW+DF GLVFDNITI ESHRGLG QI
Sbjct: 246  DAICPKTYTGPLYNLTATDCWIRTKSSAIKLGSASWFDFKGLVFDNITIFESHRGLGMQI 305

Query: 569  RDGGNVSDVTFSNINISTRYYDESWWGRAEPIYITTCPRDCNSKAGSISNLQFINITATS 390
            RDGGNVSDVTFSNINISTRYYD SWWGRAEPIYITTCPRD ++K GSISNL F+NIT  S
Sbjct: 306  RDGGNVSDVTFSNINISTRYYDPSWWGRAEPIYITTCPRDSSAKEGSISNLLFVNITIDS 365

Query: 389  ENGVCLSGSEGGILRNLKFYNVNLTYRRWTNYDDGLVDYRPGCQGLVNH-STAGFMMEHI 213
            ENGV LSGS  G+L ++KF N+NLT+RRW+NY  GLVDYRPGCQGLVNH +T+G +MEH+
Sbjct: 366  ENGVFLSGSPNGLLSDIKFKNMNLTFRRWSNYSAGLVDYRPGCQGLVNHRATSGIIMEHV 425

Query: 212  DGLNLENVNMRWSEEK--MDRWNNPLDFRPSTVNNISLLNFYSGL 84
            +G  +ENV+++WS++      WN PL+FRPSTVNN+S + F SGL
Sbjct: 426  NGFRVENVDLKWSDDDDVNAAWNVPLEFRPSTVNNVSFVGFTSGL 470


>ref|XP_003554187.1| PREDICTED: exo-poly-alpha-D-galacturonosidase-like [Glycine max]
          Length = 466

 Score =  634 bits (1636), Expect = e-179
 Identities = 306/461 (66%), Positives = 363/461 (78%), Gaps = 1/461 (0%)
 Frame = -3

Query: 1463 ILLLLVPTLIQSRSSTWPLSFPQSSEQLSLSVADYGAAGDGHQYDTVSIQSTIDDCASAS 1284
            ILLLL+  L+    S    S P +   ++LSVAD+GAAGDG +YDT +IQS I+ C    
Sbjct: 5    ILLLLLFFLLNP--SLLATSHPLNPIPVTLSVADFGAAGDGLRYDTEAIQSAINSCPEGD 62

Query: 1283 ARHHRPCEVKFP-PGKYLTATLRLKSGVLLNLSRNATILGGTSLDDYPEKQEEWYVVVAE 1107
                 PC V FP PGKYLTAT+ LKSGV+LN+   ATILGGT L+DYPE+   WYVVVAE
Sbjct: 63   -----PCHVTFPAPGKYLTATVFLKSGVVLNVESGATILGGTRLEDYPEESWRWYVVVAE 117

Query: 1106 NAXXXXXXXXXXXXXXGLKFVKRFDEKKNVMVSWNQTGACLGDECRPRLVGFIGCTNVRI 927
            NA                KFV R D +KNVMVSWNQTGACLGDECRPRL+GF+ C NV++
Sbjct: 118  NATDVGIRGGGAVDGQAAKFVVREDPRKNVMVSWNQTGACLGDECRPRLIGFLDCNNVQV 177

Query: 926  WDINFNQPAYWCLHIVRCQNTSIHDVSIYGDFNTPNNDGIDIEDSNNTLITRCTIDTGDD 747
             +I  NQPAYWCLH+VR  N  I D++IYGDFN PNNDGIDIEDSNNT+ITRC IDTGDD
Sbjct: 178  SNITLNQPAYWCLHLVRSNNICIQDIAIYGDFNIPNNDGIDIEDSNNTVITRCHIDTGDD 237

Query: 746  AICPKTYTGPIYNLTATNCWIKTKSSAIKLGSASWYDFIGLVFDNITIVESHRGLGFQIR 567
            AICPK+ TGP+YNLT T+CWI++KSSAIKLGSASW+DF   VFDNI IV+SHRG+GFQIR
Sbjct: 238  AICPKSSTGPVYNLTVTDCWIRSKSSAIKLGSASWFDFKHFVFDNIAIVDSHRGIGFQIR 297

Query: 566  DGGNVSDVTFSNINISTRYYDESWWGRAEPIYITTCPRDCNSKAGSISNLQFINITATSE 387
            DGGNVSD+ FSN+NISTRYYD  WWGRAEPIY+T+CPRD +SK  SISN+ FINITA SE
Sbjct: 298  DGGNVSDIVFSNMNISTRYYDSLWWGRAEPIYVTSCPRDSSSKEASISNVLFINITANSE 357

Query: 386  NGVCLSGSEGGILRNLKFYNVNLTYRRWTNYDDGLVDYRPGCQGLVNHSTAGFMMEHIDG 207
            NG+ LSGS+ G+LRNL+F ++++TYRR+TNY  GL+DYRPGCQ LV H TAG MMEHI+G
Sbjct: 358  NGIFLSGSKRGLLRNLRFIDMDITYRRFTNYAGGLLDYRPGCQELVKHRTAGIMMEHIEG 417

Query: 206  LNLENVNMRWSEEKMDRWNNPLDFRPSTVNNISLLNFYSGL 84
            L ++NV MRWS +++++WNNPL+FRPSTVNNIS LNF SGL
Sbjct: 418  LEVKNVEMRWSNDQLEQWNNPLEFRPSTVNNISFLNFNSGL 458


Top