BLASTX nr result

ID: Scutellaria23_contig00018587 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria23_contig00018587
         (1893 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002298934.1| predicted protein [Populus trichocarpa] gi|2...   696   0.0  
ref|XP_002317004.1| predicted protein [Populus trichocarpa] gi|2...   691   0.0  
ref|XP_003546570.1| PREDICTED: endoplasmic oxidoreductin-2-like ...   665   0.0  
ref|XP_002888866.1| hypothetical protein ARALYDRAFT_476357 [Arab...   664   0.0  
ref|NP_177372.1| endoplasmic oxidoreductin-1 [Arabidopsis thalia...   663   0.0  

>ref|XP_002298934.1| predicted protein [Populus trichocarpa] gi|222846192|gb|EEE83739.1|
            predicted protein [Populus trichocarpa]
          Length = 481

 Score =  696 bits (1796), Expect = 0.0
 Identities = 333/425 (78%), Positives = 366/425 (86%), Gaps = 2/425 (0%)
 Frame = -3

Query: 1603 AQDHKSCQCSQGS-RKYTGIVEDCCCDYETVDSLNGAVLHPLLQELVRTPFFRYFKVKLW 1427
            + ++KSCQCS    RKY G++EDCCCDYE+VDS+NG VLHPLLQELV TPFFRYFKVKLW
Sbjct: 56   SSNYKSCQCSSAQVRKYKGMIEDCCCDYESVDSVNGEVLHPLLQELVTTPFFRYFKVKLW 115

Query: 1426 CDCPFWPDDGMCKLRDCSVCECPDNEFPEPFKKPMQYGLSSDDLKCQEGKPQAAVDRTLD 1247
            CDCPFWPDDGMC+LRDCSVCECP+NEFPEPFKKP + GLS+DDL CQEGKPQAAVDRTLD
Sbjct: 116  CDCPFWPDDGMCRLRDCSVCECPENEFPEPFKKPFRRGLSADDLMCQEGKPQAAVDRTLD 175

Query: 1246 SKAFRGWMEVDNPWTNDDETDNSEMTYVNLQLNPERYTGYTGPSARRIWDAIYSENCPKY 1067
            S+AFRGW+  DNPWTNDDETDN E+TYVNL LNPERYTGY G SARRIWDA+YSENCPKY
Sbjct: 176  SRAFRGWIVTDNPWTNDDETDNGELTYVNLLLNPERYTGYAGSSARRIWDAVYSENCPKY 235

Query: 1066 TSGEICQEKRVLYKXXXXXXXXXXXXIAAEYLLDEAKNQWGRNLELMYDRVLRYPDRVRN 887
             SGEICQEK+VLYK            IAA+YLLDE+ N+WG+NLELMYDRVLRYPDRVRN
Sbjct: 236  ASGEICQEKKVLYKLISGLHSSISIHIAADYLLDESTNKWGQNLELMYDRVLRYPDRVRN 295

Query: 886  LYFTFMFVLRAVTKAANYLEQAEYNTGNLEEDLKAQSLVRQLLYNPKLQAACPLPFDEAK 707
            LYFTF+FVLRA+TKAA+YLEQAEY+TGN  EDLK QSLVRQLLYNPKLQAACPLPFDEAK
Sbjct: 296  LYFTFLFVLRAMTKAADYLEQAEYDTGNNTEDLKTQSLVRQLLYNPKLQAACPLPFDEAK 355

Query: 706  LWQGQSGPELKQEIQKNFRNISAVMDCVGCEKCRLWGKLQVLGLGTALKILFSVDSNNHP 527
            LWQGQSGPELKQ+IQK FRNISA+MDCVGCEKCRLWGKLQVLGLGTALKILFSVD  N P
Sbjct: 356  LWQGQSGPELKQQIQKQFRNISALMDCVGCEKCRLWGKLQVLGLGTALKILFSVDGQNQP 415

Query: 526  DTPLQLQRNEVIALVNLLNRLSESVKLVHEIGPSVEKTME-EFTSEPPTREMSLPQRAWE 350
                QLQRNEVIALVNLLNRLSESVK V E GPS+EK ME + +     +  S  QRA E
Sbjct: 416  SESPQLQRNEVIALVNLLNRLSESVKFVREQGPSIEKIMERQISDSSEPKHGSKWQRAGE 475

Query: 349  AVGRL 335
            ++ +L
Sbjct: 476  SLFQL 480


>ref|XP_002317004.1| predicted protein [Populus trichocarpa] gi|222860069|gb|EEE97616.1|
            predicted protein [Populus trichocarpa]
          Length = 470

 Score =  691 bits (1783), Expect = 0.0
 Identities = 325/405 (80%), Positives = 355/405 (87%), Gaps = 2/405 (0%)
 Frame = -3

Query: 1597 DHKSCQC--SQGSRKYTGIVEDCCCDYETVDSLNGAVLHPLLQELVRTPFFRYFKVKLWC 1424
            ++KSCQC  SQ S KY G++EDCCCDYE+VDS+NG VLHPLLQELV TPFFRYFKVKLWC
Sbjct: 58   NNKSCQCPSSQDSGKYKGVIEDCCCDYESVDSVNGEVLHPLLQELVTTPFFRYFKVKLWC 117

Query: 1423 DCPFWPDDGMCKLRDCSVCECPDNEFPEPFKKPMQYGLSSDDLKCQEGKPQAAVDRTLDS 1244
            DCPFWPDDGMC+LRDCSVCECP+NEFPEP KKP  YGL +DD+ CQEG PQAAVDRTLD 
Sbjct: 118  DCPFWPDDGMCRLRDCSVCECPENEFPEPLKKPFLYGLPADDVACQEGNPQAAVDRTLDR 177

Query: 1243 KAFRGWMEVDNPWTNDDETDNSEMTYVNLQLNPERYTGYTGPSARRIWDAIYSENCPKYT 1064
            +AF+GW+E DNPWTNDDETDN EMTYVNL LNPERYTGY GPSARRIWDA+YSENCPKY 
Sbjct: 178  RAFKGWIETDNPWTNDDETDNDEMTYVNLLLNPERYTGYVGPSARRIWDAVYSENCPKYP 237

Query: 1063 SGEICQEKRVLYKXXXXXXXXXXXXIAAEYLLDEAKNQWGRNLELMYDRVLRYPDRVRNL 884
            SGE+CQEK+VLYK            IA +YLLDE+ N+WG+N ELMYDRVLRYPDRVRNL
Sbjct: 238  SGEMCQEKKVLYKLISGLHSSISIHIAVDYLLDESTNKWGQNPELMYDRVLRYPDRVRNL 297

Query: 883  YFTFMFVLRAVTKAANYLEQAEYNTGNLEEDLKAQSLVRQLLYNPKLQAACPLPFDEAKL 704
            YFTF+FVLRAVTKAA+YLEQAEY+TGN  EDL+ QSLVRQLL+NPKLQAACPLPFDEAKL
Sbjct: 298  YFTFLFVLRAVTKAADYLEQAEYDTGNHTEDLETQSLVRQLLHNPKLQAACPLPFDEAKL 357

Query: 703  WQGQSGPELKQEIQKNFRNISAVMDCVGCEKCRLWGKLQVLGLGTALKILFSVDSNNHPD 524
            WQGQSGPELKQ+IQK FRNISA+MDCVGCEKCRLWGKLQVLGLGTALKILFSVD  N P 
Sbjct: 358  WQGQSGPELKQQIQKQFRNISALMDCVGCEKCRLWGKLQVLGLGTALKILFSVDGQNQPS 417

Query: 523  TPLQLQRNEVIALVNLLNRLSESVKLVHEIGPSVEKTMEEFTSEP 389
              LQLQRNEVIALVNLLNRLSES+K V E GPS+EKTME   S+P
Sbjct: 418  ESLQLQRNEVIALVNLLNRLSESIKYVCEQGPSIEKTMERQISDP 462


>ref|XP_003546570.1| PREDICTED: endoplasmic oxidoreductin-2-like [Glycine max]
          Length = 465

 Score =  665 bits (1715), Expect = 0.0
 Identities = 315/419 (75%), Positives = 355/419 (84%), Gaps = 1/419 (0%)
 Frame = -3

Query: 1591 KSCQCSQGSRKYTGIVEDCCCDYETVDSLNGAVLHPLLQELVRTPFFRYFKVKLWCDCPF 1412
            ++C C++G+ KY+G+VEDCCCDYETVD LN  VLHP LQELV+TPFFRYFKVKLWCDCPF
Sbjct: 47   RACPCARGTPKYSGMVEDCCCDYETVDRLNEEVLHPSLQELVKTPFFRYFKVKLWCDCPF 106

Query: 1411 WPDDGMCKLRDCSVCECPDNEFPEPFKKPMQYGLSSDDLKCQEGKPQAAVDRTLDSKAFR 1232
            WPDDGMC+LRDCSVCECP+NEFPE FKKP +  LS  DL CQEGKPQAAVDRTLDSKAFR
Sbjct: 107  WPDDGMCRLRDCSVCECPENEFPESFKKPDRR-LSMTDLVCQEGKPQAAVDRTLDSKAFR 165

Query: 1231 GWMEVDNPWTNDDETDNSEMTYVNLQLNPERYTGYTGPSARRIWDAIYSENCPKYTSGEI 1052
            GW E+DNPWTNDDETDN EMTYVNLQLNPERYTGYTGPSARRIWDA+YSENCPKY S E+
Sbjct: 166  GWTEIDNPWTNDDETDNDEMTYVNLQLNPERYTGYTGPSARRIWDAVYSENCPKYPSQEL 225

Query: 1051 CQEKRVLYKXXXXXXXXXXXXIAAEYLLDEAKNQWGRNLELMYDRVLRYPDRVRNLYFTF 872
            CQE+++LYK            IA++YLL+EA N WG+NL LMYDRVLRYPDRVRNLYFTF
Sbjct: 226  CQEEKILYKLISGLHSSISIHIASDYLLEEATNLWGQNLTLMYDRVLRYPDRVRNLYFTF 285

Query: 871  MFVLRAVTKAANYLEQAEYNTGNLEEDLKAQSLVRQLLYNPKLQAACPLPFDEAKLWQGQ 692
            +FVLRAVTKA++YLEQAEY+TGN  EDL  QSL++QLLYNPKLQAACP+PFDEA LW+GQ
Sbjct: 286  LFVLRAVTKASDYLEQAEYDTGNPNEDLTTQSLIKQLLYNPKLQAACPIPFDEANLWKGQ 345

Query: 691  SGPELKQEIQKNFRNISAVMDCVGCEKCRLWGKLQVLGLGTALKILFSVDSNNHPDTPLQ 512
            SGPELKQ+IQ+ FRNISA+MDCVGCEKCRLWGKLQVLGLGTALKILFSVD   +    LQ
Sbjct: 346  SGPELKQKIQQQFRNISALMDCVGCEKCRLWGKLQVLGLGTALKILFSVDGQENSSHTLQ 405

Query: 511  LQRNEVIALVNLLNRLSESVKLVHEIGPSVEKTMEEFTSEPPTREM-SLPQRAWEAVGR 338
            LQRNEVIAL NLLNRLSESVK VHE+GP+ E+ ME       TR + S  ++ W  V +
Sbjct: 406  LQRNEVIALTNLLNRLSESVKFVHEVGPTAERIMEGGHFSAHTRTLISSWKKIWSYVSK 464


>ref|XP_002888866.1| hypothetical protein ARALYDRAFT_476357 [Arabidopsis lyrata subsp.
            lyrata] gi|297334707|gb|EFH65125.1| hypothetical protein
            ARALYDRAFT_476357 [Arabidopsis lyrata subsp. lyrata]
          Length = 469

 Score =  664 bits (1712), Expect = 0.0
 Identities = 310/417 (74%), Positives = 357/417 (85%), Gaps = 1/417 (0%)
 Frame = -3

Query: 1597 DHKSCQCS-QGSRKYTGIVEDCCCDYETVDSLNGAVLHPLLQELVRTPFFRYFKVKLWCD 1421
            D  SC C  QG+ KY G+VEDCCCDYETVD+LN  VL+PLLQ+LV TPFFRY+KVKLWCD
Sbjct: 48   DRNSCSCLLQGTGKYKGMVEDCCCDYETVDNLNSEVLNPLLQDLVTTPFFRYYKVKLWCD 107

Query: 1420 CPFWPDDGMCKLRDCSVCECPDNEFPEPFKKPMQYGLSSDDLKCQEGKPQAAVDRTLDSK 1241
            CPFWPDDGMC+LRDCSVCECP+NEFPEPFKKP   GL SDDL CQEGKPQ AVDRT+D++
Sbjct: 108  CPFWPDDGMCRLRDCSVCECPENEFPEPFKKPFVPGLPSDDLMCQEGKPQGAVDRTIDNR 167

Query: 1240 AFRGWMEVDNPWTNDDETDNSEMTYVNLQLNPERYTGYTGPSARRIWDAIYSENCPKYTS 1061
            AFRGW+E  NPWT+DD+TD+ EMTYVNLQLNPERYTGYTGPSARRIWD+IYSENCPKY+S
Sbjct: 168  AFRGWVETKNPWTHDDDTDSGEMTYVNLQLNPERYTGYTGPSARRIWDSIYSENCPKYSS 227

Query: 1060 GEICQEKRVLYKXXXXXXXXXXXXIAAEYLLDEAKNQWGRNLELMYDRVLRYPDRVRNLY 881
            GE C EK+VLYK            IA++YLLDE+ NQWG+N+ELMYDR+LR+PDRVRN+Y
Sbjct: 228  GETCPEKKVLYKLISGLHSSISMHIASDYLLDESSNQWGQNIELMYDRILRHPDRVRNMY 287

Query: 880  FTFMFVLRAVTKAANYLEQAEYNTGNLEEDLKAQSLVRQLLYNPKLQAACPLPFDEAKLW 701
            FT++FVLRAVTKA  YLEQAEY+TGN  EDLK QSL++QLLY+PKLQ ACP+PFDEAKLW
Sbjct: 288  FTYLFVLRAVTKATAYLEQAEYDTGNHAEDLKTQSLIKQLLYSPKLQTACPVPFDEAKLW 347

Query: 700  QGQSGPELKQEIQKNFRNISAVMDCVGCEKCRLWGKLQVLGLGTALKILFSVDSNNHPDT 521
            QGQSGPELKQ+IQK FRNISA+MDCVGCEKCRLWGKLQV GLGTALKILFSV + +  D 
Sbjct: 348  QGQSGPELKQQIQKQFRNISALMDCVGCEKCRLWGKLQVQGLGTALKILFSVGNQDIGDQ 407

Query: 520  PLQLQRNEVIALVNLLNRLSESVKLVHEIGPSVEKTMEEFTSEPPTREMSLPQRAWE 350
             LQLQRNEVIALVNLLNRLSESVK+VH++GP VE+ ME+  ++   +   L +R W+
Sbjct: 408  TLQLQRNEVIALVNLLNRLSESVKMVHDMGPDVERLMEDQIAKVSAKPGRL-RRIWD 463


>ref|NP_177372.1| endoplasmic oxidoreductin-1 [Arabidopsis thaliana]
            gi|50400631|sp|Q9C7S7.1|ERO1_ARATH RecName:
            Full=Endoplasmic oxidoreductin-1; Flags: Precursor
            gi|12323665|gb|AAG51798.1|AC067754_14 disulfide bond
            formation protein, putative; 78451-75984 [Arabidopsis
            thaliana] gi|31711714|gb|AAP68213.1| At1g72280
            [Arabidopsis thaliana] gi|110743908|dbj|BAE99788.1| like
            disulfide bond formation protein [Arabidopsis thaliana]
            gi|332197177|gb|AEE35298.1| endoplasmic oxidoreductin-1
            [Arabidopsis thaliana]
          Length = 469

 Score =  663 bits (1710), Expect = 0.0
 Identities = 309/417 (74%), Positives = 358/417 (85%), Gaps = 1/417 (0%)
 Frame = -3

Query: 1597 DHKSCQCS-QGSRKYTGIVEDCCCDYETVDSLNGAVLHPLLQELVRTPFFRYFKVKLWCD 1421
            D  SC CS Q + KY G++EDCCCDYETVD+LN  VL+PLLQ+LV TPFFRY+KVKLWCD
Sbjct: 48   DRNSCSCSLQKTGKYKGMIEDCCCDYETVDNLNTEVLNPLLQDLVTTPFFRYYKVKLWCD 107

Query: 1420 CPFWPDDGMCKLRDCSVCECPDNEFPEPFKKPMQYGLSSDDLKCQEGKPQAAVDRTLDSK 1241
            CPFWPDDGMC+LRDCSVCECP+NEFPEPFKKP   GL SDDLKCQEGKPQ AVDRT+D++
Sbjct: 108  CPFWPDDGMCRLRDCSVCECPENEFPEPFKKPFVPGLPSDDLKCQEGKPQGAVDRTIDNR 167

Query: 1240 AFRGWMEVDNPWTNDDETDNSEMTYVNLQLNPERYTGYTGPSARRIWDAIYSENCPKYTS 1061
            AFRGW+E  NPWT+DD+TD+ EM+YVNLQLNPERYTGYTGPSARRIWD+IYSENCPKY+S
Sbjct: 168  AFRGWVETKNPWTHDDDTDSGEMSYVNLQLNPERYTGYTGPSARRIWDSIYSENCPKYSS 227

Query: 1060 GEICQEKRVLYKXXXXXXXXXXXXIAAEYLLDEAKNQWGRNLELMYDRVLRYPDRVRNLY 881
            GE C EK+VLYK            IAA+YLLDE++NQWG+N+ELMYDR+LR+PDRVRN+Y
Sbjct: 228  GETCPEKKVLYKLISGLHSSISMHIAADYLLDESRNQWGQNIELMYDRILRHPDRVRNMY 287

Query: 880  FTFMFVLRAVTKAANYLEQAEYNTGNLEEDLKAQSLVRQLLYNPKLQAACPLPFDEAKLW 701
            FT++FVLRAVTKA  YLEQAEY+TGN  EDLK QSL++QLLY+PKLQ ACP+PFDEAKLW
Sbjct: 288  FTYLFVLRAVTKATAYLEQAEYDTGNHAEDLKTQSLIKQLLYSPKLQTACPVPFDEAKLW 347

Query: 700  QGQSGPELKQEIQKNFRNISAVMDCVGCEKCRLWGKLQVLGLGTALKILFSVDSNNHPDT 521
            QGQSGPELKQ+IQK FRNISA+MDCVGCEKCRLWGKLQV GLGTALKILFSV + +  D 
Sbjct: 348  QGQSGPELKQQIQKQFRNISALMDCVGCEKCRLWGKLQVQGLGTALKILFSVGNQDIGDQ 407

Query: 520  PLQLQRNEVIALVNLLNRLSESVKLVHEIGPSVEKTMEEFTSEPPTREMSLPQRAWE 350
             LQLQRNEVIALVNLLNRLSESVK+VH++ P VE+ ME+  ++   +   L +R W+
Sbjct: 408  TLQLQRNEVIALVNLLNRLSESVKMVHDMSPDVERLMEDQIAKVSAKPARL-RRIWD 463


Top