BLASTX nr result

ID: Scutellaria23_contig00014572 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria23_contig00014572
         (2666 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002279210.1| PREDICTED: CBS domain-containing protein CBS...   551   e-154
ref|XP_002512622.1| conserved hypothetical protein [Ricinus comm...   530   e-148
ref|XP_002325465.1| predicted protein [Populus trichocarpa] gi|2...   526   e-147
ref|XP_003534522.1| PREDICTED: CBS domain-containing protein CBS...   519   e-144
ref|XP_002329005.1| predicted protein [Populus trichocarpa] gi|2...   518   e-144

>ref|XP_002279210.1| PREDICTED: CBS domain-containing protein CBSX6 [Vitis vinifera]
            gi|297735292|emb|CBI17654.3| unnamed protein product
            [Vitis vinifera]
          Length = 427

 Score =  551 bits (1421), Expect = e-154
 Identities = 285/418 (68%), Positives = 324/418 (77%)
 Frame = -2

Query: 2314 MASLFLYHVVGDLTVGKPELSEFPETETVAAAIRAIGDSTEGGIPVWKKRSQKSAFENAE 2135
            MAS+FLYHVVGDLTVGKPEL EFPETETV +AIR IG+S EG IP+WK+RS     E +E
Sbjct: 1    MASVFLYHVVGDLTVGKPELVEFPETETVESAIRVIGESAEGSIPIWKRRSHVGVMEKSE 60

Query: 2134 TRQQRFVGILNALDIVAFLAREECLADQDKAINTPVSEVVVSDSSALKVVDPGTRLIDAL 1955
             RQQRFVGILN+LDIVAFLAR+ CL DQ+KA+ TPVSEVVV ++S L+ VDP TRLIDAL
Sbjct: 61   MRQQRFVGILNSLDIVAFLARDACLVDQEKAMKTPVSEVVVPNNSLLREVDPATRLIDAL 120

Query: 1954 EMMKQGVKRLLVPKSSVWRGMSKRFSILYNGKWLKNIDPSGPGGSSKNMLANANWPTSSS 1775
            EMMK G+KRLLVPKS VW+GMSKRFSILYNGKWLKN+D S    SS N+  NAN P+SSS
Sbjct: 121  EMMKHGLKRLLVPKSVVWKGMSKRFSILYNGKWLKNLDASS---SSTNLAPNANRPSSSS 177

Query: 1774 TSSIQDKFCCLSREDVLRFXXXXXXXXXXXXXXXXXXXXAVNPNYRCIESWSPAIESTRK 1595
            T S ++KFCCLSREDV+RF                    A++PNY  IE+  PAI+ T+K
Sbjct: 178  TPSSRNKFCCLSREDVIRFVIGCLGALAPLPLSSISSLGAISPNYYSIEASFPAIQVTQK 237

Query: 1594 LPHDLCAVAVVEPTPEGQNKIIGEISSAKLWKCDYLAVAWALANLSAGQFVMGFEDNMTS 1415
            LP D  AVAVVE TP+GQ KIIGEIS+ KLWKCDYLA AWALANLSAGQFVMG EDN+TS
Sbjct: 238  LPQDPSAVAVVESTPDGQYKIIGEISACKLWKCDYLAAAWALANLSAGQFVMGVEDNVTS 297

Query: 1414 RSMPDILNSETGGETNLANGRGLNRQRTFSSRSIGFFSNSMNPNSGMGRSMYRGRSAPLI 1235
            RS+P    + TGGE N+ANG    RQR FSSRSIGFFSN  +P+ G  RSMYRGRSAPL 
Sbjct: 298  RSLPHFSVNLTGGENNMANGGASTRQRKFSSRSIGFFSNPASPSFGASRSMYRGRSAPLT 357

Query: 1234 CKATSSVAAVMAQMLSHRASHVWVTEDQNNDILVGVVGYTDIIAAVTVQPVATISENT 1061
            CK TSS+AAVMAQMLSHRA+HVWVTE ++ DILVGVVGY DI+AAV  QP   I   T
Sbjct: 358  CKVTSSLAAVMAQMLSHRATHVWVTEAESEDILVGVVGYADILAAVIKQPAPVIPSTT 415


>ref|XP_002512622.1| conserved hypothetical protein [Ricinus communis]
            gi|223548583|gb|EEF50074.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 432

 Score =  530 bits (1365), Expect = e-148
 Identities = 275/422 (65%), Positives = 326/422 (77%), Gaps = 5/422 (1%)
 Frame = -2

Query: 2314 MASLFLYHVVGDLTVGKPELSEFPETETVAAAIRAIGDSTEGGIPVWKKRSQKSAFENAE 2135
            MAS+FLYHVVGDLTVGKPE+ EF ETETV +AIRAIG+STE GIPVWK+RS  +  EN E
Sbjct: 1    MASVFLYHVVGDLTVGKPEMVEFCETETVESAIRAIGESTECGIPVWKRRSHVNMIENNE 60

Query: 2134 TRQQRFVGILNALDIVAFLAREECLADQDKAINTPVSEVVVSDSSALKVVDPGTRLIDAL 1955
             RQQRFVGILN+LDIVAFLA+ +CL DQ+KA+ TPVSEVVV D+S LK VDP TRLIDAL
Sbjct: 61   MRQQRFVGILNSLDIVAFLAKAQCLEDQEKAMKTPVSEVVVPDNSLLKQVDPATRLIDAL 120

Query: 1954 EMMKQGVKRLLVPKSSVWRGMSKRFSILYNGKWLKNIDPSGP-----GGSSKNMLANANW 1790
            EMMKQGVKRLLV K +VW+GMSKRFSILYNGKWLKN+D S       G SS N+++N + 
Sbjct: 121  EMMKQGVKRLLVSKGTVWKGMSKRFSILYNGKWLKNVDSSNSSNNLMGNSSNNLMSNISR 180

Query: 1789 PTSSSTSSIQDKFCCLSREDVLRFXXXXXXXXXXXXXXXXXXXXAVNPNYRCIESWSPAI 1610
            P+SSST+  +DKFCCLSREDV+RF                    A+N NY  +E+   AI
Sbjct: 181  PSSSSTTISRDKFCCLSREDVIRFIIGCLGALAPLPLSSISSLGAINFNYYSVEASLSAI 240

Query: 1609 ESTRKLPHDLCAVAVVEPTPEGQNKIIGEISSAKLWKCDYLAVAWALANLSAGQFVMGFE 1430
            E+T+KLP D CAVAVVEP P+G +KIIGEIS+++LWKCDYLA AWALANLS+GQFVMG E
Sbjct: 241  EATQKLPKDPCAVAVVEPMPDGHSKIIGEISASRLWKCDYLAAAWALANLSSGQFVMGVE 300

Query: 1429 DNMTSRSMPDILNSETGGETNLANGRGLNRQRTFSSRSIGFFSNSMNPNSGMGRSMYRGR 1250
            DN+T+RS+PD   + T  + N ANG G  R + FSS+SIGF  N  + + G+GRSMYRGR
Sbjct: 301  DNLTARSLPDFTVNSTVSDNNTANGGGSTRPKKFSSKSIGF--NPGSSSFGIGRSMYRGR 358

Query: 1249 SAPLICKATSSVAAVMAQMLSHRASHVWVTEDQNNDILVGVVGYTDIIAAVTVQPVATIS 1070
            SAPL CK TSS+AAVMAQMLSHRA+HVWVTE  N+++LVGVVGY DI+ AVT  P + I 
Sbjct: 359  SAPLTCKITSSLAAVMAQMLSHRATHVWVTEADNDEVLVGVVGYADILFAVTKPPASFIP 418

Query: 1069 EN 1064
             N
Sbjct: 419  IN 420


>ref|XP_002325465.1| predicted protein [Populus trichocarpa] gi|222862340|gb|EEE99846.1|
            predicted protein [Populus trichocarpa]
          Length = 422

 Score =  526 bits (1356), Expect = e-147
 Identities = 278/426 (65%), Positives = 322/426 (75%), Gaps = 2/426 (0%)
 Frame = -2

Query: 2314 MASLFLYHVVGDLTVGKPELSEFPETETVAAAIRAIGDSTEGGIPVWKKRSQKSAFENAE 2135
            M S+FLYHVVGDLTVGKPE+ EF ETETV +AIRAIG+STE GIPVWK++S     EN+E
Sbjct: 1    MPSVFLYHVVGDLTVGKPEMVEFYETETVESAIRAIGESTECGIPVWKRKSHVGMIENSE 60

Query: 2134 TRQQRFVGILNALDIVAFLAREECLADQDKAINTPVSEVVVSDSSALKVVDPGTRLIDAL 1955
            TR QRFVGILN+LDIVAFLA  ECL D+DKAI TPVS+VVV ++S LK VDP TRLIDAL
Sbjct: 61   TRLQRFVGILNSLDIVAFLASTECLEDRDKAIKTPVSQVVVPNTSLLKQVDPATRLIDAL 120

Query: 1954 EMMKQGVKRLLVPKSSVWRGMSKRFSILYNGKWLKNIDPSGPGGSSKNMLANANWPTSSS 1775
            EMMKQGV+RL+VPKS  W+GMSKRFSILYNGKWLKN D S    S+ N+  N N P+SSS
Sbjct: 121  EMMKQGVRRLIVPKSMGWKGMSKRFSILYNGKWLKNADTSN-SSSNNNLTINPNRPSSSS 179

Query: 1774 TSSIQDKFCCLSREDVLRFXXXXXXXXXXXXXXXXXXXXAVNPNYRCIESWSPAIESTRK 1595
             +S +DKFCCLSREDV+RF                    A+N NY  +E+  PAIE+TRK
Sbjct: 180  GTSNRDKFCCLSREDVIRFLIGCLGALAPLPLSSISSLGAINTNYNSLEASLPAIEATRK 239

Query: 1594 LPHDLCAVAVVEPTPEGQNKIIGEISSAKLWKCDYLAVAWALANLSAGQFVMGFEDNMTS 1415
            LP D  A+AVVEP P GQ KIIGEIS+++LWKCDYLA AWALANLSAGQFVMG EDN+TS
Sbjct: 240  LPEDPSAIAVVEPIPNGQCKIIGEISASRLWKCDYLAAAWALANLSAGQFVMGVEDNVTS 299

Query: 1414 RSMPDILNSETGGETNLANGRGLNRQRTFSSRSIGFFSNSMNPNS--GMGRSMYRGRSAP 1241
            RS+PD   +    + N A+G G  R R FSSRSIGF     NP +  G+GRS+YRGRSAP
Sbjct: 300  RSLPDFAVNSAADDDNTAHGAGSTRLRKFSSRSIGF-----NPGNSIGIGRSVYRGRSAP 354

Query: 1240 LICKATSSVAAVMAQMLSHRASHVWVTEDQNNDILVGVVGYTDIIAAVTVQPVATISENT 1061
            L CK TSS+AAVMAQMLSHRA+HVWV ED ++DILVGVVGY DI+AAVT QP +    N 
Sbjct: 355  LTCKITSSLAAVMAQMLSHRATHVWVIEDHSDDILVGVVGYADILAAVTKQPASVTHVNR 414

Query: 1060 QPS*AT 1043
              + AT
Sbjct: 415  PEAFAT 420


>ref|XP_003534522.1| PREDICTED: CBS domain-containing protein CBSX6-like [Glycine max]
          Length = 425

 Score =  519 bits (1337), Expect = e-144
 Identities = 269/417 (64%), Positives = 321/417 (76%)
 Frame = -2

Query: 2314 MASLFLYHVVGDLTVGKPELSEFPETETVAAAIRAIGDSTEGGIPVWKKRSQKSAFENAE 2135
            MAS+F+YHVVGDLTVGKPEL+EF E+ETV +AIRAIG+  EG IP+WKKRSQ    EN++
Sbjct: 1    MASVFVYHVVGDLTVGKPELAEFHESETVESAIRAIGECHEGTIPIWKKRSQLG-IENSD 59

Query: 2134 TRQQRFVGILNALDIVAFLAREECLADQDKAINTPVSEVVVSDSSALKVVDPGTRLIDAL 1955
             RQQRFVGIL++ DIVAFLA+ +CL DQDKA+ TPVSEVVV ++S L+VVDP TRLIDAL
Sbjct: 60   MRQQRFVGILSSFDIVAFLAKSQCLEDQDKALKTPVSEVVVHNNSLLRVVDPATRLIDAL 119

Query: 1954 EMMKQGVKRLLVPKSSVWRGMSKRFSILYNGKWLKNIDPSGPGGSSKNMLANANWPTSSS 1775
            +MMKQGVKRLLVPKS  W+GMSKRFS++Y GKWLKN +   PG SS N+  N N   S+S
Sbjct: 120  DMMKQGVKRLLVPKSVAWKGMSKRFSVIYYGKWLKNSE--SPGNSSNNLPLNMNRSPSTS 177

Query: 1774 TSSIQDKFCCLSREDVLRFXXXXXXXXXXXXXXXXXXXXAVNPNYRCIESWSPAIESTRK 1595
             + I+D++CCLSREDVLRF                    A+N NY  IES +PAIE+T+K
Sbjct: 178  ITPIRDRYCCLSREDVLRFIIGCLGALAPLPLTSIASLGAINSNYNYIESSTPAIEATQK 237

Query: 1594 LPHDLCAVAVVEPTPEGQNKIIGEISSAKLWKCDYLAVAWALANLSAGQFVMGFEDNMTS 1415
            LP D  AVAV+E T +GQ KIIGEIS+ KLWKCDYL+ AWALANLSAGQFVMG EDN+T 
Sbjct: 238  LPQDPSAVAVIESTSDGQCKIIGEISACKLWKCDYLSAAWALANLSAGQFVMGVEDNVTP 297

Query: 1414 RSMPDILNSETGGETNLANGRGLNRQRTFSSRSIGFFSNSMNPNSGMGRSMYRGRSAPLI 1235
            RS+P        GE +LANG G  + R FSSRS+GFFSN+ + + G  RSMYRGRSAPL 
Sbjct: 298  RSLPQFSLDLASGENDLANGGGSRKPRKFSSRSVGFFSNTASHSFG-SRSMYRGRSAPLT 356

Query: 1234 CKATSSVAAVMAQMLSHRASHVWVTEDQNNDILVGVVGYTDIIAAVTVQPVATISEN 1064
            CK TSS+AAV+AQMLSHRA+HVWVTED+N+++LVGVVGY DI+AAVT  P A I  N
Sbjct: 357  CKITSSLAAVLAQMLSHRATHVWVTEDENDEVLVGVVGYADILAAVTKPPTAFIPAN 413


>ref|XP_002329005.1| predicted protein [Populus trichocarpa] gi|222839239|gb|EEE77590.1|
            predicted protein [Populus trichocarpa]
          Length = 424

 Score =  518 bits (1334), Expect = e-144
 Identities = 275/414 (66%), Positives = 314/414 (75%), Gaps = 2/414 (0%)
 Frame = -2

Query: 2314 MASLFLYHVVGDLTVGKPELSEFPETETVAAAIRAIGDSTEGGIPVWKKRSQKSAFENAE 2135
            MAS+FLYHVVGDLTVGKPE+ EF ETETV +AIRAIG+STE GIPVWK++S  S  E +E
Sbjct: 1    MASVFLYHVVGDLTVGKPEMVEFYETETVESAIRAIGESTECGIPVWKRKSHVSMIETSE 60

Query: 2134 TRQQRFVGILNALDIVAFLAREECLADQDKAINTPVSEVVVSDSSALKVVDPGTRLIDAL 1955
             RQQRFVGILN+LDIVAFLA  ECL DQDKAI T VS+VVV ++S LK VDP TRLIDAL
Sbjct: 61   MRQQRFVGILNSLDIVAFLASTECLEDQDKAIKTSVSQVVVPNASLLKQVDPATRLIDAL 120

Query: 1954 EMMKQGVKRLLVPKSSVWRGMSKRFSILYNGKWLKNIDPSGPGGSSKNMLANANWPTSSS 1775
            EMMKQGV+RLLVPKS VW+GMSKRFS LYNGKWLKN D S    S+ N+  N N P+SSS
Sbjct: 121  EMMKQGVRRLLVPKSMVWKGMSKRFSFLYNGKWLKNADASN-NSSNNNLTINTNRPSSSS 179

Query: 1774 TSSIQDKFCCLSREDVLRFXXXXXXXXXXXXXXXXXXXXAVNPNYRCIESWSPAIESTRK 1595
             +S ++KFCCLSREDV+RF                     +NPNY  +E+  PA E+TRK
Sbjct: 180  GTSNRNKFCCLSREDVIRFLIGCLGALAPLPLSSISSLGVINPNYTSVEASLPAFEATRK 239

Query: 1594 LPHDLCAVAVVEPTPEGQNKIIGEISSAKLWKCDYLAVAWALANLSAGQFVMGFEDNMTS 1415
            L  D   VAVVEP P+GQ KIIGEIS+++LWKCDYLA AWALANLSAGQFVMG EDN T+
Sbjct: 240  LHGDPSEVAVVEPIPDGQCKIIGEISASRLWKCDYLAAAWALANLSAGQFVMGVEDNETA 299

Query: 1414 RSMPDILNSETGGETNLANGRGLNRQRTFSSRSIGFFSNSMNPNSG--MGRSMYRGRSAP 1241
            RS+ D   +   G+ + ANG G  R R FSSRSIGF     NP S   MGRSMYRGRSAP
Sbjct: 300  RSLLDFAVNSAVGDESTANGIGSTRLREFSSRSIGF-----NPGSSIRMGRSMYRGRSAP 354

Query: 1240 LICKATSSVAAVMAQMLSHRASHVWVTEDQNNDILVGVVGYTDIIAAVTVQPVA 1079
            L CK TSS+AAVMAQMLSHRA+HVWV ED ++DILVGVVGY DI+AAVT QP +
Sbjct: 355  LTCKITSSLAAVMAQMLSHRATHVWVIEDDSDDILVGVVGYADILAAVTKQPAS 408


Top