BLASTX nr result

ID: Scutellaria24_contig00007081 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria24_contig00007081
         (2079 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002272226.1| PREDICTED: pentatricopeptide repeat-containi...   886   0.0  
ref|XP_002306972.1| predicted protein [Populus trichocarpa] gi|2...   872   0.0  
ref|XP_002510663.1| pentatricopeptide repeat-containing protein,...   855   0.0  
ref|XP_002865541.1| pentatricopeptide repeat-containing protein ...   849   0.0  
ref|XP_004144287.1| PREDICTED: pentatricopeptide repeat-containi...   848   0.0  

>ref|XP_002272226.1| PREDICTED: pentatricopeptide repeat-containing protein At5g42310,
            mitochondrial [Vitis vinifera]
            gi|297745544|emb|CBI40709.3| unnamed protein product
            [Vitis vinifera]
          Length = 695

 Score =  886 bits (2290), Expect = 0.0
 Identities = 428/556 (76%), Positives = 489/556 (87%)
 Frame = +1

Query: 1    AYALVTWLQRHNLCFSYELLYSILINALGRDEKLYEAFLLSQRQSLTPLTYNALIGACAR 180
            AY+LVTWL+RHNLCFSYELLYSILI+ALGR EKLYEAFLLSQRQ+LTPLTYNALIGACAR
Sbjct: 137  AYSLVTWLERHNLCFSYELLYSILIHALGRSEKLYEAFLLSQRQTLTPLTYNALIGACAR 196

Query: 181  NNDLEKALNLMERMRRDGYQSDYVNYSLVIQSLMRNNNVDVAILEKLYGEMEADRIELDG 360
            N+DLEKALNLM RMRRDG+ SD+VNYS +IQSL R N  D ++L+K+Y E+E+D+IELDG
Sbjct: 197  NDDLEKALNLMSRMRRDGFPSDFVNYSFIIQSLTRTNKSDSSMLQKIYAEIESDKIELDG 256

Query: 361  QLFNDVIAGFTKAGDIDRALYFLGLMQGGGLTAKTSTVVSMVNELGNLGRVXXXXXXXXX 540
            QL ND+I GF K+GD++RA+ FL ++QG GL+ KT+T+V+++  LGN GR          
Sbjct: 257  QLLNDIIVGFAKSGDVNRAMSFLAMVQGNGLSPKTATLVAVITALGNAGRTEEAEAIFEE 316

Query: 541  XXXGGMRPRTRAYNALLKGYVKVGALKDAEYVVSEMEASGVSPDEQTYSLLMDAYGNAGR 720
               GG+ PRTRAYNALLKGYVK G+LKDAE +VSEME SG SPDE TYSLL+DAY NAGR
Sbjct: 317  LKEGGLMPRTRAYNALLKGYVKTGSLKDAESIVSEMERSGFSPDEHTYSLLIDAYANAGR 376

Query: 721  WESARIVLKEMEENNVKPNCYVFSRILASYRDRGEWQRSFQVLKEMKNCGVNPNLQFYNV 900
            WESARIVLKEME + V+PN YVFSRILASYRDRG+WQ+SFQVL+EM+N GV+P+  FYNV
Sbjct: 377  WESARIVLKEMEASGVRPNSYVFSRILASYRDRGKWQKSFQVLREMRNSGVSPDRHFYNV 436

Query: 901  MIDTFGKYNCLEHMMAALERMKVEGIEPDTVTWNTLIDCHCKQGHRDKAEQLFEEMQESG 1080
            MIDTFGK NCL+H +A  +RM++EG++PD VTWNTLIDCHCK GH +KAE+LFE MQESG
Sbjct: 437  MIDTFGKCNCLDHALATFDRMRMEGVQPDAVTWNTLIDCHCKSGHHNKAEELFEAMQESG 496

Query: 1081 CLPCTTTYNIMINSFGAQERWDDVKELLRKMQSQGLLPNVITYTTLVDIYGQSGRFNDAI 1260
            C PCTTTYNIMINSFG QERW+DVK LL KMQSQGLL NV+TYTTLVDIYGQSGRF DAI
Sbjct: 497  CSPCTTTYNIMINSFGEQERWEDVKTLLGKMQSQGLLANVVTYTTLVDIYGQSGRFKDAI 556

Query: 1261 ECLEAMKSAGLKPSSTMYNALINAYAQKGLSEQAVNAFRVMRGDGLKPSLLALNSLINAF 1440
            ECLE MKS GLKPSSTMYNALINAYAQ+GLSEQA+NAFRVMR DGLKPS+L LNSLINAF
Sbjct: 557  ECLEVMKSVGLKPSSTMYNALINAYAQRGLSEQAINAFRVMRADGLKPSVLVLNSLINAF 616

Query: 1441 GEDRRDAEAFAVLQYMRDNDLKPDVVTYTTLMKALIRVEKYEKVPAVFEEMLLSGCAPDR 1620
            GEDRRDAEAF+VLQYM++NDLKPDVVTYTTLMKALIRVEK++KVPAV+EEM LSGC PDR
Sbjct: 617  GEDRRDAEAFSVLQYMKENDLKPDVVTYTTLMKALIRVEKFDKVPAVYEEMTLSGCTPDR 676

Query: 1621 KARAMLRSALRYMKST 1668
            KARAMLRSALRYM+ T
Sbjct: 677  KARAMLRSALRYMERT 692


>ref|XP_002306972.1| predicted protein [Populus trichocarpa] gi|222856421|gb|EEE93968.1|
            predicted protein [Populus trichocarpa]
          Length = 709

 Score =  872 bits (2253), Expect = 0.0
 Identities = 425/559 (76%), Positives = 488/559 (87%)
 Frame = +1

Query: 1    AYALVTWLQRHNLCFSYELLYSILINALGRDEKLYEAFLLSQRQSLTPLTYNALIGACAR 180
            AYA+V WLQ+HNLCFSYELLYSILI+ALG+ EKLYEAFLLSQRQ+LTPLTYNALI ACAR
Sbjct: 151  AYAVVLWLQKHNLCFSYELLYSILIHALGQSEKLYEAFLLSQRQNLTPLTYNALISACAR 210

Query: 181  NNDLEKALNLMERMRRDGYQSDYVNYSLVIQSLMRNNNVDVAILEKLYGEMEADRIELDG 360
            NNDLEKALNL+ RMR+DGY SD+VNYSL+I+SLMR N VD AIL+KLY E+E D++ELD 
Sbjct: 211  NNDLEKALNLITRMRQDGYPSDFVNYSLIIRSLMRKNRVDSAILQKLYREIECDKLELDV 270

Query: 361  QLFNDVIAGFTKAGDIDRALYFLGLMQGGGLTAKTSTVVSMVNELGNLGRVXXXXXXXXX 540
            QL ND+I GF KAGD+ +AL FLG++QG GL+ KT+T+V+++  LGN GR          
Sbjct: 271  QLSNDIIVGFAKAGDLSKALEFLGVVQGSGLSVKTATLVAVIWALGNCGRTVEAEAIFEE 330

Query: 541  XXXGGMRPRTRAYNALLKGYVKVGALKDAEYVVSEMEASGVSPDEQTYSLLMDAYGNAGR 720
                G++PRTRAYNALL+GYVK G LKDAE+VVSEME SGVSP+EQTYS L+DAYGNAGR
Sbjct: 331  MRDNGLKPRTRAYNALLRGYVKAGLLKDAEFVVSEMERSGVSPNEQTYSFLIDAYGNAGR 390

Query: 721  WESARIVLKEMEENNVKPNCYVFSRILASYRDRGEWQRSFQVLKEMKNCGVNPNLQFYNV 900
            WESARIVLKEME +NV+PN YVFSRIL+SYRD+GEWQ+SFQVL+EM+N GV P+  FYNV
Sbjct: 391  WESARIVLKEMEASNVQPNAYVFSRILSSYRDKGEWQKSFQVLREMENSGVRPDRVFYNV 450

Query: 901  MIDTFGKYNCLEHMMAALERMKVEGIEPDTVTWNTLIDCHCKQGHRDKAEQLFEEMQESG 1080
            MIDTFGK+NCL+H MA  +RM  EGIEPDTVTWNTLIDCHC+ G  D+AE+LFEEM E G
Sbjct: 451  MIDTFGKFNCLDHAMATFDRMLSEGIEPDTVTWNTLIDCHCRAGKHDRAEELFEEMMEGG 510

Query: 1081 CLPCTTTYNIMINSFGAQERWDDVKELLRKMQSQGLLPNVITYTTLVDIYGQSGRFNDAI 1260
              PC TT+NIMINSFG QERWDDVK LL  M+SQGL+PN +TYTTL+DIYG+SGRFNDAI
Sbjct: 511  YSPCNTTFNIMINSFGDQERWDDVKNLLAHMRSQGLVPNSVTYTTLIDIYGKSGRFNDAI 570

Query: 1261 ECLEAMKSAGLKPSSTMYNALINAYAQKGLSEQAVNAFRVMRGDGLKPSLLALNSLINAF 1440
            ECL+ MK+AGLKPSSTMYNALINAYAQ+GLSEQAV+AFR MR DGLKPSLLALNSLINAF
Sbjct: 571  ECLDDMKAAGLKPSSTMYNALINAYAQRGLSEQAVSAFRAMRVDGLKPSLLALNSLINAF 630

Query: 1441 GEDRRDAEAFAVLQYMRDNDLKPDVVTYTTLMKALIRVEKYEKVPAVFEEMLLSGCAPDR 1620
            GEDRRDAEAF VLQYM++NDLKPDVVTYTTLMKALIRVEK++KVP+V+EEM+LSGC PDR
Sbjct: 631  GEDRRDAEAFTVLQYMKENDLKPDVVTYTTLMKALIRVEKFDKVPSVYEEMILSGCTPDR 690

Query: 1621 KARAMLRSALRYMKSTFNL 1677
            KARAMLRSAL+YMK T  L
Sbjct: 691  KARAMLRSALKYMKQTLEL 709


>ref|XP_002510663.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223551364|gb|EEF52850.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 695

 Score =  855 bits (2210), Expect = 0.0
 Identities = 412/559 (73%), Positives = 486/559 (86%)
 Frame = +1

Query: 1    AYALVTWLQRHNLCFSYELLYSILINALGRDEKLYEAFLLSQRQSLTPLTYNALIGACAR 180
            AYA+V+WLQ+HNLCFSYELLYSILI+ALGR EKLYEAFLLSQ+Q+L+PLTYNALI ACAR
Sbjct: 137  AYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFLLSQQQALSPLTYNALINACAR 196

Query: 181  NNDLEKALNLMERMRRDGYQSDYVNYSLVIQSLMRNNNVDVAILEKLYGEMEADRIELDG 360
            NNDLEKA+NL+ RMR+DGY SD+VNYSL+IQSL+R+N +D  IL+KLY E++ D++ELD 
Sbjct: 197  NNDLEKAINLISRMRQDGYPSDFVNYSLIIQSLVRSNRIDSPILQKLYSEIQCDKLELDV 256

Query: 361  QLFNDVIAGFTKAGDIDRALYFLGLMQGGGLTAKTSTVVSMVNELGNLGRVXXXXXXXXX 540
            QL ND+I GF KAGD ++A+ FLG++Q  GL+ +T+T++++++ LG+ GR+         
Sbjct: 257  QLSNDIIVGFAKAGDPNKAMEFLGMVQASGLSPRTATLIAVISALGDSGRIIEAEAIFEE 316

Query: 541  XXXGGMRPRTRAYNALLKGYVKVGALKDAEYVVSEMEASGVSPDEQTYSLLMDAYGNAGR 720
                G++P+TRAYN LLKGYVK G LKDAE++VSEME SGVSPDE TYSLL+DAY NAGR
Sbjct: 317  MKDNGLKPKTRAYNGLLKGYVKAGMLKDAEFIVSEMERSGVSPDECTYSLLIDAYSNAGR 376

Query: 721  WESARIVLKEMEENNVKPNCYVFSRILASYRDRGEWQRSFQVLKEMKNCGVNPNLQFYNV 900
            WESARIVLKEME NN+ PN YVFSRILASYRDRGEWQ+SFQVLKEMKN GV P+  FYNV
Sbjct: 377  WESARIVLKEMEANNIMPNSYVFSRILASYRDRGEWQKSFQVLKEMKNSGVRPDRHFYNV 436

Query: 901  MIDTFGKYNCLEHMMAALERMKVEGIEPDTVTWNTLIDCHCKQGHRDKAEQLFEEMQESG 1080
            MIDTFGK++CL+H M   ++M  EGI+PDTVTWNTLIDCHCK    ++AE+LFEEM E G
Sbjct: 437  MIDTFGKFSCLDHAMDTFDKMLSEGIQPDTVTWNTLIDCHCKAELHERAEELFEEMMEKG 496

Query: 1081 CLPCTTTYNIMINSFGAQERWDDVKELLRKMQSQGLLPNVITYTTLVDIYGQSGRFNDAI 1260
              PC TT+NIMINSFG QERWDDVK L+  M+S GLLPNV+TYTTL+DIYG+SGRF+DAI
Sbjct: 497  FSPCVTTFNIMINSFGEQERWDDVKTLMGNMRSLGLLPNVVTYTTLIDIYGKSGRFSDAI 556

Query: 1261 ECLEAMKSAGLKPSSTMYNALINAYAQKGLSEQAVNAFRVMRGDGLKPSLLALNSLINAF 1440
            ECLE MKSAGLKPSSTMYNALINAYAQKGLSEQAVNAFR+MR D LKPSLLALNSLINAF
Sbjct: 557  ECLEDMKSAGLKPSSTMYNALINAYAQKGLSEQAVNAFRLMRADSLKPSLLALNSLINAF 616

Query: 1441 GEDRRDAEAFAVLQYMRDNDLKPDVVTYTTLMKALIRVEKYEKVPAVFEEMLLSGCAPDR 1620
            GEDRRDAEAF+VL+YM++NDLKPDVVTYTTLMKALIRV+K+ KVP+V+EEM+L+GC PDR
Sbjct: 617  GEDRRDAEAFSVLKYMKENDLKPDVVTYTTLMKALIRVDKFNKVPSVYEEMILAGCTPDR 676

Query: 1621 KARAMLRSALRYMKSTFNL 1677
            KARAMLRSAL+YMK T NL
Sbjct: 677  KARAMLRSALKYMKQTLNL 695


>ref|XP_002865541.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297311376|gb|EFH41800.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 711

 Score =  849 bits (2194), Expect = 0.0
 Identities = 413/556 (74%), Positives = 478/556 (85%)
 Frame = +1

Query: 1    AYALVTWLQRHNLCFSYELLYSILINALGRDEKLYEAFLLSQRQSLTPLTYNALIGACAR 180
            AYA+V+WLQ+HNLCFSYELLYSILI+ALGR EKLYEAFLLSQ+Q+LTPLTYNALIGACAR
Sbjct: 152  AYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFLLSQKQTLTPLTYNALIGACAR 211

Query: 181  NNDLEKALNLMERMRRDGYQSDYVNYSLVIQSLMRNNNVDVAILEKLYGEMEADRIELDG 360
            NND+EKALNL+ RMR+DGYQSD+VNYSLVIQSL R N +D  +L++LY E+E D++ELD 
Sbjct: 212  NNDIEKALNLISRMRQDGYQSDFVNYSLVIQSLTRCNKIDSVMLQRLYKEIERDKLELDV 271

Query: 361  QLFNDVIAGFTKAGDIDRALYFLGLMQGGGLTAKTSTVVSMVNELGNLGRVXXXXXXXXX 540
            QL ND+I GF K+GD  RAL  LG+ Q  GL+AKT+T+VS+++ L N GR          
Sbjct: 272  QLVNDIIMGFAKSGDPSRALQLLGMAQATGLSAKTATLVSIISALANSGRTLEAEALFEE 331

Query: 541  XXXGGMRPRTRAYNALLKGYVKVGALKDAEYVVSEMEASGVSPDEQTYSLLMDAYGNAGR 720
                G++PRT+AYNALLKGYVK G LKDAE +VSEME  GVSPDE TYSLL+DAY NAGR
Sbjct: 332  LRQSGIKPRTKAYNALLKGYVKTGPLKDAELMVSEMEKRGVSPDEHTYSLLIDAYVNAGR 391

Query: 721  WESARIVLKEMEENNVKPNCYVFSRILASYRDRGEWQRSFQVLKEMKNCGVNPNLQFYNV 900
            WESARIVLKEME  +V+PN +VFSR+LA YRDRGEWQ++FQVLKEMK+ GV P+ QFYNV
Sbjct: 392  WESARIVLKEMETGDVQPNSFVFSRLLAGYRDRGEWQKTFQVLKEMKSIGVKPDRQFYNV 451

Query: 901  MIDTFGKYNCLEHMMAALERMKVEGIEPDTVTWNTLIDCHCKQGHRDKAEQLFEEMQESG 1080
            +IDTFGK+NCL+H M   +RM  EGIEPD VTWNTLIDCHCK G    AE++FE M+  G
Sbjct: 452  VIDTFGKFNCLDHAMTTFDRMLSEGIEPDRVTWNTLIDCHCKHGRHIVAEEMFEAMERRG 511

Query: 1081 CLPCTTTYNIMINSFGAQERWDDVKELLRKMQSQGLLPNVITYTTLVDIYGQSGRFNDAI 1260
            CLPC TTYNIMINS+G QERWDD+K LL KM+SQG+LPNV+T+TTLVD+YG+SGRFNDAI
Sbjct: 512  CLPCATTYNIMINSYGDQERWDDMKRLLGKMKSQGILPNVVTHTTLVDVYGKSGRFNDAI 571

Query: 1261 ECLEAMKSAGLKPSSTMYNALINAYAQKGLSEQAVNAFRVMRGDGLKPSLLALNSLINAF 1440
            ECLE MKS GLKPSSTMYNALINAYAQ+GLSEQAVNAFRVM  DGLKPSLLALNSLINAF
Sbjct: 572  ECLEEMKSVGLKPSSTMYNALINAYAQRGLSEQAVNAFRVMTSDGLKPSLLALNSLINAF 631

Query: 1441 GEDRRDAEAFAVLQYMRDNDLKPDVVTYTTLMKALIRVEKYEKVPAVFEEMLLSGCAPDR 1620
            GEDRRDAEAFAVLQYM++N +KPDVVTYTTLMKALIRV+K++KVP V+EEM++SGC PDR
Sbjct: 632  GEDRRDAEAFAVLQYMKENGVKPDVVTYTTLMKALIRVDKFQKVPGVYEEMIMSGCKPDR 691

Query: 1621 KARAMLRSALRYMKST 1668
            KAR+MLRSALRYMK T
Sbjct: 692  KARSMLRSALRYMKQT 707


>ref|XP_004144287.1| PREDICTED: pentatricopeptide repeat-containing protein At5g42310,
            mitochondrial-like [Cucumis sativus]
            gi|449489420|ref|XP_004158306.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g42310,
            mitochondrial-like [Cucumis sativus]
          Length = 720

 Score =  848 bits (2192), Expect = 0.0
 Identities = 408/558 (73%), Positives = 481/558 (86%)
 Frame = +1

Query: 4    YALVTWLQRHNLCFSYELLYSILINALGRDEKLYEAFLLSQRQSLTPLTYNALIGACARN 183
            YA+V+WLQRHNLCFSYELLYSILI+ALGR EKLYEAF+LSQ+Q+LTPLTYNALIGACARN
Sbjct: 163  YAVVSWLQRHNLCFSYELLYSILIHALGRSEKLYEAFILSQKQTLTPLTYNALIGACARN 222

Query: 184  NDLEKALNLMERMRRDGYQSDYVNYSLVIQSLMRNNNVDVAILEKLYGEMEADRIELDGQ 363
            NDLEKALNLM RMR+DG+QSD++NYSL+IQSL R N +D+ +L+KLY E+E+D+IELDG 
Sbjct: 223  NDLEKALNLMSRMRQDGFQSDFINYSLIIQSLTRTNKIDIPLLQKLYEEIESDKIELDGL 282

Query: 364  LFNDVIAGFTKAGDIDRALYFLGLMQGGGLTAKTSTVVSMVNELGNLGRVXXXXXXXXXX 543
            L ND+I GF KAGD +RALYFL ++Q  GL  KTST V++++ LGN GR           
Sbjct: 283  LLNDIILGFAKAGDPNRALYFLSMVQASGLNPKTSTFVAVISALGNHGRTEEAEAIFEEM 342

Query: 544  XXGGMRPRTRAYNALLKGYVKVGALKDAEYVVSEMEASGVSPDEQTYSLLMDAYGNAGRW 723
              GG++PR +A+NALLKGY + G+LK+AE ++SEME SG+SPDE TY LL+DAY N GRW
Sbjct: 343  KEGGLKPRIKAFNALLKGYARKGSLKEAESIISEMEKSGLSPDEHTYGLLVDAYANVGRW 402

Query: 724  ESARIVLKEMEENNVKPNCYVFSRILASYRDRGEWQRSFQVLKEMKNCGVNPNLQFYNVM 903
            ESAR +LK+ME  NV+PN ++FSRILASYRDRGEWQ++F+VL+EMKN  V P+  FYNVM
Sbjct: 403  ESARHLLKQMEARNVQPNTFIFSRILASYRDRGEWQKTFEVLREMKNSNVKPDRHFYNVM 462

Query: 904  IDTFGKYNCLEHMMAALERMKVEGIEPDTVTWNTLIDCHCKQGHRDKAEQLFEEMQESGC 1083
            IDTFGK+NCL+H M   +RM  EGIEPD VTWNTLIDCH K G+ D+A +LFEEMQE G 
Sbjct: 463  IDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHRKHGYHDRAAELFEEMQERGY 522

Query: 1084 LPCTTTYNIMINSFGAQERWDDVKELLRKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIE 1263
            LPC TTYNIMINS G QE+WD+VK LL KMQSQGLLPNV+TYTTLVDIYG SGRFNDAI+
Sbjct: 523  LPCPTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVVTYTTLVDIYGHSGRFNDAID 582

Query: 1264 CLEAMKSAGLKPSSTMYNALINAYAQKGLSEQAVNAFRVMRGDGLKPSLLALNSLINAFG 1443
            CLEAMKSAGLKPS+TMYNALINA+AQ+GLSEQAVNA+RVM  DGL+PSLLALNSLINAFG
Sbjct: 583  CLEAMKSAGLKPSATMYNALINAFAQRGLSEQAVNAYRVMISDGLRPSLLALNSLINAFG 642

Query: 1444 EDRRDAEAFAVLQYMRDNDLKPDVVTYTTLMKALIRVEKYEKVPAVFEEMLLSGCAPDRK 1623
            EDRRD EAF++LQYM++ND+KPDVVTYTTLMKALIRV+K++KVPAV+EEM+LSGC PD K
Sbjct: 643  EDRRDIEAFSILQYMKENDVKPDVVTYTTLMKALIRVDKFDKVPAVYEEMILSGCTPDGK 702

Query: 1624 ARAMLRSALRYMKSTFNL 1677
            ARAMLRSALRYMK T +L
Sbjct: 703  ARAMLRSALRYMKRTLSL 720


Top