BLASTX nr result

ID: Scutellaria24_contig00003699 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria24_contig00003699
         (2304 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002280516.1| PREDICTED: pentatricopeptide repeat-containi...   391   e-128
ref|XP_002515341.1| pentatricopeptide repeat-containing protein,...   380   e-122
ref|XP_003545545.1| PREDICTED: pentatricopeptide repeat-containi...   369   e-116
ref|XP_002330760.1| predicted protein [Populus trichocarpa] gi|2...   357   e-115
emb|CBI20759.3| unnamed protein product [Vitis vinifera]              348   e-115

>ref|XP_002280516.1| PREDICTED: pentatricopeptide repeat-containing protein At5g10690-like
            [Vitis vinifera]
          Length = 609

 Score =  391 bits (1005), Expect(2) = e-128
 Identities = 205/344 (59%), Positives = 254/344 (73%), Gaps = 1/344 (0%)
 Frame = -3

Query: 2050 KDKAQNNNTFDVSPDLITYTTLLQGFGQAKDTSSVEKILTEMKSCNKFGIDRVAYTAITD 1871
            K+ AQ     D+ PD ITYTTLL+GFG AKD  SV+KI+ EMKS N   +DR AYTAI D
Sbjct: 262  KENAQKTIDVDLFPDAITYTTLLKGFGHAKDLLSVQKIVMEMKSSNNLFVDRTAYTAIVD 321

Query: 1870 ALLNCGCIRGALCVFGEIIKQAGRKPVLRPKPHLFLSLMRAFAARGDYVTVKRLHERMWF 1691
            ALLNCG  +GALC+FGEIIK+AG+   LRPKPHL++S+M A AARGDY  VK LH+RM  
Sbjct: 322  ALLNCGSSKGALCMFGEIIKRAGQNFNLRPKPHLYISMMSALAARGDYNLVKSLHKRMRP 381

Query: 1690 DSAGTISSAIQAESDHLLMEAALNEGQVDLAIKKLKNVIWKWRDISWNSRGGMVAVRLEA 1511
            DSAGTIS A+Q E+D LLMEAALN+GQVD A   L N+I +W+ I W SRGGMVAVRLEA
Sbjct: 382  DSAGTISPAVQIEADQLLMEAALNDGQVDAATHHLSNIITRWKGICWRSRGGMVAVRLEA 441

Query: 1510 LMGLNRSILSPRIIPQVSLGDAIEHIMIPFAQASPLQASLRLKQVVMRFYKDSVVPIIDE 1331
            L+G  RS+ SP ++PQVS  D IE+IM+PF +A PL A+L LK+VVMRFYKDSVVP+ID+
Sbjct: 442  LLGFTRSMFSPYLLPQVSPADPIENIMMPFEEARPLLATLDLKRVVMRFYKDSVVPVIDD 501

Query: 1330 WGGCVGILHREDCDVLNMPLAKLMRHXXXXXXXXXXXXXVIDLMLDKRYKMVVIVKYDEF 1151
            WG CVG+LHREDC  L+ P++ +MR              V DL+L+KRYKMVV+VKY   
Sbjct: 502  WGSCVGLLHREDCRELDAPVSTMMRSPPPCVTTTTSIGRVADLILEKRYKMVVVVKYSNL 561

Query: 1150 HGTSV-SSVRAVGVFTYEQLGKLTKNSSSTTDEQFCLSKR*LKE 1022
            +G+S  SS+RAVGVFT EQL KL   +S    ++F + KR +++
Sbjct: 562  YGSSYSSSLRAVGVFTSEQLFKLA-IASEMPGQEFPVEKRTMQQ 604



 Score = 94.7 bits (234), Expect(2) = e-128
 Identities = 44/60 (73%), Positives = 50/60 (83%)
 Frame = -2

Query: 2303 FNLLMKGYITAGCPQAALGVHDEILRHGLNPDRLSYNTLIFACIKNENLDRAMLLFQQMK 2124
            +NLLMKGYITAG P AAL VHDEIL+ GL PDRL+YNTLIFAC+K E LD AM  F++MK
Sbjct: 203  YNLLMKGYITAGFPMAALSVHDEILQQGLKPDRLTYNTLIFACVKTEKLDTAMRYFEEMK 262


>ref|XP_002515341.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223545285|gb|EEF46790.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 600

 Score =  380 bits (976), Expect(2) = e-122
 Identities = 189/337 (56%), Positives = 243/337 (72%), Gaps = 2/337 (0%)
 Frame = -3

Query: 2050 KDKAQNNNTFDVSPDLITYTTLLQGFGQAKDTSSVEKILTEMKSCNKFGIDRVAYTAITD 1871
            KD+A+  N  ++ PD++TYTTLL+GFG AKD  SV+ I+ EMK  +   IDR  +TA+ D
Sbjct: 255  KDEARQTNNVNLYPDVVTYTTLLKGFGNAKDLGSVKMIVLEMKLYHNLFIDRTGFTAMVD 314

Query: 1870 ALLNCGCIRGALCVFGEIIKQAGRKPVLRPKPHLFLSLMRAFAARGDYVTVKRLHERMWF 1691
            ALLN G I+GALC+FGEIIK+AG  P LRPKPHL+LS+MRAFA +GDY  VK LH+R+W 
Sbjct: 315  ALLNSGSIKGALCIFGEIIKRAGVNPDLRPKPHLYLSMMRAFAVQGDYSMVKNLHKRLWP 374

Query: 1690 DSAGTISSAIQAESDHLLMEAALNEGQVDLAIKKLKNVIWKWRDISWNSRGGMVAVRLEA 1511
            DS GTIS AIQ E+DHLLMEAALN GQVD A++ L+ +I +W  ISW +RGGMVAVR+EA
Sbjct: 375  DSTGTISPAIQQEADHLLMEAALNGGQVDAALENLRKIIPRWNGISWTNRGGMVAVRIEA 434

Query: 1510 LMGLNRSILSPRIIPQVSLGDAIEHIMIPFAQASPLQASLRLKQVVMRFYKDSVVPIIDE 1331
            L+G  +SI SP ++PQ+S  + IE IM P  +A PL  +L LK+VVMRF++D VVPI+D+
Sbjct: 435  LLGFRKSIFSPYLLPQISSSEPIETIMTPLEEAQPLLGTLELKKVVMRFFRDEVVPIMDD 494

Query: 1330 WGGCVGILHREDCDVLNMPLAKLMRHXXXXXXXXXXXXXVIDLMLDKRYKMVVIVKYDEF 1151
            WG C+G+LHREDC  LN  LA +MR              V+DL+LDK+Y+MVV++KY   
Sbjct: 495  WGNCIGLLHREDCTELNATLATMMRSPPPCVTTMTSIGHVVDLILDKKYRMVVVIKYSNL 554

Query: 1150 HGTSVSSVRAVGVFTYEQLGKLTKNSSS--TTDEQFC 1046
               + SS RAVGVFT E+L KL K  S     ++ FC
Sbjct: 555  DSITYSSSRAVGVFTAEKLYKLAKPVSELFVREQGFC 591



 Score = 87.8 bits (216), Expect(2) = e-122
 Identities = 39/60 (65%), Positives = 51/60 (85%)
 Frame = -2

Query: 2303 FNLLMKGYITAGCPQAALGVHDEILRHGLNPDRLSYNTLIFACIKNENLDRAMLLFQQMK 2124
            +NLLMKGYI AGCPQAA+ + +EIL+ GL PDRL+YNTLI AC+K+++LD AM  F++MK
Sbjct: 196  YNLLMKGYINAGCPQAAVAMRNEILQLGLTPDRLTYNTLILACVKSKSLDAAMSFFEEMK 255


>ref|XP_003545545.1| PREDICTED: pentatricopeptide repeat-containing protein At5g10690-like
            [Glycine max]
          Length = 590

 Score =  369 bits (946), Expect(2) = e-116
 Identities = 178/321 (55%), Positives = 234/321 (72%)
 Frame = -3

Query: 2050 KDKAQNNNTFDVSPDLITYTTLLQGFGQAKDTSSVEKILTEMKSCNKFGIDRVAYTAITD 1871
            K KAQ  +  D+ PD++TYTT+L+GFGQ KD ++V KI+ EMKS  +  IDR AYTAI D
Sbjct: 257  KGKAQKFSNHDLFPDIVTYTTMLKGFGQTKDLATVLKIVLEMKSHRELYIDRTAYTAIID 316

Query: 1870 ALLNCGCIRGALCVFGEIIKQAGRKPVLRPKPHLFLSLMRAFAARGDYVTVKRLHERMWF 1691
            A L CG ++GALC+FGEI+KQ G  P L+PKPHL+LSLMRAFA  GDY  VK+LH+R+W 
Sbjct: 317  AFLKCGSVKGALCIFGEILKQTGLNPELKPKPHLYLSLMRAFAFLGDYYLVKKLHKRIWP 376

Query: 1690 DSAGTISSAIQAESDHLLMEAALNEGQVDLAIKKLKNVIWKWRDISWNSRGGMVAVRLEA 1511
            DSAGTI    Q E+DHLLMEAALN GQV++A+K L  ++ KW+ ISW SRGGMVA R+EA
Sbjct: 377  DSAGTILLVAQEEADHLLMEAALNAGQVNVAVKTLTEIVSKWKGISWTSRGGMVAYRIEA 436

Query: 1510 LMGLNRSILSPRIIPQVSLGDAIEHIMIPFAQASPLQASLRLKQVVMRFYKDSVVPIIDE 1331
            L+G ++S+ SP ++PQVS  + +E+ MI F    PLQ S++L++VVMRF+ ++VVPI+DE
Sbjct: 437  LLGFSKSLFSPHLLPQVSPSEPVENYMIQFEATRPLQGSIKLRKVVMRFFYEAVVPIVDE 496

Query: 1330 WGGCVGILHREDCDVLNMPLAKLMRHXXXXXXXXXXXXXVIDLMLDKRYKMVVIVKYDEF 1151
            WG C G+LHREDC  L+ PL  +MR              V+DL+L+KRY M+++V Y   
Sbjct: 497  WGSCTGLLHREDCIELDAPLTTMMRSPPPTVTTSTSIGHVVDLILEKRYPMIIVVNYRNS 556

Query: 1150 HGTSVSSVRAVGVFTYEQLGK 1088
            + T+  S RAVGVFT EQL +
Sbjct: 557  YATTPYSSRAVGVFTSEQLSR 577



 Score = 79.7 bits (195), Expect(2) = e-116
 Identities = 34/60 (56%), Positives = 47/60 (78%)
 Frame = -2

Query: 2303 FNLLMKGYITAGCPQAALGVHDEILRHGLNPDRLSYNTLIFACIKNENLDRAMLLFQQMK 2124
            +N+LMKGYI +GCP  A+ + +EILR G+ PDRL+YNTLI AC+++  LD AM  F++MK
Sbjct: 198  YNILMKGYINSGCPHTAINMLNEILRQGIMPDRLTYNTLILACVQSGKLDAAMQFFEEMK 257


>ref|XP_002330760.1| predicted protein [Populus trichocarpa] gi|222872562|gb|EEF09693.1|
            predicted protein [Populus trichocarpa]
          Length = 607

 Score =  357 bits (917), Expect(2) = e-115
 Identities = 187/347 (53%), Positives = 240/347 (69%), Gaps = 8/347 (2%)
 Frame = -3

Query: 2050 KDKAQNNNTFDVSPDLITYTTLLQGFGQAKDTSSVEKILTEMKSCNKFGIDRVAYTAITD 1871
            KDKAQN +   + PD++TYTTLLQGFG AKD  SV KI+ EMK      IDR A+TA+ D
Sbjct: 254  KDKAQNFSRDKLYPDVVTYTTLLQGFGGAKDLLSVLKIVYEMKMHRNLVIDRTAFTAMVD 313

Query: 1870 ALLNCGCIRGALCVFGEIIKQAGRKPVLRPKPHLFLSLMRAFAARGDYVTVKRLHERMWF 1691
            ALLNCG + GA+CVFGEIIK+AG  P LRPKPHL+LSLMRAFA++GDY  VK LH+R+W 
Sbjct: 314  ALLNCGSMNGAVCVFGEIIKRAGVNPKLRPKPHLYLSLMRAFASQGDYNMVKNLHKRLWP 373

Query: 1690 DSAGTISSAIQAESDHLLMEAALNEGQVDLAIKKLKNVIWKWRDISWN-------SRGGM 1532
            DS+G IS A+Q E+DHLLMEAALN+GQV++A++ L NV+ KW+ I W        + G +
Sbjct: 374  DSSGAISLALQEEADHLLMEAALNDGQVNVALENLTNVVLKWKRIPWTIVPNPQFTCGML 433

Query: 1531 VAVRLEALMGLNRSILSPRIIPQVSLGDAIEHIMIPFAQASPLQASLRLKQVVMRFYKDS 1352
            VA+R+E L+G   SI SP ++PQVS  + IE IM+P   A PL  +L LK+VVMRF+ D 
Sbjct: 434  VAMRIEVLLGFTNSIFSPYLLPQVSPSEPIESIMMPLKAAKPLLGTLHLKKVVMRFFWDQ 493

Query: 1351 VVPIIDEWGGCVGILHREDCDVLNMPLAKLMRHXXXXXXXXXXXXXVIDLMLDKRYKMVV 1172
            VVPI+D+WG CVG+LHREDC  LN PL  +MR              V+DL+L+K Y+MVV
Sbjct: 494  VVPIVDDWGSCVGLLHREDCTELNAPLMTMMRSPPPCVTTTTSIGHVVDLILEKMYRMVV 553

Query: 1171 IVKYDEFH-GTSVSSVRAVGVFTYEQLGKLTKNSSSTTDEQFCLSKR 1034
            +VKY   +  T+ S  + VGVFT EQL KL        +++  L +R
Sbjct: 554  VVKYSNLNSSTNSSGSKTVGVFTTEQLFKLVVPVQRPLEQERTLGRR 600



 Score = 87.8 bits (216), Expect(2) = e-115
 Identities = 42/60 (70%), Positives = 47/60 (78%)
 Frame = -2

Query: 2303 FNLLMKGYITAGCPQAALGVHDEILRHGLNPDRLSYNTLIFACIKNENLDRAMLLFQQMK 2124
            +NLLMKGYI+AGCPQ AL VHDEIL  GL PDRL+YNTLI AC+K   LD AM  F +MK
Sbjct: 195  YNLLMKGYISAGCPQDALPVHDEILELGLTPDRLTYNTLISACVKAGKLDAAMQFFDEMK 254


>emb|CBI20759.3| unnamed protein product [Vitis vinifera]
          Length = 570

 Score =  348 bits (894), Expect(2) = e-115
 Identities = 177/288 (61%), Positives = 215/288 (74%)
 Frame = -3

Query: 2050 KDKAQNNNTFDVSPDLITYTTLLQGFGQAKDTSSVEKILTEMKSCNKFGIDRVAYTAITD 1871
            K+ AQ     D+ PD ITYTTLL+GFG AKD  SV+KI+ EMKS N   +DR AYTAI D
Sbjct: 279  KENAQKTIDVDLFPDAITYTTLLKGFGHAKDLLSVQKIVMEMKSSNNLFVDRTAYTAIVD 338

Query: 1870 ALLNCGCIRGALCVFGEIIKQAGRKPVLRPKPHLFLSLMRAFAARGDYVTVKRLHERMWF 1691
            ALLNCG  +GALC+FGEIIK+AG+   LRPKPHL++S+M A AARGDY  VK LH+RM  
Sbjct: 339  ALLNCGSSKGALCMFGEIIKRAGQNFNLRPKPHLYISMMSALAARGDYNLVKSLHKRMRP 398

Query: 1690 DSAGTISSAIQAESDHLLMEAALNEGQVDLAIKKLKNVIWKWRDISWNSRGGMVAVRLEA 1511
            DSAGTIS A+Q E+D LLMEAALN+GQVD A   L N+I +W+ I W SRGGMVAVRLEA
Sbjct: 399  DSAGTISPAVQIEADQLLMEAALNDGQVDAATHHLSNIITRWKGICWRSRGGMVAVRLEA 458

Query: 1510 LMGLNRSILSPRIIPQVSLGDAIEHIMIPFAQASPLQASLRLKQVVMRFYKDSVVPIIDE 1331
            L+G  RS+ SP ++PQVS  D IE+IM+PF +A PL A+L LK+VVMRFYKDSVVP+ID+
Sbjct: 459  LLGFTRSMFSPYLLPQVSPADPIENIMMPFEEARPLLATLDLKRVVMRFYKDSVVPVIDD 518

Query: 1330 WGGCVGILHREDCDVLNMPLAKLMRHXXXXXXXXXXXXXVIDLMLDKR 1187
            WG CVG+LHREDC  L+ P++ +MR              V DL+L+KR
Sbjct: 519  WGSCVGLLHREDCRELDAPVSTMMRSPPPCVTTTTSIGRVADLILEKR 566



 Score = 94.7 bits (234), Expect(2) = e-115
 Identities = 44/60 (73%), Positives = 50/60 (83%)
 Frame = -2

Query: 2303 FNLLMKGYITAGCPQAALGVHDEILRHGLNPDRLSYNTLIFACIKNENLDRAMLLFQQMK 2124
            +NLLMKGYITAG P AAL VHDEIL+ GL PDRL+YNTLIFAC+K E LD AM  F++MK
Sbjct: 220  YNLLMKGYITAGFPMAALSVHDEILQQGLKPDRLTYNTLIFACVKTEKLDTAMRYFEEMK 279


Top