BLASTX nr result

ID: Salvia21_contig00022109 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Salvia21_contig00022109
         (1601 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_195751.1| aprataxin [Arabidopsis thaliana] gi|75335734|sp...   513   e-143
ref|XP_002873016.1| basic helix-loop-helix family protein [Arabi...   513   e-143
ref|XP_002323598.1| predicted protein [Populus trichocarpa] gi|2...   512   e-143
ref|XP_002530499.1| aprataxin, putative [Ricinus communis] gi|22...   511   e-142
emb|CBI18255.3| unnamed protein product [Vitis vinifera]              508   e-141

>ref|NP_195751.1| aprataxin [Arabidopsis thaliana] gi|75335734|sp|Q9M041.1|BH140_ARATH
            RecName: Full=Transcription factor bHLH140; AltName:
            Full=Basic helix-loop-helix protein 140; Short=AtbHLH140;
            Short=bHLH 140; AltName: Full=Transcription factor EN
            122; AltName: Full=bHLH transcription factor bHLH140
            gi|7320709|emb|CAB81914.1| putative protein [Arabidopsis
            thaliana] gi|332002943|gb|AED90326.1| aprataxin
            [Arabidopsis thaliana]
          Length = 912

 Score =  513 bits (1322), Expect = e-143
 Identities = 262/443 (59%), Positives = 320/443 (72%), Gaps = 37/443 (8%)
 Frame = -3

Query: 1485 IPTLVFPSISTADFQFDLEKASNIIIEKVEEYIGRIGKGKLVLVDLSRGSKILSLVETKA 1306
            +PTL FPSISTADFQFDLEKAS+II+EK EE++ ++G  +LVLVDLSRGSKILSLV+ KA
Sbjct: 455  VPTLAFPSISTADFQFDLEKASDIIVEKAEEFLSKLGTARLVLVDLSRGSKILSLVKAKA 514

Query: 1305 AKKNIDSNKFSTFVGDITRLRSQGGLHCNVIANTTNWRLKPGGGGVNSAIFKAAGPELEV 1126
            ++KNIDS KF TFVGDIT+LRS+GGLHCNVIAN TNWRLKPGGGGVN+AIFKAAGP+LE 
Sbjct: 515  SQKNIDSAKFFTFVGDITKLRSEGGLHCNVIANATNWRLKPGGGGVNAAIFKAAGPDLET 574

Query: 1125 ATKERAETLAPGKCVAVPLPSSSPL-IAEGVTHVIHVLGPNMNPMRPDCLKDNYSEGCKI 949
            AT+ RA TL PGK V VPLPS+ PL  AEG+THVIHVLGPNMNP RPD L ++Y++GCK 
Sbjct: 575  ATRVRANTLLPGKAVVVPLPSTCPLHNAEGITHVIHVLGPNMNPNRPDNLNNDYTKGCKT 634

Query: 948  LREAYSSLFEAFVSIVESNNK----------DQNGKRAADYIPESNKKSKGST------- 820
            LREAY+SLFE F+S+V+  +K            +G+   +   E NKK KGS        
Sbjct: 635  LREAYTSLFEGFLSVVQDQSKLPKRSSQTAVSDSGEDIKE-DSERNKKYKGSQDKAVTNN 693

Query: 819  -------------------WGTWAQALHKIAMHPDNHKDXXXXXXXXXXVIKDSYPKAQQ 697
                               W TWA ALH IAMHP+ H++          VI D YPKA++
Sbjct: 694  LESESLEDTRGSGKKMSKGWNTWALALHSIAMHPERHENVVLEYLDNIVVINDQYPKARK 753

Query: 696  HLLVIARLPGLDSIADVSTEHIPLLKEMHAVGLKWAEKFILEDKSQSFRLGYHSIPSMRQ 517
            H+LV+AR   LD + DV  E++ LL+EMH VGLKW ++F  ED S  FRLGYHS+PSMRQ
Sbjct: 754  HVLVLARQESLDGLEDVRKENLQLLQEMHNVGLKWVDRFQNEDASLIFRLGYHSVPSMRQ 813

Query: 516  LHLHMISQDFDSSHLKNKKHWLSFNSPFFLDSLDVIEGLEKGGKLSLANETCFAGPLRCH 337
            LHLH+ISQDF+S  LKNKKHW SF + FF DS+DV+E +   GK ++A+E    G LRC+
Sbjct: 814  LHLHVISQDFNSDSLKNKKHWNSFTTSFFRDSVDVLEEVNSQGKANVASEDLLKGELRCN 873

Query: 336  RCQSVNPNIPKLKNHIRSCKEPF 268
            RC+S +PNIPKLK+H+RSC   F
Sbjct: 874  RCRSAHPNIPKLKSHVRSCHSQF 896


>ref|XP_002873016.1| basic helix-loop-helix family protein [Arabidopsis lyrata subsp.
            lyrata] gi|297318853|gb|EFH49275.1| basic
            helix-loop-helix family protein [Arabidopsis lyrata
            subsp. lyrata]
          Length = 898

 Score =  513 bits (1321), Expect = e-143
 Identities = 261/443 (58%), Positives = 321/443 (72%), Gaps = 37/443 (8%)
 Frame = -3

Query: 1485 IPTLVFPSISTADFQFDLEKASNIIIEKVEEYIGRIGKGKLVLVDLSRGSKILSLVETKA 1306
            +PTL FPSISTADFQFDLEKAS+II+EK EE++ ++G  +LVLVDLS+GSKILSLV+ KA
Sbjct: 440  VPTLAFPSISTADFQFDLEKASDIIVEKAEEFLPKLGTARLVLVDLSQGSKILSLVKAKA 499

Query: 1305 AKKNIDSNKFSTFVGDITRLRSQGGLHCNVIANTTNWRLKPGGGGVNSAIFKAAGPELEV 1126
            A+KNIDS +F TFVGDIT+LRS+GGLHCNVIAN TNWRLKPGGGGVN+AIFKAAGP+LE 
Sbjct: 500  AQKNIDSARFFTFVGDITKLRSEGGLHCNVIANATNWRLKPGGGGVNAAIFKAAGPDLEA 559

Query: 1125 ATKERAETLAPGKCVAVPLPSSSPLI-AEGVTHVIHVLGPNMNPMRPDCLKDNYSEGCKI 949
            AT+ RA TL PGK   VPLPS+ PL  AEG+THVIHVLGPNMNP RPD L ++Y++GCK 
Sbjct: 560  ATRVRANTLLPGKAAVVPLPSTCPLHNAEGITHVIHVLGPNMNPNRPDNLNNDYTKGCKT 619

Query: 948  LREAYSSLFEAFVSIVESNNK----------DQNGKRAADYIPESNKKSKGST------- 820
            LREAY+SLFE F+S+V+  +K            +G+   +   E NKK KGS        
Sbjct: 620  LREAYTSLFEGFLSVVQDQSKLPKRSNQTALSDSGEDIKED-SERNKKYKGSQDKAVTNN 678

Query: 819  -------------------WGTWAQALHKIAMHPDNHKDXXXXXXXXXXVIKDSYPKAQQ 697
                               W TWA ALH IAMHP+ H++          VI D YPKA++
Sbjct: 679  LESGSLEDTRDSGKKMSKGWSTWALALHSIAMHPERHENVVLEFSDNIVVINDQYPKARK 738

Query: 696  HLLVIARLPGLDSIADVSTEHIPLLKEMHAVGLKWAEKFILEDKSQSFRLGYHSIPSMRQ 517
            H+LV+AR   LD + DV  E++ LL+EMH VGLKW ++F  ED S  FRLGYHS+PSMRQ
Sbjct: 739  HVLVLARQESLDGLEDVRKENLQLLQEMHNVGLKWVDRFQNEDASLIFRLGYHSVPSMRQ 798

Query: 516  LHLHMISQDFDSSHLKNKKHWLSFNSPFFLDSLDVIEGLEKGGKLSLANETCFAGPLRCH 337
            LHLH+ISQDFDS  LKNKKHW SF S FF DS+DV+E ++  GK ++A+E    G LRC+
Sbjct: 799  LHLHVISQDFDSDSLKNKKHWNSFTSSFFRDSVDVLEEVKSQGKANVASEDLLKGELRCN 858

Query: 336  RCQSVNPNIPKLKNHIRSCKEPF 268
            RC+S +PNIPKLK+H+R+C+  F
Sbjct: 859  RCRSAHPNIPKLKSHVRNCRSQF 881


>ref|XP_002323598.1| predicted protein [Populus trichocarpa] gi|222868228|gb|EEF05359.1|
            predicted protein [Populus trichocarpa]
          Length = 718

 Score =  512 bits (1319), Expect = e-143
 Identities = 276/507 (54%), Positives = 333/507 (65%), Gaps = 64/507 (12%)
 Frame = -3

Query: 1566 ACDALQKMREGGELDMXXXXXXXXXXSIPTLVFPSISTADFQFDLEKASNIIIEKVEEYI 1387
            +C A + ++E  +L             I TL FPSISTADFQF+ EKAS+II+EKVEE++
Sbjct: 206  SCAASKDVKESEDLAKDSVDADVSVGDITTLAFPSISTADFQFNNEKASDIIVEKVEEFV 265

Query: 1386 GRIGKGKLVLVDLSRGSKILSLVETKAAKKNIDSNKFSTFVGDITRLRSQGGLHCNVIAN 1207
             ++   + VLVDLS GSKILSLV  KAAK+NIDS KF TFVGDITRL SQGGL CN IAN
Sbjct: 266  NKLENARFVLVDLSHGSKILSLVRAKAAKRNIDSKKFFTFVGDITRLYSQGGLRCNAIAN 325

Query: 1206 TTNWRLKPGGGGVNSAIFKAAGPELEVATKERAETLAPGKCVAVPLPSSSPLIA-EGVTH 1030
              NWRLKPGGGGVN+AIF AAGP LE ATKERA++L PG  V VPLPS SPL   E V+H
Sbjct: 326  AANWRLKPGGGGVNAAIFAAAGPSLETATKERAKSLLPGHAVVVPLPSDSPLYTREEVSH 385

Query: 1029 VIHVLGPNMNPMRPDCLKDNYSEGCKILREAYSSLFEAFVSIVESNNK------------ 886
            VIHVLGPNMNP RP+ L ++Y++GC ILREAY+SLF  F+SIV S +K            
Sbjct: 386  VIHVLGPNMNPQRPNSLNNDYTKGCSILREAYTSLFTGFLSIVRSRSKLPRRIIEKLESS 445

Query: 885  ------------------DQNGKRAADYIPESNKKSKGS--------------------- 823
                              DQ  KR  D + E +KK KG+                     
Sbjct: 446  PSDLKDPSHGPRNHLTNSDQKIKRDDDCVYERSKKCKGTHDETVADISAPSSTYGKVTGD 505

Query: 822  ----------TWGTWAQALHKIAMHPDNHKDXXXXXXXXXXVIKDSYPKAQQHLLVIARL 673
                      +WG+WAQAL+ IAMHP+ HKD          V+ D YPKA +HLLV+AR 
Sbjct: 506  KSKLEGPTSKSWGSWAQALYHIAMHPEKHKDKLLEVLDDVVVLNDLYPKACKHLLVLARH 565

Query: 672  PGLDSIADVSTEHIPLLKEMHAVGLKWAEKFILEDKSQSFRLGYHSIPSMRQLHLHMISQ 493
             GLD +ADV  EH+ LL  MHAVGLKWAEKF+ ED S  FRLGYHS+PSMRQLHLH+ISQ
Sbjct: 566  EGLDCLADVHQEHLQLLMTMHAVGLKWAEKFLHEDSSMVFRLGYHSVPSMRQLHLHVISQ 625

Query: 492  DFDSSHLKNKKHWLSFNSPFFLDSLDVIEGLEKGGKLSLANETC-FAGPLRCHRCQSVNP 316
            DF+S+HLKNKKHW SFN+ FF DS+DVIE ++  GK ++ +E C  +  LRCHRC+S +P
Sbjct: 626  DFNSNHLKNKKHWNSFNTAFFRDSVDVIEEIKNHGKATIKDEDCRLSMELRCHRCRSAHP 685

Query: 315  NIPKLKNHIRSCKEPFT-CLLEAGCLI 238
            NIP+LK+HI  C+ PF   LLE G L+
Sbjct: 686  NIPRLKSHISICQAPFPHALLENGRLV 712


>ref|XP_002530499.1| aprataxin, putative [Ricinus communis] gi|223529956|gb|EEF31883.1|
            aprataxin, putative [Ricinus communis]
          Length = 749

 Score =  511 bits (1315), Expect = e-142
 Identities = 273/513 (53%), Positives = 337/513 (65%), Gaps = 63/513 (12%)
 Frame = -3

Query: 1587 AKTGSHTACDALQKMREGGELDMXXXXXXXXXXSIPTLVFPSISTADFQFDLEKASNIII 1408
            +  GS+ A  A          D+          SIPTL FPSISTADFQF  EKAS+II+
Sbjct: 228  SNVGSNIALSATTSKEVKESEDLIKGSICHDEDSIPTLAFPSISTADFQFHNEKASDIIV 287

Query: 1407 EKVEEYIGRIGKGKLVLVDLSRGSKILSLVETKAAKKNIDSNKFSTFVGDITRLRSQGGL 1228
            EKVEE++ ++G  +LVLVDLS+GSKILSLV  KAA++NI +NKF TFVGDIT+L SQGGL
Sbjct: 288  EKVEEFVKKLGNARLVLVDLSQGSKILSLVRAKAAQRNISTNKFFTFVGDITQLLSQGGL 347

Query: 1227 HCNVIANTTNWRLKPGGGGVNSAIFKAAGPELEVATKERAETLAPGKCVAVPLPSSSPLI 1048
             CNVIAN  NWRLKPGGGGVN+AI+ AAGP LEVATKE A +L PG  V VPLPS+SPL 
Sbjct: 348  RCNVIANAANWRLKPGGGGVNAAIYSAAGPALEVATKELATSLLPGHAVVVPLPSNSPLY 407

Query: 1047 -AEGVTHVIHVLGPNMNPMRPDCLKDNYSEGCKILREAYSSLFEAFVSIVES-------- 895
              EGV+H+IHVLGPNMNP RP+CL  +Y++GCKIL +AY+SLF  FVSI+++        
Sbjct: 408  HREGVSHIIHVLGPNMNPQRPNCLNGDYAKGCKILSDAYTSLFGGFVSILQNQAKSGKSR 467

Query: 894  ---------------------NNKDQNGKRAADYIPESNKKSKGS--------------- 823
                                  N DQ  KR  DY+ E +KK KGS               
Sbjct: 468  ENLVSDQSLQDMSHDIPRNILTNGDQKIKRDDDYMTEKSKKYKGSQNETRVNSTGSGCTY 527

Query: 822  ----------------TWGTWAQALHKIAMHPDNHKDXXXXXXXXXXVIKDSYPKAQQHL 691
                            +W +WAQAL+ IAM P+ HKD          V+ D YPKAQ+HL
Sbjct: 528  GKISRDNSKIDGSTSKSWNSWAQALYHIAMRPERHKDELLEISDDVVVLNDLYPKAQKHL 587

Query: 690  LVIARLPGLDSIADVSTEHIPLLKEMHAVGLKWAEKFILEDKSQSFRLGYHSIPSMRQLH 511
            LV+AR PGLD +ADV  EHI LL  MH VGLKWA++F+ ED S  FRLGYHS PSMRQLH
Sbjct: 588  LVLARYPGLDGLADVHEEHIQLLTTMHTVGLKWAKRFLHEDSSMIFRLGYHSTPSMRQLH 647

Query: 510  LHMISQDFDSSHLKNKKHWLSFNSPFFLDSLDVIEGLEKGGKLSLANETCFAG-PLRCHR 334
            LH+ISQDF+S+HLKNKKHW +FN+ FF DS+DVIE ++  GK ++ ++  +    LRCHR
Sbjct: 648  LHVISQDFNSNHLKNKKHWNTFNTAFFRDSVDVIEEVQNHGKANIKDDNSYLSMELRCHR 707

Query: 333  CQSVNPNIPKLKNHIRSCKEPF-TCLLEAGCLI 238
            C+S +PNIP+L++HI +C+ PF T LLE   L+
Sbjct: 708  CRSAHPNIPRLRSHISNCRAPFPTFLLEKDRLL 740


>emb|CBI18255.3| unnamed protein product [Vitis vinifera]
          Length = 678

 Score =  508 bits (1307), Expect = e-141
 Identities = 254/416 (61%), Positives = 315/416 (75%), Gaps = 10/416 (2%)
 Frame = -3

Query: 1485 IPTLVFPSISTADFQFDLEKASNIIIEKVEEYIGRIGKGKLVLVDLSRGSKILSLVETKA 1306
            IPTL FPSISTADFQF+ EKA++II+EKVEE++ ++   +LVLVDLS GSKILSLV  KA
Sbjct: 242  IPTLAFPSISTADFQFNHEKAADIILEKVEEFVNKVENARLVLVDLSHGSKILSLVRAKA 301

Query: 1305 AKKNIDSNKFSTFVGDITRLRSQGGLHCNVIANTTNWRLKPGGGGVNSAIFKAAGPELEV 1126
            A++NIDSNKF TFVGDITRL S+GGL CN IAN  NWRLKPGGGG N+AIF AAGPELEV
Sbjct: 302  AQRNIDSNKFFTFVGDITRLYSKGGLRCNAIANAANWRLKPGGGGANAAIFSAAGPELEV 361

Query: 1125 ATKERAETLAPGKCVAVPLPSSSPLIA-EGVTHVIHVLGPNMNPMRPDCLKDNYSEGCKI 949
             TK+RA +L PGK + VPLPS+SPL + EGVTHVIHVLGPNMN  RP+CL ++Y +G K+
Sbjct: 362  ETKKRAGSLIPGKALVVPLPSTSPLFSREGVTHVIHVLGPNMNRQRPNCLNNDYVKGSKV 421

Query: 948  LREAYSSLFEAFVSIVES-----NNKDQNGKRAADYIPESNKK---SKGSTWGTWAQALH 793
            LREAY+SLFE F SI+ +         +N +     +  +N+K   +   TWG+WAQ+L+
Sbjct: 422  LREAYTSLFEGFASIMNTQGNLLEGSSENLRSELSRVGLNNEKIGRNMTKTWGSWAQSLY 481

Query: 792  KIAMHPDNHKDXXXXXXXXXXVIKDSYPKAQQHLLVIARLPGLDSIADVSTEHIPLLKEM 613
             IAMHP+ HKD          V+ D YPKAQ+HLLV+AR  GLD +ADV  EH+ LL+ M
Sbjct: 482  HIAMHPEKHKDNLIEISDDVVVLNDLYPKAQRHLLVLARSEGLDCLADVGGEHLQLLRTM 541

Query: 612  HAVGLKWAEKFILEDKSQSFRLGYHSIPSMRQLHLHMISQDFDSSHLKNKKHWLSFNSPF 433
            HAVGLKWAEKF+ ED+   FR+GYHS PSMRQLHLH+ISQDF+S HLKNKKHW SFNS F
Sbjct: 542  HAVGLKWAEKFLCEDELLVFRIGYHSAPSMRQLHLHVISQDFNSKHLKNKKHWNSFNSAF 601

Query: 432  FLDSLDVIEGLEKGGKLSLANE-TCFAGPLRCHRCQSVNPNIPKLKNHIRSCKEPF 268
            F DS+DVIE +   G+ ++  E +  +  LRCHRC+S +PN+P+LK+HI +C+  F
Sbjct: 602  FRDSVDVIEEITNHGRATIKGEDSQLSMELRCHRCRSAHPNMPRLKSHISNCQASF 657


Top