BLASTX nr result
ID: Scutellaria23_contig00004104
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Scutellaria23_contig00004104 (2199 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002276933.1| PREDICTED: trihelix transcription factor GT-... 359 1e-96 ref|XP_002331882.1| predicted protein [Populus trichocarpa] gi|2... 355 4e-95 ref|XP_003536427.1| PREDICTED: trihelix transcription factor GT-... 273 1e-70 ref|NP_177814.1| putative trihelix DNA-binding protein [Arabidop... 265 3e-68 ref|XP_002887660.1| hypothetical protein ARALYDRAFT_895569 [Arab... 265 3e-68 >ref|XP_002276933.1| PREDICTED: trihelix transcription factor GT-2 [Vitis vinifera] Length = 510 Score = 359 bits (922), Expect = 1e-96 Identities = 193/432 (44%), Positives = 262/432 (60%), Gaps = 1/432 (0%) Frame = -2 Query: 1973 EEIXXXXXGNRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKKC 1794 EE GNRWPR+ETLALL+IRSDMD FRD+SLK PLWEEVSRK+GELG+ R+AKKC Sbjct: 41 EESDRNFAGNRWPREETLALLKIRSDMDVVFRDSSLKAPLWEEVSRKLGELGYHRNAKKC 100 Query: 1793 KEKFENVYKYHKRTKDGRASKSDGKTYRFFDQLEALEXXXXXXXXXXXXXXXXXXXXXXX 1614 KEKFEN++KYHKRTK+GR+++ +GK YRFF+QLEAL+ Sbjct: 101 KEKFENIFKYHKRTKEGRSNRQNGKNYRFFEQLEALD----------------NHPLMPP 144 Query: 1613 XXXXXXXXXXXXXXXXXVANPISYSPFQPPIPPQSHHFQLSHRPILXXXXXXXXXXXXXS 1434 NPI + I Q +P + Sbjct: 145 PSPVKYETSTPMAASMPQTNPIDVTNVSQGINAVPCSIQ---KPAVDCVAASTSTTSSSG 201 Query: 1433 DEDIQRRRGRKRKWKDYLERLMEDVVKRQEELQKRFLDTLEKSERDRIAREEAWRVQEMA 1254 E R+ +++ W + E+LM++V+++QE LQ++F++ +EK E+DRIAREEAW++QE+ Sbjct: 202 KESEGSRKKKRK-WGVFFEKLMKEVIEKQENLQRKFIEAIEKCEQDRIAREEAWKLQELD 260 Query: 1253 RMNREHDRLAHERSIXXXXXXXXXAFLQKVTGEKS-LQIPVSNTNNAIQKXXXXXXXXXX 1077 R+ REH+ L ERSI AFLQK+ + +Q+P + ++ + + Sbjct: 261 RIKREHEILVQERSIAAAKDAAVLAFLQKIAEQAGPVQLPENPSSEKVFEKQD------- 313 Query: 1076 XXXXXXXXXXXXXXXXXXXXXXATTRDNGGDDLLSPSSSRWPKAEVEALIKLRTDLENRY 897 ++ G++ + SSSRWPKAEVEALI+LRT+ + +Y Sbjct: 314 --------------------------NSNGENSIQMSSSRWPKAEVEALIRLRTNFDMQY 347 Query: 896 QENGPKGPLWEEISKSMAKIGYNRSSKRCKEKWENINKYFKKVKESNKKRPQDSKTCPYF 717 QE+GPKGPLWEEIS +M KIGY RS+KRCKEKWENINKYFK+V++SNK+RP+DSKTCPYF Sbjct: 348 QESGPKGPLWEEISLAMRKIGYERSAKRCKEKWENINKYFKRVRDSNKRRPEDSKTCPYF 407 Query: 716 QQLDAIYRERAK 681 QLDA+Y+E+ K Sbjct: 408 HQLDALYKEKTK 419 >ref|XP_002331882.1| predicted protein [Populus trichocarpa] gi|222874631|gb|EEF11762.1| predicted protein [Populus trichocarpa] Length = 470 Score = 355 bits (910), Expect = 4e-95 Identities = 192/422 (45%), Positives = 251/422 (59%) Frame = -2 Query: 1946 NRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKKCKEKFENVYK 1767 NRWP+QETLALL IRSDMD FRD+ +K PLWEEVSRK+ ELG+ RSAKKCKEKFEN+YK Sbjct: 15 NRWPKQETLALLEIRSDMDVAFRDSVVKAPLWEEVSRKLNELGYNRSAKKCKEKFENIYK 74 Query: 1766 YHKRTKDGRASKSDGKTYRFFDQLEALEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1587 YH+RTK ++ + +GKTYRFF+QL+AL+ Sbjct: 75 YHRRTKGSQSGRPNGKTYRFFEQLQALDKTNALVSPTSSDKDHCLMPSASVI-------- 126 Query: 1586 XXXXXXXXVANPISYSPFQPPIPPQSHHFQLSHRPILXXXXXXXXXXXXXSDEDIQRRRG 1407 P+S+ P P QS P + S E+ + R Sbjct: 127 -----------PVSFIPNDVPCSVQS--------PRMNCTDATSTSTASTSSEESEGTRK 167 Query: 1406 RKRKWKDYLERLMEDVVKRQEELQKRFLDTLEKSERDRIAREEAWRVQEMARMNREHDRL 1227 +KR+ D+ ERLM++V+++QE LQ +FL+ +EK E++RIAREE W++QE+ R+ RE + L Sbjct: 168 KKRRLTDFFERLMKEVIEKQENLQNKFLEAIEKCEQERIAREEVWKMQELDRIKREQELL 227 Query: 1226 AHERSIXXXXXXXXXAFLQKVTGEKSLQIPVSNTNNAIQKXXXXXXXXXXXXXXXXXXXX 1047 HER+I AFLQK + + IPV +N Sbjct: 228 VHERAIAAAKDAAVLAFLQKFSEQG---IPVQLPDNPT-------VPMKFPDNQTSPALL 277 Query: 1046 XXXXXXXXXXXXATTRDNGGDDLLSPSSSRWPKAEVEALIKLRTDLENRYQENGPKGPLW 867 T ++ + ++ SSSRWPK E+E+LIK+RT LE +YQENGPKGPLW Sbjct: 278 SKNQAVPVENVVKTHENSSVESFVNMSSSRWPKEEIESLIKIRTYLEFQYQENGPKGPLW 337 Query: 866 EEISKSMAKIGYNRSSKRCKEKWENINKYFKKVKESNKKRPQDSKTCPYFQQLDAIYRER 687 EEIS SM +GY+RS+KRCKEKWEN+NKYFK+VK+SNKKRP DSKTCPYFQQLDA+YRE+ Sbjct: 338 EEISTSMKNLGYDRSAKRCKEKWENMNKYFKRVKDSNKKRPGDSKTCPYFQQLDALYREK 397 Query: 686 AK 681 + Sbjct: 398 TR 399 Score = 95.9 bits (237), Expect = 4e-17 Identities = 42/95 (44%), Positives = 70/95 (73%) Frame = -2 Query: 968 SSSRWPKAEVEALIKLRTDLENRYQENGPKGPLWEEISKSMAKIGYNRSSKRCKEKWENI 789 +++RWPK E AL+++R+D++ ++++ K PLWEE+S+ + ++GYNRS+K+CKEK+ENI Sbjct: 13 TANRWPKQETLALLEIRSDMDVAFRDSVVKAPLWEEVSRKLNELGYNRSAKKCKEKFENI 72 Query: 788 NKYFKKVKESNKKRPQDSKTCPYFQQLDAIYRERA 684 KY ++ K S RP + KT +F+QL A+ + A Sbjct: 73 YKYHRRTKGSQSGRP-NGKTYRFFEQLQALDKTNA 106 >ref|XP_003536427.1| PREDICTED: trihelix transcription factor GT-2-like [Glycine max] Length = 667 Score = 273 bits (699), Expect = 1e-70 Identities = 139/278 (50%), Positives = 178/278 (64%), Gaps = 24/278 (8%) Frame = -2 Query: 1433 DEDIQRRRGRKRKWKDYLERLMEDVVKRQEELQKRFLDTLEKSERDRIAREEAWRVQEMA 1254 +E ++ RR RKRKWKD+ ERLM++V+++QEELQK+FL+ +EK E DRIAREEAWRVQEM Sbjct: 292 EETLEGRRKRKRKWKDFFERLMKEVIEKQEELQKKFLEAIEKREDDRIAREEAWRVQEMK 351 Query: 1253 RMNREHDRLAHERSIXXXXXXXXXAFLQKVTGEKSLQIPVSNTN---------------- 1122 R+NRE + LA ERSI +FLQK+ +++L +N N Sbjct: 352 RINREREILAQERSIAAAKDAAVMSFLQKIAEQQNLGQVSTNINLVQQPQPQLQPQPPLQ 411 Query: 1121 --------NAIQKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXATTRDNGGDDLLSPS 966 A Q + +N G++ L+PS Sbjct: 412 QQVTQPSIAAAQPPVQQPPPVVVTQPVVLPVVSQVTNMEIVKADNNSNNNNNGENFLAPS 471 Query: 965 SSRWPKAEVEALIKLRTDLENRYQENGPKGPLWEEISKSMAKIGYNRSSKRCKEKWENIN 786 SSRWPK EV+ALIKLRT ++ +YQENGPKGPLWEEIS SM K+GYNR++KRCKEKWENIN Sbjct: 472 SSRWPKVEVQALIKLRTSMDEKYQENGPKGPLWEEISASMKKLGYNRNAKRCKEKWENIN 531 Query: 785 KYFKKVKESNKKRPQDSKTCPYFQQLDAIYRERAKNHQ 672 KYFKKVKESNK+RP+DSKTCPYF QLDA+YR++ + + Sbjct: 532 KYFKKVKESNKRRPEDSKTCPYFHQLDALYRQKHRGEE 569 Score = 165 bits (418), Expect = 4e-38 Identities = 79/98 (80%), Positives = 86/98 (87%) Frame = -2 Query: 1976 MEEIXXXXXGNRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKK 1797 +EE GNRWPRQETLALLRIRSDMD FRDAS+KGPLWEEVSRKM ELG+ RS+KK Sbjct: 63 IEEGERSFGGNRWPRQETLALLRIRSDMDVAFRDASVKGPLWEEVSRKMAELGYHRSSKK 122 Query: 1796 CKEKFENVYKYHKRTKDGRASKSDGKTYRFFDQLEALE 1683 CKEKFENVYKYHKRTK+GR+ K DGKTYRFFDQL+ALE Sbjct: 123 CKEKFENVYKYHKRTKEGRSGKQDGKTYRFFDQLQALE 160 Score = 103 bits (256), Expect = 3e-19 Identities = 50/118 (42%), Positives = 74/118 (62%), Gaps = 8/118 (6%) Frame = -2 Query: 998 DNGGDDLL--------SPSSSRWPKAEVEALIKLRTDLENRYQENGPKGPLWEEISKSMA 843 +N GDD S +RWP+ E AL+++R+D++ +++ KGPLWEE+S+ MA Sbjct: 53 NNSGDDERGRIEEGERSFGGNRWPRQETLALLRIRSDMDVAFRDASVKGPLWEEVSRKMA 112 Query: 842 KIGYNRSSKRCKEKWENINKYFKKVKESNKKRPQDSKTCPYFQQLDAIYRERAKNHQP 669 ++GY+RSSK+CKEK+EN+ KY K+ KE + QD KT +F QL A+ H P Sbjct: 113 ELGYHRSSKKCKEKFENVYKYHKRTKEGRSGK-QDGKTYRFFDQLQALENHSPTPHSP 169 Score = 92.8 bits (229), Expect = 3e-16 Identities = 41/88 (46%), Positives = 62/88 (70%), Gaps = 1/88 (1%) Frame = -2 Query: 1946 NRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKKCKEKFENVYK 1767 +RWP+ E AL+++R+ MD +++ KGPLWEE+S M +LG+ R+AK+CKEK+EN+ K Sbjct: 473 SRWPKVEVQALIKLRTSMDEKYQENGPKGPLWEEISASMKKLGYNRNAKRCKEKWENINK 532 Query: 1766 YHKRTKDGRASK-SDGKTYRFFDQLEAL 1686 Y K+ K+ + D KT +F QL+AL Sbjct: 533 YFKKVKESNKRRPEDSKTCPYFHQLDAL 560 >ref|NP_177814.1| putative trihelix DNA-binding protein [Arabidopsis thaliana] gi|12322223|gb|AAG51144.1|AC079283_1 GT-like trihelix DNA-binding protein, putative [Arabidopsis thaliana] gi|332197777|gb|AEE35898.1| putative trihelix DNA-binding protein [Arabidopsis thaliana] Length = 603 Score = 265 bits (678), Expect = 3e-68 Identities = 140/261 (53%), Positives = 173/261 (66%), Gaps = 14/261 (5%) Frame = -2 Query: 1415 RRGRKRKWKDYLERLMEDVVKRQEELQKRFLDTLEKSERDRIAREEAWRVQEMARMNREH 1236 R+ RKRKWK + ERLM+ VV +QEELQ++FL+ +EK E +R+ REE+WRVQE+AR+NREH Sbjct: 247 RKKRKRKWKVFFERLMKQVVDKQEELQRKFLEAVEKREHERLVREESWRVQEIARINREH 306 Query: 1235 DRLAHERSIXXXXXXXXXAFLQKVTGEKSLQI----------PVSNTNNAIQKXXXXXXX 1086 + LA ERS+ AFLQK++ ++ Q P NN Q+ Sbjct: 307 EILAQERSMSAAKDAAVMAFLQKLSEKQPNQPQPQPQPQQVRPSMQLNNNNQQQPPQRSP 366 Query: 1085 XXXXXXXXXXXXXXXXXXXXXXXXXATTRDNGGDDLLSP----SSSRWPKAEVEALIKLR 918 T DNGGD ++P SSSRWPK E+EALIKLR Sbjct: 367 PPQPPAPLPQPIQAVVSTLDT-----TKTDNGGDQNMTPAASASSSRWPKVEIEALIKLR 421 Query: 917 TDLENRYQENGPKGPLWEEISKSMAKIGYNRSSKRCKEKWENINKYFKKVKESNKKRPQD 738 T+L+++YQENGPKGPLWEEIS M ++G+NR+SKRCKEKWENINKYFKKVKESNKKRP+D Sbjct: 422 TNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINKYFKKVKESNKKRPED 481 Query: 737 SKTCPYFQQLDAIYRERAKNH 675 SKTCPYF QLDA+YRER K H Sbjct: 482 SKTCPYFHQLDALYRERNKFH 502 Score = 157 bits (398), Expect = 9e-36 Identities = 74/88 (84%), Positives = 80/88 (90%) Frame = -2 Query: 1946 NRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKKCKEKFENVYK 1767 NRWPRQETLALL+IRSDM FRDAS+KGPLWEEVSRKM E G+ R+AKKCKEKFENVYK Sbjct: 60 NRWPRQETLALLKIRSDMGIAFRDASVKGPLWEEVSRKMAEHGYIRNAKKCKEKFENVYK 119 Query: 1766 YHKRTKDGRASKSDGKTYRFFDQLEALE 1683 YHKRTK+GR KS+GKTYRFFDQLEALE Sbjct: 120 YHKRTKEGRTGKSEGKTYRFFDQLEALE 147 Score = 91.3 bits (225), Expect = 1e-15 Identities = 40/88 (45%), Positives = 63/88 (71%), Gaps = 1/88 (1%) Frame = -2 Query: 1946 NRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKKCKEKFENVYK 1767 +RWP+ E AL+++R+++D+ +++ KGPLWEE+S M LGF R++K+CKEK+EN+ K Sbjct: 407 SRWPKVEIEALIKLRTNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINK 466 Query: 1766 YHKRTKDGRASK-SDGKTYRFFDQLEAL 1686 Y K+ K+ + D KT +F QL+AL Sbjct: 467 YFKKVKESNKKRPEDSKTCPYFHQLDAL 494 Score = 90.5 bits (223), Expect = 2e-15 Identities = 40/88 (45%), Positives = 63/88 (71%) Frame = -2 Query: 962 SRWPKAEVEALIKLRTDLENRYQENGPKGPLWEEISKSMAKIGYNRSSKRCKEKWENINK 783 +RWP+ E AL+K+R+D+ +++ KGPLWEE+S+ MA+ GY R++K+CKEK+EN+ K Sbjct: 60 NRWPRQETLALLKIRSDMGIAFRDASVKGPLWEEVSRKMAEHGYIRNAKKCKEKFENVYK 119 Query: 782 YFKKVKESNKKRPQDSKTCPYFQQLDAI 699 Y K+ KE + + KT +F QL+A+ Sbjct: 120 YHKRTKEGRTGK-SEGKTYRFFDQLEAL 146 >ref|XP_002887660.1| hypothetical protein ARALYDRAFT_895569 [Arabidopsis lyrata subsp. lyrata] gi|297333501|gb|EFH63919.1| hypothetical protein ARALYDRAFT_895569 [Arabidopsis lyrata subsp. lyrata] Length = 598 Score = 265 bits (678), Expect = 3e-68 Identities = 138/264 (52%), Positives = 176/264 (66%), Gaps = 17/264 (6%) Frame = -2 Query: 1415 RRGRKRKWKDYLERLMEDVVKRQEELQKRFLDTLEKSERDRIAREEAWRVQEMARMNREH 1236 R+ RKRKWK++ ERLM+ VV +QEELQ++FL+ +EK E +R+ REE+WRVQE+AR+NREH Sbjct: 239 RKKRKRKWKEFFERLMKQVVDKQEELQRKFLEAVEKREHERLVREESWRVQEIARINREH 298 Query: 1235 DRLAHERSIXXXXXXXXXAFLQKVTGEKSLQ--------------IPVSNTNNAIQKXXX 1098 + LA ERS+ AFLQK++ ++ Q + ++N NN Q Sbjct: 299 EILAQERSMSAAKDAAVMAFLQKLSEKQPNQPTAAQPQPQQVRPQMQLNNNNNQQQTPQP 358 Query: 1097 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXATTRDNGGDDLLSP---SSSRWPKAEVEALI 927 TT+ + GD ++P SSSRWPK E+EALI Sbjct: 359 SPPPPPPPLPQAIQAVVPTLD---------TTKTDNGDQNMTPASASSSRWPKVEIEALI 409 Query: 926 KLRTDLENRYQENGPKGPLWEEISKSMAKIGYNRSSKRCKEKWENINKYFKKVKESNKKR 747 KLRT+L+++YQENGPKGPLWEEIS M ++G+NR+SKRCKEKWENINKYFKKVKESNKKR Sbjct: 410 KLRTNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINKYFKKVKESNKKR 469 Query: 746 PQDSKTCPYFQQLDAIYRERAKNH 675 P+DSKTCPYF QLDA+YRER K H Sbjct: 470 PEDSKTCPYFHQLDALYRERNKFH 493 Score = 160 bits (405), Expect = 1e-36 Identities = 75/88 (85%), Positives = 81/88 (92%) Frame = -2 Query: 1946 NRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKKCKEKFENVYK 1767 NRWPRQETLALL+IRSDM FRDAS+KGPLWEEVSRKM ELG+ R+AKKCKEKFENVYK Sbjct: 55 NRWPRQETLALLKIRSDMGIAFRDASVKGPLWEEVSRKMAELGYIRNAKKCKEKFENVYK 114 Query: 1766 YHKRTKDGRASKSDGKTYRFFDQLEALE 1683 YHKRTK+GR KS+GKTYRFFDQLEALE Sbjct: 115 YHKRTKEGRTGKSEGKTYRFFDQLEALE 142 Score = 92.8 bits (229), Expect = 3e-16 Identities = 42/99 (42%), Positives = 68/99 (68%), Gaps = 1/99 (1%) Frame = -2 Query: 962 SRWPKAEVEALIKLRTDLENRYQENGPKGPLWEEISKSMAKIGYNRSSKRCKEKWENINK 783 +RWP+ E AL+K+R+D+ +++ KGPLWEE+S+ MA++GY R++K+CKEK+EN+ K Sbjct: 55 NRWPRQETLALLKIRSDMGIAFRDASVKGPLWEEVSRKMAELGYIRNAKKCKEKFENVYK 114 Query: 782 YFKKVKESNKKRPQDSKTCPYFQQLDAIYRERAKN-HQP 669 Y K+ KE + + KT +F QL+A+ + + H P Sbjct: 115 YHKRTKEGRTGK-SEGKTYRFFDQLEALESQSTTSLHHP 152 Score = 91.3 bits (225), Expect = 1e-15 Identities = 40/88 (45%), Positives = 63/88 (71%), Gaps = 1/88 (1%) Frame = -2 Query: 1946 NRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKKCKEKFENVYK 1767 +RWP+ E AL+++R+++D+ +++ KGPLWEE+S M LGF R++K+CKEK+EN+ K Sbjct: 398 SRWPKVEIEALIKLRTNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINK 457 Query: 1766 YHKRTKDGRASK-SDGKTYRFFDQLEAL 1686 Y K+ K+ + D KT +F QL+AL Sbjct: 458 YFKKVKESNKKRPEDSKTCPYFHQLDAL 485