BLASTX nr result
ID: Scutellaria22_contig00005444
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Scutellaria22_contig00005444 (2153 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002276933.1| PREDICTED: trihelix transcription factor GT-... 361 5e-97 ref|XP_002331882.1| predicted protein [Populus trichocarpa] gi|2... 356 1e-95 ref|XP_003536427.1| PREDICTED: trihelix transcription factor GT-... 273 1e-70 ref|NP_177814.1| putative trihelix DNA-binding protein [Arabidop... 265 3e-68 ref|XP_002887660.1| hypothetical protein ARALYDRAFT_895569 [Arab... 265 3e-68 >ref|XP_002276933.1| PREDICTED: trihelix transcription factor GT-2 [Vitis vinifera] Length = 510 Score = 361 bits (926), Expect = 5e-97 Identities = 194/432 (44%), Positives = 262/432 (60%), Gaps = 1/432 (0%) Frame = -2 Query: 1933 EEIXXXXXGNRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKKC 1754 EE GNRWPR+ETLALL+IRSDMD FRD+SLK PLWEEVSRK+GELG+ R+AKKC Sbjct: 41 EESDRNFAGNRWPREETLALLKIRSDMDVVFRDSSLKAPLWEEVSRKLGELGYHRNAKKC 100 Query: 1753 KEKFENVYKYHKRTKDGRASKSDGKTYRFFDQLEALDXXXXXXXXXXXXXXXXXXXXXXX 1574 KEKFEN++KYHKRTK+GR+++ +GK YRFF+QLEALD Sbjct: 101 KEKFENIFKYHKRTKEGRSNRQNGKNYRFFEQLEALD----------------NHPLMPP 144 Query: 1573 XXXXXXXXXXXXXXXXXVANPISYSPFQPPIPPQSHHFQLSHRPILXXXXXXXXXXXXXS 1394 NPI + I Q +P + Sbjct: 145 PSPVKYETSTPMAASMPQTNPIDVTNVSQGINAVPCSIQ---KPAVDCVAASTSTTSSSG 201 Query: 1393 DEDIQRRRGRKRKWKDYLERLMEDVVKRQEELQKRFLDTLEKSERDRIAREEAWRVQEMA 1214 E R+ +++ W + E+LM++V+++QE LQ++F++ +EK E+DRIAREEAW++QE+ Sbjct: 202 KESEGSRKKKRK-WGVFFEKLMKEVIEKQENLQRKFIEAIEKCEQDRIAREEAWKLQELD 260 Query: 1213 RMNREHDRLAHERSIXXXXXXXXXAFLQKVTGEKS-LQIPVSNTNNAIQKXXXXXXXXXX 1037 R+ REH+ L ERSI AFLQK+ + +Q+P + ++ + + Sbjct: 261 RIKREHEILVQERSIAAAKDAAVLAFLQKIAEQAGPVQLPENPSSEKVFEKQD------- 313 Query: 1036 XXXXXXXXXXXXXXXXXXXXXXATTRDNGGDDLLSPSSSRWPKAEVEALIKLRTDLENRY 857 ++ G++ + SSSRWPKAEVEALI+LRT+ + +Y Sbjct: 314 --------------------------NSNGENSIQMSSSRWPKAEVEALIRLRTNFDMQY 347 Query: 856 QENGPKGPLWEEISKSMAKIGYNRSSKRCKEKWENINKYFKKVKESNKKRPQDSKTCPYF 677 QE+GPKGPLWEEIS +M KIGY RS+KRCKEKWENINKYFK+V++SNK+RP+DSKTCPYF Sbjct: 348 QESGPKGPLWEEISLAMRKIGYERSAKRCKEKWENINKYFKRVRDSNKRRPEDSKTCPYF 407 Query: 676 QQLDAIYRERAK 641 QLDA+Y+E+ K Sbjct: 408 HQLDALYKEKTK 419 >ref|XP_002331882.1| predicted protein [Populus trichocarpa] gi|222874631|gb|EEF11762.1| predicted protein [Populus trichocarpa] Length = 470 Score = 356 bits (914), Expect = 1e-95 Identities = 193/422 (45%), Positives = 251/422 (59%) Frame = -2 Query: 1906 NRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKKCKEKFENVYK 1727 NRWP+QETLALL IRSDMD FRD+ +K PLWEEVSRK+ ELG+ RSAKKCKEKFEN+YK Sbjct: 15 NRWPKQETLALLEIRSDMDVAFRDSVVKAPLWEEVSRKLNELGYNRSAKKCKEKFENIYK 74 Query: 1726 YHKRTKDGRASKSDGKTYRFFDQLEALDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1547 YH+RTK ++ + +GKTYRFF+QL+ALD Sbjct: 75 YHRRTKGSQSGRPNGKTYRFFEQLQALDKTNALVSPTSSDKDHCLMPSASVI-------- 126 Query: 1546 XXXXXXXXVANPISYSPFQPPIPPQSHHFQLSHRPILXXXXXXXXXXXXXSDEDIQRRRG 1367 P+S+ P P QS P + S E+ + R Sbjct: 127 -----------PVSFIPNDVPCSVQS--------PRMNCTDATSTSTASTSSEESEGTRK 167 Query: 1366 RKRKWKDYLERLMEDVVKRQEELQKRFLDTLEKSERDRIAREEAWRVQEMARMNREHDRL 1187 +KR+ D+ ERLM++V+++QE LQ +FL+ +EK E++RIAREE W++QE+ R+ RE + L Sbjct: 168 KKRRLTDFFERLMKEVIEKQENLQNKFLEAIEKCEQERIAREEVWKMQELDRIKREQELL 227 Query: 1186 AHERSIXXXXXXXXXAFLQKVTGEKSLQIPVSNTNNAIQKXXXXXXXXXXXXXXXXXXXX 1007 HER+I AFLQK + + IPV +N Sbjct: 228 VHERAIAAAKDAAVLAFLQKFSEQG---IPVQLPDNPT-------VPMKFPDNQTSPALL 277 Query: 1006 XXXXXXXXXXXXATTRDNGGDDLLSPSSSRWPKAEVEALIKLRTDLENRYQENGPKGPLW 827 T ++ + ++ SSSRWPK E+E+LIK+RT LE +YQENGPKGPLW Sbjct: 278 SKNQAVPVENVVKTHENSSVESFVNMSSSRWPKEEIESLIKIRTYLEFQYQENGPKGPLW 337 Query: 826 EEISKSMAKIGYNRSSKRCKEKWENINKYFKKVKESNKKRPQDSKTCPYFQQLDAIYRER 647 EEIS SM +GY+RS+KRCKEKWEN+NKYFK+VK+SNKKRP DSKTCPYFQQLDA+YRE+ Sbjct: 338 EEISTSMKNLGYDRSAKRCKEKWENMNKYFKRVKDSNKKRPGDSKTCPYFQQLDALYREK 397 Query: 646 AK 641 + Sbjct: 398 TR 399 Score = 95.9 bits (237), Expect = 4e-17 Identities = 42/95 (44%), Positives = 70/95 (73%) Frame = -2 Query: 928 SSSRWPKAEVEALIKLRTDLENRYQENGPKGPLWEEISKSMAKIGYNRSSKRCKEKWENI 749 +++RWPK E AL+++R+D++ ++++ K PLWEE+S+ + ++GYNRS+K+CKEK+ENI Sbjct: 13 TANRWPKQETLALLEIRSDMDVAFRDSVVKAPLWEEVSRKLNELGYNRSAKKCKEKFENI 72 Query: 748 NKYFKKVKESNKKRPQDSKTCPYFQQLDAIYRERA 644 KY ++ K S RP + KT +F+QL A+ + A Sbjct: 73 YKYHRRTKGSQSGRP-NGKTYRFFEQLQALDKTNA 106 >ref|XP_003536427.1| PREDICTED: trihelix transcription factor GT-2-like [Glycine max] Length = 667 Score = 273 bits (699), Expect = 1e-70 Identities = 139/278 (50%), Positives = 178/278 (64%), Gaps = 24/278 (8%) Frame = -2 Query: 1393 DEDIQRRRGRKRKWKDYLERLMEDVVKRQEELQKRFLDTLEKSERDRIAREEAWRVQEMA 1214 +E ++ RR RKRKWKD+ ERLM++V+++QEELQK+FL+ +EK E DRIAREEAWRVQEM Sbjct: 292 EETLEGRRKRKRKWKDFFERLMKEVIEKQEELQKKFLEAIEKREDDRIAREEAWRVQEMK 351 Query: 1213 RMNREHDRLAHERSIXXXXXXXXXAFLQKVTGEKSLQIPVSNTN---------------- 1082 R+NRE + LA ERSI +FLQK+ +++L +N N Sbjct: 352 RINREREILAQERSIAAAKDAAVMSFLQKIAEQQNLGQVSTNINLVQQPQPQLQPQPPLQ 411 Query: 1081 --------NAIQKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXATTRDNGGDDLLSPS 926 A Q + +N G++ L+PS Sbjct: 412 QQVTQPSIAAAQPPVQQPPPVVVTQPVVLPVVSQVTNMEIVKADNNSNNNNNGENFLAPS 471 Query: 925 SSRWPKAEVEALIKLRTDLENRYQENGPKGPLWEEISKSMAKIGYNRSSKRCKEKWENIN 746 SSRWPK EV+ALIKLRT ++ +YQENGPKGPLWEEIS SM K+GYNR++KRCKEKWENIN Sbjct: 472 SSRWPKVEVQALIKLRTSMDEKYQENGPKGPLWEEISASMKKLGYNRNAKRCKEKWENIN 531 Query: 745 KYFKKVKESNKKRPQDSKTCPYFQQLDAIYRERAKNHQ 632 KYFKKVKESNK+RP+DSKTCPYF QLDA+YR++ + + Sbjct: 532 KYFKKVKESNKRRPEDSKTCPYFHQLDALYRQKHRGEE 569 Score = 164 bits (415), Expect = 9e-38 Identities = 78/98 (79%), Positives = 86/98 (87%) Frame = -2 Query: 1936 MEEIXXXXXGNRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKK 1757 +EE GNRWPRQETLALLRIRSDMD FRDAS+KGPLWEEVSRKM ELG+ RS+KK Sbjct: 63 IEEGERSFGGNRWPRQETLALLRIRSDMDVAFRDASVKGPLWEEVSRKMAELGYHRSSKK 122 Query: 1756 CKEKFENVYKYHKRTKDGRASKSDGKTYRFFDQLEALD 1643 CKEKFENVYKYHKRTK+GR+ K DGKTYRFFDQL+AL+ Sbjct: 123 CKEKFENVYKYHKRTKEGRSGKQDGKTYRFFDQLQALE 160 Score = 103 bits (256), Expect = 2e-19 Identities = 50/118 (42%), Positives = 74/118 (62%), Gaps = 8/118 (6%) Frame = -2 Query: 958 DNGGDDLL--------SPSSSRWPKAEVEALIKLRTDLENRYQENGPKGPLWEEISKSMA 803 +N GDD S +RWP+ E AL+++R+D++ +++ KGPLWEE+S+ MA Sbjct: 53 NNSGDDERGRIEEGERSFGGNRWPRQETLALLRIRSDMDVAFRDASVKGPLWEEVSRKMA 112 Query: 802 KIGYNRSSKRCKEKWENINKYFKKVKESNKKRPQDSKTCPYFQQLDAIYRERAKNHQP 629 ++GY+RSSK+CKEK+EN+ KY K+ KE + QD KT +F QL A+ H P Sbjct: 113 ELGYHRSSKKCKEKFENVYKYHKRTKEGRSGK-QDGKTYRFFDQLQALENHSPTPHSP 169 Score = 92.8 bits (229), Expect = 3e-16 Identities = 41/88 (46%), Positives = 62/88 (70%), Gaps = 1/88 (1%) Frame = -2 Query: 1906 NRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKKCKEKFENVYK 1727 +RWP+ E AL+++R+ MD +++ KGPLWEE+S M +LG+ R+AK+CKEK+EN+ K Sbjct: 473 SRWPKVEVQALIKLRTSMDEKYQENGPKGPLWEEISASMKKLGYNRNAKRCKEKWENINK 532 Query: 1726 YHKRTKDGRASK-SDGKTYRFFDQLEAL 1646 Y K+ K+ + D KT +F QL+AL Sbjct: 533 YFKKVKESNKRRPEDSKTCPYFHQLDAL 560 >ref|NP_177814.1| putative trihelix DNA-binding protein [Arabidopsis thaliana] gi|12322223|gb|AAG51144.1|AC079283_1 GT-like trihelix DNA-binding protein, putative [Arabidopsis thaliana] gi|332197777|gb|AEE35898.1| putative trihelix DNA-binding protein [Arabidopsis thaliana] Length = 603 Score = 265 bits (678), Expect = 3e-68 Identities = 140/261 (53%), Positives = 173/261 (66%), Gaps = 14/261 (5%) Frame = -2 Query: 1375 RRGRKRKWKDYLERLMEDVVKRQEELQKRFLDTLEKSERDRIAREEAWRVQEMARMNREH 1196 R+ RKRKWK + ERLM+ VV +QEELQ++FL+ +EK E +R+ REE+WRVQE+AR+NREH Sbjct: 247 RKKRKRKWKVFFERLMKQVVDKQEELQRKFLEAVEKREHERLVREESWRVQEIARINREH 306 Query: 1195 DRLAHERSIXXXXXXXXXAFLQKVTGEKSLQI----------PVSNTNNAIQKXXXXXXX 1046 + LA ERS+ AFLQK++ ++ Q P NN Q+ Sbjct: 307 EILAQERSMSAAKDAAVMAFLQKLSEKQPNQPQPQPQPQQVRPSMQLNNNNQQQPPQRSP 366 Query: 1045 XXXXXXXXXXXXXXXXXXXXXXXXXATTRDNGGDDLLSP----SSSRWPKAEVEALIKLR 878 T DNGGD ++P SSSRWPK E+EALIKLR Sbjct: 367 PPQPPAPLPQPIQAVVSTLDT-----TKTDNGGDQNMTPAASASSSRWPKVEIEALIKLR 421 Query: 877 TDLENRYQENGPKGPLWEEISKSMAKIGYNRSSKRCKEKWENINKYFKKVKESNKKRPQD 698 T+L+++YQENGPKGPLWEEIS M ++G+NR+SKRCKEKWENINKYFKKVKESNKKRP+D Sbjct: 422 TNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINKYFKKVKESNKKRPED 481 Query: 697 SKTCPYFQQLDAIYRERAKNH 635 SKTCPYF QLDA+YRER K H Sbjct: 482 SKTCPYFHQLDALYRERNKFH 502 Score = 156 bits (395), Expect = 2e-35 Identities = 73/88 (82%), Positives = 80/88 (90%) Frame = -2 Query: 1906 NRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKKCKEKFENVYK 1727 NRWPRQETLALL+IRSDM FRDAS+KGPLWEEVSRKM E G+ R+AKKCKEKFENVYK Sbjct: 60 NRWPRQETLALLKIRSDMGIAFRDASVKGPLWEEVSRKMAEHGYIRNAKKCKEKFENVYK 119 Query: 1726 YHKRTKDGRASKSDGKTYRFFDQLEALD 1643 YHKRTK+GR KS+GKTYRFFDQLEAL+ Sbjct: 120 YHKRTKEGRTGKSEGKTYRFFDQLEALE 147 Score = 91.3 bits (225), Expect = 1e-15 Identities = 40/88 (45%), Positives = 63/88 (71%), Gaps = 1/88 (1%) Frame = -2 Query: 1906 NRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKKCKEKFENVYK 1727 +RWP+ E AL+++R+++D+ +++ KGPLWEE+S M LGF R++K+CKEK+EN+ K Sbjct: 407 SRWPKVEIEALIKLRTNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINK 466 Query: 1726 YHKRTKDGRASK-SDGKTYRFFDQLEAL 1646 Y K+ K+ + D KT +F QL+AL Sbjct: 467 YFKKVKESNKKRPEDSKTCPYFHQLDAL 494 Score = 90.5 bits (223), Expect = 2e-15 Identities = 40/88 (45%), Positives = 63/88 (71%) Frame = -2 Query: 922 SRWPKAEVEALIKLRTDLENRYQENGPKGPLWEEISKSMAKIGYNRSSKRCKEKWENINK 743 +RWP+ E AL+K+R+D+ +++ KGPLWEE+S+ MA+ GY R++K+CKEK+EN+ K Sbjct: 60 NRWPRQETLALLKIRSDMGIAFRDASVKGPLWEEVSRKMAEHGYIRNAKKCKEKFENVYK 119 Query: 742 YFKKVKESNKKRPQDSKTCPYFQQLDAI 659 Y K+ KE + + KT +F QL+A+ Sbjct: 120 YHKRTKEGRTGK-SEGKTYRFFDQLEAL 146 >ref|XP_002887660.1| hypothetical protein ARALYDRAFT_895569 [Arabidopsis lyrata subsp. lyrata] gi|297333501|gb|EFH63919.1| hypothetical protein ARALYDRAFT_895569 [Arabidopsis lyrata subsp. lyrata] Length = 598 Score = 265 bits (678), Expect = 3e-68 Identities = 138/264 (52%), Positives = 176/264 (66%), Gaps = 17/264 (6%) Frame = -2 Query: 1375 RRGRKRKWKDYLERLMEDVVKRQEELQKRFLDTLEKSERDRIAREEAWRVQEMARMNREH 1196 R+ RKRKWK++ ERLM+ VV +QEELQ++FL+ +EK E +R+ REE+WRVQE+AR+NREH Sbjct: 239 RKKRKRKWKEFFERLMKQVVDKQEELQRKFLEAVEKREHERLVREESWRVQEIARINREH 298 Query: 1195 DRLAHERSIXXXXXXXXXAFLQKVTGEKSLQ--------------IPVSNTNNAIQKXXX 1058 + LA ERS+ AFLQK++ ++ Q + ++N NN Q Sbjct: 299 EILAQERSMSAAKDAAVMAFLQKLSEKQPNQPTAAQPQPQQVRPQMQLNNNNNQQQTPQP 358 Query: 1057 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXATTRDNGGDDLLSP---SSSRWPKAEVEALI 887 TT+ + GD ++P SSSRWPK E+EALI Sbjct: 359 SPPPPPPPLPQAIQAVVPTLD---------TTKTDNGDQNMTPASASSSRWPKVEIEALI 409 Query: 886 KLRTDLENRYQENGPKGPLWEEISKSMAKIGYNRSSKRCKEKWENINKYFKKVKESNKKR 707 KLRT+L+++YQENGPKGPLWEEIS M ++G+NR+SKRCKEKWENINKYFKKVKESNKKR Sbjct: 410 KLRTNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINKYFKKVKESNKKR 469 Query: 706 PQDSKTCPYFQQLDAIYRERAKNH 635 P+DSKTCPYF QLDA+YRER K H Sbjct: 470 PEDSKTCPYFHQLDALYRERNKFH 493 Score = 159 bits (402), Expect = 3e-36 Identities = 74/88 (84%), Positives = 81/88 (92%) Frame = -2 Query: 1906 NRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKKCKEKFENVYK 1727 NRWPRQETLALL+IRSDM FRDAS+KGPLWEEVSRKM ELG+ R+AKKCKEKFENVYK Sbjct: 55 NRWPRQETLALLKIRSDMGIAFRDASVKGPLWEEVSRKMAELGYIRNAKKCKEKFENVYK 114 Query: 1726 YHKRTKDGRASKSDGKTYRFFDQLEALD 1643 YHKRTK+GR KS+GKTYRFFDQLEAL+ Sbjct: 115 YHKRTKEGRTGKSEGKTYRFFDQLEALE 142 Score = 92.8 bits (229), Expect = 3e-16 Identities = 42/99 (42%), Positives = 68/99 (68%), Gaps = 1/99 (1%) Frame = -2 Query: 922 SRWPKAEVEALIKLRTDLENRYQENGPKGPLWEEISKSMAKIGYNRSSKRCKEKWENINK 743 +RWP+ E AL+K+R+D+ +++ KGPLWEE+S+ MA++GY R++K+CKEK+EN+ K Sbjct: 55 NRWPRQETLALLKIRSDMGIAFRDASVKGPLWEEVSRKMAELGYIRNAKKCKEKFENVYK 114 Query: 742 YFKKVKESNKKRPQDSKTCPYFQQLDAIYRERAKN-HQP 629 Y K+ KE + + KT +F QL+A+ + + H P Sbjct: 115 YHKRTKEGRTGK-SEGKTYRFFDQLEALESQSTTSLHHP 152 Score = 91.3 bits (225), Expect = 1e-15 Identities = 40/88 (45%), Positives = 63/88 (71%), Gaps = 1/88 (1%) Frame = -2 Query: 1906 NRWPRQETLALLRIRSDMDATFRDASLKGPLWEEVSRKMGELGFQRSAKKCKEKFENVYK 1727 +RWP+ E AL+++R+++D+ +++ KGPLWEE+S M LGF R++K+CKEK+EN+ K Sbjct: 398 SRWPKVEIEALIKLRTNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINK 457 Query: 1726 YHKRTKDGRASK-SDGKTYRFFDQLEAL 1646 Y K+ K+ + D KT +F QL+AL Sbjct: 458 YFKKVKESNKKRPEDSKTCPYFHQLDAL 485