BLASTX nr result
ID: Papaver23_contig00007367
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver23_contig00007367 (2326 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003535696.1| PREDICTED: uncharacterized protein LOC100306... 309 2e-81 ref|XP_002300247.1| predicted protein [Populus trichocarpa] gi|2... 308 4e-81 ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like [Gly... 303 1e-79 ref|NP_188582.1| homeobox protein HAT3.1 [Arabidopsis thaliana] ... 286 2e-74 dbj|BAB02476.1| homeotic protein HAT 3.1 [Arabidopsis thaliana] 286 2e-74 >ref|XP_003535696.1| PREDICTED: uncharacterized protein LOC100306715 [Glycine max] Length = 963 Score = 309 bits (791), Expect = 2e-81 Identities = 189/473 (39%), Positives = 262/473 (55%), Gaps = 13/473 (2%) Frame = +1 Query: 742 VACPTKENSLTINTKENVTGDVQCDTNS------SQTKENGLVTTNSSLEQSILDQK-DE 900 +A PT E + + + G QC+ + S+ EN + L S++++K ++ Sbjct: 144 MASPTGELTSHNDGTTDRMGTEQCELSEKTPQIGSEGLENEQKELGTELTSSVIEEKSNQ 203 Query: 901 TGVGNEQTAAIDLTLPARENGTLQCSSIKKSILDQKQESTAEDVKITSDVTLPTQENS-S 1077 + A I L P + + C +++ S L+Q ST E V T D++ EN Sbjct: 204 VSAIVTENAVIQLPEPLQHDLQKNCQTVEGSCLEQ---STVEQV--TVDLSNDKPENKCK 258 Query: 1078 PLKSSGHEQNILEQKQESGFEAVQHEAIDIASPILEDSLRIKNPSP-KQNLVEQKLEPDN 1254 PL E VQ E ++ ++ + NPS + V + L+ + Sbjct: 259 PLS-----------------ENVQSEPVESIPAVVVEGQMQSNPSQANMSSVNELLDQPS 301 Query: 1255 GDGQSETPSSKFAGHGIGDNELLETPNLSKNVEVSEPKKHRGSPNSKK-KKHAVXXXXXX 1431 GD + S+ ++ +P S++ + +G NSK KK+ + Sbjct: 302 GDAVNNISSNC-------SEKMSNSPTHSQS-------RRKGKKNSKLLKKYMLRSLGSS 347 Query: 1432 XXXXXXXXXDTSKAPDPVGV---SQNINTEVAXXXXXXXXXXXXLDNVFEKTKKRVRYLL 1602 + K P+P N + + N F + + +RYLL Sbjct: 348 DRALRSRTKEKPKEPEPTSNLVDGNNNGVKRKSGRKKKKRKEEGITNQFSRIRSHLRYLL 407 Query: 1603 NKTNYEHSLIEAYSGDGWKGLSAEKVRPEKELQRATAEILRCKLKIRDMFQHIESLCDEG 1782 N+ +YE+SLI+AYSG+GWKG S EK++PEKELQRA +EILR KLKIRD+FQ+++SLC EG Sbjct: 408 NRISYENSLIDAYSGEGWKGYSIEKLKPEKELQRAKSEILRRKLKIRDLFQNLDSLCAEG 467 Query: 1783 KFQESLFDSDGEIDSEDIFCAKCGSKELSTDNDIILCDGFCTRGFHQKCLDPPLLNEEIP 1962 KF ESLFDS GEIDSEDIFCAKC SKELST+NDIILCDG C RGFHQ CLDPP+L E+IP Sbjct: 468 KFPESLFDSAGEIDSEDIFCAKCQSKELSTNNDIILCDGVCDRGFHQLCLDPPMLTEDIP 527 Query: 1963 PGDEGWLCPACDCKADCIDLLNDNLGTDLSIEDNWEKVFPEAATTTAGNKLED 2121 PGDEGWLCP CDCK DC+DL+ND+ GT LSI D WE+VFPEAA + AGN +++ Sbjct: 528 PGDEGWLCPGCDCKDDCMDLVNDSFGTSLSISDTWERVFPEAA-SFAGNNMDN 579 >ref|XP_002300247.1| predicted protein [Populus trichocarpa] gi|222847505|gb|EEE85052.1| predicted protein [Populus trichocarpa] Length = 930 Score = 308 bits (789), Expect = 4e-81 Identities = 195/521 (37%), Positives = 282/521 (54%), Gaps = 45/521 (8%) Frame = +1 Query: 691 EQKQESDFQVVQSEAIDVACPTKEN---SLTINTKENVTGDVQCDTNSSQTKENGLVTTN 861 E++ + + +++Q+EA D ++ + + +++T D + + N Sbjct: 45 EERHKLECEIIQTEAGDNRAAVLQSCSGEVVQPSTDDLTKSPLIDLDPPPDDARSALFDN 104 Query: 862 SSLEQSI-LDQKDETG--------VGNEQTAAID--LTLPARENGTLQCSSI------KK 990 S S +DQK E G V +E + AID + L N + SS + Sbjct: 105 SPRPISTAMDQKLEPGATSVNTACVHSESSKAIDSSILLDEPRNSNTELSSCIANETSQA 164 Query: 991 SILDQKQESTAEDVKIT----SDVTLPTQENSSPLKSSGHEQNI---------LEQKQES 1131 S+ +S AED ++ S+ L + + S +SG + LE++Q+ Sbjct: 165 SLEGLANDSRAEDAGLSLVEASNSDLIDESSYSQQTTSGQTREFHSDRACCKPLEERQKP 224 Query: 1132 GFEAVQHEAIDIASPILEDSLRIKNPSPKQNLVEQKLEPDN-----GDGQSETPSSKFAG 1296 G E ++E+++I L + I+N P LV + + GD S + + Sbjct: 225 GSELAENESMEIGIG-LPSGIAIENLEPLTELVTKSCPIKHIGLPPGDDISIPANEQIRP 283 Query: 1297 HGIGDNELLETPNLSKNVEVSEPKKHRGSPNSKKK-----KHAVXXXXXXXXXXXXXXXD 1461 +++ + +L K + +G P+ K+ K + Sbjct: 284 THDKESKYPDCEHLEKLSGIVIGITSQGVPSVKRTSKLSGKKYTSSSRKSDRVLRSNSQE 343 Query: 1462 TSKAPDPVGVSQNINT--EVAXXXXXXXXXXXXLDNVFEKTKKRVRYLLNKTNYEHSLIE 1635 KAP+P S N+N+ E + + + + + R+RYLLN+ +YE SLI Sbjct: 344 KPKAPEPSNNSTNVNSTGEEKGKRRKKRRGKSIVADEYSRIRARLRYLLNRMSYEQSLIT 403 Query: 1636 AYSGDGWKGLSAEKVRPEKELQRATAEILRCKLKIRDMFQHIESLCDEGKFQESLFDSDG 1815 AYSG+GWKGLS EK++PEKELQRAT+EI+R K+KIRD+FQHI+SLC EG+F SLFDS+G Sbjct: 404 AYSGEGWKGLSLEKLKPEKELQRATSEIIRRKVKIRDLFQHIDSLCGEGRFPASLFDSEG 463 Query: 1816 EIDSEDIFCAKCGSKELSTDNDIILCDGFCTRGFHQKCLDPPLLNEEIPPGDEGWLCPAC 1995 +IDSEDIFCAKCGSK+L+ DNDIILCDG C RGFHQ CL PPLL E+IPPGDEGWLCP C Sbjct: 464 QIDSEDIFCAKCGSKDLTADNDIILCDGACDRGFHQFCLVPPLLREDIPPGDEGWLCPGC 523 Query: 1996 DCKADCIDLLNDNLGTDLSIEDNWEKVFPEAATTTAGNKLE 2118 DCK DCIDLLND+ GT++SI D W+ VFPEAA +G KL+ Sbjct: 524 DCKVDCIDLLNDSQGTNISISDRWDNVFPEAAAVASGQKLD 564 >ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like [Glycine max] Length = 820 Score = 303 bits (776), Expect = 1e-79 Identities = 187/449 (41%), Positives = 257/449 (57%), Gaps = 24/449 (5%) Frame = +1 Query: 847 LVTTNSSLEQSILDQKDETGVGNEQTAAIDLTLPARENGTLQCSSIKKSILDQKQESTAE 1026 L + N S + + ++ E +E+T I RE L + + ++D+K + Sbjct: 8 LTSHNDSTAEPMATEQCEL---SEKTPQIGSEGLEREQKEL-LTELTSFVIDEKSNQVSA 63 Query: 1027 DVKITSDVTLPT------QENSSPLKSSGHEQNILEQKQESGFEAVQHEAIDIASPILED 1188 DV S + LP ++N ++ S EQ+ +EQ ++D+++ E+ Sbjct: 64 DVTENSVIQLPAPPQHDFEKNCQTVEGSCLEQSTVEQV-----------SVDLSNDKSEN 112 Query: 1189 SLRIKNPSPKQNLVEQKLEPDNGDGQSETPSSKFAGHGIGDNELLETP---------NLS 1341 + + + + VE + DGQ + SS + NELL+ P N S Sbjct: 113 KCKPLSENVQSEPVES-IPAFVVDGQMQ--SSPAQANMSSVNELLDQPSGDVVNNITNCS 169 Query: 1342 KNVEVS---EPKKHRGSPNSK--KKKHAVXXXXXXXXXXXXXXXDTSKAPDPVG--VSQN 1500 + + S + +G NSK KKK+ + + K P+P V N Sbjct: 170 EKMSNSPSHSQSRRKGKRNSKLLKKKYMLRSLGSSGRALRSRTKEKPKEPEPTSNLVDGN 229 Query: 1501 INTEVAXXXXXXXXXXXX--LDNVFEKTKKRVRYLLNKTNYEHSLIEAYSGDGWKGLSAE 1674 N V + + F + + +RYLLN+ +YE+SLI+AYSG+GWKG S E Sbjct: 230 SNDGVKRKSGRKKKKRREEGITDQFSRIRSHLRYLLNRISYENSLIDAYSGEGWKGYSME 289 Query: 1675 KVRPEKELQRATAEILRCKLKIRDMFQHIESLCDEGKFQESLFDSDGEIDSEDIFCAKCG 1854 K++PEKELQRA +EILR KLKIRD+F++++SLC EGKF ESLFDS GEIDSEDIFCAKC Sbjct: 290 KLKPEKELQRAKSEILRRKLKIRDLFRNLDSLCAEGKFPESLFDSAGEIDSEDIFCAKCQ 349 Query: 1855 SKELSTDNDIILCDGFCTRGFHQKCLDPPLLNEEIPPGDEGWLCPACDCKADCIDLLNDN 2034 SKELST+NDIILCDG C RGFHQ CLDPPLL E+IPPGDEGWLCP CDCK DC+DL+ND+ Sbjct: 350 SKELSTNNDIILCDGVCDRGFHQLCLDPPLLTEDIPPGDEGWLCPGCDCKDDCMDLVNDS 409 Query: 2035 LGTDLSIEDNWEKVFPEAATTTAGNKLED 2121 GT LSI D WE+VFPEAA + AGN +++ Sbjct: 410 FGTSLSISDTWERVFPEAA-SFAGNNMDN 437 >ref|NP_188582.1| homeobox protein HAT3.1 [Arabidopsis thaliana] gi|148886602|sp|Q04996.3|HAT31_ARATH RecName: Full=Homeobox protein HAT3.1 gi|26449313|dbj|BAC41784.1| putative homeobox protein HAT3.1 [Arabidopsis thaliana] gi|29029042|gb|AAO64900.1| At3g19510 [Arabidopsis thaliana] gi|332642729|gb|AEE76250.1| homeobox protein HAT3.1 [Arabidopsis thaliana] Length = 723 Score = 286 bits (732), Expect = 2e-74 Identities = 128/183 (69%), Positives = 151/183 (82%) Frame = +1 Query: 1558 DNVFEKTKKRVRYLLNKTNYEHSLIEAYSGDGWKGLSAEKVRPEKELQRATAEILRCKLK 1737 D+ + + KK++RY LN+ NYE SLI+AYS +GWKG S EK+RPEKEL+RAT EILR KLK Sbjct: 174 DDEYTRIKKKLRYFLNRINYEQSLIDAYSLEGWKGSSLEKIRPEKELERATKEILRRKLK 233 Query: 1738 IRDMFQHIESLCDEGKFQESLFDSDGEIDSEDIFCAKCGSKELSTDNDIILCDGFCTRGF 1917 IRD+FQH+++LC EG ESLFD+DGEI SEDIFCAKCGSK+LS DNDIILCDGFC RGF Sbjct: 234 IRDLFQHLDTLCAEGSLPESLFDTDGEISSEDIFCAKCGSKDLSVDNDIILCDGFCDRGF 293 Query: 1918 HQKCLDPPLLNEEIPPGDEGWLCPACDCKADCIDLLNDNLGTDLSIEDNWEKVFPEAATT 2097 HQ CL+PPL E+IPP DEGWLCP CDCK D +DLLND+LGT S+ D+WEK+FPEAA Sbjct: 294 HQYCLEPPLRKEDIPPDDEGWLCPGCDCKDDSLDLLNDSLGTKFSVSDSWEKIFPEAAAA 353 Query: 2098 TAG 2106 G Sbjct: 354 LVG 356 >dbj|BAB02476.1| homeotic protein HAT 3.1 [Arabidopsis thaliana] Length = 661 Score = 286 bits (732), Expect = 2e-74 Identities = 128/183 (69%), Positives = 151/183 (82%) Frame = +1 Query: 1558 DNVFEKTKKRVRYLLNKTNYEHSLIEAYSGDGWKGLSAEKVRPEKELQRATAEILRCKLK 1737 D+ + + KK++RY LN+ NYE SLI+AYS +GWKG S EK+RPEKEL+RAT EILR KLK Sbjct: 112 DDEYTRIKKKLRYFLNRINYEQSLIDAYSLEGWKGSSLEKIRPEKELERATKEILRRKLK 171 Query: 1738 IRDMFQHIESLCDEGKFQESLFDSDGEIDSEDIFCAKCGSKELSTDNDIILCDGFCTRGF 1917 IRD+FQH+++LC EG ESLFD+DGEI SEDIFCAKCGSK+LS DNDIILCDGFC RGF Sbjct: 172 IRDLFQHLDTLCAEGSLPESLFDTDGEISSEDIFCAKCGSKDLSVDNDIILCDGFCDRGF 231 Query: 1918 HQKCLDPPLLNEEIPPGDEGWLCPACDCKADCIDLLNDNLGTDLSIEDNWEKVFPEAATT 2097 HQ CL+PPL E+IPP DEGWLCP CDCK D +DLLND+LGT S+ D+WEK+FPEAA Sbjct: 232 HQYCLEPPLRKEDIPPDDEGWLCPGCDCKDDSLDLLNDSLGTKFSVSDSWEKIFPEAAAA 291 Query: 2098 TAG 2106 G Sbjct: 292 LVG 294