BLASTX nr result
ID: Salvia21_contig00014313
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Salvia21_contig00014313 (2725 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI22504.3| unnamed protein product [Vitis vinifera] 442 e-121 ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vit... 442 e-121 ref|XP_003535696.1| PREDICTED: uncharacterized protein LOC100306... 397 e-108 ref|XP_002300247.1| predicted protein [Populus trichocarpa] gi|2... 396 e-107 ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like [Gly... 395 e-107 >emb|CBI22504.3| unnamed protein product [Vitis vinifera] Length = 977 Score = 442 bits (1138), Expect = e-121 Identities = 263/607 (43%), Positives = 332/607 (54%), Gaps = 30/607 (4%) Frame = +3 Query: 84 VRRNAKLEGPXXXXXXXXXXXQEKTKAPEPVENVNEGSAIGEKKKRGRKPKNMQKTTVSE 263 V+R KL QEK KA +P +N SA E+K GRK K M KTT E Sbjct: 162 VKRKYKLRSSVSGSRVLRSRSQEKPKASQPSDNFVNASASRERK--GRKKKRMNKTTADE 219 Query: 264 FSSKKTHLRYLMHRINYEQNLIDAYSAEGWRGQSXXXXXXXXXXXXXXSRILHYKLKIRA 443 F+ + HLRYL++R++YEQNLIDAYSAEGW+GQS S I KL+IR Sbjct: 220 FARIRKHLRYLLNRMSYEQNLIDAYSAEGWKGQSVEKLKPEKELQRASSEISRRKLQIRD 279 Query: 444 LFQSLDQSLAMGKLPESLFDAQGEIDSEDIFCAKCGSKDLTLDNDIILCDGACERGFHQF 623 LFQ LD A G+ PESLFD++G+IDSEDIFCAKC SKD++ DNDIILCDGAC+RGFHQF Sbjct: 280 LFQHLDSLCAEGRFPESLFDSEGQIDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQF 339 Query: 624 CLEPPLLKTDIPPGDESWLCPGCDCKADCIDMLKDFQETKISIVDSWEKIFPEAAAAAXX 803 CLEPPLLK +IPP DE WLCP CDCK DC+D+L D Q TK+S++DSWEK+FPEAAAA Sbjct: 340 CLEPPLLKEEIPPDDEGWLCPACDCKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAAAG-- 397 Query: 804 XXXXXXXXXXXXXXXXXXXXXXXXXXXEKVEGNKXXXXXXXXXXXXXXLDASN------- 962 EK +G+K D S+ Sbjct: 398 NNQDNNSGFSSDDSEDNDYDPDCPEVDEKGQGDKSSSDKFDESDEFDESDESDFTSASDD 457 Query: 963 -----NNEKYLGLPSDDSEDDDFNPSATNRDNQVKQGXXXXXXXXXXXXLRALIEDETAT 1127 NNE+ LGLPSDDSEDDDF+P A D QV QG + D T+ Sbjct: 458 MVVSPNNEQCLGLPSDDSEDDDFDPDAPEIDEQVNQG--------------SSSSDFTSD 503 Query: 1128 SEDPWQTSSSAHHKQKSEGFGEEISNLGRKKRQSLKDELSYLMEASA----EPLSRKRHV 1295 SED T + +G E+ GRKK+ +LKDEL ++E+++ PLS KRHV Sbjct: 504 SEDFTATLDRRNFSDNEDGLDEQ-RRFGRKKKDTLKDELLSVLESNSGQDNAPLSAKRHV 562 Query: 1296 ERLDYKKLNDETYGN--XXXXXXXXXXXXTITRKRTKSDRD--EVKFSDETLVTPSNTHK 1463 ERLDYKKL+DE YGN I RKR + V + T +T + T+ Sbjct: 563 ERLDYKKLHDEAYGNVSSDSSDDEDWTENVIPRKRKNLSGNVASVSPNGNTSITENGTNT 622 Query: 1464 ED-ENQIEKKH-FPKRTRRNRSDGHTTEXXXXXXXXXXXXXXXT--------HKRLGEAT 1613 +D ++ +E PKR R + + +T T +K+LGEA Sbjct: 623 KDIKHDLEAAGCTPKRRTRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAV 682 Query: 1614 TQRLLAYFNLDQYPDRVVKENLAKELGLAVRQVGKWFENARWSYNHRPRVESDSSHQNAE 1793 T+RL F +QYPDR +KE LA+ELG+ RQV KWFENARWS+ HRP E+ + + Sbjct: 683 TERLYKSFQENQYPDRAMKEKLAEELGITSRQVSKWFENARWSFRHRPPKEASAGKSAVK 742 Query: 1794 SSAISAQ 1814 A ++Q Sbjct: 743 KDASTSQ 749 >ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vitis vinifera] Length = 968 Score = 442 bits (1138), Expect = e-121 Identities = 263/607 (43%), Positives = 332/607 (54%), Gaps = 30/607 (4%) Frame = +3 Query: 84 VRRNAKLEGPXXXXXXXXXXXQEKTKAPEPVENVNEGSAIGEKKKRGRKPKNMQKTTVSE 263 V+R KL QEK KA +P +N SA E+K GRK K M KTT E Sbjct: 162 VKRKYKLRSSVSGSRVLRSRSQEKPKASQPSDNFVNASASRERK--GRKKKRMNKTTADE 219 Query: 264 FSSKKTHLRYLMHRINYEQNLIDAYSAEGWRGQSXXXXXXXXXXXXXXSRILHYKLKIRA 443 F+ + HLRYL++R++YEQNLIDAYSAEGW+GQS S I KL+IR Sbjct: 220 FARIRKHLRYLLNRMSYEQNLIDAYSAEGWKGQSVEKLKPEKELQRASSEISRRKLQIRD 279 Query: 444 LFQSLDQSLAMGKLPESLFDAQGEIDSEDIFCAKCGSKDLTLDNDIILCDGACERGFHQF 623 LFQ LD A G+ PESLFD++G+IDSEDIFCAKC SKD++ DNDIILCDGAC+RGFHQF Sbjct: 280 LFQHLDSLCAEGRFPESLFDSEGQIDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQF 339 Query: 624 CLEPPLLKTDIPPGDESWLCPGCDCKADCIDMLKDFQETKISIVDSWEKIFPEAAAAAXX 803 CLEPPLLK +IPP DE WLCP CDCK DC+D+L D Q TK+S++DSWEK+FPEAAAA Sbjct: 340 CLEPPLLKEEIPPDDEGWLCPACDCKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAAAG-- 397 Query: 804 XXXXXXXXXXXXXXXXXXXXXXXXXXXEKVEGNKXXXXXXXXXXXXXXLDASN------- 962 EK +G+K D S+ Sbjct: 398 NNQDNNSGFSSDDSEDNDYDPDCPEVDEKGQGDKSSSDKFDESDEFDESDESDFTSASDD 457 Query: 963 -----NNEKYLGLPSDDSEDDDFNPSATNRDNQVKQGXXXXXXXXXXXXLRALIEDETAT 1127 NNE+ LGLPSDDSEDDDF+P A D QV QG + D T+ Sbjct: 458 MVVSPNNEQCLGLPSDDSEDDDFDPDAPEIDEQVNQG--------------SSSSDFTSD 503 Query: 1128 SEDPWQTSSSAHHKQKSEGFGEEISNLGRKKRQSLKDELSYLMEASA----EPLSRKRHV 1295 SED T + +G E+ GRKK+ +LKDEL ++E+++ PLS KRHV Sbjct: 504 SEDFTATLDRRNFSDNEDGLDEQ-RRFGRKKKDTLKDELLSVLESNSGQDNAPLSAKRHV 562 Query: 1296 ERLDYKKLNDETYGN--XXXXXXXXXXXXTITRKRTKSDRD--EVKFSDETLVTPSNTHK 1463 ERLDYKKL+DE YGN I RKR + V + T +T + T+ Sbjct: 563 ERLDYKKLHDEAYGNVSSDSSDDEDWTENVIPRKRKNLSGNVASVSPNGNTSITENGTNT 622 Query: 1464 ED-ENQIEKKH-FPKRTRRNRSDGHTTEXXXXXXXXXXXXXXXT--------HKRLGEAT 1613 +D ++ +E PKR R + + +T T +K+LGEA Sbjct: 623 KDIKHDLEAAGCTPKRRTRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAV 682 Query: 1614 TQRLLAYFNLDQYPDRVVKENLAKELGLAVRQVGKWFENARWSYNHRPRVESDSSHQNAE 1793 T+RL F +QYPDR +KE LA+ELG+ RQV KWFENARWS+ HRP E+ + + Sbjct: 683 TERLYKSFQENQYPDRAMKEKLAEELGITSRQVSKWFENARWSFRHRPPKEASAGKSAVK 742 Query: 1794 SSAISAQ 1814 A ++Q Sbjct: 743 KDASTSQ 749 >ref|XP_003535696.1| PREDICTED: uncharacterized protein LOC100306715 [Glycine max] Length = 963 Score = 397 bits (1021), Expect = e-108 Identities = 238/599 (39%), Positives = 319/599 (53%), Gaps = 14/599 (2%) Frame = +3 Query: 147 QEKTKAPEPVENVNEGSAIGEKKKRGRKPKNMQKTTVS-EFSSKKTHLRYLMHRINYEQN 323 +EK K PEP N+ +G+ G K+K GRK K ++ ++ +FS ++HLRYL++RI+YE + Sbjct: 356 KEKPKEPEPTSNLVDGNNNGVKRKSGRKKKKRKEEGITNQFSRIRSHLRYLLNRISYENS 415 Query: 324 LIDAYSAEGWRGQSXXXXXXXXXXXXXXSRILHYKLKIRALFQSLDQSLAMGKLPESLFD 503 LIDAYS EGW+G S S IL KLKIR LFQ+LD A GK PESLFD Sbjct: 416 LIDAYSGEGWKGYSIEKLKPEKELQRAKSEILRRKLKIRDLFQNLDSLCAEGKFPESLFD 475 Query: 504 AQGEIDSEDIFCAKCGSKDLTLDNDIILCDGACERGFHQFCLEPPLLKTDIPPGDESWLC 683 + GEIDSEDIFCAKC SK+L+ +NDIILCDG C+RGFHQ CL+PP+L DIPPGDE WLC Sbjct: 476 SAGEIDSEDIFCAKCQSKELSTNNDIILCDGVCDRGFHQLCLDPPMLTEDIPPGDEGWLC 535 Query: 684 PGCDCKADCIDMLKDFQETKISIVDSWEKIFPEAAAAAXXXXXXXXXXXXXXXXXXXXXX 863 PGCDCK DC+D++ D T +SI D+WE++FPEAA+ A Sbjct: 536 PGCDCKDDCMDLVNDSFGTSLSISDTWERVFPEAASFAGNNMDNNSGVPSDDSDDDDYNP 595 Query: 864 XXXXXXXEKVEGNKXXXXXXXXXXXXXXLDASNNNEKYLGLPSDDSEDDDFNPSATNRDN 1043 KVEG++ L+ ++ ++YLGLPS+DS+D D++P A + + Sbjct: 596 NGPDDV--KVEGDESSSDESEYASASEKLEGGSHEDQYLGLPSEDSDDGDYDPDAPDVEC 653 Query: 1044 QVKQGXXXXXXXXXXXXLRALIEDETATSEDPWQTSSSAHHKQKSEGFGEEISNLGRKKR 1223 +V + L A IED T+ +D +SS K K+ Sbjct: 654 KVNEESSSSDFTSDSEDLAAAIEDNTSPGQDGGISSSKKKGKVG--------------KK 699 Query: 1224 QSLKDELSYLMEASA-----EPLSRKRHVERLDYKKLNDETYGNXXXXXXXXXXXXTITR 1388 SL DELS L+E + P+S KRHVERLDYKKL +ETY + + Sbjct: 700 LSLPDELSSLLEPDSGQEAPTPVSGKRHVERLDYKKLYEETYHSDTSDDEDWNDTAAPSG 759 Query: 1389 KRTKSDRDEVKFSDETLVTPSNTHKEDENQIEKKHFPKRTRR-------NRSDGHTTEXX 1547 K+ + VTP + + N H PKR N S + E Sbjct: 760 KKKLTGN----------VTPVSPNGNASNN--SIHTPKRNAHQNNVENTNNSPTKSLEGC 807 Query: 1548 XXXXXXXXXXXXXTHKRLGEATTQRLLAYFNLDQYPDRVVKENLAKELGLAVRQVGKWFE 1727 HKRLGEA QRL F +QYPDR KE+LA+ELGL +QV KWF Sbjct: 808 SKSGSRDKKSGSSAHKRLGEAVVQRLHKSFKENQYPDRTTKESLAQELGLTYQQVAKWFG 867 Query: 1728 NARWSYNHRPRVESDSSHQNAESSAISAQNHIAKLRGGDLVT-DTSGDSPGTPATKKRK 1901 N RWS+ H ++E++S NA + + +L++ + SG+ TP ++KRK Sbjct: 868 NTRWSFRHSSQMETNSG-INASQQVTDGRAENEGEKECELISLEFSGEKSKTPNSRKRK 925 >ref|XP_002300247.1| predicted protein [Populus trichocarpa] gi|222847505|gb|EEE85052.1| predicted protein [Populus trichocarpa] Length = 930 Score = 396 bits (1018), Expect = e-107 Identities = 229/568 (40%), Positives = 300/568 (52%), Gaps = 16/568 (2%) Frame = +3 Query: 147 QEKTKAPEPVENVNEGSAIGEKKKRGRKPKNMQKTTVSEFSSKKTHLRYLMHRINYEQNL 326 QEK KAPEP N ++ GE+K + RK + + E+S + LRYL++R++YEQ+L Sbjct: 342 QEKPKAPEPSNNSTNVNSTGEEKGKRRKKRRGKSIVADEYSRIRARLRYLLNRMSYEQSL 401 Query: 327 IDAYSAEGWRGQSXXXXXXXXXXXXXXSRILHYKLKIRALFQSLDQSLAMGKLPESLFDA 506 I AYS EGW+G S S I+ K+KIR LFQ +D G+ P SLFD+ Sbjct: 402 ITAYSGEGWKGLSLEKLKPEKELQRATSEIIRRKVKIRDLFQHIDSLCGEGRFPASLFDS 461 Query: 507 QGEIDSEDIFCAKCGSKDLTLDNDIILCDGACERGFHQFCLEPPLLKTDIPPGDESWLCP 686 +G+IDSEDIFCAKCGSKDLT DNDIILCDGAC+RGFHQFCL PPLL+ DIPPGDE WLCP Sbjct: 462 EGQIDSEDIFCAKCGSKDLTADNDIILCDGACDRGFHQFCLVPPLLREDIPPGDEGWLCP 521 Query: 687 GCDCKADCIDMLKDFQETKISIVDSWEKIFPEAAAAAXXXXXXXXXXXXXXXXXXXXXXX 866 GCDCK DCID+L D Q T ISI D W+ +FPEAAA A Sbjct: 522 GCDCKVDCIDLLNDSQGTNISISDRWDNVFPEAAAVASGQKLDYNFGLSSDDSDDNDYDP 581 Query: 867 XXXXXXEKVEGNKXXXXXXXXXXXXXXLDASNNNEKYLGLPSDDSEDDDFNPSATNRDNQ 1046 EK + + +A ++++YLGLPSDDSEDDD++P A + + Sbjct: 582 DGPDIDEKSQ-EESSSDESDFSSASDEFEAPPDDKQYLGLPSDDSEDDDYDPDAPVLEEK 640 Query: 1047 VKQGXXXXXXXXXXXXLRALIEDETATSEDPWQTSSSAHHKQKSEGFGEEISNLGRKKRQ 1226 +KQ L A + + + D + H E S G KK Sbjct: 641 LKQESSSSDFTSDSEDLDATLNGDGLSLGDEYHMPIEPH-----EDSNGRRSRFGGKKNH 695 Query: 1227 SLKDELSYLMEASAE-----PLSRKRHVERLDYKKLNDETYGNXXXXXXXXXXXXTITRK 1391 SL +L ++E + P+S KR++ERLDYKKL DETYGN RK Sbjct: 696 SLNSKLLSMLEPDSHQEKSAPVSGKRNIERLDYKKLYDETYGNICTSSDDDFTDTVAPRK 755 Query: 1392 RTKSDRDEVK--FSDETLVTPSNTHKEDENQIEKK--HFPKRTRRNR-------SDGHTT 1538 R K+ D + + VT + + ++ NQ KK H RT +N S T Sbjct: 756 RRKNTGDVAMGIANGDASVTENGLNSKNMNQELKKNEHTSGRTHQNSSFQDTNVSPAKTH 815 Query: 1539 EXXXXXXXXXXXXXXXTHKRLGEATTQRLLAYFNLDQYPDRVVKENLAKELGLAVRQVGK 1718 +K+LGEA TQ+L ++F ++YPD+ K +LA+ELG+ QV K Sbjct: 816 VGESLSGSSSKRVRPSAYKKLGEAVTQKLYSFFKENRYPDQAAKASLAEELGITFEQVNK 875 Query: 1719 WFENARWSYNHRPRVESDSSHQNAESSA 1802 WF NARWS+NH S AES++ Sbjct: 876 WFMNARWSFNH----SSPEGTSKAESAS 899 >ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like [Glycine max] Length = 820 Score = 395 bits (1016), Expect = e-107 Identities = 242/641 (37%), Positives = 338/641 (52%), Gaps = 17/641 (2%) Frame = +3 Query: 30 TNDSVLLENDGIGS-SRQRVRRNAKL-------EGPXXXXXXXXXXXQEKTKAPEPVENV 185 TN S + N S SR++ +RN+KL +EK K PEP N+ Sbjct: 166 TNCSEKMSNSPSHSQSRRKGKRNSKLLKKKYMLRSLGSSGRALRSRTKEKPKEPEPTSNL 225 Query: 186 NEGSAI-GEKKKRGRKPKNMQKTTVSE-FSSKKTHLRYLMHRINYEQNLIDAYSAEGWRG 359 +G++ G K+K GRK K ++ +++ FS ++HLRYL++RI+YE +LIDAYS EGW+G Sbjct: 226 VDGNSNDGVKRKSGRKKKKRREEGITDQFSRIRSHLRYLLNRISYENSLIDAYSGEGWKG 285 Query: 360 QSXXXXXXXXXXXXXXSRILHYKLKIRALFQSLDQSLAMGKLPESLFDAQGEIDSEDIFC 539 S S IL KLKIR LF++LD A GK PESLFD+ GEIDSEDIFC Sbjct: 286 YSMEKLKPEKELQRAKSEILRRKLKIRDLFRNLDSLCAEGKFPESLFDSAGEIDSEDIFC 345 Query: 540 AKCGSKDLTLDNDIILCDGACERGFHQFCLEPPLLKTDIPPGDESWLCPGCDCKADCIDM 719 AKC SK+L+ +NDIILCDG C+RGFHQ CL+PPLL DIPPGDE WLCPGCDCK DC+D+ Sbjct: 346 AKCQSKELSTNNDIILCDGVCDRGFHQLCLDPPLLTEDIPPGDEGWLCPGCDCKDDCMDL 405 Query: 720 LKDFQETKISIVDSWEKIFPEAAAAAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEKVEG 899 + D T +SI D+WE++FPEAA+ A K+EG Sbjct: 406 VNDSFGTSLSISDTWERVFPEAASFAGNNMDNNLGLPSDDSDDDDYNPNGSDDV--KIEG 463 Query: 900 NKXXXXXXXXXXXXXXLDASNNNEKYLGLPSDDSEDDDFNPSATNRDNQVKQGXXXXXXX 1079 ++ L+ ++ ++YLGLPS+DS+D D++P A + D +V + Sbjct: 464 DESSSDESEYASASEKLEGGSHEDQYLGLPSEDSDDGDYDPDAPDVDCKVNEESSSSDFT 523 Query: 1080 XXXXXLRALIEDETATSEDPWQTSSSAHHKQKSEGFGEEISNLGRKKRQSLKDELSYLME 1259 L A ED T+ +D SS K+K G+ + S+ DELS L+E Sbjct: 524 SDSEDLAAAFEDNTSPGQDGGINSS----KKK-----------GKVGKLSMADELSSLLE 568 Query: 1260 ASA-----EPLSRKRHVERLDYKKLNDETYGNXXXXXXXXXXXXTITRKRTKSDRDEVKF 1424 + P+S KRHVERLDYKKL +ETY + +RK+ + Sbjct: 569 PDSGQGGPTPVSGKRHVERLDYKKLYEETYHSDTSDDEDWNDAAAPSRKKKLT------- 621 Query: 1425 SDETLVTPSNTHKEDENQIEKK--HFPKRTRRNRSDGHTTEXXXXXXXXXXXXXXXTHKR 1598 + T V+P+ + K+ H K N S + + HKR Sbjct: 622 GNVTPVSPNANASNNSIHTLKRNAHQNKVENTNSSPTKSLDGRSKSGSRDKRSGSSAHKR 681 Query: 1599 LGEATTQRLLAYFNLDQYPDRVVKENLAKELGLAVRQVGKWFENARWSYNHRPRVESDSS 1778 LGEA QRL F +QYPDR KE+LA+ELGL +QV KWF+N RWS+ H ++E++S Sbjct: 682 LGEAVVQRLHKSFKENQYPDRSTKESLAQELGLTYQQVAKWFDNTRWSFRHSSQMETNSG 741 Query: 1779 HQNAESSAISAQNHIAKLRGGDLVTDTSGDSPGTPATKKRK 1901 + + + + + + + SG + T +++KRK Sbjct: 742 RNASPEATDGRAENEGEKQCESMSPEVSGKNSKTTSSRKRK 782