BLASTX nr result
ID: Perilla23_contig00019073
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Perilla23_contig00019073 (676 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011094382.1| PREDICTED: uncharacterized protein LOC105174... 307 4e-81 ref|XP_011094381.1| PREDICTED: uncharacterized protein LOC105174... 307 4e-81 ref|XP_012840132.1| PREDICTED: uncharacterized protein LOC105960... 291 2e-76 gb|EYU35152.1| hypothetical protein MIMGU_mgv1a001852mg [Erythra... 291 2e-76 ref|XP_012492709.1| PREDICTED: uncharacterized protein LOC105804... 221 4e-55 ref|XP_012492702.1| PREDICTED: uncharacterized protein LOC105804... 221 4e-55 ref|XP_011094383.1| PREDICTED: uncharacterized protein LOC105174... 218 2e-54 emb|CDP15391.1| unnamed protein product [Coffea canephora] 211 4e-52 ref|XP_012492696.1| PREDICTED: uncharacterized protein LOC105804... 210 6e-52 ref|XP_012492689.1| PREDICTED: uncharacterized protein LOC105804... 210 6e-52 ref|XP_012492674.1| PREDICTED: uncharacterized protein LOC105804... 210 6e-52 ref|XP_002311825.2| hypothetical protein POPTR_0008s20540g [Popu... 209 1e-51 gb|KDO61089.1| hypothetical protein CISIN_1g008363mg [Citrus sin... 207 4e-51 ref|XP_006481815.1| PREDICTED: uncharacterized protein LOC102609... 207 4e-51 ref|XP_006481813.1| PREDICTED: uncharacterized protein LOC102609... 207 4e-51 gb|KHG10240.1| General transcription factor 3C polypeptide 2 [Go... 207 5e-51 ref|XP_007049744.1| DNA binding protein, putative isoform 1 [The... 204 4e-50 ref|XP_011010279.1| PREDICTED: uncharacterized protein LOC105115... 196 8e-48 ref|XP_011010277.1| PREDICTED: uncharacterized protein LOC105115... 196 8e-48 ref|XP_013733163.1| PREDICTED: uncharacterized protein LOC106436... 195 2e-47 >ref|XP_011094382.1| PREDICTED: uncharacterized protein LOC105174093 isoform X2 [Sesamum indicum] Length = 870 Score = 307 bits (786), Expect = 4e-81 Identities = 155/289 (53%), Positives = 185/289 (64%), Gaps = 64/289 (22%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLWVHPFQGITYGLDWLPDPRCIFGSM 496 AWAP+Q LE AN+IIT G KGFK WDIRDPF PLW H G+TYGL+WLPDPRC+FGS+ Sbjct: 557 AWAPVQGELESANVIITTGPKGFKVWDIRDPFRPLWDHHIPGVTYGLEWLPDPRCVFGSI 616 Query: 495 EDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGEE 316 +DGT+WLLSLERA+HDIPVTGK A KHGFHSF CSSF+IW + ASRLTGMVAYCGEE Sbjct: 617 DDGTLWLLSLERAAHDIPVTGKSITAAPKHGFHSFDCSSFSIWCIHASRLTGMVAYCGEE 676 Query: 315 GSTICFKPTTRSLKDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPGMKRSRG- 139 G+T CF+PTTRS++DPSRNR+ HYLCGS+L+EE A+I+ASP S Q +SPGMKRS G Sbjct: 677 GTTFCFQPTTRSVRDPSRNRLHHYLCGSLLEEETALIVASPSTSSFLQKRSPGMKRSGGA 736 Query: 138 KEEERRAKEQSNRQVT---------AWKEERDE----------EMEAVPPK--------- 43 K++E+R KEQ + VT W + +E + +A PK Sbjct: 737 KDQEKRVKEQMAKSVTCNEPPTPAICWSDHVEEHGSDKSSMVIKKQASKPKESSKTQSQA 796 Query: 42 -----------------------------------IVAMHRVRWNVNRG 1 IVAMHRVRWNVN+G Sbjct: 797 NQETVLCRSEDAGQLQREGSGKEEKGDTVEVFPPKIVAMHRVRWNVNKG 845 >ref|XP_011094381.1| PREDICTED: uncharacterized protein LOC105174093 isoform X1 [Sesamum indicum] Length = 874 Score = 307 bits (786), Expect = 4e-81 Identities = 155/289 (53%), Positives = 185/289 (64%), Gaps = 64/289 (22%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLWVHPFQGITYGLDWLPDPRCIFGSM 496 AWAP+Q LE AN+IIT G KGFK WDIRDPF PLW H G+TYGL+WLPDPRC+FGS+ Sbjct: 561 AWAPVQGELESANVIITTGPKGFKVWDIRDPFRPLWDHHIPGVTYGLEWLPDPRCVFGSI 620 Query: 495 EDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGEE 316 +DGT+WLLSLERA+HDIPVTGK A KHGFHSF CSSF+IW + ASRLTGMVAYCGEE Sbjct: 621 DDGTLWLLSLERAAHDIPVTGKSITAAPKHGFHSFDCSSFSIWCIHASRLTGMVAYCGEE 680 Query: 315 GSTICFKPTTRSLKDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPGMKRSRG- 139 G+T CF+PTTRS++DPSRNR+ HYLCGS+L+EE A+I+ASP S Q +SPGMKRS G Sbjct: 681 GTTFCFQPTTRSVRDPSRNRLHHYLCGSLLEEETALIVASPSTSSFLQKRSPGMKRSGGA 740 Query: 138 KEEERRAKEQSNRQVT---------AWKEERDE----------EMEAVPPK--------- 43 K++E+R KEQ + VT W + +E + +A PK Sbjct: 741 KDQEKRVKEQMAKSVTCNEPPTPAICWSDHVEEHGSDKSSMVIKKQASKPKESSKTQSQA 800 Query: 42 -----------------------------------IVAMHRVRWNVNRG 1 IVAMHRVRWNVN+G Sbjct: 801 NQETVLCRSEDAGQLQREGSGKEEKGDTVEVFPPKIVAMHRVRWNVNKG 849 >ref|XP_012840132.1| PREDICTED: uncharacterized protein LOC105960491 [Erythranthe guttatus] Length = 808 Score = 291 bits (746), Expect = 2e-76 Identities = 138/249 (55%), Positives = 174/249 (69%), Gaps = 24/249 (9%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLWVHPFQGITYGLDWLPDPRCIFGSM 496 AWAP Q++LE AN+I+TAG KGFKFWDIRDPF PLW H QG+TYGL WL DPRC+FGS+ Sbjct: 535 AWAPNQTDLESANVIVTAGHKGFKFWDIRDPFRPLWDHAMQGVTYGLSWLRDPRCVFGSV 594 Query: 495 EDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGEE 316 +DGT+W LE + DIP+TGKC A+K GFHSF CSSF+IW++QAS LTG+VAYCGE Sbjct: 595 DDGTLWFHRLENTASDIPITGKCVAAATKQGFHSFDCSSFSIWNVQASPLTGVVAYCGEA 654 Query: 315 GSTICFKPTTRSLKDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPGMKRSRG- 139 G+T+CF+PT RS+KDPSRNR H LCGS+L+EE A+I+A+P +S + PGMKRS G Sbjct: 655 GTTLCFQPTARSVKDPSRNRRTHLLCGSLLEEEDALIVATPSTSTSHSRRYPGMKRSGGA 714 Query: 138 KEEERRAKEQSNRQ---VTAWKEE--------------------RDEEMEAVPPKIVAMH 28 K+ E++ KEQ N + W+ + D + E P K VA+H Sbjct: 715 KDLEKKFKEQINNEQPLAICWRGDLEETKKQEPKSKETNKDQLKNDNKREVFPGKNVAIH 774 Query: 27 RVRWNVNRG 1 RVRWN N+G Sbjct: 775 RVRWNANKG 783 >gb|EYU35152.1| hypothetical protein MIMGU_mgv1a001852mg [Erythranthe guttata] Length = 749 Score = 291 bits (746), Expect = 2e-76 Identities = 138/249 (55%), Positives = 174/249 (69%), Gaps = 24/249 (9%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLWVHPFQGITYGLDWLPDPRCIFGSM 496 AWAP Q++LE AN+I+TAG KGFKFWDIRDPF PLW H QG+TYGL WL DPRC+FGS+ Sbjct: 476 AWAPNQTDLESANVIVTAGHKGFKFWDIRDPFRPLWDHAMQGVTYGLSWLRDPRCVFGSV 535 Query: 495 EDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGEE 316 +DGT+W LE + DIP+TGKC A+K GFHSF CSSF+IW++QAS LTG+VAYCGE Sbjct: 536 DDGTLWFHRLENTASDIPITGKCVAAATKQGFHSFDCSSFSIWNVQASPLTGVVAYCGEA 595 Query: 315 GSTICFKPTTRSLKDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPGMKRSRG- 139 G+T+CF+PT RS+KDPSRNR H LCGS+L+EE A+I+A+P +S + PGMKRS G Sbjct: 596 GTTLCFQPTARSVKDPSRNRRTHLLCGSLLEEEDALIVATPSTSTSHSRRYPGMKRSGGA 655 Query: 138 KEEERRAKEQSNRQ---VTAWKEE--------------------RDEEMEAVPPKIVAMH 28 K+ E++ KEQ N + W+ + D + E P K VA+H Sbjct: 656 KDLEKKFKEQINNEQPLAICWRGDLEETKKQEPKSKETNKDQLKNDNKREVFPGKNVAIH 715 Query: 27 RVRWNVNRG 1 RVRWN N+G Sbjct: 716 RVRWNANKG 724 >ref|XP_012492709.1| PREDICTED: uncharacterized protein LOC105804550 isoform X5 [Gossypium raimondii] Length = 852 Score = 221 bits (562), Expect = 4e-55 Identities = 124/298 (41%), Positives = 166/298 (55%), Gaps = 73/298 (24%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499 AWAP S++E +N+I+TAG G KFWDIRDPF PLW VHP Y LDWLP+PRC+ S Sbjct: 523 AWAPSGSDMESSNVILTAGHGGVKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVIIS 582 Query: 498 MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319 +DGTM LLSL +A+ D+PVTGK + + G H ++CSSFAIW +Q SRL GMVAYCG Sbjct: 583 FDDGTMKLLSLVQAACDVPVTGKPFGGSKQQGLHVYNCSSFAIWCVQVSRLIGMVAYCGA 642 Query: 318 EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPG----- 157 +G+ CF+ T++++ KD SRNR PH+ CGS+ +EE A+I+ +PL D+ K P Sbjct: 643 DGTAACFQLTSKAVDKDFSRNRSPHFACGSLTEEEPAVIVNTPLPDNPLPLKKPSSECGD 702 Query: 156 --------MKRSRGKE-EERRAKEQSNRQVT----------------------------- 91 + S GK ++R+AK ++ Q T Sbjct: 703 GQRSMRYFLTESLGKNAKDRKAKVPTSNQRTLALCYGNDPSLESEPEETLAALKSKMNPN 762 Query: 90 ---------------------AWKEERDE-------EMEAVPPKIVAMHRVRWNVNRG 1 A +ER+E +ME PPKIVAMHR+RWN+N+G Sbjct: 763 SKSDGKKKANDSQALAQGTKEATNKEREETEKEGESQMETFPPKIVAMHRLRWNMNKG 820 >ref|XP_012492702.1| PREDICTED: uncharacterized protein LOC105804550 isoform X4 [Gossypium raimondii] Length = 852 Score = 221 bits (562), Expect = 4e-55 Identities = 124/298 (41%), Positives = 166/298 (55%), Gaps = 73/298 (24%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499 AWAP S++E +N+I+TAG G KFWDIRDPF PLW VHP Y LDWLP+PRC+ S Sbjct: 523 AWAPSGSDMESSNVILTAGHGGVKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVIIS 582 Query: 498 MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319 +DGTM LLSL +A+ D+PVTGK + + G H ++CSSFAIW +Q SRL GMVAYCG Sbjct: 583 FDDGTMKLLSLVQAACDVPVTGKPFGGSKQQGLHVYNCSSFAIWCVQVSRLIGMVAYCGA 642 Query: 318 EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPG----- 157 +G+ CF+ T++++ KD SRNR PH+ CGS+ +EE A+I+ +PL D+ K P Sbjct: 643 DGTAACFQLTSKAVDKDFSRNRSPHFACGSLTEEEPAVIVNTPLPDNPLPLKKPSSECGD 702 Query: 156 --------MKRSRGKE-EERRAKEQSNRQVT----------------------------- 91 + S GK ++R+AK ++ Q T Sbjct: 703 GQRSMRYFLTESLGKNAKDRKAKVPTSNQRTLALYDGNDPSVESEPEETLAALKSKMKPN 762 Query: 90 ---------------------AWKEERDE-------EMEAVPPKIVAMHRVRWNVNRG 1 A +ER+E +ME PPKIVAMHR+RWN+N+G Sbjct: 763 SKSDGKKKANDSQALAQGTKEATNKEREETEKEGESQMETFPPKIVAMHRLRWNMNKG 820 >ref|XP_011094383.1| PREDICTED: uncharacterized protein LOC105174093 isoform X3 [Sesamum indicum] Length = 692 Score = 218 bits (556), Expect = 2e-54 Identities = 94/127 (74%), Positives = 107/127 (84%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLWVHPFQGITYGLDWLPDPRCIFGSM 496 AWAP+Q LE AN+IIT G KGFK WDIRDPF PLW H G+TYGL+WLPDPRC+FGS+ Sbjct: 561 AWAPVQGELESANVIITTGPKGFKVWDIRDPFRPLWDHHIPGVTYGLEWLPDPRCVFGSI 620 Query: 495 EDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGEE 316 +DGT+WLLSLERA+HDIPVTGK A KHGFHSF CSSF+IW + ASRLTGMVAYCGEE Sbjct: 621 DDGTLWLLSLERAAHDIPVTGKSITAAPKHGFHSFDCSSFSIWCIHASRLTGMVAYCGEE 680 Query: 315 GSTICFK 295 G+T CF+ Sbjct: 681 GTTFCFQ 687 >emb|CDP15391.1| unnamed protein product [Coffea canephora] Length = 942 Score = 211 bits (536), Expect = 4e-52 Identities = 116/289 (40%), Positives = 157/289 (54%), Gaps = 65/289 (22%) Frame = -2 Query: 672 WAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGSM 496 W P+ S E ANII+TAG +G KFWD+RDPF PLW +PFQ + Y LDWLPDPRCI S Sbjct: 630 WVPVSSYSESANIIVTAGHRGLKFWDLRDPFRPLWDFYPFQRVIYSLDWLPDPRCIIVSF 689 Query: 495 EDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGEE 316 +DG + +LSL +A++D PVTGK A + GFHS+ CS F IWS+ SRLTGMVAYCG + Sbjct: 690 DDGALRILSLLKAANDAPVTGKPFEGAQQKGFHSYLCSPFQIWSVHTSRLTGMVAYCGAD 749 Query: 315 GSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTK------SPG 157 G+ + F+ TTR++ KDP RNR PH+LCG++ +E + + + L ++ F + Sbjct: 750 GTALRFQLTTRAVEKDPLRNRAPHFLCGALTEENSTLTMFTSLPNTPFPMRKSLREWGEA 809 Query: 156 MKRSRG----KEEERRAKE-----------------------------------QSNRQV 94 + RG +E+RAK+ ++ + Sbjct: 810 PRTVRGYISVSNQEKRAKQKVVKVRSEEKHKALCKRGDLDSEFGPDCMAVTETREAGKVK 869 Query: 93 TAWKEERD------------------EEMEAVPPKIVAMHRVRWNVNRG 1 T+ E D EE+E P K VAMHRVRWN N+G Sbjct: 870 TSSNSEADQRPIMVGEDNPDIMRGEVEEVEVFPSKTVAMHRVRWNTNKG 918 >ref|XP_012492696.1| PREDICTED: uncharacterized protein LOC105804550 isoform X3 [Gossypium raimondii] Length = 877 Score = 210 bits (535), Expect = 6e-52 Identities = 109/231 (47%), Positives = 150/231 (64%), Gaps = 17/231 (7%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499 AWAP S++E +N+I+TAG G KFWDIRDPF PLW VHP Y LDWLP+PRC+ S Sbjct: 513 AWAPSGSDMESSNVILTAGHGGVKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVIIS 572 Query: 498 MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319 +DGTM LLSL +A+ D+PVTGK + + G H ++CSSFAIW +Q SRL GMVAYCG Sbjct: 573 FDDGTMKLLSLVQAACDVPVTGKPFGGSKQQGLHVYNCSSFAIWCVQVSRLIGMVAYCGA 632 Query: 318 EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPG----- 157 +G+ CF+ T++++ KD SRNR PH+ CGS+ +EE A+I+ +PL D+ K P Sbjct: 633 DGTAACFQLTSKAVDKDFSRNRSPHFACGSLTEEEPAVIVNTPLPDNPLPLKKPSSECGD 692 Query: 156 --------MKRSRGKE-EERRAK-EQSNRQVTAWKEERDEEMEAVPPKIVA 34 + S GK ++R+AK SN++ A + D +E+ P + +A Sbjct: 693 GQRSMRYFLTESLGKNAKDRKAKVPTSNQRTLALYDGNDPSVESEPEETLA 743 >ref|XP_012492689.1| PREDICTED: uncharacterized protein LOC105804550 isoform X2 [Gossypium raimondii] gi|763743438|gb|KJB10937.1| hypothetical protein B456_001G233200 [Gossypium raimondii] Length = 886 Score = 210 bits (535), Expect = 6e-52 Identities = 109/231 (47%), Positives = 150/231 (64%), Gaps = 17/231 (7%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499 AWAP S++E +N+I+TAG G KFWDIRDPF PLW VHP Y LDWLP+PRC+ S Sbjct: 522 AWAPSGSDMESSNVILTAGHGGVKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVIIS 581 Query: 498 MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319 +DGTM LLSL +A+ D+PVTGK + + G H ++CSSFAIW +Q SRL GMVAYCG Sbjct: 582 FDDGTMKLLSLVQAACDVPVTGKPFGGSKQQGLHVYNCSSFAIWCVQVSRLIGMVAYCGA 641 Query: 318 EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPG----- 157 +G+ CF+ T++++ KD SRNR PH+ CGS+ +EE A+I+ +PL D+ K P Sbjct: 642 DGTAACFQLTSKAVDKDFSRNRSPHFACGSLTEEEPAVIVNTPLPDNPLPLKKPSSECGD 701 Query: 156 --------MKRSRGKE-EERRAK-EQSNRQVTAWKEERDEEMEAVPPKIVA 34 + S GK ++R+AK SN++ A + D +E+ P + +A Sbjct: 702 GQRSMRYFLTESLGKNAKDRKAKVPTSNQRTLALYDGNDPSVESEPEETLA 752 >ref|XP_012492674.1| PREDICTED: uncharacterized protein LOC105804550 isoform X1 [Gossypium raimondii] gi|823127124|ref|XP_012492682.1| PREDICTED: uncharacterized protein LOC105804550 isoform X1 [Gossypium raimondii] gi|763743433|gb|KJB10932.1| hypothetical protein B456_001G233200 [Gossypium raimondii] gi|763743437|gb|KJB10936.1| hypothetical protein B456_001G233200 [Gossypium raimondii] Length = 887 Score = 210 bits (535), Expect = 6e-52 Identities = 109/231 (47%), Positives = 150/231 (64%), Gaps = 17/231 (7%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499 AWAP S++E +N+I+TAG G KFWDIRDPF PLW VHP Y LDWLP+PRC+ S Sbjct: 523 AWAPSGSDMESSNVILTAGHGGVKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVIIS 582 Query: 498 MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319 +DGTM LLSL +A+ D+PVTGK + + G H ++CSSFAIW +Q SRL GMVAYCG Sbjct: 583 FDDGTMKLLSLVQAACDVPVTGKPFGGSKQQGLHVYNCSSFAIWCVQVSRLIGMVAYCGA 642 Query: 318 EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPG----- 157 +G+ CF+ T++++ KD SRNR PH+ CGS+ +EE A+I+ +PL D+ K P Sbjct: 643 DGTAACFQLTSKAVDKDFSRNRSPHFACGSLTEEEPAVIVNTPLPDNPLPLKKPSSECGD 702 Query: 156 --------MKRSRGKE-EERRAK-EQSNRQVTAWKEERDEEMEAVPPKIVA 34 + S GK ++R+AK SN++ A + D +E+ P + +A Sbjct: 703 GQRSMRYFLTESLGKNAKDRKAKVPTSNQRTLALYDGNDPSVESEPEETLA 753 >ref|XP_002311825.2| hypothetical protein POPTR_0008s20540g [Populus trichocarpa] gi|550333546|gb|EEE89192.2| hypothetical protein POPTR_0008s20540g [Populus trichocarpa] Length = 813 Score = 209 bits (532), Expect = 1e-51 Identities = 118/291 (40%), Positives = 157/291 (53%), Gaps = 66/291 (22%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499 AW P +S+ E N+I+TAG G KFWDIRDPF PLW +HP + Y LDWLPDPRCI S Sbjct: 491 AWVPSESDQESPNLILTAGHLGLKFWDIRDPFRPLWDLHPAPKLIYSLDWLPDPRCIILS 550 Query: 498 MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319 +DGTM LLSL RA++D V GK + G H +CSSFAIWS+Q SRLTGMVAYC Sbjct: 551 FDDGTMRLLSLARAAYDAAVNGKPSVGPKQLGMHVVNCSSFAIWSVQVSRLTGMVAYCSA 610 Query: 318 EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPGMKRSR 142 +G+ F+ TT+++ KDPSR+R PH+ CGS+ ++E AII+ +PL D+ K P Sbjct: 611 DGTVCRFQLTTKAVEKDPSRHRAPHFGCGSLSEDESAIIVGTPLPDTPLPLKKPVNDVGN 670 Query: 141 GKEEERRAK---------------------------EQSNRQVTAWKEER---------- 73 + ++R S+ +TA K +R Sbjct: 671 NPKSKQRLSVSNKAAKIPTSDDPPLALCYGDDPGMDHGSDETLTATKSKRKPKSKSGSKQ 730 Query: 72 -----------DEEME----------------AVPPKIVAMHRVRWNVNRG 1 D+E + ++PPK+VAMHRVRWN+N+G Sbjct: 731 MEGEDQALVCIDDEQDVKQKGGGKEGAGNVVESIPPKMVAMHRVRWNMNKG 781 >gb|KDO61089.1| hypothetical protein CISIN_1g008363mg [Citrus sinensis] Length = 568 Score = 207 bits (528), Expect = 4e-51 Identities = 117/299 (39%), Positives = 161/299 (53%), Gaps = 74/299 (24%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499 +WAP +S+ + AN+I+TAG G KFWDIRDPF PLW +HP YGLDWLPDP C+ S Sbjct: 236 SWAPAESDSDSANVILTAGHGGLKFWDIRDPFRPLWDIHPAPKFIYGLDWLPDPGCVILS 295 Query: 498 MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319 +DG M ++SL +A++D+P TGK + G H +CSSFAIWS+Q SRLTGMVAYC Sbjct: 296 FDDGAMRIVSLLKAAYDVPATGKPFAGTKQQGLHLVNCSSFAIWSVQVSRLTGMVAYCSA 355 Query: 318 EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPL---------------- 190 +G+ F+ T +++ KD SRNR H+LCGS+ ++E AI + +PL Sbjct: 356 DGTVHRFQLTAKAVEKDHSRNRPMHFLCGSVTEDESAITVNTPLDNTPVPLKKTVHDAGE 415 Query: 189 ------LDSSFQTKSPGMKRSR------------------GKEEE--------RRAKEQS 106 L S +KSP K+ + G+E E + ++ Sbjct: 416 RSMRSFLIESNSSKSPNDKKGKNVLSSDNQPLALCYGNEPGEESEGDMTLAALKNKQKPK 475 Query: 105 NRQVTAWKEERDEEM------------------------EAVPPKIVAMHRVRWNVNRG 1 +R + KEE D+ M E +PPK+VAMHRVRWN+N+G Sbjct: 476 SRSSSKKKEEDDQAMVCIDEEATDIQGKENEKGEAGNGIEVLPPKVVAMHRVRWNMNKG 534 >ref|XP_006481815.1| PREDICTED: uncharacterized protein LOC102609984 isoform X3 [Citrus sinensis] Length = 801 Score = 207 bits (528), Expect = 4e-51 Identities = 117/299 (39%), Positives = 161/299 (53%), Gaps = 74/299 (24%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499 +WAP +S+ + AN+I+TAG G KFWDIRDPF PLW +HP YGLDWLPDP C+ S Sbjct: 469 SWAPAESDSDSANVILTAGHGGLKFWDIRDPFRPLWDIHPAPKFIYGLDWLPDPGCVILS 528 Query: 498 MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319 +DG M ++SL +A++D+P TGK + G H +CSSFAIWS+Q SRLTGMVAYC Sbjct: 529 FDDGAMRIVSLLKAAYDVPATGKPFAGTKQQGLHLVNCSSFAIWSVQVSRLTGMVAYCSA 588 Query: 318 EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPL---------------- 190 +G+ F+ T +++ KD SRNR H+LCGS+ ++E AI + +PL Sbjct: 589 DGTVHRFQLTAKAVEKDHSRNRPMHFLCGSVTEDESAITVNTPLDNTPVPLKKTVHDAGE 648 Query: 189 ------LDSSFQTKSPGMKRSR------------------GKEEE--------RRAKEQS 106 L S +KSP K+ + G+E E + ++ Sbjct: 649 RSMRSFLIESNSSKSPNDKKGKNVLSSDNQPLALCYGNEPGEESEGDMTLAALKNKQKPK 708 Query: 105 NRQVTAWKEERDEEM------------------------EAVPPKIVAMHRVRWNVNRG 1 +R + KEE D+ M E +PPK+VAMHRVRWN+N+G Sbjct: 709 SRSSSKKKEEDDQAMVCIDEEATDIQGKENAKGEAGNGIEVLPPKVVAMHRVRWNMNKG 767 >ref|XP_006481813.1| PREDICTED: uncharacterized protein LOC102609984 isoform X1 [Citrus sinensis] gi|568856485|ref|XP_006481814.1| PREDICTED: uncharacterized protein LOC102609984 isoform X2 [Citrus sinensis] Length = 911 Score = 207 bits (528), Expect = 4e-51 Identities = 117/299 (39%), Positives = 161/299 (53%), Gaps = 74/299 (24%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499 +WAP +S+ + AN+I+TAG G KFWDIRDPF PLW +HP YGLDWLPDP C+ S Sbjct: 579 SWAPAESDSDSANVILTAGHGGLKFWDIRDPFRPLWDIHPAPKFIYGLDWLPDPGCVILS 638 Query: 498 MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319 +DG M ++SL +A++D+P TGK + G H +CSSFAIWS+Q SRLTGMVAYC Sbjct: 639 FDDGAMRIVSLLKAAYDVPATGKPFAGTKQQGLHLVNCSSFAIWSVQVSRLTGMVAYCSA 698 Query: 318 EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPL---------------- 190 +G+ F+ T +++ KD SRNR H+LCGS+ ++E AI + +PL Sbjct: 699 DGTVHRFQLTAKAVEKDHSRNRPMHFLCGSVTEDESAITVNTPLDNTPVPLKKTVHDAGE 758 Query: 189 ------LDSSFQTKSPGMKRSR------------------GKEEE--------RRAKEQS 106 L S +KSP K+ + G+E E + ++ Sbjct: 759 RSMRSFLIESNSSKSPNDKKGKNVLSSDNQPLALCYGNEPGEESEGDMTLAALKNKQKPK 818 Query: 105 NRQVTAWKEERDEEM------------------------EAVPPKIVAMHRVRWNVNRG 1 +R + KEE D+ M E +PPK+VAMHRVRWN+N+G Sbjct: 819 SRSSSKKKEEDDQAMVCIDEEATDIQGKENAKGEAGNGIEVLPPKVVAMHRVRWNMNKG 877 >gb|KHG10240.1| General transcription factor 3C polypeptide 2 [Gossypium arboreum] Length = 925 Score = 207 bits (527), Expect = 5e-51 Identities = 109/231 (47%), Positives = 148/231 (64%), Gaps = 17/231 (7%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499 AWAP S++E +N+I+TAG G KFWDIRDPF PLW VHP Y LDWLP+PRC+ S Sbjct: 561 AWAPSGSDMESSNVILTAGHGGVKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVIIS 620 Query: 498 MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319 +DGTM LLSL +A+ D+PVTGK + + G H ++CSSFAIW +Q SRLTGMVAYCG Sbjct: 621 FDDGTMKLLSLVQAACDVPVTGKPFGGSKQQGLHVYNCSSFAIWCVQVSRLTGMVAYCGA 680 Query: 318 EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPG----- 157 +G+ F+ T++++ KD SRNR PH+ CGS+ +EE A+I+ +PL D K P Sbjct: 681 DGTATYFQLTSKAVDKDFSRNRSPHFACGSLTEEEPAVIVNTPLPDKPLPLKKPSSECGD 740 Query: 156 --------MKRSRGKE-EERRAK-EQSNRQVTAWKEERDEEMEAVPPKIVA 34 + S GK + R+AK SN++ A + D +E+ P + +A Sbjct: 741 GQRSMRYFLTESLGKNAKHRKAKVPTSNQRTLALYDGNDPSVESEPDETLA 791 >ref|XP_007049744.1| DNA binding protein, putative isoform 1 [Theobroma cacao] gi|508702005|gb|EOX93901.1| DNA binding protein, putative isoform 1 [Theobroma cacao] Length = 868 Score = 204 bits (519), Expect = 4e-50 Identities = 99/198 (50%), Positives = 132/198 (66%), Gaps = 2/198 (1%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499 AWAP S++E AN+++TAG G KFWDIRDPF PLW VHP Y LDWLP+PRC+ S Sbjct: 540 AWAPSGSDMESANVVLTAGHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILS 599 Query: 498 MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319 +DGTM +LSL +A+ D+PVTGK + G H ++CSSFAIW++Q SRLTGMVAYCG Sbjct: 600 FDDGTMKMLSLIQAACDVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGA 659 Query: 318 EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPGMKRSR 142 +G+ F+ T++++ KD SRNR PH++CGS+ +EE AI++ +PL D K Sbjct: 660 DGNVTRFQLTSKAVDKDFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGE 719 Query: 141 GKEEERRAKEQSNRQVTA 88 G R +SN+ A Sbjct: 720 GPRSMRAFLTESNQAKNA 737 >ref|XP_011010279.1| PREDICTED: uncharacterized protein LOC105115164 isoform X2 [Populus euphratica] Length = 930 Score = 196 bits (499), Expect = 8e-48 Identities = 95/174 (54%), Positives = 121/174 (69%), Gaps = 2/174 (1%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499 AW P +S+ E N+I+TAG G KFWDIRDPF PLW +HP + Y LDWLPDPRCI S Sbjct: 608 AWVPSESDQESPNLILTAGHLGLKFWDIRDPFRPLWDLHPAPKLIYSLDWLPDPRCIILS 667 Query: 498 MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319 +DGTM LLSL RA++D V GK + G H +CSSFAIWS+Q SRLTGMVAYC Sbjct: 668 FDDGTMRLLSLARAAYDAAVNGKPSVGPKQLGMHVVNCSSFAIWSVQVSRLTGMVAYCSA 727 Query: 318 EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSP 160 +G+ F+ TT+++ KDPSR+R PH+ CG + ++E AII+ +PL D+ K P Sbjct: 728 DGTVCRFQLTTKAVEKDPSRHRAPHFGCGFLSEDESAIIVGTPLPDNPLPLKKP 781 >ref|XP_011010277.1| PREDICTED: uncharacterized protein LOC105115164 isoform X1 [Populus euphratica] gi|743931985|ref|XP_011010278.1| PREDICTED: uncharacterized protein LOC105115164 isoform X1 [Populus euphratica] Length = 931 Score = 196 bits (499), Expect = 8e-48 Identities = 95/174 (54%), Positives = 121/174 (69%), Gaps = 2/174 (1%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499 AW P +S+ E N+I+TAG G KFWDIRDPF PLW +HP + Y LDWLPDPRCI S Sbjct: 609 AWVPSESDQESPNLILTAGHLGLKFWDIRDPFRPLWDLHPAPKLIYSLDWLPDPRCIILS 668 Query: 498 MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319 +DGTM LLSL RA++D V GK + G H +CSSFAIWS+Q SRLTGMVAYC Sbjct: 669 FDDGTMRLLSLARAAYDAAVNGKPSVGPKQLGMHVVNCSSFAIWSVQVSRLTGMVAYCSA 728 Query: 318 EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSP 160 +G+ F+ TT+++ KDPSR+R PH+ CG + ++E AII+ +PL D+ K P Sbjct: 729 DGTVCRFQLTTKAVEKDPSRHRAPHFGCGFLSEDESAIIVGTPLPDNPLPLKKP 782 >ref|XP_013733163.1| PREDICTED: uncharacterized protein LOC106436761 [Brassica napus] Length = 2668 Score = 195 bits (496), Expect = 2e-47 Identities = 111/287 (38%), Positives = 156/287 (54%), Gaps = 62/287 (21%) Frame = -2 Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLWVHPF-QGITY-GLDWLPDPRCIFG 502 +WAP QS+ A +IITAG KG KFWD+RDPFHPL + QG+ +DWLP+PRCI Sbjct: 566 SWAPFQSDSGSATVIITAGHKGLKFWDLRDPFHPLREYNVGQGVNICSVDWLPEPRCIII 625 Query: 501 SMEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCG 322 S +DGT+ +LSL +A++D+PVTG + GFHSF S AIW++QASR+TG+VAYCG Sbjct: 626 SCDDGTLKILSLPKAAYDVPVTGNFLVGTKQQGFHSFSRSLLAIWNVQASRVTGLVAYCG 685 Query: 321 EEGSTICFKPTTRSLKDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPGMK--- 151 +G+ + F+ T+R D RNR PH+LCGS ++E I + +P+ +S F+T G + Sbjct: 686 ADGTAVRFQLTSRMENDAVRNRTPHFLCGSFSEDESGISVVTPVPNSPFRTFYSGKQWRD 745 Query: 150 --------RSRGKEEERRAKEQSNRQVTAWKEERD------------------------- 70 + +E+RA EQS+ Q A D Sbjct: 746 TISRFPHGVNSVPNQEKRAMEQSDEQPLALCYGNDPNVEGGSDDELVAQKSKQASKAKTK 805 Query: 69 ------------------------EEMEAVPPKIVAMHRVRWNVNRG 1 E++E +PPK V+++RVRWN+NRG Sbjct: 806 TTSKKPKASDCALICNEEEPTRLREKLEELPPKDVSINRVRWNMNRG 852