BLASTX nr result
ID: Atropa21_contig00004350
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00004350 (2093 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006345030.1| PREDICTED: uncharacterized protein LOC102588... 1104 0.0 ref|XP_004236128.1| PREDICTED: uncharacterized protein LOC101252... 988 0.0 gb|AAX73298.1| putative BAH domain-containing protein [Solanum l... 716 0.0 ref|XP_004242163.1| PREDICTED: uncharacterized protein LOC101255... 716 0.0 ref|XP_003634295.1| PREDICTED: uncharacterized protein LOC100248... 483 e-133 emb|CAN60153.1| hypothetical protein VITISV_021504 [Vitis vinifera] 480 e-133 ref|XP_002318026.2| hypothetical protein POPTR_0012s07900g [Popu... 467 e-129 ref|XP_002511441.1| DNA binding protein, putative [Ricinus commu... 462 e-127 gb|EOY20638.1| BAH domain,TFIIS helical bundle-like domain isofo... 459 e-126 gb|EOY20637.1| BAH domain,TFIIS helical bundle-like domain isofo... 459 e-126 gb|EOY20634.1| BAH domain,TFIIS helical bundle-like domain isofo... 459 e-126 ref|XP_002321574.2| hypothetical protein POPTR_0015s08400g [Popu... 456 e-125 ref|XP_002511444.1| conserved hypothetical protein [Ricinus comm... 455 e-125 gb|EXC31170.1| hypothetical protein L484_004936 [Morus notabilis] 443 e-121 gb|EMJ11634.1| hypothetical protein PRUPE_ppa000152mg [Prunus pe... 442 e-121 ref|XP_002321576.2| hypothetical protein POPTR_0015s08410g [Popu... 436 e-119 ref|XP_006376841.1| hypothetical protein POPTR_0012s07910g [Popu... 433 e-118 ref|XP_002318025.2| hypothetical protein POPTR_0012s07910g [Popu... 433 e-118 ref|XP_002318028.2| hypothetical protein POPTR_0012s07910g [Popu... 433 e-118 ref|XP_006439759.1| hypothetical protein CICLE_v10018474mg [Citr... 429 e-117 >ref|XP_006345030.1| PREDICTED: uncharacterized protein LOC102588004 isoform X1 [Solanum tuberosum] gi|565356351|ref|XP_006345031.1| PREDICTED: uncharacterized protein LOC102588004 isoform X2 [Solanum tuberosum] Length = 1638 Score = 1104 bits (2856), Expect = 0.0 Identities = 579/698 (82%), Positives = 607/698 (86%), Gaps = 1/698 (0%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLRQHK+TEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSI+K Sbjct: 391 NHLRQHKNTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSITK 450 Query: 181 NPGGSNDVTKSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSPASGKEGQPRVS 360 NPGG NDVTKSAV QFSAS+MASIKTSQGETT KSASLSPGSTKPASSPASGKEGQ RVS Sbjct: 451 NPGGPNDVTKSAVAQFSASRMASIKTSQGETTIKSASLSPGSTKPASSPASGKEGQHRVS 510 Query: 361 VGGSFDVPLAREDKXXXXXXXXXXXXXXXGKEDGRSSTAVSMNSIKISTGGSRHRKSVDG 540 VGGS DVP AREDK GKEDGRSSTAVSMNSIKISTGGSRHRKSV+G Sbjct: 511 VGGSCDVPSAREDKSSSSSQSHNHSQSISGKEDGRSSTAVSMNSIKISTGGSRHRKSVNG 570 Query: 541 YPGSSVSGSQKESPAGRGSHRNPSSEKLPQ-VMSGEKTVDVPVIEGSGHKLIVKISNRGR 717 YPGSSVSGSQKESPA R SHRNPSSEKLPQ +SGEKT+DVPV+EGSGHKLIVKI NRGR Sbjct: 571 YPGSSVSGSQKESPADRSSHRNPSSEKLPQPAVSGEKTMDVPVLEGSGHKLIVKIPNRGR 630 Query: 718 SPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEKTDAYRSNFDANSESWQSNDF 897 SPAQSASGGSYEDPTNMSSRASSPVLSEK+DQFD+TLKEKTDA RSN D N+ESWQSNDF Sbjct: 631 SPAQSASGGSYEDPTNMSSRASSPVLSEKSDQFDQTLKEKTDADRSNLDTNAESWQSNDF 690 Query: 898 KDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVRATCTSGSEPKSGKLHEASFSSMNA 1077 KDILTGSD+GDGSPAA PEE RSKIVDD KSAEVRA CTSG+EPKSGKLHEAS+S MNA Sbjct: 691 KDILTGSDDGDGSPAAVPEEVRSKIVDDGRKSAEVRAACTSGTEPKSGKLHEASYSPMNA 750 Query: 1078 LIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSKSDMVSPSVSPQRNNPAAEEACPGD 1257 LIESCVKYSESNVPMLL DAIGMNLLASVAAEEMSKS+MVSPSVSPQRN PAAE+AC GD Sbjct: 751 LIESCVKYSESNVPMLLGDAIGMNLLASVAAEEMSKSNMVSPSVSPQRNIPAAEDACTGD 810 Query: 1258 DVKSKSPPSDISAGDCRNDDDGNREKLVSACASWSEDKLHPSKGAVMELSGDRKASFSPS 1437 D KSKSPP DISAGD +NDD GN EKLV A ASWS+DKL S GA MEL GDRKAS SPS Sbjct: 811 DAKSKSPPGDISAGDRKNDDAGNGEKLVIASASWSKDKLLSSMGAAMELPGDRKASISPS 870 Query: 1438 QETMIGGCNKQFNSPCIDSQTAVVKLEITEKSDEVGKYPPSPHSVSGKAIDGELSKQFQX 1617 QETM GGCNKQFNSPC DSQTA KLEITEKS EV KY SPHSVS KAIDGELSKQF Sbjct: 871 QETMTGGCNKQFNSPCFDSQTAGEKLEITEKSGEVEKYASSPHSVSEKAIDGELSKQFHE 930 Query: 1618 XXXXXXXXXXXGAVDAKLGGDGTSVLGDKVTNAGASLEDQKSTVEVCSSKFVSENKNSVS 1797 GA+DAKLGGDGTSVLGDKVT+A AS EDQK +VEVC+SKF SENKN V+ Sbjct: 931 EMVVSREVKVEGALDAKLGGDGTSVLGDKVTSAVASSEDQKPSVEVCTSKFESENKNGVN 990 Query: 1798 RVLNTASTEMKPSYVVVKSEKTGGSNKEERLPTSSSGDPTTVRGGRSDEASINHVDLSEK 1977 RVLN S MKPS VVV SEK GS+KEERLPTSSSGDPTTVRGGRSDE S+N V+LSEK Sbjct: 991 RVLNITSIGMKPSSVVVNSEKMEGSDKEERLPTSSSGDPTTVRGGRSDEVSLNLVNLSEK 1050 Query: 1978 TKSDHGTVEASVEDKARVETDITTRNQQGEASVERKDI 2091 KSD G VEASVEDKARVETD+TTRNQ+GEASVERKD+ Sbjct: 1051 AKSDQGNVEASVEDKARVETDVTTRNQKGEASVERKDV 1088 >ref|XP_004236128.1| PREDICTED: uncharacterized protein LOC101252674 [Solanum lycopersicum] Length = 1602 Score = 988 bits (2555), Expect = 0.0 Identities = 531/699 (75%), Positives = 567/699 (81%), Gaps = 2/699 (0%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLRQHK+TEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQA TWPSKSRLPEASHSISK Sbjct: 390 NHLRQHKNTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAATWPSKSRLPEASHSISK 449 Query: 181 NPGGSNDVTKSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSPASGKEGQPRVS 360 NPGGSNDVTKSAV Q SAS+MASIKTSQGETT KSASLSPGSTKPASSPASGKEGQ RVS Sbjct: 450 NPGGSNDVTKSAVAQLSASRMASIKTSQGETTVKSASLSPGSTKPASSPASGKEGQHRVS 509 Query: 361 VGGSFDVPLAREDKXXXXXXXXXXXXXXXGKEDGRSSTAVSMNSIKISTGGSRHRKSVDG 540 VGGS DVP AREDK GKEDGRSSTAVSMNSIKISTGGSRHRKS +G Sbjct: 510 VGGSCDVPSAREDKSSSSSQSHNHSQSISGKEDGRSSTAVSMNSIKISTGGSRHRKSNNG 569 Query: 541 YPGSSVSGSQKESPAGRGSHRNPSSEKLPQ-VMSGEKTVDVPVIEGSGHKLIVKISNRGR 717 YPGSS+SGSQKE+PAGR SHRNP+SEKLPQ +SGEK +DVPV+EGSGHKL VK+S+RGR Sbjct: 570 YPGSSISGSQKETPAGRSSHRNPTSEKLPQSAVSGEKIMDVPVLEGSGHKLKVKMSSRGR 629 Query: 718 SPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEKTDAYRSNFDANSESWQSNDF 897 SPAQSASGGSYEDPTNMSSRASSPVLSEK+DQFDRTLKEKTDA RSN +AN+ESWQSNDF Sbjct: 630 SPAQSASGGSYEDPTNMSSRASSPVLSEKSDQFDRTLKEKTDADRSNLEANAESWQSNDF 689 Query: 898 KDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVRATCTSGSEPKSGKLHEASFSSMNA 1077 KDILTGSD+GDGSPAA EEERSKIVDDS +SAEVRA CTSG+E KSGKLHEAS+S MNA Sbjct: 690 KDILTGSDDGDGSPAAVTEEERSKIVDDSRRSAEVRAACTSGTEAKSGKLHEASYSPMNA 749 Query: 1078 LIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSKSDMVSPSVSPQRNNPAAEEACPGD 1257 LIESCVKYSESNVPMLL DAIGMNLLASVAAEEMSKS+MVSPSVS RN PAAEEAC GD Sbjct: 750 LIESCVKYSESNVPMLLGDAIGMNLLASVAAEEMSKSNMVSPSVSSHRNTPAAEEACTGD 809 Query: 1258 DVKSKSPPSDISAGDCRNDD-DGNREKLVSACASWSEDKLHPSKGAVMELSGDRKASFSP 1434 D KSKSPP DI+AGD +NDD DGN E+L+ A ASWSEDKL S GA +EL GDRKAS SP Sbjct: 810 DAKSKSPPGDITAGDRKNDDGDGNGEELIIASASWSEDKLLSSMGAAIELPGDRKASVSP 869 Query: 1435 SQETMIGGCNKQFNSPCIDSQTAVVKLEITEKSDEVGKYPPSPHSVSGKAIDGELSKQFQ 1614 SQETM GGC KQFNSPC DSQTA KLEITEKS EV KY SP +VS KAIDGE SKQF Sbjct: 870 SQETMAGGC-KQFNSPCFDSQTAGEKLEITEKSGEVEKYASSPRTVSEKAIDGEASKQFH 928 Query: 1615 XXXXXXXXXXXXGAVDAKLGGDGTSVLGDKVTNAGASLEDQKSTVEVCSSKFVSENKNSV 1794 G +DAKLGGDG SVLGDKV + ASLEDQK +VEVC+SKF SENKN + Sbjct: 929 EETVVSREVKVEGPLDAKLGGDGASVLGDKVASTVASLEDQKPSVEVCTSKFESENKNGM 988 Query: 1795 SRVLNTASTEMKPSYVVVKSEKTGGSNKEERLPTSSSGDPTTVRGGRSDEASINHVDLSE 1974 +RVLN AS E KPS VVV SEK GS+KEERL Sbjct: 989 NRVLNIASAETKPSSVVVNSEKLEGSDKEERL---------------------------- 1020 Query: 1975 KTKSDHGTVEASVEDKARVETDITTRNQQGEASVERKDI 2091 +EASVEDKARV TDI TRNQ+GEASVERK++ Sbjct: 1021 ------ANIEASVEDKARVGTDIVTRNQKGEASVERKNV 1053 >gb|AAX73298.1| putative BAH domain-containing protein [Solanum lycopersicum] Length = 1608 Score = 716 bits (1848), Expect = 0.0 Identities = 410/699 (58%), Positives = 481/699 (68%), Gaps = 14/699 (2%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLRQHK+ EIQRKARSLVDTWKKRVEAEMN+ID+KSGSNQAVTWPSK+RLPEASHS K Sbjct: 364 NHLRQHKNMEIQRKARSLVDTWKKRVEAEMNMIDSKSGSNQAVTWPSKARLPEASHSGEK 423 Query: 181 NPGGSNDVTKSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSPASGKEGQPRVS 360 N GGS D T+S+V QFSASK SIK + ET KSA SPG K AS P+SGK GQPR+S Sbjct: 424 NAGGSTDATRSSVTQFSASKTTSIKPTPVETNMKSACSSPGPIKQASPPSSGKVGQPRIS 483 Query: 361 VGGSFDVPLAREDKXXXXXXXXXXXXXXXGKEDGRSSTAVSMNSIKISTGGSRHRKSVDG 540 GS DVPLAREDK GKED RSSTAVSM+SIKIS+GGSRHRKS++G Sbjct: 484 AFGSSDVPLAREDKSSSSSQSHNHSQSLSGKEDARSSTAVSMSSIKISSGGSRHRKSING 543 Query: 541 YPGSSVSGSQKESPAGRGS--HRNPSSEK-LPQVMSGEKTVDVPVIEGSGHKLIVKISNR 711 PG SVS QKE R S HRNP++EK L +SGEKTVDVP +EGS HKLIVKI N+ Sbjct: 544 GPGPSVSAGQKEGSTNRSSSLHRNPTTEKSLQSALSGEKTVDVPAVEGSCHKLIVKIPNK 603 Query: 712 GRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEKTDAYRSN--FDANSESWQ 885 GRSPA+S SGGS EDP+ MSSRASSPVLSEKNDQ DR KEK DAYRS+ + N+ESWQ Sbjct: 604 GRSPARSVSGGSCEDPSIMSSRASSPVLSEKNDQLDRNSKEKKDAYRSDVTINVNTESWQ 663 Query: 886 SNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVR--ATCTSGSEPKSGKLHEAS 1059 SN KD+LTGSDEGDGSP A EEER K + KSAEV + +SG+E KSGKLHEAS Sbjct: 664 SNVLKDVLTGSDEGDGSPVAVLEEERRKTAGEGRKSAEVAKPGSSSSGTELKSGKLHEAS 723 Query: 1060 FSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSKSDMVSPSVSPQRNNPAAE 1239 FSSMNALIESC KYSE+N M L DA+GMNLLASVA EEMSKS VSP VSPQ ++P+ Sbjct: 724 FSSMNALIESCAKYSEANASMSLSDAVGMNLLASVATEEMSKSGRVSPFVSPQGDSPSGG 783 Query: 1240 EACPGDDVKSKSPPSDISAGD--CRNDDDGNREK---LVSACASWSEDKLHPSKGAVMEL 1404 E C GD++K K+ P D S+G+ RND D N +K V A SWSE K+H ++ A+ + Sbjct: 784 ETCTGDELKPKTSPVDSSSGNHSGRNDGDANGDKEKQFVVANTSWSEGKVHANRSAMTDF 843 Query: 1405 SGDRKASFSPSQETMIGGCNKQFNSPCIDSQTA-VVKLEITEKSDEVGKYPPSPHSVSGK 1581 + +R+ S SPS+ET G C FNS C DSQ A +K + EK E+ K +P +V K Sbjct: 844 NRERRPSSSPSEETTTGEC---FNSSCTDSQMAGNLKSGVNEKLVEMAKSAAAPCNVFEK 900 Query: 1582 AIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKVTNAGASLEDQKSTVEVCS 1761 A DGE S+QF +D + GG G+S+ DKVTN S+E K V + + Sbjct: 901 ASDGEQSRQFH-EEKVISTKTLDNVLDGESGGHGSSIGEDKVTNGLVSIEGLKRPVGISA 959 Query: 1762 SKFVSENKNSVSRVLNTASTEMKPSYVVVKSEKTGGSNKEERLPTSSSGDPTTVRGGRSD 1941 K+ ++KN VSRVL ASTE+KP VVVKSE T +KEE T SS D +GG SD Sbjct: 960 FKYEGDDKNDVSRVLGVASTEVKPPSVVVKSEATERGDKEELQQTGSSRDTIAGKGGHSD 1019 Query: 1942 EASINHVDLSEKTKSDHGTVEASV-EDKARVETDITTRN 2055 E N V SE+ SD TV+ SV EDKA E ++ RN Sbjct: 1020 EMDANSVLKSEQPNSDKKTVDTSVIEDKAASECNLAIRN 1058 >ref|XP_004242163.1| PREDICTED: uncharacterized protein LOC101255308 [Solanum lycopersicum] gi|113205156|gb|AAX95757.2| BAH domain-containing protein, putative [Solanum lycopersicum] Length = 1631 Score = 716 bits (1848), Expect = 0.0 Identities = 410/699 (58%), Positives = 481/699 (68%), Gaps = 14/699 (2%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLRQHK+ EIQRKARSLVDTWKKRVEAEMN+ID+KSGSNQAVTWPSK+RLPEASHS K Sbjct: 387 NHLRQHKNMEIQRKARSLVDTWKKRVEAEMNMIDSKSGSNQAVTWPSKARLPEASHSGEK 446 Query: 181 NPGGSNDVTKSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSPASGKEGQPRVS 360 N GGS D T+S+V QFSASK SIK + ET KSA SPG K AS P+SGK GQPR+S Sbjct: 447 NAGGSTDATRSSVTQFSASKTTSIKPTPVETNMKSACSSPGPIKQASPPSSGKVGQPRIS 506 Query: 361 VGGSFDVPLAREDKXXXXXXXXXXXXXXXGKEDGRSSTAVSMNSIKISTGGSRHRKSVDG 540 GS DVPLAREDK GKED RSSTAVSM+SIKIS+GGSRHRKS++G Sbjct: 507 AFGSSDVPLAREDKSSSSSQSHNHSQSLSGKEDARSSTAVSMSSIKISSGGSRHRKSING 566 Query: 541 YPGSSVSGSQKESPAGRGS--HRNPSSEK-LPQVMSGEKTVDVPVIEGSGHKLIVKISNR 711 PG SVS QKE R S HRNP++EK L +SGEKTVDVP +EGS HKLIVKI N+ Sbjct: 567 GPGPSVSAGQKEGSTNRSSSLHRNPTTEKSLQSALSGEKTVDVPAVEGSCHKLIVKIPNK 626 Query: 712 GRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEKTDAYRSN--FDANSESWQ 885 GRSPA+S SGGS EDP+ MSSRASSPVLSEKNDQ DR KEK DAYRS+ + N+ESWQ Sbjct: 627 GRSPARSVSGGSCEDPSIMSSRASSPVLSEKNDQLDRNSKEKKDAYRSDVTINVNTESWQ 686 Query: 886 SNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVR--ATCTSGSEPKSGKLHEAS 1059 SN KD+LTGSDEGDGSP A EEER K + KSAEV + +SG+E KSGKLHEAS Sbjct: 687 SNVLKDVLTGSDEGDGSPVAVLEEERRKTAGEGRKSAEVAKPGSSSSGTELKSGKLHEAS 746 Query: 1060 FSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSKSDMVSPSVSPQRNNPAAE 1239 FSSMNALIESC KYSE+N M L DA+GMNLLASVA EEMSKS VSP VSPQ ++P+ Sbjct: 747 FSSMNALIESCAKYSEANASMSLSDAVGMNLLASVATEEMSKSGRVSPFVSPQGDSPSGG 806 Query: 1240 EACPGDDVKSKSPPSDISAGD--CRNDDDGNREK---LVSACASWSEDKLHPSKGAVMEL 1404 E C GD++K K+ P D S+G+ RND D N +K V A SWSE K+H ++ A+ + Sbjct: 807 ETCTGDELKPKTSPVDSSSGNHSGRNDGDANGDKEKQFVVANTSWSEGKVHANRSAMTDF 866 Query: 1405 SGDRKASFSPSQETMIGGCNKQFNSPCIDSQTA-VVKLEITEKSDEVGKYPPSPHSVSGK 1581 + +R+ S SPS+ET G C FNS C DSQ A +K + EK E+ K +P +V K Sbjct: 867 NRERRPSSSPSEETTTGEC---FNSSCTDSQMAGNLKSGVNEKLVEMAKSAAAPCNVFEK 923 Query: 1582 AIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKVTNAGASLEDQKSTVEVCS 1761 A DGE S+QF +D + GG G+S+ DKVTN S+E K V + + Sbjct: 924 ASDGEQSRQFH-EEKVISTKTLDNVLDGESGGHGSSIGEDKVTNGLVSIEGLKRPVGISA 982 Query: 1762 SKFVSENKNSVSRVLNTASTEMKPSYVVVKSEKTGGSNKEERLPTSSSGDPTTVRGGRSD 1941 K+ ++KN VSRVL ASTE+KP VVVKSE T +KEE T SS D +GG SD Sbjct: 983 FKYEGDDKNDVSRVLGVASTEVKPPSVVVKSEATERGDKEELQQTGSSRDTIAGKGGHSD 1042 Query: 1942 EASINHVDLSEKTKSDHGTVEASV-EDKARVETDITTRN 2055 E N V SE+ SD TV+ SV EDKA E ++ RN Sbjct: 1043 EMDANSVLKSEQPNSDKKTVDTSVIEDKAASECNLAIRN 1081 >ref|XP_003634295.1| PREDICTED: uncharacterized protein LOC100248456 [Vitis vinifera] Length = 1631 Score = 483 bits (1243), Expect = e-133 Identities = 328/732 (44%), Positives = 423/732 (57%), Gaps = 35/732 (4%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLR HK+ EIQ+KARSLVDTWKKRVEAEMNI DAKSGS+QAV W S+ RL E SH ++ Sbjct: 383 NHLRSHKNLEIQKKARSLVDTWKKRVEAEMNINDAKSGSSQAVAWSSRPRLSEVSHGGNR 442 Query: 181 NPGGSNDVT-KSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSPASG----KEG 345 + GGS+++ KS+V Q S+SK A +K QGE KS S S G TK A+SPAS K+G Sbjct: 443 HSGGSSEIAMKSSVTQLSSSKTAPVKLVQGEIA-KSGSASQGFTKSATSPASVSTSLKDG 501 Query: 346 QPRVS-VGGSFDVPLA--REDKXXXXXXXXXXXXXXX----------GKEDGRSSTAVSM 486 Q RV+ G + D PL R++K GKED RSSTA+SM Sbjct: 502 QTRVAGAGNASDPPLTTVRDEKSSSSSQSHNNSQSCSSDHAKTVGFSGKEDARSSTAMSM 561 Query: 487 NSIKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLPQV-MSGEKTVD 657 + K S G SRHRKSV+GYPG +VSG Q+E+ + R S RNP+SEK+ Q ++ +K D Sbjct: 562 SVSKTSGGASRHRKSVNGYPGPAVSGVQRETGSSRSSSFQRNPASEKVSQSGLTCDKAFD 621 Query: 658 VPVIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEK 837 VP +EG+ HKLIVKI NRGRSPAQSASGGS+EDP+ ++S+ASSPVLS K+DQ DR LKEK Sbjct: 622 VPTVEGNSHKLIVKIPNRGRSPAQSASGGSFEDPSMVNSQASSPVLSGKHDQSDRNLKEK 681 Query: 838 TDAYRSN--FDANSESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVRAT 1011 +D YR+N D N+ESWQSNDFKD +TGSDEGDGSPA P+EERS+ DD+ K A+ Sbjct: 682 SDVYRANNTSDVNTESWQSNDFKDAMTGSDEGDGSPATLPDEERSRTGDDTRKIK--TAS 739 Query: 1012 CTSGSEPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSKSD 1191 +SG EPKSGKL EASF+SMNALIESCVK E+N + + D +GMNLLASVAA EM+K + Sbjct: 740 SSSGIEPKSGKLVEASFTSMNALIESCVK-CEANASVSVVDDVGMNLLASVAAGEMAKRE 798 Query: 1192 MVSPSVSPQRNNPAAEEACPGDDVKSKSPPSDISAGDCRND--DDGNREKLVSACASWSE 1365 VSP+ SP RN E++ G+D KSK DI +++ G+ EK W++ Sbjct: 799 SVSPADSPLRNTAVIEDSSAGNDAKSKPTGDDILREQSQSNYGPTGDTEKQ----GFWAK 854 Query: 1366 DKLHPSKGAVMELSGDRKASFSPSQETMIGGCNKQFNSPCID-SQTAVVKLEITEKSDEV 1542 D LH P N+ NS ID +T+ + EI KSDE Sbjct: 855 DGLH----------------HLPKHALTNRENNEHINSTSIDLVRTSELCSEINRKSDET 898 Query: 1543 ---GKYPPSPHSVSGKAIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKVTN 1713 SP S + K D E KQ G D K +S+ DKV + Sbjct: 899 VVGASVTASPVSTTEKGSDDEQGKQLHEKKAAVDGVNVDGIPDTKPKVSSSSLAEDKVND 958 Query: 1714 AGASLEDQKSTVEVCSSKFVSENKNSVSRVLNTASTEMKPSYVVVKSEKTGGSNKEERLP 1893 +E ++ S + E KN+V+ LN TE KP ++ S+ G+ KE LP Sbjct: 959 VLPCVELKEEQSSYASLEPDGE-KNNVNEGLN---TEQKPPASMIPSDFVKGTEKEVPLP 1014 Query: 1894 TSSSGD--PTTV---RGGRSDEASI-NHVDLSEKTKSDHGTVEASVEDKARVETDITTRN 2055 + S D P V + ++DE + NH + E E +E K T R Sbjct: 1015 SGSGKDLVPENVDQMKAEKADEICVSNHANQME---------EQRIEPKNHASTAAEDRR 1065 Query: 2056 QQGEASVERKDI 2091 + E ++ K++ Sbjct: 1066 ELMEENLGNKEV 1077 >emb|CAN60153.1| hypothetical protein VITISV_021504 [Vitis vinifera] Length = 1688 Score = 480 bits (1236), Expect = e-133 Identities = 321/696 (46%), Positives = 412/696 (59%), Gaps = 35/696 (5%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLR HK+ EIQ+KARSLVDTWKKRVEAEMNI DAKSGS+QAV W S+ RL E SH ++ Sbjct: 427 NHLRSHKNLEIQKKARSLVDTWKKRVEAEMNINDAKSGSSQAVAWSSRPRLSEVSHGGNR 486 Query: 181 NPGGSNDVT-KSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSPASG----KEG 345 + GGS+++ KS+V Q S+SK A +K QGE KS S S G TK A+SPAS K+G Sbjct: 487 HSGGSSEIAMKSSVTQLSSSKTAPVKLVQGEIA-KSGSASQGFTKSATSPASVSTSLKDG 545 Query: 346 QPRVS-VGGSFDVPLA--REDKXXXXXXXXXXXXXXX----------GKEDGRSSTAVSM 486 Q RV+ G + D PL R++K GKED RSSTA+SM Sbjct: 546 QTRVAGAGNASDPPLTTVRDEKSSSSSQSHNNSQSCSSDHAKTVGFSGKEDARSSTAMSM 605 Query: 487 NSIKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLPQV-MSGEKTVD 657 + K S G SRHRKSV+GYPG +VSG Q+E+ + R S RNP+SEK+ Q ++ +K D Sbjct: 606 SVSKTSGGASRHRKSVNGYPGPAVSGVQRETGSSRSSSFQRNPASEKVSQSGLTCDKAFD 665 Query: 658 VPVIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEK 837 VP +EG+ HKLIVKI NRGRSPAQSASGGS+EDP+ ++S+ASSPVLS K+DQ DR LKEK Sbjct: 666 VPTVEGNSHKLIVKIPNRGRSPAQSASGGSFEDPSMVNSQASSPVLSGKHDQSDRNLKEK 725 Query: 838 TDAYRSN--FDANSESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVRAT 1011 +D YR+N D N+ESWQSNDFKD +TGSDEGDGSPA P+EERS+ DD+ K A+ Sbjct: 726 SDVYRANNTSDVNTESWQSNDFKDAMTGSDEGDGSPATLPDEERSRTGDDTRKIK--TAS 783 Query: 1012 CTSGSEPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSKSD 1191 +SG EPKSGKL EASF+SMNALIESCVK E+N + + D +GMNLLASVAA EM+K + Sbjct: 784 SSSGIEPKSGKLVEASFTSMNALIESCVK-CEANASVSVVDDVGMNLLASVAAGEMAKRE 842 Query: 1192 MVSPSVSPQRNNPAAEEACPGDDVKSKSPPSDISAGDCRND--DDGNREKLVSACASWSE 1365 VSP+ SP RN E++ G+D KSK DI +++ G+ EK W++ Sbjct: 843 SVSPADSPLRNTAVIEDSSAGNDAKSKPTGDDILREQSQSNYGPTGDTEKQ----GFWAK 898 Query: 1366 DKLHPSKGAVMELSGDRKASFSPSQETMIGGCNKQFNSPCID-SQTAVVKLEITEKSDEV 1542 D LH P N+ NS ID +T+ + EI KSDE Sbjct: 899 DGLH----------------HLPKHALTNRENNEHINSTSIDLVRTSELCSEINRKSDET 942 Query: 1543 ---GKYPPSPHSVSGKAIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKVTN 1713 SP S + K D E KQ G D K +S+ DKV + Sbjct: 943 VVGASVTASPVSTTEKGSDDEQGKQLHEKKAAVDGVNVDGIPDTKPKVSSSSLAEDKVND 1002 Query: 1714 AGASLEDQKSTVEVCSSKFVSENKNSVSRVLNTASTEMKPSYVVVKSEKTGGSNKEERLP 1893 +E ++ S + E KN+V+ LN TE KP ++ S+ G+ KE LP Sbjct: 1003 VLPCVELKEEQSSYASLEPDGE-KNNVNEGLN---TEQKPPASMIPSDFVKGTEKEVPLP 1058 Query: 1894 TSSSGD--PTTV---RGGRSDEASI-NHVDLSEKTK 1983 + S D P V + ++DE + NH + E+ + Sbjct: 1059 SGSGKDLVPENVDQMKAEKADEICVSNHANQMEEQR 1094 >ref|XP_002318026.2| hypothetical protein POPTR_0012s07900g [Populus trichocarpa] gi|550326617|gb|EEE96246.2| hypothetical protein POPTR_0012s07900g [Populus trichocarpa] Length = 1624 Score = 467 bits (1202), Expect = e-129 Identities = 320/738 (43%), Positives = 409/738 (55%), Gaps = 42/738 (5%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 N+LR HK+ EIQ+KARSLVDTWKKRVEAEM+ + KSGSNQ V+W ++SRLPE SH ++ Sbjct: 383 NNLRTHKNLEIQKKARSLVDTWKKRVEAEMDA-NTKSGSNQGVSWTARSRLPEISHGGNR 441 Query: 181 NPGGSNDVT-KSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSPASG----KEG 345 G S++V KS VVQ SASK S+K QGET +SAS SPG + +SP S KE Sbjct: 442 QFGVSSEVAMKSTVVQLSASKTGSVKVVQGETVARSASTSPGPIRSTASPGSAGNNSKEA 501 Query: 346 QPR-VSVGGSFD--VPLAREDKXXXXXXXXXXXXXXX----------GKEDGRSSTAVSM 486 PR G+ D V +AR++K GKED RSSTA SM Sbjct: 502 HPRNTGASGASDPSVVVARDEKSSSSSQSHNNSQSCSSDHAKNGGVSGKEDARSSTAGSM 561 Query: 487 NSIKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLPQV-MSGEKTVD 657 K+ RHRKS +G+PG ++SG QKE+ + R S H+N SEKL Q ++ EK +D Sbjct: 562 MVSKMVGVSLRHRKSGNGFPGQAMSGVQKETGSSRNSSLHKNLGSEKLSQSSLTCEKALD 621 Query: 658 VPVIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEK 837 VPV EG+GHK IVKI NRGRSPAQSASGGS EDP+ M+SRASSPVLSEK+D FDR LKEK Sbjct: 622 VPVAEGNGHKFIVKIPNRGRSPAQSASGGSLEDPSVMNSRASSPVLSEKHDHFDRNLKEK 681 Query: 838 TDAYRSNF--DANSESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVRAT 1011 DAYR+N D N+ESWQSNDFK++LTGSDEGDGSP P+EE + DDS K AE Sbjct: 682 NDAYRANITSDVNTESWQSNDFKEVLTGSDEGDGSPTTVPDEEHCRTGDDSRKLAEASKA 741 Query: 1012 CTSGS--EPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSK 1185 +S S E K KLH+ASFSSMNALIESC KYSE+N M + D IGMNLLASVAA EMSK Sbjct: 742 TSSSSANEEKMVKLHDASFSSMNALIESCAKYSEANASMSVGDDIGMNLLASVAAGEMSK 801 Query: 1186 SDMVSPSVSPQRNNPAAEEACPGDDVKSKSPPSDISAGDCRNDDDGNREKLVSACASWSE 1365 SD VSP+ SP+RN P E +C G D + KS P + A D R + V E Sbjct: 802 SDTVSPTDSPRRNTPVVESSCAGSDARPKSSPGEDPAQD--------RGQFVDVVNDEHE 853 Query: 1366 DKLHPSKGAVMELSGDRKASFSPSQETMIGGCNKQFNSPCIDSQTAVVKLEITEKSDE-- 1539 + ++ + D K + SQE + G N QFNS +D Q E KS+E Sbjct: 854 KRAIVLGTSLAAKNFDGK-TILISQEKLKGQLNGQFNSSNMDVQQTSECPESNLKSEEVL 912 Query: 1540 --VGKYPPSPHSVSGKAIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKVTN 1713 V PSP +V + DG Q + DG S +K+ Sbjct: 913 VSVSVAVPSPSTVEKASFDGGKEPQEDKGV-------------GRSNADGVSAAKEKLHR 959 Query: 1714 AGASLEDQKSTVEVCSSKFVSENKNSVSRV-LNTASTEMKPSYVVVK----SEKTGGSNK 1878 + +E+K +++R+ + T + SY +K + K N Sbjct: 960 S-----------------ITTEDKVNITRMEVGTEVNNISSSYPSIKLNGENNKNMNEND 1002 Query: 1879 EERLPT--------SSSGDPTTVRGGRSDEASINHVDLSEKTKSDHGTVEASVEDKARVE 2034 EE+ PT S G+ G D S N +D + ++ T + + E ++ Sbjct: 1003 EEKPPTKMHPELTKGSDGEVLQPYGSSKDMVSEN-MDEVKAERAGEATEKRNSEHESNTG 1061 Query: 2035 TDITTRNQQGEASVERKD 2088 D T N +GE +R++ Sbjct: 1062 PDAT--NNKGECVDDRQE 1077 >ref|XP_002511441.1| DNA binding protein, putative [Ricinus communis] gi|223550556|gb|EEF52043.1| DNA binding protein, putative [Ricinus communis] Length = 1712 Score = 462 bits (1189), Expect = e-127 Identities = 320/722 (44%), Positives = 411/722 (56%), Gaps = 41/722 (5%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLR HK+ EIQ+KARSLVDTWKKRVEAEM DAKSGSNQAV+W ++ RLPE SH ++ Sbjct: 475 NHLRTHKNLEIQKKARSLVDTWKKRVEAEM---DAKSGSNQAVSWAARPRLPEVSHGGNR 531 Query: 181 NPGGSNDVT-KSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSPASG----KEG 345 + S++V KS+ Q SASK +K QGET TKS S SPGS K A S AS K+G Sbjct: 532 HLSASSEVAMKSSAAQISASKNTPVKLVQGETATKSTSASPGSLKSAPSSASVGNNIKDG 591 Query: 346 QPR-VSVGGSFDVPL--AREDKXXXXXXXXXXXXXXX----------GKEDGRSSTAVSM 486 QPR V G + PL A ++K GKED RSSTA+SM Sbjct: 592 QPRNTGVNGGSEPPLTVAGDEKSSSSSQSPNNSQSCSSDHGKTGGYSGKEDARSSTAISM 651 Query: 487 NSIKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLP-QVMSGEKTVD 657 + KI G SRHRKS +G+PG + SG QKE + R S HRNP SEKLP ++ EK VD Sbjct: 652 TANKIIGGSSRHRKSANGFPGHTSSGVQKEIGSSRNSSSHRNPGSEKLPLSSLTCEKAVD 711 Query: 658 VPVIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEK 837 VPV EG+ HKLIVK+SNRGRSPA+S SGGS+EDP+ M+SRASSPVLSEK+D LKEK Sbjct: 712 VPVAEGNNHKLIVKLSNRGRSPARSGSGGSFEDPSVMNSRASSPVLSEKHD-----LKEK 766 Query: 838 TDAYRSNF--DANSESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEV--R 1005 D YR+N D N+ESWQSND K+ LTGSDEGDGSPA P+E+ S+ DD+ K E+ Sbjct: 767 NDVYRANTVSDVNNESWQSNDSKEFLTGSDEGDGSPATVPDEDNSRTGDDTRKLIEIPKA 826 Query: 1006 ATCTSGSEPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSK 1185 A+ +SG+E KSGKLHEASFSS+NALIESCVKYSE+N M + D +GMNLLASVAA EMSK Sbjct: 827 ASSSSGNERKSGKLHEASFSSINALIESCVKYSEANASMSVGDDVGMNLLASVAAGEMSK 886 Query: 1186 SDMVSPSVSPQRNNPAAEEACPGDDVKSKSPPSDISAGDCRNDDDGNREK----LVSACA 1353 SDM SPS SPQRN E + D++ KS P D A + D EK L ++ Sbjct: 887 SDMASPSPSPQRNVTVPEHSYTSTDLRMKSSPIDSLALNRGQSVDDEHEKGTTILSNSLV 946 Query: 1354 SWSEDKLHPSKGAVMELSGDRKASFSPSQETMIGGCNKQFNSPCIDSQTAVVKLEITEKS 1533 +EDK P + + +GD A + S +Q PCI+S VK E T Sbjct: 947 MNTEDK--PILISHEQPTGDHNAHLNSSIMDA-----QQVAEPCIESN---VKSEETSVG 996 Query: 1534 DEVGKYPPSPHSVSGKAIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKVTN 1713 + PS +V K +DG + ++ V KL G S +++ N Sbjct: 997 TSLAL--PSASAVD-KTVDGGGTGTWEE------------KVRGKLNACGLSDAKEELCN 1041 Query: 1714 AGASLEDQKSTVEVCSSKFV------------SENKNSVSRVLNTASTEMKPSYVVVKSE 1857 + + E V + V + K ++ + ++ E KP+ +++ Sbjct: 1042 SFENEEKVDRLAVVGTEAAVRPSPLPSMEINSEKKKKMINELKSSVQAEQKPAAMML--- 1098 Query: 1858 KTGGSNKEERLPTSSSGDPTTVRGGRSDEASINHVDLSEKTKSDHGTVEASVEDKARVET 2037 +G +N E L S SGD S++ V K++ G+ V+ K E+ Sbjct: 1099 -SGSTNGREVLQHSESGDDMV-------SGSVSEVKGENTVKTEGGSQSLGVQ-KTEKES 1149 Query: 2038 DI 2043 +I Sbjct: 1150 NI 1151 >gb|EOY20638.1| BAH domain,TFIIS helical bundle-like domain isoform 5 [Theobroma cacao] Length = 1583 Score = 459 bits (1182), Expect = e-126 Identities = 317/704 (45%), Positives = 404/704 (57%), Gaps = 35/704 (4%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLR HK+ EIQ+KAR LVDTWKKRVEAEM DAKSGSNQAV W ++ R+ E SHS SK Sbjct: 343 NHLRSHKNLEIQKKARGLVDTWKKRVEAEM---DAKSGSNQAVPWSARPRISEVSHSGSK 399 Query: 181 NPGGSNDVTKSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSPASG----KEGQ 348 + G S KS+V QFSASK S+K +QGET TKSAS SPGS K A+SP S K+GQ Sbjct: 400 HSGSSEVAVKSSVTQFSASKTGSVKLAQGETPTKSASASPGSMKAATSPVSASTNLKDGQ 459 Query: 349 PR--VSVGGSFDVPLAREDKXXXXXXXXXXXXXXX----------GKEDGRSSTAVSMNS 492 R +VG S AR++K GKE+ RSS A S Sbjct: 460 ARNATAVGTSDPQTTARDEKSSSSSQSHNNSQSCSSDHAKTGGVSGKEEARSSAAGSGTV 519 Query: 493 IKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLPQV-MSGEKTVDVP 663 KIS SRHRKS++G+PGSS G Q+E+ + + S HRNP+SEK+ Q ++ EK VD P Sbjct: 520 TKISGSSSRHRKSINGFPGSS--GVQRETGSSKNSSLHRNPASEKISQSGLTCEKAVDAP 577 Query: 664 VIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEKTD 843 + EG+ HK IVKI NRGRSPAQS SGGS ED + M+SRASSPVLSEK++Q DR KEK++ Sbjct: 578 MAEGNSHKFIVKIPNRGRSPAQSVSGGSLEDLSVMNSRASSPVLSEKHEQSDRNTKEKSE 637 Query: 844 AYRSNF--DANSESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVRATCT 1017 YR+N D N+ESWQSNDFKD+LTGSDEGDGSPAA P+EE +I +D+ K+ EV T + Sbjct: 638 TYRANVTTDVNTESWQSNDFKDVLTGSDEGDGSPAAVPDEEHCRIGEDARKTTEVTKTAS 697 Query: 1018 S--GSEPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSKSD 1191 S G+E KSGKL EASFSS+NALI+SCVKYSE+N M + D GMNLLASVAA E+SKSD Sbjct: 698 SSSGNELKSGKLQEASFSSINALIDSCVKYSEANACMPVGDDAGMNLLASVAAGEISKSD 757 Query: 1192 MVSPSVSPQRNNPAAEEACPGDDVKSKSPPSDISAGDCRNDDDGNREKLVSACASWSEDK 1371 + SP SPQRN P E + G+D + K SAGD D +R + V D Sbjct: 758 VASPIDSPQRNTPVVEHSSTGNDTRLKP-----SAGD---DVVRDRHQSVEGA-----DD 804 Query: 1372 LHPSKGAVMELSGDRKA--SFSPSQETMIGGCNKQFNSPCID-SQTAVVKLEITEKSDEV 1542 H +G V S + A SQE G N+ S + QTA LE + + V Sbjct: 805 EHLKQGTVAGNSWAKNADCKTGSSQEKSGGELNEHLISSSMGLPQTADQCLENGKLKEIV 864 Query: 1543 GKYPPSPHSVSGKAIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKVTNAGA 1722 + S S ++ + ++D K G + V DKV + G Sbjct: 865 AAALVNLPSGSTVEKTTDVGDSKEHLEKKAGGVDDDSSLDTKQKGSTSLVNEDKVVDPGV 924 Query: 1723 SLEDQ--KSTVEVCSSKFVSENKNSVSRVLNTASTEMKPSYVVVKSEKTGGSNKEERLPT 1896 +E + + V S + E+K +V+ L+ S + + V T G++KE P Sbjct: 925 KVEKEAVDGSSSVPSMEVDVEDKKNVTEGLD-RSLQTHENSAAVTGNSTKGADKEASPPG 983 Query: 1897 SSS-------GDPTTVRGGRSDEASINHVDLSEKTKSDHGTVEA 2007 S+ G+ + +D S HV +EK K + TV A Sbjct: 984 SAKDIVLEKVGEVKLEKDVETDARS--HVAHTEKQKPEWETVTA 1025 >gb|EOY20637.1| BAH domain,TFIIS helical bundle-like domain isoform 4 [Theobroma cacao] Length = 1442 Score = 459 bits (1182), Expect = e-126 Identities = 317/704 (45%), Positives = 404/704 (57%), Gaps = 35/704 (4%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLR HK+ EIQ+KAR LVDTWKKRVEAEM DAKSGSNQAV W ++ R+ E SHS SK Sbjct: 202 NHLRSHKNLEIQKKARGLVDTWKKRVEAEM---DAKSGSNQAVPWSARPRISEVSHSGSK 258 Query: 181 NPGGSNDVTKSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSPASG----KEGQ 348 + G S KS+V QFSASK S+K +QGET TKSAS SPGS K A+SP S K+GQ Sbjct: 259 HSGSSEVAVKSSVTQFSASKTGSVKLAQGETPTKSASASPGSMKAATSPVSASTNLKDGQ 318 Query: 349 PR--VSVGGSFDVPLAREDKXXXXXXXXXXXXXXX----------GKEDGRSSTAVSMNS 492 R +VG S AR++K GKE+ RSS A S Sbjct: 319 ARNATAVGTSDPQTTARDEKSSSSSQSHNNSQSCSSDHAKTGGVSGKEEARSSAAGSGTV 378 Query: 493 IKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLPQV-MSGEKTVDVP 663 KIS SRHRKS++G+PGSS G Q+E+ + + S HRNP+SEK+ Q ++ EK VD P Sbjct: 379 TKISGSSSRHRKSINGFPGSS--GVQRETGSSKNSSLHRNPASEKISQSGLTCEKAVDAP 436 Query: 664 VIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEKTD 843 + EG+ HK IVKI NRGRSPAQS SGGS ED + M+SRASSPVLSEK++Q DR KEK++ Sbjct: 437 MAEGNSHKFIVKIPNRGRSPAQSVSGGSLEDLSVMNSRASSPVLSEKHEQSDRNTKEKSE 496 Query: 844 AYRSNF--DANSESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVRATCT 1017 YR+N D N+ESWQSNDFKD+LTGSDEGDGSPAA P+EE +I +D+ K+ EV T + Sbjct: 497 TYRANVTTDVNTESWQSNDFKDVLTGSDEGDGSPAAVPDEEHCRIGEDARKTTEVTKTAS 556 Query: 1018 S--GSEPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSKSD 1191 S G+E KSGKL EASFSS+NALI+SCVKYSE+N M + D GMNLLASVAA E+SKSD Sbjct: 557 SSSGNELKSGKLQEASFSSINALIDSCVKYSEANACMPVGDDAGMNLLASVAAGEISKSD 616 Query: 1192 MVSPSVSPQRNNPAAEEACPGDDVKSKSPPSDISAGDCRNDDDGNREKLVSACASWSEDK 1371 + SP SPQRN P E + G+D + K SAGD D +R + V D Sbjct: 617 VASPIDSPQRNTPVVEHSSTGNDTRLKP-----SAGD---DVVRDRHQSVEGA-----DD 663 Query: 1372 LHPSKGAVMELSGDRKA--SFSPSQETMIGGCNKQFNSPCID-SQTAVVKLEITEKSDEV 1542 H +G V S + A SQE G N+ S + QTA LE + + V Sbjct: 664 EHLKQGTVAGNSWAKNADCKTGSSQEKSGGELNEHLISSSMGLPQTADQCLENGKLKEIV 723 Query: 1543 GKYPPSPHSVSGKAIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKVTNAGA 1722 + S S ++ + ++D K G + V DKV + G Sbjct: 724 AAALVNLPSGSTVEKTTDVGDSKEHLEKKAGGVDDDSSLDTKQKGSTSLVNEDKVVDPGV 783 Query: 1723 SLEDQ--KSTVEVCSSKFVSENKNSVSRVLNTASTEMKPSYVVVKSEKTGGSNKEERLPT 1896 +E + + V S + E+K +V+ L+ S + + V T G++KE P Sbjct: 784 KVEKEAVDGSSSVPSMEVDVEDKKNVTEGLD-RSLQTHENSAAVTGNSTKGADKEASPPG 842 Query: 1897 SSS-------GDPTTVRGGRSDEASINHVDLSEKTKSDHGTVEA 2007 S+ G+ + +D S HV +EK K + TV A Sbjct: 843 SAKDIVLEKVGEVKLEKDVETDARS--HVAHTEKQKPEWETVTA 884 >gb|EOY20634.1| BAH domain,TFIIS helical bundle-like domain isoform 1 [Theobroma cacao] gi|508773379|gb|EOY20635.1| BAH domain,TFIIS helical bundle-like domain isoform 1 [Theobroma cacao] gi|508773380|gb|EOY20636.1| BAH domain,TFIIS helical bundle-like domain isoform 1 [Theobroma cacao] gi|508773383|gb|EOY20639.1| BAH domain,TFIIS helical bundle-like domain isoform 1 [Theobroma cacao] Length = 1630 Score = 459 bits (1182), Expect = e-126 Identities = 317/704 (45%), Positives = 404/704 (57%), Gaps = 35/704 (4%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLR HK+ EIQ+KAR LVDTWKKRVEAEM DAKSGSNQAV W ++ R+ E SHS SK Sbjct: 390 NHLRSHKNLEIQKKARGLVDTWKKRVEAEM---DAKSGSNQAVPWSARPRISEVSHSGSK 446 Query: 181 NPGGSNDVTKSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSPASG----KEGQ 348 + G S KS+V QFSASK S+K +QGET TKSAS SPGS K A+SP S K+GQ Sbjct: 447 HSGSSEVAVKSSVTQFSASKTGSVKLAQGETPTKSASASPGSMKAATSPVSASTNLKDGQ 506 Query: 349 PR--VSVGGSFDVPLAREDKXXXXXXXXXXXXXXX----------GKEDGRSSTAVSMNS 492 R +VG S AR++K GKE+ RSS A S Sbjct: 507 ARNATAVGTSDPQTTARDEKSSSSSQSHNNSQSCSSDHAKTGGVSGKEEARSSAAGSGTV 566 Query: 493 IKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLPQV-MSGEKTVDVP 663 KIS SRHRKS++G+PGSS G Q+E+ + + S HRNP+SEK+ Q ++ EK VD P Sbjct: 567 TKISGSSSRHRKSINGFPGSS--GVQRETGSSKNSSLHRNPASEKISQSGLTCEKAVDAP 624 Query: 664 VIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEKTD 843 + EG+ HK IVKI NRGRSPAQS SGGS ED + M+SRASSPVLSEK++Q DR KEK++ Sbjct: 625 MAEGNSHKFIVKIPNRGRSPAQSVSGGSLEDLSVMNSRASSPVLSEKHEQSDRNTKEKSE 684 Query: 844 AYRSNF--DANSESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVRATCT 1017 YR+N D N+ESWQSNDFKD+LTGSDEGDGSPAA P+EE +I +D+ K+ EV T + Sbjct: 685 TYRANVTTDVNTESWQSNDFKDVLTGSDEGDGSPAAVPDEEHCRIGEDARKTTEVTKTAS 744 Query: 1018 S--GSEPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSKSD 1191 S G+E KSGKL EASFSS+NALI+SCVKYSE+N M + D GMNLLASVAA E+SKSD Sbjct: 745 SSSGNELKSGKLQEASFSSINALIDSCVKYSEANACMPVGDDAGMNLLASVAAGEISKSD 804 Query: 1192 MVSPSVSPQRNNPAAEEACPGDDVKSKSPPSDISAGDCRNDDDGNREKLVSACASWSEDK 1371 + SP SPQRN P E + G+D + K SAGD D +R + V D Sbjct: 805 VASPIDSPQRNTPVVEHSSTGNDTRLKP-----SAGD---DVVRDRHQSVEGA-----DD 851 Query: 1372 LHPSKGAVMELSGDRKA--SFSPSQETMIGGCNKQFNSPCID-SQTAVVKLEITEKSDEV 1542 H +G V S + A SQE G N+ S + QTA LE + + V Sbjct: 852 EHLKQGTVAGNSWAKNADCKTGSSQEKSGGELNEHLISSSMGLPQTADQCLENGKLKEIV 911 Query: 1543 GKYPPSPHSVSGKAIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKVTNAGA 1722 + S S ++ + ++D K G + V DKV + G Sbjct: 912 AAALVNLPSGSTVEKTTDVGDSKEHLEKKAGGVDDDSSLDTKQKGSTSLVNEDKVVDPGV 971 Query: 1723 SLEDQ--KSTVEVCSSKFVSENKNSVSRVLNTASTEMKPSYVVVKSEKTGGSNKEERLPT 1896 +E + + V S + E+K +V+ L+ S + + V T G++KE P Sbjct: 972 KVEKEAVDGSSSVPSMEVDVEDKKNVTEGLD-RSLQTHENSAAVTGNSTKGADKEASPPG 1030 Query: 1897 SSS-------GDPTTVRGGRSDEASINHVDLSEKTKSDHGTVEA 2007 S+ G+ + +D S HV +EK K + TV A Sbjct: 1031 SAKDIVLEKVGEVKLEKDVETDARS--HVAHTEKQKPEWETVTA 1072 >ref|XP_002321574.2| hypothetical protein POPTR_0015s08400g [Populus trichocarpa] gi|566206600|ref|XP_002321573.2| hypothetical protein POPTR_0015s08400g [Populus trichocarpa] gi|550322306|gb|EEF05701.2| hypothetical protein POPTR_0015s08400g [Populus trichocarpa] gi|550322307|gb|EEF05700.2| hypothetical protein POPTR_0015s08400g [Populus trichocarpa] Length = 1633 Score = 456 bits (1174), Expect = e-125 Identities = 311/721 (43%), Positives = 411/721 (57%), Gaps = 28/721 (3%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 N LR HK+ EIQ+KARSLVDTWKKRVEAEM+ +AKS SNQ V+WP++SRL E H ++ Sbjct: 381 NLLRTHKNLEIQKKARSLVDTWKKRVEAEMDA-NAKSASNQGVSWPARSRLSEVPHGGNR 439 Query: 181 NPGGSNDVT-KSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSPAS----GKEG 345 G S++V KS+VVQ SASK S+K QG+T TKSAS SPG + +SP S KE Sbjct: 440 QSGVSSEVAMKSSVVQLSASKTGSVKAVQGDTVTKSASTSPGPVRSTTSPGSVGNNSKET 499 Query: 346 QPRVSVGGSFDVP---LAREDKXXXXXXXXXXXXXXX----------GKEDGRSSTAVSM 486 QPR + + P +AR++K GKED RSSTA SM Sbjct: 500 QPRNTGASAASDPSPTVARDEKSSSSSPSHNNSQSCSSDHAKTGGFSGKEDARSSTAGSM 559 Query: 487 NSIKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLPQV-MSGEKTVD 657 + KI G RHRKSV+G+PG ++SG QKE+ + R S HRN SEKL ++ EK +D Sbjct: 560 TANKIIVGSLRHRKSVNGFPGQALSGVQKETGSSRNSSLHRNSGSEKLSHSSLTCEKALD 619 Query: 658 VPVIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEK 837 VP+ EG+GHK IVKI NRGRSPAQS+SGG++ED + M+SRASSPV+SE++DQFD LKEK Sbjct: 620 VPMTEGNGHKFIVKIPNRGRSPAQSSSGGTFEDASVMNSRASSPVISERHDQFDHNLKEK 679 Query: 838 TDAYRSNF--DANSESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEV-RA 1008 D+YR+N D +ESWQSNDFK++LTGSDEG GSPA P+EE +I DD KS EV +A Sbjct: 680 NDSYRANITSDVKTESWQSNDFKEVLTGSDEGVGSPATVPDEEHGRIGDDGRKSGEVSKA 739 Query: 1009 TCTSG-SEPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSK 1185 T TS E K GKL++ASFSSMNALIESC KYSE N + + D GMNLLASVAA EMSK Sbjct: 740 TPTSTVCEHKLGKLNDASFSSMNALIESCAKYSEGNASLSVGDDGGMNLLASVAAGEMSK 799 Query: 1186 SDMVSPSVSPQRNNPAAEEACPGDDVKSKSPPSDISAGDCRNDDDGNREKLVSACASWSE 1365 SDMVSP+ SP+RN P E C +++KS P D A DG + + Sbjct: 800 SDMVSPTGSPRRNMP-IEHPCVPSGLRAKSSPCDDPAQSQGKPVDG---------VDYED 849 Query: 1366 DKLHPSKGAVMELSGDRKASFSPSQETMIGGCNKQFNSPCID-SQTAVVKLEITEKSDEV 1542 +K + G + + + K SQE G N NS +D QTA LE KS+E Sbjct: 850 EKRGITVGTSLSKNTEAKTVLF-SQEKSTGELNGPPNSSHVDVQQTAKRCLESYLKSEET 908 Query: 1543 GKYPPSPHSVSGKAIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKVTNAG- 1719 S S + K + + ++ KL G SV D + N G Sbjct: 909 LVAAVSSASTAVKTSNCGGKEPWEKEDGGRSNVDGISDDKEKLHG---SVFND-INNTGV 964 Query: 1720 -ASLEDQKSTVEVCSSKFVSENKNSVSRVLNTASTEMKPSYVVVKSEKTGGSNKEERLPT 1896 ++E + + +F +ENK ++++ LN + ++ S+ G+ E P+ Sbjct: 965 QVAIEAMEGSSSNHRVEFDAENKKNINKELNISIKAEPAPPAIMLSDFAKGTINEVLQPS 1024 Query: 1897 SSSGDPTTVRGGRSDEASINHVDLSEKTKSDHGTVEASVEDKARVETDITTRNQQGEASV 2076 SS D D +++ V E H T + +E+++ + T + +GE V Sbjct: 1025 SSGKD--------MDSENLHEVKAGETDGRSHSTEKNKIENESNTASAAT--DHEGECKV 1074 Query: 2077 E 2079 E Sbjct: 1075 E 1075 >ref|XP_002511444.1| conserved hypothetical protein [Ricinus communis] gi|223550559|gb|EEF52046.1| conserved hypothetical protein [Ricinus communis] Length = 1651 Score = 455 bits (1170), Expect = e-125 Identities = 263/490 (53%), Positives = 327/490 (66%), Gaps = 32/490 (6%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLR HK EIQ+KAR+LVDTWKKRVEAEM DA+SGSN AV+W ++ RLPE SH +++ Sbjct: 395 NHLRTHKHLEIQKKARTLVDTWKKRVEAEM---DARSGSNTAVSWAARPRLPEVSHGVNR 451 Query: 181 NPGGSNDVT-KSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSPASG----KEG 345 + G ++++ KS+V QFSASK +K Q ET KS ++SPGS KP S AS KEG Sbjct: 452 HSGAASEIAMKSSVAQFSASKNTPVKIGQMETMAKSLAVSPGSMKPVPSSASAGNSTKEG 511 Query: 346 QPR-VSVGGSFDVP--LAREDKXXXXXXXXXXXXXXX----------GKEDGRSSTAVSM 486 Q R VGG+ D+P R++K GKED RSSTAVSM Sbjct: 512 QVRNTGVGGASDLPSIATRDEKSSSSSQSHNNSQSCSSDHAKNGGVSGKEDARSSTAVSM 571 Query: 487 NSIKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLPQV-MSGEKTVD 657 + K G SRHRKSV+G+ G +G Q++S + R + HR +EKL Q ++ +K VD Sbjct: 572 AANKTIGGSSRHRKSVNGFQGGGATGIQRDSGSSRNASLHRIQGAEKLSQSSLTCDKAVD 631 Query: 658 VPVIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEK 837 VP+ EG+ HKLIVKI NRGRSPAQSASGGS+EDP+ M+SRASSPVLS+K++Q DR LKEK Sbjct: 632 VPIAEGNNHKLIVKIPNRGRSPAQSASGGSFEDPSVMNSRASSPVLSDKHEQLDRNLKEK 691 Query: 838 TDAYRSNF--DANSESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEV--R 1005 D YR+N D N+ESWQSNDFK++LTGSDEGDGSPA AP+EE + DD K A+ Sbjct: 692 NDVYRTNVVSDVNNESWQSNDFKEVLTGSDEGDGSPAIAPDEENCRPGDDQRKLADAPKA 751 Query: 1006 ATCTSGSEPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSK 1185 A+ +SG+E K+GKLHE SFSSMNALIESCVKYSE PM + D +GMNLLA+VAA EMSK Sbjct: 752 ASSSSGNEHKTGKLHEGSFSSMNALIESCVKYSEVTAPMSVGDDVGMNLLATVAAGEMSK 811 Query: 1186 SDMVSPSVSPQRNNPAAEEACPGDDVKSKSPPSDISAGDCRNDDDG------NREKLV-S 1344 SDM SP SPQ N E C +D + KS P D D R DG NR+ ++ S Sbjct: 812 SDMASPKHSPQTNTTVVEHHCTSNDGRLKSSPGDNLPRDRRQSVDGVDDEHENRDSVIGS 871 Query: 1345 ACASWSEDKL 1374 + +EDK+ Sbjct: 872 SLPKITEDKI 881 >gb|EXC31170.1| hypothetical protein L484_004936 [Morus notabilis] Length = 1455 Score = 443 bits (1139), Expect = e-121 Identities = 320/724 (44%), Positives = 408/724 (56%), Gaps = 41/724 (5%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLR HK++EIQ+KARSLVDTWKKRVEAEMNI D KSGSNQ V+WP +SR PE + K Sbjct: 204 NHLRSHKNSEIQKKARSLVDTWKKRVEAEMNINDMKSGSNQVVSWPGRSR-PEVGN---K 259 Query: 181 NPGGSNDVT-KSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSPASG----KEG 345 +PGGS+D+ KSA F A+K S+K GE+TT+SAS SPGS K SPAS K+G Sbjct: 260 HPGGSSDIAIKSAYANFQATKYPSVKLVPGESTTRSASASPGSMKSVPSPASASTNLKDG 319 Query: 346 QPR-VSVGGSF-DVPL--AREDKXXXXXXXXXXXXXXX---------GKEDGRSSTAVSM 486 PR GGS DVPL AR++K GK++ RSS++ SM Sbjct: 320 HPRNTGAGGSMSDVPLTTARDEKSSSSSQSHNNSQSCSNDHARTGISGKDEARSSSSGSM 379 Query: 487 NSIKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLPQV-MSGEKTVD 657 N+ K S G SR RKSV+G GS +SGSQ+ES GR S H+N + EK ++ EK VD Sbjct: 380 NANKASGGSSRPRKSVNGIQGS-LSGSQRESWTGRNSSLHKNAAVEKSSHSGLTSEKVVD 438 Query: 658 VPVIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEK 837 EG+ HKLIVKI NRGRSP+QSA GGS++DPT +SSRASSPVL EK+DQFDR+LKEK Sbjct: 439 GATAEGNSHKLIVKIPNRGRSPSQSA-GGSFDDPTIISSRASSPVLREKHDQFDRSLKEK 497 Query: 838 TDAYRSN--FDANSESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVRAT 1011 +DAYR+ D N+ESWQSNDFKD+LT SDEGDGSPA +EER + D++ K+ EV T Sbjct: 498 SDAYRATGASDVNAESWQSNDFKDVLTASDEGDGSPATMTDEERCRTGDENKKAVEVSKT 557 Query: 1012 CTS--GSEPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSK 1185 +S G+E KSG EASFSS+NALIESCVKYSE N + D +GMNLLASVAA E+SK Sbjct: 558 ASSSSGNEHKSGNFQEASFSSINALIESCVKYSEGNTSISAVDDLGMNLLASVAAGEISK 617 Query: 1186 SDMVSPSVSPQRNNPAAEEACPGDDVKSKSPPSD------ISAGDCRNDDDGNREK-LVS 1344 SD+VSPS SPQR+ P E G+D K K P+D +GD +D+ G V+ Sbjct: 618 SDLVSPSRSPQRDTP-VELPGTGNDSKVKLIPADDLCRNQSRSGDVTDDEHGKHSSDSVN 676 Query: 1345 ACASWSEDKLHPSKGAVMELSGDRKASFSPSQETMIGGCNKQFNSPCIDSQTAVVKLEIT 1524 A +DK +V+ G K+ + N +++ D Q A E Sbjct: 677 LEAKDGDDK------SVLCFEGKPKSKHTG---------NIEYSG--ADFQQAEGDEESN 719 Query: 1525 EKSDEVGKYP----PSPHSVSGKAIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSV 1692 KS+EV P PS S D E K Q G +D K + + Sbjct: 720 GKSNEVILAPVLASPSKTSEKTAGADSEEGKPTQ-EKLAVGGVNADGNLDVKHNRTDSLL 778 Query: 1693 LGDKVTNAGASLEDQKSTVEVCSSKFVSENKNSVSRVLNTA-----STEMKPSYVVVKSE 1857 DK + G++ + K++VE S E + LN T+ KP VVKS Sbjct: 779 REDKAGDGGSN-NEVKASVEESYSCPAIETDAKIKYCLNEGMDSILQTDEKPPVSVVKS- 836 Query: 1858 KTGGSNKEERLPTSSSGDPTTVRGGRSDEASINHVDLSEKTKSDHGTVEASVEDKARVET 2037 K+ E LP+ D + + + VD + K + AS + RV Sbjct: 837 KSVKETCEGMLPSDLGKDLVSEKAHEVKMEKPDTVDTRSENKRTDPEINASTTPENRVVA 896 Query: 2038 DITT 2049 +T+ Sbjct: 897 GVTS 900 >gb|EMJ11634.1| hypothetical protein PRUPE_ppa000152mg [Prunus persica] Length = 1613 Score = 442 bits (1136), Expect = e-121 Identities = 306/728 (42%), Positives = 412/728 (56%), Gaps = 32/728 (4%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLR HK+ EIQ+KARSLVDTWKKRV+AEM DA S N AV+W ++ RL EAS+ ++ Sbjct: 361 NHLRTHKNLEIQKKARSLVDTWKKRVQAEM---DANSNVNPAVSWSARPRLSEASNGGNR 417 Query: 181 NPGGSNDVT-KSAVVQFSASKMASIKTSQGETTTKSASLSPGSTK---PASSPASGKEGQ 348 + GGS DV KS+V Q S SK AS+K QG++ TKSAS SPGS P S+ ++ K+GQ Sbjct: 418 HSGGSTDVAVKSSVTQLSVSKSASVKLVQGDSVTKSASASPGSKSVPSPVSASSNLKDGQ 477 Query: 349 PR-VSVGGSFDVPLA--REDKXXXXXXXXXXXXXXX----------GKEDGRSSTAVSMN 489 R V+VG + D+PL R++K GKED RSSTA SMN Sbjct: 478 SRIVAVGVTVDLPLTTPRDEKSSSSSQSHNNSQSCSNDHARTGGVSGKEDARSSTAGSMN 537 Query: 490 SIKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLPQV-MSGEKTVDV 660 KIS G SR RKS++G+PGS++SG Q+E+ + R S H++P EK Q ++ EK +D Sbjct: 538 VNKISGGSSRPRKSINGFPGSALSGVQRETVSSRSSSLHKSPPPEKSSQPGLASEKVLDG 597 Query: 661 PVIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEKT 840 EG+ HKLIVKI NRGRSPAQS SGGS+EDP+NM+SRASSP+ EK+DQ DR++KEK Sbjct: 598 SAAEGNSHKLIVKIPNRGRSPAQSGSGGSFEDPSNMNSRASSPMQLEKHDQLDRSVKEKA 657 Query: 841 DAYRSNF--DANSESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEV-RAT 1011 D YR+ D N+ESWQSNDFKD+LTGSDEGDGSPAA EE + D+S K AEV +A Sbjct: 658 DVYRATVTSDVNNESWQSNDFKDVLTGSDEGDGSPAAVTAEEDCRAGDNSKKIAEVPKAA 717 Query: 1012 CTSGSEPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSKSD 1191 +S KS L EASFSSM+ALIESCVKYSE N + D +GMNLLASVAA EMSKS+ Sbjct: 718 SSSSGNEKSDNLQEASFSSMHALIESCVKYSEGNAS--VGDDLGMNLLASVAAGEMSKSE 775 Query: 1192 MVSPSVSPQRNNPAAEEACPGDDVKSKSPPSDISAGDCRNDDDGNREKLVSACASWSEDK 1371 SP+ SPQR+ P +E C G+D + KSPP D A D +DG ++ E Sbjct: 776 --SPTDSPQRSTPVSEHLCEGNDSRVKSPPVDELARDESQSNDGADDEYQK---HGFEST 830 Query: 1372 LHPSKGAVMELSGDRKASFSPSQETMIGGCNKQFNSPCIDSQTAVVKLEITEKSDEVGKY 1551 +K V+ K+S Q ++ + S ++A + E EKS EV Sbjct: 831 TSGAKNGVV------KSSSVCEQNSVAEDPRNLYYSSVSIQRSAGLSPENKEKSSEVSLA 884 Query: 1552 P---PSPHSVSGKAIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKVTNAGA 1722 P SP S K ++G+ K Q G D K G G G+KV++ + Sbjct: 885 PSGTASPPSTVEKIMEGD-GKPLQ-DKKIIGGVSADGIPDIKHGFSGLLSNGNKVSDVSS 942 Query: 1723 SLEDQKSTVEVCSSKFVSENKNSVSRVL-----NTASTEMKPSYVVVKSEKTGGSNKEER 1887 + K +E S + + + ++ E KPS + SE G+ ++ Sbjct: 943 RVAVGKEAIEESSLHAELDVDGKIKNLRYEGMDSSVPAEEKPSTLKRHSELVKGTCEDVL 1002 Query: 1888 LPTSSSGDPTTVRGGRSDEASINHVDLSEKTKSDHGTVEASVEDKARVETDITTRNQQG- 2064 L SSG + G++ E D ++ T + + ++ + +T + + Sbjct: 1003 L---SSGFRKDLISGKASELKAEKADETDDTGHHNQAENQRTDPESGSSSAVTDHDDEHV 1059 Query: 2065 EASVERKD 2088 E ++E K+ Sbjct: 1060 EENLESKE 1067 >ref|XP_002321576.2| hypothetical protein POPTR_0015s08410g [Populus trichocarpa] gi|550322308|gb|EEF05703.2| hypothetical protein POPTR_0015s08410g [Populus trichocarpa] Length = 1642 Score = 436 bits (1120), Expect = e-119 Identities = 303/724 (41%), Positives = 411/724 (56%), Gaps = 31/724 (4%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLR HK+ EIQ+KARSLVD WKKRVEAEM+ +AK SNQ VTW ++SR+PE S ++ Sbjct: 397 NHLRTHKNLEIQKKARSLVDMWKKRVEAEMDA-NAKFSSNQGVTWSTRSRIPEVSQVGNR 455 Query: 181 NPGGSNDVT-KSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSPASG----KEG 345 G S+++ KS+VVQ SASK +K QGET TKSAS SPG K +SP + K+G Sbjct: 456 PSGVSSEIAMKSSVVQLSASKSGPVKLVQGETVTKSAS-SPGPIKSTASPGTVGNNLKDG 514 Query: 346 QPR-VSVGGSFDVPL--AREDKXXXXXXXXXXXXXXX----------GKEDGRSSTAVSM 486 Q R + V G+ D+P A+++K GKED RSSTAVSM Sbjct: 515 QLRNIGVSGASDLPASAAKDEKSSSSSQSLNNSQSCSSDHAKTSGLPGKEDARSSTAVSM 574 Query: 487 NSIKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLPQV-MSGEKTVD 657 + KI G R RKSV+G+PG +VSG Q++S + R S HRNP SEKL Q ++ ++ +D Sbjct: 575 ATNKIIGGSLRQRKSVNGFPGPAVSGVQRDSGSSRSSPLHRNPGSEKLQQSSLACDQALD 634 Query: 658 VPVIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEK 837 VP EG HK IVKI +GRSPAQS+SGG+ ED + M+SR SSPV SE++DQFD LKEK Sbjct: 635 VPTAEGFSHKFIVKIPTKGRSPAQSSSGGTLEDTSVMNSRDSSPVPSERHDQFDHNLKEK 694 Query: 838 TDAYRSNF--DANSESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVRAT 1011 ++YR N D +ESWQSNDFK++LTGSDEGDGSPA P+EE + DD+ K EV Sbjct: 695 INSYRVNIASDVKTESWQSNDFKEVLTGSDEGDGSPATVPDEEHGCMGDDASKLGEVSKA 754 Query: 1012 CTSGS--EPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSK 1185 S + E K GKLH+ASFSSMNALIESC KYS+ N M + D +GMNLLASVAA EMSK Sbjct: 755 TPSSNVYEHKFGKLHDASFSSMNALIESCAKYSDGNASMSVGDDVGMNLLASVAAGEMSK 814 Query: 1186 SDMVSPSVSPQRNNPAAEEACPGDDVKSKSPPSDISA-GDCRNDDDGNREKLVSACASWS 1362 SDMVSP+ SP+RN P E C ++KS P D+ A + DD + ++ ++ S S Sbjct: 815 SDMVSPTDSPRRNMP-IEHPCAPSGSRAKSSPRDVPAQSQGKPVDDEDEKQGITVGTSLS 873 Query: 1363 EDKLHPSKGAVMELSGDRKASFSPSQETMIGGCNKQFNSPCID-SQTAVVKLEITEKSDE 1539 ++ G + F SQE G N NS +D + A LE KS+E Sbjct: 874 KN------------IGAKTVLF--SQEKHTGELNGPPNSSHVDGKKIAEPCLESNVKSEE 919 Query: 1540 VGKYPPSPHSVSGKAIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKVTNAG 1719 + S S++ K + + ++ KL G SVL +++ N G Sbjct: 920 ILLAAVSSESMAVKTSNCRGKELWEKEGGGRSNLDGISDEKEKLHG---SVL-NEINNTG 975 Query: 1720 ASLEDQKSTVEVCSSKFV----SENKNSVSRVLNTASTEMKPSYVVVKSEKTGGSNKEER 1887 ++D ++V S+ ENK +++ L+ + + +++S+ G+N E R Sbjct: 976 --VQDGTDAIDVSSTNHPVETDGENKKKMNKELDVSVGDEPKPPAMLQSDFAKGTNDEVR 1033 Query: 1888 LPTSSSGDPTTVRGGRSDEASINHVDLSEKTKSDHGTVEASVEDKARVETDITTRNQQGE 2067 P+SS D + +++ V E H T + +E + T T + +GE Sbjct: 1034 EPSSSGKDVVS--------ENMHDVKAGETDGRSHSTEKNKIEHEC--NTASATTDYEGE 1083 Query: 2068 ASVE 2079 VE Sbjct: 1084 CKVE 1087 >ref|XP_006376841.1| hypothetical protein POPTR_0012s07910g [Populus trichocarpa] gi|550326620|gb|ERP54638.1| hypothetical protein POPTR_0012s07910g [Populus trichocarpa] Length = 1542 Score = 433 bits (1114), Expect = e-118 Identities = 306/733 (41%), Positives = 410/733 (55%), Gaps = 47/733 (6%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLR HK+ EIQ+KARSLVDTWKKRVEAEM+ + KSGSN V+W ++SRLPE SH Sbjct: 397 NHLRTHKNLEIQKKARSLVDTWKKRVEAEMDA-NTKSGSNHGVSWTARSRLPEVSHG-GN 454 Query: 181 NPGGSNDVT-KSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSP-ASG---KEG 345 PG S++V KS+VVQ SASK +K QGET TKS S SPG KPA+SP A+G K+G Sbjct: 455 RPGVSSEVAMKSSVVQLSASKSGPVKLVQGETVTKSGS-SPGPIKPAASPNAAGNNLKDG 513 Query: 346 QPR-VSVGGSFDVPL--AREDKXXXXXXXXXXXXXXX----------GKEDGRSSTAVSM 486 QPR V G+ D+P+ AR++K GK+D RSSTAVSM Sbjct: 514 QPRNTGVSGAMDLPVSAARDEKSSSSSQSHNNSQSCSSEHAKTVGLSGKDDARSSTAVSM 573 Query: 487 NSIKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLPQV-MSGEKTVD 657 + KI G RHRK V+G+ G ++SG+Q++S + R S H+NP SEKL Q ++ EK +D Sbjct: 574 AANKIIGGSLRHRKPVNGFSGPALSGAQRDSGSSRSSPLHKNPGSEKLQQSSLACEKVLD 633 Query: 658 VPVIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEK 837 P+ EG+ HK+IVKI NRGRSPAQS+SGG++ED MSSRASSPV+SE+++QFD LKEK Sbjct: 634 APMAEGNNHKIIVKIPNRGRSPAQSSSGGTFEDALVMSSRASSPVVSERHEQFDHNLKEK 693 Query: 838 TDAYRSNFDAN--SESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVRAT 1011 D YR+N +N +ESWQSNDFK++LTGSDE DG PA P++E + DD+ K EV T Sbjct: 694 NDPYRANITSNVKTESWQSNDFKEVLTGSDERDGLPANVPDKEHGQTGDDARKLGEVSKT 753 Query: 1012 CTSGS--EPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSK 1185 S + E KS K ++ASFSSMNALIESC KYSE N M + D +GMNLLASVAA EMSK Sbjct: 754 TPSLTVFELKSEKSYDASFSSMNALIESCAKYSEGNAAMTVGDDVGMNLLASVAAGEMSK 813 Query: 1186 SDMVSPSVSPQRNNPAAEEACPGDDVKSKSPPSDISA------GDCRNDDDGNREKLVSA 1347 SD+VSP+ SP + P P ++ KS P D A D +DDD R +V Sbjct: 814 SDVVSPTNSPCISMPIERSWAP-SGLRGKSSPCDDPAQSQGKSADGVDDDDEKRVTVVGT 872 Query: 1348 CASWSEDKLHPSKGAVMELSGDRKASFSPSQETMIGGCNKQFNSPCIDSQTAVVKLEITE 1527 PSK + SQE G N NS +D+ A +E Sbjct: 873 ---------PPSKNT-------EAKTVLFSQEKHAGELNGPSNSSNVDA--AEPCMESNV 914 Query: 1528 KSDEVGKYPPSPHSVSGKAIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKV 1707 KSDE P S S++ + + G + GDG S +K+ Sbjct: 915 KSDETLAAPVSSASMAVRTSN------------------CGGKEPWEKEGDGISDDKNKL 956 Query: 1708 TNAGASLEDQKSTVEVCSS-----------KFVSENKNSVSRVLNTASTEMKPSYVVVKS 1854 ++ E + V+V + + EN ++++ LN + +++S Sbjct: 957 LHSSVLTEVNYTGVQVGTEAIEGSSSNHHVEVDGENNKNMNKELNVSIHADPKPPAMMQS 1016 Query: 1855 EKTGGSNKEERLPTSSSGDPTT-----VRGGRSDEASINHVDLSEKTKSDHGTVEASVED 2019 + + G+N E P+SS D + V+ G +D S H +K K + T A+ + Sbjct: 1017 DFSKGTNDEMPQPSSSGKDMISENMHDVKAGETDGRS--HSTEKKKIKHESNTAPAATDH 1074 Query: 2020 KARVETDITTRNQ 2058 ++ + + NQ Sbjct: 1075 ESECKVESLGGNQ 1087 >ref|XP_002318025.2| hypothetical protein POPTR_0012s07910g [Populus trichocarpa] gi|550326619|gb|EEE96245.2| hypothetical protein POPTR_0012s07910g [Populus trichocarpa] Length = 1536 Score = 433 bits (1114), Expect = e-118 Identities = 306/733 (41%), Positives = 410/733 (55%), Gaps = 47/733 (6%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLR HK+ EIQ+KARSLVDTWKKRVEAEM+ + KSGSN V+W ++SRLPE SH Sbjct: 397 NHLRTHKNLEIQKKARSLVDTWKKRVEAEMDA-NTKSGSNHGVSWTARSRLPEVSHG-GN 454 Query: 181 NPGGSNDVT-KSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSP-ASG---KEG 345 PG S++V KS+VVQ SASK +K QGET TKS S SPG KPA+SP A+G K+G Sbjct: 455 RPGVSSEVAMKSSVVQLSASKSGPVKLVQGETVTKSGS-SPGPIKPAASPNAAGNNLKDG 513 Query: 346 QPR-VSVGGSFDVPL--AREDKXXXXXXXXXXXXXXX----------GKEDGRSSTAVSM 486 QPR V G+ D+P+ AR++K GK+D RSSTAVSM Sbjct: 514 QPRNTGVSGAMDLPVSAARDEKSSSSSQSHNNSQSCSSEHAKTVGLSGKDDARSSTAVSM 573 Query: 487 NSIKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLPQV-MSGEKTVD 657 + KI G RHRK V+G+ G ++SG+Q++S + R S H+NP SEKL Q ++ EK +D Sbjct: 574 AANKIIGGSLRHRKPVNGFSGPALSGAQRDSGSSRSSPLHKNPGSEKLQQSSLACEKVLD 633 Query: 658 VPVIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEK 837 P+ EG+ HK+IVKI NRGRSPAQS+SGG++ED MSSRASSPV+SE+++QFD LKEK Sbjct: 634 APMAEGNNHKIIVKIPNRGRSPAQSSSGGTFEDALVMSSRASSPVVSERHEQFDHNLKEK 693 Query: 838 TDAYRSNFDAN--SESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVRAT 1011 D YR+N +N +ESWQSNDFK++LTGSDE DG PA P++E + DD+ K EV T Sbjct: 694 NDPYRANITSNVKTESWQSNDFKEVLTGSDERDGLPANVPDKEHGQTGDDARKLGEVSKT 753 Query: 1012 CTSGS--EPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSK 1185 S + E KS K ++ASFSSMNALIESC KYSE N M + D +GMNLLASVAA EMSK Sbjct: 754 TPSLTVFELKSEKSYDASFSSMNALIESCAKYSEGNAAMTVGDDVGMNLLASVAAGEMSK 813 Query: 1186 SDMVSPSVSPQRNNPAAEEACPGDDVKSKSPPSDISA------GDCRNDDDGNREKLVSA 1347 SD+VSP+ SP + P P ++ KS P D A D +DDD R +V Sbjct: 814 SDVVSPTNSPCISMPIERSWAP-SGLRGKSSPCDDPAQSQGKSADGVDDDDEKRVTVVGT 872 Query: 1348 CASWSEDKLHPSKGAVMELSGDRKASFSPSQETMIGGCNKQFNSPCIDSQTAVVKLEITE 1527 PSK + SQE G N NS +D+ A +E Sbjct: 873 ---------PPSKNT-------EAKTVLFSQEKHAGELNGPSNSSNVDA--AEPCMESNV 914 Query: 1528 KSDEVGKYPPSPHSVSGKAIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKV 1707 KSDE P S S++ + + G + GDG S +K+ Sbjct: 915 KSDETLAAPVSSASMAVRTSN------------------CGGKEPWEKEGDGISDDKNKL 956 Query: 1708 TNAGASLEDQKSTVEVCSS-----------KFVSENKNSVSRVLNTASTEMKPSYVVVKS 1854 ++ E + V+V + + EN ++++ LN + +++S Sbjct: 957 LHSSVLTEVNYTGVQVGTEAIEGSSSNHHVEVDGENNKNMNKELNVSIHADPKPPAMMQS 1016 Query: 1855 EKTGGSNKEERLPTSSSGDPTT-----VRGGRSDEASINHVDLSEKTKSDHGTVEASVED 2019 + + G+N E P+SS D + V+ G +D S H +K K + T A+ + Sbjct: 1017 DFSKGTNDEMPQPSSSGKDMISENMHDVKAGETDGRS--HSTEKKKIKHESNTAPAATDH 1074 Query: 2020 KARVETDITTRNQ 2058 ++ + + NQ Sbjct: 1075 ESECKVESLGGNQ 1087 >ref|XP_002318028.2| hypothetical protein POPTR_0012s07910g [Populus trichocarpa] gi|566197345|ref|XP_002318027.2| hypothetical protein POPTR_0012s07910g [Populus trichocarpa] gi|550326618|gb|EEE96248.2| hypothetical protein POPTR_0012s07910g [Populus trichocarpa] gi|550326621|gb|EEE96247.2| hypothetical protein POPTR_0012s07910g [Populus trichocarpa] Length = 1640 Score = 433 bits (1114), Expect = e-118 Identities = 306/733 (41%), Positives = 410/733 (55%), Gaps = 47/733 (6%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLR HK+ EIQ+KARSLVDTWKKRVEAEM+ + KSGSN V+W ++SRLPE SH Sbjct: 397 NHLRTHKNLEIQKKARSLVDTWKKRVEAEMDA-NTKSGSNHGVSWTARSRLPEVSHG-GN 454 Query: 181 NPGGSNDVT-KSAVVQFSASKMASIKTSQGETTTKSASLSPGSTKPASSP-ASG---KEG 345 PG S++V KS+VVQ SASK +K QGET TKS S SPG KPA+SP A+G K+G Sbjct: 455 RPGVSSEVAMKSSVVQLSASKSGPVKLVQGETVTKSGS-SPGPIKPAASPNAAGNNLKDG 513 Query: 346 QPR-VSVGGSFDVPL--AREDKXXXXXXXXXXXXXXX----------GKEDGRSSTAVSM 486 QPR V G+ D+P+ AR++K GK+D RSSTAVSM Sbjct: 514 QPRNTGVSGAMDLPVSAARDEKSSSSSQSHNNSQSCSSEHAKTVGLSGKDDARSSTAVSM 573 Query: 487 NSIKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLPQV-MSGEKTVD 657 + KI G RHRK V+G+ G ++SG+Q++S + R S H+NP SEKL Q ++ EK +D Sbjct: 574 AANKIIGGSLRHRKPVNGFSGPALSGAQRDSGSSRSSPLHKNPGSEKLQQSSLACEKVLD 633 Query: 658 VPVIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKEK 837 P+ EG+ HK+IVKI NRGRSPAQS+SGG++ED MSSRASSPV+SE+++QFD LKEK Sbjct: 634 APMAEGNNHKIIVKIPNRGRSPAQSSSGGTFEDALVMSSRASSPVVSERHEQFDHNLKEK 693 Query: 838 TDAYRSNFDAN--SESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVRAT 1011 D YR+N +N +ESWQSNDFK++LTGSDE DG PA P++E + DD+ K EV T Sbjct: 694 NDPYRANITSNVKTESWQSNDFKEVLTGSDERDGLPANVPDKEHGQTGDDARKLGEVSKT 753 Query: 1012 CTSGS--EPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMSK 1185 S + E KS K ++ASFSSMNALIESC KYSE N M + D +GMNLLASVAA EMSK Sbjct: 754 TPSLTVFELKSEKSYDASFSSMNALIESCAKYSEGNAAMTVGDDVGMNLLASVAAGEMSK 813 Query: 1186 SDMVSPSVSPQRNNPAAEEACPGDDVKSKSPPSDISA------GDCRNDDDGNREKLVSA 1347 SD+VSP+ SP + P P ++ KS P D A D +DDD R +V Sbjct: 814 SDVVSPTNSPCISMPIERSWAP-SGLRGKSSPCDDPAQSQGKSADGVDDDDEKRVTVVGT 872 Query: 1348 CASWSEDKLHPSKGAVMELSGDRKASFSPSQETMIGGCNKQFNSPCIDSQTAVVKLEITE 1527 PSK + SQE G N NS +D+ A +E Sbjct: 873 ---------PPSKNT-------EAKTVLFSQEKHAGELNGPSNSSNVDA--AEPCMESNV 914 Query: 1528 KSDEVGKYPPSPHSVSGKAIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKV 1707 KSDE P S S++ + + G + GDG S +K+ Sbjct: 915 KSDETLAAPVSSASMAVRTSN------------------CGGKEPWEKEGDGISDDKNKL 956 Query: 1708 TNAGASLEDQKSTVEVCSS-----------KFVSENKNSVSRVLNTASTEMKPSYVVVKS 1854 ++ E + V+V + + EN ++++ LN + +++S Sbjct: 957 LHSSVLTEVNYTGVQVGTEAIEGSSSNHHVEVDGENNKNMNKELNVSIHADPKPPAMMQS 1016 Query: 1855 EKTGGSNKEERLPTSSSGDPTT-----VRGGRSDEASINHVDLSEKTKSDHGTVEASVED 2019 + + G+N E P+SS D + V+ G +D S H +K K + T A+ + Sbjct: 1017 DFSKGTNDEMPQPSSSGKDMISENMHDVKAGETDGRS--HSTEKKKIKHESNTAPAATDH 1074 Query: 2020 KARVETDITTRNQ 2058 ++ + + NQ Sbjct: 1075 ESECKVESLGGNQ 1087 >ref|XP_006439759.1| hypothetical protein CICLE_v10018474mg [Citrus clementina] gi|567894544|ref|XP_006439760.1| hypothetical protein CICLE_v10018474mg [Citrus clementina] gi|557542021|gb|ESR52999.1| hypothetical protein CICLE_v10018474mg [Citrus clementina] gi|557542022|gb|ESR53000.1| hypothetical protein CICLE_v10018474mg [Citrus clementina] Length = 1634 Score = 429 bits (1103), Expect = e-117 Identities = 305/700 (43%), Positives = 398/700 (56%), Gaps = 37/700 (5%) Frame = +1 Query: 1 NHLRQHKSTEIQRKARSLVDTWKKRVEAEMNIIDAKSGSNQAVTWPSKSRLPEASHSISK 180 NHLR HK+ EIQ+KARSLVDTWKKRVEAEM DAKSGSNQAV+ P++ R+PE SH ++ Sbjct: 389 NHLRTHKNLEIQKKARSLVDTWKKRVEAEM---DAKSGSNQAVSGPARPRIPEVSHGGNR 445 Query: 181 NPGGSNDVT-KSAVVQFSASKMASIKTSQGETTTKSASL--SPGSTKPASSPASG----K 339 N G S+++ KS+ +Q S SK S+K QGET K AS SP STK A SPASG K Sbjct: 446 NSGSSSEIAIKSSSMQLSTSKTPSVKLVQGETVAKPASACASPASTKSAPSPASGSTNLK 505 Query: 340 EGQPRVSVGGSFDVPL--AREDKXXXXXXXXXXXXXXX----------GKEDGRSSTAVS 483 +GQ R + G+ D+P AR++K GKED RSSTA S Sbjct: 506 DGQLR-NTSGTSDLPSTPARDEKSSSSSQSHNNSQSCSSDHAKTGGFSGKEDARSSTAGS 564 Query: 484 MNSIKISTGGSRHRKSVDGYPGSSVSGSQKESPAGRGS--HRNPSSEKLPQV-MSGEKTV 654 M KIS G SR RKS +G+P +++SG Q++ + R S H+NP SEKL Q ++ EK V Sbjct: 565 MTVNKISGGSSRPRKSANGFPSTALSGVQRDHGSSRNSSSHKNPGSEKLSQSSLTCEKVV 624 Query: 655 DVPVIEGSGHKLIVKISNRGRSPAQSASGGSYEDPTNMSSRASSPVLSEKNDQFDRTLKE 834 D+ V+EG+ HKLIVKI NRGRSPAQSA S E+P+ M+SRASSPV +K+D+FDR+ KE Sbjct: 625 DMSVVEGNTHKLIVKIPNRGRSPAQSAYAVSLEEPSVMNSRASSPVPLDKHDRFDRSFKE 684 Query: 835 KTDAYRSNF--DANSESWQSNDFKDILTGSDEGDGSPAAAPEEERSKIVDDSMKSAEVRA 1008 K+D YR N D N+ESWQSNDFKD+LTGSDEGDGSPA P+EE+ + DD K+AEV Sbjct: 685 KSDGYRHNVTSDVNNESWQSNDFKDVLTGSDEGDGSPATVPDEEQCRAGDDPGKTAEVSK 744 Query: 1009 TCTS--GSEPKSGKLHEASFSSMNALIESCVKYSESNVPMLLDDAIGMNLLASVAAEEMS 1182 T +S G+E KSGK H+ SF S+NALIESCVKYSE+ +++ D GMNLLASVAA E+S Sbjct: 745 TASSSSGNELKSGKSHDVSFRSINALIESCVKYSEAKTSVVVGDDAGMNLLASVAAGEIS 804 Query: 1183 KSDMVSPSVSPQRNNPAAEEACPGDDVKSKSPPSDISAGDCRNDDDGNREKLVSACASWS 1362 KSD+VSP SP+R P E +D + KS P D D D G KL SW+ Sbjct: 805 KSDVVSPVGSPRRRTPVYEPFGNENDSRVKSFPGD-QFSDGAGDAHG---KLGVDHTSWA 860 Query: 1363 EDKLHPSKGAVMELSGDRKASFSPSQETMIGGCNKQFNSPCIDSQTAVVKLEITEKS-DE 1539 ++ + +L+G + + SP +Q PC ++ K+ +T+ + D Sbjct: 861 KNGDSNQEKPAGDLTG--RINTSPMD-------LQQSGDPCQENIENSNKIVMTKGTPDC 911 Query: 1540 VGKYPPSPHSVSGKAIDGELSKQFQXXXXXXXXXXXXGAVDAKLGGDGTSVLGDKVT--N 1713 GK P +G +D G D K + DKV+ N Sbjct: 912 AGKNP--EEDKAGVRVD------------------TNGTSDDKQRSSASLSQEDKVSELN 951 Query: 1714 AGASLEDQKSTVEVCSSKFVSENKNSVSRVLNT-ASTEMKPSYVVVKSEKTGGSNKEERL 1890 G ++ S +F ENK + L TE KP + E G++ E L Sbjct: 952 QGVECNVVDGSLSHPSLEFHCENKKTACEGLKCFEQTEQKPPLIATHPENVKGAD-GELL 1010 Query: 1891 PTSSSGDPTT------VRGGRSDEA-SINHVDLSEKTKSD 1989 S G+ V+ DE S ++V+ SE+ KSD Sbjct: 1011 HESGPGEDMASKNIDEVKDEMVDEVDSKSNVNHSEEQKSD 1050