BLASTX nr result
ID: Akebia24_contig00005831
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00005831 (2856 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI30249.3| unnamed protein product [Vitis vinifera] 583 e-163 ref|XP_006430296.1| hypothetical protein CICLE_v10010952mg [Citr... 535 e-149 ref|XP_006481885.1| PREDICTED: ubiquitin-associated protein 2-li... 535 e-149 ref|XP_006481887.1| PREDICTED: ubiquitin-associated protein 2-li... 535 e-149 ref|XP_006430297.1| hypothetical protein CICLE_v10010952mg [Citr... 532 e-148 ref|XP_007203792.1| hypothetical protein PRUPE_ppa001273mg [Prun... 522 e-145 ref|XP_004303026.1| PREDICTED: uncharacterized protein LOC101305... 497 e-137 ref|XP_006381311.1| hypothetical protein POPTR_0006s11660g [Popu... 494 e-136 ref|XP_007027622.1| ENTH/VHS family protein, putative isoform 3 ... 476 e-131 ref|XP_007027620.1| ENTH/VHS family protein, putative isoform 1 ... 476 e-131 gb|EXB37772.1| Pre-mRNA cleavage complex 2 protein Pcf11 [Morus ... 474 e-130 ref|XP_002528590.1| conserved hypothetical protein [Ricinus comm... 472 e-130 ref|XP_007027621.1| ENTH/VHS family protein, putative isoform 2 ... 471 e-130 ref|XP_006851712.1| hypothetical protein AMTR_s00040p00210200 [A... 467 e-128 ref|XP_006341164.1| PREDICTED: uncharacterized protein LOC102593... 450 e-123 ref|XP_006430295.1| hypothetical protein CICLE_v10010952mg [Citr... 434 e-119 ref|XP_004246564.1| PREDICTED: uncharacterized protein LOC101244... 426 e-116 ref|XP_002277320.2| PREDICTED: uncharacterized protein LOC100251... 404 e-109 ref|XP_006339117.1| PREDICTED: uncharacterized protein LOC102597... 358 6e-96 ref|XP_003625749.1| Pre-mRNA cleavage complex 2 protein Pcf11 [M... 351 9e-94 >emb|CBI30249.3| unnamed protein product [Vitis vinifera] Length = 1049 Score = 583 bits (1504), Expect = e-163 Identities = 376/906 (41%), Positives = 476/906 (52%), Gaps = 68/906 (7%) Frame = -3 Query: 2854 SIHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHV 2675 SIH GMRHLFGTWKGVFP A LQMIEKELGF A+NGSS G TSR DSQSQRPP+SIHV Sbjct: 246 SIHPGMRHLFGTWKGVFPLAPLQMIEKELGFPPAINGSSPGIATSRSDSQSQRPPHSIHV 305 Query: 2674 NPKYLEARQRLQQSN----------------------------------------KDPRL 2615 NPKYLEARQRLQQS+ K + Sbjct: 306 NPKYLEARQRLQQSSRTKGAANDVTGTMVNSTEDADRLDRTAGINAGRPWDDLPAKSIQH 365 Query: 2614 SQREASSEPVHEKKSSAGYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSNA 2435 S REA E V EKK A Y E G+DLSR+ L IGR E+ G +KP Y +G Sbjct: 366 SHREAIGELV-EKKIGAPYGDYEYGTDLSRNPGLGIGRPSEQ-----GHDKPWYKAGGRV 419 Query: 2434 AETNIGRRNAFDTQDGFSKYQAPKSAQVLSQLQPTD-VGNRGSRGMNKSWKNSEEEEYIW 2258 ET +RN FD + GF Y AP+SA + LQPT NR + GM++SWKNSEEEEY+W Sbjct: 420 VETFSSQRNGFDIKHGFPNYPAPRSANADAHLQPTQSTVNRSNSGMSRSWKNSEEEEYMW 479 Query: 2257 DDMNSRLTDHGGPDSSSADGWSTDDAEKPEIEDHLPQAHGEHDIGSRIYTETSSDSLSIA 2078 DDMNS++T+H + S D W+ DD+EK + E+ L + +D+GS + ETS+DS+S Sbjct: 480 DDMNSKMTEHSAANHSKKDRWTPDDSEKLDFENQLQKPQSIYDVGSSVDRETSTDSMSSE 539 Query: 2077 QRAQASFGHRTTSIWPSQEPRSVDGLKHISITTRISGHSEGHPXXXXXXXXXXXXXXXGR 1898 QR Q +FGHR +S+WP QEP S DGLKH +T I GHSEG+P Sbjct: 540 QREQGAFGHRMSSLWPLQEPHSTDGLKHSGTSTLILGHSEGYP----------------- 582 Query: 1897 TGFETLTGPSVVSIPNVGSMVDRVSGSGGFSGQQRHHSFTEKDHLRIQSSQPGQKTSHLP 1718 T F P ++ Q +Q G LP Sbjct: 583 TQFTLDALPKLI-----------------------------------QKAQLGDLQKLLP 607 Query: 1717 GNLSQAPYGQLPQDSSLPVRPQNHIKSQPPQHIHASFPQLRHPGLFSQQLHSEPTQSLPS 1538 NL P S+P+R H FS QL +P Q PS Sbjct: 608 HNLQSLS----PAVPSVPIR---------------------HHAPFSPQLQPDPLQPEPS 642 Query: 1537 SQTPK-PLPQPSISGSP-----PIMGHS-APGLDVPGQPSTGNLLAAIMKSGLLSSNSVT 1379 Q K LPQ SI +P P++ HS P + G+ ST NLLAA+MKSG+LS++SV+ Sbjct: 643 GQAQKTSLPQTSIFEAPSTIENPVLEHSNYPAAESTGKLSTSNLLAAVMKSGILSNSSVS 702 Query: 1378 GGLPNPSFKDSGVLPSHLSIQHPLPSGPPTQLXXXXXXXXXXXPLGSTSSLSTHPQRTXX 1199 G +P SF+D+G + + IQ PLPSGPP + S S QR Sbjct: 703 GSIPKTSFQDTGAVLQSV-IQPPLPSGPPP----------------AHKSASNLSQRKVE 745 Query: 1198 XXXXXXXXXXXXXXXXXXXXXXSNVASAVPNPLSSLLSTLVAKGLISSPSKEMPTLTSPQ 1019 SNV S NP+++LLS+LVAKGLIS+ E T Q Sbjct: 746 RPPLPPGPPPPSSLAGSGLPQSSNVTSNASNPIANLLSSLVAKGLISASKTESSTHVPTQ 805 Query: 1018 VASRLPKQXXXXXXXXXXXXXXXXXXXXXSGNDLLFKGS----AAKITSTVSKPMKVERK 851 + +RL Q + + S AAK + V++ VE K Sbjct: 806 MPARLQNQSAGISTISPIPVSSVSVASSVPLSSTMDAVSHTEPAAKASVAVTQSTSVEVK 865 Query: 850 NLLGIEFKPEIIRESHPSVISDLFDDLPHKCSICGHRLKFQEQLDLHLEWHASK------ 689 NL+G EFK +IIRESHPSVIS+LFDDLPH+CSICG RLK +E+LD HLEWHA K Sbjct: 866 NLIGFEFKSDIIRESHPSVISELFDDLPHQCSICGLRLKLRERLDRHLEWHALKKSEPNG 925 Query: 688 --TLSRRWYPSLGVWVAGNEG--------SSSGPSVETAEKSEPVVPADESQCVCILCGE 539 SR W+ + G W+A G S +G S + E SE +VPADE+QCVC+LCGE Sbjct: 926 LNRASRSWFVNSGEWIAEVAGFPTEAKSTSPAGESGKPLETSEQMVPADENQCVCVLCGE 985 Query: 538 PFEDFYSHDRDEWMYKGATYMSLPAVDGDIGTTDGCASLGPIVHANCASPTSVSDLGLSK 359 FEDFYS + D+WM++GA M++P+ G++GT + GPIVHA+C + +SV DLGL+ Sbjct: 986 VFEDFYSQEMDKWMFRGAVKMTVPSQGGELGT----KNQGPIVHADCITESSVHDLGLAC 1041 Query: 358 NIKPEQ 341 +IK E+ Sbjct: 1042 DIKVEK 1047 >ref|XP_006430296.1| hypothetical protein CICLE_v10010952mg [Citrus clementina] gi|557532353|gb|ESR43536.1| hypothetical protein CICLE_v10010952mg [Citrus clementina] Length = 1073 Score = 535 bits (1379), Expect = e-149 Identities = 367/940 (39%), Positives = 494/940 (52%), Gaps = 102/940 (10%) Frame = -3 Query: 2854 SIHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHV 2675 ++ S MRHLFGTWKGVFPP +LQ+IEKELGF++ VNGSSSGATTSR DSQSQRPP+SIHV Sbjct: 163 AVRSSMRHLFGTWKGVFPPMTLQIIEKELGFTSVVNGSSSGATTSRHDSQSQRPPHSIHV 222 Query: 2674 NPKYLEARQRLQQSNK-----------------------------------DPRL----S 2612 NPKYLE RQRLQQ+++ DP + S Sbjct: 223 NPKYLE-RQRLQQTSRAKGLVNDMNGAVASSTVDAERPDRASSMSASRPWVDPTVKMQHS 281 Query: 2611 QREASSEPVHEKKSSAGYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSNAA 2432 QR+A SEP+HEK A +Y + GS+LSR S L GR RV++Q G EKP YGSGSN + Sbjct: 282 QRDALSEPIHEKNIGAYGDY-DYGSELSRSSGLGSGRTTGRVSDQ-GYEKPWYGSGSNIS 339 Query: 2431 ETNIGRRNAFDTQDGFSKYQAPKSAQVLSQLQPTDVGNRGSRGMNKSWKNSEEEEYIWDD 2252 ET G+RN F+ + GF Y A KSA + LQ + S SWKNSEEEE++W D Sbjct: 340 ETIAGQRNGFNKKQGFPNYSASKSANAAAHLQQVQSIPKSSSSGLSSWKNSEEEEFMW-D 398 Query: 2251 MNSRLTDHGGPD---SSSADGWSTDDAEKPEIEDHLPQAHGEHDIGSRIYTETSSDSLSI 2081 M+ R +DH + +S D + D EK E+++HL + G HD+ S ETSSDSLS Sbjct: 399 MHPRTSDHDAANISKNSRKDHLAVDGPEKLELDNHLRKPQGIHDVSSSFDRETSSDSLST 458 Query: 2080 AQRAQASFGHRTTSIWPSQEPRSVDGLKHISI------TTRISGHSEGHPXXXXXXXXXX 1919 Q+ QA++ H+ S W +E DGL ++ ++ + GHP Sbjct: 459 EQKDQAAYRHQMPSPWQLKE---ADGLIAATLGGFPASSSSSLARTGGHP--------PV 507 Query: 1918 XXXXXGRTGFETLTGPSVVSIPNVGSMVDRVSGSGGFSGQ--QRHHS------------- 1784 G +GF TL + S ++ + + + +G SG HHS Sbjct: 508 VSSHIGTSGFGTLASSASGSTGSLATQRFQSARAGSPSGHSPMHHHSPSPSVPAHHPRQN 567 Query: 1783 ---FTEKDHLRIQS-SQPGQKTSHLPGNLSQAPYGQLPQDSSLPVRPQNHIKSQP---PQ 1625 T++D+ Q S+P KTS PG +S P G +DS + P + + + P PQ Sbjct: 568 MQNCTDRDYPHAQPLSRPDLKTSSFPGLVSSGPRGHSTKDSPSILHPNSQLGNLPKVQPQ 627 Query: 1624 HIHASFPQLRHPGLFSQQLHSEPTQSLPSSQTPKPLPQPSISGSP----PIMGHSAPGLD 1457 + S P + S QL+ + + L LPQ S G+P + HS P LD Sbjct: 628 DLKGS-----SPAVTSFQLNCQSQKPL--------LPQVSNFGAPSTKEAVSDHSNP-LD 673 Query: 1456 VP--GQPSTGNLLAAIMKSGLLSSNSVTGGLPNPSFKDSGVLPSHLSIQHPLPSG-PPTQ 1286 GQ T +LLA+++KSG+L+S S+T GL N + K+ G +P L IQ PLPSG PP Sbjct: 674 AEGLGQSGTSSLLASVLKSGILNS-SITDGLANRALKEVGQIPLQLDIQPPLPSGPPPPS 732 Query: 1285 LXXXXXXXXXXXPLGSTS-----SLSTHPQRTXXXXXXXXXXXXXXXXXXXXXXXXSNVA 1121 L L S + T QR S+V Sbjct: 733 LLTSSGARVGSGSLSGPSQEDPPATMTSSQR-KVEQPPLPPGPPPSSLASSTSPKASSVE 791 Query: 1120 SAVPNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXX 941 S NP+S+LLSTLVAKGLIS+ E P+ T+PQV SR+ + Sbjct: 792 SKTSNPISNLLSTLVAKGLISASKTEPPSHTTPQVTSRMQNESPGISSSSPATVSSVPNL 851 Query: 940 XXXSGNDLLFKGS----AAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDD 773 + + + S A + + +S+ VE +NL+G++FKP++IRE H SVI LFD Sbjct: 852 LPIPPSSTVDETSLPAPAGESSFALSESTTVETQNLIGLKFKPDVIREFHESVIKRLFDG 911 Query: 772 LPHKCSICGHRLKFQEQLDLHLEWHASK--------TLSRRWYPSLGVWVAGNEGSSSG- 620 PH CSICG RLK QEQLD HLEWHA + +SRRWY + WVAG G G Sbjct: 912 FPHLCSICGLRLKLQEQLDRHLEWHALRKPGLDDVDKISRRWYANSDDWVAGKAGLPLGL 971 Query: 619 -------PSVETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAV 461 S +T ++ EP+VPAD++QC C++CGE FED Y+ R EWM+K A YM +P+ Sbjct: 972 ESISCMEDSGKTIDEGEPMVPADDNQCACVMCGELFEDCYNQARGEWMFKAAVYMMIPSG 1031 Query: 460 DGDIGTTDGCASLGPIVHANCASPTSVSDLGLSKNIKPEQ 341 +G++GTT+ ++ GPIVH NC S SV DL + +K E+ Sbjct: 1032 NGEVGTTNESSAKGPIVHGNCISENSVHDLRVISKVKVEK 1071 >ref|XP_006481885.1| PREDICTED: ubiquitin-associated protein 2-like isoform X1 [Citrus sinensis] gi|568856635|ref|XP_006481886.1| PREDICTED: ubiquitin-associated protein 2-like isoform X2 [Citrus sinensis] Length = 1073 Score = 535 bits (1378), Expect = e-149 Identities = 367/950 (38%), Positives = 485/950 (51%), Gaps = 112/950 (11%) Frame = -3 Query: 2854 SIHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHV 2675 ++ S MRHLFGTWKGVFPP +LQ+IEKELGF++ VNGSSSGATTSR DSQSQRPP+SIHV Sbjct: 163 AVRSSMRHLFGTWKGVFPPMTLQIIEKELGFTSVVNGSSSGATTSRHDSQSQRPPHSIHV 222 Query: 2674 NPKYLEARQRLQQSNK-----------------------------------DPRL----S 2612 NPKYLE RQRLQQ+++ DP + S Sbjct: 223 NPKYLE-RQRLQQTSRAKGLVNDMNGAVASSTVDAERPDRASSMSASRPWVDPTVKMQHS 281 Query: 2611 QREASSEPVHEKKSSAGYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSNAA 2432 QR+A SEP+HEK Y + GS+LSR S L GR RV++Q G EKP YGSGSN + Sbjct: 282 QRDALSEPIHEKNIGGAYGDYDYGSELSRSSGLGSGRTTGRVSDQ-GYEKPWYGSGSNIS 340 Query: 2431 ETNIGRRNAFDTQDGFSKYQAPKSAQVLSQLQPTDVGNRGSRGMNKSWKNSEEEEYIWDD 2252 ET G+RN F+ + GF Y A KSA + LQ + S SWKNSEEEE++WD Sbjct: 341 ETIAGQRNGFNKKQGFPNYSASKSANAAAHLQQVQSIPKSSSSGLSSWKNSEEEEFMWD- 399 Query: 2251 MNSRLTDHGGPD---SSSADGWSTDDAEKPEIEDHLPQAHGEHDIGSRIYTETSSDSLSI 2081 M+ R +DH + +S D + D EK E+++HL + G HD+ S ETSSDSLS Sbjct: 400 MHPRTSDHDAANISKNSRKDHLAVDGPEKLELDNHLRKPQGIHDVSSSFDIETSSDSLST 459 Query: 2080 AQRAQASFGHRTTSIWPSQEPRSVDGLKHISITTRISGHSEGHPXXXXXXXXXXXXXXXG 1901 Q+ QA++ H+ S W +E DGL I+ G P Sbjct: 460 EQKDQAAYRHQMPSPWQLKE---ADGL--------IAATLGGFPASSSSSLA-------- 500 Query: 1900 RTGFETLTGPSVVSIPNVGSMVDRVSGSGGFSGQQR----------------HHS----- 1784 RTG G S + G++ SGS G QR HHS Sbjct: 501 RTGGHPPVGSSHIGTSGFGTLASSASGSTGSLATQRFQSAPAGSPSGHSPMHHHSPSPSV 560 Query: 1783 -----------FTEKDHLRIQS-SQPGQKTSHLPGNLSQAPYGQLPQDSSLPVRPQN--- 1649 T++D+ Q S+P KTS PG +S P G +D + P + Sbjct: 561 PAHHPRQNMQNCTDRDYPHAQPLSRPDLKTSSFPGLVSSGPRGHSTKDLPSILHPNSQLG 620 Query: 1648 HIKSQPPQHIHASFPQLRHPGLFSQQLHSEPTQSLPSSQTPKPLPQPSISGSPP----IM 1481 ++ PQ + S P + S QL+ + + L LPQ S G+P + Sbjct: 621 NLHKVQPQDLKGS-----SPAVTSFQLNCQSQKPL--------LPQVSNFGAPSSKEAVS 667 Query: 1480 GHSAPGLDVPG--QPSTGNLLAAIMKSGLLSSNSVTGGLPNPSFKDSGVLPSHLSIQHPL 1307 HS P LD G Q T +LLA+++KSG+L+S S+T GL N + ++ G +P L IQ PL Sbjct: 668 DHSNP-LDAEGLGQSGTSSLLASVLKSGILNS-SITDGLANRALREVGQIPLQLDIQPPL 725 Query: 1306 PSGPPTQLXXXXXXXXXXXPLGSTSSLS--------THPQRTXXXXXXXXXXXXXXXXXX 1151 PSGPP L GS+S S T QR Sbjct: 726 PSGPPPSLLTSSGARVGS---GSSSGPSQEDPPATMTGSQRKVEQPPLPPGPPPSSLASS 782 Query: 1150 XXXXXXSNVASAVPNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXX 971 +V S NP+S+LLSTLVAKGLIS+ E P+ T+PQV SR+ + Sbjct: 783 TSPKVS-SVESKTSNPISNLLSTLVAKGLISASKTEPPSHTTPQVTSRMQNESPGISSSS 841 Query: 970 XXXXXXXXXXXXXSGNDLLFKGS----AAKITSTVSKPMKVERKNLLGIEFKPEIIRESH 803 + + + S A + + +S+ VE +NL+G++FKP++IRE H Sbjct: 842 PAAVSSVPNLLPIPPSSTVDETSLPAPAGESSFALSESTTVETQNLIGLKFKPDVIREFH 901 Query: 802 PSVISDLFDDLPHKCSICGHRLKFQEQLDLHLEWHASKT--------LSRRWYPSLGVWV 647 SVI LFD PH CSICG RLK QEQLD HLEWHA + +SRRWY + WV Sbjct: 902 ESVIKRLFDGFPHLCSICGLRLKLQEQLDRHLEWHALRKPGLDDVDKVSRRWYANSDDWV 961 Query: 646 AGNEGSSSG--------PSVETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYK 491 AG G G S +T ++ EP+VPAD++QC C++CGE FED Y+ R EWM+K Sbjct: 962 AGKAGLPLGLESISCMEDSGKTIDEGEPMVPADDNQCACVMCGELFEDCYNQARGEWMFK 1021 Query: 490 GATYMSLPAVDGDIGTTDGCASLGPIVHANCASPTSVSDLGLSKNIKPEQ 341 A YM +P+ +G++GTT+ ++ GPIVH NC S SV DL + +K E+ Sbjct: 1022 AAVYMMIPSGNGEVGTTNESSAKGPIVHGNCISENSVHDLRVISKVKVEK 1071 >ref|XP_006481887.1| PREDICTED: ubiquitin-associated protein 2-like isoform X3 [Citrus sinensis] Length = 1070 Score = 535 bits (1377), Expect = e-149 Identities = 366/947 (38%), Positives = 484/947 (51%), Gaps = 109/947 (11%) Frame = -3 Query: 2854 SIHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHV 2675 ++ S MRHLFGTWKGVFPP +LQ+IEKELGF++ VNGSSSGATTSR DSQSQRPP+SIHV Sbjct: 163 AVRSSMRHLFGTWKGVFPPMTLQIIEKELGFTSVVNGSSSGATTSRHDSQSQRPPHSIHV 222 Query: 2674 NPKYLEARQRLQQSNK-----------------------------------DPRLS-QRE 2603 NPKYLE RQRLQQ+++ DP + QR+ Sbjct: 223 NPKYLE-RQRLQQTSRAKGLVNDMNGAVASSTVDAERPDRASSMSASRPWVDPTVKMQRD 281 Query: 2602 ASSEPVHEKKSSAGYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSNAAETN 2423 A SEP+HEK Y + GS+LSR S L GR RV++Q G EKP YGSGSN +ET Sbjct: 282 ALSEPIHEKNIGGAYGDYDYGSELSRSSGLGSGRTTGRVSDQ-GYEKPWYGSGSNISETI 340 Query: 2422 IGRRNAFDTQDGFSKYQAPKSAQVLSQLQPTDVGNRGSRGMNKSWKNSEEEEYIWDDMNS 2243 G+RN F+ + GF Y A KSA + LQ + S SWKNSEEEE++WD M+ Sbjct: 341 AGQRNGFNKKQGFPNYSASKSANAAAHLQQVQSIPKSSSSGLSSWKNSEEEEFMWD-MHP 399 Query: 2242 RLTDHGGPD---SSSADGWSTDDAEKPEIEDHLPQAHGEHDIGSRIYTETSSDSLSIAQR 2072 R +DH + +S D + D EK E+++HL + G HD+ S ETSSDSLS Q+ Sbjct: 400 RTSDHDAANISKNSRKDHLAVDGPEKLELDNHLRKPQGIHDVSSSFDIETSSDSLSTEQK 459 Query: 2071 AQASFGHRTTSIWPSQEPRSVDGLKHISITTRISGHSEGHPXXXXXXXXXXXXXXXGRTG 1892 QA++ H+ S W +E DGL I+ G P RTG Sbjct: 460 DQAAYRHQMPSPWQLKE---ADGL--------IAATLGGFPASSSSSLA--------RTG 500 Query: 1891 FETLTGPSVVSIPNVGSMVDRVSGSGGFSGQQR----------------HHS-------- 1784 G S + G++ SGS G QR HHS Sbjct: 501 GHPPVGSSHIGTSGFGTLASSASGSTGSLATQRFQSAPAGSPSGHSPMHHHSPSPSVPAH 560 Query: 1783 --------FTEKDHLRIQS-SQPGQKTSHLPGNLSQAPYGQLPQDSSLPVRPQN---HIK 1640 T++D+ Q S+P KTS PG +S P G +D + P + ++ Sbjct: 561 HPRQNMQNCTDRDYPHAQPLSRPDLKTSSFPGLVSSGPRGHSTKDLPSILHPNSQLGNLH 620 Query: 1639 SQPPQHIHASFPQLRHPGLFSQQLHSEPTQSLPSSQTPKPLPQPSISGSPP----IMGHS 1472 PQ + S P + S QL+ + + L LPQ S G+P + HS Sbjct: 621 KVQPQDLKGS-----SPAVTSFQLNCQSQKPL--------LPQVSNFGAPSSKEAVSDHS 667 Query: 1471 APGLDVPG--QPSTGNLLAAIMKSGLLSSNSVTGGLPNPSFKDSGVLPSHLSIQHPLPSG 1298 P LD G Q T +LLA+++KSG+L+S S+T GL N + ++ G +P L IQ PLPSG Sbjct: 668 NP-LDAEGLGQSGTSSLLASVLKSGILNS-SITDGLANRALREVGQIPLQLDIQPPLPSG 725 Query: 1297 PPTQLXXXXXXXXXXXPLGSTSSLS--------THPQRTXXXXXXXXXXXXXXXXXXXXX 1142 PP L GS+S S T QR Sbjct: 726 PPPSLLTSSGARVGS---GSSSGPSQEDPPATMTGSQRKVEQPPLPPGPPPSSLASSTSP 782 Query: 1141 XXXSNVASAVPNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXX 962 +V S NP+S+LLSTLVAKGLIS+ E P+ T+PQV SR+ + Sbjct: 783 KVS-SVESKTSNPISNLLSTLVAKGLISASKTEPPSHTTPQVTSRMQNESPGISSSSPAA 841 Query: 961 XXXXXXXXXXSGNDLLFKGS----AAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSV 794 + + + S A + + +S+ VE +NL+G++FKP++IRE H SV Sbjct: 842 VSSVPNLLPIPPSSTVDETSLPAPAGESSFALSESTTVETQNLIGLKFKPDVIREFHESV 901 Query: 793 ISDLFDDLPHKCSICGHRLKFQEQLDLHLEWHASKT--------LSRRWYPSLGVWVAGN 638 I LFD PH CSICG RLK QEQLD HLEWHA + +SRRWY + WVAG Sbjct: 902 IKRLFDGFPHLCSICGLRLKLQEQLDRHLEWHALRKPGLDDVDKVSRRWYANSDDWVAGK 961 Query: 637 EGSSSG--------PSVETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGAT 482 G G S +T ++ EP+VPAD++QC C++CGE FED Y+ R EWM+K A Sbjct: 962 AGLPLGLESISCMEDSGKTIDEGEPMVPADDNQCACVMCGELFEDCYNQARGEWMFKAAV 1021 Query: 481 YMSLPAVDGDIGTTDGCASLGPIVHANCASPTSVSDLGLSKNIKPEQ 341 YM +P+ +G++GTT+ ++ GPIVH NC S SV DL + +K E+ Sbjct: 1022 YMMIPSGNGEVGTTNESSAKGPIVHGNCISENSVHDLRVISKVKVEK 1068 >ref|XP_006430297.1| hypothetical protein CICLE_v10010952mg [Citrus clementina] gi|557532354|gb|ESR43537.1| hypothetical protein CICLE_v10010952mg [Citrus clementina] Length = 906 Score = 532 bits (1371), Expect = e-148 Identities = 366/935 (39%), Positives = 491/935 (52%), Gaps = 102/935 (10%) Frame = -3 Query: 2839 MRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHVNPKYL 2660 MRHLFGTWKGVFPP +LQ+IEKELGF++ VNGSSSGATTSR DSQSQRPP+SIHVNPKYL Sbjct: 1 MRHLFGTWKGVFPPMTLQIIEKELGFTSVVNGSSSGATTSRHDSQSQRPPHSIHVNPKYL 60 Query: 2659 EARQRLQQSNK-----------------------------------DPRL----SQREAS 2597 E RQRLQQ+++ DP + SQR+A Sbjct: 61 E-RQRLQQTSRAKGLVNDMNGAVASSTVDAERPDRASSMSASRPWVDPTVKMQHSQRDAL 119 Query: 2596 SEPVHEKKSSAGYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSNAAETNIG 2417 SEP+HEK A +Y + GS+LSR S L GR RV++Q G EKP YGSGSN +ET G Sbjct: 120 SEPIHEKNIGAYGDY-DYGSELSRSSGLGSGRTTGRVSDQ-GYEKPWYGSGSNISETIAG 177 Query: 2416 RRNAFDTQDGFSKYQAPKSAQVLSQLQPTDVGNRGSRGMNKSWKNSEEEEYIWDDMNSRL 2237 +RN F+ + GF Y A KSA + LQ + S SWKNSEEEE++W DM+ R Sbjct: 178 QRNGFNKKQGFPNYSASKSANAAAHLQQVQSIPKSSSSGLSSWKNSEEEEFMW-DMHPRT 236 Query: 2236 TDHGGPD---SSSADGWSTDDAEKPEIEDHLPQAHGEHDIGSRIYTETSSDSLSIAQRAQ 2066 +DH + +S D + D EK E+++HL + G HD+ S ETSSDSLS Q+ Q Sbjct: 237 SDHDAANISKNSRKDHLAVDGPEKLELDNHLRKPQGIHDVSSSFDRETSSDSLSTEQKDQ 296 Query: 2065 ASFGHRTTSIWPSQEPRSVDGLKHISI------TTRISGHSEGHPXXXXXXXXXXXXXXX 1904 A++ H+ S W +E DGL ++ ++ + GHP Sbjct: 297 AAYRHQMPSPWQLKE---ADGLIAATLGGFPASSSSSLARTGGHP--------PVVSSHI 345 Query: 1903 GRTGFETLTGPSVVSIPNVGSMVDRVSGSGGFSGQ--QRHHS----------------FT 1778 G +GF TL + S ++ + + + +G SG HHS T Sbjct: 346 GTSGFGTLASSASGSTGSLATQRFQSARAGSPSGHSPMHHHSPSPSVPAHHPRQNMQNCT 405 Query: 1777 EKDHLRIQS-SQPGQKTSHLPGNLSQAPYGQLPQDSSLPVRPQNHIKSQP---PQHIHAS 1610 ++D+ Q S+P KTS PG +S P G +DS + P + + + P PQ + S Sbjct: 406 DRDYPHAQPLSRPDLKTSSFPGLVSSGPRGHSTKDSPSILHPNSQLGNLPKVQPQDLKGS 465 Query: 1609 FPQLRHPGLFSQQLHSEPTQSLPSSQTPKPLPQPSISGSP----PIMGHSAPGLDVP--G 1448 P + S QL+ + + L LPQ S G+P + HS P LD G Sbjct: 466 -----SPAVTSFQLNCQSQKPL--------LPQVSNFGAPSTKEAVSDHSNP-LDAEGLG 511 Query: 1447 QPSTGNLLAAIMKSGLLSSNSVTGGLPNPSFKDSGVLPSHLSIQHPLPSG-PPTQLXXXX 1271 Q T +LLA+++KSG+L+S S+T GL N + K+ G +P L IQ PLPSG PP L Sbjct: 512 QSGTSSLLASVLKSGILNS-SITDGLANRALKEVGQIPLQLDIQPPLPSGPPPPSLLTSS 570 Query: 1270 XXXXXXXPLGSTS-----SLSTHPQRTXXXXXXXXXXXXXXXXXXXXXXXXSNVASAVPN 1106 L S + T QR S+V S N Sbjct: 571 GARVGSGSLSGPSQEDPPATMTSSQR-KVEQPPLPPGPPPSSLASSTSPKASSVESKTSN 629 Query: 1105 PLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXXXXXSG 926 P+S+LLSTLVAKGLIS+ E P+ T+PQV SR+ + Sbjct: 630 PISNLLSTLVAKGLISASKTEPPSHTTPQVTSRMQNESPGISSSSPATVSSVPNLLPIPP 689 Query: 925 NDLLFKGS----AAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKC 758 + + + S A + + +S+ VE +NL+G++FKP++IRE H SVI LFD PH C Sbjct: 690 SSTVDETSLPAPAGESSFALSESTTVETQNLIGLKFKPDVIREFHESVIKRLFDGFPHLC 749 Query: 757 SICGHRLKFQEQLDLHLEWHASK--------TLSRRWYPSLGVWVAGNEGSSSG------ 620 SICG RLK QEQLD HLEWHA + +SRRWY + WVAG G G Sbjct: 750 SICGLRLKLQEQLDRHLEWHALRKPGLDDVDKISRRWYANSDDWVAGKAGLPLGLESISC 809 Query: 619 --PSVETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIG 446 S +T ++ EP+VPAD++QC C++CGE FED Y+ R EWM+K A YM +P+ +G++G Sbjct: 810 MEDSGKTIDEGEPMVPADDNQCACVMCGELFEDCYNQARGEWMFKAAVYMMIPSGNGEVG 869 Query: 445 TTDGCASLGPIVHANCASPTSVSDLGLSKNIKPEQ 341 TT+ ++ GPIVH NC S SV DL + +K E+ Sbjct: 870 TTNESSAKGPIVHGNCISENSVHDLRVISKVKVEK 904 >ref|XP_007203792.1| hypothetical protein PRUPE_ppa001273mg [Prunus persica] gi|462399323|gb|EMJ04991.1| hypothetical protein PRUPE_ppa001273mg [Prunus persica] Length = 866 Score = 522 bits (1345), Expect = e-145 Identities = 355/916 (38%), Positives = 454/916 (49%), Gaps = 83/916 (9%) Frame = -3 Query: 2839 MRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHVNPKYL 2660 MRHLFGTWKGVFP +LQMIEKELGF++ NGSSSGA TSR DSQSQRP +SIHVNPKYL Sbjct: 1 MRHLFGTWKGVFPAQTLQMIEKELGFASTANGSSSGAATSRLDSQSQRPAHSIHVNPKYL 60 Query: 2659 EARQRLQQSNK-----------------------------------DPRL-------SQR 2606 E RQRLQQ + DP + S Sbjct: 61 E-RQRLQQPTRTKGMASDFSGAMANSIDDAERPDRVASLSAGRPWVDPTVKMHNMQRSNT 119 Query: 2605 EASSEPVHEKKSSAGYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSNAAET 2426 +A SE VHEK A Y E GSDL R S+L IGR ++ EQ G +KP YG GS+ AET Sbjct: 120 DALSERVHEKNIGAEYGEYEYGSDLPRSSNLGIGRIGGKITEQ-GNDKPWYGGGSSVAET 178 Query: 2425 NIGRRNAFDTQDGFSKYQAPKSAQVLSQLQPTD-VGNRGSRGMNKSWKNSEEEEYIWDDM 2249 +RN F+ + G + Y APKSA +L+ + +R S ++ SWKNSEEEE+ WDDM Sbjct: 179 ISSQRNGFNIKHGLTNYSAPKSANADPRLKTAPAIASRSSGVLSNSWKNSEEEEFKWDDM 238 Query: 2248 NSRLTDHGGPDSSS---ADGWSTDDAEKPEIEDHLPQAHGEHDIGSRIYTETSSDSLSIA 2078 NSRLTDHG PD SS D W++DD+EK H + G +D + + +TS+D Sbjct: 239 NSRLTDHGPPDISSNSRKDCWTSDDSEKLGFGGHFRKPKGANDFATTVDLDTSADPTE-- 296 Query: 2077 QRAQASFGHRTTSIWPSQEPRSVDGLKHISITTRISGHSEGHPXXXXXXXXXXXXXXXGR 1898 ++ GHR +S WP + +DGL S HSE + Sbjct: 297 HNDLSALGHRMSSPWPLSDSHGMDGLTPTGTPVISSVHSERYASSL-------------- 342 Query: 1897 TGFETLTGPSVVSIPNVGSMVDRVSGSGGFS-GQQRHHSFTEKDHLRIQSSQPGQKTSHL 1721 +G T SV + + + G+ F G + ++QS + Sbjct: 343 SGLSTSGDSSVARLGSRAQVASSRIGASSFGFGATSGPAVAVGKQKQLQSVR-------- 394 Query: 1720 PGNLSQAPYGQLPQDSSLPVRPQNHIKSQPPQHIHASFPQLRHPGLFSQQLHSEPTQS-L 1544 + +P GQ S QH A + HP L S P Q L Sbjct: 395 ----AASPSGQ----------------SLVHQHSPAPTSTVHHP---HHHLQSLPEQDYL 431 Query: 1543 PSSQTPKPLPQPSISGSPPIMGHSAP------GLDVPGQPSTGNLLAAIMKSGLLSSNSV 1382 S P P + S +P G S P + GQ ST +LLAA+MK+G+LS S+ Sbjct: 432 ESPSLPPPDSKLSTYVTPSTAGISLPDHSNLRAAETSGQSSTSSLLAAVMKTGILSDKSI 491 Query: 1381 TGGLPNPSFKDSGVLPSHLSIQHPLPSGPP-TQLXXXXXXXXXXXPLGSTSSLSTHPQRT 1205 TG LP+ + +D G S +Q PLPSGPP TQ+ S+S LS Sbjct: 492 TGSLPSLNLRDMGQNQSQSGVQPPLPSGPPPTQVALPGSKVASAP---SSSHLSHENSPA 548 Query: 1204 XXXXXXXXXXXXXXXXXXXXXXXXSNVASA--------VPNPLSSLLSTLVAKGLISSPS 1049 ASA +P+S+LLS+LVAKGLIS+ Sbjct: 549 SSDISLKKVGHPPLPPSQPLSSSLEGTASANASTVVNNASDPISNLLSSLVAKGLISASK 608 Query: 1048 KEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXXXXXS----GNDLLFKGSAAKITST 881 E PT S Q+ + L Q +D+ AK ++ Sbjct: 609 SESPTPVSSQMPNELQNQSVSTPVTSSVSVSPVSASPSLPVSSRTDDVSLAEPLAKTSAA 668 Query: 880 VSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKCSICGHRLKFQEQLDLHLEW 701 + + K+E KN +GIEFKP+ IRE HPSVI +LFDDLPHKCSICG RLK +E+L+ HLEW Sbjct: 669 LPQSSKIETKNPIGIEFKPDKIREFHPSVIEELFDDLPHKCSICGLRLKLKERLERHLEW 728 Query: 700 HASKT--------LSRRWYPSLGVWVAGNEGSSSGPS--------VETAEKSEPVVPADE 569 HA KT SRRWY WVAG G GP ET + EP+VPADE Sbjct: 729 HALKTPEFNGSVKASRRWYADSTNWVAGKAGPPLGPEDNMSIDKPSETMDNGEPMVPADE 788 Query: 568 SQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIGTTDGCASLGPIVHANCASP 389 SQCVC++CG FED Y +RDEWM+KGA+Y+S+P GD+GTT+ GPIVHANC + Sbjct: 789 SQCVCVICGYIFEDLYCQERDEWMFKGASYLSIPYGVGDLGTTEESVVKGPIVHANCIAE 848 Query: 388 TSVSDLGLSKNIKPEQ 341 S+SDLGL+ IK E+ Sbjct: 849 NSLSDLGLASRIKLEK 864 >ref|XP_004303026.1| PREDICTED: uncharacterized protein LOC101305191 [Fragaria vesca subsp. vesca] Length = 1110 Score = 497 bits (1279), Expect = e-137 Identities = 351/944 (37%), Positives = 448/944 (47%), Gaps = 105/944 (11%) Frame = -3 Query: 2851 IHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHVN 2672 IH MRHLFGTWKGVFP +LQMIEKELGF+ A NGSSSG ++SRPDSQSQRP NSIHVN Sbjct: 184 IHQSMRHLFGTWKGVFPAQTLQMIEKELGFTTAANGSSSGVSSSRPDSQSQRPANSIHVN 243 Query: 2671 PKYLEARQRLQQ-----------------------------------SNKDPRL------ 2615 PKYLE RQRLQQ S DP + Sbjct: 244 PKYLE-RQRLQQPVRTKGMASDFDGTMTNSIDDIERSDRVASISAGRSWADPPVKMPNIQ 302 Query: 2614 -SQREASSEPVHEKKSSAGYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSN 2438 S R+A SE HEK Y+ + SDL R S L IGR + EQ G +KP YG S+ Sbjct: 303 RSTRDALSERFHEKNVGGEYDESDYDSDLPRSSSLAIGRSGGNIIEQ-GHDKPWYGGVSS 361 Query: 2437 AAETNIGRRNAFDTQDGFSKYQAPKSAQVLSQLQ-PTDVGNRGSRGMNKSWKNSEEEEYI 2261 AAET G+RN F+ + G + Y APKSA +LQ P + +R G++ SWKNSEEEEY+ Sbjct: 362 AAETISGQRNGFNKKHGLN-YSAPKSANADPRLQTPQAIASRNRGGLSSSWKNSEEEEYM 420 Query: 2260 WDDMNSRLTDHGGPDSSS---ADGWSTDDAEKPEIEDHLPQAHGEHDIGSRIYTETSSDS 2090 WDDMNSRLTDH PD SS + W +DD+EK G + + D+ Sbjct: 421 WDDMNSRLTDHVTPDLSSNSRKERWISDDSEK--------MGFGGGSRKLKRVNDLDMDT 472 Query: 2089 LSIAQRAQASFGHRTTSIWPSQEPRSVDGLKHISITTRISGHSEGHPXXXXXXXXXXXXX 1910 + Q+ ++ GHR S W QE VD L S HSE + Sbjct: 473 DIVEQKDISALGHRMPSPWSLQESHVVDRLTSSGTPVMNSAHSERY-VSSLSGLSTSGDS 531 Query: 1909 XXGRTGFETLTGPSVVSIPNVGSMVDRVSGSGGFSGQQRH-------------------- 1790 R G S V + G + SGS G G+Q+ Sbjct: 532 SVARLGNRAQMMSSHVGASSFGLPTNAASGSNGAVGKQQQIQSVRAASPSGQLLMHQHAP 591 Query: 1789 ----------HSFTEKDHLRIQSSQPGQKTSHLPGNLSQAPYGQLPQDSSLPVRPQN--- 1649 H E+D + S P K S + G + Q +DS LP+ N Sbjct: 592 LPASKIQNPRHYLAEQDPAQAPSLPPDLKVSQILGKSDSGLHSQYTEDS-LPIPTSNLRL 650 Query: 1648 --HIKSQPPQHIHASFP----QLRHPGLFSQQLHSEPTQSLPSSQTPKPLPQPSISGSPP 1487 KSQP + S Q +H F QQ +EP S QT KP PS + Sbjct: 651 GGMAKSQPQELKALSSSMAAIQSKHHYPFQQQDITEPESS---DQTEKPHKMPSTVRNSI 707 Query: 1486 IMGHSAPGLDVPGQPSTGNLLAAIMKSGLLSSNSVTGGLPNPSFKDSGVLPSHLSIQHPL 1307 + + GQ ST +LLAA++K+G+LS+ S+TG LP+ SF D +P Q PL Sbjct: 708 SDLSNLLAAETSGQSSTSSLLAAVLKTGILSNKSITGSLPSSSFGDMEKMPPQSVSQPPL 767 Query: 1306 PSG-PPTQLXXXXXXXXXXXPLGSTSSLSTHPQRTXXXXXXXXXXXXXXXXXXXXXXXXS 1130 P G PPT+ LG S P + + Sbjct: 768 PIGRPPTKAALPGLKVAPAPSLGHPSR-DNSPTTSSTLQKVGHPPLPPGQPPLSQEGGST 826 Query: 1129 NVASAVPNPLSSLLSTLVAKGLISSPSKEM--PTLTSPQVASRLPKQXXXXXXXXXXXXX 956 S +P+S+LLS+LVAKGLIS+ E P + ++ K Sbjct: 827 AKDSNAKDPISNLLSSLVAKGLISASKSESTTPLPSHKPTEVQIQKLPTTTVSSISPGSA 886 Query: 955 XXXXXXXXSGNDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFD 776 ++ K ++ +++ K E+KN +G EFKP+ IRE HPSVI +LFD Sbjct: 887 SSIVPGSSRRDNAPLAEQVVKPSAALAQSTKTEKKNPIGFEFKPDKIRELHPSVIDELFD 946 Query: 775 DLPHKCSICGHRLKFQEQLDLHLEWHASKT--------LSRRWYPSLGVWVAGNEGSSSG 620 DL HKC +CG RLK +E+LD HLEWHA KT SR WY + WV G GSSS Sbjct: 947 DLQHKCILCGLRLKLKERLDRHLEWHALKTPEADGSIKASRGWYANSANWVTGKAGSSSD 1006 Query: 619 PSVE--------TAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPA 464 T +EP VPADESQC CI+CG FEDFY + D+WM+KGA YM++PA Sbjct: 1007 LDSNNSNDMTGMTVASNEPTVPADESQCACIICGNTFEDFYCQESDDWMFKGAVYMTVPA 1066 Query: 463 VDGDIGTTDGCASLGPIVHANCASPTSVSDLGL-SKNIKPEQMD 335 DG++GT G GPIVHA C S+ +LGL + +K E+ D Sbjct: 1067 GDGELGTAGGSVLKGPIVHATCIDENSLEELGLAATRVKLEKDD 1110 >ref|XP_006381311.1| hypothetical protein POPTR_0006s11660g [Populus trichocarpa] gi|550336013|gb|ERP59108.1| hypothetical protein POPTR_0006s11660g [Populus trichocarpa] Length = 908 Score = 494 bits (1271), Expect = e-136 Identities = 357/931 (38%), Positives = 469/931 (50%), Gaps = 98/931 (10%) Frame = -3 Query: 2839 MRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHVNPKYL 2660 MRHLFGTWKGVFPP LQMIEKELG + AVNGSS+GA SR +SQSQRPPNSIHVNPKYL Sbjct: 1 MRHLFGTWKGVFPPQPLQMIEKELGLAPAVNGSSAGAAASRSESQSQRPPNSIHVNPKYL 60 Query: 2659 EARQRLQQSNK-----------------------------------DPRL-------SQR 2606 E RQR+QQS++ DP + S R Sbjct: 61 E-RQRIQQSSRAKGVSNVLTVPVANSIEDVEGPDRAVSIDTRRPWVDPPVKTQTLQRSHR 119 Query: 2605 EASSEPVHEKKS-SAGYEYLESGSDLSRHSDLVIGRDYERVNEQ-DGLEKPLYGSGSNAA 2432 EA +EPVHEKK A YE E GSD+SR S L IGR RV EQ G E P YG+ SNAA Sbjct: 120 EALNEPVHEKKKIGAIYEDFEYGSDVSRKSGLGIGRASGRVAEQGQGQENPCYGTSSNAA 179 Query: 2431 ETNIGRRNAFDTQDGFSKYQAPKSAQVLSQLQPTDVGNRGSRGMNKSWKNSEEEEYIWDD 2252 E G+RN F+ + GF Y A KS+ V LQPT R G++ +WKNSEEEEYIWD Sbjct: 180 ELISGQRNGFNMKHGFPNYPASKSSMVDLHLQPTQRIGRSETGISANWKNSEEEEYIWD- 238 Query: 2251 MNSRLTDH---GGPDSSSADGWSTDDAEKPEIEDHLPQAHGEHDIGSRIYTETSSDSLSI 2081 M+SRL+DH G ++S D W DD++K ++E R+ ETSSDSLS Sbjct: 239 MHSRLSDHNAAGLSNNSRKDHWIPDDSDKMDLE--------------RLDGETSSDSLST 284 Query: 2080 AQRAQASFGHRTTSIWPSQEPRSVDGLKHISITTRISGHSEGHPXXXXXXXXXXXXXXXG 1901 Q+ A+ G R +S W E S DGL +T +GH EG+ Sbjct: 285 EQKEHATIGSRLSSPWKLPESHSTDGLILSGTSTTNTGHVEGYSATVGGVATSSRSSLGR 344 Query: 1900 -------------RTGFETLTGPSVVSIPNVGSMVDRVSGSGGFSGQ----QRHHSFT-E 1775 + G + T S++S +G + G+ SGQ QR S + Sbjct: 345 MAVRPRLGSSHIGKAGLASSTNTSLLSTETLGQQKFQSQGAASPSGQSPIRQRPSSPAFQ 404 Query: 1774 KDHLRIQSSQPGQKTSHLPGNLSQAPYGQLPQDSSLPVRPQNHIKSQPPQHIH----ASF 1607 + ++Q+S G++ H +++Q Y + LP Q + S P H S Sbjct: 405 ACYPQLQNS--GEQDYHQSQSMTQPDYRAQFSGNLLPSNVQ--LGSLPKLHSEDLQAPSL 460 Query: 1606 P--QLRHPGLFSQQLHSEPTQSLPSSQTPKP-LPQPSISGSPPIMGHSAPGLDVP----- 1451 P QL H SQ+ + +S Q +P LP S G+ SA P Sbjct: 461 PSFQLSHQHRLSQRRQPDSKESEAFGQIQRPHLPPVSNFGTSSTSVSSAADHLNPFTAGT 520 Query: 1450 -GQPSTGNLLAAIMKSGLLSSNSVTGGLPNPSFKDSGVLPSHLSIQHPLPSGPPTQLXXX 1274 GQ ST +LLAA+MK+G+LS + +G +P+ +F+D G +PS IQ PLPSGPP Q Sbjct: 521 SGQSSTSSLLAAVMKTGILSKIN-SGVVPDRNFQDIGKMPSQSIIQPPLPSGPPPQFSFS 579 Query: 1273 XXXXXXXXPLGSTSSLSTHPQ----RTXXXXXXXXXXXXXXXXXXXXXXXXSNVASAVPN 1106 + S SS Q ++ + PN Sbjct: 580 EAR------IESASSAPAQSQDKLPTVSNISQRKDERPPPPLGSPPSSEQTTDAVNKAPN 633 Query: 1105 PLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXXXXXSG 926 P+S+LLS+LVAKGLIS+ E + QV S+L K+ G Sbjct: 634 PISNLLSSLVAKGLISTSKSETSSPLPTQVPSQLQKKNPSITSPSSEPISSATLHSSTVG 693 Query: 925 NDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKCSICG 746 + + K + +S+ KVE +L+G+EFKPE+IRE HP VIS LF+DLPH+CS+CG Sbjct: 694 EASIPEPDT-KCSVALSQTTKVEIDDLIGLEFKPEVIRELHPPVISSLFEDLPHRCSLCG 752 Query: 745 HRLKFQEQLDLHLEWHASKT--------LSRRWYPSLGVWVAGNEG-----SSSGPS--- 614 +LK +E+L HLEWH + +R WY LG W+ N+G SS P Sbjct: 753 LQLKLKERLHRHLEWHNQRKPESDGINGPTRGWYADLGHWLTVNDGLPLGVESSCPMDDF 812 Query: 613 VETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIGTTDG 434 ET E + V A E CVC+LCG+ FED+Y +R++WM+KGA M+LP+ DG +GT Sbjct: 813 EETTECDDKTVLAHEDHCVCVLCGKLFEDYYCEERNKWMFKGAVRMTLPSGDGQMGTAKE 872 Query: 433 CASLGPIVHANCASPTSVSDLGLSKNIKPEQ 341 A GP VH NC S +S+ DL L+ IK E+ Sbjct: 873 SAK-GPTVHVNCISESSLCDLVLASGIKMEK 902 >ref|XP_007027622.1| ENTH/VHS family protein, putative isoform 3 [Theobroma cacao] gi|508716227|gb|EOY08124.1| ENTH/VHS family protein, putative isoform 3 [Theobroma cacao] Length = 1091 Score = 476 bits (1224), Expect = e-131 Identities = 343/974 (35%), Positives = 463/974 (47%), Gaps = 138/974 (14%) Frame = -3 Query: 2851 IHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHVN 2672 +H MRHLFGTWKGVFPP LQMIEKELGF+ +NGSSSG TTSRPD SQRPP+SIHVN Sbjct: 146 VHQSMRHLFGTWKGVFPPQPLQMIEKELGFAPMINGSSSGTTTSRPDPLSQRPPHSIHVN 205 Query: 2671 PKYLEARQRLQQSNK----------------------------------DPRL------- 2615 PKYLE +QRLQQS++ DP + Sbjct: 206 PKYLE-KQRLQQSSRVKGMVNDMTETMSSSKEDSERPDRAAITAGRPYVDPSVKMNNIQR 264 Query: 2614 SQREASSEPVHEKKSSAGYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSNA 2435 S R+ +EPV EK A + + GSDL + + +GR +V +Q G ++P YG+ S+ Sbjct: 265 SHRDMFNEPVREKNIGATFGDYDYGSDLLQTPGMGVGRTGGKVTDQ-GNDRPWYGATSSV 323 Query: 2434 AETNIGRRNAFDTQDGFSKYQAPKSAQVLSQLQPT-DVGNRGSRGMNKSWKNSEEEEYIW 2258 E +RN F+ + G Y A KS +LQ T ++ R S G++ SWKNSEEEE++W Sbjct: 324 TEMISSQRNGFNIKHGSQNYSASKSVNADPRLQATKNIAGRSSSGLSSSWKNSEEEEFMW 383 Query: 2257 DDMNSRLTDHGGPD---SSSADGWSTDDAEKPEIEDHLPQAHGEHDIGSRI--YTETSSD 2093 + M+SRL++H + +S D W+ D +EK + E L +A HD+GSR ET++D Sbjct: 384 E-MHSRLSEHDAANISNNSRKDHWTPDVSEKLDFETQLRKAQSVHDVGSRFDRERETTAD 442 Query: 2092 SLSIAQRAQASFGHRTTSIWPSQEPRSVDGLKHISITTRISGHSEGHPXXXXXXXXXXXX 1913 SLS Q+ + S+G R +S WP E DGL T GHSE + Sbjct: 443 SLSTEQKDKTSYGRRISSAWPLLESNKTDGLP-----TNNLGHSESYSATIGGLP----- 492 Query: 1912 XXXGRTGFETLTGPSVVSIPNVGSMVDRV-----SGSGGFSGQQRHH-----SFTEKDHL 1763 TG S S+ +G ++ SGS GQQR S E+ + Sbjct: 493 -----------TGASS-SLARIGMRPQKILANVASGSTSTLGQQRFQPLGTASPPEQSPM 540 Query: 1762 RIQSSQPGQKTSHLPGNLSQAPYGQLPQDSSLP--------------VRPQNHIKSQPPQ 1625 R S P H L + PQ SLP V H Sbjct: 541 RQHSPSPSFPGRHPHQQLQKLAEQDYPQAHSLPRTDPKPSHFSGKLNVGSHKHSSQASSA 600 Query: 1624 HIHASFPQLRHPGLFSQQLHSEPTQSLPSSQTPKPLP-QPSISGSPPIMGHSAP-----G 1463 I + P +P F Q + Q+ PSSQT KPLP Q S G+ +G ++ Sbjct: 601 LISSYQPSCHYP--FGQPPQPDSVQAEPSSQTQKPLPSQISKVGAASTLGIASEQANPLA 658 Query: 1462 LDVPGQPSTGNLLAAIMKSGLLSSNSVTGGLPNPSFKDSGVLPSHLSIQHPLPSGPPTQL 1283 + ST +LLAA+MKSG+LSSNS TG LPN +D G +PS Q PLP+GPP + Sbjct: 659 IGTSELSSTSSLLAAVMKSGILSSNSFTGSLPNKISQDVGQIPS----QPPLPNGPPPAV 714 Query: 1282 XXXXXXXXXXXPLGSTSS-----LSTHPQRTXXXXXXXXXXXXXXXXXXXXXXXXSNVAS 1118 ++S +T+ + S+ S Sbjct: 715 FTSSGLRVDSGTSSGSASHDALAATTNSSQGKVEQPPLPPGPPPPALVSNAPAQTSDAES 774 Query: 1117 AVPNPLSSLLSTLVA--------KGLISSPSKEMPT------------------------ 1034 NP+S+LLS+LVA K S S ++PT Sbjct: 775 KASNPISNLLSSLVAKGLISASKKDASSLLSHQIPTQMQESLGMERPTQMQESLGMERHT 834 Query: 1033 -LTSPQVASRLPKQXXXXXXXXXXXXXXXXXXXXXSGND--------LLFKGSAAKITST 881 + + +P + S +D + F A K + Sbjct: 835 QMQKESLGMEMPTESPNQSSGISTSSPLPASSIPSSSDDPSSSTMDEVSFAEPATKSSVA 894 Query: 880 VSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKCSICGHRLKFQEQLDLHLEW 701 + + +E +NL+G+EF+P++IRE H SVIS L DDLPH CS+CG RLK QE+LD HLE Sbjct: 895 LHQSAAMEEENLIGLEFRPDVIREFHSSVISKLLDDLPHCCSLCGLRLKLQERLDRHLEC 954 Query: 700 HASKTLS--------RRWYPSLGVWVAGNEGSSSGPSV-------ETAEKSEPVVPADES 566 HA K R WY W+ G G + S +T KSE +VPADE+ Sbjct: 955 HAMKKTESEGSNRALRGWYARSDDWIGGKPGQFAFESTGSVNQLEKTTAKSELMVPADEN 1014 Query: 565 QCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIGTTDGCASLGPIVHANCASPT 386 Q C+LCGE FED++ R EWM+KGA Y+++P+ DG++GTT+G A GPIVHANC S + Sbjct: 1015 QYACMLCGELFEDYFCQIRGEWMFKGAVYLTIPSKDGEVGTTNGSAGNGPIVHANCISES 1074 Query: 385 SVSDLGLSKNIKPE 344 SV DLGL+ +K E Sbjct: 1075 SVHDLGLAGGVKLE 1088 >ref|XP_007027620.1| ENTH/VHS family protein, putative isoform 1 [Theobroma cacao] gi|508716225|gb|EOY08122.1| ENTH/VHS family protein, putative isoform 1 [Theobroma cacao] Length = 1125 Score = 476 bits (1224), Expect = e-131 Identities = 343/974 (35%), Positives = 463/974 (47%), Gaps = 138/974 (14%) Frame = -3 Query: 2851 IHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHVN 2672 +H MRHLFGTWKGVFPP LQMIEKELGF+ +NGSSSG TTSRPD SQRPP+SIHVN Sbjct: 180 VHQSMRHLFGTWKGVFPPQPLQMIEKELGFAPMINGSSSGTTTSRPDPLSQRPPHSIHVN 239 Query: 2671 PKYLEARQRLQQSNK----------------------------------DPRL------- 2615 PKYLE +QRLQQS++ DP + Sbjct: 240 PKYLE-KQRLQQSSRVKGMVNDMTETMSSSKEDSERPDRAAITAGRPYVDPSVKMNNIQR 298 Query: 2614 SQREASSEPVHEKKSSAGYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSNA 2435 S R+ +EPV EK A + + GSDL + + +GR +V +Q G ++P YG+ S+ Sbjct: 299 SHRDMFNEPVREKNIGATFGDYDYGSDLLQTPGMGVGRTGGKVTDQ-GNDRPWYGATSSV 357 Query: 2434 AETNIGRRNAFDTQDGFSKYQAPKSAQVLSQLQPT-DVGNRGSRGMNKSWKNSEEEEYIW 2258 E +RN F+ + G Y A KS +LQ T ++ R S G++ SWKNSEEEE++W Sbjct: 358 TEMISSQRNGFNIKHGSQNYSASKSVNADPRLQATKNIAGRSSSGLSSSWKNSEEEEFMW 417 Query: 2257 DDMNSRLTDHGGPD---SSSADGWSTDDAEKPEIEDHLPQAHGEHDIGSRI--YTETSSD 2093 + M+SRL++H + +S D W+ D +EK + E L +A HD+GSR ET++D Sbjct: 418 E-MHSRLSEHDAANISNNSRKDHWTPDVSEKLDFETQLRKAQSVHDVGSRFDRERETTAD 476 Query: 2092 SLSIAQRAQASFGHRTTSIWPSQEPRSVDGLKHISITTRISGHSEGHPXXXXXXXXXXXX 1913 SLS Q+ + S+G R +S WP E DGL T GHSE + Sbjct: 477 SLSTEQKDKTSYGRRISSAWPLLESNKTDGLP-----TNNLGHSESYSATIGGLP----- 526 Query: 1912 XXXGRTGFETLTGPSVVSIPNVGSMVDRV-----SGSGGFSGQQRHH-----SFTEKDHL 1763 TG S S+ +G ++ SGS GQQR S E+ + Sbjct: 527 -----------TGASS-SLARIGMRPQKILANVASGSTSTLGQQRFQPLGTASPPEQSPM 574 Query: 1762 RIQSSQPGQKTSHLPGNLSQAPYGQLPQDSSLP--------------VRPQNHIKSQPPQ 1625 R S P H L + PQ SLP V H Sbjct: 575 RQHSPSPSFPGRHPHQQLQKLAEQDYPQAHSLPRTDPKPSHFSGKLNVGSHKHSSQASSA 634 Query: 1624 HIHASFPQLRHPGLFSQQLHSEPTQSLPSSQTPKPLP-QPSISGSPPIMGHSAP-----G 1463 I + P +P F Q + Q+ PSSQT KPLP Q S G+ +G ++ Sbjct: 635 LISSYQPSCHYP--FGQPPQPDSVQAEPSSQTQKPLPSQISKVGAASTLGIASEQANPLA 692 Query: 1462 LDVPGQPSTGNLLAAIMKSGLLSSNSVTGGLPNPSFKDSGVLPSHLSIQHPLPSGPPTQL 1283 + ST +LLAA+MKSG+LSSNS TG LPN +D G +PS Q PLP+GPP + Sbjct: 693 IGTSELSSTSSLLAAVMKSGILSSNSFTGSLPNKISQDVGQIPS----QPPLPNGPPPAV 748 Query: 1282 XXXXXXXXXXXPLGSTSS-----LSTHPQRTXXXXXXXXXXXXXXXXXXXXXXXXSNVAS 1118 ++S +T+ + S+ S Sbjct: 749 FTSSGLRVDSGTSSGSASHDALAATTNSSQGKVEQPPLPPGPPPPALVSNAPAQTSDAES 808 Query: 1117 AVPNPLSSLLSTLVA--------KGLISSPSKEMPT------------------------ 1034 NP+S+LLS+LVA K S S ++PT Sbjct: 809 KASNPISNLLSSLVAKGLISASKKDASSLLSHQIPTQMQESLGMERPTQMQESLGMERHT 868 Query: 1033 -LTSPQVASRLPKQXXXXXXXXXXXXXXXXXXXXXSGND--------LLFKGSAAKITST 881 + + +P + S +D + F A K + Sbjct: 869 QMQKESLGMEMPTESPNQSSGISTSSPLPASSIPSSSDDPSSSTMDEVSFAEPATKSSVA 928 Query: 880 VSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKCSICGHRLKFQEQLDLHLEW 701 + + +E +NL+G+EF+P++IRE H SVIS L DDLPH CS+CG RLK QE+LD HLE Sbjct: 929 LHQSAAMEEENLIGLEFRPDVIREFHSSVISKLLDDLPHCCSLCGLRLKLQERLDRHLEC 988 Query: 700 HASKTLS--------RRWYPSLGVWVAGNEGSSSGPSV-------ETAEKSEPVVPADES 566 HA K R WY W+ G G + S +T KSE +VPADE+ Sbjct: 989 HAMKKTESEGSNRALRGWYARSDDWIGGKPGQFAFESTGSVNQLEKTTAKSELMVPADEN 1048 Query: 565 QCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIGTTDGCASLGPIVHANCASPT 386 Q C+LCGE FED++ R EWM+KGA Y+++P+ DG++GTT+G A GPIVHANC S + Sbjct: 1049 QYACMLCGELFEDYFCQIRGEWMFKGAVYLTIPSKDGEVGTTNGSAGNGPIVHANCISES 1108 Query: 385 SVSDLGLSKNIKPE 344 SV DLGL+ +K E Sbjct: 1109 SVHDLGLAGGVKLE 1122 >gb|EXB37772.1| Pre-mRNA cleavage complex 2 protein Pcf11 [Morus notabilis] Length = 1101 Score = 474 bits (1219), Expect = e-130 Identities = 355/971 (36%), Positives = 472/971 (48%), Gaps = 136/971 (14%) Frame = -3 Query: 2854 SIHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRP-PNSIH 2678 S+H MRHLFGTWKGVFP +L++IEKEL F+ A NGSS+GA TSRP++QS RP NSIH Sbjct: 180 SVHQSMRHLFGTWKGVFPLQTLRVIEKELDFAPAANGSSTGAATSRPETQSNRPLQNSIH 239 Query: 2677 VNPKYLEARQRLQQSNK-----------DPRLSQREAS---------------------- 2597 VNPKYLE RQRLQQ N+ D L +E S Sbjct: 240 VNPKYLE-RQRLQQPNRVSGMLKPILLWDHELEAKELSSDVSGSIANSIEDAESMERATS 298 Query: 2596 -------------------------SEPVHEKKSSAGYEYLESGSDLSRHSDLVIGRDYE 2492 SE +HEK S + SDL R+S L I R Sbjct: 299 IGTGRSWVDPSVKMHNLQRSTRGTTSEVIHEKNISVESPDYDYSSDLPRNSSLGIVRASG 358 Query: 2491 RVNEQDGLEKPLYGSGSNAAETNIGRRNAFDTQDGFSKYQAPKSAQVLSQLQPTD--VGN 2318 R+ EQ G EK +G GS+ AE+ G+RN+F+ + GF Y PKS +QLQ Sbjct: 359 RIAEQ-GNEKVWHGGGSSFAESVSGQRNSFNIKHGFPNYPGPKSISANTQLQSAQNISSR 417 Query: 2317 RGSRGMNKSWKNSEEEEYIWDDMNSRLTDHGGPDSSS---ADGWSTDDAEKPEIEDHLPQ 2147 R + SWKNSEEEE+ WDDMNSRLTDHG D S+ D + +DA+K EDH+ + Sbjct: 418 RSGAAASSSWKNSEEEEFTWDDMNSRLTDHGASDISTNFRVDRSAYEDADKSGFEDHIHK 477 Query: 2146 AHGEHDIGSRIYTETSSDSLSIAQRAQASFGHRTTSIWPSQEPRSVDGLKHISITTRISG 1967 D SR+ E S+D+ ++ Q +R +S W SQE S+DGL SG Sbjct: 478 PLSIRDYASRVNKEVSADTFAVEQ-------NRISSPWLSQESHSIDGLSR-------SG 523 Query: 1966 HSEGHPXXXXXXXXXXXXXXXGRTGFETLTGPSVVSIPNVGSMVD---------RVSGSG 1814 S GF T + P + G++ + S S Sbjct: 524 TSS--------------------FGFPTNSVPG-----STGALTQQRFPPPTLRQRSPSP 558 Query: 1813 GFSGQQRH---HSFTEKDHLRIQS-SQPGQKTSHLPGNLSQAPYGQLPQDSSLPVRPQNH 1646 S ++ H + TE+D + QS + P K S G ++ + Q QD SLPV P + Sbjct: 559 TLSARRPHLQLQNLTEQDRAKAQSPAHPDSKVSQSLGQSTREVHNQYAQD-SLPVLPSHV 617 Query: 1645 IKSQPPQHIHASFPQLRHPGLFSQQLHSEPTQSLPSSQTPK-PLPQPSISGSPPIMGHSA 1469 ++ + H + P RH F QQ+ + T S P Q K PLPQ S SG P +G SA Sbjct: 618 RLNKMVKSQHHNMPP-RHQYPFLQQV-EDSTDSEPLGQIQKLPLPQASNSGPPATLGSSA 675 Query: 1468 P------GLDVPGQPSTGNLLAAIMKSGLLSSNSV-TGGLPNPSFKDSGVLPSHLSIQHP 1310 P ++ G ST +LLAA+MKSG+LS++S+ T L N +F+ S LPS Q P Sbjct: 676 PDRLNALAVETSGDSSTSSLLAAVMKSGILSNSSITTSSLSNLNFQSSAQLPSQAG-QPP 734 Query: 1309 LPSGPPTQLXXXXXXXXXXXPLGSTSSL--STHP---------QRTXXXXXXXXXXXXXX 1163 LP+G T L STSS+ S+H Q+ Sbjct: 735 LPTGTHTNLGSKAT---------STSSISHSSHDGLSVSSKIFQKKTQSAPLPTGPPPSS 785 Query: 1162 XXXXXXXXXXSNVASAVPNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXX 983 S+VA+ P+P+S+LLS+LVAKGLIS+ KE P P V + K+ Sbjct: 786 SPLRSASENASSVANNTPDPISNLLSSLVAKGLISASKKESPQAIPPVVPTETQKKSPSI 845 Query: 982 XXXXXXXXXXXXXXXXXSGND----------------LLFKGSAAKITSTV--------- 878 S D K + +I + + Sbjct: 846 TGTGSVPVSLVSGSTVSSTRDDSSISEPTADSPVSLPESTKSTNLEIKNLIGFDFKPDES 905 Query: 877 SKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKCSICGHRLKFQEQLDLHLEWH 698 +K +E KNL+G +FKP+++RE HPSV+SDL D H+C++CG +LK +E+L HLEWH Sbjct: 906 TKSTNLEIKNLIGFDFKPDVVREFHPSVVSDLLDGFEHQCNMCGLQLKLKERLTRHLEWH 965 Query: 697 ASKTL--------SRRWYPSLGVWVAGNEGSSSG----PSVE---TAEKSEPVVPADESQ 563 +K L SR WY + W+ G G SSG SV+ +K E +V ADESQ Sbjct: 966 NTKKLDANGPTKASRMWYANPSDWINGVAGFSSGLESAKSVDKPGKTDKGESMVVADESQ 1025 Query: 562 CVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIGTTDGCASLGPIVHANCASPTS 383 CVC+LCGE FEDFY +RDEWM+KGA +M +P+ G+ G+ + GPIVHANC S S Sbjct: 1026 CVCVLCGEIFEDFYCQERDEWMFKGAMHMIIPSATGETGSNGEGSRKGPIVHANCISECS 1085 Query: 382 VSDLGLSKNIK 350 + DLGL IK Sbjct: 1086 LQDLGLVSRIK 1096 >ref|XP_002528590.1| conserved hypothetical protein [Ricinus communis] gi|223531986|gb|EEF33798.1| conserved hypothetical protein [Ricinus communis] Length = 1123 Score = 472 bits (1214), Expect = e-130 Identities = 351/934 (37%), Positives = 462/934 (49%), Gaps = 100/934 (10%) Frame = -3 Query: 2851 IHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHVN 2672 +HS MRHLFGTWKGVFPP SLQMIEKELGF++A+NGSSS A TSR DSQS+R SIH+N Sbjct: 174 VHSSMRHLFGTWKGVFPPQSLQMIEKELGFASALNGSSSSAATSRLDSQSRR---SIHIN 230 Query: 2671 PKYLEARQRLQQSNK-----------------------------------DPRL------ 2615 PK LE Q LQQS++ DP + Sbjct: 231 PKILEI-QHLQQSSRAKGMATDLTVPIPNTAEDVERPERAASIAAGRSWVDPPVKMHNIQ 289 Query: 2614 -SQREASSEPVHEKKSSAGYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSN 2438 +QRE S+P HEKK + Y E S++SR S L IGR RV +G EKP YG+G++ Sbjct: 290 HTQREILSDPGHEKKIGSTYGDFEYNSEISRISGLGIGRTSGRV-AAEGHEKPWYGAGNS 348 Query: 2437 AAETNIGRRNAFDTQDGFSKYQAPKSAQVLSQLQPTDVG-NRGSRGMNKSWKNSEEEEYI 2261 A ET G++N F + GF Y K V LQ T ++ + ++ SWKNSEEEE++ Sbjct: 349 ATETISGQKNGFTVKHGFPNYSTSKPVNVDLHLQRTQSNASKSTTAVSASWKNSEEEEFM 408 Query: 2260 WDDMNSRLTDHGGPD---SSSADGWSTDDAEKPEIEDHLPQAHGEHDIGSRIYTETSSDS 2090 WD M+SRL+DH + +S D W+ D +EK E E+ + ++ SR ETSSDS Sbjct: 409 WD-MHSRLSDHDAANLSITSRKDRWTPDGSEKLEFENQFRKPQNALEVMSRFERETSSDS 467 Query: 2089 LSIAQRAQASFGHRTTSIWPSQEPRSVDGLKHISITTRISGHSEGH-------------- 1952 S QR Q S GHR +S W +E DGL + +G ++G+ Sbjct: 468 QSTEQREQISLGHRLSSPWRLKESHPTDGLLIPGSSGSNTGQTDGYSATLGGLSASSSLA 527 Query: 1951 --PXXXXXXXXXXXXXXXGRTGFE-TLTGPSVVS----IPNVGSMVDRVSGSGGFSG--- 1802 P ++G TL S +P+ S V + S F Sbjct: 528 RMPVRPHTGNSGSGFSANTKSGSHGTLAQQRFQSPGAALPSGQSPVHQNPLSPSFPALYP 587 Query: 1801 QQRHHSFTEKDHLRIQS-SQPGQKTSHLPGNL--SQAPYGQLPQDSSLPVRPQNHIKSQP 1631 Q+ S E+D QS +P KT L GNL S+ G L + ++ ++ S P Sbjct: 588 NQQFQSSAEQDLPLSQSLPRPDYKTHQLSGNLLPSKVQPGSLKR-----LQNEDSPTSAP 642 Query: 1630 PQHIHASFPQLRHPGLFSQQLHSEPTQSLPSSQTPKP--LPQPSISGSPPIMGHSAPGLD 1457 P QL FSQ +E PS Q KP +P +I G+ SAP + Sbjct: 643 P----LPSIQLNRQYPFSQPRQAESKHVEPSGQIKKPHLIPVSNI-GTSSTSESSAPDMS 697 Query: 1456 VP------GQPSTGNLLAAIMKSGLLSSNSVTGGLPNPSFKDSGVLPSHLSIQHPLPSGP 1295 P GQ ST +LLAA+M SG+LSS + GGLP+ SF+D G PS SIQ PLPSGP Sbjct: 698 TPLSAQTSGQSSTSSLLAAVMSSGILSSIT-NGGLPSKSFQDVGKTPSQSSIQPPLPSGP 756 Query: 1294 PTQLXXXXXXXXXXXPLGSTSSLSTHPQRTXXXXXXXXXXXXXXXXXXXXXXXXSNVASA 1115 P Q S + S T SN + Sbjct: 757 PPQYKSSGARISSASAPLSDNDTSV----TSNISEKKEEQPPLPPGPPPSSIQSSNSVNK 812 Query: 1114 VPNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVA----SRLPKQXXXXXXXXXXXXXXXX 947 NP+S+LLS+LVAKGLIS+ E + P+ S+ P Sbjct: 813 AANPISNLLSSLVAKGLISASKSETSSPLPPESPTPSQSQNPTITNSSSKPASSVPASSA 872 Query: 946 XXXXXSGNDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLP 767 + ++ F K ++ V +P E ++L+G+EFK ++IRESHP VI LFDD P Sbjct: 873 TSLSSTKDEASFPKPDVKSSAAVPQPTAPEIESLIGLEFKSDVIRESHPHVIGALFDDFP 932 Query: 766 HKCSICGHRLKFQEQLDLHLEWHA-------SKTLSRRWYPSLGVWVAGNE----GSSSG 620 H+CSICG +LK +E+LD HLEWH RRWY LG WVAG G S Sbjct: 933 HQCSICGLQLKLKERLDRHLEWHIWSKPEPDGLNRVRRWYADLGNWVAGKAEIPFGIESS 992 Query: 619 PSVE----TAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGD 452 S++ T ++ EP+V ADE+QCVC+LCGE FED+YS R +WM+K A +++L GD Sbjct: 993 VSMDEFGRTVDEDEPMVLADENQCVCVLCGELFEDYYSQQRKKWMFKAAMHLTLSLKGGD 1052 Query: 451 IGTTDGCASLGPIVHANCASPTSVSDLGLSKNIK 350 IGT + S GPIVH NC S +SV DL L+ K Sbjct: 1053 IGTANE-NSKGPIVHVNCMSESSVHDLELTSGTK 1085 >ref|XP_007027621.1| ENTH/VHS family protein, putative isoform 2 [Theobroma cacao] gi|508716226|gb|EOY08123.1| ENTH/VHS family protein, putative isoform 2 [Theobroma cacao] Length = 1091 Score = 471 bits (1212), Expect = e-130 Identities = 335/941 (35%), Positives = 456/941 (48%), Gaps = 105/941 (11%) Frame = -3 Query: 2851 IHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHVN 2672 +H MRHLFGTWKGVFPP LQMIEKELGF+ +NGSSSG TTSRPD SQRPP+SIHVN Sbjct: 180 VHQSMRHLFGTWKGVFPPQPLQMIEKELGFAPMINGSSSGTTTSRPDPLSQRPPHSIHVN 239 Query: 2671 PKYLEARQRLQQSNK--------DPRLSQREASSEPVHEKKSSAGYEYLESGSDLSRHSD 2516 PKYLE +QRLQQS++ +S + SE +AG Y++ ++ Sbjct: 240 PKYLE-KQRLQQSSRVKGMVNDMTETMSSSKEDSERPDRAAITAGRPYVDPSVKMNTPG- 297 Query: 2515 LVIGRDYERVNEQDGLEKPLYGSGSNAAETNIGRRNAFDTQDGFSKYQAPKSAQVLSQLQ 2336 + +GR +V +Q G ++P YG+ S+ E +RN F+ + G Y A KS +LQ Sbjct: 298 MGVGRTGGKVTDQ-GNDRPWYGATSSVTEMISSQRNGFNIKHGSQNYSASKSVNADPRLQ 356 Query: 2335 PT-DVGNRGSRGMNKSWKNSEEEEYIWDDMNSRLTDHGGPD---SSSADGWSTDDAEKPE 2168 T ++ R S G++ SWKNSEEEE++W+ M+SRL++H + +S D W+ D +EK + Sbjct: 357 ATKNIAGRSSSGLSSSWKNSEEEEFMWE-MHSRLSEHDAANISNNSRKDHWTPDVSEKLD 415 Query: 2167 IEDHLPQAHGEHDIGSRI--YTETSSDSLSIAQRAQASFGHRTTSIWPSQEPRSVDGLKH 1994 E L +A HD+GSR ET++DSLS Q+ + S+G R +S WP E DGL Sbjct: 416 FETQLRKAQSVHDVGSRFDRERETTADSLSTEQKDKTSYGRRISSAWPLLESNKTDGLP- 474 Query: 1993 ISITTRISGHSEGHPXXXXXXXXXXXXXXXGRTGFETLTGPSVVSIPNVGSMVDRV---- 1826 T GHSE + TG S S+ +G ++ Sbjct: 475 ----TNNLGHSESYSATIGGLP----------------TGASS-SLARIGMRPQKILANV 513 Query: 1825 -SGSGGFSGQQRHH-----SFTEKDHLRIQSSQPGQKTSHLPGNLSQAPYGQLPQDSSLP 1664 SGS GQQR S E+ +R S P H L + PQ SLP Sbjct: 514 ASGSTSTLGQQRFQPLGTASPPEQSPMRQHSPSPSFPGRHPHQQLQKLAEQDYPQAHSLP 573 Query: 1663 --------------VRPQNHIKSQPPQHIHASFPQLRHPGLFSQQLHSEPTQSLPSSQTP 1526 V H I + P +P F Q + Q+ PSSQT Sbjct: 574 RTDPKPSHFSGKLNVGSHKHSSQASSALISSYQPSCHYP--FGQPPQPDSVQAEPSSQTQ 631 Query: 1525 KPLP-QPSISGSPPIMGHSAP-----GLDVPGQPSTGNLLAAIMKSGLLSSNSVTGGLPN 1364 KPLP Q S G+ +G ++ + ST +LLAA+MKSG+LSSNS TG LPN Sbjct: 632 KPLPSQISKVGAASTLGIASEQANPLAIGTSELSSTSSLLAAVMKSGILSSNSFTGSLPN 691 Query: 1363 PSFKDSGVLPSHLSIQHPLPSGPPTQLXXXXXXXXXXXPLGSTSS-----LSTHPQRTXX 1199 +D G +PS Q PLP+GPP + ++S +T+ + Sbjct: 692 KISQDVGQIPS----QPPLPNGPPPAVFTSSGLRVDSGTSSGSASHDALAATTNSSQGKV 747 Query: 1198 XXXXXXXXXXXXXXXXXXXXXXSNVASAVPNPLSSLLSTLVA--------KGLISSPSKE 1043 S+ S NP+S+LLS+LVA K S S + Sbjct: 748 EQPPLPPGPPPPALVSNAPAQTSDAESKASNPISNLLSSLVAKGLISASKKDASSLLSHQ 807 Query: 1042 MPT-------------------------LTSPQVASRLPKQXXXXXXXXXXXXXXXXXXX 938 +PT + + +P + Sbjct: 808 IPTQMQESLGMERPTQMQESLGMERHTQMQKESLGMEMPTESPNQSSGISTSSPLPASSI 867 Query: 937 XXSGND--------LLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDL 782 S +D + F A K + + + +E +NL+G+EF+P++IRE H SVIS L Sbjct: 868 PSSSDDPSSSTMDEVSFAEPATKSSVALHQSAAMEEENLIGLEFRPDVIREFHSSVISKL 927 Query: 781 FDDLPHKCSICGHRLKFQEQLDLHLEWHASKTLS--------RRWYPSLGVWVAGNEGSS 626 DDLPH CS+CG RLK QE+LD HLE HA K R WY W+ G G Sbjct: 928 LDDLPHCCSLCGLRLKLQERLDRHLECHAMKKTESEGSNRALRGWYARSDDWIGGKPGQF 987 Query: 625 SGPSV-------ETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLP 467 + S +T KSE +VPADE+Q C+LCGE FED++ R EWM+KGA Y+++P Sbjct: 988 AFESTGSVNQLEKTTAKSELMVPADENQYACMLCGELFEDYFCQIRGEWMFKGAVYLTIP 1047 Query: 466 AVDGDIGTTDGCASLGPIVHANCASPTSVSDLGLSKNIKPE 344 + DG++GTT+G A GPIVHANC S +SV DLGL+ +K E Sbjct: 1048 SKDGEVGTTNGSAGNGPIVHANCISESSVHDLGLAGGVKLE 1088 >ref|XP_006851712.1| hypothetical protein AMTR_s00040p00210200 [Amborella trichopoda] gi|548855292|gb|ERN13179.1| hypothetical protein AMTR_s00040p00210200 [Amborella trichopoda] Length = 1173 Score = 467 bits (1201), Expect = e-128 Identities = 368/1031 (35%), Positives = 468/1031 (45%), Gaps = 190/1031 (18%) Frame = -3 Query: 2854 SIHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHV 2675 SIH+GM HLF TWKGVFPPA LQ+IEK+L F A N SSSGA SRPDSQ RPP+SIHV Sbjct: 183 SIHAGMHHLFRTWKGVFPPAPLQIIEKQLDFPPATNSSSSGAPASRPDSQ--RPPHSIHV 240 Query: 2674 NPKYLEARQRLQQSNKDPRLS-----------------------------------QREA 2600 NPKYLEARQRLQQS++ +S QR Sbjct: 241 NPKYLEARQRLQQSSRAKGISADNNGVSLADHMESSDRAMTSGSPKQWPDLPVKNIQRPQ 300 Query: 2599 SSEPVHE----KKSSAGYEYLESGSDLSRHSDLVIGRDYERVNE-QDGLEKPLYGSGSNA 2435 S EP+ E KK S GY + SD +R SD+ R ERV E ++GL++ YG G Sbjct: 301 SGEPLSESLFGKKPSTGYGDYKFASDRARRSDIRTVRSIERVVEKEEGLDRGRYG-GVEG 359 Query: 2434 AETN--IGRRNAFDTQ--------DGFSKYQAPKSAQVLSQLQPTD--VGNRGSRGMNKS 2291 TN G +N D + ++ + A V+ QL P G G G++++ Sbjct: 360 TTTNPPFGPKNGHSMPQLPQRGLTDAYGSHRPSRPAHVVPQLPPPQDVAGKSGRGGISRN 419 Query: 2290 WKNSEEEEYIWDDMNSRLTDHGGPDSSSADGWSTDDA----------------------- 2180 WKNSEEEEY+WDDMNSRLT+HGG D SS D W +DDA Sbjct: 420 WKNSEEEEYMWDDMNSRLTEHGGADRSSKDPWVSDDAGNPTSMTRGKWMPSESDPLDANW 479 Query: 2179 ---------EKP-----------EIEDHLPQAHGEHDIGSRIYTETSSDSLSIAQRAQAS 2060 EKP E +D Q+HG+ DI R +TS++S S + Sbjct: 480 NSLETSSRLEKPIVGEDGMSLKREPDDPQLQSHGQQDIDPRSRRDTSAESPSQGG-GPSE 538 Query: 2059 FGHRTTSIWPSQEPRS----------VDGLKHISITTRISGHSEGHPXXXXXXXXXXXXX 1910 F R S WP Q+ S VDGL + T ++ S G Sbjct: 539 FERRLLSGWPPQQNMSMSQLRPRIHPVDGLIQTGLPTSLASSSFG--------------- 583 Query: 1909 XXGRTGFETLTGPSVVSIPN-VGSMVDRVSGSGGFSGQQRH------------------- 1790 + G ++ G + SIP+ G + GS G G QR Sbjct: 584 ---KAGNQSNLGMPLGSIPSSFGPTSQMIPGSSGLFGHQRQQPQRPPSPSSQLPFHHLPY 640 Query: 1789 ------------HSFTEKDHLRIQS-SQPGQKTSHLPGNLSQAPYGQLPQ--DSSLPVRP 1655 H + QS +QPGQK S +Q P+ +SS+ Sbjct: 641 SSQIPLHQPPSLHDLDPMQQAQAQSFTQPGQKGSQAINQSTQNQDSFSPKRHNSSILQSL 700 Query: 1654 QNHIKSQPPQHIHASFPQLRHPGLFSQQLHSEPTQSLPSSQTPK---PLPQPSISGSPPI 1484 Q ++ QPP H + L P S+Q H + L Q P P QP G P Sbjct: 701 QAPLQIQPPLRFHGASSSLLPP---SKQGHHQ----LHFGQPPNLEIPHAQPPTFGPPRT 753 Query: 1483 MGHSAPGL------DVPGQPSTGNLLAAIMKSGLLSSNSV--------TGGLPNPSFKDS 1346 G+S GL + GQ ST LLA I++SG+L S T P DS Sbjct: 754 SGYSGAGLPKNLPVEPQGQSSTETLLATILQSGILPLESTPSNTQPLSTSSSAIPRHSDS 813 Query: 1345 GVLPSHLSIQHPLPSGPP------TQLXXXXXXXXXXXPLGSTSSLSTHPQRTXXXXXXX 1184 PS+L+IQ PLP+GPP + PLG+ SSLST P Sbjct: 814 MSTPSNLNIQPPLPTGPPPIPQTSSLPVTSVSSLLGPNPLGNMSSLSTQP---VGMLQPP 870 Query: 1183 XXXXXXXXXXXXXXXXXSNVASAVPNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRL 1004 S+ AS V N LS LLS+LVAKGLIS+P+ E + + Sbjct: 871 LPPGPPPASSIAGSSQASSTASGVSNQLSGLLSSLVAKGLISAPTSESSNPPVSHAPTEV 930 Query: 1003 PKQXXXXXXXXXXXXXXXXXXXXXSGNDLLFKGSAAKITSTVSK------------PMKV 860 Q + +++++S P+ + Sbjct: 931 QHQTAVVATSATSMLSSRSLVSSTPPTSIPIDEPELWVSTSISSAPPQAPRVDTKDPIAI 990 Query: 859 ERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKCSICGHRLKFQEQLDLHLEWHASKT-- 686 E NL+GIEFKPE+IRE HPSVIS LFD +PH+CS CG R QE+L HLEWHASK Sbjct: 991 E-PNLIGIEFKPEVIRERHPSVISGLFDAMPHRCSACGLRFNRQEELSKHLEWHASKNHE 1049 Query: 685 ------LSRRWYPSLGVWVAGNEGSSSGPS-------VETAEKSEPVVPADESQCVCILC 545 + R WY SL WV G+ G S+G + + EK EPVVPADESQC+CILC Sbjct: 1050 QSSGKRVLRNWYVSLRNWVEGDVGPSTGDASFPLDEKLSNVEKEEPVVPADESQCICILC 1109 Query: 544 GEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIGTTDGCASLGPIVHANCASPTSVSDLGL 365 GEPFED+YSH+RDEWMYKGATYMS G+ G DG +S IVH NC S + DL Sbjct: 1110 GEPFEDYYSHERDEWMYKGATYMS-----GNGG--DGSSSPVSIVHVNCISKGAADDLLE 1162 Query: 364 SKNIKPEQMDG 332 ++N ++ DG Sbjct: 1163 AENDNVDKADG 1173 >ref|XP_006341164.1| PREDICTED: uncharacterized protein LOC102593629 [Solanum tuberosum] Length = 1046 Score = 450 bits (1157), Expect = e-123 Identities = 326/924 (35%), Positives = 446/924 (48%), Gaps = 93/924 (10%) Frame = -3 Query: 2854 SIHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHV 2675 S+H GMRHLFGTWKGVFPP LQ+IEKELGF+ VNGSSSG TSRPD Q+QRP +SIHV Sbjct: 177 SVHPGMRHLFGTWKGVFPPQQLQLIEKELGFTTGVNGSSSG--TSRPDPQAQRPAHSIHV 234 Query: 2674 NPKYLEARQRLQQSNK----------------------------------DPRL--SQRE 2603 NPKYLEARQRLQQS K DP + +Q+E Sbjct: 235 NPKYLEARQRLQQSTKAKGAVSDISSTLNVNEDAERPERTTSVSSGRPWIDPSIKRAQKE 294 Query: 2602 ASSEPVHEKKSSAGYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSNAAETN 2423 +E V EK Y + SDLSR + +GR ER EQ G +KP Y SG+ Sbjct: 295 KLNEHVPEKTIGTAYGDSDYVSDLSRRAAFGVGRGGERFKEQ-GFDKPWYDSGTGKI--- 350 Query: 2422 IGRRNAFDTQDGFSKY-QAPKSAQVLSQLQPTDVGNRGSRGMNKSWKNSEEEEYIWDDMN 2246 + +R+ D + GF Q ++ QL P+ + NR S ++SWKNSEEEEY+WDD+N Sbjct: 351 LNQRSGLDIKHGFQSIPQKSATSDAHPQLIPS-LPNRTSTLTDRSWKNSEEEEYMWDDVN 409 Query: 2245 SRLTDHGGPDSSSADGWSTDDAEKPEIEDHLPQAHGEHDIGSRIYTETSSDSLSIAQRAQ 2066 +++ D W+++D++K ++E+ L + D+G R +E S+DSLS +R Sbjct: 410 ----------NAAKDRWASEDSDKSDLENQLRRPQSTRDVGLRADSEASADSLSAEERGS 459 Query: 2065 ASFGHRTTSIWPSQEPRSVDGLKHISITTRISGHSEGHPXXXXXXXXXXXXXXXGRTGFE 1886 ASFG++ +++W S+E ++DG +H + H EG+ R ++ Sbjct: 460 ASFGNQMSAMW-SRESHALDGARHSASVQGAPVHPEGY--QTSFCGLSKAANSVSRASYK 516 Query: 1885 TLTGPSVVSIPNVGSMVDRVSGSGGF------------SGQQRHHSFTEKDHLRIQS--- 1751 TG V PN+G M + G S Q H L + Sbjct: 517 LQTGSVHVGTPNIGPMNATLESRGSIVQQGETLRAASPSAQSPMHQRPPSPSLITSNTNQ 576 Query: 1750 --SQPGQKTSHLPGNLSQAPYGQLPQDSSLPVRPQ----------------NHIKSQPPQ 1625 + PG++ + S Q+ + S+L R Q N + QPP Sbjct: 577 VINSPGEQYQMQTSSRSDPRLSQISRRSNLDPRNQFAQESLAMPSRNSVSVNSQRQQPPS 636 Query: 1624 HIHASFPQLRHPGLFSQQLHSEPTQSLPSSQTPKPLPQPSISGSPPIMGHSAPGLDVPGQ 1445 ++S H Q H +SL S + Q S +P I G P Sbjct: 637 LQNSSALSSSH-----QSRHKVQRESLESEYS----GQTKNSTAPQISG-------FPDP 680 Query: 1444 PSTGNLLAAIMKSGLLSSNSVTGGLPNPSFKDSGVLPSHLSIQ-HPL---PSGPPTQLXX 1277 ST +LLAA++KSG++ + S +G S D G L S S Q HP PSGP L Sbjct: 681 SSTSSLLAAVLKSGVIGNKSSSG--TTSSSLDKGALSSQASAQPHPAQFSPSGPRIPLAS 738 Query: 1276 XXXXXXXXXPLGSTSSLSTHPQRTXXXXXXXXXXXXXXXXXXXXXXXXSNVASAVPNPLS 1097 + S+ +PQR N + +PLS Sbjct: 739 VTSLSMDR----NASNPPNYPQRN--VEQPPLPPGLPRTLVGSASLQTPNAPNTASSPLS 792 Query: 1096 SLLSTLVAKGLISSPSKE----MPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXXXXXS 929 S+LSTLVAKGLIS+ K+ P+ T PQ + +P Sbjct: 793 SILSTLVAKGLISASKKDPPIYTPSDTPPQTQNLIPPASSISTPALSAPISASVPSSAPK 852 Query: 928 GNDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKCSIC 749 ++L +AK + + E K+L+G+ FKP++IR SHP+VISDL DD+PH+C IC Sbjct: 853 -DELSHSKPSAKTLEVLLQSTNEEAKSLIGLVFKPDVIRNSHPAVISDLLDDVPHQCGIC 911 Query: 748 GHRLKFQEQLDLHLEWHASK-------TLSRRWYPSLGVWVAGNEG-----SSSGP---S 614 G LK QE+LD HLEWH+ + SR+WY + G W+A G S GP S Sbjct: 912 GFGLKLQEKLDRHLEWHSLRNPDVKLLNNSRKWYLNSGEWIAAFGGLPCGDKSKGPAGGS 971 Query: 613 VETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIGTTDG 434 ET+E +E +VPADE QCVC+LCGE FEDFY+ + DEWM+K A YMS+P + Sbjct: 972 SETSECTETMVPADECQCVCVLCGEFFEDFYNEESDEWMFKDAVYMSIP-------SESD 1024 Query: 433 CASLGPIVHANCASPTSVSDLGLS 362 C GPIVH NC S +S +LGL+ Sbjct: 1025 CQ--GPIVHKNCISESSCQELGLA 1046 >ref|XP_006430295.1| hypothetical protein CICLE_v10010952mg [Citrus clementina] gi|557532352|gb|ESR43535.1| hypothetical protein CICLE_v10010952mg [Citrus clementina] Length = 829 Score = 434 bits (1117), Expect = e-119 Identities = 313/864 (36%), Positives = 437/864 (50%), Gaps = 63/864 (7%) Frame = -3 Query: 2743 SSSGATTSRPDSQSQRPPNSIHVNPKYLEARQRLQQSNKDPRLSQREASSEPVHEKKSSA 2564 +SS RPD S S+ + +++ ++Q S QR+A SEP+HEK A Sbjct: 6 ASSTVDAERPDRAS-----SMSASRPWVDPTVKMQHS-------QRDALSEPIHEKNIGA 53 Query: 2563 GYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSNAAETNIGRRNAFDTQDGF 2384 +Y + GS+LSR S L GR RV++Q G EKP YGSGSN +ET G+RN F+ + GF Sbjct: 54 YGDY-DYGSELSRSSGLGSGRTTGRVSDQ-GYEKPWYGSGSNISETIAGQRNGFNKKQGF 111 Query: 2383 SKYQAPKSAQVLSQLQPTDVGNRGSRGMNKSWKNSEEEEYIWDDMNSRLTDHGGPD---S 2213 Y A KSA + LQ + S SWKNSEEEE++W DM+ R +DH + + Sbjct: 112 PNYSASKSANAAAHLQQVQSIPKSSSSGLSSWKNSEEEEFMW-DMHPRTSDHDAANISKN 170 Query: 2212 SSADGWSTDDAEKPEIEDHLPQAHGEHDIGSRIYTETSSDSLSIAQRAQASFGHRTTSIW 2033 S D + D EK E+++HL + G HD+ S ETSSDSLS Q+ QA++ H+ S W Sbjct: 171 SRKDHLAVDGPEKLELDNHLRKPQGIHDVSSSFDRETSSDSLSTEQKDQAAYRHQMPSPW 230 Query: 2032 PSQEPRSVDGLKHISI------TTRISGHSEGHPXXXXXXXXXXXXXXXGRTGFETLTGP 1871 +E DGL ++ ++ + GHP G +GF TL Sbjct: 231 QLKE---ADGLIAATLGGFPASSSSSLARTGGHP--------PVVSSHIGTSGFGTLASS 279 Query: 1870 SVVSIPNVGSMVDRVSGSGGFSGQ--QRHHS----------------FTEKDHLRIQS-S 1748 + S ++ + + + +G SG HHS T++D+ Q S Sbjct: 280 ASGSTGSLATQRFQSARAGSPSGHSPMHHHSPSPSVPAHHPRQNMQNCTDRDYPHAQPLS 339 Query: 1747 QPGQKTSHLPGNLSQAPYGQLPQDSSLPVRPQNHIKSQP---PQHIHASFPQLRHPGLFS 1577 +P KTS PG +S P G +DS + P + + + P PQ + S P + S Sbjct: 340 RPDLKTSSFPGLVSSGPRGHSTKDSPSILHPNSQLGNLPKVQPQDLKGS-----SPAVTS 394 Query: 1576 QQLHSEPTQSLPSSQTPKPLPQPSISGSP----PIMGHSAPGLDVP--GQPSTGNLLAAI 1415 QL+ + + L LPQ S G+P + HS P LD GQ T +LLA++ Sbjct: 395 FQLNCQSQKPL--------LPQVSNFGAPSTKEAVSDHSNP-LDAEGLGQSGTSSLLASV 445 Query: 1414 MKSGLLSSNSVTGGLPNPSFKDSGVLPSHLSIQHPLPSG-PPTQLXXXXXXXXXXXPLGS 1238 +KSG+L+S S+T GL N + K+ G +P L IQ PLPSG PP L L Sbjct: 446 LKSGILNS-SITDGLANRALKEVGQIPLQLDIQPPLPSGPPPPSLLTSSGARVGSGSLSG 504 Query: 1237 TS-----SLSTHPQRTXXXXXXXXXXXXXXXXXXXXXXXXSNVASAVPNPLSSLLSTLVA 1073 S + T QR S+V S NP+S+LLSTLVA Sbjct: 505 PSQEDPPATMTSSQR-KVEQPPLPPGPPPSSLASSTSPKASSVESKTSNPISNLLSTLVA 563 Query: 1072 KGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXXXXXSGNDLLFKGS--- 902 KGLIS+ E P+ T+PQV SR+ + + + + S Sbjct: 564 KGLISASKTEPPSHTTPQVTSRMQNESPGISSSSPATVSSVPNLLPIPPSSTVDETSLPA 623 Query: 901 -AAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKCSICGHRLKFQE 725 A + + +S+ VE +NL+G++FKP++IRE H SVI LFD PH CSICG RLK QE Sbjct: 624 PAGESSFALSESTTVETQNLIGLKFKPDVIREFHESVIKRLFDGFPHLCSICGLRLKLQE 683 Query: 724 QLDLHLEWHASK--------TLSRRWYPSLGVWVAGNEGSSSG--------PSVETAEKS 593 QLD HLEWHA + +SRRWY + WVAG G G S +T ++ Sbjct: 684 QLDRHLEWHALRKPGLDDVDKISRRWYANSDDWVAGKAGLPLGLESISCMEDSGKTIDEG 743 Query: 592 EPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIGTTDGCASLGPI 413 EP+VPAD++QC C++CGE FED Y+ R EWM+K A YM +P+ +G++GTT+ ++ GPI Sbjct: 744 EPMVPADDNQCACVMCGELFEDCYNQARGEWMFKAAVYMMIPSGNGEVGTTNESSAKGPI 803 Query: 412 VHANCASPTSVSDLGLSKNIKPEQ 341 VH NC S SV DL + +K E+ Sbjct: 804 VHGNCISENSVHDLRVISKVKVEK 827 >ref|XP_004246564.1| PREDICTED: uncharacterized protein LOC101244024 [Solanum lycopersicum] Length = 1040 Score = 426 bits (1096), Expect = e-116 Identities = 326/931 (35%), Positives = 445/931 (47%), Gaps = 100/931 (10%) Frame = -3 Query: 2854 SIHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHV 2675 S+H GMRHLFGTWKGVFPP LQ+IEKELGF+ VNGSSSG TSRPD Q+QRP +SIHV Sbjct: 171 SVHPGMRHLFGTWKGVFPPQQLQLIEKELGFTTGVNGSSSG--TSRPDPQAQRPAHSIHV 228 Query: 2674 NPKYLEARQRLQQSNK----------------------------------DPRL--SQRE 2603 NPKYLEARQRLQQS + DP + +Q+E Sbjct: 229 NPKYLEARQRLQQSTRAKGAASDISSTVNVNEDAERPERTTSVSSGRSWIDPSIKRAQKE 288 Query: 2602 ASSEPVHEKKSSAGYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSNAAETN 2423 +E V EK SA Y + SDL + +GR ER EQ G +KP Y SG+ Sbjct: 289 KLNEHVPEKTISAAYGDSDYASDLPSRAAFGVGRGGERFKEQ-GFDKPWYDSGAGKI--- 344 Query: 2422 IGRRNAFDTQDGFSKY-QAPKSAQVLSQLQPTDVGNRGSRGMNKSWKNSEEEEYIWDDMN 2246 + +R++ DT+ F Q ++ QL P+ + NR S ++SWKNSEEEEY+WDD+N Sbjct: 345 LSQRSSLDTKHDFQSIPQKSATSDAHPQLIPS-LPNRTSTLTDRSWKNSEEEEYMWDDVN 403 Query: 2245 SRLTDHGGPDSSSADGWSTDDAEKPEIEDHLPQAHGEHDIGSRIYTETSSDSLSIAQRAQ 2066 +++ D W+++D++K ++E+ L + ++G R +E S+DS S +R Sbjct: 404 ----------NAAKDRWASEDSDKSDLENQLRRPQSIREVGLRADSEASADSPSAEERGP 453 Query: 2065 ASFGHRTTSIWPSQEPRSVDGLKHISITTRISGHSEGHPXXXXXXXXXXXXXXXGRTGFE 1886 ASFG++ +++W S+ ++DG +H + HSEG+ R ++ Sbjct: 454 ASFGNQMSAMW-SRGSHALDGARHSASVQGAPVHSEGY--QTSFSGLSKVANSVSRASYK 510 Query: 1885 TLTGPSVVSIPNVGSMVDRVSGSGGF------------SGQQRHHSFTEKDHLRIQSSQ- 1745 TG V N+G M + G S Q H L +S Sbjct: 511 LQTGSVHVGTQNIGPMNATLESRGSIVQQGETLRAASPSAQSPMHHLPPSPSLITSNSNQ 570 Query: 1744 ---------PGQKTSHLPGNLSQA-------PYGQLPQDS-SLPVRPQNHIKSQ---PPQ 1625 Q +S LSQ P Q Q+S ++P R + SQ PP Sbjct: 571 VINSPAEQYQMQTSSRSDPRLSQISRRSNLDPRNQYAQESLTMPSRNTISVNSQRQHPPS 630 Query: 1624 HIHASFPQLRHPGLFSQQLHSEPTQSLPSSQTPKPLPQPSISGSPPIMGHSAPGLDVPGQ 1445 ++S H Q++ E +S S QT K P ISG P Sbjct: 631 LQNSSALSSSHQ--LRQKVQRESLESEYSVQT-KNSTVPEISG-------------FPDP 674 Query: 1444 PSTGNLLAAIMKSGLLSSNSVTGGLPNPSFKDSGVLPSHLSIQ-HPL---PSGPPTQLXX 1277 ST +LLAA++KSG++ + S +G S D G L S S Q HP SGP Sbjct: 675 SSTSSLLAAVLKSGVIGNKSSSG--TTSSSLDKGALSSQASAQPHPAQFSTSGP------ 726 Query: 1276 XXXXXXXXXPLGSTSSLSTHPQRTXXXXXXXXXXXXXXXXXXXXXXXXSNVASAVPN--- 1106 P S +SLS + +S PN Sbjct: 727 -------RIPPASVTSLSMDRNASNSPNYSQRNVEQPPLPPGLPPTLAGTASSQTPNAPN 779 Query: 1105 ----PLSSLLSTLVAKGLISSPSKE----MPTLTSPQVASRLPKQXXXXXXXXXXXXXXX 950 PLSS+LSTLVAKGLIS+ K+ P+ T PQ + +P Sbjct: 780 IASSPLSSILSTLVAKGLISASKKDPPIYTPSDTPPQTQNLIPPASSISTPALSAPTSSS 839 Query: 949 XXXXXXSGNDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDL 770 ++L +A+ + + MK E K+L+G+ FKP++IR SHP+VISDL DD+ Sbjct: 840 VPSSAHK-DELSHSKPSAETPEVLLQSMKEEAKSLIGLVFKPDVIRNSHPAVISDLVDDV 898 Query: 769 PHKCSICGHRLKFQEQLDLHLEWHASK-------TLSRRWYPSLGVWVAGNEG-----SS 626 P +C ICG KFQ +LD HLEWH+ + SR+WY + G W+A G S Sbjct: 899 PLQCGICGFGFKFQVKLDRHLEWHSLRNPDVKLLNNSRKWYLNSGEWIAAFGGLPCGDKS 958 Query: 625 SGP---SVETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDG 455 GP S ET+E +E +VPADE QCVC+LCGE FEDFY+ + DEWM+K A YMS+P Sbjct: 959 EGPAGGSSETSECTETMVPADECQCVCVLCGEFFEDFYNEESDEWMFKDAVYMSIP---- 1014 Query: 454 DIGTTDGCASLGPIVHANCASPTSVSDLGLS 362 + C GPIVH NC S +S +LG + Sbjct: 1015 ---SESDCQ--GPIVHKNCISESSCQELGFA 1040 >ref|XP_002277320.2| PREDICTED: uncharacterized protein LOC100251089 [Vitis vinifera] Length = 801 Score = 404 bits (1038), Expect = e-109 Identities = 259/591 (43%), Positives = 324/591 (54%), Gaps = 70/591 (11%) Frame = -3 Query: 2854 SIHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHV 2675 SIH GMRHLFGTWKGVFP A LQMIEKELGF A+NGSS G TSR DSQSQRPP+SIHV Sbjct: 167 SIHPGMRHLFGTWKGVFPLAPLQMIEKELGFPPAINGSSPGIATSRSDSQSQRPPHSIHV 226 Query: 2674 NPKYLEARQRLQQSN----------------------------------------KDPRL 2615 NPKYLEARQRLQQS+ K + Sbjct: 227 NPKYLEARQRLQQSSRTKGAANDVTGTMVNSTEDADRLDRTAGINAGRPWDDLPAKSIQH 286 Query: 2614 SQREASSEPVHEKKSSAGYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSNA 2435 S REA E V EKK A Y E G+DLSR+ L IGR E+ G +KP Y +G Sbjct: 287 SHREAIGELV-EKKIGAPYGDYEYGTDLSRNPGLGIGRPSEQ-----GHDKPWYKAGGRV 340 Query: 2434 AETNIGRRNAFDTQDGFSKYQAPKSAQVLSQLQPTD-VGNRGSRGMNKSWKNSEEEEYIW 2258 ET +RN FD + GF Y AP+SA + LQPT NR + GM++SWKNSEEEEY+W Sbjct: 341 VETFSSQRNGFDIKHGFPNYPAPRSANADAHLQPTQSTVNRSNSGMSRSWKNSEEEEYMW 400 Query: 2257 DDMNSRLTDHGGPDSSSADGWSTDDAEKPEIEDHLPQAHGEHDIGSRIYTETSSDSLSIA 2078 DDMNS++T+H + S D W+ DD+EK + E+ L + +D+GS + ETS+DS+S Sbjct: 401 DDMNSKMTEHSAANHSKKDRWTPDDSEKLDFENQLQKPQSIYDVGSSVDRETSTDSMSSE 460 Query: 2077 QRAQASFGHRTTSIWPSQEPRSVDGLKHISITTRISGHSEGHPXXXXXXXXXXXXXXXGR 1898 QR Q +FGHR +S+WP QEP S DGLKH +T I GHSEG+P R Sbjct: 461 QREQGAFGHRMSSLWPLQEPHSTDGLKHSGTSTLILGHSEGYP--TVSGLSTSASSSLAR 518 Query: 1897 TGFETLTGPSVVSIPNVGSMVDRVSGS-GGFSGQQRHHS-----------FTEKDHLRIQ 1754 TG L G S G + + SGS G GQQR S + DHL + Sbjct: 519 TGLRPLMGSSHAGASGFGFLTNASSGSTTGTVGQQRLQSVGAASPSGQSPMHQPDHLPVH 578 Query: 1753 S-SQPGQKTSHLPGNLSQAPYGQLPQDSSLPVRPQ----NHIKSQPPQHIHASFP----- 1604 S P K S G + + Q D +LP Q ++ P ++ + P Sbjct: 579 SLPLPDIKASQFSGQFNIGSHKQFTLD-ALPKLIQKAQLGDLQKLLPHNLQSLSPAVPSV 637 Query: 1603 QLRHPGLFSQQLHSEPTQSLPSSQTPK-PLPQPSISGSP-----PIMGHS-APGLDVPGQ 1445 +RH FS QL +P Q PS Q K LPQ SI +P P++ HS P + G+ Sbjct: 638 PIRHHAPFSPQLQPDPLQPEPSGQAQKTSLPQTSIFEAPSTIENPVLEHSNYPAAESTGK 697 Query: 1444 PSTGNLLAAIMKSGLLSSNSVTGGLPNPSFKDSGVLPSHLSIQHPLPSGPP 1292 ST NLLAA+MKSG+LS++SV+G +P SF+D+G + + IQ PLPSGPP Sbjct: 698 LSTSNLLAAVMKSGILSNSSVSGSIPKTSFQDTGAVLQSV-IQPPLPSGPP 747 >ref|XP_006339117.1| PREDICTED: uncharacterized protein LOC102597998 [Solanum tuberosum] Length = 1066 Score = 358 bits (920), Expect = 6e-96 Identities = 300/937 (32%), Positives = 422/937 (45%), Gaps = 97/937 (10%) Frame = -3 Query: 2854 SIHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHV 2675 S+HSGM+ LF TW+ VFPP LQ+IEKELGF+ VNGSSSGA R DS++Q+ +SIHV Sbjct: 174 SVHSGMQRLFVTWRKVFPPQQLQLIEKELGFTTGVNGSSSGAR--RDDSKAQQTAHSIHV 231 Query: 2674 NPKYLEARQRLQQSNK-------------------------------DPRLSQREASSEP 2588 NPKYLEARQ LQQ + + Q+E +E Sbjct: 232 NPKYLEARQCLQQPTRAKGSADDITPGDIQKPERATSVGSERSWFDISAKCVQKEQLNER 291 Query: 2587 VHEKKSSAGYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSNAAETNIGRRN 2408 + EK +SA Y E SDLSR S + E++ E+ G +K Y + + +RN Sbjct: 292 IREKTTSAAYGDPEYVSDLSRGSGFGLRITGEKLKEE-GRDKSWYNPANGKI---LSQRN 347 Query: 2407 AFDTQDGFSKYQAPKSAQVLSQLQPT-DVGNRGSRGMNKSWKNSEEEEYIWDDMNSRLTD 2231 D + G + +A + QPT N+ S M++SW++S+EEEY+WDD+N Sbjct: 348 GLDLKHGVQSL-SQNTANSDAYPQPTHSFANQSSTLMDRSWQSSDEEEYMWDDVNC---- 402 Query: 2230 HGGPDSSSADGWSTDDAEKPEIEDHLPQAHGEHDIGSRIYTETSSDSLSIAQRAQASFGH 2051 + D ++ D K +++ P+ ++ G + +E S+DSLS QAS + Sbjct: 403 ------ADKDQRASKDPYKTGLDNQHPRP--QNMFGLKAESEASADSLSREDNGQASSEN 454 Query: 2050 RTTSIWPSQEPRSVDGLKHISITTRISGHSEGHPXXXXXXXXXXXXXXXGRTGFETLTGP 1871 + +S+W D +H++ H GH F++ Sbjct: 455 QISSMWS-------DEARHLASVQSTPDHPRGH--LTSFSGLPTATNSIVGKSFQSQKDS 505 Query: 1870 SVVSIPNVGSMVDRVSGSGGFSGQQRHHSFTEKDHLRIQSSQPGQKTSHLPGNLSQA--- 1700 S V P+ G + +GS G Q R L Q S GN SQ Sbjct: 506 SHVGTPSYG-IAKTANGSRGTIMQPRETQGAAPPSLESAMRQLPPSPSISTGNFSQVVNS 564 Query: 1699 ---------------------------PYGQLPQDSSLPVRPQN-HIKSQP--------P 1628 P Q+PQDS LP+ Q+ H+ S P Sbjct: 565 LTRDYHTQTESHADPRMSQFSRRSNLDPRKQVPQDS-LPMTSQSAHLVSSQISQTPIYNP 623 Query: 1627 QHIHASFPQLRHPGLFSQQLHSEPTQSLPSSQTPKPLPQPSISGSPPIMGHSAP------ 1466 + +SF + H F +++ E P S+ P + ++ HS Sbjct: 624 SSMMSSFQEEHHVS-FPEKIQQES----PESEFSIPSQKSIVTQLSGFADHSGTVPSILQ 678 Query: 1465 GLDVPGQPSTGNLLAAIMKSGLLSSNSVTGGLPNPSFKDSGVLPSHLSIQHPLPSGPPTQ 1286 G + GQ S +LLAA+MKSG+L+S+S G N +D G L S Q P+PSGPP Q Sbjct: 679 GSESSGQTSMSSLLAAVMKSGVLNSSSSVGTPLNS--RDKGPLSSQAGAQPPIPSGPPIQ 736 Query: 1285 LXXXXXXXXXXXP-LGSTSSLSTHPQRTXXXXXXXXXXXXXXXXXXXXXXXXS-NVASAV 1112 L + S ++S P + + NV +A Sbjct: 737 LLSSGPKAPHSVVSVQSDRNVSNAPSYSQRNGERPRLPPDPAPTPVGSESLQAPNVVNAA 796 Query: 1111 PNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXXXXX 932 NP++ LL++L+AKGLIS+ +E PT T P + Q Sbjct: 797 SNPVAKLLNSLMAKGLISASKEESPTSTPPPTPPQTRFQCPPASISSTPGVSAPISSSTC 856 Query: 931 SG--NDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKC 758 S ++L AAKI + + K ER++ FKP +IRES+P VIS+L DD+PH+C Sbjct: 857 SSQKDELSLSKPAAKIPDALPQSNKEEREDA----FKPGVIRESNPGVISELLDDVPHQC 912 Query: 757 SICGHRLKFQEQLDLHLEWHASKT-------LSRRWYPSLGVWVAGNEGS---------S 626 ICG RLK + QLD HLEWHA + RRWY + G W AG GS Sbjct: 913 GICGLRLKLRVQLDRHLEWHALRNPDGKRLHSERRWYLNSGEWFAGT-GSVPHCGILAVP 971 Query: 625 SGPSVETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIG 446 +G S + +E +E +VPADESQCVC+LCG+ FEDFY D+WM+KGA YM + I Sbjct: 972 TGGSSKLSECTEVMVPADESQCVCVLCGQVFEDFYDEKSDKWMFKGAVYMDDSLNESGI- 1030 Query: 445 TTDGCASLGPIVHANCASPTSVSDLGLSKNIKPEQMD 335 PIVH NC S S + + L +IK E D Sbjct: 1031 -------QNPIVHKNCTSEDSQNWM-LKDDIKQESED 1059 >ref|XP_003625749.1| Pre-mRNA cleavage complex 2 protein Pcf11 [Medicago truncatula] gi|355500764|gb|AES81967.1| Pre-mRNA cleavage complex 2 protein Pcf11 [Medicago truncatula] Length = 1039 Score = 351 bits (901), Expect = 9e-94 Identities = 291/919 (31%), Positives = 418/919 (45%), Gaps = 85/919 (9%) Frame = -3 Query: 2851 IHSGMRHLFGTWKGVFPPASLQMIEKELGFSAAVNGSSSGATTSRPDSQSQRPPNSIHVN 2672 +HS MRHLFGTW+GVFPP +LQ+IEKEL F+ AVNGS+S + T R DSQSQRP +SIHVN Sbjct: 184 VHSSMRHLFGTWRGVFPPQTLQIIEKELNFNPAVNGSASASATLRSDSQSQRPSHSIHVN 243 Query: 2671 PKYLEARQRLQQSNK---------------------------------DPRLSQ------ 2609 PKYLE RQRLQQS++ DPRL+ Sbjct: 244 PKYLE-RQRLQQSSRTKGVFDDMAGVISNANEGAERPDRALGAARPWLDPRLNMHNNQHT 302 Query: 2608 -REASSEPVHEKKSSAGYEYLESGSDLSRHSDLVIGRDYERVNEQDGLEKPLYGSGSNAA 2432 R A ++ V EK Y E S +S +GR R+ A Sbjct: 303 HRGALNDSVPEKSIGGAYGDDEYNSSVSNSLGSGVGRTGSRLI-------------GGVA 349 Query: 2431 ETNIGRRNAFDTQDGFSKYQAPKSAQVLSQLQPTDVGNRGSRGMNKSWKNSEEEEYIWDD 2252 ET G+RN F + FS ++APKS + D N S M+K+WKNSEEEE++WD+ Sbjct: 350 ETLSGQRNGFSLKHSFSNHEAPKSVNL-------DAHNIRSSAMSKNWKNSEEEEFMWDE 402 Query: 2251 MNSRLTDH--GGPDSSSADGWSTDDAEKPEIEDHLPQAHGEHDIGSRIYTETSSDSLSIA 2078 +N L+D+ ++ S+D W DD + E EDHL H IG+++ S+ + Sbjct: 403 VNPGLSDNVPNVSNNLSSDQWMADD-DNLESEDHLQFTH---PIGTKVNKGIST----VK 454 Query: 2077 QRAQASFGHRTTSIWPSQE--PRSVDGLKHISITTRISGHSEGHPXXXXXXXXXXXXXXX 1904 ++ +S GH + S W Q+ P + +K GHSE Sbjct: 455 KQLPSSGGHSSLS-WELQKQVPSAKLNMK--------PGHSE--------------IFVS 491 Query: 1903 GRTGFETLTGPSVVSIPNVGSMVDRVSGSGGFSGQQRHHSFTEKDHLRIQSSQPGQKTSH 1724 +G S I N SM G +GQQ+ S + QSS Q++ Sbjct: 492 APSGLPKNPNSSAARIRNQSSMPHTTIGMSKITGQQQFDSEGTESPSE-QSSPLRQQSPK 550 Query: 1723 LPGNLSQAPYGQ--LPQDSSLPVRPQNHIKSQPPQHIHASFPQLR---HPGLFSQQLHSE 1559 +P + P + QD ++ H+ Q+I P +R G + + Sbjct: 551 VPVTIRNPPSMRNLAEQDCPTTLKTSQHLGGLQSQYIRDPVPAIRSNVQVGNLRKSQEKD 610 Query: 1558 PTQSLPSSQTPKPLPQPSISGSPPI---------MGHSAPGLD--VPGQPSTGNLLAA-I 1415 L S+ + +P PQ GS + AP + V + ST L A Sbjct: 611 MRGPLSSATSFQPKPQQQQLGSSQAEVTLKAKQPLKSKAPLVKAKVTSEKSTTKCLPAPS 670 Query: 1414 MKSGLLSSNSVTGGLPNPSFKDSGVLPSHLSIQHPLPSGPPTQLXXXXXXXXXXXPLGS- 1238 +KSG++ + S+T L D+ PS + ++ P SG P+ LGS Sbjct: 671 VKSGIIPNKSITRNL------DASNRPSQIGVK-PTRSGGPSPATLISSGSPAMS-LGSP 722 Query: 1237 ---TSSLSTHPQRTXXXXXXXXXXXXXXXXXXXXXXXXSNVASA-VPNPLSSLLSTLVAK 1070 + +L PQ SN A+ NP+S+LLS+LVAK Sbjct: 723 DDYSPTLPKLPQGKAGKKQNDSTQPSTSSNNRGASAPSSNTANKNTLNPISNLLSSLVAK 782 Query: 1069 GLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXXXXXSGNDLLFKGSAAKI 890 GLIS+ ++ T+ S V + + + AAK Sbjct: 783 GLISAGTESATTVRSETVMRSKDQTESIAVSSSLPVASVPVSSAVPVKSSRIEADDAAKA 842 Query: 889 TSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKCSICGHRLKFQEQLDLH 710 + +S+ E +NL+G +FKP++IRE HP VI +L D+LPH C CG RLK QEQ + H Sbjct: 843 SLALSQSTSTEIRNLIGFDFKPDVIREMHPHVIEELLDELPHHCGDCGIRLKQQEQFNRH 902 Query: 709 LEWHASK--------TLSRRWYPSLGVWVAGN----EGSSSGPSVETAEKS-------EP 587 LEWHA+K SRRWY + W+A S SV+ + + + Sbjct: 903 LEWHATKEREQNGLTVASRRWYVTSDDWIASKAECLSESEFTDSVDEYDDNKTDGSQLDT 962 Query: 586 VVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIGTTDGCASLGPIVH 407 +V ADE+QC+C+LCGE FED Y +RDEWM+KGA Y++ P D ++ + ++GPI+H Sbjct: 963 MVVADENQCLCVLCGELFEDVYCQERDEWMFKGAVYLNNPDSDSEMES----RNVGPIIH 1018 Query: 406 ANCASPTSVSDLGLSKNIK 350 A C S S+ LG++ ++ Sbjct: 1019 ARCLSDNSI--LGVTNTVR 1035