BLASTX nr result
ID: Rehmannia22_contig00002471
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia22_contig00002471 (2086 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI35079.3| unnamed protein product [Vitis vinifera] 984 0.0 ref|XP_002264619.2| PREDICTED: THO complex subunit 1-like [Vitis... 983 0.0 emb|CBI35093.3| unnamed protein product [Vitis vinifera] 980 0.0 ref|XP_002263874.1| PREDICTED: THO complex subunit 1-like [Vitis... 979 0.0 gb|EMJ05791.1| hypothetical protein PRUPE_ppa003099mg [Prunus pe... 960 0.0 ref|XP_006432406.1| hypothetical protein CICLE_v10000631mg [Citr... 946 0.0 ref|XP_002299188.1| hypothetical protein POPTR_0001s06900g [Popu... 943 0.0 ref|XP_006465777.1| PREDICTED: THO complex subunit 1-like isofor... 941 0.0 ref|XP_002529986.1| nuclear matrix protein, putative [Ricinus co... 938 0.0 ref|XP_004230044.1| PREDICTED: THO complex subunit 1-like [Solan... 938 0.0 gb|EOY19638.1| Nuclear matrix protein-related isoform 1 [Theobro... 937 0.0 ref|XP_006465778.1| PREDICTED: THO complex subunit 1-like isofor... 934 0.0 gb|ESW20659.1| hypothetical protein PHAVU_005G004500g [Phaseolus... 927 0.0 ref|XP_006347676.1| PREDICTED: THO complex subunit 1-like [Solan... 926 0.0 ref|XP_004307195.1| PREDICTED: THO complex subunit 1-like [Fraga... 922 0.0 ref|XP_004140313.1| PREDICTED: THO complex subunit 1-like [Cucum... 920 0.0 ref|XP_003522894.1| PREDICTED: THO complex subunit 1 isoform X1 ... 914 0.0 ref|XP_002303943.2| hypothetical protein POPTR_0003s19340g [Popu... 892 0.0 gb|EPS68583.1| hypothetical protein M569_06184, partial [Genlise... 891 0.0 gb|EOY19639.1| Nuclear matrix protein-related isoform 2 [Theobro... 889 0.0 >emb|CBI35079.3| unnamed protein product [Vitis vinifera] Length = 613 Score = 984 bits (2544), Expect = 0.0 Identities = 489/614 (79%), Positives = 533/614 (86%) Frame = -1 Query: 2065 FLTMDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAA 1886 F+ +++FK+A+L PGPP+ FAL QDENQLLENILR LLQELVS A Sbjct: 10 FILVEIFKQALLKPGPPESFALQVVQEAIKPQKQTKLAQDENQLLENILRKLLQELVSCA 69 Query: 1885 VQSGEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRN 1706 VQSGE+IMQYGQSI D + Q+PRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRN Sbjct: 70 VQSGEKIMQYGQSIDDEEAIQSQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRN 129 Query: 1705 CKDIFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFP 1526 CKDIF YIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFP Sbjct: 130 CKDIFAYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFP 189 Query: 1525 LSERSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWH 1346 LSERSAVNIKGVFNTSNETKYEK+AP+ +IDFNFYKTFWSLQE F NPAS++ A TKW Sbjct: 190 LSERSAVNIKGVFNTSNETKYEKDAPEGISIDFNFYKTFWSLQEHFCNPASISLAPTKWQ 249 Query: 1345 KFTSSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVL 1166 KFTS+L VVLNTFEAQPLSDEEG+A NLE+E + FSIKYLTSS LMGLELKDPSFRRH+L Sbjct: 250 KFTSNLMVVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHIL 309 Query: 1165 VQCLILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILER 986 VQCLILFDYLKAPGKNDKDLPSD+MKEEIKSCEERVKKLLEMTPP+GKEFLH+IEHILER Sbjct: 310 VQCLILFDYLKAPGKNDKDLPSDSMKEEIKSCEERVKKLLEMTPPKGKEFLHNIEHILER 369 Query: 985 ERNWVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDP 806 E+NWVWWKRDGCP FE+QPIEKK Q+G +KRRPRWR+GNKELSQLWKWADQNPNALTDP Sbjct: 370 EKNWVWWKRDGCPPFERQPIEKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALTDP 429 Query: 805 QRVRTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHG 626 QR RTPA+ +YWKPLAEDMD SAGIE EYHHKN+RVYCWKGLRF+ARQDL+GFSRFTE+G Sbjct: 430 QRARTPAVSEYWKPLAEDMDLSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTEYG 489 Query: 625 IEGVVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSR 446 IEGVVP+ELLP DVRSKYQAKP DRSKRAKKEETKG+ QQ EE+QIA TPASE D EG+R Sbjct: 490 IEGVVPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIA-TPASEIDGEGTR 548 Query: 445 MDPEASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGM 266 +D EAS AP D D T +PT+DE+QKQ+SD D G EAGQ EAD EAE GM Sbjct: 549 VDLEASAAPMDTDVTAT----------TPTADENQKQSSDTDAGQEAGQSEADAEAEAGM 598 Query: 265 IDGEMDAEVDLDVV 224 IDGE DAEVDLD V Sbjct: 599 IDGETDAEVDLDAV 612 >ref|XP_002264619.2| PREDICTED: THO complex subunit 1-like [Vitis vinifera] Length = 601 Score = 983 bits (2541), Expect = 0.0 Identities = 489/611 (80%), Positives = 531/611 (86%) Frame = -1 Query: 2056 MDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQS 1877 M++FK+A+L PGPP+ FAL QDENQLLENILR LLQELVS AVQS Sbjct: 1 MEIFKQALLKPGPPESFALQVVQEAIKPQKQTKLAQDENQLLENILRKLLQELVSCAVQS 60 Query: 1876 GEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 1697 GE+IMQYGQSI D + Q+PRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD Sbjct: 61 GEKIMQYGQSIDDEEAIQSQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 120 Query: 1696 IFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 1517 IF YIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSE Sbjct: 121 IFAYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1516 RSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFT 1337 RSAVNIKGVFNTSNETKYEK+AP+ +IDFNFYKTFWSLQE F NPAS++ A TKW KFT Sbjct: 181 RSAVNIKGVFNTSNETKYEKDAPEGISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFT 240 Query: 1336 SSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQC 1157 S+L VVLNTFEAQPLSDEEG+A NLE+E + FSIKYLTSS LMGLELKDPSFRRH+LVQC Sbjct: 241 SNLMVVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQC 300 Query: 1156 LILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERN 977 LILFDYLKAPGKNDKDLPSD+MKEEIKSCEERVKKLLEMTPP+GKEFLH+IEHILERE+N Sbjct: 301 LILFDYLKAPGKNDKDLPSDSMKEEIKSCEERVKKLLEMTPPKGKEFLHNIEHILEREKN 360 Query: 976 WVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 797 WVWWKRDGCP FE+QPIEKK Q+G +KRRPRWR+GNKELSQLWKWADQNPNALTDPQR Sbjct: 361 WVWWKRDGCPPFERQPIEKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALTDPQRA 420 Query: 796 RTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEG 617 RTPA+ +YWKPLAEDMD SAGIE EYHHKN+RVYCWKGLRF+ARQDL+GFSRFTE+GIEG Sbjct: 421 RTPAVSEYWKPLAEDMDLSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTEYGIEG 480 Query: 616 VVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRMDP 437 VVP+ELLP DVRSKYQAKP DRSKRAKKEETKG+ QQ EE+QIA TPASE D EG+R+D Sbjct: 481 VVPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIA-TPASEIDGEGTRVDL 539 Query: 436 EASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGMIDG 257 EAS AP D D T +PT+DE+QKQ+SD D G EAGQ EAD EAE GMIDG Sbjct: 540 EASAAPMDTDVTAT----------TPTADENQKQSSDTDAGQEAGQSEADAEAEAGMIDG 589 Query: 256 EMDAEVDLDVV 224 E DAEVDLD V Sbjct: 590 ETDAEVDLDAV 600 >emb|CBI35093.3| unnamed protein product [Vitis vinifera] Length = 613 Score = 980 bits (2533), Expect = 0.0 Identities = 487/611 (79%), Positives = 530/611 (86%) Frame = -1 Query: 2056 MDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQS 1877 +++FK+A+L PGPP+ FAL QDENQLLENILR LLQELVS AVQS Sbjct: 13 VEIFKQALLKPGPPESFALQVVQEAIKPQKQTKLAQDENQLLENILRKLLQELVSCAVQS 72 Query: 1876 GEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 1697 GE+IM YGQSI D + Q+PRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD Sbjct: 73 GEKIMHYGQSIDDEEAIQSQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 132 Query: 1696 IFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 1517 IF YIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSE Sbjct: 133 IFAYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 192 Query: 1516 RSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFT 1337 RSAVNIKGVFNTSNETKYEK+AP+ +IDFNFYKTFWSLQE F NPAS++ A TKW KFT Sbjct: 193 RSAVNIKGVFNTSNETKYEKDAPEGISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFT 252 Query: 1336 SSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQC 1157 S+L VVLNTFEAQPLSDEEG+A NLE+E + FSIKYLTSS LMGLELKDPSFRRH+LVQC Sbjct: 253 SNLMVVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQC 312 Query: 1156 LILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERN 977 LILFDYLKAPGKNDKDLPSD+MKEEIKSCEERVKKLLE TPP+GKEFLH+IEHILERE+N Sbjct: 313 LILFDYLKAPGKNDKDLPSDSMKEEIKSCEERVKKLLETTPPKGKEFLHNIEHILEREKN 372 Query: 976 WVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 797 WVWWKRDGCP FE+QPIEKK Q+G +KRRPRWR+GNKELSQLWKWADQNPNALTDPQRV Sbjct: 373 WVWWKRDGCPPFERQPIEKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALTDPQRV 432 Query: 796 RTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEG 617 RTPA+ +YWKPLAEDMD SAGIE EYHHKN+RVYCWKGLRF+ARQDL+GFSRFTE+GIEG Sbjct: 433 RTPAVSEYWKPLAEDMDSSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTEYGIEG 492 Query: 616 VVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRMDP 437 VVP+ELLP DVRSKYQAKP DRSKRAKKEETKG+ QQ EE+QIA TPASE D EG+R+D Sbjct: 493 VVPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIA-TPASEIDGEGTRVDL 551 Query: 436 EASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGMIDG 257 EAS AP D D T +PT+DE+QKQ+SD D G EAGQ EAD EAE GMIDG Sbjct: 552 EASAAPMDTDVTAT----------TPTADENQKQSSDTDAGQEAGQSEADAEAEAGMIDG 601 Query: 256 EMDAEVDLDVV 224 E DAEVDLD V Sbjct: 602 ETDAEVDLDAV 612 >ref|XP_002263874.1| PREDICTED: THO complex subunit 1-like [Vitis vinifera] Length = 607 Score = 979 bits (2532), Expect = 0.0 Identities = 487/610 (79%), Positives = 529/610 (86%) Frame = -1 Query: 2053 DLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQSG 1874 ++FK+A+L PGPP+ FAL QDENQLLENILR LLQELVS AVQSG Sbjct: 8 EIFKQALLKPGPPESFALQVVQEAIKPQKQTKLAQDENQLLENILRKLLQELVSCAVQSG 67 Query: 1873 EEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKDI 1694 E+IM YGQSI D + Q+PRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKDI Sbjct: 68 EKIMHYGQSIDDEEAIQSQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKDI 127 Query: 1693 FGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSER 1514 F YIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSER Sbjct: 128 FAYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSER 187 Query: 1513 SAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFTS 1334 SAVNIKGVFNTSNETKYEK+AP+ +IDFNFYKTFWSLQE F NPAS++ A TKW KFTS Sbjct: 188 SAVNIKGVFNTSNETKYEKDAPEGISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFTS 247 Query: 1333 SLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQCL 1154 +L VVLNTFEAQPLSDEEG+A NLE+E + FSIKYLTSS LMGLELKDPSFRRH+LVQCL Sbjct: 248 NLMVVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL 307 Query: 1153 ILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERNW 974 ILFDYLKAPGKNDKDLPSD+MKEEIKSCEERVKKLLE TPP+GKEFLH+IEHILERE+NW Sbjct: 308 ILFDYLKAPGKNDKDLPSDSMKEEIKSCEERVKKLLETTPPKGKEFLHNIEHILEREKNW 367 Query: 973 VWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR 794 VWWKRDGCP FE+QPIEKK Q+G +KRRPRWR+GNKELSQLWKWADQNPNALTDPQRVR Sbjct: 368 VWWKRDGCPPFERQPIEKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALTDPQRVR 427 Query: 793 TPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEGV 614 TPA+ +YWKPLAEDMD SAGIE EYHHKN+RVYCWKGLRF+ARQDL+GFSRFTE+GIEGV Sbjct: 428 TPAVSEYWKPLAEDMDSSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTEYGIEGV 487 Query: 613 VPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRMDPE 434 VP+ELLP DVRSKYQAKP DRSKRAKKEETKG+ QQ EE+QIA TPASE D EG+R+D E Sbjct: 488 VPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIA-TPASEIDGEGTRVDLE 546 Query: 433 ASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGMIDGE 254 AS AP D D T +PT+DE+QKQ+SD D G EAGQ EAD EAE GMIDGE Sbjct: 547 ASAAPMDTDVTAT----------TPTADENQKQSSDTDAGQEAGQSEADAEAEAGMIDGE 596 Query: 253 MDAEVDLDVV 224 DAEVDLD V Sbjct: 597 TDAEVDLDAV 606 >gb|EMJ05791.1| hypothetical protein PRUPE_ppa003099mg [Prunus persica] Length = 604 Score = 960 bits (2481), Expect = 0.0 Identities = 482/611 (78%), Positives = 519/611 (84%) Frame = -1 Query: 2056 MDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQS 1877 M++F++AIL PGPP++FAL QDENQLLENILRTLLQELVS Sbjct: 1 MEVFRRAILQPGPPENFALQTVQQVIKPQKQTKLVQDENQLLENILRTLLQELVS----- 55 Query: 1876 GEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 1697 GE+IMQYGQSI DG+ G +PRLLDIVLYLCE EH+EGGMIFQLLEDLTEMSTMRNCKD Sbjct: 56 GEQIMQYGQSIDDGETTQGHIPRLLDIVLYLCENEHIEGGMIFQLLEDLTEMSTMRNCKD 115 Query: 1696 IFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 1517 +FGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSE Sbjct: 116 VFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 175 Query: 1516 RSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFT 1337 RSAVNIKGVFNTSNETKYEK+ PD +IDFNFYKTFWSLQE F NP SLT A TKW KFT Sbjct: 176 RSAVNIKGVFNTSNETKYEKDPPDGISIDFNFYKTFWSLQEHFCNPPSLTLAPTKWKKFT 235 Query: 1336 SSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQC 1157 S L VVLNTFEAQPLSDEEG A +LE+E +NFSIKYLTSS LMGLELKDPSFRRH+LVQC Sbjct: 236 SGLMVVLNTFEAQPLSDEEGDANSLEEEAANFSIKYLTSSKLMGLELKDPSFRRHILVQC 295 Query: 1156 LILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERN 977 LILFDYLKAPGK++KDLPSD+MKEEIKSCEERVKKLLEMTPP+G+ FLH IEHILERE+N Sbjct: 296 LILFDYLKAPGKSEKDLPSDSMKEEIKSCEERVKKLLEMTPPKGENFLHKIEHILEREKN 355 Query: 976 WVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 797 WVWWKRDGCP FEKQP EKK+ QEG +KRRPRWR+GNKELS LWKWADQNPNALTDPQRV Sbjct: 356 WVWWKRDGCPPFEKQPAEKKVVQEGAKKRRPRWRMGNKELSLLWKWADQNPNALTDPQRV 415 Query: 796 RTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEG 617 RTPAI DYWKPLA+DMD +AGIE EYHHKN+RVYCWKGLRFSARQDLEGFSRFTE GIEG Sbjct: 416 RTPAITDYWKPLADDMDPAAGIEAEYHHKNNRVYCWKGLRFSARQDLEGFSRFTEFGIEG 475 Query: 616 VVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRMDP 437 VVPLELL P+ RSKYQAKP D+SKRAKKEETKG+ QVEE+QIAT A+E D EG R Sbjct: 476 VVPLELLTPEERSKYQAKPNDKSKRAKKEETKGAAHQVEENQIATA-ANEIDGEGIRAVL 534 Query: 436 EASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGMIDG 257 EASV P D DA T SP DEHQKQ+SD DVG EAGQ+EAD E E GMIDG Sbjct: 535 EASVTPTDTDA--TVATGDMSQGGSPIPDEHQKQSSDTDVGQEAGQMEADAEVEAGMIDG 592 Query: 256 EMDAEVDLDVV 224 MD EVDLD V Sbjct: 593 GMDTEVDLDPV 603 >ref|XP_006432406.1| hypothetical protein CICLE_v10000631mg [Citrus clementina] gi|557534528|gb|ESR45646.1| hypothetical protein CICLE_v10000631mg [Citrus clementina] Length = 608 Score = 946 bits (2446), Expect = 0.0 Identities = 473/611 (77%), Positives = 521/611 (85%) Frame = -1 Query: 2056 MDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQS 1877 M++F++AILH GPP++FAL QDENQLLEN+LRTLLQELVS+AVQS Sbjct: 1 MEVFRRAILHAGPPENFALQTVQEVIKPQKQTKLAQDENQLLENMLRTLLQELVSSAVQS 60 Query: 1876 GEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 1697 GE IM YGQSI DG+ Q+PRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTM+NCKD Sbjct: 61 GEPIMHYGQSIDDGETSQAQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKD 120 Query: 1696 IFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 1517 IFGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKLELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1516 RSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFT 1337 RSAVNIKGVFNTSNETKYEK+ PD +DFNFYKTFWSLQE F NPA LT A TKW KFT Sbjct: 181 RSAVNIKGVFNTSNETKYEKDPPDGIPVDFNFYKTFWSLQEYFCNPA-LTLAPTKWQKFT 239 Query: 1336 SSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQC 1157 SSL VVLNTF+AQPLSDE G A LE+E + F+IKYLTSS LMGLELKDPSFRRHVLVQC Sbjct: 240 SSLMVVLNTFDAQPLSDEVGDANVLEEEAATFNIKYLTSSKLMGLELKDPSFRRHVLVQC 299 Query: 1156 LILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERN 977 LILFDYLKAPGKNDKDLPS++MKEE+KSCEERVKKLLEMTPP+GK+FLHSIEHILERE+N Sbjct: 300 LILFDYLKAPGKNDKDLPSESMKEEMKSCEERVKKLLEMTPPKGKDFLHSIEHILEREKN 359 Query: 976 WVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 797 WVWWKRDGCP FEKQ +EKK Q+G +KRRPRWRLGNKELSQLWKWADQNPNALTDPQRV Sbjct: 360 WVWWKRDGCPPFEKQSMEKKAVQDGPKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 419 Query: 796 RTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEG 617 RTPAI +YWKPLAEDMD SAGIE EYHHKNSRVYCWKGLRFSARQDL+GFSRFT+HGIEG Sbjct: 420 RTPAITEYWKPLAEDMDPSAGIEAEYHHKNSRVYCWKGLRFSARQDLDGFSRFTDHGIEG 479 Query: 616 VVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRMDP 437 VVPLELLPP VRS+Y+ K DRSKRAKKE++K + Q EE+QIA + ASE D +G R D Sbjct: 480 VVPLELLPPHVRSRYEGKANDRSKRAKKEDSKVAPSQAEENQIAAS-ASENDGDGIRADL 538 Query: 436 EASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGMIDG 257 EAS P + D +T + T DEHQKQ+SD D+G EAGQ++AD EA+ GM+DG Sbjct: 539 EASATPVETD--VTAGTGNISQSGTATPDEHQKQSSDTDMGQEAGQLDADAEADAGMMDG 596 Query: 256 EMDAEVDLDVV 224 E DAEVDL+ V Sbjct: 597 ETDAEVDLEAV 607 >ref|XP_002299188.1| hypothetical protein POPTR_0001s06900g [Populus trichocarpa] gi|222846446|gb|EEE83993.1| hypothetical protein POPTR_0001s06900g [Populus trichocarpa] Length = 608 Score = 943 bits (2438), Expect = 0.0 Identities = 473/611 (77%), Positives = 519/611 (84%) Frame = -1 Query: 2056 MDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQS 1877 M+ F++AIL PGP + FAL QDENQLLEN+LRTLLQELVS+A QS Sbjct: 1 MEEFRRAILQPGPVETFALKTVQEFIKPQKQTKLVQDENQLLENMLRTLLQELVSSAAQS 60 Query: 1876 GEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 1697 GEEIM G+SI D + GQ+PRLLD VLYLCE+EH+EGGMIFQLLEDLTEMSTMRNCKD Sbjct: 61 GEEIMLSGKSIDDEENSQGQIPRLLDAVLYLCEREHIEGGMIFQLLEDLTEMSTMRNCKD 120 Query: 1696 IFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 1517 IFGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1516 RSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFT 1337 RSAVNIKGVFNTSNETKYEKE P ++DFNFYKT WSLQE F +P SLT + KW KF+ Sbjct: 181 RSAVNIKGVFNTSNETKYEKEPPAAISLDFNFYKTLWSLQEYFCDP-SLTLSPIKWQKFS 239 Query: 1336 SSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQC 1157 SSL VVLNTFEAQPLS+EEG A NLE+E + F+IKYLTSS LMGLELKDPSFRRHVLVQC Sbjct: 240 SSLMVVLNTFEAQPLSEEEGDANNLEEEAAAFNIKYLTSSKLMGLELKDPSFRRHVLVQC 299 Query: 1156 LILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERN 977 LILFDYLKAPGKNDKDL S++MKEEI+S EE VKKLLEMTPP+GK+FLH +EHILERE+N Sbjct: 300 LILFDYLKAPGKNDKDLTSESMKEEIRSREEHVKKLLEMTPPKGKDFLHMVEHILEREKN 359 Query: 976 WVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 797 W+WWKRDGCP FEKQPIE K Q+GG+KRRPRWRLGNKELSQLWKWADQNPNALTDPQRV Sbjct: 360 WLWWKRDGCPPFEKQPIENKTVQDGGKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 419 Query: 796 RTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEG 617 RTP I DYWKPLAEDMD SAGI+ EYHHKN+RVYCWKGLRFSARQDL+GFSRFT+HGIEG Sbjct: 420 RTPIITDYWKPLAEDMDPSAGIDAEYHHKNNRVYCWKGLRFSARQDLDGFSRFTDHGIEG 479 Query: 616 VVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRMDP 437 VVPLELLPPDVRSKYQAKP DRSKRAKK+E KG++ QVE++QI +TPASE D EG R+D Sbjct: 480 VVPLELLPPDVRSKYQAKPNDRSKRAKKDEPKGALHQVEDNQI-STPASEIDGEGIRIDL 538 Query: 436 EASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGMIDG 257 EAS AP D D +T +PT DEHQKQ SD D G EAGQ+EAD EAE GMIDG Sbjct: 539 EASAAPMDTD--VTATTGSISQSGTPTPDEHQKQGSDTDGGQEAGQLEADAEAEAGMIDG 596 Query: 256 EMDAEVDLDVV 224 E DAEVDL+ V Sbjct: 597 ETDAEVDLEAV 607 >ref|XP_006465777.1| PREDICTED: THO complex subunit 1-like isoform X1 [Citrus sinensis] Length = 608 Score = 941 bits (2432), Expect = 0.0 Identities = 471/611 (77%), Positives = 519/611 (84%) Frame = -1 Query: 2056 MDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQS 1877 M++F++AIL GPP++FAL QDENQLLEN+LRTLLQELVS+AVQS Sbjct: 1 MEVFRRAILQAGPPENFALQTVQEVIKPQKQTKLAQDENQLLENMLRTLLQELVSSAVQS 60 Query: 1876 GEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 1697 GE IM YGQSI DG+ Q+PRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTM+NCKD Sbjct: 61 GEPIMHYGQSIDDGETSQAQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKD 120 Query: 1696 IFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 1517 IFGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKLELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1516 RSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFT 1337 RSAVNIKGVFNTSNETKYEK+ PD +DFNFYKTFWSLQE F NPA LT A TKW KFT Sbjct: 181 RSAVNIKGVFNTSNETKYEKDPPDGIPVDFNFYKTFWSLQEYFCNPA-LTLAPTKWQKFT 239 Query: 1336 SSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQC 1157 SSL VVLNTF+AQPLSDE G A LE+E + F+IKYLTSS LMGLELKDPSFRRHVLVQC Sbjct: 240 SSLMVVLNTFDAQPLSDEVGDANVLEEEAATFNIKYLTSSKLMGLELKDPSFRRHVLVQC 299 Query: 1156 LILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERN 977 LILFDYLKAPGKNDKDLPS++MKEE+KSCEERVKKLLE TPP+GK+FLHSIEHILERE+N Sbjct: 300 LILFDYLKAPGKNDKDLPSESMKEEMKSCEERVKKLLETTPPKGKDFLHSIEHILEREKN 359 Query: 976 WVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 797 WVWWKRDGCP FEKQ +EKK Q+G +KRRPRWRLGNKELSQLWKWADQNPNALTDPQRV Sbjct: 360 WVWWKRDGCPPFEKQSMEKKAVQDGPKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 419 Query: 796 RTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEG 617 RTPAI +YWKPLA+DMD SAGIE EYHHKNSRVYCWKGLRFSARQDL+GFSRFT+HGIEG Sbjct: 420 RTPAITEYWKPLADDMDPSAGIEAEYHHKNSRVYCWKGLRFSARQDLDGFSRFTDHGIEG 479 Query: 616 VVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRMDP 437 VVPLELLPP VRS+Y+ K DRSKRAKKE++K + Q EE+QIA + ASE D EG R D Sbjct: 480 VVPLELLPPHVRSRYEGKANDRSKRAKKEDSKVAPSQAEENQIAAS-ASENDGEGIRADL 538 Query: 436 EASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGMIDG 257 EAS P + D +T + T DEHQKQ+SD D+G EAGQ++AD EA+ GM+DG Sbjct: 539 EASATPVETD--VTAGTGNISQSGTATPDEHQKQSSDTDMGQEAGQLDADAEADAGMMDG 596 Query: 256 EMDAEVDLDVV 224 E DAEVDL+ V Sbjct: 597 ETDAEVDLEAV 607 >ref|XP_002529986.1| nuclear matrix protein, putative [Ricinus communis] gi|223530509|gb|EEF32391.1| nuclear matrix protein, putative [Ricinus communis] Length = 608 Score = 938 bits (2425), Expect = 0.0 Identities = 466/612 (76%), Positives = 519/612 (84%) Frame = -1 Query: 2056 MDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQS 1877 M+ FK AIL PGPP++FAL QDENQLLEN+LRTLLQELV++AV S Sbjct: 1 MEEFKNAILQPGPPENFALQTVQEFIKPQRQTKLAQDENQLLENMLRTLLQELVASAVHS 60 Query: 1876 GEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 1697 GE+IM YGQS+ +G+ GQ+PRLLD+VL+LCE+EHVEGGMIFQLLEDLTEMSTM+NC+D Sbjct: 61 GEQIMLYGQSVDEGEKSQGQIPRLLDVVLHLCEREHVEGGMIFQLLEDLTEMSTMKNCQD 120 Query: 1696 IFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 1517 IFGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1516 RSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFT 1337 RSAVNIKGVFNTSNETKYEK+ P ++DFNFYKT WSLQE+F NPA LT A TKWHKFT Sbjct: 181 RSAVNIKGVFNTSNETKYEKDPPAGISVDFNFYKTLWSLQENFCNPAPLTLAPTKWHKFT 240 Query: 1336 SSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQC 1157 SSL VVLNTFEAQPLS+EEG A NLE+E + F+IKYLTSS LMGLELKDPSFRRH+LVQC Sbjct: 241 SSLMVVLNTFEAQPLSEEEGDANNLEEEAATFNIKYLTSSKLMGLELKDPSFRRHILVQC 300 Query: 1156 LILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERN 977 LILFDYLKAPGKNDKD S++MKE+I++CEERVKKLLEMTPP+GK+FL IEH+LERE+N Sbjct: 301 LILFDYLKAPGKNDKDSTSESMKEDIRTCEERVKKLLEMTPPKGKDFLQKIEHVLEREKN 360 Query: 976 WVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 797 WV WKRDGC FEKQPIE K QEG +KR+PRWRLGNKELSQLWKWADQNPNALTDPQRV Sbjct: 361 WVCWKRDGCQPFEKQPIENKTIQEGSKKRKPRWRLGNKELSQLWKWADQNPNALTDPQRV 420 Query: 796 RTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEG 617 RTPAI +YWKPLAEDMD SAGIE EYHHKN+RVYCWKGLRFSARQDL+GFSRFT+HGIEG Sbjct: 421 RTPAITEYWKPLAEDMDPSAGIEAEYHHKNNRVYCWKGLRFSARQDLDGFSRFTDHGIEG 480 Query: 616 VVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRMDP 437 VVPLELLPPDVRSKYQAKP DRSKRAKK++ KG Q EE+QIA TPASE D EG R D Sbjct: 481 VVPLELLPPDVRSKYQAKPNDRSKRAKKDDIKGGSNQTEENQIA-TPASEIDGEGIRAD- 538 Query: 436 EASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGMIDG 257 EA+ AP D DA+ T +PT DE Q+Q+ D D G EAG +EAD E E GMIDG Sbjct: 539 EAAAAPMDTDAMAT--AGSTSQGGTPTPDERQRQSPDADDGQEAGHLEADGEVEAGMIDG 596 Query: 256 EMDAEVDLDVVA 221 E DAEVDL+ ++ Sbjct: 597 ETDAEVDLEAIS 608 >ref|XP_004230044.1| PREDICTED: THO complex subunit 1-like [Solanum lycopersicum] Length = 608 Score = 938 bits (2424), Expect = 0.0 Identities = 473/609 (77%), Positives = 517/609 (84%) Frame = -1 Query: 2056 MDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQS 1877 MDLF++AIL GPP++FAL QDENQLLENILR+LLQELV+AAVQS Sbjct: 1 MDLFRQAILRQGPPEEFALLTVQEAIKPQKQTKLVQDENQLLENILRSLLQELVAAAVQS 60 Query: 1876 GEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 1697 G+++M+YG SI DG+ GQ+PRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNC+D Sbjct: 61 GQKLMKYGVSIVDGESSQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCED 120 Query: 1696 IFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 1517 +FGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE Sbjct: 121 VFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 180 Query: 1516 RSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFT 1337 RSAVNIKGVFNTSNETKYE E PD +IDFNFY+T WSLQE F NP SL A KWHKFT Sbjct: 181 RSAVNIKGVFNTSNETKYETEVPDGISIDFNFYRTLWSLQEYFCNPPSLINAPGKWHKFT 240 Query: 1336 SSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQC 1157 SSLT+VLNTFEAQPLSDEEG+A NLED+ + F+IKYLTSS LMGLELKDPSFRRHVLVQC Sbjct: 241 SSLTLVLNTFEAQPLSDEEGNAHNLEDDAATFNIKYLTSSKLMGLELKDPSFRRHVLVQC 300 Query: 1156 LILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERN 977 LILFDYLKAPGK++K+LPS+ MKEEIK+ EER KKLLEMTPP+G +FL SIEHILERERN Sbjct: 301 LILFDYLKAPGKSEKELPSEAMKEEIKTSEERAKKLLEMTPPKGIDFLRSIEHILERERN 360 Query: 976 WVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 797 WVWWKRDGCP FEKQP+EKKL Q+G +KRR RW LGNKELSQLWKWADQ ALTD +RV Sbjct: 361 WVWWKRDGCPPFEKQPVEKKLVQDGTKKRRTRWSLGNKELSQLWKWADQYSGALTDAERV 420 Query: 796 RTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEG 617 TPAI YWKPLAEDMDESAGIE EYHHKN+RVYCWKGLRFSARQDLEGFSRFTEHGIEG Sbjct: 421 ATPAITKYWKPLAEDMDESAGIEAEYHHKNNRVYCWKGLRFSARQDLEGFSRFTEHGIEG 480 Query: 616 VVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRMDP 437 VVPLELLP +VR+KYQAKP +R+KR KKE+TK S QQ EE+QIA TP SE D E R DP Sbjct: 481 VVPLELLPNEVRAKYQAKPSERTKRTKKEDTKNSAQQAEENQIA-TPPSEMDNEVGRADP 539 Query: 436 EASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGMIDG 257 EAS AP D DA I +PT +++QKQ+SD DV EAGQIEADTEAE GMIDG Sbjct: 540 EASAAPMDTDAGIA--TVNICQEETPTPEDNQKQSSDTDVAQEAGQIEADTEAETGMIDG 597 Query: 256 EMDAEVDLD 230 E DAE DLD Sbjct: 598 ETDAE-DLD 605 >gb|EOY19638.1| Nuclear matrix protein-related isoform 1 [Theobroma cacao] Length = 602 Score = 937 bits (2421), Expect = 0.0 Identities = 473/605 (78%), Positives = 512/605 (84%) Frame = -1 Query: 2056 MDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQS 1877 M+ F++AIL PGPP+ FAL QDENQLLEN+LRTLLQELVS++V S Sbjct: 2 MEAFRRAILQPGPPETFALKIVQEVIKPQKQTKLAQDENQLLENMLRTLLQELVSSSVPS 61 Query: 1876 GEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 1697 GEEIMQYG+SI D G +PRLLD VLYLCEKEHVEGGMIFQLLEDL EMSTMRNCKD Sbjct: 62 GEEIMQYGKSIDDESDTQGVIPRLLDFVLYLCEKEHVEGGMIFQLLEDLNEMSTMRNCKD 121 Query: 1696 IFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 1517 IF YIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSE Sbjct: 122 IFRYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 181 Query: 1516 RSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFT 1337 RSAVNIKGVFNTSNETKYEK+ P+ ++DFNFYKTFWSLQ+ F NPASL+ A KW KFT Sbjct: 182 RSAVNIKGVFNTSNETKYEKDPPEGISVDFNFYKTFWSLQDYFCNPASLSTAPVKWQKFT 241 Query: 1336 SSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQC 1157 SSL VVLNTFEAQPLS+EEG+ NLE+E + F+IKYLTSS LMGLELKDPSFRRH+L+QC Sbjct: 242 SSLMVVLNTFEAQPLSEEEGADNNLEEEATTFNIKYLTSSKLMGLELKDPSFRRHILLQC 301 Query: 1156 LILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERN 977 LILFDYLKAPGKNDKD S++MKEEIKSCE+RVKKLLE+TPP+GK+FL SIEHILERE+N Sbjct: 302 LILFDYLKAPGKNDKD-SSESMKEEIKSCEDRVKKLLEVTPPKGKDFLCSIEHILEREKN 360 Query: 976 WVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 797 WVWWKRDGCP FEKQPIEKK Q G +KRRPRWRLGNKELSQLWKWADQNPNALTDPQRV Sbjct: 361 WVWWKRDGCPPFEKQPIEKKPVQNGAKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 420 Query: 796 RTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEG 617 RTPAI DYWKPLAEDMDESAGIE EYHHKN+RVYCWKGLRF+ARQDLEGFS+FTEHGIEG Sbjct: 421 RTPAITDYWKPLAEDMDESAGIEAEYHHKNNRVYCWKGLRFAARQDLEGFSKFTEHGIEG 480 Query: 616 VVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRMDP 437 VVPLELLPPDVRSK+Q KP DRSKRAKKEETK S QVEESQIA TPASE D EG R D Sbjct: 481 VVPLELLPPDVRSKFQGKPSDRSKRAKKEETKTSSHQVEESQIA-TPASEVDGEGMRADM 539 Query: 436 EASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGMIDG 257 EAS A D D +T +PT DEHQKQ+ D DVG EAGQ+EAD E E G IDG Sbjct: 540 EASAALMDAD--VTAGTGNNSQGGTPTPDEHQKQSPDTDVGQEAGQLEADAEVEAG-IDG 596 Query: 256 EMDAE 242 E D E Sbjct: 597 ETDPE 601 >ref|XP_006465778.1| PREDICTED: THO complex subunit 1-like isoform X2 [Citrus sinensis] Length = 607 Score = 934 bits (2415), Expect = 0.0 Identities = 470/611 (76%), Positives = 518/611 (84%) Frame = -1 Query: 2056 MDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQS 1877 M++F++AIL GPP++FAL QDENQLLEN+LRTLLQELVS+AVQS Sbjct: 1 MEVFRRAILQAGPPENFALQTVQEVIKPQKQTKLAQDENQLLENMLRTLLQELVSSAVQS 60 Query: 1876 GEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 1697 GE IM YGQSI DG+ Q+PRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTM+NCKD Sbjct: 61 GEPIMHYGQSIDDGETSQAQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKD 120 Query: 1696 IFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 1517 IFGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKLELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1516 RSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFT 1337 RSAVNIKGVFNTSNETKYEK+ PD +DFNFYKTFWSLQE F NPA LT A TKW KFT Sbjct: 181 RSAVNIKGVFNTSNETKYEKDPPDGIPVDFNFYKTFWSLQEYFCNPA-LTLAPTKWQKFT 239 Query: 1336 SSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQC 1157 SSL VVLNTF+AQPLSDE G A LE+E + F+IKYLTSS LMGLELKDPSFRRHVLVQC Sbjct: 240 SSLMVVLNTFDAQPLSDEVGDANVLEEEAATFNIKYLTSSKLMGLELKDPSFRRHVLVQC 299 Query: 1156 LILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERN 977 LILFDYLKAPGKNDKDLPS++MKEE+KSCEERVKKLLE TPP+GK+FLHSIEHILERE+N Sbjct: 300 LILFDYLKAPGKNDKDLPSESMKEEMKSCEERVKKLLETTPPKGKDFLHSIEHILEREKN 359 Query: 976 WVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 797 WVWWKRDGCP FEKQ +EKK Q+ G K+RPRWRLGNKELSQLWKWADQNPNALTDPQRV Sbjct: 360 WVWWKRDGCPPFEKQSMEKKAVQD-GPKKRPRWRLGNKELSQLWKWADQNPNALTDPQRV 418 Query: 796 RTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEG 617 RTPAI +YWKPLA+DMD SAGIE EYHHKNSRVYCWKGLRFSARQDL+GFSRFT+HGIEG Sbjct: 419 RTPAITEYWKPLADDMDPSAGIEAEYHHKNSRVYCWKGLRFSARQDLDGFSRFTDHGIEG 478 Query: 616 VVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRMDP 437 VVPLELLPP VRS+Y+ K DRSKRAKKE++K + Q EE+QIA + ASE D EG R D Sbjct: 479 VVPLELLPPHVRSRYEGKANDRSKRAKKEDSKVAPSQAEENQIAAS-ASENDGEGIRADL 537 Query: 436 EASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGMIDG 257 EAS P + D +T + T DEHQKQ+SD D+G EAGQ++AD EA+ GM+DG Sbjct: 538 EASATPVETD--VTAGTGNISQSGTATPDEHQKQSSDTDMGQEAGQLDADAEADAGMMDG 595 Query: 256 EMDAEVDLDVV 224 E DAEVDL+ V Sbjct: 596 ETDAEVDLEAV 606 >gb|ESW20659.1| hypothetical protein PHAVU_005G004500g [Phaseolus vulgaris] Length = 604 Score = 927 bits (2397), Expect = 0.0 Identities = 463/611 (75%), Positives = 512/611 (83%) Frame = -1 Query: 2056 MDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQS 1877 M++FK+AIL PGPP++FAL QDENQ LENILR LLQE VSAAV S Sbjct: 1 MEVFKRAILQPGPPENFALKTVQEVIKPQKQTKLAQDENQFLENILRMLLQEFVSAAV-S 59 Query: 1876 GEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 1697 E+IMQ+GQSI + G +PRLLDIVLYLCEKEH+EGGMIFQLLEDLTEMSTM+NCKD Sbjct: 60 AEKIMQFGQSIDSNETTQGHIPRLLDIVLYLCEKEHIEGGMIFQLLEDLTEMSTMKNCKD 119 Query: 1696 IFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 1517 +FGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSE Sbjct: 120 VFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 179 Query: 1516 RSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFT 1337 RSA+NIKGVFNTSNETK+EKE + IDFNFY+TFW LQE FSNP S++ A KW KFT Sbjct: 180 RSALNIKGVFNTSNETKFEKEPLEGICIDFNFYQTFWGLQEFFSNPTSISHAPVKWQKFT 239 Query: 1336 SSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQC 1157 SSL+VVLNTFEAQPLSDEEG A NLE+E NFSIKYLTSS LMGLELKDPSFRRHVLVQC Sbjct: 240 SSLSVVLNTFEAQPLSDEEGDANNLEEEAVNFSIKYLTSSKLMGLELKDPSFRRHVLVQC 299 Query: 1156 LILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERN 977 LILFDYLKAPGK DKDLPS+ MKEEI SCEERVKKLLE+TPP+G EFLH IEHILERE+N Sbjct: 300 LILFDYLKAPGKGDKDLPSENMKEEITSCEERVKKLLELTPPKGSEFLHKIEHILEREKN 359 Query: 976 WVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 797 WVWWKRDGC +EKQPIEKK EG +KRRPRWRLGNKELSQLWKWADQNPNALTDPQRV Sbjct: 360 WVWWKRDGCLPYEKQPIEKKAVPEGSKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 419 Query: 796 RTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEG 617 +TP+IM+YWKPLA+DMD SAGIE EYHHKN+RVYCWKGLR +ARQDLEGFS+FT+HGIEG Sbjct: 420 QTPSIMEYWKPLADDMDPSAGIEAEYHHKNNRVYCWKGLRLAARQDLEGFSKFTDHGIEG 479 Query: 616 VVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRMDP 437 VVPLELLPPDVRSKYQAKP DRSKR+KKEETKGS QVEE+QIATT A+E D +G R D Sbjct: 480 VVPLELLPPDVRSKYQAKPNDRSKRSKKEETKGSAHQVEENQIATT-ATELDGDGIRTDT 538 Query: 436 EASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGMIDG 257 A+ DG ++ +PT +E K +SD DVG EAGQ+EA+ E E G+IDG Sbjct: 539 TATPMEFDGASV------PGTQGGTPTPEELHKHSSDTDVGQEAGQLEAEAEVEAGIIDG 592 Query: 256 EMDAEVDLDVV 224 E DA+VDLD V Sbjct: 593 ETDADVDLDTV 603 >ref|XP_006347676.1| PREDICTED: THO complex subunit 1-like [Solanum tuberosum] Length = 609 Score = 926 bits (2392), Expect = 0.0 Identities = 462/605 (76%), Positives = 512/605 (84%) Frame = -1 Query: 2056 MDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQS 1877 MDLF++AIL GPP++FAL QDENQLLENILR+LLQELV+AAVQS Sbjct: 1 MDLFRQAILRQGPPEEFALLTVQEAIKPQKQTKLVQDENQLLENILRSLLQELVAAAVQS 60 Query: 1876 GEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 1697 G+++M+YG SI DG+ GQ+PRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNC+D Sbjct: 61 GQKVMKYGVSIVDGESSQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCED 120 Query: 1696 IFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 1517 +FGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE Sbjct: 121 VFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 180 Query: 1516 RSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFT 1337 RSAVNIKGVFNTSNETKYE E P+ +IDFNFY+T WSLQE F NP SL A KWHKFT Sbjct: 181 RSAVNIKGVFNTSNETKYETEVPEGISIDFNFYRTLWSLQEYFCNPPSLINAPGKWHKFT 240 Query: 1336 SSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQC 1157 SSLT+VLNTFEAQPLSDEEG+ NLED+ + F+IKYLTSS LMGLELKDPSFRRHVLVQC Sbjct: 241 SSLTLVLNTFEAQPLSDEEGNVHNLEDDAATFNIKYLTSSKLMGLELKDPSFRRHVLVQC 300 Query: 1156 LILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERN 977 LILFDYLK PGK++K+LPS+ MKEEIK+ EE+ KKLLEMTPP+G +FLHSIEHILERERN Sbjct: 301 LILFDYLKEPGKSEKELPSEAMKEEIKTSEEQAKKLLEMTPPKGIDFLHSIEHILERERN 360 Query: 976 WVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 797 WVWWKRDGCP FEKQP+EKKL Q+G +KRRPRW LGN+ELSQLWKWADQ +ALTD QRV Sbjct: 361 WVWWKRDGCPPFEKQPVEKKLVQDGTKKRRPRWSLGNRELSQLWKWADQYSSALTDAQRV 420 Query: 796 RTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEG 617 TPAI YWKPLAEDMDESAGIE EYHHKN+RVYCWKGLRFSARQDLEGFSRFTEHGIEG Sbjct: 421 STPAITKYWKPLAEDMDESAGIEAEYHHKNNRVYCWKGLRFSARQDLEGFSRFTEHGIEG 480 Query: 616 VVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRMDP 437 VVPLELL +VR++YQAKP +R+KR KKE+TK S QQ +E+QIA TP SE D E + DP Sbjct: 481 VVPLELLSNEVRARYQAKPSERTKRTKKEDTKNSAQQADENQIA-TPPSEMDNEVGQADP 539 Query: 436 EASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGMIDG 257 EAS AP D DA I +PT +++QKQ+SD DV EAGQ EADTEAE MIDG Sbjct: 540 EASAAPMDTDAGIA--TVNISQEETPTPEDNQKQSSDTDVAQEAGQTEADTEAETAMIDG 597 Query: 256 EMDAE 242 E DAE Sbjct: 598 ETDAE 602 >ref|XP_004307195.1| PREDICTED: THO complex subunit 1-like [Fragaria vesca subsp. vesca] Length = 611 Score = 922 bits (2382), Expect = 0.0 Identities = 464/613 (75%), Positives = 512/613 (83%), Gaps = 4/613 (0%) Frame = -1 Query: 2056 MDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQS 1877 M++F+ AIL PGPP+ FAL QDENQLLENILRTLLQELVS+AVQS Sbjct: 1 MEVFRSAILQPGPPETFALQTVQQVIKPQKGTKLVQDENQLLENILRTLLQELVSSAVQS 60 Query: 1876 GEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 1697 GE+IMQYGQSI DG+ G +PRLLD+VLYLCE EHVEGGMIFQLLEDLTEMSTMRNCKD Sbjct: 61 GEQIMQYGQSIDDGEATRGHIPRLLDVVLYLCENEHVEGGMIFQLLEDLTEMSTMRNCKD 120 Query: 1696 IFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 1517 +FGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSE Sbjct: 121 VFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1516 RSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFT 1337 RSAVNIKGVFNTSNETKYEK+APD +IDFNFYKTFWSLQE F NPA LT A TKW KFT Sbjct: 181 RSAVNIKGVFNTSNETKYEKDAPDGISIDFNFYKTFWSLQEYFCNPAPLTVAPTKWQKFT 240 Query: 1336 SSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQC 1157 SSL VVLNTFEAQPLSDEEG A NLE E +NFSIKYLTSS LMGLELKDPSFRRH+LVQC Sbjct: 241 SSLKVVLNTFEAQPLSDEEGEANNLE-ESANFSIKYLTSSKLMGLELKDPSFRRHILVQC 299 Query: 1156 LILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERN 977 LILFDYLKAPGK++KDLPS++MKEEI S EE VKKLLEMTPP+G+ FLH IEHILERE+N Sbjct: 300 LILFDYLKAPGKSEKDLPSESMKEEINSYEEHVKKLLEMTPPKGESFLHKIEHILEREKN 359 Query: 976 WVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 797 WVWWKRDGCP FEKQPIEKK Q+G +KR+PRWRLGNKELSQLWKWADQNPNALTD QR+ Sbjct: 360 WVWWKRDGCPPFEKQPIEKKTVQDGAKKRKPRWRLGNKELSQLWKWADQNPNALTDTQRL 419 Query: 796 RTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEG 617 RTP+I +YWKPLAEDMD +AGIE EYHHKN+RVYCWKGLRFSARQDLEGFS+FTE GIEG Sbjct: 420 RTPSITEYWKPLAEDMDPAAGIEAEYHHKNNRVYCWKGLRFSARQDLEGFSKFTEFGIEG 479 Query: 616 VVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRMDP 437 VVPLELLPP+ R+KY K ++SKRAKKE+ K +V VEE+Q+AT A++ D E R D Sbjct: 480 VVPLELLPPEERAKYAPKTNEKSKRAKKEDAKAAVHHVEENQVATA-ATDVDGEVLRTDV 538 Query: 436 EASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQI----EADTEAEQG 269 A VAP D D + SP +DEHQKQ+SD D G EAGQ+ E D E + G Sbjct: 539 GALVAPLDTDNTMV---CNTSQGNSPMADEHQKQSSDTDGGQEAGQLEDDAEVDAEGDAG 595 Query: 268 MIDGEMDAEVDLD 230 MIDGE++ EVDLD Sbjct: 596 MIDGEIEPEVDLD 608 >ref|XP_004140313.1| PREDICTED: THO complex subunit 1-like [Cucumis sativus] Length = 607 Score = 920 bits (2379), Expect = 0.0 Identities = 465/611 (76%), Positives = 509/611 (83%) Frame = -1 Query: 2062 LTMDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAV 1883 L ++ F+KAIL GPP++FAL QDENQLLENILR LLQELVS+AV Sbjct: 5 LYLEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAV 64 Query: 1882 QSGEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNC 1703 QS E +MQYG SI + + DIVLYLCEKEHVEGGMIFQLLEDLTEMST+RNC Sbjct: 65 QSTEPVMQYGMSIDEKETSQ-------DIVLYLCEKEHVEGGMIFQLLEDLTEMSTLRNC 117 Query: 1702 KDIFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPL 1523 KDIFGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRI+MFLAHFFPL Sbjct: 118 KDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPL 177 Query: 1522 SERSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHK 1343 SERSAVNIKGVFNTSNETKYEK+ PD +IDFNFYKTFWSLQE F NPASL A TKW K Sbjct: 178 SERSAVNIKGVFNTSNETKYEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQK 237 Query: 1342 FTSSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLV 1163 FTSSL VVLNTF+AQPLSDEEG A LE+E + FSIKYLTSS LMGLELKDPSFRRHVL+ Sbjct: 238 FTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLM 297 Query: 1162 QCLILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERE 983 QCLILFDYLKAPGKN+KD+PS+TM+EEIKSCEERVKKLLE+TPPRGK+FL IEHIL+RE Sbjct: 298 QCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRE 357 Query: 982 RNWVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQ 803 NWVWWKRDGC FEKQPIEKK + +KRRPRWRLGNKELSQLWKW+DQNPNALTDPQ Sbjct: 358 NNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGNKELSQLWKWSDQNPNALTDPQ 417 Query: 802 RVRTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGI 623 RVR+PAI DYWKPLAEDMDESAGIE EYHH+N+RVYCWKGLRFSARQDLEGFSRFT+HGI Sbjct: 418 RVRSPAISDYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGI 477 Query: 622 EGVVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRM 443 EGVVPLELLPPDVR+KYQAKP +RSKRAKKEE KG+VQQV+E+Q+A TPASE D EG+R Sbjct: 478 EGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMA-TPASENDGEGTRS 536 Query: 442 DPEASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGMI 263 DP+ A D D I +P E K +SD D+G EAGQ+EAD E E GMI Sbjct: 537 DPDGPSAGMDVDTAIATGNVSQGGISTP---EENKLSSDTDIGQEAGQLEADAEVEPGMI 593 Query: 262 DGEMDAEVDLD 230 DGE DAEVDLD Sbjct: 594 DGETDAEVDLD 604 >ref|XP_003522894.1| PREDICTED: THO complex subunit 1 isoform X1 [Glycine max] gi|571450424|ref|XP_006578423.1| PREDICTED: THO complex subunit 1 isoform X2 [Glycine max] Length = 605 Score = 914 bits (2363), Expect = 0.0 Identities = 459/611 (75%), Positives = 504/611 (82%) Frame = -1 Query: 2056 MDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQS 1877 M++FK+AI+ PGPP+ FAL QDENQ LENILR LLQE VSAAVQ Sbjct: 1 MEVFKRAIIQPGPPESFALRTVQEVIKPQKQTKLAQDENQFLENILRMLLQEFVSAAVQF 60 Query: 1876 GEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 1697 GE+IMQ+GQSI + G +PRLLDIVLYLCEKEH+EGGMIFQLLEDLTEMSTM+NCKD Sbjct: 61 GEKIMQFGQSIDSSETTQGHIPRLLDIVLYLCEKEHIEGGMIFQLLEDLTEMSTMKNCKD 120 Query: 1696 IFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 1517 IFGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1516 RSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFT 1337 RSA+NIKGVFNTSNETKYEKE + IDFNFY+TFW LQE FSNP S++ A KW KFT Sbjct: 181 RSALNIKGVFNTSNETKYEKEPLEGICIDFNFYQTFWGLQEYFSNPTSISHAPAKWQKFT 240 Query: 1336 SSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQC 1157 SL+VVLNTFEAQPLSDEEG A NLE+E NFSIKYLTSS LMGLELKDPSFRRHVLVQC Sbjct: 241 LSLSVVLNTFEAQPLSDEEGDANNLEEEAVNFSIKYLTSSKLMGLELKDPSFRRHVLVQC 300 Query: 1156 LILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERN 977 LILFDYLKAPGK DKDLPS+ MKEEI S EERVKKLLE+TPP+G EFLH IEHILERE+N Sbjct: 301 LILFDYLKAPGKGDKDLPSENMKEEITSWEERVKKLLELTPPKGTEFLHKIEHILEREKN 360 Query: 976 WVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 797 WVWWKRDGC +EKQ IEKK +G +KRRPRWRLGNKELSQLWKWADQNPNALTDPQRV Sbjct: 361 WVWWKRDGCLPYEKQRIEKKAVPDGPKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 420 Query: 796 RTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEG 617 +TP+IM+YWKPLAEDMD SAGIE +YHHKN+RVYCWKGLR SARQDLEGFS+FT+HGIEG Sbjct: 421 QTPSIMEYWKPLAEDMDPSAGIEADYHHKNNRVYCWKGLRLSARQDLEGFSKFTDHGIEG 480 Query: 616 VVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEESQIATTPASETDVEGSRMDP 437 VVPLELLPPDVRSKYQAKP DRSKR+KKEETKG+ Q+EE+QIAT A+E D +G R D Sbjct: 481 VVPLELLPPDVRSKYQAKPNDRSKRSKKEETKGTAHQIEENQIATN-ATEIDGDGIRTDT 539 Query: 436 EASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDVGLEAGQIEADTEAEQGMIDG 257 A+ D + T +E QK +SD D G EAGQ+EAD E E GMIDG Sbjct: 540 TATSMEFDA------ATAPGTQGGTTTPEELQKLSSDTDGGQEAGQLEADAEVEAGMIDG 593 Query: 256 EMDAEVDLDVV 224 E DA+VDLD V Sbjct: 594 ETDADVDLDTV 604 >ref|XP_002303943.2| hypothetical protein POPTR_0003s19340g [Populus trichocarpa] gi|550343535|gb|EEE78922.2| hypothetical protein POPTR_0003s19340g [Populus trichocarpa] Length = 629 Score = 892 bits (2304), Expect = 0.0 Identities = 457/631 (72%), Positives = 507/631 (80%), Gaps = 20/631 (3%) Frame = -1 Query: 2056 MDLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQS 1877 M+ F++AIL GP + FAL QDENQLLEN+LRTLLQELVS++ QS Sbjct: 1 MEEFRRAILQSGPIESFALQTVQEFIKPQKQTKLVQDENQLLENMLRTLLQELVSSSAQS 60 Query: 1876 GEEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 1697 EEIM YG+SI DG+ GQ+PRLLD+VLYLCE++ VEGGMIFQLLEDLTEMSTMRNCKD Sbjct: 61 REEIMLYGKSIEDGEDSQGQIPRLLDVVLYLCERDFVEGGMIFQLLEDLTEMSTMRNCKD 120 Query: 1696 IFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 1517 IFGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1516 RSAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFT 1337 RSAVNIKGVFNTSNETKYEKE P + ++ QE F +P SLT + KW KF+ Sbjct: 181 RSAVNIKGVFNTSNETKYEKEPPAATCCMYSDKLVCLLFQEYFCDP-SLTLSPIKWQKFS 239 Query: 1336 SSLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQC 1157 SL V+LN FEAQPLS+EEGSA NLE+E ++F+IKYLTSS LMGLELKDPSFRRHVLVQC Sbjct: 240 LSLMVILNAFEAQPLSEEEGSANNLEEEAASFNIKYLTSSKLMGLELKDPSFRRHVLVQC 299 Query: 1156 LILFDYLKAPGKNDKDLPSDTM-------------------KEEIKSCEERVKKLLEMTP 1034 LILFDYLKAPGKNDKDL S++M KEEIKS EE VKKLLEMTP Sbjct: 300 LILFDYLKAPGKNDKDLTSESMVSAVPLLILILSALNSCLCKEEIKSREEHVKKLLEMTP 359 Query: 1033 PRGKEFLHSIEHILERERNWVWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELS 854 P+GK+FLH +EHILERE+NW+WWKRDGCP FEKQPIE K Q+GG+KRRPRWRLGNKELS Sbjct: 360 PKGKDFLHKVEHILEREKNWLWWKRDGCPPFEKQPIENKTVQDGGKKRRPRWRLGNKELS 419 Query: 853 QLWKWADQNPNALTDPQRVRTPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRF 674 QLWKWADQNPNALTDPQRVRTPAI DYWKPLAEDMD SA IE +YHHKN+RVYCWKGLR Sbjct: 420 QLWKWADQNPNALTDPQRVRTPAITDYWKPLAEDMDPSASIEADYHHKNNRVYCWKGLRV 479 Query: 673 SARQDLEGFSRFTEHGIEGVVPLELLPPDVRSKYQAKPGDRSKRAKKEETKGSVQQVEES 494 SARQDL+GFSRFT+HGIEGVVPLELLPPDVRSK+QAKP DRSKRAKK+E KG+ QVE++ Sbjct: 480 SARQDLDGFSRFTDHGIEGVVPLELLPPDVRSKHQAKPNDRSKRAKKDEPKGASHQVEDN 539 Query: 493 QIA-TTPASETDVEGSRMDPEASVAPADGDAIITXXXXXXXXXXSPTSDEHQKQNSDGDV 317 Q++ TPASE D EG R D EASV P D DA+ T +PT DEHQKQ+ D D Sbjct: 540 QVSIATPASEIDGEGIRTDLEASVTPMDSDAMAT--TSNISQSSTPTPDEHQKQSPDTDG 597 Query: 316 GLEAGQIEADTEAEQGMIDGEMDAEVDLDVV 224 G EAG IEAD EAE GMIDGE DAEVDL+ V Sbjct: 598 GQEAGHIEADAEAEAGMIDGETDAEVDLEAV 628 >gb|EPS68583.1| hypothetical protein M569_06184, partial [Genlisea aurea] Length = 510 Score = 891 bits (2302), Expect = 0.0 Identities = 431/509 (84%), Positives = 466/509 (91%) Frame = -1 Query: 2053 DLFKKAILHPGPPQDFALXXXXXXXXXXXXXXXXQDENQLLENILRTLLQELVSAAVQSG 1874 ++F++AI++PGPPQDFAL QDENQLLENILR LLQELVSAAVQSG Sbjct: 2 EIFREAIMNPGPPQDFALRTVEQVIKPQNIDKLLQDENQLLENILRALLQELVSAAVQSG 61 Query: 1873 EEIMQYGQSIADGDIRPGQVPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKDI 1694 E IM YGQS+A+G++R G++PRLLDIVLYLCEKEH+EGGMIFQLLEDLTEMSTMRNCKDI Sbjct: 62 EHIMHYGQSVAEGEVRYGEIPRLLDIVLYLCEKEHIEGGMIFQLLEDLTEMSTMRNCKDI 121 Query: 1693 FGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSER 1514 FGYIESKQDILGKPELF RGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSER Sbjct: 122 FGYIESKQDILGKPELFTRGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSER 181 Query: 1513 SAVNIKGVFNTSNETKYEKEAPDCSTIDFNFYKTFWSLQESFSNPASLTPALTKWHKFTS 1334 SAVNIKGVFNTSNETKYEKE P+C++IDFNFYKTFWSLQE FSNPASLTP TKW KF S Sbjct: 182 SAVNIKGVFNTSNETKYEKETPECTSIDFNFYKTFWSLQEFFSNPASLTPVATKWPKFCS 241 Query: 1333 SLTVVLNTFEAQPLSDEEGSAINLEDEGSNFSIKYLTSSNLMGLELKDPSFRRHVLVQCL 1154 SL VVLNTFEAQPL DEE +AINLEDE +NFSIKYLTSSNLMGLELKDPSFRRH LVQCL Sbjct: 242 SLMVVLNTFEAQPLRDEECNAINLEDE-ANFSIKYLTSSNLMGLELKDPSFRRHFLVQCL 300 Query: 1153 ILFDYLKAPGKNDKDLPSDTMKEEIKSCEERVKKLLEMTPPRGKEFLHSIEHILERERNW 974 ILFDYLK+PGKNDKDL D+M+EEIK+CEE+VKKLLEMTPPRGKEFLH I H+LERERNW Sbjct: 301 ILFDYLKSPGKNDKDLLLDSMREEIKNCEEQVKKLLEMTPPRGKEFLHGIGHVLERERNW 360 Query: 973 VWWKRDGCPAFEKQPIEKKLAQEGGRKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR 794 VWWKRD CP FEKQP+E+K+AQ+G RKRR RWRLGNKELSQLWKWADQNPNALTDP+RV Sbjct: 361 VWWKRDSCPPFEKQPLERKVAQDGARKRRLRWRLGNKELSQLWKWADQNPNALTDPRRVC 420 Query: 793 TPAIMDYWKPLAEDMDESAGIEEEYHHKNSRVYCWKGLRFSARQDLEGFSRFTEHGIEGV 614 TP+IMDYWKPLA+DMDE+AGIEEEYHHKN+RVYCWKGLRFSARQDLEGFSRFTEHGIEGV Sbjct: 421 TPSIMDYWKPLADDMDEAAGIEEEYHHKNNRVYCWKGLRFSARQDLEGFSRFTEHGIEGV 480 Query: 613 VPLELLPPDVRSKYQAKPGDRSKRAKKEE 527 VPLELLPPD RSKYQ KPGDRSKRAK++E Sbjct: 481 VPLELLPPDTRSKYQMKPGDRSKRAKRDE 509 >gb|EOY19639.1| Nuclear matrix protein-related isoform 2 [Theobroma cacao] Length = 572 Score = 889 bits (2296), Expect = 0.0 Identities = 451/576 (78%), Positives = 487/576 (84%), Gaps = 15/576 (2%) Frame = -1 Query: 1924 ILRTLLQELVSAAVQSGEEIMQYGQSIADGDIRPGQVPRLL---------------DIVL 1790 +LRTLLQELVS++V SGEEIMQYG+SI D G +PRLL + VL Sbjct: 1 MLRTLLQELVSSSVPSGEEIMQYGKSIDDESDTQGVIPRLLGYVRVLIAEMTTIMQNFVL 60 Query: 1789 YLCEKEHVEGGMIFQLLEDLTEMSTMRNCKDIFGYIESKQDILGKPELFARGKLVMLRTC 1610 YLCEKEHVEGGMIFQLLEDL EMSTMRNCKDIF YIESKQDILGK ELFARGKLVMLRTC Sbjct: 61 YLCEKEHVEGGMIFQLLEDLNEMSTMRNCKDIFRYIESKQDILGKQELFARGKLVMLRTC 120 Query: 1609 NQLLRRLSKANDVVFCGRIIMFLAHFFPLSERSAVNIKGVFNTSNETKYEKEAPDCSTID 1430 NQLLRRLSKANDVVFCGRI+MFLAHFFPLSERSAVNIKGVFNTSNETKYEK+ P+ ++D Sbjct: 121 NQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKDPPEGISVD 180 Query: 1429 FNFYKTFWSLQESFSNPASLTPALTKWHKFTSSLTVVLNTFEAQPLSDEEGSAINLEDEG 1250 FNFYKTFWSLQ+ F NPASL+ A KW KFTSSL VVLNTFEAQPLS+EEG+ NLE+E Sbjct: 181 FNFYKTFWSLQDYFCNPASLSTAPVKWQKFTSSLMVVLNTFEAQPLSEEEGADNNLEEEA 240 Query: 1249 SNFSIKYLTSSNLMGLELKDPSFRRHVLVQCLILFDYLKAPGKNDKDLPSDTMKEEIKSC 1070 + F+IKYLTSS LMGLELKDPSFRRH+L+QCLILFDYLKAPGKNDKD S++MKEEIKSC Sbjct: 241 TTFNIKYLTSSKLMGLELKDPSFRRHILLQCLILFDYLKAPGKNDKD-SSESMKEEIKSC 299 Query: 1069 EERVKKLLEMTPPRGKEFLHSIEHILERERNWVWWKRDGCPAFEKQPIEKKLAQEGGRKR 890 E+RVKKLLE+TPP+GK+FL SIEHILERE+NWVWWKRDGCP FEKQPIEKK Q G +KR Sbjct: 300 EDRVKKLLEVTPPKGKDFLCSIEHILEREKNWVWWKRDGCPPFEKQPIEKKPVQNGAKKR 359 Query: 889 RPRWRLGNKELSQLWKWADQNPNALTDPQRVRTPAIMDYWKPLAEDMDESAGIEEEYHHK 710 RPRWRLGNKELSQLWKWADQNPNALTDPQRVRTPAI DYWKPLAEDMDESAGIE EYHHK Sbjct: 360 RPRWRLGNKELSQLWKWADQNPNALTDPQRVRTPAITDYWKPLAEDMDESAGIEAEYHHK 419 Query: 709 NSRVYCWKGLRFSARQDLEGFSRFTEHGIEGVVPLELLPPDVRSKYQAKPGDRSKRAKKE 530 N+RVYCWKGLRF+ARQDLEGFS+FTEHGIEGVVPLELLPPDVRSK+Q KP DRSKRAKKE Sbjct: 420 NNRVYCWKGLRFAARQDLEGFSKFTEHGIEGVVPLELLPPDVRSKFQGKPSDRSKRAKKE 479 Query: 529 ETKGSVQQVEESQIATTPASETDVEGSRMDPEASVAPADGDAIITXXXXXXXXXXSPTSD 350 ETK S QVEESQIA TPASE D EG R D EAS A D D +T +PT D Sbjct: 480 ETKTSSHQVEESQIA-TPASEVDGEGMRADMEASAALMDAD--VTAGTGNNSQGGTPTPD 536 Query: 349 EHQKQNSDGDVGLEAGQIEADTEAEQGMIDGEMDAE 242 EHQKQ+ D DVG EAGQ+EAD E E G IDGE D E Sbjct: 537 EHQKQSPDTDVGQEAGQLEADAEVEAG-IDGETDPE 571