BLASTX nr result
ID: Paeonia24_contig00002961
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia24_contig00002961 (2183 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI35079.3| unnamed protein product [Vitis vinifera] 991 0.0 ref|XP_002264619.2| PREDICTED: THO complex subunit 1-like [Vitis... 991 0.0 emb|CBI35093.3| unnamed protein product [Vitis vinifera] 988 0.0 ref|XP_002263874.1| PREDICTED: THO complex subunit 1-like [Vitis... 987 0.0 ref|XP_007204592.1| hypothetical protein PRUPE_ppa003099mg [Prun... 980 0.0 ref|XP_006465777.1| PREDICTED: THO complex subunit 1-like isofor... 961 0.0 ref|XP_007010828.1| Nuclear matrix protein-related isoform 1 [Th... 961 0.0 ref|XP_002529986.1| nuclear matrix protein, putative [Ricinus co... 959 0.0 ref|XP_006432406.1| hypothetical protein CICLE_v10000631mg [Citr... 959 0.0 ref|XP_006465778.1| PREDICTED: THO complex subunit 1-like isofor... 954 0.0 ref|XP_002299188.1| hypothetical protein POPTR_0001s06900g [Popu... 949 0.0 ref|XP_004307195.1| PREDICTED: THO complex subunit 1-like [Fraga... 946 0.0 ref|XP_004230044.1| PREDICTED: THO complex subunit 1-like [Solan... 939 0.0 ref|XP_007148665.1| hypothetical protein PHAVU_005G004500g [Phas... 938 0.0 ref|XP_003522894.1| PREDICTED: THO complex subunit 1 isoform X1 ... 932 0.0 ref|XP_006347676.1| PREDICTED: THO complex subunit 1-like [Solan... 928 0.0 ref|XP_004140313.1| PREDICTED: THO complex subunit 1-like [Cucum... 920 0.0 ref|NP_568219.1| THO complex subunit 1 [Arabidopsis thaliana] g... 915 0.0 ref|XP_006287318.1| hypothetical protein CARUB_v10000519mg [Caps... 914 0.0 ref|XP_002871388.1| hypothetical protein ARALYDRAFT_908935 [Arab... 912 0.0 >emb|CBI35079.3| unnamed protein product [Vitis vinifera] Length = 613 Score = 991 bits (2563), Expect = 0.0 Identities = 485/603 (80%), Positives = 531/603 (88%) Frame = -3 Query: 2070 IKDFLIVC*IFVMEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRT 1891 + F I C I V EIF+ ALL+ GPP+SFAL+ VQEAIKPQKQTKL QDENQLLENILR Sbjct: 2 VSPFNINCFILV-EIFKQALLKPGPPESFALQVVQEAIKPQKQTKLAQDENQLLENILRK 60 Query: 1890 LLQELVSTVAQSGESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLED 1711 LLQELVS QSGE IMQYGQS+DD E Q QIPRLLDIVLYLCEKEHVEGGMIFQLLED Sbjct: 61 LLQELVSCAVQSGEKIMQYGQSIDDEEAIQSQIPRLLDIVLYLCEKEHVEGGMIFQLLED 120 Query: 1710 LTEMSTMRNCQDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI 1531 LTEMSTMRNC+DIF YIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI Sbjct: 121 LTEMSTMRNCKDIFAYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI 180 Query: 1530 LTFLAHFFPLSERSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPAS 1351 L FLAHFFPLSERSAVNIKGVFNTSNETKYEK+AP+GIS+DFNFYKTFWSLQEHF NPAS Sbjct: 181 LMFLAHFFPLSERSAVNIKGVFNTSNETKYEKDAPEGISIDFNFYKTFWSLQEHFCNPAS 240 Query: 1350 ITAAPTKWQKFTSSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELK 1171 I+ APTKWQKFTS+L+VVL+TFEAQPLSDEEGN+NNLEEEAATFSIKYLTSSKLMGLELK Sbjct: 241 ISLAPTKWQKFTSNLMVVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELK 300 Query: 1170 DSSFRRHILVQCLILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFL 991 D SFRRHILVQCLILFDYLKAPGK DKD PS+SMKEEIKSCE+RVKK+LEMTPPKG +FL Sbjct: 301 DPSFRRHILVQCLILFDYLKAPGKNDKDLPSDSMKEEIKSCEERVKKLLEMTPPKGKEFL 360 Query: 990 RQVEHILEREKNWVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWAD 811 +EHILEREKNWVWWKRDGCP FE+QP EKK V GAKKRRPRWRMGNKEL QLW+WAD Sbjct: 361 HNIEHILEREKNWVWWKRDGCPPFERQPIEKKAVQDGAKKRRPRWRMGNKELSQLWKWAD 420 Query: 810 QNPKVLTDAQRVRAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLE 631 QNP LTD QR R PA+ +YWKPLA+DMD S+GIE EYHHKNNRVYCWKGLR++ARQDL+ Sbjct: 421 QNPNALTDPQRARTPAVSEYWKPLAEDMDLSAGIEAEYHHKNNRVYCWKGLRFAARQDLD 480 Query: 630 GFSRFTDKDIGRVVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPAN 451 GFSRFT+ I VVP ELL VR+KYQ+KP D++KRAKKE+ KGA+ Q+E+NQIATPA+ Sbjct: 481 GFSRFTEYGIEGVVPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIATPAS 540 Query: 450 ELDGEGVRGDHEGLAAQMDTDGEVDPPTPEEHEKESPDTDMGQEAGQLEADTESDTGMID 271 E+DGEG R D E AA MDTD PT +E++K+S DTD GQEAGQ EAD E++ GMID Sbjct: 541 EIDGEGTRVDLEASAAPMDTDVTATTPTADENQKQSSDTDAGQEAGQSEADAEAEAGMID 600 Query: 270 GET 262 GET Sbjct: 601 GET 603 >ref|XP_002264619.2| PREDICTED: THO complex subunit 1-like [Vitis vinifera] Length = 601 Score = 991 bits (2561), Expect = 0.0 Identities = 481/591 (81%), Positives = 526/591 (89%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 MEIF+ ALL+ GPP+SFAL+ VQEAIKPQKQTKL QDENQLLENILR LLQELVS QS Sbjct: 1 MEIFKQALLKPGPPESFALQVVQEAIKPQKQTKLAQDENQLLENILRKLLQELVSCAVQS 60 Query: 1854 GESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQD 1675 GE IMQYGQS+DD E Q QIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNC+D Sbjct: 61 GEKIMQYGQSIDDEEAIQSQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 120 Query: 1674 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLSE 1495 IF YIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFPLSE Sbjct: 121 IFAYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1494 RSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKFT 1315 RSAVNIKGVFNTSNETKYEK+AP+GIS+DFNFYKTFWSLQEHF NPASI+ APTKWQKFT Sbjct: 181 RSAVNIKGVFNTSNETKYEKDAPEGISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFT 240 Query: 1314 SSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQC 1135 S+L+VVL+TFEAQPLSDEEGN+NNLEEEAATFSIKYLTSSKLMGLELKD SFRRHILVQC Sbjct: 241 SNLMVVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQC 300 Query: 1134 LILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREKN 955 LILFDYLKAPGK DKD PS+SMKEEIKSCE+RVKK+LEMTPPKG +FL +EHILEREKN Sbjct: 301 LILFDYLKAPGKNDKDLPSDSMKEEIKSCEERVKKLLEMTPPKGKEFLHNIEHILEREKN 360 Query: 954 WVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQRV 775 WVWWKRDGCP FE+QP EKK V GAKKRRPRWRMGNKEL QLW+WADQNP LTD QR Sbjct: 361 WVWWKRDGCPPFERQPIEKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALTDPQRA 420 Query: 774 RAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIGR 595 R PA+ +YWKPLA+DMD S+GIE EYHHKNNRVYCWKGLR++ARQDL+GFSRFT+ I Sbjct: 421 RTPAVSEYWKPLAEDMDLSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTEYGIEG 480 Query: 594 VVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDHE 415 VVP ELL VR+KYQ+KP D++KRAKKE+ KGA+ Q+E+NQIATPA+E+DGEG R D E Sbjct: 481 VVPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIATPASEIDGEGTRVDLE 540 Query: 414 GLAAQMDTDGEVDPPTPEEHEKESPDTDMGQEAGQLEADTESDTGMIDGET 262 AA MDTD PT +E++K+S DTD GQEAGQ EAD E++ GMIDGET Sbjct: 541 ASAAPMDTDVTATTPTADENQKQSSDTDAGQEAGQSEADAEAEAGMIDGET 591 >emb|CBI35093.3| unnamed protein product [Vitis vinifera] Length = 613 Score = 988 bits (2554), Expect = 0.0 Identities = 482/594 (81%), Positives = 527/594 (88%) Frame = -3 Query: 2043 IFVMEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTV 1864 IFV EIF+ ALL+ GPP+SFAL+ VQEAIKPQKQTKL QDENQLLENILR LLQELVS Sbjct: 11 IFV-EIFKQALLKPGPPESFALQVVQEAIKPQKQTKLAQDENQLLENILRKLLQELVSCA 69 Query: 1863 AQSGESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRN 1684 QSGE IM YGQS+DD E Q QIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRN Sbjct: 70 VQSGEKIMHYGQSIDDEEAIQSQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRN 129 Query: 1683 CQDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFP 1504 C+DIF YIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFP Sbjct: 130 CKDIFAYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFP 189 Query: 1503 LSERSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQ 1324 LSERSAVNIKGVFNTSNETKYEK+AP+GIS+DFNFYKTFWSLQEHF NPASI+ APTKWQ Sbjct: 190 LSERSAVNIKGVFNTSNETKYEKDAPEGISIDFNFYKTFWSLQEHFCNPASISLAPTKWQ 249 Query: 1323 KFTSSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHIL 1144 KFTS+L+VVL+TFEAQPLSDEEGN+NNLEEEAATFSIKYLTSSKLMGLELKD SFRRHIL Sbjct: 250 KFTSNLMVVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHIL 309 Query: 1143 VQCLILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILER 964 VQCLILFDYLKAPGK DKD PS+SMKEEIKSCE+RVKK+LE TPPKG +FL +EHILER Sbjct: 310 VQCLILFDYLKAPGKNDKDLPSDSMKEEIKSCEERVKKLLETTPPKGKEFLHNIEHILER 369 Query: 963 EKNWVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDA 784 EKNWVWWKRDGCP FE+QP EKK V GAKKRRPRWRMGNKEL QLW+WADQNP LTD Sbjct: 370 EKNWVWWKRDGCPPFERQPIEKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALTDP 429 Query: 783 QRVRAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKD 604 QRVR PA+ +YWKPLA+DMD S+GIE EYHHKNNRVYCWKGLR++ARQDL+GFSRFT+ Sbjct: 430 QRVRTPAVSEYWKPLAEDMDSSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTEYG 489 Query: 603 IGRVVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRG 424 I VVP ELL VR+KYQ+KP D++KRAKKE+ KGA+ Q+E+NQIATPA+E+DGEG R Sbjct: 490 IEGVVPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIATPASEIDGEGTRV 549 Query: 423 DHEGLAAQMDTDGEVDPPTPEEHEKESPDTDMGQEAGQLEADTESDTGMIDGET 262 D E AA MDTD PT +E++K+S DTD GQEAGQ EAD E++ GMIDGET Sbjct: 550 DLEASAAPMDTDVTATTPTADENQKQSSDTDAGQEAGQSEADAEAEAGMIDGET 603 >ref|XP_002263874.1| PREDICTED: THO complex subunit 1-like [Vitis vinifera] Length = 607 Score = 987 bits (2552), Expect = 0.0 Identities = 479/590 (81%), Positives = 524/590 (88%) Frame = -3 Query: 2031 EIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQSG 1852 EIF+ ALL+ GPP+SFAL+ VQEAIKPQKQTKL QDENQLLENILR LLQELVS QSG Sbjct: 8 EIFKQALLKPGPPESFALQVVQEAIKPQKQTKLAQDENQLLENILRKLLQELVSCAVQSG 67 Query: 1851 ESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQDI 1672 E IM YGQS+DD E Q QIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNC+DI Sbjct: 68 EKIMHYGQSIDDEEAIQSQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKDI 127 Query: 1671 FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLSER 1492 F YIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFPLSER Sbjct: 128 FAYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSER 187 Query: 1491 SAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKFTS 1312 SAVNIKGVFNTSNETKYEK+AP+GIS+DFNFYKTFWSLQEHF NPASI+ APTKWQKFTS Sbjct: 188 SAVNIKGVFNTSNETKYEKDAPEGISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFTS 247 Query: 1311 SLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQCL 1132 +L+VVL+TFEAQPLSDEEGN+NNLEEEAATFSIKYLTSSKLMGLELKD SFRRHILVQCL Sbjct: 248 NLMVVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL 307 Query: 1131 ILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREKNW 952 ILFDYLKAPGK DKD PS+SMKEEIKSCE+RVKK+LE TPPKG +FL +EHILEREKNW Sbjct: 308 ILFDYLKAPGKNDKDLPSDSMKEEIKSCEERVKKLLETTPPKGKEFLHNIEHILEREKNW 367 Query: 951 VWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQRVR 772 VWWKRDGCP FE+QP EKK V GAKKRRPRWRMGNKEL QLW+WADQNP LTD QRVR Sbjct: 368 VWWKRDGCPPFERQPIEKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALTDPQRVR 427 Query: 771 APAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIGRV 592 PA+ +YWKPLA+DMD S+GIE EYHHKNNRVYCWKGLR++ARQDL+GFSRFT+ I V Sbjct: 428 TPAVSEYWKPLAEDMDSSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTEYGIEGV 487 Query: 591 VPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDHEG 412 VP ELL VR+KYQ+KP D++KRAKKE+ KGA+ Q+E+NQIATPA+E+DGEG R D E Sbjct: 488 VPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIATPASEIDGEGTRVDLEA 547 Query: 411 LAAQMDTDGEVDPPTPEEHEKESPDTDMGQEAGQLEADTESDTGMIDGET 262 AA MDTD PT +E++K+S DTD GQEAGQ EAD E++ GMIDGET Sbjct: 548 SAAPMDTDVTATTPTADENQKQSSDTDAGQEAGQSEADAEAEAGMIDGET 597 >ref|XP_007204592.1| hypothetical protein PRUPE_ppa003099mg [Prunus persica] gi|462400123|gb|EMJ05791.1| hypothetical protein PRUPE_ppa003099mg [Prunus persica] Length = 604 Score = 980 bits (2533), Expect = 0.0 Identities = 477/597 (79%), Positives = 525/597 (87%), Gaps = 8/597 (1%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 ME+FR A+LQ GPP++FAL+TVQ+ IKPQKQTKL+QDENQLLENILRTLLQELVS Sbjct: 1 MEVFRRAILQPGPPENFALQTVQQVIKPQKQTKLVQDENQLLENILRTLLQELVS----- 55 Query: 1854 GESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQD 1675 GE IMQYGQS+DDGETTQG IPRLLDIVLYLCE EH+EGGMIFQLLEDLTEMSTMRNC+D Sbjct: 56 GEQIMQYGQSIDDGETTQGHIPRLLDIVLYLCENEHIEGGMIFQLLEDLTEMSTMRNCKD 115 Query: 1674 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLSE 1495 +FGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFPLSE Sbjct: 116 VFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 175 Query: 1494 RSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKFT 1315 RSAVNIKGVFNTSNETKYEK+ PDGIS+DFNFYKTFWSLQEHF NP S+T APTKW+KFT Sbjct: 176 RSAVNIKGVFNTSNETKYEKDPPDGISIDFNFYKTFWSLQEHFCNPPSLTLAPTKWKKFT 235 Query: 1314 SSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQC 1135 S L+VVL+TFEAQPLSDEEG++N+LEEEAA FSIKYLTSSKLMGLELKD SFRRHILVQC Sbjct: 236 SGLMVVLNTFEAQPLSDEEGDANSLEEEAANFSIKYLTSSKLMGLELKDPSFRRHILVQC 295 Query: 1134 LILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREKN 955 LILFDYLKAPGK++KD PS+SMKEEIKSCE+RVKK+LEMTPPKG +FL ++EHILEREKN Sbjct: 296 LILFDYLKAPGKSEKDLPSDSMKEEIKSCEERVKKLLEMTPPKGENFLHKIEHILEREKN 355 Query: 954 WVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQRV 775 WVWWKRDGCP FEKQP EKKVV GAKKRRPRWRMGNKEL LW+WADQNP LTD QRV Sbjct: 356 WVWWKRDGCPPFEKQPAEKKVVQEGAKKRRPRWRMGNKELSLLWKWADQNPNALTDPQRV 415 Query: 774 RAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIGR 595 R PAI DYWKPLADDMD ++GIE EYHHKNNRVYCWKGLR+SARQDLEGFSRFT+ I Sbjct: 416 RTPAITDYWKPLADDMDPAAGIEAEYHHKNNRVYCWKGLRFSARQDLEGFSRFTEFGIEG 475 Query: 594 VVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDHE 415 VVP ELLTP+ R+KYQ+KP DK+KRAKKE+ KGA+HQ E+NQIAT ANE+DGEG+R E Sbjct: 476 VVPLELLTPEERSKYQAKPNDKSKRAKKEETKGAAHQVEENQIATAANEIDGEGIRAVLE 535 Query: 414 GLAAQMDTDGEV--------DPPTPEEHEKESPDTDMGQEAGQLEADTESDTGMIDG 268 DTD V P P+EH+K+S DTD+GQEAGQ+EAD E + GMIDG Sbjct: 536 ASVTPTDTDATVATGDMSQGGSPIPDEHQKQSSDTDVGQEAGQMEADAEVEAGMIDG 592 >ref|XP_006465777.1| PREDICTED: THO complex subunit 1-like isoform X1 [Citrus sinensis] Length = 608 Score = 961 bits (2484), Expect = 0.0 Identities = 470/599 (78%), Positives = 521/599 (86%), Gaps = 8/599 (1%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 ME+FR A+LQ+GPP++FAL+TVQE IKPQKQTKL QDENQLLEN+LRTLLQELVS+ QS Sbjct: 1 MEVFRRAILQAGPPENFALQTVQEVIKPQKQTKLAQDENQLLENMLRTLLQELVSSAVQS 60 Query: 1854 GESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQD 1675 GE IM YGQS+DDGET+Q QIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTM+NC+D Sbjct: 61 GEPIMHYGQSIDDGETSQAQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKD 120 Query: 1674 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLSE 1495 IFGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKLELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1494 RSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKFT 1315 RSAVNIKGVFNTSNETKYEK+ PDGI VDFNFYKTFWSLQE+F NPA +T APTKWQKFT Sbjct: 181 RSAVNIKGVFNTSNETKYEKDPPDGIPVDFNFYKTFWSLQEYFCNPA-LTLAPTKWQKFT 239 Query: 1314 SSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQC 1135 SSL+VVL+TF+AQPLSDE G++N LEEEAATF+IKYLTSSKLMGLELKD SFRRH+LVQC Sbjct: 240 SSLMVVLNTFDAQPLSDEVGDANVLEEEAATFNIKYLTSSKLMGLELKDPSFRRHVLVQC 299 Query: 1134 LILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREKN 955 LILFDYLKAPGK DKD PSESMKEE+KSCE+RVKK+LE TPPKG DFL +EHILEREKN Sbjct: 300 LILFDYLKAPGKNDKDLPSESMKEEMKSCEERVKKLLETTPPKGKDFLHSIEHILEREKN 359 Query: 954 WVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQRV 775 WVWWKRDGCP FEKQ EKK V G KKRRPRWR+GNKEL QLW+WADQNP LTD QRV Sbjct: 360 WVWWKRDGCPPFEKQSMEKKAVQDGPKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 419 Query: 774 RAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIGR 595 R PAI +YWKPLADDMD S+GIE EYHHKN+RVYCWKGLR+SARQDL+GFSRFTD I Sbjct: 420 RTPAITEYWKPLADDMDPSAGIEAEYHHKNSRVYCWKGLRFSARQDLDGFSRFTDHGIEG 479 Query: 594 VVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDHE 415 VVP ELL P VR++Y+ K D++KRAKKED K A Q+E+NQIA A+E DGEG+R D E Sbjct: 480 VVPLELLPPHVRSRYEGKANDRSKRAKKEDSKVAPSQAEENQIAASASENDGEGIRADLE 539 Query: 414 GLAAQMDTD--------GEVDPPTPEEHEKESPDTDMGQEAGQLEADTESDTGMIDGET 262 A ++TD + TP+EH+K+S DTDMGQEAGQL+AD E+D GM+DGET Sbjct: 540 ASATPVETDVTAGTGNISQSGTATPDEHQKQSSDTDMGQEAGQLDADAEADAGMMDGET 598 >ref|XP_007010828.1| Nuclear matrix protein-related isoform 1 [Theobroma cacao] gi|508727741|gb|EOY19638.1| Nuclear matrix protein-related isoform 1 [Theobroma cacao] Length = 602 Score = 961 bits (2483), Expect = 0.0 Identities = 473/600 (78%), Positives = 522/600 (87%), Gaps = 8/600 (1%) Frame = -3 Query: 2037 VMEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQ 1858 +ME FR A+LQ GPP++FAL+ VQE IKPQKQTKL QDENQLLEN+LRTLLQELVS+ Sbjct: 1 MMEAFRRAILQPGPPETFALKIVQEVIKPQKQTKLAQDENQLLENMLRTLLQELVSSSVP 60 Query: 1857 SGESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQ 1678 SGE IMQYG+S+DD TQG IPRLLD VLYLCEKEHVEGGMIFQLLEDL EMSTMRNC+ Sbjct: 61 SGEEIMQYGKSIDDESDTQGVIPRLLDFVLYLCEKEHVEGGMIFQLLEDLNEMSTMRNCK 120 Query: 1677 DIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLS 1498 DIF YIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFPLS Sbjct: 121 DIFRYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLS 180 Query: 1497 ERSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKF 1318 ERSAVNIKGVFNTSNETKYEK+ P+GISVDFNFYKTFWSLQ++F NPAS++ AP KWQKF Sbjct: 181 ERSAVNIKGVFNTSNETKYEKDPPEGISVDFNFYKTFWSLQDYFCNPASLSTAPVKWQKF 240 Query: 1317 TSSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQ 1138 TSSL+VVL+TFEAQPLS+EEG NNLEEEA TF+IKYLTSSKLMGLELKD SFRRHIL+Q Sbjct: 241 TSSLMVVLNTFEAQPLSEEEGADNNLEEEATTFNIKYLTSSKLMGLELKDPSFRRHILLQ 300 Query: 1137 CLILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREK 958 CLILFDYLKAPGK DKD SESMKEEIKSCEDRVKK+LE+TPPKG DFL +EHILEREK Sbjct: 301 CLILFDYLKAPGKNDKD-SSESMKEEIKSCEDRVKKLLEVTPPKGKDFLCSIEHILEREK 359 Query: 957 NWVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQR 778 NWVWWKRDGCP FEKQP EKK V GAKKRRPRWR+GNKEL QLW+WADQNP LTD QR Sbjct: 360 NWVWWKRDGCPPFEKQPIEKKPVQNGAKKRRPRWRLGNKELSQLWKWADQNPNALTDPQR 419 Query: 777 VRAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIG 598 VR PAI DYWKPLA+DMDES+GIE EYHHKNNRVYCWKGLR++ARQDLEGFS+FT+ I Sbjct: 420 VRTPAITDYWKPLAEDMDESAGIEAEYHHKNNRVYCWKGLRFAARQDLEGFSKFTEHGIE 479 Query: 597 RVVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDH 418 VVP ELL P VR+K+Q KP D++KRAKKE+ K +SHQ E++QIATPA+E+DGEG+R D Sbjct: 480 GVVPLELLPPDVRSKFQGKPSDRSKRAKKEETKTSSHQVEESQIATPASEVDGEGMRADM 539 Query: 417 EGLAAQMDTD--------GEVDPPTPEEHEKESPDTDMGQEAGQLEADTESDTGMIDGET 262 E AA MD D + PTP+EH+K+SPDTD+GQEAGQLEAD E + G IDGET Sbjct: 540 EASAALMDADVTAGTGNNSQGGTPTPDEHQKQSPDTDVGQEAGQLEADAEVEAG-IDGET 598 >ref|XP_002529986.1| nuclear matrix protein, putative [Ricinus communis] gi|223530509|gb|EEF32391.1| nuclear matrix protein, putative [Ricinus communis] Length = 608 Score = 959 bits (2480), Expect = 0.0 Identities = 466/599 (77%), Positives = 527/599 (87%), Gaps = 8/599 (1%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 ME F++A+LQ GPP++FAL+TVQE IKPQ+QTKL QDENQLLEN+LRTLLQELV++ S Sbjct: 1 MEEFKNAILQPGPPENFALQTVQEFIKPQRQTKLAQDENQLLENMLRTLLQELVASAVHS 60 Query: 1854 GESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQD 1675 GE IM YGQSVD+GE +QGQIPRLLD+VL+LCE+EHVEGGMIFQLLEDLTEMSTM+NCQD Sbjct: 61 GEQIMLYGQSVDEGEKSQGQIPRLLDVVLHLCEREHVEGGMIFQLLEDLTEMSTMKNCQD 120 Query: 1674 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLSE 1495 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1494 RSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKFT 1315 RSAVNIKGVFNTSNETKYEK+ P GISVDFNFYKT WSLQE+F NPA +T APTKW KFT Sbjct: 181 RSAVNIKGVFNTSNETKYEKDPPAGISVDFNFYKTLWSLQENFCNPAPLTLAPTKWHKFT 240 Query: 1314 SSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQC 1135 SSL+VVL+TFEAQPLS+EEG++NNLEEEAATF+IKYLTSSKLMGLELKD SFRRHILVQC Sbjct: 241 SSLMVVLNTFEAQPLSEEEGDANNLEEEAATFNIKYLTSSKLMGLELKDPSFRRHILVQC 300 Query: 1134 LILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREKN 955 LILFDYLKAPGK DKD SESMKE+I++CE+RVKK+LEMTPPKG DFL+++EH+LEREKN Sbjct: 301 LILFDYLKAPGKNDKDSTSESMKEDIRTCEERVKKLLEMTPPKGKDFLQKIEHVLEREKN 360 Query: 954 WVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQRV 775 WV WKRDGC FEKQP E K + G+KKR+PRWR+GNKEL QLW+WADQNP LTD QRV Sbjct: 361 WVCWKRDGCQPFEKQPIENKTIQEGSKKRKPRWRLGNKELSQLWKWADQNPNALTDPQRV 420 Query: 774 RAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIGR 595 R PAI +YWKPLA+DMD S+GIE EYHHKNNRVYCWKGLR+SARQDL+GFSRFTD I Sbjct: 421 RTPAITEYWKPLAEDMDPSAGIEAEYHHKNNRVYCWKGLRFSARQDLDGFSRFTDHGIEG 480 Query: 594 VVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDHE 415 VVP ELL P VR+KYQ+KP D++KRAKK+DIKG S+Q+E+NQIATPA+E+DGEG+R D E Sbjct: 481 VVPLELLPPDVRSKYQAKPNDRSKRAKKDDIKGGSNQTEENQIATPASEIDGEGIRAD-E 539 Query: 414 GLAAQMDTD--------GEVDPPTPEEHEKESPDTDMGQEAGQLEADTESDTGMIDGET 262 AA MDTD + PTP+E +++SPD D GQEAG LEAD E + GMIDGET Sbjct: 540 AAAAPMDTDAMATAGSTSQGGTPTPDERQRQSPDADDGQEAGHLEADGEVEAGMIDGET 598 >ref|XP_006432406.1| hypothetical protein CICLE_v10000631mg [Citrus clementina] gi|557534528|gb|ESR45646.1| hypothetical protein CICLE_v10000631mg [Citrus clementina] Length = 608 Score = 959 bits (2478), Expect = 0.0 Identities = 468/599 (78%), Positives = 521/599 (86%), Gaps = 8/599 (1%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 ME+FR A+L +GPP++FAL+TVQE IKPQKQTKL QDENQLLEN+LRTLLQELVS+ QS Sbjct: 1 MEVFRRAILHAGPPENFALQTVQEVIKPQKQTKLAQDENQLLENMLRTLLQELVSSAVQS 60 Query: 1854 GESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQD 1675 GE IM YGQS+DDGET+Q QIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTM+NC+D Sbjct: 61 GEPIMHYGQSIDDGETSQAQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKD 120 Query: 1674 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLSE 1495 IFGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKLELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1494 RSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKFT 1315 RSAVNIKGVFNTSNETKYEK+ PDGI VDFNFYKTFWSLQE+F NPA +T APTKWQKFT Sbjct: 181 RSAVNIKGVFNTSNETKYEKDPPDGIPVDFNFYKTFWSLQEYFCNPA-LTLAPTKWQKFT 239 Query: 1314 SSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQC 1135 SSL+VVL+TF+AQPLSDE G++N LEEEAATF+IKYLTSSKLMGLELKD SFRRH+LVQC Sbjct: 240 SSLMVVLNTFDAQPLSDEVGDANVLEEEAATFNIKYLTSSKLMGLELKDPSFRRHVLVQC 299 Query: 1134 LILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREKN 955 LILFDYLKAPGK DKD PSESMKEE+KSCE+RVKK+LEMTPPKG DFL +EHILEREKN Sbjct: 300 LILFDYLKAPGKNDKDLPSESMKEEMKSCEERVKKLLEMTPPKGKDFLHSIEHILEREKN 359 Query: 954 WVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQRV 775 WVWWKRDGCP FEKQ EKK V G KKRRPRWR+GNKEL QLW+WADQNP LTD QRV Sbjct: 360 WVWWKRDGCPPFEKQSMEKKAVQDGPKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 419 Query: 774 RAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIGR 595 R PAI +YWKPLA+DMD S+GIE EYHHKN+RVYCWKGLR+SARQDL+GFSRFTD I Sbjct: 420 RTPAITEYWKPLAEDMDPSAGIEAEYHHKNSRVYCWKGLRFSARQDLDGFSRFTDHGIEG 479 Query: 594 VVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDHE 415 VVP ELL P VR++Y+ K D++KRAKKED K A Q+E+NQIA A+E DG+G+R D E Sbjct: 480 VVPLELLPPHVRSRYEGKANDRSKRAKKEDSKVAPSQAEENQIAASASENDGDGIRADLE 539 Query: 414 GLAAQMDTD--------GEVDPPTPEEHEKESPDTDMGQEAGQLEADTESDTGMIDGET 262 A ++TD + TP+EH+K+S DTDMGQEAGQL+AD E+D GM+DGET Sbjct: 540 ASATPVETDVTAGTGNISQSGTATPDEHQKQSSDTDMGQEAGQLDADAEADAGMMDGET 598 >ref|XP_006465778.1| PREDICTED: THO complex subunit 1-like isoform X2 [Citrus sinensis] Length = 607 Score = 954 bits (2467), Expect = 0.0 Identities = 469/599 (78%), Positives = 520/599 (86%), Gaps = 8/599 (1%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 ME+FR A+LQ+GPP++FAL+TVQE IKPQKQTKL QDENQLLEN+LRTLLQELVS+ QS Sbjct: 1 MEVFRRAILQAGPPENFALQTVQEVIKPQKQTKLAQDENQLLENMLRTLLQELVSSAVQS 60 Query: 1854 GESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQD 1675 GE IM YGQS+DDGET+Q QIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTM+NC+D Sbjct: 61 GEPIMHYGQSIDDGETSQAQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKD 120 Query: 1674 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLSE 1495 IFGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKLELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1494 RSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKFT 1315 RSAVNIKGVFNTSNETKYEK+ PDGI VDFNFYKTFWSLQE+F NPA +T APTKWQKFT Sbjct: 181 RSAVNIKGVFNTSNETKYEKDPPDGIPVDFNFYKTFWSLQEYFCNPA-LTLAPTKWQKFT 239 Query: 1314 SSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQC 1135 SSL+VVL+TF+AQPLSDE G++N LEEEAATF+IKYLTSSKLMGLELKD SFRRH+LVQC Sbjct: 240 SSLMVVLNTFDAQPLSDEVGDANVLEEEAATFNIKYLTSSKLMGLELKDPSFRRHVLVQC 299 Query: 1134 LILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREKN 955 LILFDYLKAPGK DKD PSESMKEE+KSCE+RVKK+LE TPPKG DFL +EHILEREKN Sbjct: 300 LILFDYLKAPGKNDKDLPSESMKEEMKSCEERVKKLLETTPPKGKDFLHSIEHILEREKN 359 Query: 954 WVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQRV 775 WVWWKRDGCP FEKQ EKK V G KK RPRWR+GNKEL QLW+WADQNP LTD QRV Sbjct: 360 WVWWKRDGCPPFEKQSMEKKAVQDGPKK-RPRWRLGNKELSQLWKWADQNPNALTDPQRV 418 Query: 774 RAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIGR 595 R PAI +YWKPLADDMD S+GIE EYHHKN+RVYCWKGLR+SARQDL+GFSRFTD I Sbjct: 419 RTPAITEYWKPLADDMDPSAGIEAEYHHKNSRVYCWKGLRFSARQDLDGFSRFTDHGIEG 478 Query: 594 VVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDHE 415 VVP ELL P VR++Y+ K D++KRAKKED K A Q+E+NQIA A+E DGEG+R D E Sbjct: 479 VVPLELLPPHVRSRYEGKANDRSKRAKKEDSKVAPSQAEENQIAASASENDGEGIRADLE 538 Query: 414 GLAAQMDTD--------GEVDPPTPEEHEKESPDTDMGQEAGQLEADTESDTGMIDGET 262 A ++TD + TP+EH+K+S DTDMGQEAGQL+AD E+D GM+DGET Sbjct: 539 ASATPVETDVTAGTGNISQSGTATPDEHQKQSSDTDMGQEAGQLDADAEADAGMMDGET 597 >ref|XP_002299188.1| hypothetical protein POPTR_0001s06900g [Populus trichocarpa] gi|222846446|gb|EEE83993.1| hypothetical protein POPTR_0001s06900g [Populus trichocarpa] Length = 608 Score = 949 bits (2453), Expect = 0.0 Identities = 466/599 (77%), Positives = 519/599 (86%), Gaps = 8/599 (1%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 ME FR A+LQ GP ++FAL+TVQE IKPQKQTKL+QDENQLLEN+LRTLLQELVS+ AQS Sbjct: 1 MEEFRRAILQPGPVETFALKTVQEFIKPQKQTKLVQDENQLLENMLRTLLQELVSSAAQS 60 Query: 1854 GESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQD 1675 GE IM G+S+DD E +QGQIPRLLD VLYLCE+EH+EGGMIFQLLEDLTEMSTMRNC+D Sbjct: 61 GEEIMLSGKSIDDEENSQGQIPRLLDAVLYLCEREHIEGGMIFQLLEDLTEMSTMRNCKD 120 Query: 1674 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLSE 1495 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1494 RSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKFT 1315 RSAVNIKGVFNTSNETKYEKE P IS+DFNFYKT WSLQE+F +P S+T +P KWQKF+ Sbjct: 181 RSAVNIKGVFNTSNETKYEKEPPAAISLDFNFYKTLWSLQEYFCDP-SLTLSPIKWQKFS 239 Query: 1314 SSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQC 1135 SSL+VVL+TFEAQPLS+EEG++NNLEEEAA F+IKYLTSSKLMGLELKD SFRRH+LVQC Sbjct: 240 SSLMVVLNTFEAQPLSEEEGDANNLEEEAAAFNIKYLTSSKLMGLELKDPSFRRHVLVQC 299 Query: 1134 LILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREKN 955 LILFDYLKAPGK DKD SESMKEEI+S E+ VKK+LEMTPPKG DFL VEHILEREKN Sbjct: 300 LILFDYLKAPGKNDKDLTSESMKEEIRSREEHVKKLLEMTPPKGKDFLHMVEHILEREKN 359 Query: 954 WVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQRV 775 W+WWKRDGCP FEKQP E K V G KKRRPRWR+GNKEL QLW+WADQNP LTD QRV Sbjct: 360 WLWWKRDGCPPFEKQPIENKTVQDGGKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 419 Query: 774 RAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIGR 595 R P I DYWKPLA+DMD S+GI+ EYHHKNNRVYCWKGLR+SARQDL+GFSRFTD I Sbjct: 420 RTPIITDYWKPLAEDMDPSAGIDAEYHHKNNRVYCWKGLRFSARQDLDGFSRFTDHGIEG 479 Query: 594 VVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDHE 415 VVP ELL P VR+KYQ+KP D++KRAKK++ KGA HQ EDNQI+TPA+E+DGEG+R D E Sbjct: 480 VVPLELLPPDVRSKYQAKPNDRSKRAKKDEPKGALHQVEDNQISTPASEIDGEGIRIDLE 539 Query: 414 GLAAQMDTD--------GEVDPPTPEEHEKESPDTDMGQEAGQLEADTESDTGMIDGET 262 AA MDTD + PTP+EH+K+ DTD GQEAGQLEAD E++ GMIDGET Sbjct: 540 ASAAPMDTDVTATTGSISQSGTPTPDEHQKQGSDTDGGQEAGQLEADAEAEAGMIDGET 598 >ref|XP_004307195.1| PREDICTED: THO complex subunit 1-like [Fragaria vesca subsp. vesca] Length = 611 Score = 946 bits (2444), Expect = 0.0 Identities = 464/601 (77%), Positives = 516/601 (85%), Gaps = 11/601 (1%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 ME+FRSA+LQ GPP++FAL+TVQ+ IKPQK TKL+QDENQLLENILRTLLQELVS+ QS Sbjct: 1 MEVFRSAILQPGPPETFALQTVQQVIKPQKGTKLVQDENQLLENILRTLLQELVSSAVQS 60 Query: 1854 GESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQD 1675 GE IMQYGQS+DDGE T+G IPRLLD+VLYLCE EHVEGGMIFQLLEDLTEMSTMRNC+D Sbjct: 61 GEQIMQYGQSIDDGEATRGHIPRLLDVVLYLCENEHVEGGMIFQLLEDLTEMSTMRNCKD 120 Query: 1674 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLSE 1495 +FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFPLSE Sbjct: 121 VFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1494 RSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKFT 1315 RSAVNIKGVFNTSNETKYEK+APDGIS+DFNFYKTFWSLQE+F NPA +T APTKWQKFT Sbjct: 181 RSAVNIKGVFNTSNETKYEKDAPDGISIDFNFYKTFWSLQEYFCNPAPLTVAPTKWQKFT 240 Query: 1314 SSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQC 1135 SSL VVL+TFEAQPLSDEEG +NNL EE+A FSIKYLTSSKLMGLELKD SFRRHILVQC Sbjct: 241 SSLKVVLNTFEAQPLSDEEGEANNL-EESANFSIKYLTSSKLMGLELKDPSFRRHILVQC 299 Query: 1134 LILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREKN 955 LILFDYLKAPGK++KD PSESMKEEI S E+ VKK+LEMTPPKG FL ++EHILEREKN Sbjct: 300 LILFDYLKAPGKSEKDLPSESMKEEINSYEEHVKKLLEMTPPKGESFLHKIEHILEREKN 359 Query: 954 WVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQRV 775 WVWWKRDGCP FEKQP EKK V GAKKR+PRWR+GNKEL QLW+WADQNP LTD QR+ Sbjct: 360 WVWWKRDGCPPFEKQPIEKKTVQDGAKKRKPRWRLGNKELSQLWKWADQNPNALTDTQRL 419 Query: 774 RAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIGR 595 R P+I +YWKPLA+DMD ++GIE EYHHKNNRVYCWKGLR+SARQDLEGFS+FT+ I Sbjct: 420 RTPSITEYWKPLAEDMDPAAGIEAEYHHKNNRVYCWKGLRFSARQDLEGFSKFTEFGIEG 479 Query: 594 VVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDHE 415 VVP ELL P+ RAKY K +K+KRAKKED K A H E+NQ+AT A ++DGE +R D Sbjct: 480 VVPLELLPPEERAKYAPKTNEKSKRAKKEDAKAAVHHVEENQVATAATDVDGEVLRTDVG 539 Query: 414 GLAAQMDTDGEV-------DPPTPEEHEKESPDTDMGQEAGQL----EADTESDTGMIDG 268 L A +DTD + + P +EH+K+S DTD GQEAGQL E D E D GMIDG Sbjct: 540 ALVAPLDTDNTMVCNTSQGNSPMADEHQKQSSDTDGGQEAGQLEDDAEVDAEGDAGMIDG 599 Query: 267 E 265 E Sbjct: 600 E 600 >ref|XP_004230044.1| PREDICTED: THO complex subunit 1-like [Solanum lycopersicum] Length = 608 Score = 939 bits (2426), Expect = 0.0 Identities = 453/599 (75%), Positives = 519/599 (86%), Gaps = 8/599 (1%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 M++FR A+L+ GPP+ FAL TVQEAIKPQKQTKL+QDENQLLENILR+LLQELV+ QS Sbjct: 1 MDLFRQAILRQGPPEEFALLTVQEAIKPQKQTKLVQDENQLLENILRSLLQELVAAAVQS 60 Query: 1854 GESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQD 1675 G+ +M+YG S+ DGE++QGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNC+D Sbjct: 61 GQKLMKYGVSIVDGESSQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCED 120 Query: 1674 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLSE 1495 +FGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+ FLAHFFPLSE Sbjct: 121 VFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 180 Query: 1494 RSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKFT 1315 RSAVNIKGVFNTSNETKYE E PDGIS+DFNFY+T WSLQE+F NP S+ AP KW KFT Sbjct: 181 RSAVNIKGVFNTSNETKYETEVPDGISIDFNFYRTLWSLQEYFCNPPSLINAPGKWHKFT 240 Query: 1314 SSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQC 1135 SSL +VL+TFEAQPLSDEEGN++NLE++AATF+IKYLTSSKLMGLELKD SFRRH+LVQC Sbjct: 241 SSLTLVLNTFEAQPLSDEEGNAHNLEDDAATFNIKYLTSSKLMGLELKDPSFRRHVLVQC 300 Query: 1134 LILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREKN 955 LILFDYLKAPGK++K+ PSE+MKEEIK+ E+R KK+LEMTPPKG DFLR +EHILERE+N Sbjct: 301 LILFDYLKAPGKSEKELPSEAMKEEIKTSEERAKKLLEMTPPKGIDFLRSIEHILERERN 360 Query: 954 WVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQRV 775 WVWWKRDGCP FEKQP EKK+V G KKRR RW +GNKEL QLW+WADQ LTDA+RV Sbjct: 361 WVWWKRDGCPPFEKQPVEKKLVQDGTKKRRTRWSLGNKELSQLWKWADQYSGALTDAERV 420 Query: 774 RAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIGR 595 PAI YWKPLA+DMDES+GIE EYHHKNNRVYCWKGLR+SARQDLEGFSRFT+ I Sbjct: 421 ATPAITKYWKPLAEDMDESAGIEAEYHHKNNRVYCWKGLRFSARQDLEGFSRFTEHGIEG 480 Query: 594 VVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDHE 415 VVP ELL +VRAKYQ+KP ++ KR KKED K ++ Q+E+NQIATP +E+D E R D E Sbjct: 481 VVPLELLPNEVRAKYQAKPSERTKRTKKEDTKNSAQQAEENQIATPPSEMDNEVGRADPE 540 Query: 414 GLAAQMDTDGEV--------DPPTPEEHEKESPDTDMGQEAGQLEADTESDTGMIDGET 262 AA MDTD + + PTPE+++K+S DTD+ QEAGQ+EADTE++TGMIDGET Sbjct: 541 ASAAPMDTDAGIATVNICQEETPTPEDNQKQSSDTDVAQEAGQIEADTEAETGMIDGET 599 >ref|XP_007148665.1| hypothetical protein PHAVU_005G004500g [Phaseolus vulgaris] gi|561021929|gb|ESW20659.1| hypothetical protein PHAVU_005G004500g [Phaseolus vulgaris] Length = 604 Score = 938 bits (2424), Expect = 0.0 Identities = 459/597 (76%), Positives = 515/597 (86%), Gaps = 6/597 (1%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 ME+F+ A+LQ GPP++FAL+TVQE IKPQKQTKL QDENQ LENILR LLQE VS A S Sbjct: 1 MEVFKRAILQPGPPENFALKTVQEVIKPQKQTKLAQDENQFLENILRMLLQEFVSA-AVS 59 Query: 1854 GESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQD 1675 E IMQ+GQS+D ETTQG IPRLLDIVLYLCEKEH+EGGMIFQLLEDLTEMSTM+NC+D Sbjct: 60 AEKIMQFGQSIDSNETTQGHIPRLLDIVLYLCEKEHIEGGMIFQLLEDLTEMSTMKNCKD 119 Query: 1674 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLSE 1495 +FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFPLSE Sbjct: 120 VFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 179 Query: 1494 RSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKFT 1315 RSA+NIKGVFNTSNETK+EKE +GI +DFNFY+TFW LQE FSNP SI+ AP KWQKFT Sbjct: 180 RSALNIKGVFNTSNETKFEKEPLEGICIDFNFYQTFWGLQEFFSNPTSISHAPVKWQKFT 239 Query: 1314 SSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQC 1135 SSL VVL+TFEAQPLSDEEG++NNLEEEA FSIKYLTSSKLMGLELKD SFRRH+LVQC Sbjct: 240 SSLSVVLNTFEAQPLSDEEGDANNLEEEAVNFSIKYLTSSKLMGLELKDPSFRRHVLVQC 299 Query: 1134 LILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREKN 955 LILFDYLKAPGK DKD PSE+MKEEI SCE+RVKK+LE+TPPKG++FL ++EHILEREKN Sbjct: 300 LILFDYLKAPGKGDKDLPSENMKEEITSCEERVKKLLELTPPKGSEFLHKIEHILEREKN 359 Query: 954 WVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQRV 775 WVWWKRDGC +EKQP EKK V G+KKRRPRWR+GNKEL QLW+WADQNP LTD QRV Sbjct: 360 WVWWKRDGCLPYEKQPIEKKAVPEGSKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 419 Query: 774 RAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIGR 595 + P+IM+YWKPLADDMD S+GIE EYHHKNNRVYCWKGLR +ARQDLEGFS+FTD I Sbjct: 420 QTPSIMEYWKPLADDMDPSAGIEAEYHHKNNRVYCWKGLRLAARQDLEGFSKFTDHGIEG 479 Query: 594 VVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDHE 415 VVP ELL P VR+KYQ+KP D++KR+KKE+ KG++HQ E+NQIAT A ELDG+G+R D Sbjct: 480 VVPLELLPPDVRSKYQAKPNDRSKRSKKEETKGSAHQVEENQIATTATELDGDGIRTD-- 537 Query: 414 GLAAQMDTDGEVDP------PTPEEHEKESPDTDMGQEAGQLEADTESDTGMIDGET 262 A M+ DG P PTPEE K S DTD+GQEAGQLEA+ E + G+IDGET Sbjct: 538 TTATPMEFDGASVPGTQGGTPTPEELHKHSSDTDVGQEAGQLEAEAEVEAGIIDGET 594 >ref|XP_003522894.1| PREDICTED: THO complex subunit 1 isoform X1 [Glycine max] gi|571450424|ref|XP_006578423.1| PREDICTED: THO complex subunit 1 isoform X2 [Glycine max] Length = 605 Score = 932 bits (2409), Expect = 0.0 Identities = 457/597 (76%), Positives = 508/597 (85%), Gaps = 6/597 (1%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 ME+F+ A++Q GPP+SFAL TVQE IKPQKQTKL QDENQ LENILR LLQE VS Q Sbjct: 1 MEVFKRAIIQPGPPESFALRTVQEVIKPQKQTKLAQDENQFLENILRMLLQEFVSAAVQF 60 Query: 1854 GESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQD 1675 GE IMQ+GQS+D ETTQG IPRLLDIVLYLCEKEH+EGGMIFQLLEDLTEMSTM+NC+D Sbjct: 61 GEKIMQFGQSIDSSETTQGHIPRLLDIVLYLCEKEHIEGGMIFQLLEDLTEMSTMKNCKD 120 Query: 1674 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLSE 1495 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1494 RSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKFT 1315 RSA+NIKGVFNTSNETKYEKE +GI +DFNFY+TFW LQE+FSNP SI+ AP KWQKFT Sbjct: 181 RSALNIKGVFNTSNETKYEKEPLEGICIDFNFYQTFWGLQEYFSNPTSISHAPAKWQKFT 240 Query: 1314 SSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQC 1135 SL VVL+TFEAQPLSDEEG++NNLEEEA FSIKYLTSSKLMGLELKD SFRRH+LVQC Sbjct: 241 LSLSVVLNTFEAQPLSDEEGDANNLEEEAVNFSIKYLTSSKLMGLELKDPSFRRHVLVQC 300 Query: 1134 LILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREKN 955 LILFDYLKAPGK DKD PSE+MKEEI S E+RVKK+LE+TPPKGT+FL ++EHILEREKN Sbjct: 301 LILFDYLKAPGKGDKDLPSENMKEEITSWEERVKKLLELTPPKGTEFLHKIEHILEREKN 360 Query: 954 WVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQRV 775 WVWWKRDGC +EKQ EKK V G KKRRPRWR+GNKEL QLW+WADQNP LTD QRV Sbjct: 361 WVWWKRDGCLPYEKQRIEKKAVPDGPKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRV 420 Query: 774 RAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIGR 595 + P+IM+YWKPLA+DMD S+GIE +YHHKNNRVYCWKGLR SARQDLEGFS+FTD I Sbjct: 421 QTPSIMEYWKPLAEDMDPSAGIEADYHHKNNRVYCWKGLRLSARQDLEGFSKFTDHGIEG 480 Query: 594 VVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDHE 415 VVP ELL P VR+KYQ+KP D++KR+KKE+ KG +HQ E+NQIAT A E+DG+G+R D Sbjct: 481 VVPLELLPPDVRSKYQAKPNDRSKRSKKEETKGTAHQIEENQIATNATEIDGDGIRTD-- 538 Query: 414 GLAAQMDTDGEVDP------PTPEEHEKESPDTDMGQEAGQLEADTESDTGMIDGET 262 A M+ D P TPEE +K S DTD GQEAGQLEAD E + GMIDGET Sbjct: 539 TTATSMEFDAATAPGTQGGTTTPEELQKLSSDTDGGQEAGQLEADAEVEAGMIDGET 595 >ref|XP_006347676.1| PREDICTED: THO complex subunit 1-like [Solanum tuberosum] Length = 609 Score = 928 bits (2399), Expect = 0.0 Identities = 446/599 (74%), Positives = 516/599 (86%), Gaps = 8/599 (1%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 M++FR A+L+ GPP+ FAL TVQEAIKPQKQTKL+QDENQLLENILR+LLQELV+ QS Sbjct: 1 MDLFRQAILRQGPPEEFALLTVQEAIKPQKQTKLVQDENQLLENILRSLLQELVAAAVQS 60 Query: 1854 GESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQD 1675 G+ +M+YG S+ DGE++QGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNC+D Sbjct: 61 GQKVMKYGVSIVDGESSQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCED 120 Query: 1674 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLSE 1495 +FGYIESKQDILGK ELFARGKLVMLRTCNQLLRRLSKANDVVFCGRI+ FLAHFFPLSE Sbjct: 121 VFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSE 180 Query: 1494 RSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKFT 1315 RSAVNIKGVFNTSNETKYE E P+GIS+DFNFY+T WSLQE+F NP S+ AP KW KFT Sbjct: 181 RSAVNIKGVFNTSNETKYETEVPEGISIDFNFYRTLWSLQEYFCNPPSLINAPGKWHKFT 240 Query: 1314 SSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQC 1135 SSL +VL+TFEAQPLSDEEGN +NLE++AATF+IKYLTSSKLMGLELKD SFRRH+LVQC Sbjct: 241 SSLTLVLNTFEAQPLSDEEGNVHNLEDDAATFNIKYLTSSKLMGLELKDPSFRRHVLVQC 300 Query: 1134 LILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREKN 955 LILFDYLK PGK++K+ PSE+MKEEIK+ E++ KK+LEMTPPKG DFL +EHILERE+N Sbjct: 301 LILFDYLKEPGKSEKELPSEAMKEEIKTSEEQAKKLLEMTPPKGIDFLHSIEHILERERN 360 Query: 954 WVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQRV 775 WVWWKRDGCP FEKQP EKK+V G KKRRPRW +GN+EL QLW+WADQ LTDAQRV Sbjct: 361 WVWWKRDGCPPFEKQPVEKKLVQDGTKKRRPRWSLGNRELSQLWKWADQYSSALTDAQRV 420 Query: 774 RAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIGR 595 PAI YWKPLA+DMDES+GIE EYHHKNNRVYCWKGLR+SARQDLEGFSRFT+ I Sbjct: 421 STPAITKYWKPLAEDMDESAGIEAEYHHKNNRVYCWKGLRFSARQDLEGFSRFTEHGIEG 480 Query: 594 VVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDHE 415 VVP ELL+ +VRA+YQ+KP ++ KR KKED K ++ Q+++NQIATP +E+D E + D E Sbjct: 481 VVPLELLSNEVRARYQAKPSERTKRTKKEDTKNSAQQADENQIATPPSEMDNEVGQADPE 540 Query: 414 GLAAQMDTDGEV--------DPPTPEEHEKESPDTDMGQEAGQLEADTESDTGMIDGET 262 AA MDTD + + PTPE+++K+S DTD+ QEAGQ EADTE++T MIDGET Sbjct: 541 ASAAPMDTDAGIATVNISQEETPTPEDNQKQSSDTDVAQEAGQTEADTEAETAMIDGET 599 >ref|XP_004140313.1| PREDICTED: THO complex subunit 1-like [Cucumis sativus] Length = 607 Score = 920 bits (2377), Expect = 0.0 Identities = 453/599 (75%), Positives = 513/599 (85%), Gaps = 8/599 (1%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 +E FR A+LQ GPP++FAL+ VQ+ I+PQK TKL QDENQLLENILR LLQELVS+ QS Sbjct: 7 LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQS 66 Query: 1854 GESIMQYGQSVDDGETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQD 1675 E +MQYG S+D+ ET+Q DIVLYLCEKEHVEGGMIFQLLEDLTEMST+RNC+D Sbjct: 67 TEPVMQYGMSIDEKETSQ-------DIVLYLCEKEHVEGGMIFQLLEDLTEMSTLRNCKD 119 Query: 1674 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLSE 1495 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRIL FLAHFFPLSE Sbjct: 120 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSE 179 Query: 1494 RSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKFT 1315 RSAVNIKGVFNTSNETKYEK+ PDG S+DFNFYKTFWSLQE F NPAS+ A TKWQKFT Sbjct: 180 RSAVNIKGVFNTSNETKYEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFT 239 Query: 1314 SSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQC 1135 SSL+VVL+TF+AQPLSDEEG++N LEEE+ATFSIKYLTSSKLMGLELKD SFRRH+L+QC Sbjct: 240 SSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC 299 Query: 1134 LILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREKN 955 LILFDYLKAPGK +KD PSE+M+EEIKSCE+RVKK+LE+TPP+G DFL+++EHIL+RE N Sbjct: 300 LILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENN 359 Query: 954 WVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQRV 775 WVWWKRDGC FEKQP EKK + KKRRPRWR+GNKEL QLW+W+DQNP LTD QRV Sbjct: 360 WVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGNKELSQLWKWSDQNPNALTDPQRV 419 Query: 774 RAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIGR 595 R+PAI DYWKPLA+DMDES+GIE EYHH+NNRVYCWKGLR+SARQDLEGFSRFTD I Sbjct: 420 RSPAISDYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEG 479 Query: 594 VVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDHE 415 VVP ELL P VRAKYQ+KP +++KRAKKE+ KGA Q ++NQ+ATPA+E DGEG R D + Sbjct: 480 VVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPD 539 Query: 414 GLAAQMDTD-----GEVDP---PTPEEHEKESPDTDMGQEAGQLEADTESDTGMIDGET 262 G +A MD D G V TPEE+ K S DTD+GQEAGQLEAD E + GMIDGET Sbjct: 540 GPSAGMDVDTAIATGNVSQGGISTPEEN-KLSSDTDIGQEAGQLEADAEVEPGMIDGET 597 >ref|NP_568219.1| THO complex subunit 1 [Arabidopsis thaliana] gi|75163171|sp|Q93VM9.1|THOC1_ARATH RecName: Full=THO complex subunit 1; Short=AtTHO1; AltName: Full=HPR1 homolog; Short=AtHPR1 gi|15983384|gb|AAL11560.1|AF424566_1 AT5g09860/MYH9_7 [Arabidopsis thaliana] gi|16226756|gb|AAL16253.1|AF428323_1 AT5g09860/MYH9_7 [Arabidopsis thaliana] gi|332004073|gb|AED91456.1| THO complex subunit 1 [Arabidopsis thaliana] Length = 599 Score = 915 bits (2364), Expect = 0.0 Identities = 446/592 (75%), Positives = 507/592 (85%), Gaps = 4/592 (0%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 M+ FR A+LQ P ++FAL+TVQ I+PQKQTKL QDENQ+LEN+LRTLLQELV+ AQS Sbjct: 1 MDAFRDAILQRAPIETFALKTVQHFIQPQKQTKLAQDENQMLENMLRTLLQELVAAAAQS 60 Query: 1854 GESIMQYGQSVDDGETTQ---GQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRN 1684 GE IMQYGQ +DD + GQIP LLD+VLYLCEKEHVEGGMIFQLLEDLTEMSTM+N Sbjct: 61 GEQIMQYGQLIDDDDDDDDIHGQIPHLLDVVLYLCEKEHVEGGMIFQLLEDLTEMSTMKN 120 Query: 1683 CQDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFP 1504 C+D+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFP Sbjct: 121 CKDVFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFP 180 Query: 1503 LSERSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQ 1324 LSERSAVNIKGVFNTSNETKYEK+ P GISVDFNFYKTFWSLQE+F NPAS+T+A TKWQ Sbjct: 181 LSERSAVNIKGVFNTSNETKYEKDPPKGISVDFNFYKTFWSLQEYFCNPASLTSASTKWQ 240 Query: 1323 KFTSSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHIL 1144 KF+SSL VVL+TF+AQPLS+EEG +N+LEEEAATF+IKYLTSSKLMGLELKDSSFRRHIL Sbjct: 241 KFSSSLAVVLNTFDAQPLSEEEGEANSLEEEAATFNIKYLTSSKLMGLELKDSSFRRHIL 300 Query: 1143 VQCLILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILER 964 +QCLI+FDYL+APGK DKD PSE+MKEE+KSCEDRVKK+LE+TPPKG +FLR VEHILER Sbjct: 301 LQCLIMFDYLRAPGKNDKDLPSETMKEELKSCEDRVKKLLEITPPKGKEFLRAVEHILER 360 Query: 963 EKNWVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDA 784 EKNWVWWKRDGCP FEKQP +KK G KKRR RWR+GNKEL QLWRWADQNP LTD+ Sbjct: 361 EKNWVWWKRDGCPPFEKQPIDKKSPNAGQKKRRQRWRLGNKELSQLWRWADQNPNALTDS 420 Query: 783 QRVRAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKD 604 QRVR P I DYWKPLA+DMD S+GIE+EYHHKNNRVYCWKGLR++ARQDLEGFSRFT+ Sbjct: 421 QRVRTPDIADYWKPLAEDMDPSAGIEDEYHHKNNRVYCWKGLRFTARQDLEGFSRFTEMG 480 Query: 603 IGRVVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRG 424 I VVP ELL P+VR+KYQ+KP +KAKRAKKE+ KG SH++E NQI +E + EG RG Sbjct: 481 IEGVVPVELLPPEVRSKYQAKPNEKAKRAKKEETKGGSHETEGNQIGVSNSEAEAEGGRG 540 Query: 423 DHEGLAAQMDTDGEVDPPTPEEHEK-ESPDTDMGQEAGQLEADTESDTGMID 271 D A M++D D PTPEE ++ DT+ GQEAGQ+E + G++D Sbjct: 541 D----AETMESDAIADTPTPEEQQRLGGSDTENGQEAGQIEDGETEEAGLMD 588 >ref|XP_006287318.1| hypothetical protein CARUB_v10000519mg [Capsella rubella] gi|482556024|gb|EOA20216.1| hypothetical protein CARUB_v10000519mg [Capsella rubella] Length = 598 Score = 914 bits (2362), Expect = 0.0 Identities = 446/593 (75%), Positives = 508/593 (85%), Gaps = 3/593 (0%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 M+ FR A+LQ P +SFAL+TVQ+ I+PQKQTKL QDENQ+LEN+LRTLLQELV++ AQS Sbjct: 1 MDAFRDAILQRAPIESFALKTVQQFIQPQKQTKLAQDENQMLENMLRTLLQELVASAAQS 60 Query: 1854 GESIMQYGQSVDDG--ETTQGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNC 1681 GE IMQYGQ +DD E GQIP LLD+VLYLCE EHVEGGMIFQLLEDLTEMSTM+NC Sbjct: 61 GEQIMQYGQLIDDDVDENIHGQIPHLLDVVLYLCENEHVEGGMIFQLLEDLTEMSTMKNC 120 Query: 1680 QDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPL 1501 +D+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFPL Sbjct: 121 KDVFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPL 180 Query: 1500 SERSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQK 1321 SERSAVNIKGVFNTSNETKYEK+ P+GISVDFNFYKTFWSLQE+F NPAS+T+A TKWQK Sbjct: 181 SERSAVNIKGVFNTSNETKYEKDPPNGISVDFNFYKTFWSLQEYFCNPASLTSASTKWQK 240 Query: 1320 FTSSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILV 1141 F+SSL VVL+TF+AQPLS+EEG +N+LEEEAATF+IKYLTSSKLMGLELKDSSFRRHIL+ Sbjct: 241 FSSSLAVVLNTFDAQPLSEEEGEANSLEEEAATFNIKYLTSSKLMGLELKDSSFRRHILL 300 Query: 1140 QCLILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILERE 961 QCLILFDYL+APGK DKD PSE+MKEE+KSCEDRVKK+LE+TPPKG +FLR VEHILERE Sbjct: 301 QCLILFDYLRAPGKNDKDLPSETMKEELKSCEDRVKKLLEITPPKGKEFLRAVEHILERE 360 Query: 960 KNWVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQ 781 KNWVWWKRDGCP FEKQ +KK G KKRR RWR+GNKEL QLWRWADQNP LTD+Q Sbjct: 361 KNWVWWKRDGCPPFEKQLIDKKSANAGQKKRRQRWRLGNKELSQLWRWADQNPNALTDSQ 420 Query: 780 RVRAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDI 601 RV PAI DYWKPLA+DMD S+GIE+EYHHKNNRVYCWKGLR++ARQDLEGFSRFTD + Sbjct: 421 RVLTPAIADYWKPLAEDMDPSAGIEDEYHHKNNRVYCWKGLRFAARQDLEGFSRFTDFGV 480 Query: 600 GRVVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGD 421 VVP ELL P+VR+KYQ+KP +KAKRAKKE+ KG S ++E NQI +E + EG RG+ Sbjct: 481 EGVVPVELLPPEVRSKYQAKPNEKAKRAKKEETKGGSQETEGNQIGVSNSEAEAEGGRGE 540 Query: 420 HEGLAAQMDTDGEVDPPTPEEHEK-ESPDTDMGQEAGQLEADTESDTGMIDGE 265 E M++D D PTPEE ++ DTD GQEAGQ+E + G++D + Sbjct: 541 AEA----MESDAVADTPTPEEQQRLGGSDTDNGQEAGQIEDTETEEAGLMDAD 589 >ref|XP_002871388.1| hypothetical protein ARALYDRAFT_908935 [Arabidopsis lyrata subsp. lyrata] gi|297317225|gb|EFH47647.1| hypothetical protein ARALYDRAFT_908935 [Arabidopsis lyrata subsp. lyrata] Length = 597 Score = 912 bits (2357), Expect = 0.0 Identities = 445/590 (75%), Positives = 506/590 (85%), Gaps = 2/590 (0%) Frame = -3 Query: 2034 MEIFRSALLQSGPPDSFALETVQEAIKPQKQTKLLQDENQLLENILRTLLQELVSTVAQS 1855 M+ FR A+LQ P ++FAL+TVQE I+PQKQTKL QDENQ+LEN+LRTLLQELV+ AQS Sbjct: 1 MDAFRDAILQRAPIETFALKTVQEFIQPQKQTKLAQDENQMLENMLRTLLQELVAAAAQS 60 Query: 1854 GESIMQYGQSVDDGETT-QGQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCQ 1678 GE IMQYGQ +DD + GQIP LLD+VLYLCEKEHVEGGMIFQLLEDLTEMSTM+NC+ Sbjct: 61 GEQIMQYGQLIDDDDDNIHGQIPHLLDVVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCK 120 Query: 1677 DIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILTFLAHFFPLS 1498 D+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRIL FLAHFFPLS Sbjct: 121 DVFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLS 180 Query: 1497 ERSAVNIKGVFNTSNETKYEKEAPDGISVDFNFYKTFWSLQEHFSNPASITAAPTKWQKF 1318 ERSAVNIKGVFNTSNETKYEK+ P GISVDFNFYKTFWSLQE+F NPAS+ +A TKWQKF Sbjct: 181 ERSAVNIKGVFNTSNETKYEKDPPKGISVDFNFYKTFWSLQEYFCNPASLISASTKWQKF 240 Query: 1317 TSSLLVVLSTFEAQPLSDEEGNSNNLEEEAATFSIKYLTSSKLMGLELKDSSFRRHILVQ 1138 +SSL VVL+TF+AQPLS+EEG +N+LEEEAATF+IKYLTSSKLMGLELKDSSFRRHIL+Q Sbjct: 241 SSSLAVVLNTFDAQPLSEEEGEANSLEEEAATFNIKYLTSSKLMGLELKDSSFRRHILLQ 300 Query: 1137 CLILFDYLKAPGKTDKDCPSESMKEEIKSCEDRVKKMLEMTPPKGTDFLRQVEHILEREK 958 CLI+FDYL+APGK DKD PSE+MKEE+KSCEDRVKK+LE+TPPKG +FLR VEHILEREK Sbjct: 301 CLIMFDYLRAPGKNDKDLPSETMKEELKSCEDRVKKLLEITPPKGKEFLRAVEHILEREK 360 Query: 957 NWVWWKRDGCPAFEKQPTEKKVVTGGAKKRRPRWRMGNKELHQLWRWADQNPKVLTDAQR 778 NWVWWKRDGCP FEKQP +KK G KKRR RWR+GNKEL QLWRWADQNP LTD+QR Sbjct: 361 NWVWWKRDGCPPFEKQPIDKKSPNAGQKKRRQRWRLGNKELSQLWRWADQNPNALTDSQR 420 Query: 777 VRAPAIMDYWKPLADDMDESSGIEEEYHHKNNRVYCWKGLRYSARQDLEGFSRFTDKDIG 598 VR P I DYWKPLA+DMD S+GIE+EYHHKNNRVYCWKGLR++ARQDLEGFSRFT+ I Sbjct: 421 VRTPDIADYWKPLAEDMDPSAGIEDEYHHKNNRVYCWKGLRFTARQDLEGFSRFTEMGIE 480 Query: 597 RVVPPELLTPKVRAKYQSKPGDKAKRAKKEDIKGASHQSEDNQIATPANELDGEGVRGDH 418 VVP ELL P+VR+KYQ+KP +KAKRAKKE+ KG S ++E NQI +E + EG RGD Sbjct: 481 GVVPVELLPPEVRSKYQAKPNEKAKRAKKEEAKGGSQETEGNQIGVSNSEAEAEGGRGD- 539 Query: 417 EGLAAQMDTDGEVDPPTPEEHEK-ESPDTDMGQEAGQLEADTESDTGMID 271 A M++D D PTPEE ++ DT+ GQEAGQ+E + G++D Sbjct: 540 ---AETMESDAIADTPTPEEQQRLGGSDTENGQEAGQIEDAETEEAGLMD 586