BLASTX nr result
ID: Sinomenium21_contig00009088
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00009088 (2641 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006847924.1| hypothetical protein AMTR_s00029p00122290 [A... 926 0.0 ref|XP_002264619.2| PREDICTED: THO complex subunit 1-like [Vitis... 885 0.0 emb|CBI35079.3| unnamed protein product [Vitis vinifera] 885 0.0 emb|CBI35093.3| unnamed protein product [Vitis vinifera] 882 0.0 ref|XP_002263874.1| PREDICTED: THO complex subunit 1-like [Vitis... 882 0.0 ref|XP_006432406.1| hypothetical protein CICLE_v10000631mg [Citr... 867 0.0 ref|XP_006465777.1| PREDICTED: THO complex subunit 1-like isofor... 866 0.0 ref|XP_007204592.1| hypothetical protein PRUPE_ppa003099mg [Prun... 861 0.0 ref|XP_004307195.1| PREDICTED: THO complex subunit 1-like [Fraga... 861 0.0 ref|XP_006465778.1| PREDICTED: THO complex subunit 1-like isofor... 859 0.0 ref|XP_002529986.1| nuclear matrix protein, putative [Ricinus co... 857 0.0 ref|XP_007010828.1| Nuclear matrix protein-related isoform 1 [Th... 855 0.0 ref|XP_002468664.1| hypothetical protein SORBIDRAFT_01g049910 [S... 851 0.0 ref|XP_002299188.1| hypothetical protein POPTR_0001s06900g [Popu... 846 0.0 ref|NP_001159168.1| hypothetical protein [Zea mays] gi|223942431... 846 0.0 ref|XP_006649251.1| PREDICTED: THO complex subunit 1-like [Oryza... 845 0.0 ref|XP_003558999.1| PREDICTED: THO complex subunit 1-like [Brach... 844 0.0 ref|NP_001048715.1| Os03g0110400 [Oryza sativa Japonica Group] g... 839 0.0 ref|XP_004986033.1| PREDICTED: THO complex subunit 1-like [Setar... 838 0.0 ref|XP_007148665.1| hypothetical protein PHAVU_005G004500g [Phas... 835 0.0 >ref|XP_006847924.1| hypothetical protein AMTR_s00029p00122290 [Amborella trichopoda] gi|548851229|gb|ERN09505.1| hypothetical protein AMTR_s00029p00122290 [Amborella trichopoda] Length = 667 Score = 926 bits (2394), Expect = 0.0 Identities = 474/670 (70%), Positives = 526/670 (78%), Gaps = 17/670 (2%) Frame = -3 Query: 2540 MAEATPGLRILLHQHQ--KERAPVHISSHADRDRVLEIFKGALSQPGPPSDFALQTMQDA 2367 MAEATP LRILLHQ Q KER+P+ +SSHADR+RVLE+F+ ALSQ GPP++FALQT+Q+A Sbjct: 1 MAEATPQLRILLHQQQPQKERSPITVSSHADRNRVLEVFRRALSQVGPPANFALQTVQEA 60 Query: 2366 IKPQKQTVLVQDENQSLENALRTLLQELASSAVQSGERIMRFGQSIDDSESALGHIPRLL 2187 IKPQKQTVLVQDENQSLENALR LLQELASSAVQ GER M++GQSID + S G IPRLL Sbjct: 61 IKPQKQTVLVQDENQSLENALRALLQELASSAVQLGERTMQYGQSIDGAGSMPGLIPRLL 120 Query: 2186 DIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKEVFGYIESKQDILGKQELFGRGKLVM 2007 DIVLYLCE+ HVEGGMIFQLLEDLTEMST+RDCKEVFGYIESKQDILGKQELFGRGKLVM Sbjct: 121 DIVLYLCEQSHVEGGMIFQLLEDLTEMSTIRDCKEVFGYIESKQDILGKQELFGRGKLVM 180 Query: 2006 LRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKDAPDG 1827 LRTCNQLLRRLSKANDVVFCGRILMFLAH FPLSERSA+N+KGVFNTSN+TKYE++ P+G Sbjct: 181 LRTCNQLLRRLSKANDVVFCGRILMFLAHVFPLSERSALNVKGVFNTSNQTKYEQEPPEG 240 Query: 1826 ISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFSSNLMVVLDAFEAQPLSDDDGNANNL 1647 ISVDFNFYKTFWSLQEHFCNP S T+A AKW F+S+LMVV+D FEAQPL +DDG+AN L Sbjct: 241 ISVDFNFYKTFWSLQEHFCNPTSMTLASAKWQNFTSSLMVVMDTFEAQPLHEDDGSANIL 300 Query: 1646 DEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQCLILFDYLKAPGKNDKDAPSETIKE 1467 DEEEA FSIKYLTSSKLMGLELKDP FRRHIL QCLILFDYLKAPGKNDK+ P E ++E Sbjct: 301 DEEEAVAFSIKYLTSSKLMGLELKDPNFRRHILVQCLILFDYLKAPGKNDKEGPKEIMRE 360 Query: 1466 EIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREKNWVWWKRDGCPPFEKHPTEKKAGQD 1287 EIKS EERVKKLLEMIP KGKEFL +EHILEREKNWVWWKRDGCPPFEK TE+K QD Sbjct: 361 EIKSYEERVKKLLEMIPSKGKEFLERVEHILEREKNWVWWKRDGCPPFEKQATERKTNQD 420 Query: 1286 GVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQRVRTPSVMEYWKPLAEDMDFSAGIEE 1107 G +KR+PRWRLGNKELSQLWKWADQNPNALTD QRVRTPS+ EYWK LAEDMD SAGIE Sbjct: 421 GAKKRKPRWRLGNKELSQLWKWADQNPNALTDAQRVRTPSITEYWKALAEDMDTSAGIEA 480 Query: 1106 EYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIEGVVPPDLLPPEVRSKHHNKPXXXXX 927 EYHHKNN+VYCWKGLRFSARQDLEGFSRF++ G+EGVVPP+LLPP++RSK+H K Sbjct: 481 EYHHKNNRVYCWKGLRFSARQDLEGFSRFTDHGVEGVVPPELLPPDIRSKYHAKAGDKSK 540 Query: 926 XXXXXXXXXEPTQQVEDNQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDISQSGAP 747 VEDNQ + SG P Sbjct: 541 RAKKEEEVKGNAPLVEDNQ--NAGATTELEGSGSGAELEDSAAPMDTDVGAVGATNSGGP 598 Query: 746 TPE--QKQSPSGELGQEVGQSXXXXXXXXXXXAGIIDG--ETDLEAE-----------PD 612 +P+ QKQSP E+GQEV Q I+D E +L+AE P Sbjct: 599 SPDEAQKQSPDDEVGQEVVQP-------------ILDSEPEPELDAEGKPEQMLEPELPK 645 Query: 611 ATSLDLQGGV 582 ++DLQ GV Sbjct: 646 PATIDLQDGV 655 >ref|XP_002264619.2| PREDICTED: THO complex subunit 1-like [Vitis vinifera] Length = 601 Score = 885 bits (2286), Expect = 0.0 Identities = 449/615 (73%), Positives = 490/615 (79%), Gaps = 4/615 (0%) Frame = -3 Query: 2441 LEIFKGALSQPGPPSDFALQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQS 2262 +EIFK AL +PGPP FALQ +Q+AIKPQKQT L QDENQ LEN LR LLQEL S AVQS Sbjct: 1 MEIFKQALLKPGPPESFALQVVQEAIKPQKQTKLAQDENQLLENILRKLLQELVSCAVQS 60 Query: 2261 GERIMRFGQSIDDSESALGHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKE 2082 GE+IM++GQSIDD E+ IPRLLDIVLYLCE+ HVEGGMIFQLLEDLTEMSTMR+CK+ Sbjct: 61 GEKIMQYGQSIDDEEAIQSQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 120 Query: 2081 VFGYIESKQDILGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 1902 +F YIESKQDILGKQELF RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE Sbjct: 121 IFAYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1901 RSAVNIKGVFNTSNETKYEKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFS 1722 RSAVNIKGVFNTSNETKYEKDAP+GIS+DFNFYKTFWSLQEHFCNPAS ++AP KW KF+ Sbjct: 181 RSAVNIKGVFNTSNETKYEKDAPEGISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFT 240 Query: 1721 SNLMVVLDAFEAQPLSDDDGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQ 1542 SNLMVVL+ FEAQPLSD++GNANNL EEEA TFSIKYLTSSKLMGLELKDP FRRHIL Q Sbjct: 241 SNLMVVLNTFEAQPLSDEEGNANNL-EEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQ 299 Query: 1541 CLILFDYLKAPGKNDKDAPSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREK 1362 CLILFDYLKAPGKNDKD PS+++KEEIKSCEERVKKLLEM PPKGKEFL++IEHILEREK Sbjct: 300 CLILFDYLKAPGKNDKDLPSDSMKEEIKSCEERVKKLLEMTPPKGKEFLHNIEHILEREK 359 Query: 1361 NWVWWKRDGCPPFEKHPTEKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQR 1182 NWVWWKRDGCPPFE+ P EKKA QDG +KRRPRWR+GNKELSQLWKWADQNPNALTD QR Sbjct: 360 NWVWWKRDGCPPFERQPIEKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALTDPQR 419 Query: 1181 VRTPSVMEYWKPLAEDMDFSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIE 1002 RTP+V EYWKPLAEDMD SAGIE EYHHKNN+VYCWKGLRF+ARQDL+GFSRF+E GIE Sbjct: 420 ARTPAVSEYWKPLAEDMDLSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTEYGIE 479 Query: 1001 GVVPPDLLPPEVRSKHHNKPXXXXXXXXXXXXXXEPTQQVEDNQXXXXXXXXXXXXXXXX 822 GVVP +LLP +VRSK+ KP QQ E+NQ Sbjct: 480 GVVPMELLPSDVRSKYQAKP-SDRSKRAKKEETKGAAQQAEENQIATPASEIDGEGTRVD 538 Query: 821 XXXXXXXXXXXXXXXXXDISQSGAPTP----EQKQSPSGELGQEVGQSXXXXXXXXXXXA 654 + A TP QKQS + GQE GQS A Sbjct: 539 LEASAAPMD----------TDVTATTPTADENQKQSSDTDAGQEAGQS----EADAEAEA 584 Query: 653 GIIDGETDLEAEPDA 609 G+IDGETD E + DA Sbjct: 585 GMIDGETDAEVDLDA 599 >emb|CBI35079.3| unnamed protein product [Vitis vinifera] Length = 613 Score = 885 bits (2286), Expect = 0.0 Identities = 449/616 (72%), Positives = 491/616 (79%), Gaps = 4/616 (0%) Frame = -3 Query: 2444 VLEIFKGALSQPGPPSDFALQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQ 2265 ++EIFK AL +PGPP FALQ +Q+AIKPQKQT L QDENQ LEN LR LLQEL S AVQ Sbjct: 12 LVEIFKQALLKPGPPESFALQVVQEAIKPQKQTKLAQDENQLLENILRKLLQELVSCAVQ 71 Query: 2264 SGERIMRFGQSIDDSESALGHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCK 2085 SGE+IM++GQSIDD E+ IPRLLDIVLYLCE+ HVEGGMIFQLLEDLTEMSTMR+CK Sbjct: 72 SGEKIMQYGQSIDDEEAIQSQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCK 131 Query: 2084 EVFGYIESKQDILGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLS 1905 ++F YIESKQDILGKQELF RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLS Sbjct: 132 DIFAYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLS 191 Query: 1904 ERSAVNIKGVFNTSNETKYEKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKF 1725 ERSAVNIKGVFNTSNETKYEKDAP+GIS+DFNFYKTFWSLQEHFCNPAS ++AP KW KF Sbjct: 192 ERSAVNIKGVFNTSNETKYEKDAPEGISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKF 251 Query: 1724 SSNLMVVLDAFEAQPLSDDDGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILF 1545 +SNLMVVL+ FEAQPLSD++GNANNL EEEA TFSIKYLTSSKLMGLELKDP FRRHIL Sbjct: 252 TSNLMVVLNTFEAQPLSDEEGNANNL-EEEAATFSIKYLTSSKLMGLELKDPSFRRHILV 310 Query: 1544 QCLILFDYLKAPGKNDKDAPSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILERE 1365 QCLILFDYLKAPGKNDKD PS+++KEEIKSCEERVKKLLEM PPKGKEFL++IEHILERE Sbjct: 311 QCLILFDYLKAPGKNDKDLPSDSMKEEIKSCEERVKKLLEMTPPKGKEFLHNIEHILERE 370 Query: 1364 KNWVWWKRDGCPPFEKHPTEKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQ 1185 KNWVWWKRDGCPPFE+ P EKKA QDG +KRRPRWR+GNKELSQLWKWADQNPNALTD Q Sbjct: 371 KNWVWWKRDGCPPFERQPIEKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALTDPQ 430 Query: 1184 RVRTPSVMEYWKPLAEDMDFSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGI 1005 R RTP+V EYWKPLAEDMD SAGIE EYHHKNN+VYCWKGLRF+ARQDL+GFSRF+E GI Sbjct: 431 RARTPAVSEYWKPLAEDMDLSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTEYGI 490 Query: 1004 EGVVPPDLLPPEVRSKHHNKPXXXXXXXXXXXXXXEPTQQVEDNQXXXXXXXXXXXXXXX 825 EGVVP +LLP +VRSK+ KP QQ E+NQ Sbjct: 491 EGVVPMELLPSDVRSKYQAKP-SDRSKRAKKEETKGAAQQAEENQIATPASEIDGEGTRV 549 Query: 824 XXXXXXXXXXXXXXXXXXDISQSGAPTP----EQKQSPSGELGQEVGQSXXXXXXXXXXX 657 + A TP QKQS + GQE GQS Sbjct: 550 DLEASAAPMD----------TDVTATTPTADENQKQSSDTDAGQEAGQS----EADAEAE 595 Query: 656 AGIIDGETDLEAEPDA 609 AG+IDGETD E + DA Sbjct: 596 AGMIDGETDAEVDLDA 611 >emb|CBI35093.3| unnamed protein product [Vitis vinifera] Length = 613 Score = 882 bits (2280), Expect = 0.0 Identities = 449/615 (73%), Positives = 489/615 (79%), Gaps = 4/615 (0%) Frame = -3 Query: 2441 LEIFKGALSQPGPPSDFALQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQS 2262 +EIFK AL +PGPP FALQ +Q+AIKPQKQT L QDENQ LEN LR LLQEL S AVQS Sbjct: 13 VEIFKQALLKPGPPESFALQVVQEAIKPQKQTKLAQDENQLLENILRKLLQELVSCAVQS 72 Query: 2261 GERIMRFGQSIDDSESALGHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKE 2082 GE+IM +GQSIDD E+ IPRLLDIVLYLCE+ HVEGGMIFQLLEDLTEMSTMR+CK+ Sbjct: 73 GEKIMHYGQSIDDEEAIQSQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKD 132 Query: 2081 VFGYIESKQDILGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 1902 +F YIESKQDILGKQELF RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE Sbjct: 133 IFAYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 192 Query: 1901 RSAVNIKGVFNTSNETKYEKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFS 1722 RSAVNIKGVFNTSNETKYEKDAP+GIS+DFNFYKTFWSLQEHFCNPAS ++AP KW KF+ Sbjct: 193 RSAVNIKGVFNTSNETKYEKDAPEGISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFT 252 Query: 1721 SNLMVVLDAFEAQPLSDDDGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQ 1542 SNLMVVL+ FEAQPLSD++GNANNL EEEA TFSIKYLTSSKLMGLELKDP FRRHIL Q Sbjct: 253 SNLMVVLNTFEAQPLSDEEGNANNL-EEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQ 311 Query: 1541 CLILFDYLKAPGKNDKDAPSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREK 1362 CLILFDYLKAPGKNDKD PS+++KEEIKSCEERVKKLLE PPKGKEFL++IEHILEREK Sbjct: 312 CLILFDYLKAPGKNDKDLPSDSMKEEIKSCEERVKKLLETTPPKGKEFLHNIEHILEREK 371 Query: 1361 NWVWWKRDGCPPFEKHPTEKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQR 1182 NWVWWKRDGCPPFE+ P EKKA QDG +KRRPRWR+GNKELSQLWKWADQNPNALTD QR Sbjct: 372 NWVWWKRDGCPPFERQPIEKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALTDPQR 431 Query: 1181 VRTPSVMEYWKPLAEDMDFSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIE 1002 VRTP+V EYWKPLAEDMD SAGIE EYHHKNN+VYCWKGLRF+ARQDL+GFSRF+E GIE Sbjct: 432 VRTPAVSEYWKPLAEDMDSSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTEYGIE 491 Query: 1001 GVVPPDLLPPEVRSKHHNKPXXXXXXXXXXXXXXEPTQQVEDNQXXXXXXXXXXXXXXXX 822 GVVP +LLP +VRSK+ KP QQ E+NQ Sbjct: 492 GVVPMELLPSDVRSKYQAKP-SDRSKRAKKEETKGAAQQAEENQIATPASEIDGEGTRVD 550 Query: 821 XXXXXXXXXXXXXXXXXDISQSGAPTP----EQKQSPSGELGQEVGQSXXXXXXXXXXXA 654 + A TP QKQS + GQE GQS A Sbjct: 551 LEASAAPMD----------TDVTATTPTADENQKQSSDTDAGQEAGQS----EADAEAEA 596 Query: 653 GIIDGETDLEAEPDA 609 G+IDGETD E + DA Sbjct: 597 GMIDGETDAEVDLDA 611 >ref|XP_002263874.1| PREDICTED: THO complex subunit 1-like [Vitis vinifera] Length = 607 Score = 882 bits (2279), Expect = 0.0 Identities = 449/614 (73%), Positives = 488/614 (79%), Gaps = 4/614 (0%) Frame = -3 Query: 2438 EIFKGALSQPGPPSDFALQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQSG 2259 EIFK AL +PGPP FALQ +Q+AIKPQKQT L QDENQ LEN LR LLQEL S AVQSG Sbjct: 8 EIFKQALLKPGPPESFALQVVQEAIKPQKQTKLAQDENQLLENILRKLLQELVSCAVQSG 67 Query: 2258 ERIMRFGQSIDDSESALGHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKEV 2079 E+IM +GQSIDD E+ IPRLLDIVLYLCE+ HVEGGMIFQLLEDLTEMSTMR+CK++ Sbjct: 68 EKIMHYGQSIDDEEAIQSQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKDI 127 Query: 2078 FGYIESKQDILGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSER 1899 F YIESKQDILGKQELF RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSER Sbjct: 128 FAYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSER 187 Query: 1898 SAVNIKGVFNTSNETKYEKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFSS 1719 SAVNIKGVFNTSNETKYEKDAP+GIS+DFNFYKTFWSLQEHFCNPAS ++AP KW KF+S Sbjct: 188 SAVNIKGVFNTSNETKYEKDAPEGISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFTS 247 Query: 1718 NLMVVLDAFEAQPLSDDDGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQC 1539 NLMVVL+ FEAQPLSD++GNANNL EEEA TFSIKYLTSSKLMGLELKDP FRRHIL QC Sbjct: 248 NLMVVLNTFEAQPLSDEEGNANNL-EEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQC 306 Query: 1538 LILFDYLKAPGKNDKDAPSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREKN 1359 LILFDYLKAPGKNDKD PS+++KEEIKSCEERVKKLLE PPKGKEFL++IEHILEREKN Sbjct: 307 LILFDYLKAPGKNDKDLPSDSMKEEIKSCEERVKKLLETTPPKGKEFLHNIEHILEREKN 366 Query: 1358 WVWWKRDGCPPFEKHPTEKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQRV 1179 WVWWKRDGCPPFE+ P EKKA QDG +KRRPRWR+GNKELSQLWKWADQNPNALTD QRV Sbjct: 367 WVWWKRDGCPPFERQPIEKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALTDPQRV 426 Query: 1178 RTPSVMEYWKPLAEDMDFSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIEG 999 RTP+V EYWKPLAEDMD SAGIE EYHHKNN+VYCWKGLRF+ARQDL+GFSRF+E GIEG Sbjct: 427 RTPAVSEYWKPLAEDMDSSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTEYGIEG 486 Query: 998 VVPPDLLPPEVRSKHHNKPXXXXXXXXXXXXXXEPTQQVEDNQXXXXXXXXXXXXXXXXX 819 VVP +LLP +VRSK+ KP QQ E+NQ Sbjct: 487 VVPMELLPSDVRSKYQAKP-SDRSKRAKKEETKGAAQQAEENQIATPASEIDGEGTRVDL 545 Query: 818 XXXXXXXXXXXXXXXXDISQSGAPTP----EQKQSPSGELGQEVGQSXXXXXXXXXXXAG 651 + A TP QKQS + GQE GQS AG Sbjct: 546 EASAAPMD----------TDVTATTPTADENQKQSSDTDAGQEAGQS----EADAEAEAG 591 Query: 650 IIDGETDLEAEPDA 609 +IDGETD E + DA Sbjct: 592 MIDGETDAEVDLDA 605 >ref|XP_006432406.1| hypothetical protein CICLE_v10000631mg [Citrus clementina] gi|557534528|gb|ESR45646.1| hypothetical protein CICLE_v10000631mg [Citrus clementina] Length = 608 Score = 867 bits (2241), Expect = 0.0 Identities = 439/613 (71%), Positives = 490/613 (79%), Gaps = 2/613 (0%) Frame = -3 Query: 2441 LEIFKGALSQPGPPSDFALQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQS 2262 +E+F+ A+ GPP +FALQT+Q+ IKPQKQT L QDENQ LEN LRTLLQEL SSAVQS Sbjct: 1 MEVFRRAILHAGPPENFALQTVQEVIKPQKQTKLAQDENQLLENMLRTLLQELVSSAVQS 60 Query: 2261 GERIMRFGQSIDDSESALGHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKE 2082 GE IM +GQSIDD E++ IPRLLDIVLYLCE+ HVEGGMIFQLLEDLTEMSTM++CK+ Sbjct: 61 GEPIMHYGQSIDDGETSQAQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKD 120 Query: 2081 VFGYIESKQDILGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 1902 +FGYIESKQDILGK ELF RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKLELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1901 RSAVNIKGVFNTSNETKYEKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFS 1722 RSAVNIKGVFNTSNETKYEKD PDGI VDFNFYKTFWSLQE+FCNPA T+AP KW KF+ Sbjct: 181 RSAVNIKGVFNTSNETKYEKDPPDGIPVDFNFYKTFWSLQEYFCNPA-LTLAPTKWQKFT 239 Query: 1721 SNLMVVLDAFEAQPLSDDDGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQ 1542 S+LMVVL+ F+AQPLSD+ G+AN L EEEA TF+IKYLTSSKLMGLELKDP FRRH+L Q Sbjct: 240 SSLMVVLNTFDAQPLSDEVGDANVL-EEEAATFNIKYLTSSKLMGLELKDPSFRRHVLVQ 298 Query: 1541 CLILFDYLKAPGKNDKDAPSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREK 1362 CLILFDYLKAPGKNDKD PSE++KEE+KSCEERVKKLLEM PPKGK+FL+SIEHILEREK Sbjct: 299 CLILFDYLKAPGKNDKDLPSESMKEEMKSCEERVKKLLEMTPPKGKDFLHSIEHILEREK 358 Query: 1361 NWVWWKRDGCPPFEKHPTEKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQR 1182 NWVWWKRDGCPPFEK EKKA QDG +KRRPRWRLGNKELSQLWKWADQNPNALTD QR Sbjct: 359 NWVWWKRDGCPPFEKQSMEKKAVQDGPKKRRPRWRLGNKELSQLWKWADQNPNALTDPQR 418 Query: 1181 VRTPSVMEYWKPLAEDMDFSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIE 1002 VRTP++ EYWKPLAEDMD SAGIE EYHHKN++VYCWKGLRFSARQDL+GFSRF++ GIE Sbjct: 419 VRTPAITEYWKPLAEDMDPSAGIEAEYHHKNSRVYCWKGLRFSARQDLDGFSRFTDHGIE 478 Query: 1001 GVVPPDLLPPEVRSKHHNKPXXXXXXXXXXXXXXEPTQQVEDNQXXXXXXXXXXXXXXXX 822 GVVP +LLPP VRS++ K P+ Q E+NQ Sbjct: 479 GVVPLELLPPHVRSRYEGKANDRSKRAKKEDSKVAPS-QAEENQIAASASENDGDGIRAD 537 Query: 821 XXXXXXXXXXXXXXXXXDISQSGAPTPE--QKQSPSGELGQEVGQSXXXXXXXXXXXAGI 648 +ISQSG TP+ QKQS ++GQE GQ AG+ Sbjct: 538 LEASATPVETDVTAGTGNISQSGTATPDEHQKQSSDTDMGQEAGQ----LDADAEADAGM 593 Query: 647 IDGETDLEAEPDA 609 +DGETD E + +A Sbjct: 594 MDGETDAEVDLEA 606 >ref|XP_006465777.1| PREDICTED: THO complex subunit 1-like isoform X1 [Citrus sinensis] Length = 608 Score = 866 bits (2237), Expect = 0.0 Identities = 438/613 (71%), Positives = 490/613 (79%), Gaps = 2/613 (0%) Frame = -3 Query: 2441 LEIFKGALSQPGPPSDFALQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQS 2262 +E+F+ A+ Q GPP +FALQT+Q+ IKPQKQT L QDENQ LEN LRTLLQEL SSAVQS Sbjct: 1 MEVFRRAILQAGPPENFALQTVQEVIKPQKQTKLAQDENQLLENMLRTLLQELVSSAVQS 60 Query: 2261 GERIMRFGQSIDDSESALGHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKE 2082 GE IM +GQSIDD E++ IPRLLDIVLYLCE+ HVEGGMIFQLLEDLTEMSTM++CK+ Sbjct: 61 GEPIMHYGQSIDDGETSQAQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKD 120 Query: 2081 VFGYIESKQDILGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 1902 +FGYIESKQDILGK ELF RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKLELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1901 RSAVNIKGVFNTSNETKYEKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFS 1722 RSAVNIKGVFNTSNETKYEKD PDGI VDFNFYKTFWSLQE+FCNPA T+AP KW KF+ Sbjct: 181 RSAVNIKGVFNTSNETKYEKDPPDGIPVDFNFYKTFWSLQEYFCNPA-LTLAPTKWQKFT 239 Query: 1721 SNLMVVLDAFEAQPLSDDDGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQ 1542 S+LMVVL+ F+AQPLSD+ G+AN L EEEA TF+IKYLTSSKLMGLELKDP FRRH+L Q Sbjct: 240 SSLMVVLNTFDAQPLSDEVGDANVL-EEEAATFNIKYLTSSKLMGLELKDPSFRRHVLVQ 298 Query: 1541 CLILFDYLKAPGKNDKDAPSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREK 1362 CLILFDYLKAPGKNDKD PSE++KEE+KSCEERVKKLLE PPKGK+FL+SIEHILEREK Sbjct: 299 CLILFDYLKAPGKNDKDLPSESMKEEMKSCEERVKKLLETTPPKGKDFLHSIEHILEREK 358 Query: 1361 NWVWWKRDGCPPFEKHPTEKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQR 1182 NWVWWKRDGCPPFEK EKKA QDG +KRRPRWRLGNKELSQLWKWADQNPNALTD QR Sbjct: 359 NWVWWKRDGCPPFEKQSMEKKAVQDGPKKRRPRWRLGNKELSQLWKWADQNPNALTDPQR 418 Query: 1181 VRTPSVMEYWKPLAEDMDFSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIE 1002 VRTP++ EYWKPLA+DMD SAGIE EYHHKN++VYCWKGLRFSARQDL+GFSRF++ GIE Sbjct: 419 VRTPAITEYWKPLADDMDPSAGIEAEYHHKNSRVYCWKGLRFSARQDLDGFSRFTDHGIE 478 Query: 1001 GVVPPDLLPPEVRSKHHNKPXXXXXXXXXXXXXXEPTQQVEDNQXXXXXXXXXXXXXXXX 822 GVVP +LLPP VRS++ K P+ Q E+NQ Sbjct: 479 GVVPLELLPPHVRSRYEGKANDRSKRAKKEDSKVAPS-QAEENQIAASASENDGEGIRAD 537 Query: 821 XXXXXXXXXXXXXXXXXDISQSGAPTPE--QKQSPSGELGQEVGQSXXXXXXXXXXXAGI 648 +ISQSG TP+ QKQS ++GQE GQ AG+ Sbjct: 538 LEASATPVETDVTAGTGNISQSGTATPDEHQKQSSDTDMGQEAGQ----LDADAEADAGM 593 Query: 647 IDGETDLEAEPDA 609 +DGETD E + +A Sbjct: 594 MDGETDAEVDLEA 606 >ref|XP_007204592.1| hypothetical protein PRUPE_ppa003099mg [Prunus persica] gi|462400123|gb|EMJ05791.1| hypothetical protein PRUPE_ppa003099mg [Prunus persica] Length = 604 Score = 861 bits (2225), Expect = 0.0 Identities = 432/608 (71%), Positives = 480/608 (78%), Gaps = 2/608 (0%) Frame = -3 Query: 2441 LEIFKGALSQPGPPSDFALQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQS 2262 +E+F+ A+ QPGPP +FALQT+Q IKPQKQT LVQDENQ LEN LRTLLQEL S Sbjct: 1 MEVFRRAILQPGPPENFALQTVQQVIKPQKQTKLVQDENQLLENILRTLLQELVS----- 55 Query: 2261 GERIMRFGQSIDDSESALGHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKE 2082 GE+IM++GQSIDD E+ GHIPRLLDIVLYLCE H+EGGMIFQLLEDLTEMSTMR+CK+ Sbjct: 56 GEQIMQYGQSIDDGETTQGHIPRLLDIVLYLCENEHIEGGMIFQLLEDLTEMSTMRNCKD 115 Query: 2081 VFGYIESKQDILGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 1902 VFGYIESKQDILGK ELF RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE Sbjct: 116 VFGYIESKQDILGKPELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 175 Query: 1901 RSAVNIKGVFNTSNETKYEKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFS 1722 RSAVNIKGVFNTSNETKYEKD PDGIS+DFNFYKTFWSLQEHFCNP S T+AP KW KF+ Sbjct: 176 RSAVNIKGVFNTSNETKYEKDPPDGISIDFNFYKTFWSLQEHFCNPPSLTLAPTKWKKFT 235 Query: 1721 SNLMVVLDAFEAQPLSDDDGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQ 1542 S LMVVL+ FEAQPLSD++G+AN+L EEEA FSIKYLTSSKLMGLELKDP FRRHIL Q Sbjct: 236 SGLMVVLNTFEAQPLSDEEGDANSL-EEEAANFSIKYLTSSKLMGLELKDPSFRRHILVQ 294 Query: 1541 CLILFDYLKAPGKNDKDAPSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREK 1362 CLILFDYLKAPGK++KD PS+++KEEIKSCEERVKKLLEM PPKG+ FL+ IEHILEREK Sbjct: 295 CLILFDYLKAPGKSEKDLPSDSMKEEIKSCEERVKKLLEMTPPKGENFLHKIEHILEREK 354 Query: 1361 NWVWWKRDGCPPFEKHPTEKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQR 1182 NWVWWKRDGCPPFEK P EKK Q+G +KRRPRWR+GNKELS LWKWADQNPNALTD QR Sbjct: 355 NWVWWKRDGCPPFEKQPAEKKVVQEGAKKRRPRWRMGNKELSLLWKWADQNPNALTDPQR 414 Query: 1181 VRTPSVMEYWKPLAEDMDFSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIE 1002 VRTP++ +YWKPLA+DMD +AGIE EYHHKNN+VYCWKGLRFSARQDLEGFSRF+E GIE Sbjct: 415 VRTPAITDYWKPLADDMDPAAGIEAEYHHKNNRVYCWKGLRFSARQDLEGFSRFTEFGIE 474 Query: 1001 GVVPPDLLPPEVRSKHHNKPXXXXXXXXXXXXXXEPTQQVEDNQXXXXXXXXXXXXXXXX 822 GVVP +LL PE RSK+ KP QVE+NQ Sbjct: 475 GVVPLELLTPEERSKYQAKP-NDKSKRAKKEETKGAAHQVEENQIATAANEIDGEGIRAV 533 Query: 821 XXXXXXXXXXXXXXXXXDISQSGAPTPE--QKQSPSGELGQEVGQSXXXXXXXXXXXAGI 648 D+SQ G+P P+ QKQS ++GQE GQ G Sbjct: 534 LEASVTPTDTDATVATGDMSQGGSPIPDEHQKQSSDTDVGQEAGQMEADAEVEAGMIDGG 593 Query: 647 IDGETDLE 624 +D E DL+ Sbjct: 594 MDTEVDLD 601 >ref|XP_004307195.1| PREDICTED: THO complex subunit 1-like [Fragaria vesca subsp. vesca] Length = 611 Score = 861 bits (2224), Expect = 0.0 Identities = 436/615 (70%), Positives = 482/615 (78%), Gaps = 2/615 (0%) Frame = -3 Query: 2441 LEIFKGALSQPGPPSDFALQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQS 2262 +E+F+ A+ QPGPP FALQT+Q IKPQK T LVQDENQ LEN LRTLLQEL SSAVQS Sbjct: 1 MEVFRSAILQPGPPETFALQTVQQVIKPQKGTKLVQDENQLLENILRTLLQELVSSAVQS 60 Query: 2261 GERIMRFGQSIDDSESALGHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKE 2082 GE+IM++GQSIDD E+ GHIPRLLD+VLYLCE HVEGGMIFQLLEDLTEMSTMR+CK+ Sbjct: 61 GEQIMQYGQSIDDGEATRGHIPRLLDVVLYLCENEHVEGGMIFQLLEDLTEMSTMRNCKD 120 Query: 2081 VFGYIESKQDILGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 1902 VFGYIESKQDILGKQELF RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE Sbjct: 121 VFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1901 RSAVNIKGVFNTSNETKYEKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFS 1722 RSAVNIKGVFNTSNETKYEKDAPDGIS+DFNFYKTFWSLQE+FCNPA T+AP KW KF+ Sbjct: 181 RSAVNIKGVFNTSNETKYEKDAPDGISIDFNFYKTFWSLQEYFCNPAPLTVAPTKWQKFT 240 Query: 1721 SNLMVVLDAFEAQPLSDDDGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQ 1542 S+L VVL+ FEAQPLSD++G ANNL EE+ FSIKYLTSSKLMGLELKDP FRRHIL Q Sbjct: 241 SSLKVVLNTFEAQPLSDEEGEANNL--EESANFSIKYLTSSKLMGLELKDPSFRRHILVQ 298 Query: 1541 CLILFDYLKAPGKNDKDAPSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREK 1362 CLILFDYLKAPGK++KD PSE++KEEI S EE VKKLLEM PPKG+ FL+ IEHILEREK Sbjct: 299 CLILFDYLKAPGKSEKDLPSESMKEEINSYEEHVKKLLEMTPPKGESFLHKIEHILEREK 358 Query: 1361 NWVWWKRDGCPPFEKHPTEKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQR 1182 NWVWWKRDGCPPFEK P EKK QDG +KR+PRWRLGNKELSQLWKWADQNPNALTDTQR Sbjct: 359 NWVWWKRDGCPPFEKQPIEKKTVQDGAKKRKPRWRLGNKELSQLWKWADQNPNALTDTQR 418 Query: 1181 VRTPSVMEYWKPLAEDMDFSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIE 1002 +RTPS+ EYWKPLAEDMD +AGIE EYHHKNN+VYCWKGLRFSARQDLEGFS+F+E GIE Sbjct: 419 LRTPSITEYWKPLAEDMDPAAGIEAEYHHKNNRVYCWKGLRFSARQDLEGFSKFTEFGIE 478 Query: 1001 GVVPPDLLPPEVRSKHHNKPXXXXXXXXXXXXXXEPTQQVEDNQXXXXXXXXXXXXXXXX 822 GVVP +LLPPE R+K+ K VE+NQ Sbjct: 479 GVVPLELLPPEERAKYAPK-TNEKSKRAKKEDAKAAVHHVEENQ-VATAATDVDGEVLRT 536 Query: 821 XXXXXXXXXXXXXXXXXDISQSGAPTPE--QKQSPSGELGQEVGQSXXXXXXXXXXXAGI 648 + SQ +P + QKQS + GQE GQ AG+ Sbjct: 537 DVGALVAPLDTDNTMVCNTSQGNSPMADEHQKQSSDTDGGQEAGQLEDDAEVDAEGDAGM 596 Query: 647 IDGETDLEAEPDATS 603 IDGE + E + D S Sbjct: 597 IDGEIEPEVDLDPAS 611 >ref|XP_006465778.1| PREDICTED: THO complex subunit 1-like isoform X2 [Citrus sinensis] Length = 607 Score = 859 bits (2220), Expect = 0.0 Identities = 437/613 (71%), Positives = 489/613 (79%), Gaps = 2/613 (0%) Frame = -3 Query: 2441 LEIFKGALSQPGPPSDFALQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQS 2262 +E+F+ A+ Q GPP +FALQT+Q+ IKPQKQT L QDENQ LEN LRTLLQEL SSAVQS Sbjct: 1 MEVFRRAILQAGPPENFALQTVQEVIKPQKQTKLAQDENQLLENMLRTLLQELVSSAVQS 60 Query: 2261 GERIMRFGQSIDDSESALGHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKE 2082 GE IM +GQSIDD E++ IPRLLDIVLYLCE+ HVEGGMIFQLLEDLTEMSTM++CK+ Sbjct: 61 GEPIMHYGQSIDDGETSQAQIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKD 120 Query: 2081 VFGYIESKQDILGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 1902 +FGYIESKQDILGK ELF RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKLELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1901 RSAVNIKGVFNTSNETKYEKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFS 1722 RSAVNIKGVFNTSNETKYEKD PDGI VDFNFYKTFWSLQE+FCNPA T+AP KW KF+ Sbjct: 181 RSAVNIKGVFNTSNETKYEKDPPDGIPVDFNFYKTFWSLQEYFCNPA-LTLAPTKWQKFT 239 Query: 1721 SNLMVVLDAFEAQPLSDDDGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQ 1542 S+LMVVL+ F+AQPLSD+ G+AN L EEEA TF+IKYLTSSKLMGLELKDP FRRH+L Q Sbjct: 240 SSLMVVLNTFDAQPLSDEVGDANVL-EEEAATFNIKYLTSSKLMGLELKDPSFRRHVLVQ 298 Query: 1541 CLILFDYLKAPGKNDKDAPSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREK 1362 CLILFDYLKAPGKNDKD PSE++KEE+KSCEERVKKLLE PPKGK+FL+SIEHILEREK Sbjct: 299 CLILFDYLKAPGKNDKDLPSESMKEEMKSCEERVKKLLETTPPKGKDFLHSIEHILEREK 358 Query: 1361 NWVWWKRDGCPPFEKHPTEKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQR 1182 NWVWWKRDGCPPFEK EKKA QDG K+RPRWRLGNKELSQLWKWADQNPNALTD QR Sbjct: 359 NWVWWKRDGCPPFEKQSMEKKAVQDG-PKKRPRWRLGNKELSQLWKWADQNPNALTDPQR 417 Query: 1181 VRTPSVMEYWKPLAEDMDFSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIE 1002 VRTP++ EYWKPLA+DMD SAGIE EYHHKN++VYCWKGLRFSARQDL+GFSRF++ GIE Sbjct: 418 VRTPAITEYWKPLADDMDPSAGIEAEYHHKNSRVYCWKGLRFSARQDLDGFSRFTDHGIE 477 Query: 1001 GVVPPDLLPPEVRSKHHNKPXXXXXXXXXXXXXXEPTQQVEDNQXXXXXXXXXXXXXXXX 822 GVVP +LLPP VRS++ K P+ Q E+NQ Sbjct: 478 GVVPLELLPPHVRSRYEGKANDRSKRAKKEDSKVAPS-QAEENQIAASASENDGEGIRAD 536 Query: 821 XXXXXXXXXXXXXXXXXDISQSGAPTPE--QKQSPSGELGQEVGQSXXXXXXXXXXXAGI 648 +ISQSG TP+ QKQS ++GQE GQ AG+ Sbjct: 537 LEASATPVETDVTAGTGNISQSGTATPDEHQKQSSDTDMGQEAGQ----LDADAEADAGM 592 Query: 647 IDGETDLEAEPDA 609 +DGETD E + +A Sbjct: 593 MDGETDAEVDLEA 605 >ref|XP_002529986.1| nuclear matrix protein, putative [Ricinus communis] gi|223530509|gb|EEF32391.1| nuclear matrix protein, putative [Ricinus communis] Length = 608 Score = 857 bits (2213), Expect = 0.0 Identities = 431/615 (70%), Positives = 488/615 (79%), Gaps = 2/615 (0%) Frame = -3 Query: 2441 LEIFKGALSQPGPPSDFALQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQS 2262 +E FK A+ QPGPP +FALQT+Q+ IKPQ+QT L QDENQ LEN LRTLLQEL +SAV S Sbjct: 1 MEEFKNAILQPGPPENFALQTVQEFIKPQRQTKLAQDENQLLENMLRTLLQELVASAVHS 60 Query: 2261 GERIMRFGQSIDDSESALGHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKE 2082 GE+IM +GQS+D+ E + G IPRLLD+VL+LCER HVEGGMIFQLLEDLTEMSTM++C++ Sbjct: 61 GEQIMLYGQSVDEGEKSQGQIPRLLDVVLHLCEREHVEGGMIFQLLEDLTEMSTMKNCQD 120 Query: 2081 VFGYIESKQDILGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 1902 +FGYIESKQDILGKQELF RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1901 RSAVNIKGVFNTSNETKYEKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFS 1722 RSAVNIKGVFNTSNETKYEKD P GISVDFNFYKT WSLQE+FCNPA T+AP KWHKF+ Sbjct: 181 RSAVNIKGVFNTSNETKYEKDPPAGISVDFNFYKTLWSLQENFCNPAPLTLAPTKWHKFT 240 Query: 1721 SNLMVVLDAFEAQPLSDDDGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQ 1542 S+LMVVL+ FEAQPLS+++G+ANNL EEEA TF+IKYLTSSKLMGLELKDP FRRHIL Q Sbjct: 241 SSLMVVLNTFEAQPLSEEEGDANNL-EEEAATFNIKYLTSSKLMGLELKDPSFRRHILVQ 299 Query: 1541 CLILFDYLKAPGKNDKDAPSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREK 1362 CLILFDYLKAPGKNDKD+ SE++KE+I++CEERVKKLLEM PPKGK+FL IEH+LEREK Sbjct: 300 CLILFDYLKAPGKNDKDSTSESMKEDIRTCEERVKKLLEMTPPKGKDFLQKIEHVLEREK 359 Query: 1361 NWVWWKRDGCPPFEKHPTEKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQR 1182 NWV WKRDGC PFEK P E K Q+G +KR+PRWRLGNKELSQLWKWADQNPNALTD QR Sbjct: 360 NWVCWKRDGCQPFEKQPIENKTIQEGSKKRKPRWRLGNKELSQLWKWADQNPNALTDPQR 419 Query: 1181 VRTPSVMEYWKPLAEDMDFSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIE 1002 VRTP++ EYWKPLAEDMD SAGIE EYHHKNN+VYCWKGLRFSARQDL+GFSRF++ GIE Sbjct: 420 VRTPAITEYWKPLAEDMDPSAGIEAEYHHKNNRVYCWKGLRFSARQDLDGFSRFTDHGIE 479 Query: 1001 GVVPPDLLPPEVRSKHHNKPXXXXXXXXXXXXXXEPTQQVEDNQXXXXXXXXXXXXXXXX 822 GVVP +LLPP+VRSK+ KP + Q E+NQ Sbjct: 480 GVVPLELLPPDVRSKYQAKPNDRSKRAKKDDIKGG-SNQTEENQ-IATPASEIDGEGIRA 537 Query: 821 XXXXXXXXXXXXXXXXXDISQSGAPTPE--QKQSPSGELGQEVGQSXXXXXXXXXXXAGI 648 SQ G PTP+ Q+QSP + GQE G AG+ Sbjct: 538 DEAAAAPMDTDAMATAGSTSQGGTPTPDERQRQSPDADDGQEAGH----LEADGEVEAGM 593 Query: 647 IDGETDLEAEPDATS 603 IDGETD E + +A S Sbjct: 594 IDGETDAEVDLEAIS 608 >ref|XP_007010828.1| Nuclear matrix protein-related isoform 1 [Theobroma cacao] gi|508727741|gb|EOY19638.1| Nuclear matrix protein-related isoform 1 [Theobroma cacao] Length = 602 Score = 855 bits (2210), Expect = 0.0 Identities = 433/610 (70%), Positives = 483/610 (79%), Gaps = 2/610 (0%) Frame = -3 Query: 2444 VLEIFKGALSQPGPPSDFALQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQ 2265 ++E F+ A+ QPGPP FAL+ +Q+ IKPQKQT L QDENQ LEN LRTLLQEL SS+V Sbjct: 1 MMEAFRRAILQPGPPETFALKIVQEVIKPQKQTKLAQDENQLLENMLRTLLQELVSSSVP 60 Query: 2264 SGERIMRFGQSIDDSESALGHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCK 2085 SGE IM++G+SIDD G IPRLLD VLYLCE+ HVEGGMIFQLLEDL EMSTMR+CK Sbjct: 61 SGEEIMQYGKSIDDESDTQGVIPRLLDFVLYLCEKEHVEGGMIFQLLEDLNEMSTMRNCK 120 Query: 2084 EVFGYIESKQDILGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLS 1905 ++F YIESKQDILGKQELF RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLS Sbjct: 121 DIFRYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLS 180 Query: 1904 ERSAVNIKGVFNTSNETKYEKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKF 1725 ERSAVNIKGVFNTSNETKYEKD P+GISVDFNFYKTFWSLQ++FCNPAS + AP KW KF Sbjct: 181 ERSAVNIKGVFNTSNETKYEKDPPEGISVDFNFYKTFWSLQDYFCNPASLSTAPVKWQKF 240 Query: 1724 SSNLMVVLDAFEAQPLSDDDGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILF 1545 +S+LMVVL+ FEAQPLS+++G NNL EEEATTF+IKYLTSSKLMGLELKDP FRRHIL Sbjct: 241 TSSLMVVLNTFEAQPLSEEEGADNNL-EEEATTFNIKYLTSSKLMGLELKDPSFRRHILL 299 Query: 1544 QCLILFDYLKAPGKNDKDAPSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILERE 1365 QCLILFDYLKAPGKNDKD+ SE++KEEIKSCE+RVKKLLE+ PPKGK+FL SIEHILERE Sbjct: 300 QCLILFDYLKAPGKNDKDS-SESMKEEIKSCEDRVKKLLEVTPPKGKDFLCSIEHILERE 358 Query: 1364 KNWVWWKRDGCPPFEKHPTEKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQ 1185 KNWVWWKRDGCPPFEK P EKK Q+G +KRRPRWRLGNKELSQLWKWADQNPNALTD Q Sbjct: 359 KNWVWWKRDGCPPFEKQPIEKKPVQNGAKKRRPRWRLGNKELSQLWKWADQNPNALTDPQ 418 Query: 1184 RVRTPSVMEYWKPLAEDMDFSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGI 1005 RVRTP++ +YWKPLAEDMD SAGIE EYHHKNN+VYCWKGLRF+ARQDLEGFS+F+E GI Sbjct: 419 RVRTPAITDYWKPLAEDMDESAGIEAEYHHKNNRVYCWKGLRFAARQDLEGFSKFTEHGI 478 Query: 1004 EGVVPPDLLPPEVRSKHHNKPXXXXXXXXXXXXXXEPTQQVEDNQXXXXXXXXXXXXXXX 825 EGVVP +LLPP+VRSK KP + QVE++Q Sbjct: 479 EGVVPLELLPPDVRSKFQGKP-SDRSKRAKKEETKTSSHQVEESQIATPASEVDGEGMRA 537 Query: 824 XXXXXXXXXXXXXXXXXXDISQSGAPTPE--QKQSPSGELGQEVGQSXXXXXXXXXXXAG 651 + SQ G PTP+ QKQSP ++GQE GQ Sbjct: 538 DMEASAALMDADVTAGTGNNSQGGTPTPDEHQKQSPDTDVGQEAGQLEADAEVEAG---- 593 Query: 650 IIDGETDLEA 621 IDGETD EA Sbjct: 594 -IDGETDPEA 602 >ref|XP_002468664.1| hypothetical protein SORBIDRAFT_01g049910 [Sorghum bicolor] gi|241922518|gb|EER95662.1| hypothetical protein SORBIDRAFT_01g049910 [Sorghum bicolor] Length = 637 Score = 851 bits (2198), Expect = 0.0 Identities = 419/542 (77%), Positives = 467/542 (86%), Gaps = 9/542 (1%) Frame = -3 Query: 2540 MAEATP------GLRILLHQHQKERAP---VHISSHADRDRVLEIFKGALSQPGPPSDFA 2388 MAE +P GLRILL + + +P +SSHADRDR++ +F+ ALS+ PP F+ Sbjct: 1 MAEPSPPPASNAGLRILLSKDRPAPSPPPTAAVSSHADRDRIIGVFRSALSRNEPPETFS 60 Query: 2387 LQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQSGERIMRFGQSIDDSESAL 2208 LQT+Q+AIKPQK+TVLV +ENQSLENALRTLLQEL SSAVQSG++IM++G S+D ES Sbjct: 61 LQTVQEAIKPQKETVLVLEENQSLENALRTLLQELVSSAVQSGKKIMQYGNSLDSGESNC 120 Query: 2207 GHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKEVFGYIESKQDILGKQELF 2028 I RLLDIVLYLCERGHVEGGM+FQLLEDLTEMST++DCK++FGYIES+QD+LGKQELF Sbjct: 121 P-ITRLLDIVLYLCERGHVEGGMVFQLLEDLTEMSTIKDCKDIFGYIESQQDVLGKQELF 179 Query: 2027 GRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY 1848 GRGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSERSA+NIKGVFNTSN TKY Sbjct: 180 GRGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSERSALNIKGVFNTSNVTKY 239 Query: 1847 EKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFSSNLMVVLDAFEAQPLSDD 1668 EKDA DGISVDFNFYKT WSLQEHF NPA T PAKW KFSSNL VVL FEAQPLSDD Sbjct: 240 EKDAMDGISVDFNFYKTLWSLQEHFSNPALTNTNPAKWQKFSSNLAVVLSTFEAQPLSDD 299 Query: 1667 DGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQCLILFDYLKAPGKNDKDA 1488 DG NNL+EEE F+IKYLTSSKLMGLELKDP FRRHIL QCLI FDYLKAPGKNDK+ Sbjct: 300 DGKLNNLNEEEDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLIFFDYLKAPGKNDKEG 359 Query: 1487 PSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREKNWVWWKRDGCPPFEKHPT 1308 P+ ++KEEIKSCEE VKKLLE+IPPKGKEFL SIEHILEREKNWVWWKRDGC FEK P Sbjct: 360 PTGSMKEEIKSCEEHVKKLLEIIPPKGKEFLKSIEHILEREKNWVWWKRDGCLAFEKPPF 419 Query: 1307 EKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQRVRTPSVMEYWKPLAEDMD 1128 EKK GQ G RKR+PRWRLGNKELSQLWKWA+QNPN LTD RVR PS+ EYWKPLAEDMD Sbjct: 420 EKKPGQAGGRKRKPRWRLGNKELSQLWKWAEQNPNVLTDPDRVRMPSITEYWKPLAEDMD 479 Query: 1127 FSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIEGVVPPDLLPPEVRSKHHN 948 SAGIEEEYHHK+N+VYCWKGLRFSARQDL+GF+RFS+ GIEGVVP +LLPPEV ++ + Sbjct: 480 PSAGIEEEYHHKSNRVYCWKGLRFSARQDLDGFARFSDYGIEGVVPSELLPPEVNARFSS 539 Query: 947 KP 942 KP Sbjct: 540 KP 541 >ref|XP_002299188.1| hypothetical protein POPTR_0001s06900g [Populus trichocarpa] gi|222846446|gb|EEE83993.1| hypothetical protein POPTR_0001s06900g [Populus trichocarpa] Length = 608 Score = 846 bits (2186), Expect = 0.0 Identities = 431/613 (70%), Positives = 481/613 (78%), Gaps = 2/613 (0%) Frame = -3 Query: 2441 LEIFKGALSQPGPPSDFALQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQS 2262 +E F+ A+ QPGP FAL+T+Q+ IKPQKQT LVQDENQ LEN LRTLLQEL SSA QS Sbjct: 1 MEEFRRAILQPGPVETFALKTVQEFIKPQKQTKLVQDENQLLENMLRTLLQELVSSAAQS 60 Query: 2261 GERIMRFGQSIDDSESALGHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKE 2082 GE IM G+SIDD E++ G IPRLLD VLYLCER H+EGGMIFQLLEDLTEMSTMR+CK+ Sbjct: 61 GEEIMLSGKSIDDEENSQGQIPRLLDAVLYLCEREHIEGGMIFQLLEDLTEMSTMRNCKD 120 Query: 2081 VFGYIESKQDILGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 1902 +FGYIESKQDILGKQELF RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE Sbjct: 121 IFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 180 Query: 1901 RSAVNIKGVFNTSNETKYEKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFS 1722 RSAVNIKGVFNTSNETKYEK+ P IS+DFNFYKT WSLQE+FC+P S T++P KW KFS Sbjct: 181 RSAVNIKGVFNTSNETKYEKEPPAAISLDFNFYKTLWSLQEYFCDP-SLTLSPIKWQKFS 239 Query: 1721 SNLMVVLDAFEAQPLSDDDGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQ 1542 S+LMVVL+ FEAQPLS+++G+ANNL EEEA F+IKYLTSSKLMGLELKDP FRRH+L Q Sbjct: 240 SSLMVVLNTFEAQPLSEEEGDANNL-EEEAAAFNIKYLTSSKLMGLELKDPSFRRHVLVQ 298 Query: 1541 CLILFDYLKAPGKNDKDAPSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREK 1362 CLILFDYLKAPGKNDKD SE++KEEI+S EE VKKLLEM PPKGK+FL+ +EHILEREK Sbjct: 299 CLILFDYLKAPGKNDKDLTSESMKEEIRSREEHVKKLLEMTPPKGKDFLHMVEHILEREK 358 Query: 1361 NWVWWKRDGCPPFEKHPTEKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQR 1182 NW+WWKRDGCPPFEK P E K QDG +KRRPRWRLGNKELSQLWKWADQNPNALTD QR Sbjct: 359 NWLWWKRDGCPPFEKQPIENKTVQDGGKKRRPRWRLGNKELSQLWKWADQNPNALTDPQR 418 Query: 1181 VRTPSVMEYWKPLAEDMDFSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIE 1002 VRTP + +YWKPLAEDMD SAGI+ EYHHKNN+VYCWKGLRFSARQDL+GFSRF++ GIE Sbjct: 419 VRTPIITDYWKPLAEDMDPSAGIDAEYHHKNNRVYCWKGLRFSARQDLDGFSRFTDHGIE 478 Query: 1001 GVVPPDLLPPEVRSKHHNKPXXXXXXXXXXXXXXEPTQQVEDNQXXXXXXXXXXXXXXXX 822 GVVP +LLPP+VRSK+ KP QVEDNQ Sbjct: 479 GVVPLELLPPDVRSKYQAKP-NDRSKRAKKDEPKGALHQVEDNQISTPASEIDGEGIRID 537 Query: 821 XXXXXXXXXXXXXXXXXDISQSGAPTPE--QKQSPSGELGQEVGQSXXXXXXXXXXXAGI 648 ISQSG PTP+ QKQ + GQE GQ AG+ Sbjct: 538 LEASAAPMDTDVTATTGSISQSGTPTPDEHQKQGSDTDGGQEAGQ----LEADAEAEAGM 593 Query: 647 IDGETDLEAEPDA 609 IDGETD E + +A Sbjct: 594 IDGETDAEVDLEA 606 >ref|NP_001159168.1| hypothetical protein [Zea mays] gi|223942431|gb|ACN25299.1| unknown [Zea mays] gi|414864321|tpg|DAA42878.1| TPA: hypothetical protein ZEAMMB73_799316 [Zea mays] Length = 638 Score = 846 bits (2185), Expect = 0.0 Identities = 417/542 (76%), Positives = 467/542 (86%), Gaps = 9/542 (1%) Frame = -3 Query: 2540 MAEATP------GLRILLHQHQKERAP---VHISSHADRDRVLEIFKGALSQPGPPSDFA 2388 MAE +P GLRILL + + +P +SSHADRDR++ +F+ ALS+ PP F+ Sbjct: 1 MAEPSPPPVSNAGLRILLSKDRPAPSPPPTAAVSSHADRDRIIGVFRSALSRNEPPETFS 60 Query: 2387 LQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQSGERIMRFGQSIDDSESAL 2208 LQT+Q+AIKPQK+TVLV +ENQSLENALRTLLQEL SSAVQS ++IM++G S+D ES Sbjct: 61 LQTVQEAIKPQKETVLVLEENQSLENALRTLLQELVSSAVQSDKKIMQYGNSLDSGESNC 120 Query: 2207 GHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKEVFGYIESKQDILGKQELF 2028 I RLLDIVLYLCERGHVEGGM+FQLLEDLTEMST++DCK++FGYIES+QD+LGKQELF Sbjct: 121 -LITRLLDIVLYLCERGHVEGGMVFQLLEDLTEMSTIKDCKDIFGYIESQQDVLGKQELF 179 Query: 2027 GRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY 1848 GRGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSERSA+NIKGVFNTSN TKY Sbjct: 180 GRGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSERSALNIKGVFNTSNVTKY 239 Query: 1847 EKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFSSNLMVVLDAFEAQPLSDD 1668 EKDA DGISVDFNFYKT WSLQEHF NPA T+ PAKW KFSSNL VVL+ FEAQPLSDD Sbjct: 240 EKDAMDGISVDFNFYKTLWSLQEHFSNPALTSTNPAKWQKFSSNLAVVLNTFEAQPLSDD 299 Query: 1667 DGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQCLILFDYLKAPGKNDKDA 1488 DG NNL+EEE F+IKYLTSSKLMGLELKDP FRRHIL QCLI FDYLKAPGKNDK+ Sbjct: 300 DGKLNNLNEEEDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLIFFDYLKAPGKNDKEG 359 Query: 1487 PSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREKNWVWWKRDGCPPFEKHPT 1308 P+ ++ EEIKSCEE VKKLLE+IPPKGKEFL SIEHILEREKNWVWWKRDGC FEK P Sbjct: 360 PTGSMIEEIKSCEEHVKKLLEIIPPKGKEFLKSIEHILEREKNWVWWKRDGCLAFEKPPF 419 Query: 1307 EKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQRVRTPSVMEYWKPLAEDMD 1128 EKK GQ G RKR+PRWRLG+KELSQLWKWA+QNPN LTD RVR PS+ EYWKPLAEDMD Sbjct: 420 EKKPGQAGARKRKPRWRLGSKELSQLWKWAEQNPNVLTDPDRVRMPSITEYWKPLAEDMD 479 Query: 1127 FSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIEGVVPPDLLPPEVRSKHHN 948 SAGIEEEYHHK+N+VYCWKGLRFSARQDL+GF+RFS+ GIEGVVP +LLPPEV +K + Sbjct: 480 PSAGIEEEYHHKSNRVYCWKGLRFSARQDLDGFARFSDYGIEGVVPSELLPPEVNAKFSS 539 Query: 947 KP 942 KP Sbjct: 540 KP 541 >ref|XP_006649251.1| PREDICTED: THO complex subunit 1-like [Oryza brachyantha] Length = 644 Score = 845 bits (2182), Expect = 0.0 Identities = 417/542 (76%), Positives = 464/542 (85%), Gaps = 12/542 (2%) Frame = -3 Query: 2531 ATPGLRILLHQHQKERAPVH------------ISSHADRDRVLEIFKGALSQPGPPSDFA 2388 +T GLRILL K+R P +SSH DRDR++ +F+ ALS+ P FA Sbjct: 10 STAGLRILL---SKDRPPASSSSSAPATAAAAVSSHTDRDRIIGVFRDALSRTESPEAFA 66 Query: 2387 LQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQSGERIMRFGQSIDDSESAL 2208 LQ +Q+AIKPQKQTVLV +ENQSLENALR LLQELASSAVQSG+RIM++G S+D+ ES Sbjct: 67 LQAVQEAIKPQKQTVLVLEENQSLENALRALLQELASSAVQSGKRIMQYGNSLDNGESNC 126 Query: 2207 GHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKEVFGYIESKQDILGKQELF 2028 I RLLDIVLYLCERGHVEGGM+FQLLEDLTEMST++DCK+VFGYIESKQD+LGKQELF Sbjct: 127 P-ITRLLDIVLYLCERGHVEGGMVFQLLEDLTEMSTIKDCKDVFGYIESKQDVLGKQELF 185 Query: 2027 GRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY 1848 GRGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSERSA+NIKGVFNTSNETKY Sbjct: 186 GRGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSERSALNIKGVFNTSNETKY 245 Query: 1847 EKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFSSNLMVVLDAFEAQPLSDD 1668 EKDA DGISVDFNFY T WSLQEHF NPA T KW F+SNL VVL FEAQPLS+D Sbjct: 246 EKDATDGISVDFNFYNTLWSLQEHFSNPALTAANLTKWQNFASNLTVVLSTFEAQPLSED 305 Query: 1667 DGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQCLILFDYLKAPGKNDKDA 1488 DG NNLD+EE F+IKYLTSSKLMGLELKDP FRRHIL QCLILFD+LKAPGK DK+ Sbjct: 306 DGKLNNLDQEEDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLILFDFLKAPGKTDKEG 365 Query: 1487 PSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREKNWVWWKRDGCPPFEKHPT 1308 P+ ++KEEI SCEERVKKLLE+IPPKGKEFL SIEHILEREKNWVWWKRDGC FEK P Sbjct: 366 PTGSMKEEIDSCEERVKKLLEIIPPKGKEFLQSIEHILEREKNWVWWKRDGCLAFEKQPF 425 Query: 1307 EKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQRVRTPSVMEYWKPLAEDMD 1128 EKK+GQ GV+KR+PRWRLGNKEL+QLWKWA+QNPNALTD++R+ PSV EYWKPLAEDMD Sbjct: 426 EKKSGQAGVKKRKPRWRLGNKELAQLWKWAEQNPNALTDSERICMPSVTEYWKPLAEDMD 485 Query: 1127 FSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIEGVVPPDLLPPEVRSKHHN 948 SAGIE+EYHHKNN+VYCWKGLRFSARQDLEGFSRF + GIEGVVP +LLPPEVRSK ++ Sbjct: 486 PSAGIEDEYHHKNNRVYCWKGLRFSARQDLEGFSRFCDYGIEGVVPQELLPPEVRSKFYS 545 Query: 947 KP 942 KP Sbjct: 546 KP 547 >ref|XP_003558999.1| PREDICTED: THO complex subunit 1-like [Brachypodium distachyon] Length = 630 Score = 844 bits (2180), Expect = 0.0 Identities = 417/537 (77%), Positives = 463/537 (86%), Gaps = 7/537 (1%) Frame = -3 Query: 2531 ATPGLRILLHQHQKERAPVH-------ISSHADRDRVLEIFKGALSQPGPPSDFALQTMQ 2373 + PGLRILL K+R P +SSHADRDR++ +F+ ALS+ P FALQ +Q Sbjct: 10 SNPGLRILL---AKDRPPTSSPSTLAAVSSHADRDRIIGVFRNALSRTESPEVFALQAVQ 66 Query: 2372 DAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQSGERIMRFGQSIDDSESALGHIPR 2193 +AIKPQKQTVLV +ENQSLENALR LLQEL SSAVQSG+ IM++G S+D ES I Sbjct: 67 EAIKPQKQTVLVLEENQSLENALRRLLQELVSSAVQSGKGIMQYGNSLDSGESNC-LITH 125 Query: 2192 LLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKEVFGYIESKQDILGKQELFGRGKL 2013 LLDI+LYLCERGHVEGGM+FQLLEDLT+MST++DCK+VFGYIESKQD+LGKQELFGRGKL Sbjct: 126 LLDIMLYLCERGHVEGGMVFQLLEDLTDMSTIKDCKDVFGYIESKQDVLGKQELFGRGKL 185 Query: 2012 VMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKDAP 1833 VMLRTCNQLLRRLSK+NDVVFCGRI+MFLAHFFPLSERSA+NIKGVFNTSNETKYEKDA Sbjct: 186 VMLRTCNQLLRRLSKSNDVVFCGRIIMFLAHFFPLSERSALNIKGVFNTSNETKYEKDAT 245 Query: 1832 DGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFSSNLMVVLDAFEAQPLSDDDGNAN 1653 GISVDFNFY+T WSLQEHF NPA TT P KW KF+SNL VVL+ FEAQPL DDDG N Sbjct: 246 GGISVDFNFYQTLWSLQEHFRNPALTTTNPTKWQKFASNLTVVLNTFEAQPLCDDDGKHN 305 Query: 1652 NLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQCLILFDYLKAPGKNDKDAPSETI 1473 NL++EE F+IKYLTSSKLMGLELKD FRRHIL QCLI FDYLKAPGK+DK+ PSE++ Sbjct: 306 NLEQEEDAAFNIKYLTSSKLMGLELKDASFRRHILVQCLIFFDYLKAPGKSDKEGPSESM 365 Query: 1472 KEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREKNWVWWKRDGCPPFEKHPTEKKAG 1293 KEEIKSCEERVK LLEMIPPKGKEFL SIEHILEREKNWVWWKRDGCP FEK P EKK G Sbjct: 366 KEEIKSCEERVKNLLEMIPPKGKEFLQSIEHILEREKNWVWWKRDGCPAFEKQPFEKKPG 425 Query: 1292 QDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQRVRTPSVMEYWKPLAEDMDFSAGI 1113 GVRKR+PRWRLGNKEL+QLWKWA+ NPNALTD RVRTPSV EYWKPLA+DMD SAGI Sbjct: 426 --GVRKRKPRWRLGNKELAQLWKWAELNPNALTDPDRVRTPSVTEYWKPLADDMDASAGI 483 Query: 1112 EEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIEGVVPPDLLPPEVRSKHHNKP 942 EEEYHHKNN+VYCWKGLRFSARQDL+GFSRF E GIEGVVP +LLPPEVR+K+++KP Sbjct: 484 EEEYHHKNNRVYCWKGLRFSARQDLDGFSRFCEYGIEGVVPTELLPPEVRAKYNSKP 540 >ref|NP_001048715.1| Os03g0110400 [Oryza sativa Japonica Group] gi|108705792|gb|ABF93587.1| THO complex subunit 1, putative, expressed [Oryza sativa Japonica Group] gi|113547186|dbj|BAF10629.1| Os03g0110400 [Oryza sativa Japonica Group] gi|218191944|gb|EEC74371.1| hypothetical protein OsI_09688 [Oryza sativa Indica Group] Length = 638 Score = 839 bits (2167), Expect = 0.0 Identities = 420/551 (76%), Positives = 462/551 (83%), Gaps = 18/551 (3%) Frame = -3 Query: 2540 MAEATP------GLRILLHQHQKERAPVH------------ISSHADRDRVLEIFKGALS 2415 MAE TP GLRILL K+R P +SSH DRDR++ +F+ ALS Sbjct: 1 MAEPTPPLPSNAGLRILL---SKDRPPASSSSALAAATSAAVSSHTDRDRIIGVFRDALS 57 Query: 2414 QPGPPSDFALQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQSGERIMRFGQ 2235 + P FALQ +QDAIKPQKQTVLV +ENQSLENALR LLQELASSAVQSG+RIM++G Sbjct: 58 RTESPEAFALQAVQDAIKPQKQTVLVLEENQSLENALRKLLQELASSAVQSGKRIMQYG- 116 Query: 2234 SIDDSESALGHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKEVFGYIESKQ 2055 D+E I RLLDIVLYLCERGHVEGGM+FQLLEDLTEMST++DCK+VFGYIESKQ Sbjct: 117 ---DNEENNCPITRLLDIVLYLCERGHVEGGMVFQLLEDLTEMSTIKDCKDVFGYIESKQ 173 Query: 2054 DILGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGV 1875 D+LGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSERSA+NIKGV Sbjct: 174 DVLGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSERSALNIKGV 233 Query: 1874 FNTSNETKYEKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFSSNLMVVLDA 1695 FNTSNETKYEKDA DGISVDFNFY T WSLQEHF NPA T +W KF SNL VVL Sbjct: 234 FNTSNETKYEKDATDGISVDFNFYNTLWSLQEHFSNPALTAANLTRWQKFVSNLTVVLST 293 Query: 1694 FEAQPLSDDDGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQCLILFDYLK 1515 FEAQPLSDDDG NNLD+EE F+IKYLTSSKLMGLELKDP FRRHIL QCLI FD+LK Sbjct: 294 FEAQPLSDDDGKLNNLDQEEDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLIFFDFLK 353 Query: 1514 APGKNDKDAPSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREKNWVWWKRDG 1335 APGK DK+ P+ ++KEEI SCEERVKKLLE+IPPKGK+FL SIEHILEREKNWVWWKRDG Sbjct: 354 APGKTDKEGPTGSMKEEIDSCEERVKKLLEIIPPKGKDFLQSIEHILEREKNWVWWKRDG 413 Query: 1334 CPPFEKHPTEKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQRVRTPSVMEY 1155 C FEK P EKK GQ GVRKR+PRWRLGNKEL+QLWKWA+QNPNALTD++R+ PSV EY Sbjct: 414 CLAFEKQPFEKKPGQAGVRKRKPRWRLGNKELAQLWKWAEQNPNALTDSERICMPSVTEY 473 Query: 1154 WKPLAEDMDFSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIEGVVPPDLLP 975 WKPLAEDMD SAGIE+EYHHKNN+VYCWKGLRFSARQDLEGFSRF + GIEGVVP +LLP Sbjct: 474 WKPLAEDMDPSAGIEDEYHHKNNRVYCWKGLRFSARQDLEGFSRFCDYGIEGVVPQELLP 533 Query: 974 PEVRSKHHNKP 942 PEVRSK ++KP Sbjct: 534 PEVRSKFYSKP 544 >ref|XP_004986033.1| PREDICTED: THO complex subunit 1-like [Setaria italica] Length = 638 Score = 838 bits (2166), Expect = 0.0 Identities = 415/543 (76%), Positives = 463/543 (85%), Gaps = 10/543 (1%) Frame = -3 Query: 2540 MAEATP-----GLRILLHQHQKERAPVH-----ISSHADRDRVLEIFKGALSQPGPPSDF 2391 MAE +P GLRILL + + +P +SSH DRDR++ +F+ ALS+ PP F Sbjct: 1 MAEPSPPASNAGLRILLSKDRPSPSPPSTAGSTVSSHTDRDRIIGVFRSALSRTEPPETF 60 Query: 2390 ALQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQSGERIMRFGQSIDDSESA 2211 ALQT+Q+AIKPQK+TVLV +ENQSLENALRTLLQEL SSAVQSG++IM++G S+D ES Sbjct: 61 ALQTVQEAIKPQKETVLVLEENQSLENALRTLLQELVSSAVQSGKKIMQYGNSLDSGESN 120 Query: 2210 LGHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKEVFGYIESKQDILGKQEL 2031 I RLLDIVLYLCERGHVEGGM+FQLLEDLTEMST++DCK++FGYIES+QD+LGKQEL Sbjct: 121 C-LITRLLDIVLYLCERGHVEGGMVFQLLEDLTEMSTIKDCKDIFGYIESQQDVLGKQEL 179 Query: 2030 FGRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK 1851 FGRGKLVMLRTCNQLLRRLSKANDVVFCGRI+MFLAHFFPLSERSA+NIKGVFNTSN TK Sbjct: 180 FGRGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSERSALNIKGVFNTSNVTK 239 Query: 1850 YEKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFSSNLMVVLDAFEAQPLSD 1671 YEKDA DGISVDFNFYKT WSLQEHF NPA T AKW KFSSNL VVL FEA PLSD Sbjct: 240 YEKDATDGISVDFNFYKTLWSLQEHFSNPALTNTNLAKWQKFSSNLAVVLSTFEALPLSD 299 Query: 1670 DDGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQCLILFDYLKAPGKNDKD 1491 DDG NNLD+EE F+IKYLTSSKLMGLELKDP FRRHIL QCLI FDYLKAPGKN+K+ Sbjct: 300 DDGKLNNLDQEEDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLIFFDYLKAPGKNEKE 359 Query: 1490 APSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREKNWVWWKRDGCPPFEKHP 1311 P+ +K+EIKSCEERVKKLLE+IPPKGKEFL SIEHILEREKNWVWWKRDGC FEK P Sbjct: 360 GPTGGMKDEIKSCEERVKKLLEVIPPKGKEFLKSIEHILEREKNWVWWKRDGCLAFEKAP 419 Query: 1310 TEKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQRVRTPSVMEYWKPLAEDM 1131 EKK Q G RKR+PRWRLGNKELSQLWKWA+QNPN LT+ RVR PS+ EYWKPLAEDM Sbjct: 420 FEKKPVQAGGRKRKPRWRLGNKELSQLWKWAEQNPNVLTNPDRVRMPSITEYWKPLAEDM 479 Query: 1130 DFSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIEGVVPPDLLPPEVRSKHH 951 D SAGIEEEYHHK+N+VYCWKGLRFSARQDL+GF+RFS+ GIEGVVP +LLPPEV +K Sbjct: 480 DPSAGIEEEYHHKSNRVYCWKGLRFSARQDLDGFARFSDYGIEGVVPSELLPPEVNAKFS 539 Query: 950 NKP 942 +KP Sbjct: 540 SKP 542 >ref|XP_007148665.1| hypothetical protein PHAVU_005G004500g [Phaseolus vulgaris] gi|561021929|gb|ESW20659.1| hypothetical protein PHAVU_005G004500g [Phaseolus vulgaris] Length = 604 Score = 835 bits (2158), Expect = 0.0 Identities = 423/612 (69%), Positives = 478/612 (78%), Gaps = 2/612 (0%) Frame = -3 Query: 2441 LEIFKGALSQPGPPSDFALQTMQDAIKPQKQTVLVQDENQSLENALRTLLQELASSAVQS 2262 +E+FK A+ QPGPP +FAL+T+Q+ IKPQKQT L QDENQ LEN LR LLQE S+AV S Sbjct: 1 MEVFKRAILQPGPPENFALKTVQEVIKPQKQTKLAQDENQFLENILRMLLQEFVSAAV-S 59 Query: 2261 GERIMRFGQSIDDSESALGHIPRLLDIVLYLCERGHVEGGMIFQLLEDLTEMSTMRDCKE 2082 E+IM+FGQSID +E+ GHIPRLLDIVLYLCE+ H+EGGMIFQLLEDLTEMSTM++CK+ Sbjct: 60 AEKIMQFGQSIDSNETTQGHIPRLLDIVLYLCEKEHIEGGMIFQLLEDLTEMSTMKNCKD 119 Query: 2081 VFGYIESKQDILGKQELFGRGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 1902 VFGYIESKQDILGKQELF RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE Sbjct: 120 VFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSE 179 Query: 1901 RSAVNIKGVFNTSNETKYEKDAPDGISVDFNFYKTFWSLQEHFCNPASTTIAPAKWHKFS 1722 RSA+NIKGVFNTSNETK+EK+ +GI +DFNFY+TFW LQE F NP S + AP KW KF+ Sbjct: 180 RSALNIKGVFNTSNETKFEKEPLEGICIDFNFYQTFWGLQEFFSNPTSISHAPVKWQKFT 239 Query: 1721 SNLMVVLDAFEAQPLSDDDGNANNLDEEEATTFSIKYLTSSKLMGLELKDPGFRRHILFQ 1542 S+L VVL+ FEAQPLSD++G+ANNL EEEA FSIKYLTSSKLMGLELKDP FRRH+L Q Sbjct: 240 SSLSVVLNTFEAQPLSDEEGDANNL-EEEAVNFSIKYLTSSKLMGLELKDPSFRRHVLVQ 298 Query: 1541 CLILFDYLKAPGKNDKDAPSETIKEEIKSCEERVKKLLEMIPPKGKEFLYSIEHILEREK 1362 CLILFDYLKAPGK DKD PSE +KEEI SCEERVKKLLE+ PPKG EFL+ IEHILEREK Sbjct: 299 CLILFDYLKAPGKGDKDLPSENMKEEITSCEERVKKLLELTPPKGSEFLHKIEHILEREK 358 Query: 1361 NWVWWKRDGCPPFEKHPTEKKAGQDGVRKRRPRWRLGNKELSQLWKWADQNPNALTDTQR 1182 NWVWWKRDGC P+EK P EKKA +G +KRRPRWRLGNKELSQLWKWADQNPNALTD QR Sbjct: 359 NWVWWKRDGCLPYEKQPIEKKAVPEGSKKRRPRWRLGNKELSQLWKWADQNPNALTDPQR 418 Query: 1181 VRTPSVMEYWKPLAEDMDFSAGIEEEYHHKNNKVYCWKGLRFSARQDLEGFSRFSESGIE 1002 V+TPS+MEYWKPLA+DMD SAGIE EYHHKNN+VYCWKGLR +ARQDLEGFS+F++ GIE Sbjct: 419 VQTPSIMEYWKPLADDMDPSAGIEAEYHHKNNRVYCWKGLRLAARQDLEGFSKFTDHGIE 478 Query: 1001 GVVPPDLLPPEVRSKHHNKPXXXXXXXXXXXXXXEPTQQVEDNQXXXXXXXXXXXXXXXX 822 GVVP +LLPP+VRSK+ KP QVE+NQ Sbjct: 479 GVVPLELLPPDVRSKYQAKP-NDRSKRSKKEETKGSAHQVEENQIATTATELDGDGIRTD 537 Query: 821 XXXXXXXXXXXXXXXXXDISQSGAPTPEQ--KQSPSGELGQEVGQSXXXXXXXXXXXAGI 648 +Q G PTPE+ K S ++GQE GQ AGI Sbjct: 538 TTATPMEFDGASVPG----TQGGTPTPEELHKHSSDTDVGQEAGQ----LEAEAEVEAGI 589 Query: 647 IDGETDLEAEPD 612 IDGETD + + D Sbjct: 590 IDGETDADVDLD 601