BLASTX nr result
ID: Cinnamomum24_contig00012774
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cinnamomum24_contig00012774 (2531 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010656678.1| PREDICTED: SART-1 family protein DOT2 [Vitis... 910 0.0 ref|XP_010256356.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 894 0.0 ref|XP_012077379.1| PREDICTED: SART-1 family protein DOT2 isofor... 838 0.0 ref|XP_008806835.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 836 0.0 ref|XP_008806833.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 836 0.0 ref|XP_012441144.1| PREDICTED: SART-1 family protein DOT2 [Gossy... 834 0.0 ref|XP_006836392.1| PREDICTED: SART-1 family protein DOT2 [Ambor... 831 0.0 ref|XP_011094061.1| PREDICTED: SART-1 family protein DOT2 [Sesam... 828 0.0 ref|XP_011011622.1| PREDICTED: SART-1 family protein DOT2 isofor... 825 0.0 ref|XP_010926911.1| PREDICTED: SART-1 family protein DOT2 [Elaei... 824 0.0 ref|XP_002297938.2| hypothetical protein POPTR_0001s11550g [Popu... 823 0.0 ref|XP_002516516.1| conserved hypothetical protein [Ricinus comm... 821 0.0 ref|XP_011011623.1| PREDICTED: SART-1 family protein DOT2 isofor... 818 0.0 ref|XP_007022025.1| U4/U6.U5 tri-snRNP-associated protein 1 isof... 809 0.0 ref|XP_010102332.1| hypothetical protein L484_015280 [Morus nota... 806 0.0 ref|XP_010033990.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 805 0.0 ref|XP_009405353.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 803 0.0 gb|KJB61483.1| hypothetical protein B456_009G361400 [Gossypium r... 801 0.0 ref|XP_012077380.1| PREDICTED: SART-1 family protein DOT2 isofor... 801 0.0 gb|KHG25959.1| U4/U6.U5 tri-snRNP-associated 1 [Gossypium arboreum] 800 0.0 >ref|XP_010656678.1| PREDICTED: SART-1 family protein DOT2 [Vitis vinifera] gi|296090475|emb|CBI40671.3| unnamed protein product [Vitis vinifera] Length = 944 Score = 910 bits (2352), Expect = 0.0 Identities = 472/713 (66%), Positives = 560/713 (78%), Gaps = 10/713 (1%) Frame = +2 Query: 14 KNRDQGYDKEMLRSKDGERDEKLKLDYEDTRD--ITMQVKEVQNGESDNPEKLSTKDHKQ 187 KNRD+G+D RSKDG +D+KLKLD D RD +T Q + + E D+ +H++ Sbjct: 241 KNRDEGHD----RSKDGGKDDKLKLDGGDNRDRDVTKQGRGSHHDEDDS----RAIEHEK 292 Query: 188 ITESAADGSHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEEKLNAEKVKAL 367 E A+ G T+ L+ RI++MKEER+K+KSEG SE+LAWV++SRKVEE+ NAEK KAL Sbjct: 293 NAEGAS-GPQSSTAQLQERILRMKEERVKRKSEGSSEVLAWVNRSRKVEEQRNAEKEKAL 351 Query: 368 HLSKVFEEQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNILTD 547 LSK+FEEQDN+ QGESDDE+ +H ++DLAGVK+LHGLDKVIEGGAVVLTL+DQ+IL + Sbjct: 352 QLSKIFEEQDNIDQGESDDEKPTRHSSQDLAGVKVLHGLDKVIEGGAVVLTLKDQDILAN 411 Query: 548 GDINEEADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPADEE 727 GDINE+ DMLENVEIGEQKRRDEAYKA+KKKTG Y+DKFN++PGS+KKILPQYDDP +E Sbjct: 412 GDINEDVDMLENVEIGEQKRRDEAYKAAKKKTGIYEDKFNDEPGSEKKILPQYDDPVTDE 471 Query: 728 GVTLDGSGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXXXX 907 G+ LD SG F+GEA +QG +N+F+DL T GK SSDYYTHEEM+QF Sbjct: 472 GLALDASGRFTGEAEKKLEELRRRLQGVSTNNRFEDLNTYGKNSSDYYTHEEMLQFKKPK 531 Query: 908 XXXXXXXXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXXXX 1087 L++DALEAEA+SAGLG GDLGSRN+ +RQS + E+ER EA+MR Sbjct: 532 KKKSLRKKEK-LNIDALEAEAVSAGLGVGDLGSRNDGKRQSIREEQERSEAEMRNSAYQL 590 Query: 1088 XXXXXXXXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGASGM 1267 LR +QTL +QLEE+EN VFG DDE+L KSL++ARKL L++QDEA SG Sbjct: 591 AYAKADEASKALRLDQTLPVQLEENENQVFGEDDEELQKSLQRARKLVLQKQDEAATSGP 650 Query: 1268 QVVARLAETNNESE--ETQNHVSGG-----IVITEMEEFVSKIHLDEEIHKPEADDVFKD 1426 Q +A LA T S+ + QN +SG +V TEMEEFV + L++E HKP+ +DVF D Sbjct: 651 QAIALLASTTTSSQNVDNQNPISGESQENRVVFTEMEEFVWGLQLEDEAHKPDGEDVFMD 710 Query: 1427 E-EVPKSFEKEMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGALQL 1603 E E PK+ ++E +DEAGGW EVKDT DELP+NE KE++VPD IHE AVGKGLSGALQL Sbjct: 711 EDEAPKASDQERKDEAGGWTEVKDTDKDELPVNENKEEMVPDDTIHEVAVGKGLSGALQL 770 Query: 1604 LKERGTLKETIDWGGRNMDKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISH 1783 LKERGTLKE I+WGGRNMDKKKSKLVGIY+N GTKEIRIERTDEFGRIMTPKEAFR+ISH Sbjct: 771 LKERGTLKEGIEWGGRNMDKKKSKLVGIYDNTGTKEIRIERTDEFGRIMTPKEAFRMISH 830 Query: 1784 KFHGKGPGKMKLEKKIKKYQEELKSKQMKASDTPSESMERMREAQARLKTPYLVLSGHVK 1963 KFHGKGPGKMK EK++K+YQEELK KQMK SDTPS+S+ERMREAQARLKTPYLVLSGHVK Sbjct: 831 KFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSQSVERMREAQARLKTPYLVLSGHVK 890 Query: 1964 PGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 2122 PGQTSDPRSGFATVEKD PGSLTPMLGDRKVEHFLGIKRKAEPS+MGPPKK K Sbjct: 891 PGQTSDPRSGFATVEKDVPGSLTPMLGDRKVEHFLGIKRKAEPSNMGPPKKPK 943 >ref|XP_010256356.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001422|ref|XP_010256357.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001427|ref|XP_010256358.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001430|ref|XP_010256359.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001433|ref|XP_010256360.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001436|ref|XP_010256361.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] Length = 851 Score = 894 bits (2309), Expect = 0.0 Identities = 463/703 (65%), Positives = 555/703 (78%), Gaps = 7/703 (0%) Frame = +2 Query: 35 DKEMLRSKDGERDEKLKLDYEDTRDITMQVKEVQNGESDNPEKLSTKDHKQITESAADGS 214 D+ R KD +DEKL LD + RD+ QVKEVQ+ D +S ++ K++ + A GS Sbjct: 153 DESQGRGKDVGKDEKLDLDGGNDRDVVKQVKEVQH---DVVVDMSVENKKKV-DGAMGGS 208 Query: 215 HPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEEKLNAEKVKALHLSKVFEEQ 394 P T +LE RI+KM+EER KKKSEGVSE+L+WV+KSRK+EEK NAEK KAL LSKVFEEQ Sbjct: 209 QPSTGELEERILKMREERSKKKSEGVSEVLSWVNKSRKLEEKRNAEKQKALQLSKVFEEQ 268 Query: 395 DNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNILTDGDINEEADM 574 D + QGES+DE+ +H +KDLAGVKILHG+DKVIEGGAVVLTL+DQNIL + D+NEEAD+ Sbjct: 269 DKIDQGESEDEDTARHTSKDLAGVKILHGIDKVIEGGAVVLTLKDQNILANDDVNEEADV 328 Query: 575 LENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPADEEGVTLDGSGH 754 LENVEIGEQK+RD AYKA+KKKTG Y+DKF+ + G+QKKILPQYDDP ++EG+ LD SG Sbjct: 329 LENVEIGEQKQRDAAYKAAKKKTGIYEDKFSGEDGAQKKILPQYDDPVEDEGLVLDESGR 388 Query: 755 FSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXXXXXXXXXXXXX 934 F+GEA +QG SN F+DL ++ K++SD+YTHEEM+QF Sbjct: 389 FAGEAEKKLEELRKRLQGVSASNHFEDLNSSAKITSDFYTHEEMLQFKKPKKKKSLRKKV 448 Query: 935 XXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXXXXXXXXXXXXX 1114 LDLDALEAEAISAG G GDLGSR + +RQ+ K ++ER EA+MR+ Sbjct: 449 K-LDLDALEAEAISAGFGVGDLGSRKDGQRQATKEQQERSEAEMRSNAYQSAFAKAEEAS 507 Query: 1115 XXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGASGMQVVARLAET 1294 LRQEQTLT+Q+EE+E+ VFG D+EDLYKSLEKARKLALK Q+EA ASG Q VA LA T Sbjct: 508 KTLRQEQTLTVQVEENESPVFGDDEEDLYKSLEKARKLALKTQNEAAASGPQAVALLAST 567 Query: 1295 -NNESEETQNHVSGG-----IVITEMEEFVSKIHLDEEIHKPEADDVFKDEE-VPKSFEK 1453 +N+ ++ +N SG +V TEMEEFV + L+EE K E++DVF DE+ VPK+ ++ Sbjct: 568 VSNQPKDEENLTSGEPQENKVVFTEMEEFVWGLQLNEEARKLESEDVFMDEDNVPKASDQ 627 Query: 1454 EMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGALQLLKERGTLKET 1633 E++DEAGGW EV D +E P+ EEKE++VPD+ IHE A+GKGLSGAL+LLKERGTLKET Sbjct: 628 EIKDEAGGWTEVNDIDENEHPVEEEKEEVVPDETIHEVAIGKGLSGALKLLKERGTLKET 687 Query: 1634 IDWGGRNMDKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISHKFHGKGPGKM 1813 +DWGGRNMDKKKSKLVGIY++ G KEIRIERTDEFGRIMTPKEAFR+ISHKFHGKGPGKM Sbjct: 688 VDWGGRNMDKKKSKLVGIYDDGGPKEIRIERTDEFGRIMTPKEAFRVISHKFHGKGPGKM 747 Query: 1814 KLEKKIKKYQEELKSKQMKASDTPSESMERMREAQARLKTPYLVLSGHVKPGQTSDPRSG 1993 K EK++K+YQEELK KQMK SDTPS+SMERMREAQARLKTPYLVLSGHVKPGQTSDPRSG Sbjct: 748 KQEKRMKQYQEELKLKQMKNSDTPSQSMERMREAQARLKTPYLVLSGHVKPGQTSDPRSG 807 Query: 1994 FATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 2122 FATVEKD PG LTPMLGD+KVEHFLGIKRKAEPS+MGPPKK K Sbjct: 808 FATVEKDIPGGLTPMLGDKKVEHFLGIKRKAEPSNMGPPKKSK 850 >ref|XP_012077379.1| PREDICTED: SART-1 family protein DOT2 isoform X1 [Jatropha curcas] gi|643724962|gb|KDP34163.1| hypothetical protein JCGZ_07734 [Jatropha curcas] Length = 864 Score = 838 bits (2164), Expect = 0.0 Identities = 445/721 (61%), Positives = 534/721 (74%), Gaps = 18/721 (2%) Frame = +2 Query: 14 KNRDQGYDKEMLRSKDGERDEKLKLDYEDTRDITMQVKEVQNGESD----NPEKLSTKDH 181 + RD YDKE LR ++ + DY+ ++D +++ N +S + KD Sbjct: 146 RERDSDYDKERLRDREKVSKRSHEEDYDRSKDDVVEMDYENNKDSSVLKQSKVSFDNKDE 205 Query: 182 KQITESAADGSHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEEKLNAEKVK 361 ++ E++ GS P S LE RI+KMKEERLKK SE E+LAWV++SRK+EEK NAEK K Sbjct: 206 QKAEETSRGGS-APVSQLEERILKMKEERLKKNSEPGDEVLAWVNRSRKLEEKKNAEKQK 264 Query: 362 ALHLSKVFEEQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNIL 541 A LSK+FEEQDN VQGES+DE+ +H T DLAGVK+LHGL+KV+EGGAVVLTL+DQ+IL Sbjct: 265 AKQLSKIFEEQDNNVQGESEDEDSGEHTTHDLAGVKVLHGLEKVMEGGAVVLTLKDQSIL 324 Query: 542 TDGDINEEADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPAD 721 DGDINEE DMLENVEIGEQKRRD+AYKA+KKKTG YDDKFN+DP S+KKILPQYDD A Sbjct: 325 ADGDINEEVDMLENVEIGEQKRRDDAYKAAKKKTGIYDDKFNDDPASEKKILPQYDDSAA 384 Query: 722 EEGVTLDGSGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXX 901 +EGV LD G F+GEA +QG +N+F+DL+++GK+SSDYYTHEE++QF Sbjct: 385 DEGVALDERGRFTGEAEKKLEELRRRLQGVSTNNRFEDLSSSGKISSDYYTHEELLQF-K 443 Query: 902 XXXXXXXXXXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXX 1081 LD+DALEAEA+SAGLG GDLGSRN RRQ+ + E+ER EA+MR+ Sbjct: 444 KPKKKKSLRKKEKLDIDALEAEAVSAGLGVGDLGSRNNGRRQAIRQEQERSEAEMRSSAY 503 Query: 1082 XXXXXXXXXXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGAS 1261 LRQEQTL +L+EDEN VF DDEDLYKSLE+ARKLALK+Q+E AS Sbjct: 504 QAAYDKADEASKSLRQEQTLHAKLDEDENPVFAEDDEDLYKSLERARKLALKKQEEK-AS 562 Query: 1262 GMQVVARLA----ETNNESEETQNHVSG-----GIVITEMEEFVSKIHLDEEIHKPEADD 1414 G Q +ARLA T++++ + QN +G IV TEMEEFV + LDEE HK DD Sbjct: 563 GPQAIARLAAATTTTSSQTTDDQNPTTGESQENKIVFTEMEEFVWGLQLDEESHKHGNDD 622 Query: 1415 VFKDE-EVPKSFEKEMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSG 1591 VF DE E P ++E +DE GGW EV+D DE P+NE ED+VPD+ IHE VGKGLS Sbjct: 623 VFMDEDEAPIVSDQEKKDETGGWTEVQDIDKDENPVNENNEDIVPDETIHEVPVGKGLSA 682 Query: 1592 ALQLLKERGTLKETIDWGGRNMDKKKSKLVGI----YENDGTKEIRIERTDEFGRIMTPK 1759 AL+LLKERGTLKE+ +WGGRNMDKKKSKLVGI +N+ K+IRI+RTDE+GR +TPK Sbjct: 683 ALKLLKERGTLKESTEWGGRNMDKKKSKLVGIVDSDVDNERFKDIRIDRTDEYGRTLTPK 742 Query: 1760 EAFRIISHKFHGKGPGKMKLEKKIKKYQEELKSKQMKASDTPSESMERMREAQARLKTPY 1939 EAFRIISHKFHGKGPGKMK EK++K+Y EELK KQMK SDTPS S+ERMREAQA+LKTPY Sbjct: 743 EAFRIISHKFHGKGPGKMKQEKRMKQYLEELKMKQMKNSDTPSLSVERMREAQAQLKTPY 802 Query: 1940 LVLSGHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQ 2119 LVLSGHVKPGQTSDPRSGFATVEKD PG LTPMLGD+KVEHFLGIKRKAEP + PKK Sbjct: 803 LVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDKKVEHFLGIKRKAEPGNSNAPKKP 862 Query: 2120 K 2122 K Sbjct: 863 K 863 >ref|XP_008806835.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 isoform X2 [Phoenix dactylifera] Length = 1013 Score = 836 bits (2160), Expect = 0.0 Identities = 440/705 (62%), Positives = 534/705 (75%), Gaps = 6/705 (0%) Frame = +2 Query: 26 QGYDKEMLRSKDGERDEKLKLDYEDTRDITMQVKEVQNGESDNPEKLSTKDHKQITESAA 205 +G ++E+ R+++GE+DEK+K D D+R I + +EVQ+ E D +T + Sbjct: 321 RGKEREIGRAREGEKDEKVKGDGGDSR-IARKGQEVQDDEGD------------LTHNEK 367 Query: 206 DGSHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEEKLNAEKVKALHLSKVF 385 S TS LE R++KMKEERLK+KS+G SEI +WV+KSRK+EEK AEK KAL LSK Sbjct: 368 PLSSISTSKLEERVVKMKEERLKRKSDGASEISSWVNKSRKLEEKWTAEKEKALRLSKAL 427 Query: 386 EEQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNILTDGDINEE 565 EEQDN++ ES+DEE H DLAG KILHGLDKV+EGGAVVLTL+DQ+IL DGDINEE Sbjct: 428 EEQDNIL-AESEDEEATGHSGNDLAGAKILHGLDKVMEGGAVVLTLKDQSILADGDINEE 486 Query: 566 ADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPADEEGVTLDG 745 ADMLENVEIGEQK+RDEAY+A+KK+TG YDDKF++D GSQK ILPQYD+ ++EGVTLD Sbjct: 487 ADMLENVEIGEQKQRDEAYRAAKKRTGLYDDKFSDDIGSQKTILPQYDNQNEDEGVTLDE 546 Query: 746 SGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXXXXXXXXXX 925 SG F+GEA I+G + +DLT++GK+SSDYYT +EM+QF Sbjct: 547 SGRFTGEAEKKLEELRKRIEGGAIKKSNEDLTSSGKISSDYYTPDEMLQFKKPKKKKSLR 606 Query: 926 XXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXXXXXXXXXX 1105 LDLDALEAEAISAGLGAGDLGSRN+ RRQ+AK E+E+ EA+ R+ Sbjct: 607 KKEK-LDLDALEAEAISAGLGAGDLGSRNDLRRQTAKEEQEKAEAEKRSHAYQSAIAKAE 665 Query: 1106 XXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGASGMQVVARL 1285 LRQEQT T++ ED+NLVFG D ED+++S+ +ARKLALK+QDE SG + VA + Sbjct: 666 EASKALRQEQTSTVKSVEDDNLVFGEDYEDVHRSIGQARKLALKKQDETAVSGPEAVALV 725 Query: 1286 AETNNESEETQNHVSGG-----IVITEMEEFVSKIHLDEEIHKPEADDVFKDEE-VPKSF 1447 A T E E+ G ++ITEMEEFV + + E+ HKPE++DVFKDEE +PK Sbjct: 726 ATTKKEQEDASPTEGGEPQENKVIITEMEEFVLGLQITEDTHKPESEDVFKDEEDIPKPL 785 Query: 1448 EKEMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGALQLLKERGTLK 1627 E E E E GGW EV +T E +NEEKED+ PD+IIHET++GKGLSGAL+LLKERGTL Sbjct: 786 ELETEAEVGGWTEVMETDDTEAAVNEEKEDINPDEIIHETSMGKGLSGALKLLKERGTLN 845 Query: 1628 ETIDWGGRNMDKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISHKFHGKGPG 1807 E+IDWGGRNMDKKKSKLVGI +N+G KEIRIERTDEFGRIMTPKEAFR++SHKFHGKGPG Sbjct: 846 ESIDWGGRNMDKKKSKLVGINDNEGPKEIRIERTDEFGRIMTPKEAFRMLSHKFHGKGPG 905 Query: 1808 KMKLEKKIKKYQEELKSKQMKASDTPSESMERMREAQARLKTPYLVLSGHVKPGQTSDPR 1987 KMK EK++K+YQE+LK+KQMKASDTP +ME+MREAQARLKTPYLVLSGHVKPGQTSDPR Sbjct: 906 KMKQEKRMKQYQEDLKTKQMKASDTPLLAMEKMREAQARLKTPYLVLSGHVKPGQTSDPR 965 Query: 1988 SGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 2122 SGFATVEKDH GSLTPMLGD+KVEHFLGI RK + SMGPP +K Sbjct: 966 SGFATVEKDHLGSLTPMLGDKKVEHFLGINRKPDARSMGPPPPKK 1010 >ref|XP_008806833.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 isoform X1 [Phoenix dactylifera] Length = 1040 Score = 836 bits (2160), Expect = 0.0 Identities = 440/705 (62%), Positives = 534/705 (75%), Gaps = 6/705 (0%) Frame = +2 Query: 26 QGYDKEMLRSKDGERDEKLKLDYEDTRDITMQVKEVQNGESDNPEKLSTKDHKQITESAA 205 +G ++E+ R+++GE+DEK+K D D+R I + +EVQ+ E D +T + Sbjct: 348 RGKEREIGRAREGEKDEKVKGDGGDSR-IARKGQEVQDDEGD------------LTHNEK 394 Query: 206 DGSHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEEKLNAEKVKALHLSKVF 385 S TS LE R++KMKEERLK+KS+G SEI +WV+KSRK+EEK AEK KAL LSK Sbjct: 395 PLSSISTSKLEERVVKMKEERLKRKSDGASEISSWVNKSRKLEEKWTAEKEKALRLSKAL 454 Query: 386 EEQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNILTDGDINEE 565 EEQDN++ ES+DEE H DLAG KILHGLDKV+EGGAVVLTL+DQ+IL DGDINEE Sbjct: 455 EEQDNIL-AESEDEEATGHSGNDLAGAKILHGLDKVMEGGAVVLTLKDQSILADGDINEE 513 Query: 566 ADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPADEEGVTLDG 745 ADMLENVEIGEQK+RDEAY+A+KK+TG YDDKF++D GSQK ILPQYD+ ++EGVTLD Sbjct: 514 ADMLENVEIGEQKQRDEAYRAAKKRTGLYDDKFSDDIGSQKTILPQYDNQNEDEGVTLDE 573 Query: 746 SGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXXXXXXXXXX 925 SG F+GEA I+G + +DLT++GK+SSDYYT +EM+QF Sbjct: 574 SGRFTGEAEKKLEELRKRIEGGAIKKSNEDLTSSGKISSDYYTPDEMLQFKKPKKKKSLR 633 Query: 926 XXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXXXXXXXXXX 1105 LDLDALEAEAISAGLGAGDLGSRN+ RRQ+AK E+E+ EA+ R+ Sbjct: 634 KKEK-LDLDALEAEAISAGLGAGDLGSRNDLRRQTAKEEQEKAEAEKRSHAYQSAIAKAE 692 Query: 1106 XXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGASGMQVVARL 1285 LRQEQT T++ ED+NLVFG D ED+++S+ +ARKLALK+QDE SG + VA + Sbjct: 693 EASKALRQEQTSTVKSVEDDNLVFGEDYEDVHRSIGQARKLALKKQDETAVSGPEAVALV 752 Query: 1286 AETNNESEETQNHVSGG-----IVITEMEEFVSKIHLDEEIHKPEADDVFKDEE-VPKSF 1447 A T E E+ G ++ITEMEEFV + + E+ HKPE++DVFKDEE +PK Sbjct: 753 ATTKKEQEDASPTEGGEPQENKVIITEMEEFVLGLQITEDTHKPESEDVFKDEEDIPKPL 812 Query: 1448 EKEMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGALQLLKERGTLK 1627 E E E E GGW EV +T E +NEEKED+ PD+IIHET++GKGLSGAL+LLKERGTL Sbjct: 813 ELETEAEVGGWTEVMETDDTEAAVNEEKEDINPDEIIHETSMGKGLSGALKLLKERGTLN 872 Query: 1628 ETIDWGGRNMDKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISHKFHGKGPG 1807 E+IDWGGRNMDKKKSKLVGI +N+G KEIRIERTDEFGRIMTPKEAFR++SHKFHGKGPG Sbjct: 873 ESIDWGGRNMDKKKSKLVGINDNEGPKEIRIERTDEFGRIMTPKEAFRMLSHKFHGKGPG 932 Query: 1808 KMKLEKKIKKYQEELKSKQMKASDTPSESMERMREAQARLKTPYLVLSGHVKPGQTSDPR 1987 KMK EK++K+YQE+LK+KQMKASDTP +ME+MREAQARLKTPYLVLSGHVKPGQTSDPR Sbjct: 933 KMKQEKRMKQYQEDLKTKQMKASDTPLLAMEKMREAQARLKTPYLVLSGHVKPGQTSDPR 992 Query: 1988 SGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 2122 SGFATVEKDH GSLTPMLGD+KVEHFLGI RK + SMGPP +K Sbjct: 993 SGFATVEKDHLGSLTPMLGDKKVEHFLGINRKPDARSMGPPPPKK 1037 >ref|XP_012441144.1| PREDICTED: SART-1 family protein DOT2 [Gossypium raimondii] gi|823216924|ref|XP_012441145.1| PREDICTED: SART-1 family protein DOT2 [Gossypium raimondii] gi|763794483|gb|KJB61479.1| hypothetical protein B456_009G361400 [Gossypium raimondii] gi|763794484|gb|KJB61480.1| hypothetical protein B456_009G361400 [Gossypium raimondii] gi|763794485|gb|KJB61481.1| hypothetical protein B456_009G361400 [Gossypium raimondii] gi|763794488|gb|KJB61484.1| hypothetical protein B456_009G361400 [Gossypium raimondii] Length = 900 Score = 834 bits (2155), Expect = 0.0 Identities = 448/730 (61%), Positives = 530/730 (72%), Gaps = 27/730 (3%) Frame = +2 Query: 14 KNRDQGYDKEMLRSKD-----------GERDEKLKLDYEDTRDITMQVKEVQNGESDNPE 160 KNR+ +KE R +D G +D +L LDYED RD Sbjct: 194 KNREADLEKERSRDRDNVGKNHEEDYEGSKDGELALDYEDRRD----------------- 236 Query: 161 KLSTKDHKQITE-SAADGSHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEE 337 KD ++ S A +S+LE RI++MKE+RLKKKSEG+SE+ AWVS+SRK+E+ Sbjct: 237 ----KDEAELNAGSNASLVQASSSELEERIVRMKEDRLKKKSEGLSEVSAWVSRSRKLED 292 Query: 338 KLNAEKVKALHLSKVFEEQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVL 517 K NAEK KAL LSK+FEEQDN VQGE +DEE T DL GVK+LHGLDKV++GGAVVL Sbjct: 293 KRNAEKEKALQLSKIFEEQDNFVQGEDEDEEADNRPTHDLGGVKVLHGLDKVMDGGAVVL 352 Query: 518 TLRDQNILTDGDINEEADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKIL 697 TL+DQ+IL DGD+NE+ DMLEN+EIGEQK+RDEAYKA+KKKTG YDDKFNEDPGS+KKIL Sbjct: 353 TLKDQSILADGDLNEDVDMLENIEIGEQKQRDEAYKAAKKKTGVYDDKFNEDPGSEKKIL 412 Query: 698 PQYDDPADEEGVTLDGSGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTH 877 PQYDDP +EGVTLD G F+GEA + G P +N+ +DL GK+SSDYYT Sbjct: 413 PQYDDPVADEGVTLDERGRFTGEAEKKLEELRKRLLGVPTNNRVEDLNNVGKISSDYYTQ 472 Query: 878 EEMVQFXXXXXXXXXXXXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVE 1057 EEM++F LD+DALEAEA+SAGLGAGDLGSR + RRQ+ K EE R E Sbjct: 473 EEMLRF-KKPKKKKALRKKEKLDIDALEAEAVSAGLGAGDLGSRKDSRRQAIKEEEARSE 531 Query: 1058 ADMRTEXXXXXXXXXXXXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALK 1237 A+ R LR EQT T++ EEDEN VF D+EDLYKSLEKAR+LALK Sbjct: 532 AEKRKNAYQAAFAKADEASKSLRLEQTHTVKPEEDENQVFADDEEDLYKSLEKARRLALK 591 Query: 1238 RQDEAGASGMQVVARLAETNNESEETQNHVSGG------IVITEMEEFVSKIHLDEEIHK 1399 +Q+E SG Q +A LA T+ ++ T +H S G +VITEMEEFV + LDEE HK Sbjct: 592 KQEE--KSGPQAIALLATTSASNQTTDDHTSTGEAQENKVVITEMEEFVWGLQLDEEAHK 649 Query: 1400 PEADDVFKDE-EVPKSFE---KEMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHET 1567 P+++DVF DE EVP + E K E+E GGW EV DT ADE P NE+ +++VPD+ IHE Sbjct: 650 PDSEDVFMDEDEVPGASEQDRKNGENEVGGWTEVIDTSADEKPANEDNDEVVPDETIHEI 709 Query: 1568 AVGKGLSGALQLLKERGTLKETIDWGGRNMDKKKSKLVGIYENDGT-----KEIRIERTD 1732 AVGKGLSGAL+LLK+RGTLKETI+WGGRNMDKKKSKLVGI ++D K+IRIERTD Sbjct: 710 AVGKGLSGALKLLKDRGTLKETIEWGGRNMDKKKSKLVGIVDDDHQTDNRFKDIRIERTD 769 Query: 1733 EFGRIMTPKEAFRIISHKFHGKGPGKMKLEKKIKKYQEELKSKQMKASDTPSESMERMRE 1912 EFGRI+TPKEAFR++SHKFHGKGPGKMK EK++K+YQEELK KQMK SDTPS S+ERMRE Sbjct: 770 EFGRIVTPKEAFRMLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMRE 829 Query: 1913 AQARLKTPYLVLSGHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEP 2092 AQA+LKTPYLVLSGHVKPGQTSDP SGFATVEKD PG LTPMLGDRKVEHFLGIKRKAE Sbjct: 830 AQAQLKTPYLVLSGHVKPGQTSDPASGFATVEKDFPGGLTPMLGDRKVEHFLGIKRKAEA 889 Query: 2093 SSMGPPKKQK 2122 + G PKK K Sbjct: 890 GNSGTPKKPK 899 >ref|XP_006836392.1| PREDICTED: SART-1 family protein DOT2 [Amborella trichopoda] gi|548838910|gb|ERM99245.1| hypothetical protein AMTR_s00092p00135160 [Amborella trichopoda] Length = 1028 Score = 831 bits (2147), Expect = 0.0 Identities = 437/712 (61%), Positives = 531/712 (74%), Gaps = 5/712 (0%) Frame = +2 Query: 2 KMIGKNRDQGYDKEMLRSKDGERDEKLKLDYEDTRDITMQVKEVQNGESDNPEKLSTKDH 181 K+ GK++D G DKE R K+GE++ K K+D D RDIT Q VQ+ + + ++ DH Sbjct: 319 KVKGKSKDHGRDKEFDRGKEGEKEAKPKIDAWDGRDITEQEDNVQDDKDNTYDRTGAMDH 378 Query: 182 KQITESAADGSHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEEKLNAEKVK 361 K+ E A S P TS++E R+ KM+EER+KKK+EGVSE+ +WV+KSRK+EEKL++EK K Sbjct: 379 KEKNEIQAGVSRPSTSEIEERLAKMREERMKKKNEGVSEVSSWVNKSRKIEEKLSSEKEK 438 Query: 362 ALHLSKVFEEQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNIL 541 ALHL+KVF EQD+VVQ ESD+EE QH KDLAGVK+LHGL++VI GGAVVLTL+DQNIL Sbjct: 439 ALHLAKVFAEQDSVVQ-ESDEEEEAQHSGKDLAGVKVLHGLEQVIVGGAVVLTLKDQNIL 497 Query: 542 TDGDINEEADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPAD 721 DGD+N E DMLENVE+GEQKRRDEAYKA+KKK G Y+DKF +D GSQKKILPQYDD + Sbjct: 498 ADGDLNNEVDMLENVELGEQKRRDEAYKAAKKKPGIYEDKFADDDGSQKKILPQYDDTSK 557 Query: 722 EEGVTLDGSGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXX 901 +EGV LD SGH + EA +QGA F+DLT TGK+SSDYYT EEM+QF Sbjct: 558 DEGVALDESGHITREAQKKLEELRKRLQGASTGQHFEDLTATGKVSSDYYTQEEMLQFKK 617 Query: 902 XXXXXXXXXXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXX 1081 LDLDALEAEAI++GLG GD GSR + +RQ AK EEE EA+ R E Sbjct: 618 PKKKKALRKKVK-LDLDALEAEAIASGLGVGDRGSRADAQRQRAKEEEEWAEAETRKEAY 676 Query: 1082 XXXXXXXXXXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGAS 1261 LR+EQTL ++ +EDENL FG DDEDL+KS+E+ARKLA K+QDE AS Sbjct: 677 QSAFAKANESTKALREEQTLKVEGDEDENLAFG-DDEDLHKSIEEARKLARKKQDEGAAS 735 Query: 1262 GMQVVARLAETNNESEETQ---NHVSGGIVITEMEEFVSKIHLDEEIHKPEADDVFK-DE 1429 G VA+LA + +ES++ + +V TE++EFV + DE P+A+DVFK D+ Sbjct: 736 GPLAVAQLAVSASESKDAEASGEPQENRLVFTEVDEFVLGLQHDEGAQNPDAEDVFKEDD 795 Query: 1430 EVPKSFEK-EMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGALQLL 1606 EV ++ E ++ GGW +V ++ DE EE E++VPD I E VGKGLSGALQLL Sbjct: 796 EVQNPIKQDEPMEQVGGWTDVIESEKDEQMKTEEDEEVVPDATIQEAVVGKGLSGALQLL 855 Query: 1607 KERGTLKETIDWGGRNMDKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISHK 1786 KERGTLKE IDWGGRNMDKKKSKLVG+ ENDG KEI ++R DEFGRIMTPKEAFR +SHK Sbjct: 856 KERGTLKEAIDWGGRNMDKKKSKLVGVRENDGAKEIVLDRLDEFGRIMTPKEAFRKLSHK 915 Query: 1787 FHGKGPGKMKLEKKIKKYQEELKSKQMKASDTPSESMERMREAQARLKTPYLVLSGHVKP 1966 FHGKGPGKMK EK++K++ EELK KQMKASDTP SME+MREAQA+ ++PY+VLSG +KP Sbjct: 916 FHGKGPGKMKQEKRMKQFMEELKLKQMKASDTPLLSMEKMREAQAKTRSPYIVLSGQIKP 975 Query: 1967 GQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 2122 GQTSDPRSGFATVEKD PGSLTPMLGDRKVEHFLGIKRKAEPS+MGPPKK K Sbjct: 976 GQTSDPRSGFATVEKDQPGSLTPMLGDRKVEHFLGIKRKAEPSNMGPPKKPK 1027 >ref|XP_011094061.1| PREDICTED: SART-1 family protein DOT2 [Sesamum indicum] Length = 942 Score = 828 bits (2140), Expect = 0.0 Identities = 429/711 (60%), Positives = 529/711 (74%), Gaps = 8/711 (1%) Frame = +2 Query: 14 KNRDQGYDKEMLRSKDGERDEKLKLDYEDTRDITMQVKEVQNGESDNPEKLSTKDHKQIT 193 K +D+ +D RSKD ++D +L+ + +RD + N + +N K+ H++ Sbjct: 238 KQKDESHD----RSKDTDKDGHSRLENDYSRDKQSTKELADNSDDENDSKILK--HQEKA 291 Query: 194 ESAADGSHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEEKLNAEKVKALHL 373 ++A GS S+LE RI KM+EERLKK SEG SE+LAWV++SRK+EEK AEK KAL L Sbjct: 292 DTAIAGSRQSASELEDRISKMREERLKKPSEGASEVLAWVNRSRKLEEKRTAEKEKALQL 351 Query: 374 SKVFEEQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNILTDGD 553 SK+FEEQDN+ GESD+E +H T+DL GVKILHGLDKV+EGGAVVLTL+DQ+IL DGD Sbjct: 352 SKIFEEQDNMNGGESDEEAAAEHTTQDLGGVKILHGLDKVLEGGAVVLTLKDQSILADGD 411 Query: 554 INEEADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPADEEGV 733 INEE DMLENVEIGEQKRRDEAYKA+KKKTG YDDKF+++PG++KKILPQYDDP +EGV Sbjct: 412 INEEVDMLENVEIGEQKRRDEAYKAAKKKTGIYDDKFSDEPGAEKKILPQYDDPVADEGV 471 Query: 734 TLDGSGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXXXXXX 913 TLD SG F+GEA IQG S + +DL +T K+ +DYYT +EM +F Sbjct: 472 TLDSSGRFTGEAERKLEELRRRIQGVSTSTRGEDLNSTAKILTDYYTQDEMTKFKKPKKK 531 Query: 914 XXXXXXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXXXXXX 1093 LDLDALEAEA SAGLGAGDLGSRN+ RRQ+ + E+E++EA+MR Sbjct: 532 KSLRKKEK-LDLDALEAEARSAGLGAGDLGSRNDGRRQNLREEQEKIEAEMRRNAYESAY 590 Query: 1094 XXXXXXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGASGMQV 1273 LRQEQ +Q EED+ VFG DD++L KSLE+ARK+ALK+QDE S QV Sbjct: 591 AKADEASKALRQEQVPAMQTEEDDAPVFGDDDDELRKSLERARKIALKKQDEEEKSAPQV 650 Query: 1274 VARLAETNNESEETQNHVSGGI-------VITEMEEFVSKIHLDEEIHKPEADDVFKDEE 1432 + LA ++ T+N SG + + TEMEEFV + LDEE PE++DVF +E+ Sbjct: 651 ITLLATSSANDSTTENPNSGSVDQQENKVIFTEMEEFVWGLQLDEEEKNPESEDVFMEED 710 Query: 1433 V-PKSFEKEMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGALQLLK 1609 V P + ++EM+DEAGGW EVK+T DE P EEKE++VPD+ IHE+AVGKGL+GAL+LLK Sbjct: 711 VAPSTSDQEMKDEAGGWAEVKETMKDETPAKEEKEEVVPDETIHESAVGKGLAGALKLLK 770 Query: 1610 ERGTLKETIDWGGRNMDKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISHKF 1789 +RGTLKETI+WGGRNMDKKKSKLVGIY+ND KEIRIERTDE+GRI+TPKEAFR++SHKF Sbjct: 771 DRGTLKETIEWGGRNMDKKKSKLVGIYDNDAAKEIRIERTDEYGRILTPKEAFRLLSHKF 830 Query: 1790 HGKGPGKMKLEKKIKKYQEELKSKQMKASDTPSESMERMREAQARLKTPYLVLSGHVKPG 1969 HGKGPGKMK EK++++YQEELK KQMK +DTPS S+ERMREAQA+L+TPYLVLSGHVKPG Sbjct: 831 HGKGPGKMKQEKRMRQYQEELKVKQMKNADTPSLSVERMREAQAKLQTPYLVLSGHVKPG 890 Query: 1970 QTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 2122 Q+SDPR+ FATVEKD G LTPMLGD+KVEHFL IKRK EP KK K Sbjct: 891 QSSDPRNTFATVEKDFAGGLTPMLGDKKVEHFLNIKRKPEPGDTASQKKPK 941 >ref|XP_011011622.1| PREDICTED: SART-1 family protein DOT2 isoform X1 [Populus euphratica] Length = 860 Score = 825 bits (2130), Expect = 0.0 Identities = 440/718 (61%), Positives = 531/718 (73%), Gaps = 15/718 (2%) Frame = +2 Query: 14 KNRDQGYDKEMLRSKDGERDEKLKLDYEDTRDITMQVKEVQNGESDNPEKLSTKDHKQIT 193 ++R+ DKE R KD + + DY+D + M ++ + ++ K+S +D Sbjct: 150 RDREADQDKERSREKDRASRKGNEEDYDDK--VQMDYEDEVDKDNRKQGKVSFRDEG--- 204 Query: 194 ESAADGSHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEEKLNAEKVKALHL 373 E +A+G+H S+LE RI+KMKEER KKKSE S+ILAWV +SRK+EE +A K +A HL Sbjct: 205 EQSAEGAHSSASELEQRILKMKEERTKKKSEAGSDILAWVGRSRKIEENKHAAKARAKHL 264 Query: 374 SKVFEEQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNILTDGD 553 SK+FEEQDN+ QG SDDEE QH +LAG+K+L GLDKV+EGGAVVLTL+DQNIL DGD Sbjct: 265 SKIFEEQDNIGQGGSDDEEADQHNAYNLAGIKVLDGLDKVLEGGAVVLTLKDQNILADGD 324 Query: 554 INEEADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPADEEGV 733 INEE DMLENVEIGEQKRRDEAYKA+KKKTG YDDKFN+DP S+KK+LPQYDD +EG+ Sbjct: 325 INEEVDMLENVEIGEQKRRDEAYKAAKKKTGIYDDKFNDDPASEKKMLPQYDDANADEGI 384 Query: 734 TLDGSGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXXXXXX 913 TLD G F+GEA +QG S + +DL ++GK+SSDY+THEEM++F Sbjct: 385 TLDERGRFTGEAEKKLEELRRRLQGTSTSARLEDLNSSGKISSDYFTHEEMLKF-KKPKK 443 Query: 914 XXXXXXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXXXXXX 1093 LD+DALEAEA+SAGLG GDLGSR + RRQ+ + E+ER A+MR Sbjct: 444 KKSLRKKDKLDIDALEAEAVSAGLGIGDLGSRKDGRRQAIREEQERSAAEMRNNAYQSAY 503 Query: 1094 XXXXXXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGASGMQV 1273 LR +QTL ++EE+ENLVF D+EDLYKSLE+ARKLALK+Q EA ASG Sbjct: 504 AKADEASKSLRLDQTLQTKVEEEENLVFADDEEDLYKSLERARKLALKKQ-EAEASGPLA 562 Query: 1274 VARLAET-------NNESEETQNHVSGGIVITEMEEFVSKIHLDEEIHKPEADDVFKDE- 1429 +A LA T ++++ ET +V TEMEEFVS I L EE+HKP+ +DVF DE Sbjct: 563 IAHLASTTLSSQIADDKNPETGESHENKLVFTEMEEFVSAIQLAEEVHKPDNEDVFMDED 622 Query: 1430 EVPKSFEKEMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGALQLLK 1609 E P+ ++E +DEAGGWMEV D DE P+NE+ E++VPD+ IHE AVGKGLSGAL+LLK Sbjct: 623 EPPRVSDEEQKDEAGGWMEVPDNSKDENPVNED-EEIVPDETIHEVAVGKGLSGALKLLK 681 Query: 1610 ERGTLKETIDWGGRNMDKKKSKLVGIYEND-GT------KEIRIERTDEFGRIMTPKEAF 1768 ERGTLKE+IDWGGRNMDKKKSKLVGI ++D GT K+IRIERTDEFGRIMTPKEAF Sbjct: 682 ERGTLKESIDWGGRNMDKKKSKLVGIVDDDVGTNNDNKFKDIRIERTDEFGRIMTPKEAF 741 Query: 1769 RIISHKFHGKGPGKMKLEKKIKKYQEELKSKQMKASDTPSESMERMREAQARLKTPYLVL 1948 R+ISHKFHGKGPGKMK EK++K+YQEELK KQMK SDTPS S+ERMR AQA+LKTPYLVL Sbjct: 742 RMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMRGAQAQLKTPYLVL 801 Query: 1949 SGHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 2122 SGHVKPGQTSDPRSGFATVEKD PG LTPMLGD+KVEHFLGIKRK E G PKK K Sbjct: 802 SGHVKPGQTSDPRSGFATVEKDFPGGLTPMLGDKKVEHFLGIKRKPETGFSGAPKKPK 859 >ref|XP_010926911.1| PREDICTED: SART-1 family protein DOT2 [Elaeis guineensis] Length = 1017 Score = 824 bits (2129), Expect = 0.0 Identities = 432/696 (62%), Positives = 528/696 (75%), Gaps = 5/696 (0%) Frame = +2 Query: 50 RSKDGERDEKLKLDYEDTRDITMQVKEVQNGESDNPEKLSTKDHKQITESAADGSHPPTS 229 R+++GE+DEK+K D ++R I + +E+Q+ E D T + K I+ ++ TS Sbjct: 334 RAREGEKDEKVKADGGNSR-IARKGEEIQDNEGD-----LTHNEKSISSTS-------TS 380 Query: 230 DLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEEKLNAEKVKALHLSKVFEEQDNVVQ 409 +LE R+ KMKEERLK+K +G SEI +WV+KSRK+EEK NAEK KAL LSK EEQDN++ Sbjct: 381 ELEERVTKMKEERLKRKPDGASEISSWVNKSRKLEEKRNAEKEKALRLSKALEEQDNIL- 439 Query: 410 GESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNILTDGDINEEADMLENVE 589 ES+DEE H DLAGVKILHGLDKV+EGGAVVLTL+DQ+IL DGDINE+ADMLENVE Sbjct: 440 AESEDEEATGHSGNDLAGVKILHGLDKVMEGGAVVLTLKDQSILADGDINEDADMLENVE 499 Query: 590 IGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPADEEGVTLDGSGHFSGEA 769 IGEQK+RDEAY+A+KK+TG YDDKF++D GS+K ILPQYD+ ++EGVTLD SG F+GEA Sbjct: 500 IGEQKQRDEAYRAAKKRTGLYDDKFSDDMGSRKPILPQYDNEIEDEGVTLDESGRFTGEA 559 Query: 770 XXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXXXXXXXXXXXXXXXLDL 949 I+G + ++DLT++GK SSDYYT +EM+QF LDL Sbjct: 560 EKKLEELRKRIEGGIIKQNYEDLTSSGKSSSDYYTPDEMLQFKKPKKKKSLRKKEK-LDL 618 Query: 950 DALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXXXXXXXXXXXXXXXLRQ 1129 DALEAEAISAGLGAGDLGSRN+ RRQ+AK E+ + +A+MR+ LRQ Sbjct: 619 DALEAEAISAGLGAGDLGSRNDLRRQTAKEEQVKADAEMRSNAYQSAIAKAEEASKALRQ 678 Query: 1130 EQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGASGMQVVARLAETNNESE 1309 EQTLT++ ED+NLVFG D EDL +S+ +ARKLALK+QDE SG + VA +A T E E Sbjct: 679 EQTLTVKSVEDDNLVFGEDFEDLQRSIGQARKLALKKQDETPVSGPEAVALVATTKKEQE 738 Query: 1310 ETQ----NHVSGGIVITEMEEFVSKIHLDEEIHKPEADDVFKDEE-VPKSFEKEMEDEAG 1474 + ++ITEMEEFV + E+ HKPE++DVFKDEE +PKS E E E E G Sbjct: 739 DASPTEGEPQENKVIITEMEEFVLGLQFTEDTHKPESEDVFKDEEDIPKSLELETEAEVG 798 Query: 1475 GWMEVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGALQLLKERGTLKETIDWGGRN 1654 GW EV +T E ++EEKED+ PD+I HETA+GKGLSG L+LLK+RGTL E +D GGRN Sbjct: 799 GWAEVMETDKTEAAVSEEKEDINPDEINHETAIGKGLSGVLKLLKDRGTLNEGVDLGGRN 858 Query: 1655 MDKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISHKFHGKGPGKMKLEKKIK 1834 MDKKKSKLVGIY+N+G KEIRIERTDEFGRIMTPKEAFR++SHKFHGKGPGKMK EK++K Sbjct: 859 MDKKKSKLVGIYDNEGQKEIRIERTDEFGRIMTPKEAFRMLSHKFHGKGPGKMKQEKRMK 918 Query: 1835 KYQEELKSKQMKASDTPSESMERMREAQARLKTPYLVLSGHVKPGQTSDPRSGFATVEKD 2014 +YQE+LK+KQMKASDTP +ME+MREAQARLKTPYLVLSGHVKPGQTSDPRSGFATVEKD Sbjct: 919 QYQEDLKTKQMKASDTPLLAMEKMREAQARLKTPYLVLSGHVKPGQTSDPRSGFATVEKD 978 Query: 2015 HPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 2122 H GSLTPMLGD+KVEHFLGI R+ + SMGPP +K Sbjct: 979 HLGSLTPMLGDKKVEHFLGINRRPDAGSMGPPPPKK 1014 >ref|XP_002297938.2| hypothetical protein POPTR_0001s11550g [Populus trichocarpa] gi|550347020|gb|EEE82743.2| hypothetical protein POPTR_0001s11550g [Populus trichocarpa] Length = 862 Score = 823 bits (2126), Expect = 0.0 Identities = 444/721 (61%), Positives = 530/721 (73%), Gaps = 18/721 (2%) Frame = +2 Query: 14 KNRDQGYDKEMLRSKDGERDEKLKLDYEDTRDITMQVKEVQNGESDNPE--KLSTKDHK- 184 K R + D+ +S + + D+K+++DYED D DN + K+S +D Sbjct: 156 KERSREKDRASRKSNEEDYDDKVQMDYEDEVD------------KDNRKQGKVSFRDEDD 203 Query: 185 QITESAADGSHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEEKLNAEKVKA 364 Q E A+ G+H S+L RI+KMKEER KKKSE S+ILAWV KSRK+EE A K +A Sbjct: 204 QSAEGASAGAHSSASELGQRILKMKEERTKKKSEPGSDILAWVGKSRKIEENKYAAKKRA 263 Query: 365 LHLSKVFEEQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNILT 544 HLSK+FEEQDN+ QG SDDEE QH +LAG+K+L GLDKV+EGGAVVLTL+DQNIL Sbjct: 264 KHLSKIFEEQDNIGQGGSDDEEADQHNAYNLAGIKVLDGLDKVLEGGAVVLTLKDQNILA 323 Query: 545 DGDINEEADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPADE 724 DGDINEE DMLENVEIGEQKRRDEAYKA+KKKTG Y+DKFN+DP S+KK+LPQYDD + Sbjct: 324 DGDINEEVDMLENVEIGEQKRRDEAYKAAKKKTGIYEDKFNDDPASEKKMLPQYDDANAD 383 Query: 725 EGVTLDGSGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXXX 904 EGVTLD G F+GEA +QG S + +DL ++GK+SSDY+THEEM+QF Sbjct: 384 EGVTLDERGRFTGEAEKKLEELRRRLQGTSTSARLEDLNSSGKISSDYFTHEEMLQF-KK 442 Query: 905 XXXXXXXXXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXXX 1084 LD+DALEAEA+SAGLG GDLGSR + RRQ+ + E+ER EA+MR Sbjct: 443 PKKKKSLRKKDKLDIDALEAEAVSAGLGIGDLGSRKDGRRQAIREEQERSEAEMRNNAYQ 502 Query: 1085 XXXXXXXXXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGASG 1264 LR ++TL ++EE+ENLVF D+EDLYKSLE+ARKLALK+Q EA ASG Sbjct: 503 SAYAKADEASKSLRLDRTLQTKVEEEENLVFADDEEDLYKSLERARKLALKKQ-EAEASG 561 Query: 1265 MQVVARLAET-------NNESEETQNHVSGGIVITEMEEFVSKIHLDEEIHKPEADDVFK 1423 +A LA T ++++ ET +V TEMEEFVS I L EE+HKP+ +DVF Sbjct: 562 PLAIAHLASTTLSSQIADDKNPETGESHENKLVFTEMEEFVSAIQLAEEVHKPDNEDVFM 621 Query: 1424 DE-EVPKSFEKEMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGALQ 1600 DE E P+ ++E +DEAGGWMEV D DE P+NE+ E++VPD+ IHE AVGKGLSGAL+ Sbjct: 622 DEDEPPRVSDEEQKDEAGGWMEVPDNSKDENPVNED-EEIVPDETIHEVAVGKGLSGALK 680 Query: 1601 LLKERGTLKETIDWGGRNMDKKKSKLVGIYEND-GT------KEIRIERTDEFGRIMTPK 1759 LLKERGTLKE+IDWGGRNMDKKKSKLVGI ++D GT K+IRIERTDEFGRIMTPK Sbjct: 681 LLKERGTLKESIDWGGRNMDKKKSKLVGIVDDDVGTNNDNKFKDIRIERTDEFGRIMTPK 740 Query: 1760 EAFRIISHKFHGKGPGKMKLEKKIKKYQEELKSKQMKASDTPSESMERMREAQARLKTPY 1939 EAFR+ISHKFHGKGPGKMK EK++K+YQEELK KQMK SDTPS S+ERMR AQA+LKTPY Sbjct: 741 EAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMRGAQAQLKTPY 800 Query: 1940 LVLSGHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQ 2119 LVLSGHVKPGQTSDPRSGFATVEKD PG LTPMLGD+KVEHFLGIKRK E G PKK Sbjct: 801 LVLSGHVKPGQTSDPRSGFATVEKDFPGGLTPMLGDKKVEHFLGIKRKPETGFSGAPKKP 860 Query: 2120 K 2122 K Sbjct: 861 K 861 >ref|XP_002516516.1| conserved hypothetical protein [Ricinus communis] gi|223544336|gb|EEF45857.1| conserved hypothetical protein [Ricinus communis] Length = 873 Score = 821 bits (2120), Expect = 0.0 Identities = 434/717 (60%), Positives = 527/717 (73%), Gaps = 14/717 (1%) Frame = +2 Query: 14 KNRDQGYDKEMLRSKDGERDEKLKLDYEDTRDITMQVKEVQNGESDNPEKLS---TKDHK 184 K +++ +DK+ LR +R + + D I M + +N + +K+S D + Sbjct: 159 KEKEEFHDKDRLRDGVSKRSHEEENDRSKNDTIEMGYERERNSDVGKQKKVSFDDDNDDE 218 Query: 185 QITESAADGSHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEEKLNAEKVKA 364 Q E + G + + E RI+K++EERLKK S+ SE+L+WV++SRK+ EK NAEK KA Sbjct: 219 QKVERTSGGGLASSLEFEERILKVREERLKKNSDAGSEVLSWVNRSRKLAEKKNAEKKKA 278 Query: 365 LHLSKVFEEQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNILT 544 LSKVFEEQD +VQGES+DEE + T DLAGVK+LHGL+KV+EGGAVVLTL+DQ+IL Sbjct: 279 KQLSKVFEEQDKIVQGESEDEEAGELATNDLAGVKVLHGLEKVMEGGAVVLTLKDQSILV 338 Query: 545 DGDINEEADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPADE 724 DGDINEE DMLEN+EIGEQKRR+EAYKA+KKKTG YDDKFN+DP S++KILPQYDDP + Sbjct: 339 DGDINEEVDMLENIEIGEQKRRNEAYKAAKKKTGIYDDKFNDDPASERKILPQYDDPTTD 398 Query: 725 EGVTLDGSGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXXX 904 EGVTLD G F+GEA +QGA N F+DL ++GK+SSD+YTHEEM+QF Sbjct: 399 EGVTLDERGRFTGEAEKKLEELRRRLQGALTDNCFEDLNSSGKMSSDFYTHEEMLQF-KK 457 Query: 905 XXXXXXXXXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXXX 1084 LD+DALEAEA+SAGLG GDLGSR++ RRQ+ + E+ER EA+ R+ Sbjct: 458 PKKKKSLRKKEKLDIDALEAEAVSAGLGVGDLGSRSDGRRQAIREEQERSEAERRSSAYQ 517 Query: 1085 XXXXXXXXXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGASG 1264 LR EQTL ++ E+EN VF DDEDL+KSLE+ARKLALK+Q+E ASG Sbjct: 518 SAYAKADEASKSLRLEQTLPAKVNEEENPVFADDDEDLFKSLERARKLALKKQEE--ASG 575 Query: 1265 MQVVARLA-ETNNESEETQNHVSG-----GIVITEMEEFVSKIHLDEEIHKPEADDVFKD 1426 Q +ARLA TNN+ + QN G +V TEMEEFV + LDEE HKP ++DVF D Sbjct: 576 PQAIARLATATNNQIADDQNPADGESQENKVVFTEMEEFVWGLQLDEESHKPGSEDVFMD 635 Query: 1427 EE-VPKSFEKEMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGALQL 1603 E+ P+ ++EM+DEAG W EV D D+ +NE KED+VPD+ IHE AVGKGLSGAL+L Sbjct: 636 EDAAPRVSDQEMKDEAGRWTEVNDAAEDDNSVNENKEDVVPDETIHEVAVGKGLSGALKL 695 Query: 1604 LKERGTLKETIDWGGRNMDKKKSKLVGIYENDGT----KEIRIERTDEFGRIMTPKEAFR 1771 LKERGTLKET+DWGGRNMDKKKSKLVGI ++D KEIRIER DEFGRIMTPKEAFR Sbjct: 696 LKERGTLKETVDWGGRNMDKKKSKLVGIVDSDADNEKFKEIRIERMDEFGRIMTPKEAFR 755 Query: 1772 IISHKFHGKGPGKMKLEKKIKKYQEELKSKQMKASDTPSESMERMREAQARLKTPYLVLS 1951 +ISHKFHGKGPGKMK EK++K+YQEELK KQMK SDTPSES+ERMREAQ +LKTPYLVLS Sbjct: 756 MISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSESVERMREAQKKLKTPYLVLS 815 Query: 1952 GHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 2122 GHVK GQ SDPRS FATVEKD PG LTPMLGD+KVEHFLGIKRKAE + P KK K Sbjct: 816 GHVKSGQASDPRSSFATVEKDLPGGLTPMLGDKKVEHFLGIKRKAEHENSSPSKKPK 872 >ref|XP_011011623.1| PREDICTED: SART-1 family protein DOT2 isoform X2 [Populus euphratica] Length = 859 Score = 818 bits (2114), Expect = 0.0 Identities = 439/718 (61%), Positives = 530/718 (73%), Gaps = 15/718 (2%) Frame = +2 Query: 14 KNRDQGYDKEMLRSKDGERDEKLKLDYEDTRDITMQVKEVQNGESDNPEKLSTKDHKQIT 193 ++R+ DKE R KD + + DY+D + M ++ + ++ K+S +D Sbjct: 150 RDREADQDKERSREKDRASRKGNEEDYDDK--VQMDYEDEVDKDNRKQGKVSFRDEG--- 204 Query: 194 ESAADGSHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEEKLNAEKVKALHL 373 E +A+G+H S+LE RI+KMKEER KKKSE S+ILAWV +SRK+EE +A K +A HL Sbjct: 205 EQSAEGAHSSASELEQRILKMKEERTKKKSEAGSDILAWVGRSRKIEENKHAAKARAKHL 264 Query: 374 SKVFEEQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNILTDGD 553 SK+FEEQDN+ QG SDDEE QH +LAG+K+L GLDKV+EGGAVVLTL+DQNIL DGD Sbjct: 265 SKIFEEQDNIGQGGSDDEEADQHNAYNLAGIKVLDGLDKVLEGGAVVLTLKDQNILADGD 324 Query: 554 INEEADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPADEEGV 733 INEE DMLENVEIGEQKRRDEAYKA+KKKTG YDDKFN+DP S+KK+LPQYDD +EG+ Sbjct: 325 INEEVDMLENVEIGEQKRRDEAYKAAKKKTGIYDDKFNDDPASEKKMLPQYDDANADEGI 384 Query: 734 TLDGSGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXXXXXX 913 TLD G F+GEA +QG S + +DL ++GK+SSDY+THEEM++F Sbjct: 385 TLDERGRFTGEAEKKLEELRRRLQGTSTSARLEDLNSSGKISSDYFTHEEMLKF-KKPKK 443 Query: 914 XXXXXXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXXXXXX 1093 LD+DALEAEA+SAGLG GDLGSR + RRQ+ + E+ER A+MR Sbjct: 444 KKSLRKKDKLDIDALEAEAVSAGLGIGDLGSRKDGRRQAIREEQERSAAEMRNNAYQSAY 503 Query: 1094 XXXXXXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGASGMQV 1273 LR +QTL ++EE+ENLVF D+EDLYKSLE+ARKLALK+Q EA ASG Sbjct: 504 AKADEASKSLRLDQTLQTKVEEEENLVFADDEEDLYKSLERARKLALKKQ-EAEASGPLA 562 Query: 1274 VARLAET-------NNESEETQNHVSGGIVITEMEEFVSKIHLDEEIHKPEADDVFKDE- 1429 +A LA T ++++ ET +V TEMEEFVS I L E+HKP+ +DVF DE Sbjct: 563 IAHLASTTLSSQIADDKNPETGESHENKLVFTEMEEFVSAIQL-AEVHKPDNEDVFMDED 621 Query: 1430 EVPKSFEKEMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGALQLLK 1609 E P+ ++E +DEAGGWMEV D DE P+NE+ E++VPD+ IHE AVGKGLSGAL+LLK Sbjct: 622 EPPRVSDEEQKDEAGGWMEVPDNSKDENPVNED-EEIVPDETIHEVAVGKGLSGALKLLK 680 Query: 1610 ERGTLKETIDWGGRNMDKKKSKLVGIYEND-GT------KEIRIERTDEFGRIMTPKEAF 1768 ERGTLKE+IDWGGRNMDKKKSKLVGI ++D GT K+IRIERTDEFGRIMTPKEAF Sbjct: 681 ERGTLKESIDWGGRNMDKKKSKLVGIVDDDVGTNNDNKFKDIRIERTDEFGRIMTPKEAF 740 Query: 1769 RIISHKFHGKGPGKMKLEKKIKKYQEELKSKQMKASDTPSESMERMREAQARLKTPYLVL 1948 R+ISHKFHGKGPGKMK EK++K+YQEELK KQMK SDTPS S+ERMR AQA+LKTPYLVL Sbjct: 741 RMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMRGAQAQLKTPYLVL 800 Query: 1949 SGHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 2122 SGHVKPGQTSDPRSGFATVEKD PG LTPMLGD+KVEHFLGIKRK E G PKK K Sbjct: 801 SGHVKPGQTSDPRSGFATVEKDFPGGLTPMLGDKKVEHFLGIKRKPETGFSGAPKKPK 858 >ref|XP_007022025.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] gi|590611175|ref|XP_007022026.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] gi|508721653|gb|EOY13550.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] gi|508721654|gb|EOY13551.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] Length = 907 Score = 809 bits (2090), Expect = 0.0 Identities = 432/719 (60%), Positives = 518/719 (72%), Gaps = 16/719 (2%) Frame = +2 Query: 14 KNRDQGYDKEMLRSKDGERDEKLKLDYEDTRDITMQVKEVQNGESDNPEKLSTKDHKQIT 193 ++RD K +G +D +L LDY D+RD KD ++ Sbjct: 211 RDRDNAIKKNHEEDYEGSKDGELALDYGDSRD---------------------KDEAELN 249 Query: 194 ESAADG-SHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEEKLNAEKVKALH 370 + G + +S+LE RI +MKEERLKKKSEGVSE+L WV RK+EEK NAEK KAL Sbjct: 250 AGSNAGVAQASSSELEERIARMKEERLKKKSEGVSEVLEWVGNFRKLEEKRNAEKEKALQ 309 Query: 371 LSKVFEEQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNILTDG 550 SK+FEEQD+ VQGE++DEE +H DLAGVK+LHGLDKV++GGAVVLTL+DQ+IL +G Sbjct: 310 RSKIFEEQDDFVQGENEDEEAVRHAAHDLAGVKVLHGLDKVMDGGAVVLTLKDQSILANG 369 Query: 551 DINEEADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPADEEG 730 DINE+ DMLENVEIGEQ+RRDEAYKA+KKKTG YDDKFN++PGS+KKILPQYD+P +EG Sbjct: 370 DINEDVDMLENVEIGEQRRRDEAYKAAKKKTGVYDDKFNDEPGSEKKILPQYDNPVADEG 429 Query: 731 VTLDGSGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXXXXX 910 VTLD G F+GEA +QG P +N+ +DL GK++SDYYT EEM++F Sbjct: 430 VTLDERGRFTGEAEKKLQELRKRLQGVPTNNRVEDLNNAGKIASDYYTQEEMLKF-KKPK 488 Query: 911 XXXXXXXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXXXXX 1090 LD+DALEAEAIS+GLGAGDLGSRN+ RRQ+ + EE R EA+ R Sbjct: 489 KKKALRKKEKLDIDALEAEAISSGLGAGDLGSRNDARRQAIREEEARSEAEKRNSAYQSA 548 Query: 1091 XXXXXXXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGASGMQ 1270 L EQTL ++ EEDEN VF DD+DLYKS+E++RKLA K+Q++ SG Q Sbjct: 549 YAKADEASKSLWLEQTLIVKPEEDENQVFADDDDDLYKSIERSRKLAFKKQEDE-KSGPQ 607 Query: 1271 VVARLAET-------NNESEETQNHVSGGIVITEMEEFVSKIHLDEEIHKPEADDVFKDE 1429 +A A T ++++ T +VITEMEEFV + DEE HKP+++DVF DE Sbjct: 608 AIALRATTAAISQTADDQTTTTGEAQENKLVITEMEEFVWGLQHDEEAHKPDSEDVFMDE 667 Query: 1430 -EVPKSFE---KEMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGAL 1597 EVP E K E+E GGW EV D DE P NE+K+D+VPD+ IHE AVGKGLSGAL Sbjct: 668 DEVPGVSEHDGKSGENEVGGWTEVVDASTDENPSNEDKDDIVPDETIHEVAVGKGLSGAL 727 Query: 1598 QLLKERGTLKETIDWGGRNMDKKKSKLVGIY----ENDGTKEIRIERTDEFGRIMTPKEA 1765 +LLK+RGTLKE+I+WGGRNMDKKKSKLVGI END K+IRIERTDEFGRI+TPKEA Sbjct: 728 KLLKDRGTLKESIEWGGRNMDKKKSKLVGIVDDDRENDRFKDIRIERTDEFGRIITPKEA 787 Query: 1766 FRIISHKFHGKGPGKMKLEKKIKKYQEELKSKQMKASDTPSESMERMREAQARLKTPYLV 1945 FR++SHKFHGKGPGKMK EK+ K+YQEELK KQMK SDTPS S+ERMREAQA+LKTPYLV Sbjct: 788 FRVLSHKFHGKGPGKMKQEKRQKQYQEELKLKQMKNSDTPSLSVERMREAQAQLKTPYLV 847 Query: 1946 LSGHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 2122 LSGHVKPGQTSDPRSGFATVEKD PG LTPMLGDRKVEHFLGIKRKAEP + PKK K Sbjct: 848 LSGHVKPGQTSDPRSGFATVEKDFPGGLTPMLGDRKVEHFLGIKRKAEPGNSSTPKKPK 906 >ref|XP_010102332.1| hypothetical protein L484_015280 [Morus notabilis] gi|587905102|gb|EXB93293.1| hypothetical protein L484_015280 [Morus notabilis] Length = 952 Score = 806 bits (2081), Expect = 0.0 Identities = 439/759 (57%), Positives = 538/759 (70%), Gaps = 52/759 (6%) Frame = +2 Query: 2 KMIGKNRDQGYDKEMLRS--------------KDGERDEKLKLDYEDTRDITMQVKEVQN 139 K+ K R+ DKE R KDG RD+K KLD ++ +D +E + Sbjct: 206 KIKEKEREADQDKEKSRDRVSKKSVEEDYELGKDGGRDDKTKLDDDNKKD-----REAKQ 260 Query: 140 GESDNPEKLSTKDHKQITESAADGSHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSK 319 G D +QIT + +H T++LE RI+KMK+ER KKK+E V E+LAWV+K Sbjct: 261 GNVSQ-----YIDGEQITHDISHKAHLTTTELEKRILKMKQERSKKKTEDVPEVLAWVNK 315 Query: 320 SRKVEEKLNAEKVKALHLSKVFEEQDNVVQGESDDEEVP-QHLTKDLAGVKILHGLDKVI 496 SRK+EEK N EK KAL LSK+FEEQDN+VQ +S+DEE QH +LAGVK+LHG+DKV+ Sbjct: 316 SRKLEEKKNDEKEKALQLSKIFEEQDNIVQEDSEDEETTTQHY--NLAGVKVLHGIDKVM 373 Query: 497 EGGAVVLTLRDQNILTDGDINEEADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDP 676 EGGAVVLTL+DQNIL DGDIN E DMLENVEIGEQKRRDEAYKA+KKK G Y DKFN+DP Sbjct: 374 EGGAVVLTLKDQNILADGDINLEIDMLENVEIGEQKRRDEAYKAAKKKVGIYVDKFNDDP 433 Query: 677 GSQKKILPQYDDPADEEGVTLDGSGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKL 856 S++K+LPQYDDP+ + GVT+D G + EA +QGA +++F+DL+ GK+ Sbjct: 434 NSERKMLPQYDDPSTDVGVTIDERGRITSEAEKKLEELRRRLQGASTNSRFEDLSFPGKV 493 Query: 857 SSDYYTHEEMVQFXXXXXXXXXXXXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAK 1036 SSDYYT EEM+QF LD+DALEAEA+SAGLG GDLGSRN+ +RQ + Sbjct: 494 SSDYYTSEEMMQFKKPKKKKSLRKKDK-LDIDALEAEAVSAGLGVGDLGSRNDPKRQVIR 552 Query: 1037 AEEERVEADMRTEXXXXXXXXXXXXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEK 1216 E++R EA+ R LR EQTL ++LEE+ENLVF DDED +K++E+ Sbjct: 553 EEQDRAEAERRNNAYKTAFAKADEASKSLRLEQTLPVKLEEEENLVFADDDEDFHKAVER 612 Query: 1217 ARKLALKRQDEAGASGMQVVARLAET--NNESEETQN----HVSGGIVITEMEEFVSKIH 1378 ARK+A+K++D+ SG + VA LA T N++ + QN +V TEMEEFV + Sbjct: 613 ARKIAVKKEDKETPSGPEAVALLAATIANSQPADEQNPSGESQENKVVFTEMEEFVWGLQ 672 Query: 1379 LDEEIHKPEADDVFKDE-EVPKSFEKEMEDEAGGWMEVKDTGADELPINEEKEDLVPDKI 1555 L+EE KP+ +DVF DE E PK++ +E+++E GGW EVK+T DE P EE+E++VPD I Sbjct: 673 LEEEAQKPDNEDVFMDEDEEPKAYNEEIKNEPGGWTEVKETNNDEHPSKEEEEEIVPDGI 732 Query: 1556 IHETAVGKGLSGALQLLKERGTLKETIDWGGRNMDKKKSKLVGIYEND-----------G 1702 IHE AVGKGLSGAL+LLKERGTLKE+IDWGGRNMDKKKSKLVGI ++D G Sbjct: 733 IHEVAVGKGLSGALKLLKERGTLKESIDWGGRNMDKKKSKLVGIVDDDEPGQQVHPKKDG 792 Query: 1703 T-------------------KEIRIERTDEFGRIMTPKEAFRIISHKFHGKGPGKMKLEK 1825 T K+IRIERTDEFGRI+TPKEAFRIISHKFHGKGPGKMK EK Sbjct: 793 TRTSSSSYSKETRASKVYEEKDIRIERTDEFGRILTPKEAFRIISHKFHGKGPGKMKQEK 852 Query: 1826 KIKKYQEELKSKQMKASDTPSESMERMREAQARLKTPYLVLSGHVKPGQTSDPRSGFATV 2005 ++K+YQEELK KQMK+SDTPS+S+ERMREAQA+LKTPYLVLSGHVKPGQTSDPRSGFATV Sbjct: 853 RMKQYQEELKLKQMKSSDTPSQSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATV 912 Query: 2006 EKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 2122 EKD PG LTPMLGDRKVEHFLGIKRK EP++ G PKK K Sbjct: 913 EKDPPGGLTPMLGDRKVEHFLGIKRKPEPANSGRPKKPK 951 >ref|XP_010033990.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Eucalyptus grandis] gi|629087518|gb|KCW53875.1| hypothetical protein EUGRSUZ_J03092 [Eucalyptus grandis] Length = 900 Score = 805 bits (2078), Expect = 0.0 Identities = 424/706 (60%), Positives = 521/706 (73%), Gaps = 10/706 (1%) Frame = +2 Query: 35 DKEMLRSKDGERDEKLKLDYEDTRDITMQVKEVQNGESDNPEKLSTKDHKQITESAADGS 214 D+E +D RD++ + D D ++K+ E D E+ H Q +SA DG+ Sbjct: 199 DREEDHDRDRSRDKERVIRKGDAHDYD-RIKD-NRVEFDIAEEKEDVGHGQNPDSALDGT 256 Query: 215 HPPTSDLESRIMKMKEERLKKK--SEGVSEILAWVSKSRKVEEKLNAEKVKALHLSKVFE 388 TS+L+ RI K KEERLK++ SEG SEILAWV++SRK+E+K NAEK K + LSKVFE Sbjct: 257 RLSTSNLQDRISKAKEERLKRQPESEGASEILAWVNRSRKLEQKRNAEKEKVMRLSKVFE 316 Query: 389 EQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNILTDGDINEEA 568 EQD++ GES+DE+ DLAGVK+LHGLDKV+EGGAVVLTL+DQNIL DGDINEE Sbjct: 317 EQDDIGHGESEDEQEVPRNAHDLAGVKVLHGLDKVVEGGAVVLTLKDQNILADGDINEEV 376 Query: 569 DMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPADEEGVTLDGS 748 DMLENVEIGEQK RDEAYKA+KKK+G YDDKF++DP S+KK+LPQYDDPA +EGVTLD S Sbjct: 377 DMLENVEIGEQKHRDEAYKAAKKKSGIYDDKFSDDPASEKKMLPQYDDPAQDEGVTLDSS 436 Query: 749 GHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXXXXXXXXXXX 928 G + EA +QG S+ ++DLT++ K SSDYYT EE+++F Sbjct: 437 GRLTNEAEKKLEELRRRLQGVSSSSHYEDLTSSAKTSSDYYTQEELLRF-RKPKKKKSLR 495 Query: 929 XXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXXXXXXXXXXX 1108 LDLDALEAEA+SAGLG GDLGSR + RRQ+++ E+E++EA+MR Sbjct: 496 KKEKLDLDALEAEAVSAGLGVGDLGSRKDGRRQASREEQEKIEAEMRKNAFQLAYAKAEE 555 Query: 1109 XXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGASGMQVVARLA 1288 LR EQTL ++ E DEN+V DDEDLYKSLE+ARKLALK+Q+E GASG + +A A Sbjct: 556 ASRLLRVEQTLPVKTENDENMVIADDDEDLYKSLERARKLALKKQEEKGASGPKAIALRA 615 Query: 1289 ET-------NNESEETQNHVSGGIVITEMEEFVSKIHLDEEIHKPEADDVFKDE-EVPKS 1444 + N+S T +V+TE+E FVS + +DE KP+ +DVF DE E P + Sbjct: 616 SSIPSTHNAENQSVTTGESQESRVVMTEIEGFVSGLEVDEVSRKPDTEDVFMDEDEAPVT 675 Query: 1445 FEKEMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGALQLLKERGTL 1624 + E++DE GGW E K+ G DE +NE++E++VPD+ IHE AVGKGLSGAL+LLK+RGTL Sbjct: 676 SDNEVKDEPGGWTEFKEFGNDEGSVNEDEEEVVPDETIHEAAVGKGLSGALKLLKDRGTL 735 Query: 1625 KETIDWGGRNMDKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISHKFHGKGP 1804 KET++WGGRNMDKKKSKLVGI + G KEIRIERTDEFGRI+TPKEAFR++SHKFHGKGP Sbjct: 736 KETVEWGGRNMDKKKSKLVGIADG-GQKEIRIERTDEFGRILTPKEAFRLLSHKFHGKGP 794 Query: 1805 GKMKLEKKIKKYQEELKSKQMKASDTPSESMERMREAQARLKTPYLVLSGHVKPGQTSDP 1984 GKMK EK++K+Y EELK KQMK SDTPS S ERMREAQA++KTPYLVLSGHVKPGQ SDP Sbjct: 795 GKMKQEKRMKQYHEELKLKQMKNSDTPSSSAERMREAQAQMKTPYLVLSGHVKPGQNSDP 854 Query: 1985 RSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 2122 RSGFAT+EKD PGSLTPMLGDRKVEHFLGIKRK EPS++G KK K Sbjct: 855 RSGFATIEKD-PGSLTPMLGDRKVEHFLGIKRKPEPSNLGASKKPK 899 >ref|XP_009405353.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Musa acuminata subsp. malaccensis] gi|695035842|ref|XP_009405354.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Musa acuminata subsp. malaccensis] gi|695035844|ref|XP_009405355.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Musa acuminata subsp. malaccensis] Length = 996 Score = 803 bits (2075), Expect = 0.0 Identities = 431/709 (60%), Positives = 538/709 (75%), Gaps = 6/709 (0%) Frame = +2 Query: 14 KNRDQGYDKEMLRSKDGERDEKLKLDYEDTRDITMQVKEVQNGESDNPEKLSTKDHKQIT 193 ++R + +K +K+ E+DE+ D+ED R + + +E ++G SD+ EK + K+ Q + Sbjct: 295 RSRTRDREKGPAGAKESEKDERTLSDFEDGR-LDSREEEARDG-SDSHEKSTLKN--QQS 350 Query: 194 ESAADGSHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEEKLNAEKVKALHL 373 E D S+LE R+ + KEER+KKKS+G EI +WV+KSR++EE+ NAEK +AL L Sbjct: 351 EKHTDSLL--ASELEERLARTKEERMKKKSDGAFEISSWVNKSRRLEERKNAEK-EALRL 407 Query: 374 SKVFEEQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNILTDGD 553 SK FEEQDN++ + DDE V H KDLAGVKILHGLDKVIEGGAVVLTL+DQ+IL DGD Sbjct: 408 SKAFEEQDNML-ADGDDETVG-HTQKDLAGVKILHGLDKVIEGGAVVLTLKDQDILKDGD 465 Query: 554 INEEADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPADEEGV 733 INEE DMLENVEIGEQK+RDEAYKA+KK+TG YDDKFN++ GSQK ILPQYDDP ++EGV Sbjct: 466 INEEIDMLENVEIGEQKQRDEAYKAAKKRTGLYDDKFNDETGSQKTILPQYDDPVEDEGV 525 Query: 734 TLDGSGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXXXXXX 913 LD SGHF+GEA I+G+ V ++DLT++ K SSDYYT EEM++F Sbjct: 526 ALDESGHFTGEAEKKLEELRRRIEGSFVPKSYEDLTSSAKNSSDYYTAEEMLRFKKPKKK 585 Query: 914 XXXXXXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXXXXXX 1093 LDLDA+EAEA SAGLGA DLGSRN+ RRQ + E+E++EA+ R++ Sbjct: 586 KSLRKKEK-LDLDAMEAEARSAGLGASDLGSRNDMRRQIEREEQEKIEAERRSKAYQTAY 644 Query: 1094 XXXXXXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGASGMQV 1273 + QEQTL ++ ED+++VFG D EDL SLE+ARKLAL++ DEAGA+G Q Sbjct: 645 EKAEEASKVMLQEQTLRLKSFEDDDIVFGEDYEDLQMSLEQARKLALRKHDEAGATGPQA 704 Query: 1274 VARLAETNNESEETQNHVSGG-----IVITEMEEFVSKIHLDEEIHKPEADDVFKDEE-V 1435 VA LA + E E +Q+ +G +VITE+EEFV + L+E KPE++DVF DEE Sbjct: 705 VALLATSIKEQENSQSQSTGELQEEKVVITEVEEFVLGLQLNEGAQKPESEDVFMDEEDS 764 Query: 1436 PKSFEKEMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGALQLLKER 1615 PKS E E++ + GW EV++T E PI+E+K+D+ PD+IIHE AVGKGLSGAL+LLKER Sbjct: 765 PKSLEPEIKVDVTGWTEVEETSKSEDPISEKKDDVSPDEIIHEVAVGKGLSGALKLLKER 824 Query: 1616 GTLKETIDWGGRNMDKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISHKFHG 1795 G LKET+DWGGR MDKKKSKLVG+Y++ GTKEIRIERTDEFGRIMTPKEAFR++SHKFHG Sbjct: 825 GALKETVDWGGRTMDKKKSKLVGLYDDGGTKEIRIERTDEFGRIMTPKEAFRMLSHKFHG 884 Query: 1796 KGPGKMKLEKKIKKYQEELKSKQMKASDTPSESMERMREAQARLKTPYLVLSGHVKPGQT 1975 KGPGKMK EK++K+YQE+LK+KQMKASDTP ++E+MREAQA+LKTPYLVLSGHVKPGQT Sbjct: 885 KGPGKMKQEKRMKQYQEDLKTKQMKASDTPLLAVEKMREAQAQLKTPYLVLSGHVKPGQT 944 Query: 1976 SDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 2122 SDPRSGFATVEKDH GSLTPMLGD+KVEHFLGIKRK E SMGPP +K Sbjct: 945 SDPRSGFATVEKDHLGSLTPMLGDKKVEHFLGIKRKPEIGSMGPPLPKK 993 >gb|KJB61483.1| hypothetical protein B456_009G361400 [Gossypium raimondii] Length = 878 Score = 801 bits (2069), Expect = 0.0 Identities = 431/707 (60%), Positives = 512/707 (72%), Gaps = 27/707 (3%) Frame = +2 Query: 14 KNRDQGYDKEMLRSKD-----------GERDEKLKLDYEDTRDITMQVKEVQNGESDNPE 160 KNR+ +KE R +D G +D +L LDYED RD Sbjct: 194 KNREADLEKERSRDRDNVGKNHEEDYEGSKDGELALDYEDRRD----------------- 236 Query: 161 KLSTKDHKQITE-SAADGSHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEE 337 KD ++ S A +S+LE RI++MKE+RLKKKSEG+SE+ AWVS+SRK+E+ Sbjct: 237 ----KDEAELNAGSNASLVQASSSELEERIVRMKEDRLKKKSEGLSEVSAWVSRSRKLED 292 Query: 338 KLNAEKVKALHLSKVFEEQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVL 517 K NAEK KAL LSK+FEEQDN VQGE +DEE T DL GVK+LHGLDKV++GGAVVL Sbjct: 293 KRNAEKEKALQLSKIFEEQDNFVQGEDEDEEADNRPTHDLGGVKVLHGLDKVMDGGAVVL 352 Query: 518 TLRDQNILTDGDINEEADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKIL 697 TL+DQ+IL DGD+NE+ DMLEN+EIGEQK+RDEAYKA+KKKTG YDDKFNEDPGS+KKIL Sbjct: 353 TLKDQSILADGDLNEDVDMLENIEIGEQKQRDEAYKAAKKKTGVYDDKFNEDPGSEKKIL 412 Query: 698 PQYDDPADEEGVTLDGSGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTH 877 PQYDDP +EGVTLD G F+GEA + G P +N+ +DL GK+SSDYYT Sbjct: 413 PQYDDPVADEGVTLDERGRFTGEAEKKLEELRKRLLGVPTNNRVEDLNNVGKISSDYYTQ 472 Query: 878 EEMVQFXXXXXXXXXXXXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVE 1057 EEM++F LD+DALEAEA+SAGLGAGDLGSR + RRQ+ K EE R E Sbjct: 473 EEMLRF-KKPKKKKALRKKEKLDIDALEAEAVSAGLGAGDLGSRKDSRRQAIKEEEARSE 531 Query: 1058 ADMRTEXXXXXXXXXXXXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALK 1237 A+ R LR EQT T++ EEDEN VF D+EDLYKSLEKAR+LALK Sbjct: 532 AEKRKNAYQAAFAKADEASKSLRLEQTHTVKPEEDENQVFADDEEDLYKSLEKARRLALK 591 Query: 1238 RQDEAGASGMQVVARLAETNNESEETQNHVSGG------IVITEMEEFVSKIHLDEEIHK 1399 +Q+E SG Q +A LA T+ ++ T +H S G +VITEMEEFV + LDEE HK Sbjct: 592 KQEE--KSGPQAIALLATTSASNQTTDDHTSTGEAQENKVVITEMEEFVWGLQLDEEAHK 649 Query: 1400 PEADDVFKDE-EVPKSFE---KEMEDEAGGWMEVKDTGADELPINEEKEDLVPDKIIHET 1567 P+++DVF DE EVP + E K E+E GGW EV DT ADE P NE+ +++VPD+ IHE Sbjct: 650 PDSEDVFMDEDEVPGASEQDRKNGENEVGGWTEVIDTSADEKPANEDNDEVVPDETIHEI 709 Query: 1568 AVGKGLSGALQLLKERGTLKETIDWGGRNMDKKKSKLVGIYENDGT-----KEIRIERTD 1732 AVGKGLSGAL+LLK+RGTLKETI+WGGRNMDKKKSKLVGI ++D K+IRIERTD Sbjct: 710 AVGKGLSGALKLLKDRGTLKETIEWGGRNMDKKKSKLVGIVDDDHQTDNRFKDIRIERTD 769 Query: 1733 EFGRIMTPKEAFRIISHKFHGKGPGKMKLEKKIKKYQEELKSKQMKASDTPSESMERMRE 1912 EFGRI+TPKEAFR++SHKFHGKGPGKMK EK++K+YQEELK KQMK SDTPS S+ERMRE Sbjct: 770 EFGRIVTPKEAFRMLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMRE 829 Query: 1913 AQARLKTPYLVLSGHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRK 2053 AQA+LKTPYLVLSGHVKPGQTSDP SGFATVEKD PG LTPMLGDRK Sbjct: 830 AQAQLKTPYLVLSGHVKPGQTSDPASGFATVEKDFPGGLTPMLGDRK 876 >ref|XP_012077380.1| PREDICTED: SART-1 family protein DOT2 isoform X2 [Jatropha curcas] Length = 636 Score = 801 bits (2068), Expect = 0.0 Identities = 420/637 (65%), Positives = 492/637 (77%), Gaps = 14/637 (2%) Frame = +2 Query: 254 MKEERLKKKSEGVSEILAWVSKSRKVEEKLNAEKVKALHLSKVFEEQDNVVQGESDDEEV 433 MKEERLKK SE E+LAWV++SRK+EEK NAEK KA LSK+FEEQDN VQGES+DE+ Sbjct: 1 MKEERLKKNSEPGDEVLAWVNRSRKLEEKKNAEKQKAKQLSKIFEEQDNNVQGESEDEDS 60 Query: 434 PQHLTKDLAGVKILHGLDKVIEGGAVVLTLRDQNILTDGDINEEADMLENVEIGEQKRRD 613 +H T DLAGVK+LHGL+KV+EGGAVVLTL+DQ+IL DGDINEE DMLENVEIGEQKRRD Sbjct: 61 GEHTTHDLAGVKVLHGLEKVMEGGAVVLTLKDQSILADGDINEEVDMLENVEIGEQKRRD 120 Query: 614 EAYKASKKKTGTYDDKFNEDPGSQKKILPQYDDPADEEGVTLDGSGHFSGEAXXXXXXXX 793 +AYKA+KKKTG YDDKFN+DP S+KKILPQYDD A +EGV LD G F+GEA Sbjct: 121 DAYKAAKKKTGIYDDKFNDDPASEKKILPQYDDSAADEGVALDERGRFTGEAEKKLEELR 180 Query: 794 XXIQGAPVSNQFQDLTTTGKLSSDYYTHEEMVQFXXXXXXXXXXXXXXXLDLDALEAEAI 973 +QG +N+F+DL+++GK+SSDYYTHEE++QF LD+DALEAEA+ Sbjct: 181 RRLQGVSTNNRFEDLSSSGKISSDYYTHEELLQF-KKPKKKKSLRKKEKLDIDALEAEAV 239 Query: 974 SAGLGAGDLGSRNEERRQSAKAEEERVEADMRTEXXXXXXXXXXXXXXXLRQEQTLTIQL 1153 SAGLG GDLGSRN RRQ+ + E+ER EA+MR+ LRQEQTL +L Sbjct: 240 SAGLGVGDLGSRNNGRRQAIRQEQERSEAEMRSSAYQAAYDKADEASKSLRQEQTLHAKL 299 Query: 1154 EEDENLVFGGDDEDLYKSLEKARKLALKRQDEAGASGMQVVARLA----ETNNESEETQN 1321 +EDEN VF DDEDLYKSLE+ARKLALK+Q+E ASG Q +ARLA T++++ + QN Sbjct: 300 DEDENPVFAEDDEDLYKSLERARKLALKKQEEK-ASGPQAIARLAAATTTTSSQTTDDQN 358 Query: 1322 HVSG-----GIVITEMEEFVSKIHLDEEIHKPEADDVFKDE-EVPKSFEKEMEDEAGGWM 1483 +G IV TEMEEFV + LDEE HK DDVF DE E P ++E +DE GGW Sbjct: 359 PTTGESQENKIVFTEMEEFVWGLQLDEESHKHGNDDVFMDEDEAPIVSDQEKKDETGGWT 418 Query: 1484 EVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGALQLLKERGTLKETIDWGGRNMDK 1663 EV+D DE P+NE ED+VPD+ IHE VGKGLS AL+LLKERGTLKE+ +WGGRNMDK Sbjct: 419 EVQDIDKDENPVNENNEDIVPDETIHEVPVGKGLSAALKLLKERGTLKESTEWGGRNMDK 478 Query: 1664 KKSKLVGI----YENDGTKEIRIERTDEFGRIMTPKEAFRIISHKFHGKGPGKMKLEKKI 1831 KKSKLVGI +N+ K+IRI+RTDE+GR +TPKEAFRIISHKFHGKGPGKMK EK++ Sbjct: 479 KKSKLVGIVDSDVDNERFKDIRIDRTDEYGRTLTPKEAFRIISHKFHGKGPGKMKQEKRM 538 Query: 1832 KKYQEELKSKQMKASDTPSESMERMREAQARLKTPYLVLSGHVKPGQTSDPRSGFATVEK 2011 K+Y EELK KQMK SDTPS S+ERMREAQA+LKTPYLVLSGHVKPGQTSDPRSGFATVEK Sbjct: 539 KQYLEELKMKQMKNSDTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEK 598 Query: 2012 DHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 2122 D PG LTPMLGD+KVEHFLGIKRKAEP + PKK K Sbjct: 599 DLPGGLTPMLGDKKVEHFLGIKRKAEPGNSNAPKKPK 635 >gb|KHG25959.1| U4/U6.U5 tri-snRNP-associated 1 [Gossypium arboreum] Length = 955 Score = 800 bits (2066), Expect = 0.0 Identities = 449/779 (57%), Positives = 532/779 (68%), Gaps = 76/779 (9%) Frame = +2 Query: 14 KNRDQGYDKEMLRSKD-----------GERDEKLKLDYEDTRDITMQVKEVQNGESDNPE 160 KNR+ +KE R +D G +D +L LDYED RD Sbjct: 200 KNRETDLEKERSRDRDNVVKNHEEDYEGSKDGELALDYEDRRD----------------- 242 Query: 161 KLSTKDHKQITE-SAADGSHPPTSDLESRIMKMKEERLKKKSEGVSEILAWVSKSRKVEE 337 KD ++ S A +S+LE RI++MKE RLKKKSEG+SE+ AWVS+SRK+E+ Sbjct: 243 ----KDEAELNAGSNASLVQASSSELEERIVRMKEVRLKKKSEGLSEVSAWVSRSRKLED 298 Query: 338 KLNAEKVKALHLSKVFEEQDNVVQGESDDEEVPQHLTKDLAGVKILHGLDKVIEGGAVVL 517 K NAEK KAL LSK+FEEQDN VQGE +DEE + DL GVK+LHGLDKV++GGAVVL Sbjct: 299 KRNAEKEKALQLSKIFEEQDNFVQGEDEDEEADNRPSHDLGGVKVLHGLDKVMDGGAVVL 358 Query: 518 TLRDQNILTDGDINEEADMLENVEIGEQKRRDEAYKASKKKTGTYDDKFNEDPGSQKKIL 697 TL+DQ+IL DGD+NE+ DMLEN+EIGEQK+RDEAYKA+KKKTG YDDKFNEDPGS+KKIL Sbjct: 359 TLKDQSILADGDLNEDVDMLENIEIGEQKQRDEAYKAAKKKTGVYDDKFNEDPGSEKKIL 418 Query: 698 PQYDDPADEEGVTLDGSGHFSGEAXXXXXXXXXXIQGAPVSNQFQDLTTTGKLSSDYYTH 877 PQYDDP +EGVTLD G F+GEA + G P +N+ +DL GK+SSDYYT Sbjct: 419 PQYDDPVADEGVTLDERGRFTGEAEKKLDELRKRLLGVPTNNRVEDLNNVGKVSSDYYTQ 478 Query: 878 EEMVQFXXXXXXXXXXXXXXXLDLDALEAEAISAGLGAGDLGSRNEERRQSAKAEEERVE 1057 EEM++F LD+DALEAEA+SAGLGAGDLGSRN+ RRQ+ K EE R E Sbjct: 479 EEMLRF-KKPKKKKALRKKEKLDIDALEAEAVSAGLGAGDLGSRNDSRRQAIKEEEARSE 537 Query: 1058 ADMRTEXXXXXXXXXXXXXXXLRQEQTLTIQLEEDENLVFGGDDEDLYKSLEKARKLALK 1237 A+ R LR EQTLT++ EEDEN VF D+EDLYKSLEKAR+LALK Sbjct: 538 AEKRNNAYQAAFAKADEASKSLRLEQTLTVKPEEDENQVFADDEEDLYKSLEKARRLALK 597 Query: 1238 RQDEAGASGMQVVARLAET--NNESEETQNHVSG-----GIVITEMEEFVSKIHLDE--- 1387 +Q+E SG Q VA LA T +N++ + QN +G +VITEMEEFV + LDE Sbjct: 598 KQEE--KSGPQAVALLAATSASNQTTDDQNTSTGEAQENKVVITEMEEFVWGLQLDEATK 655 Query: 1388 ------------------------EIHKPEADDVFKDE-EVPKSFEKEM---EDEAGGWM 1483 E HKP+++DVF DE EVP + E++ E+E GGW Sbjct: 656 SSAKIWNIFSFMGSCVRLMLIWSSEAHKPDSEDVFMDEDEVPGASEQDRENGENEVGGWT 715 Query: 1484 EVKDTGADELPINEEKEDLVPDKIIHETAVGKGLSGALQLLKERGTLKETIDWGGRNMDK 1663 EV DT ADE P NE+ ++VPD+ IHE AVGKGLSGAL+LLK+RGTLKETI+WGGRNMDK Sbjct: 716 EVVDTSADEKPANEDNNEVVPDETIHEIAVGKGLSGALKLLKDRGTLKETIEWGGRNMDK 775 Query: 1664 KKSKLVGIYENDGT-----KEIRIERTDEFGRIMTPKEAFRIISHKFHGKGPGKMKLEKK 1828 KKSKLVGI ++D K+IRIERTDEFGRI+TPKEAFR++SHKFHGKGPGKMK EK+ Sbjct: 776 KKSKLVGIVDDDHQTDNRFKDIRIERTDEFGRIVTPKEAFRMLSHKFHGKGPGKMKQEKR 835 Query: 1829 IKKYQEELKSKQMKASDTPSESMERMREAQARLKTPYLVLSGHVKPG------------- 1969 +K+YQEELK KQMK SDTPS S+ERMREAQA+LKTPYLVLSGHVKPG Sbjct: 836 MKQYQEELKLKQMKNSDTPSLSVERMREAQAQLKTPYLVLSGHVKPGYRDLTLCKMKLGL 895 Query: 1970 -----QTSDPRSGFATVEKDHPGSLTPMLGDR---KVEHFLGIKRKAEPSSMGPPKKQK 2122 QTSDP SGFATVEKD PG LTPMLGDR KVEHFLGIKRKAE + G PKK K Sbjct: 896 PFYAMQTSDPASGFATVEKDFPGGLTPMLGDRKAMKVEHFLGIKRKAEAGNSGTPKKPK 954