BLASTX nr result
ID: Cinnamomum23_contig00000958
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cinnamomum23_contig00000958 (3352 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010656678.1| PREDICTED: SART-1 family protein DOT2 [Vitis... 826 0.0 ref|XP_010256356.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 800 0.0 ref|XP_006836392.1| PREDICTED: SART-1 family protein DOT2 [Ambor... 756 0.0 ref|XP_012077379.1| PREDICTED: SART-1 family protein DOT2 isofor... 749 0.0 ref|XP_008806835.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 748 0.0 ref|XP_008806833.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 748 0.0 ref|XP_011011622.1| PREDICTED: SART-1 family protein DOT2 isofor... 743 0.0 ref|XP_012441144.1| PREDICTED: SART-1 family protein DOT2 [Gossy... 741 0.0 ref|XP_002297938.2| hypothetical protein POPTR_0001s11550g [Popu... 741 0.0 ref|XP_011011623.1| PREDICTED: SART-1 family protein DOT2 isofor... 736 0.0 ref|XP_010926911.1| PREDICTED: SART-1 family protein DOT2 [Elaei... 735 0.0 ref|XP_011094061.1| PREDICTED: SART-1 family protein DOT2 [Sesam... 734 0.0 ref|XP_002516516.1| conserved hypothetical protein [Ricinus comm... 728 0.0 ref|XP_010033990.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 728 0.0 ref|XP_007022025.1| U4/U6.U5 tri-snRNP-associated protein 1 isof... 721 0.0 ref|XP_010102332.1| hypothetical protein L484_015280 [Morus nota... 719 0.0 ref|XP_012077380.1| PREDICTED: SART-1 family protein DOT2 isofor... 716 0.0 ref|XP_003530377.1| PREDICTED: zinc finger CCCH domain-containin... 711 0.0 gb|KJB61483.1| hypothetical protein B456_009G361400 [Gossypium r... 708 0.0 gb|KHN38139.1| U4/U6.U5 tri-snRNP-associated protein 1 [Glycine ... 708 0.0 >ref|XP_010656678.1| PREDICTED: SART-1 family protein DOT2 [Vitis vinifera] gi|296090475|emb|CBI40671.3| unnamed protein product [Vitis vinifera] Length = 944 Score = 826 bits (2134), Expect = 0.0 Identities = 434/712 (60%), Positives = 522/712 (73%), Gaps = 10/712 (1%) Frame = +3 Query: 1005 KNRDQGYDKEMLRSTNGERDEKLKLDYEDTRD--ITMQGKEVQYGEGDNPEKLSTMDYKQ 1178 KNRD+G+D RS +G +D+KLKLD D RD +T QG+ + E D+ +++++ Sbjct: 241 KNRDEGHD----RSKDGGKDDKLKLDGGDNRDRDVTKQGRGSHHDEDDS----RAIEHEK 292 Query: 1179 RTESTADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLNADKEKAL 1358 E A G ST++L+ RI++MKEER+K+KSEG SEVLAWVN+SRK+ E+ NA+KEKAL Sbjct: 293 NAEG-ASGPQSSTAQLQERILRMKEERVKRKSEGSSEVLAWVNRSRKVEEQRNAEKEKAL 351 Query: 1359 HLSKVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLKDQNILAD 1538 LSK+FEEQDN+DQGESDDE+ +H ++DLAGVK+LHGLDKVIEGGAVVLTLKDQ+ILA+ Sbjct: 352 QLSKIFEEQDNIDQGESDDEKPTRHSSQDLAGVKVLHGLDKVIEGGAVVLTLKDQDILAN 411 Query: 1539 GDINEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQYDDPADEE 1718 GDINE+VDMLENVEIGEQ FN++P S+KKILPQYDDP +E Sbjct: 412 GDINEDVDMLENVEIGEQKRRDEAYKAAKKKTGIYEDKFNDEPGSEKKILPQYDDPVTDE 471 Query: 1719 GVTLDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEMVQFXXXX 1898 G+ LD SG F+GEA +Q S +N+F+DL T GK SSDYYTHEEM+QF Sbjct: 472 GLALDASGRFTGEAEKKLEELRRRLQGVSTNNRFEDLNTYGKNSSDYYTHEEMLQFKKPK 531 Query: 1899 XXXXXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMRTXXXXXX 2078 ++DALEAEA+SAGLGVGDLGSRN+ +RQS + E+ER++A+MR Sbjct: 532 KKKSLRKKEKLNIDALEAEAVSAGLGVGDLGSRNDGKRQSIREEQERSEAEMRNSAYQLA 591 Query: 2079 XXXXXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDDAGASGMH 2258 LR QTL +Q EE+EN VFG DDE+L KSL++ARKL L++QD+A SG Sbjct: 592 YAKADEASKALRLDQTLPVQLEENENQVFGEDDEELQKSLQRARKLVLQKQDEAATSGPQ 651 Query: 2259 VVARLAETNNESE--ETQKPVSGG-----IVITEMEEFVSKIHLDEEINKPEADDVFEDE 2417 +A LA T S+ + Q P+SG +V TEMEEFV + L++E +KP+ +DVF DE Sbjct: 652 AIALLASTTTSSQNVDNQNPISGESQENRVVFTEMEEFVWGLQLEDEAHKPDGEDVFMDE 711 Query: 2418 -EVPKSFEKEMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLSGALQLL 2594 E PK+ ++E +DE GGW EVKDT DELP+NE KE+++PD IHE AVGKGLSGALQLL Sbjct: 712 DEAPKASDQERKDEAGGWTEVKDTDKDELPVNENKEEMVPDDTIHEVAVGKGLSGALQLL 771 Query: 2595 KERGTLKETINWGGRNMDKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISHK 2774 KERGTLKE I WGGRNMDKKKSKLVGIY+N GTKEIRIERTDEFGRIMTPKEAFR+ISHK Sbjct: 772 KERGTLKEGIEWGGRNMDKKKSKLVGIYDNTGTKEIRIERTDEFGRIMTPKEAFRMISHK 831 Query: 2775 FHGKGPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREAQARLKTPYLVLSGHVKP 2954 FHGKGPG SDTPS+S+ERMREAQARLKTPYLVLSGHVKP Sbjct: 832 FHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSQSVERMREAQARLKTPYLVLSGHVKP 891 Query: 2955 GQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 3110 GQTSDPRSGFATVEKD PGSLTPMLGDRKVEHFLGIKRKAEPS+MGPPKK K Sbjct: 892 GQTSDPRSGFATVEKDVPGSLTPMLGDRKVEHFLGIKRKAEPSNMGPPKKPK 943 Score = 74.7 bits (182), Expect = 5e-10 Identities = 49/117 (41%), Positives = 59/117 (50%), Gaps = 4/117 (3%) Frame = +3 Query: 132 MDMEWSESRYEHKQDSRCESSSPNAVYREEKMDDFEDELSTVIDTKDKEKSRDSGKH*SK 311 MDM+WSE + E + R SP Y + DD E+ S KH SK Sbjct: 1 MDMDWSEPKPERSDELRDRDDSPTRDYHDGAYDDLEEN-----------GIEKSSKHRSK 49 Query: 312 DRKKERR---DHGSKDRERAKI-DLLKESEENQNELRENDHIGSRERRKEEHKENTK 470 DRKK RR DH KDRER+K D LKE E+ + E D + SRERRKE+ E K Sbjct: 50 DRKKSRREEKDHRGKDRERSKAGDGLKEREKETKD-SEKDRVTSRERRKEDRDEREK 105 >ref|XP_010256356.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001422|ref|XP_010256357.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001427|ref|XP_010256358.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001430|ref|XP_010256359.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001433|ref|XP_010256360.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001436|ref|XP_010256361.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] Length = 851 Score = 800 bits (2067), Expect = 0.0 Identities = 423/702 (60%), Positives = 510/702 (72%), Gaps = 7/702 (0%) Frame = +3 Query: 1026 DKEMLRSTNGERDEKLKLDYEDTRDITMQGKEVQYGEGDNPEKLSTMDYKQRTESTADGS 1205 D+ R + +DEKL LD + RD+ Q KEVQ+ D +S ++ K++ + GS Sbjct: 153 DESQGRGKDVGKDEKLDLDGGNDRDVVKQVKEVQH---DVVVDMS-VENKKKVDGAMGGS 208 Query: 1206 HPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLNADKEKALHLSKVFEEQ 1385 PST ELE RI+KM+EER KKKSEGVSEVL+WVNKSRK+ EK NA+K+KAL LSKVFEEQ Sbjct: 209 QPSTGELEERILKMREERSKKKSEGVSEVLSWVNKSRKLEEKRNAEKQKALQLSKVFEEQ 268 Query: 1386 DNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLKDQNILADGDINEEVDM 1565 D +DQGES+DE+ +H +KDLAGVKILHG+DKVIEGGAVVLTLKDQNILA+ D+NEE D+ Sbjct: 269 DKIDQGESEDEDTARHTSKDLAGVKILHGIDKVIEGGAVVLTLKDQNILANDDVNEEADV 328 Query: 1566 LENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQYDDPADEEGVTLDVSGH 1745 LENVEIGEQ F+ + +QKKILPQYDDP ++EG+ LD SG Sbjct: 329 LENVEIGEQKQRDAAYKAAKKKTGIYEDKFSGEDGAQKKILPQYDDPVEDEGLVLDESGR 388 Query: 1746 FSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEMVQFXXXXXXXXXXXXX 1925 F+GEA +Q S SN F+DL ++ KI+SD+YTHEEM+QF Sbjct: 389 FAGEAEKKLEELRKRLQGVSASNHFEDLNSSAKITSDFYTHEEMLQFKKPKKKKSLRKKV 448 Query: 1926 XXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMRTXXXXXXXXXXXXXXX 2105 DLDALEAEAISAG GVGDLGSR + +RQ+ K ++ER++A+MR+ Sbjct: 449 KLDLDALEAEAISAGFGVGDLGSRKDGQRQATKEQQERSEAEMRSNAYQSAFAKAEEASK 508 Query: 2106 XLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDDAGASGMHVVARLAET- 2282 LRQ QTLT+Q EE+E+ VFG D+EDLYKSLEKARKLALK Q++A ASG VA LA T Sbjct: 509 TLRQEQTLTVQVEENESPVFGDDEEDLYKSLEKARKLALKTQNEAAASGPQAVALLASTV 568 Query: 2283 -----NNESEETQKPVSGGIVITEMEEFVSKIHLDEEINKPEADDVFEDEE-VPKSFEKE 2444 + E+ + +P +V TEMEEFV + L+EE K E++DVF DE+ VPK+ ++E Sbjct: 569 SNQPKDEENLTSGEPQENKVVFTEMEEFVWGLQLNEEARKLESEDVFMDEDNVPKASDQE 628 Query: 2445 MEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLSGALQLLKERGTLKETI 2624 ++DE GGW EV D +E P+ EEKE+++PD+ IHE A+GKGLSGAL+LLKERGTLKET+ Sbjct: 629 IKDEAGGWTEVNDIDENEHPVEEEKEEVVPDETIHEVAIGKGLSGALKLLKERGTLKETV 688 Query: 2625 NWGGRNMDKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISHKFHGKGPGXXX 2804 +WGGRNMDKKKSKLVGIY++ G KEIRIERTDEFGRIMTPKEAFR+ISHKFHGKGPG Sbjct: 689 DWGGRNMDKKKSKLVGIYDDGGPKEIRIERTDEFGRIMTPKEAFRVISHKFHGKGPGKMK 748 Query: 2805 XXXXXXXXXXXXXXXXXXASDTPSESMERMREAQARLKTPYLVLSGHVKPGQTSDPRSGF 2984 SDTPS+SMERMREAQARLKTPYLVLSGHVKPGQTSDPRSGF Sbjct: 749 QEKRMKQYQEELKLKQMKNSDTPSQSMERMREAQARLKTPYLVLSGHVKPGQTSDPRSGF 808 Query: 2985 ATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 3110 ATVEKD PG LTPMLGD+KVEHFLGIKRKAEPS+MGPPKK K Sbjct: 809 ATVEKDIPGGLTPMLGDKKVEHFLGIKRKAEPSNMGPPKKSK 850 >ref|XP_006836392.1| PREDICTED: SART-1 family protein DOT2 [Amborella trichopoda] gi|548838910|gb|ERM99245.1| hypothetical protein AMTR_s00092p00135160 [Amborella trichopoda] Length = 1028 Score = 756 bits (1951), Expect = 0.0 Identities = 406/708 (57%), Positives = 494/708 (69%), Gaps = 5/708 (0%) Frame = +3 Query: 1002 GKNRDQGYDKEMLRSTNGERDEKLKLDYEDTRDITMQGKEVQYGEGDNPEKLSTMDYKQR 1181 GK++D G DKE R GE++ K K+D D RDIT Q VQ + + ++ MD+K++ Sbjct: 322 GKSKDHGRDKEFDRGKEGEKEAKPKIDAWDGRDITEQEDNVQDDKDNTYDRTGAMDHKEK 381 Query: 1182 TESTADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLNADKEKALH 1361 E A S PSTSE+E R+ KM+EER+KKK+EGVSEV +WVNKSRKI EKL+++KEKALH Sbjct: 382 NEIQAGVSRPSTSEIEERLAKMREERMKKKNEGVSEVSSWVNKSRKIEEKLSSEKEKALH 441 Query: 1362 LSKVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLKDQNILADG 1541 L+KVF EQD+V Q ESD+EEE QH KDLAGVK+LHGL++VI GGAVVLTLKDQNILADG Sbjct: 442 LAKVFAEQDSVVQ-ESDEEEEAQHSGKDLAGVKVLHGLEQVIVGGAVVLTLKDQNILADG 500 Query: 1542 DINEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQYDDPADEEG 1721 D+N EVDMLENVE+GEQ F +D SQKKILPQYDD + +EG Sbjct: 501 DLNNEVDMLENVELGEQKRRDEAYKAAKKKPGIYEDKFADDDGSQKKILPQYDDTSKDEG 560 Query: 1722 VTLDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEMVQFXXXXX 1901 V LD SGH + EA +Q S F+DLT TGK+SSDYYT EEM+QF Sbjct: 561 VALDESGHITREAQKKLEELRKRLQGASTGQHFEDLTATGKVSSDYYTQEEMLQFKKPKK 620 Query: 1902 XXXXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMRTXXXXXXX 2081 DLDALEAEAI++GLGVGD GSR + +RQ AK EEE A+A+ R Sbjct: 621 KKALRKKVKLDLDALEAEAIASGLGVGDRGSRADAQRQRAKEEEEWAEAETRKEAYQSAF 680 Query: 2082 XXXXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDDAGASGMHV 2261 LR+ QTL ++ +EDENL FG DDEDL+KS+E+ARKLA K+QD+ ASG Sbjct: 681 AKANESTKALREEQTLKVEGDEDENLAFG-DDEDLHKSIEEARKLARKKQDEGAASGPLA 739 Query: 2262 VARLAETNNESEETQ---KPVSGGIVITEMEEFVSKIHLDEEINKPEADDVF-EDEEVPK 2429 VA+LA + +ES++ + +P +V TE++EFV + DE P+A+DVF ED+EV Sbjct: 740 VAQLAVSASESKDAEASGEPQENRLVFTEVDEFVLGLQHDEGAQNPDAEDVFKEDDEVQN 799 Query: 2430 SFEK-EMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLSGALQLLKERG 2606 ++ E ++ GGW +V ++ DE EE E+++PD I E VGKGLSGALQLLKERG Sbjct: 800 PIKQDEPMEQVGGWTDVIESEKDEQMKTEEDEEVVPDATIQEAVVGKGLSGALQLLKERG 859 Query: 2607 TLKETINWGGRNMDKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISHKFHGK 2786 TLKE I+WGGRNMDKKKSKLVG+ ENDG KEI ++R DEFGRIMTPKEAFR +SHKFHGK Sbjct: 860 TLKEAIDWGGRNMDKKKSKLVGVRENDGAKEIVLDRLDEFGRIMTPKEAFRKLSHKFHGK 919 Query: 2787 GPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREAQARLKTPYLVLSGHVKPGQTS 2966 GPG ASDTP SME+MREAQA+ ++PY+VLSG +KPGQTS Sbjct: 920 GPGKMKQEKRMKQFMEELKLKQMKASDTPLLSMEKMREAQAKTRSPYIVLSGQIKPGQTS 979 Query: 2967 DPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 3110 DPRSGFATVEKD PGSLTPMLGDRKVEHFLGIKRKAEPS+MGPPKK K Sbjct: 980 DPRSGFATVEKDQPGSLTPMLGDRKVEHFLGIKRKAEPSNMGPPKKPK 1027 >ref|XP_012077379.1| PREDICTED: SART-1 family protein DOT2 isoform X1 [Jatropha curcas] gi|643724962|gb|KDP34163.1| hypothetical protein JCGZ_07734 [Jatropha curcas] Length = 864 Score = 749 bits (1933), Expect = 0.0 Identities = 410/729 (56%), Positives = 499/729 (68%), Gaps = 27/729 (3%) Frame = +3 Query: 1005 KNRDQGYDKEMLRST------------NGERDEKLKLDYEDTRDIT-MQGKEVQYGEGDN 1145 + RD YDKE LR + +D+ +++DYE+ +D + ++ +V + D Sbjct: 146 RERDSDYDKERLRDREKVSKRSHEEDYDRSKDDVVEMDYENNKDSSVLKQSKVSFDNKD- 204 Query: 1146 PEKLSTMDYKQRTESTADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIG 1325 +Q+ E T+ G S+LE RI+KMKEERLKK SE EVLAWVN+SRK+ Sbjct: 205 ---------EQKAEETSRGGSAPVSQLEERILKMKEERLKKNSEPGDEVLAWVNRSRKLE 255 Query: 1326 EKLNADKEKALHLSKVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVV 1505 EK NA+K+KA LSK+FEEQDN QGES+DE+ +H T DLAGVK+LHGL+KV+EGGAVV Sbjct: 256 EKKNAEKQKAKQLSKIFEEQDNNVQGESEDEDSGEHTTHDLAGVKVLHGLEKVMEGGAVV 315 Query: 1506 LTLKDQNILADGDINEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKI 1685 LTLKDQ+ILADGDINEEVDMLENVEIGEQ FN+DP+S+KKI Sbjct: 316 LTLKDQSILADGDINEEVDMLENVEIGEQKRRDDAYKAAKKKTGIYDDKFNDDPASEKKI 375 Query: 1686 LPQYDDPADEEGVTLDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYT 1865 LPQYDD A +EGV LD G F+GEA +Q S +N+F+DL+++GKISSDYYT Sbjct: 376 LPQYDDSAADEGVALDERGRFTGEAEKKLEELRRRLQGVSTNNRFEDLSSSGKISSDYYT 435 Query: 1866 HEEMVQFXXXXXXXXXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAK 2045 HEE++QF D+DALEAEA+SAGLGVGDLGSRN RRQ+ + E+ER++ Sbjct: 436 HEELLQFKKPKKKKSLRKKEKLDIDALEAEAVSAGLGVGDLGSRNNGRRQAIRQEQERSE 495 Query: 2046 ADMRTXXXXXXXXXXXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALK 2225 A+MR+ LRQ QTL + +EDEN VF DDEDLYKSLE+ARKLALK Sbjct: 496 AEMRSSAYQAAYDKADEASKSLRQEQTLHAKLDEDENPVFAEDDEDLYKSLERARKLALK 555 Query: 2226 RQDDAGASGMHVVARLA----ETNNESEETQKPVSG-----GIVITEMEEFVSKIHLDEE 2378 +Q++ ASG +ARLA T++++ + Q P +G IV TEMEEFV + LDEE Sbjct: 556 KQEEK-ASGPQAIARLAAATTTTSSQTTDDQNPTTGESQENKIVFTEMEEFVWGLQLDEE 614 Query: 2379 INKPEADDVFEDE-EVPKSFEKEMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEP 2555 +K DDVF DE E P ++E +DETGGW EV+D DE P+NE ED++PD+ IHE Sbjct: 615 SHKHGNDDVFMDEDEAPIVSDQEKKDETGGWTEVQDIDKDENPVNENNEDIVPDETIHEV 674 Query: 2556 AVGKGLSGALQLLKERGTLKETINWGGRNMDKKKSKLVGI----YENDGTKEIRIERTDE 2723 VGKGLS AL+LLKERGTLKE+ WGGRNMDKKKSKLVGI +N+ K+IRI+RTDE Sbjct: 675 PVGKGLSAALKLLKERGTLKESTEWGGRNMDKKKSKLVGIVDSDVDNERFKDIRIDRTDE 734 Query: 2724 FGRIMTPKEAFRIISHKFHGKGPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREA 2903 +GR +TPKEAFRIISHKFHGKGPG SDTPS S+ERMREA Sbjct: 735 YGRTLTPKEAFRIISHKFHGKGPGKMKQEKRMKQYLEELKMKQMKNSDTPSLSVERMREA 794 Query: 2904 QARLKTPYLVLSGHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPS 3083 QA+LKTPYLVLSGHVKPGQTSDPRSGFATVEKD PG LTPMLGD+KVEHFLGIKRKAEP Sbjct: 795 QAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDKKVEHFLGIKRKAEPG 854 Query: 3084 SMGPPKKQK 3110 + PKK K Sbjct: 855 NSNAPKKPK 863 >ref|XP_008806835.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 isoform X2 [Phoenix dactylifera] Length = 1013 Score = 748 bits (1932), Expect = 0.0 Identities = 408/708 (57%), Positives = 494/708 (69%), Gaps = 10/708 (1%) Frame = +3 Query: 1017 QGYDKEMLRSTNGERDEKLKLDYEDTRDITMQGKEVQYGEGD---NPEKLSTMDYKQRTE 1187 +G ++E+ R+ GE+DEK+K D D+R I +G+EVQ EGD N + LS++ Sbjct: 321 RGKEREIGRAREGEKDEKVKGDGGDSR-IARKGQEVQDDEGDLTHNEKPLSSI------- 372 Query: 1188 STADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLNADKEKALHLS 1367 STS+LE R+VKMKEERLK+KS+G SE+ +WVNKSRK+ EK A+KEKAL LS Sbjct: 373 --------STSKLEERVVKMKEERLKRKSDGASEISSWVNKSRKLEEKWTAEKEKALRLS 424 Query: 1368 KVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLKDQNILADGDI 1547 K EEQDN+ ES+DEE H DLAG KILHGLDKV+EGGAVVLTLKDQ+ILADGDI Sbjct: 425 KALEEQDNI-LAESEDEEATGHSGNDLAGAKILHGLDKVMEGGAVVLTLKDQSILADGDI 483 Query: 1548 NEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQYDDPADEEGVT 1727 NEE DMLENVEIGEQ F++D SQK ILPQYD+ ++EGVT Sbjct: 484 NEEADMLENVEIGEQKQRDEAYRAAKKRTGLYDDKFSDDIGSQKTILPQYDNQNEDEGVT 543 Query: 1728 LDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEMVQFXXXXXXX 1907 LD SG F+GEA I+ ++ +DLT++GKISSDYYT +EM+QF Sbjct: 544 LDESGRFTGEAEKKLEELRKRIEGGAIKKSNEDLTSSGKISSDYYTPDEMLQFKKPKKKK 603 Query: 1908 XXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMRTXXXXXXXXX 2087 DLDALEAEAISAGLG GDLGSRN+ RRQ+AK E+E+A+A+ R+ Sbjct: 604 SLRKKEKLDLDALEAEAISAGLGAGDLGSRNDLRRQTAKEEQEKAEAEKRSHAYQSAIAK 663 Query: 2088 XXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDDAGASGMHVVA 2267 LRQ QT T++ ED+NLVFG D ED+++S+ +ARKLALK+QD+ SG VA Sbjct: 664 AEEASKALRQEQTSTVKSVEDDNLVFGEDYEDVHRSIGQARKLALKKQDETAVSGPEAVA 723 Query: 2268 RLAETNNESEETQKPVSGG------IVITEMEEFVSKIHLDEEINKPEADDVFEDEE-VP 2426 +A T E E+ P GG ++ITEMEEFV + + E+ +KPE++DVF+DEE +P Sbjct: 724 LVATTKKEQEDAS-PTEGGEPQENKVIITEMEEFVLGLQITEDTHKPESEDVFKDEEDIP 782 Query: 2427 KSFEKEMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLSGALQLLKERG 2606 K E E E E GGW EV +T E +NEEKED+ PD+IIHE ++GKGLSGAL+LLKERG Sbjct: 783 KPLELETEAEVGGWTEVMETDDTEAAVNEEKEDINPDEIIHETSMGKGLSGALKLLKERG 842 Query: 2607 TLKETINWGGRNMDKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISHKFHGK 2786 TL E+I+WGGRNMDKKKSKLVGI +N+G KEIRIERTDEFGRIMTPKEAFR++SHKFHGK Sbjct: 843 TLNESIDWGGRNMDKKKSKLVGINDNEGPKEIRIERTDEFGRIMTPKEAFRMLSHKFHGK 902 Query: 2787 GPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREAQARLKTPYLVLSGHVKPGQTS 2966 GPG ASDTP +ME+MREAQARLKTPYLVLSGHVKPGQTS Sbjct: 903 GPGKMKQEKRMKQYQEDLKTKQMKASDTPLLAMEKMREAQARLKTPYLVLSGHVKPGQTS 962 Query: 2967 DPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 3110 DPRSGFATVEKDH GSLTPMLGD+KVEHFLGI RK + SMGPP +K Sbjct: 963 DPRSGFATVEKDHLGSLTPMLGDKKVEHFLGINRKPDARSMGPPPPKK 1010 Score = 68.6 bits (166), Expect = 3e-08 Identities = 49/128 (38%), Positives = 69/128 (53%), Gaps = 15/128 (11%) Frame = +3 Query: 132 MDMEWSESRYEHKQDSRCESSSPNAVYREEKMDDFED-----------ELSTVIDTKDKE 278 MDME E R E + ++R + S R+E+ D+ ED + +KDK Sbjct: 1 MDMELGEIRME-QDEARFGNGSLERAARDEETDELEDYGGDMETDGIGKEDNDSGSKDKG 59 Query: 279 KSRDSGKH*SKDRKKERRD---HGSKDRERAKIDLLKESEENQNELRENDHI-GSRERRK 446 KSR+SGKH SK+ KK RR+ HGS+D ER+K + + ++ + R+ D SRERRK Sbjct: 60 KSRESGKHKSKEGKKRRREEKVHGSRDGERSKEREKENRDLDRYDTRDKDRFESSRERRK 119 Query: 447 EEHKENTK 470 EE E K Sbjct: 120 EERLEFNK 127 >ref|XP_008806833.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 isoform X1 [Phoenix dactylifera] Length = 1040 Score = 748 bits (1932), Expect = 0.0 Identities = 408/708 (57%), Positives = 494/708 (69%), Gaps = 10/708 (1%) Frame = +3 Query: 1017 QGYDKEMLRSTNGERDEKLKLDYEDTRDITMQGKEVQYGEGD---NPEKLSTMDYKQRTE 1187 +G ++E+ R+ GE+DEK+K D D+R I +G+EVQ EGD N + LS++ Sbjct: 348 RGKEREIGRAREGEKDEKVKGDGGDSR-IARKGQEVQDDEGDLTHNEKPLSSI------- 399 Query: 1188 STADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLNADKEKALHLS 1367 STS+LE R+VKMKEERLK+KS+G SE+ +WVNKSRK+ EK A+KEKAL LS Sbjct: 400 --------STSKLEERVVKMKEERLKRKSDGASEISSWVNKSRKLEEKWTAEKEKALRLS 451 Query: 1368 KVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLKDQNILADGDI 1547 K EEQDN+ ES+DEE H DLAG KILHGLDKV+EGGAVVLTLKDQ+ILADGDI Sbjct: 452 KALEEQDNI-LAESEDEEATGHSGNDLAGAKILHGLDKVMEGGAVVLTLKDQSILADGDI 510 Query: 1548 NEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQYDDPADEEGVT 1727 NEE DMLENVEIGEQ F++D SQK ILPQYD+ ++EGVT Sbjct: 511 NEEADMLENVEIGEQKQRDEAYRAAKKRTGLYDDKFSDDIGSQKTILPQYDNQNEDEGVT 570 Query: 1728 LDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEMVQFXXXXXXX 1907 LD SG F+GEA I+ ++ +DLT++GKISSDYYT +EM+QF Sbjct: 571 LDESGRFTGEAEKKLEELRKRIEGGAIKKSNEDLTSSGKISSDYYTPDEMLQFKKPKKKK 630 Query: 1908 XXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMRTXXXXXXXXX 2087 DLDALEAEAISAGLG GDLGSRN+ RRQ+AK E+E+A+A+ R+ Sbjct: 631 SLRKKEKLDLDALEAEAISAGLGAGDLGSRNDLRRQTAKEEQEKAEAEKRSHAYQSAIAK 690 Query: 2088 XXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDDAGASGMHVVA 2267 LRQ QT T++ ED+NLVFG D ED+++S+ +ARKLALK+QD+ SG VA Sbjct: 691 AEEASKALRQEQTSTVKSVEDDNLVFGEDYEDVHRSIGQARKLALKKQDETAVSGPEAVA 750 Query: 2268 RLAETNNESEETQKPVSGG------IVITEMEEFVSKIHLDEEINKPEADDVFEDEE-VP 2426 +A T E E+ P GG ++ITEMEEFV + + E+ +KPE++DVF+DEE +P Sbjct: 751 LVATTKKEQEDAS-PTEGGEPQENKVIITEMEEFVLGLQITEDTHKPESEDVFKDEEDIP 809 Query: 2427 KSFEKEMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLSGALQLLKERG 2606 K E E E E GGW EV +T E +NEEKED+ PD+IIHE ++GKGLSGAL+LLKERG Sbjct: 810 KPLELETEAEVGGWTEVMETDDTEAAVNEEKEDINPDEIIHETSMGKGLSGALKLLKERG 869 Query: 2607 TLKETINWGGRNMDKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISHKFHGK 2786 TL E+I+WGGRNMDKKKSKLVGI +N+G KEIRIERTDEFGRIMTPKEAFR++SHKFHGK Sbjct: 870 TLNESIDWGGRNMDKKKSKLVGINDNEGPKEIRIERTDEFGRIMTPKEAFRMLSHKFHGK 929 Query: 2787 GPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREAQARLKTPYLVLSGHVKPGQTS 2966 GPG ASDTP +ME+MREAQARLKTPYLVLSGHVKPGQTS Sbjct: 930 GPGKMKQEKRMKQYQEDLKTKQMKASDTPLLAMEKMREAQARLKTPYLVLSGHVKPGQTS 989 Query: 2967 DPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 3110 DPRSGFATVEKDH GSLTPMLGD+KVEHFLGI RK + SMGPP +K Sbjct: 990 DPRSGFATVEKDHLGSLTPMLGDKKVEHFLGINRKPDARSMGPPPPKK 1037 Score = 68.6 bits (166), Expect = 3e-08 Identities = 49/128 (38%), Positives = 69/128 (53%), Gaps = 15/128 (11%) Frame = +3 Query: 132 MDMEWSESRYEHKQDSRCESSSPNAVYREEKMDDFED-----------ELSTVIDTKDKE 278 MDME E R E + ++R + S R+E+ D+ ED + +KDK Sbjct: 28 MDMELGEIRME-QDEARFGNGSLERAARDEETDELEDYGGDMETDGIGKEDNDSGSKDKG 86 Query: 279 KSRDSGKH*SKDRKKERRD---HGSKDRERAKIDLLKESEENQNELRENDHI-GSRERRK 446 KSR+SGKH SK+ KK RR+ HGS+D ER+K + + ++ + R+ D SRERRK Sbjct: 87 KSRESGKHKSKEGKKRRREEKVHGSRDGERSKEREKENRDLDRYDTRDKDRFESSRERRK 146 Query: 447 EEHKENTK 470 EE E K Sbjct: 147 EERLEFNK 154 >ref|XP_011011622.1| PREDICTED: SART-1 family protein DOT2 isoform X1 [Populus euphratica] Length = 860 Score = 743 bits (1917), Expect = 0.0 Identities = 406/717 (56%), Positives = 493/717 (68%), Gaps = 15/717 (2%) Frame = +3 Query: 1005 KNRDQGYDKEMLRSTNGERDEKLKLDYEDTRDITMQGKEVQYGEGDNPEKLSTMDYKQRT 1184 K R + D+ + + D+K+++DYED D DN K + ++ Sbjct: 158 KERSREKDRASRKGNEEDYDDKVQMDYEDEVD------------KDN-RKQGKVSFRDEG 204 Query: 1185 ESTADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLNADKEKALHL 1364 E +A+G+H S SELE RI+KMKEER KKKSE S++LAWV +SRKI E +A K +A HL Sbjct: 205 EQSAEGAHSSASELEQRILKMKEERTKKKSEAGSDILAWVGRSRKIEENKHAAKARAKHL 264 Query: 1365 SKVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLKDQNILADGD 1544 SK+FEEQDN+ QG SDDEE QH +LAG+K+L GLDKV+EGGAVVLTLKDQNILADGD Sbjct: 265 SKIFEEQDNIGQGGSDDEEADQHNAYNLAGIKVLDGLDKVLEGGAVVLTLKDQNILADGD 324 Query: 1545 INEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQYDDPADEEGV 1724 INEEVDMLENVEIGEQ FN+DP+S+KK+LPQYDD +EG+ Sbjct: 325 INEEVDMLENVEIGEQKRRDEAYKAAKKKTGIYDDKFNDDPASEKKMLPQYDDANADEGI 384 Query: 1725 TLDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEMVQFXXXXXX 1904 TLD G F+GEA +Q TS S + +DL ++GKISSDY+THEEM++F Sbjct: 385 TLDERGRFTGEAEKKLEELRRRLQGTSTSARLEDLNSSGKISSDYFTHEEMLKFKKPKKK 444 Query: 1905 XXXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMRTXXXXXXXX 2084 D+DALEAEA+SAGLG+GDLGSR + RRQ+ + E+ER+ A+MR Sbjct: 445 KSLRKKDKLDIDALEAEAVSAGLGIGDLGSRKDGRRQAIREEQERSAAEMRNNAYQSAYA 504 Query: 2085 XXXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDDAGASGMHVV 2264 LR QTL + EE+ENLVF D+EDLYKSLE+ARKLALK+Q +A ASG + Sbjct: 505 KADEASKSLRLDQTLQTKVEEEENLVFADDEEDLYKSLERARKLALKKQ-EAEASGPLAI 563 Query: 2265 ARLAET-------NNESEETQKPVSGGIVITEMEEFVSKIHLDEEINKPEADDVFEDE-E 2420 A LA T ++++ ET + +V TEMEEFVS I L EE++KP+ +DVF DE E Sbjct: 564 AHLASTTLSSQIADDKNPETGESHENKLVFTEMEEFVSAIQLAEEVHKPDNEDVFMDEDE 623 Query: 2421 VPKSFEKEMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLSGALQLLKE 2600 P+ ++E +DE GGWMEV D S DE P+NE+ E+++PD+ IHE AVGKGLSGAL+LLKE Sbjct: 624 PPRVSDEEQKDEAGGWMEVPDNSKDENPVNED-EEIVPDETIHEVAVGKGLSGALKLLKE 682 Query: 2601 RGTLKETINWGGRNMDKKKSKLVGIYEND-GT------KEIRIERTDEFGRIMTPKEAFR 2759 RGTLKE+I+WGGRNMDKKKSKLVGI ++D GT K+IRIERTDEFGRIMTPKEAFR Sbjct: 683 RGTLKESIDWGGRNMDKKKSKLVGIVDDDVGTNNDNKFKDIRIERTDEFGRIMTPKEAFR 742 Query: 2760 IISHKFHGKGPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREAQARLKTPYLVLS 2939 +ISHKFHGKGPG SDTPS S+ERMR AQA+LKTPYLVLS Sbjct: 743 MISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMRGAQAQLKTPYLVLS 802 Query: 2940 GHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 3110 GHVKPGQTSDPRSGFATVEKD PG LTPMLGD+KVEHFLGIKRK E G PKK K Sbjct: 803 GHVKPGQTSDPRSGFATVEKDFPGGLTPMLGDKKVEHFLGIKRKPETGFSGAPKKPK 859 >ref|XP_012441144.1| PREDICTED: SART-1 family protein DOT2 [Gossypium raimondii] gi|823216924|ref|XP_012441145.1| PREDICTED: SART-1 family protein DOT2 [Gossypium raimondii] gi|763794483|gb|KJB61479.1| hypothetical protein B456_009G361400 [Gossypium raimondii] gi|763794484|gb|KJB61480.1| hypothetical protein B456_009G361400 [Gossypium raimondii] gi|763794485|gb|KJB61481.1| hypothetical protein B456_009G361400 [Gossypium raimondii] gi|763794488|gb|KJB61484.1| hypothetical protein B456_009G361400 [Gossypium raimondii] Length = 900 Score = 741 bits (1914), Expect = 0.0 Identities = 407/719 (56%), Positives = 485/719 (67%), Gaps = 15/719 (2%) Frame = +3 Query: 999 IGKNRDQGYDKEMLRSTNGERDEKLKLDYEDTRDITMQGKEVQYGEGDNPEKLSTMDYKQ 1178 +GKN ++ Y+ G +D +L LDYED RD E + G N + Sbjct: 211 VGKNHEEDYE--------GSKDGELALDYEDRRD----KDEAELNAGSNASLVQA----- 253 Query: 1179 RTESTADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLNADKEKAL 1358 S+SELE RIV+MKE+RLKKKSEG+SEV AWV++SRK+ +K NA+KEKAL Sbjct: 254 -----------SSSELEERIVRMKEDRLKKKSEGLSEVSAWVSRSRKLEDKRNAEKEKAL 302 Query: 1359 HLSKVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLKDQNILAD 1538 LSK+FEEQDN QGE +DEE PT DL GVK+LHGLDKV++GGAVVLTLKDQ+ILAD Sbjct: 303 QLSKIFEEQDNFVQGEDEDEEADNRPTHDLGGVKVLHGLDKVMDGGAVVLTLKDQSILAD 362 Query: 1539 GDINEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQYDDPADEE 1718 GD+NE+VDMLEN+EIGEQ FNEDP S+KKILPQYDDP +E Sbjct: 363 GDLNEDVDMLENIEIGEQKQRDEAYKAAKKKTGVYDDKFNEDPGSEKKILPQYDDPVADE 422 Query: 1719 GVTLDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEMVQFXXXX 1898 GVTLD G F+GEA + +N+ +DL GKISSDYYT EEM++F Sbjct: 423 GVTLDERGRFTGEAEKKLEELRKRLLGVPTNNRVEDLNNVGKISSDYYTQEEMLRFKKPK 482 Query: 1899 XXXXXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMRTXXXXXX 2078 D+DALEAEA+SAGLG GDLGSR + RRQ+ K EE R++A+ R Sbjct: 483 KKKALRKKEKLDIDALEAEAVSAGLGAGDLGSRKDSRRQAIKEEEARSEAEKRKNAYQAA 542 Query: 2079 XXXXXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDDAGASGMH 2258 LR QT T++PEEDEN VF D+EDLYKSLEKAR+LALK+Q++ SG Sbjct: 543 FAKADEASKSLRLEQTHTVKPEEDENQVFADDEEDLYKSLEKARRLALKKQEE--KSGPQ 600 Query: 2259 VVARLAETNNESEETQKPVSGG------IVITEMEEFVSKIHLDEEINKPEADDVFEDE- 2417 +A LA T+ ++ T S G +VITEMEEFV + LDEE +KP+++DVF DE Sbjct: 601 AIALLATTSASNQTTDDHTSTGEAQENKVVITEMEEFVWGLQLDEEAHKPDSEDVFMDED 660 Query: 2418 EVPKSFE---KEMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLSGALQ 2588 EVP + E K E+E GGW EV DTSADE P NE+ ++++PD+ IHE AVGKGLSGAL+ Sbjct: 661 EVPGASEQDRKNGENEVGGWTEVIDTSADEKPANEDNDEVVPDETIHEIAVGKGLSGALK 720 Query: 2589 LLKERGTLKETINWGGRNMDKKKSKLVGIYENDGT-----KEIRIERTDEFGRIMTPKEA 2753 LLK+RGTLKETI WGGRNMDKKKSKLVGI ++D K+IRIERTDEFGRI+TPKEA Sbjct: 721 LLKDRGTLKETIEWGGRNMDKKKSKLVGIVDDDHQTDNRFKDIRIERTDEFGRIVTPKEA 780 Query: 2754 FRIISHKFHGKGPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREAQARLKTPYLV 2933 FR++SHKFHGKGPG SDTPS S+ERMREAQA+LKTPYLV Sbjct: 781 FRMLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMREAQAQLKTPYLV 840 Query: 2934 LSGHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 3110 LSGHVKPGQTSDP SGFATVEKD PG LTPMLGDRKVEHFLGIKRKAE + G PKK K Sbjct: 841 LSGHVKPGQTSDPASGFATVEKDFPGGLTPMLGDRKVEHFLGIKRKAEAGNSGTPKKPK 899 >ref|XP_002297938.2| hypothetical protein POPTR_0001s11550g [Populus trichocarpa] gi|550347020|gb|EEE82743.2| hypothetical protein POPTR_0001s11550g [Populus trichocarpa] Length = 862 Score = 741 bits (1914), Expect = 0.0 Identities = 410/719 (57%), Positives = 496/719 (68%), Gaps = 17/719 (2%) Frame = +3 Query: 1005 KNRDQGYDKEMLRSTNGERDEKLKLDYEDT--RDITMQGKEVQYGEGDNPEKLSTMDYKQ 1178 K R + D+ +S + D+K+++DYED +D QGK V + + D+ Q Sbjct: 156 KERSREKDRASRKSNEEDYDDKVQMDYEDEVDKDNRKQGK-VSFRDEDD----------Q 204 Query: 1179 RTESTADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLNADKEKAL 1358 E + G+H S SEL RI+KMKEER KKKSE S++LAWV KSRKI E A K++A Sbjct: 205 SAEGASAGAHSSASELGQRILKMKEERTKKKSEPGSDILAWVGKSRKIEENKYAAKKRAK 264 Query: 1359 HLSKVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLKDQNILAD 1538 HLSK+FEEQDN+ QG SDDEE QH +LAG+K+L GLDKV+EGGAVVLTLKDQNILAD Sbjct: 265 HLSKIFEEQDNIGQGGSDDEEADQHNAYNLAGIKVLDGLDKVLEGGAVVLTLKDQNILAD 324 Query: 1539 GDINEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQYDDPADEE 1718 GDINEEVDMLENVEIGEQ FN+DP+S+KK+LPQYDD +E Sbjct: 325 GDINEEVDMLENVEIGEQKRRDEAYKAAKKKTGIYEDKFNDDPASEKKMLPQYDDANADE 384 Query: 1719 GVTLDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEMVQFXXXX 1898 GVTLD G F+GEA +Q TS S + +DL ++GKISSDY+THEEM+QF Sbjct: 385 GVTLDERGRFTGEAEKKLEELRRRLQGTSTSARLEDLNSSGKISSDYFTHEEMLQFKKPK 444 Query: 1899 XXXXXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMRTXXXXXX 2078 D+DALEAEA+SAGLG+GDLGSR + RRQ+ + E+ER++A+MR Sbjct: 445 KKKSLRKKDKLDIDALEAEAVSAGLGIGDLGSRKDGRRQAIREEQERSEAEMRNNAYQSA 504 Query: 2079 XXXXXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDDAGASGMH 2258 LR +TL + EE+ENLVF D+EDLYKSLE+ARKLALK+Q +A ASG Sbjct: 505 YAKADEASKSLRLDRTLQTKVEEEENLVFADDEEDLYKSLERARKLALKKQ-EAEASGPL 563 Query: 2259 VVARLAET-------NNESEETQKPVSGGIVITEMEEFVSKIHLDEEINKPEADDVFEDE 2417 +A LA T ++++ ET + +V TEMEEFVS I L EE++KP+ +DVF DE Sbjct: 564 AIAHLASTTLSSQIADDKNPETGESHENKLVFTEMEEFVSAIQLAEEVHKPDNEDVFMDE 623 Query: 2418 -EVPKSFEKEMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLSGALQLL 2594 E P+ ++E +DE GGWMEV D S DE P+NE+ E+++PD+ IHE AVGKGLSGAL+LL Sbjct: 624 DEPPRVSDEEQKDEAGGWMEVPDNSKDENPVNED-EEIVPDETIHEVAVGKGLSGALKLL 682 Query: 2595 KERGTLKETINWGGRNMDKKKSKLVGIYEND-GT------KEIRIERTDEFGRIMTPKEA 2753 KERGTLKE+I+WGGRNMDKKKSKLVGI ++D GT K+IRIERTDEFGRIMTPKEA Sbjct: 683 KERGTLKESIDWGGRNMDKKKSKLVGIVDDDVGTNNDNKFKDIRIERTDEFGRIMTPKEA 742 Query: 2754 FRIISHKFHGKGPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREAQARLKTPYLV 2933 FR+ISHKFHGKGPG SDTPS S+ERMR AQA+LKTPYLV Sbjct: 743 FRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMRGAQAQLKTPYLV 802 Query: 2934 LSGHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 3110 LSGHVKPGQTSDPRSGFATVEKD PG LTPMLGD+KVEHFLGIKRK E G PKK K Sbjct: 803 LSGHVKPGQTSDPRSGFATVEKDFPGGLTPMLGDKKVEHFLGIKRKPETGFSGAPKKPK 861 >ref|XP_011011623.1| PREDICTED: SART-1 family protein DOT2 isoform X2 [Populus euphratica] Length = 859 Score = 736 bits (1901), Expect = 0.0 Identities = 405/717 (56%), Positives = 492/717 (68%), Gaps = 15/717 (2%) Frame = +3 Query: 1005 KNRDQGYDKEMLRSTNGERDEKLKLDYEDTRDITMQGKEVQYGEGDNPEKLSTMDYKQRT 1184 K R + D+ + + D+K+++DYED D DN K + ++ Sbjct: 158 KERSREKDRASRKGNEEDYDDKVQMDYEDEVD------------KDN-RKQGKVSFRDEG 204 Query: 1185 ESTADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLNADKEKALHL 1364 E +A+G+H S SELE RI+KMKEER KKKSE S++LAWV +SRKI E +A K +A HL Sbjct: 205 EQSAEGAHSSASELEQRILKMKEERTKKKSEAGSDILAWVGRSRKIEENKHAAKARAKHL 264 Query: 1365 SKVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLKDQNILADGD 1544 SK+FEEQDN+ QG SDDEE QH +LAG+K+L GLDKV+EGGAVVLTLKDQNILADGD Sbjct: 265 SKIFEEQDNIGQGGSDDEEADQHNAYNLAGIKVLDGLDKVLEGGAVVLTLKDQNILADGD 324 Query: 1545 INEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQYDDPADEEGV 1724 INEEVDMLENVEIGEQ FN+DP+S+KK+LPQYDD +EG+ Sbjct: 325 INEEVDMLENVEIGEQKRRDEAYKAAKKKTGIYDDKFNDDPASEKKMLPQYDDANADEGI 384 Query: 1725 TLDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEMVQFXXXXXX 1904 TLD G F+GEA +Q TS S + +DL ++GKISSDY+THEEM++F Sbjct: 385 TLDERGRFTGEAEKKLEELRRRLQGTSTSARLEDLNSSGKISSDYFTHEEMLKFKKPKKK 444 Query: 1905 XXXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMRTXXXXXXXX 2084 D+DALEAEA+SAGLG+GDLGSR + RRQ+ + E+ER+ A+MR Sbjct: 445 KSLRKKDKLDIDALEAEAVSAGLGIGDLGSRKDGRRQAIREEQERSAAEMRNNAYQSAYA 504 Query: 2085 XXXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDDAGASGMHVV 2264 LR QTL + EE+ENLVF D+EDLYKSLE+ARKLALK+Q +A ASG + Sbjct: 505 KADEASKSLRLDQTLQTKVEEEENLVFADDEEDLYKSLERARKLALKKQ-EAEASGPLAI 563 Query: 2265 ARLAET-------NNESEETQKPVSGGIVITEMEEFVSKIHLDEEINKPEADDVFEDE-E 2420 A LA T ++++ ET + +V TEMEEFVS I L E++KP+ +DVF DE E Sbjct: 564 AHLASTTLSSQIADDKNPETGESHENKLVFTEMEEFVSAIQL-AEVHKPDNEDVFMDEDE 622 Query: 2421 VPKSFEKEMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLSGALQLLKE 2600 P+ ++E +DE GGWMEV D S DE P+NE+ E+++PD+ IHE AVGKGLSGAL+LLKE Sbjct: 623 PPRVSDEEQKDEAGGWMEVPDNSKDENPVNED-EEIVPDETIHEVAVGKGLSGALKLLKE 681 Query: 2601 RGTLKETINWGGRNMDKKKSKLVGIYEND-GT------KEIRIERTDEFGRIMTPKEAFR 2759 RGTLKE+I+WGGRNMDKKKSKLVGI ++D GT K+IRIERTDEFGRIMTPKEAFR Sbjct: 682 RGTLKESIDWGGRNMDKKKSKLVGIVDDDVGTNNDNKFKDIRIERTDEFGRIMTPKEAFR 741 Query: 2760 IISHKFHGKGPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREAQARLKTPYLVLS 2939 +ISHKFHGKGPG SDTPS S+ERMR AQA+LKTPYLVLS Sbjct: 742 MISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMRGAQAQLKTPYLVLS 801 Query: 2940 GHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 3110 GHVKPGQTSDPRSGFATVEKD PG LTPMLGD+KVEHFLGIKRK E G PKK K Sbjct: 802 GHVKPGQTSDPRSGFATVEKDFPGGLTPMLGDKKVEHFLGIKRKPETGFSGAPKKPK 858 >ref|XP_010926911.1| PREDICTED: SART-1 family protein DOT2 [Elaeis guineensis] Length = 1017 Score = 735 bits (1898), Expect = 0.0 Identities = 397/695 (57%), Positives = 479/695 (68%), Gaps = 5/695 (0%) Frame = +3 Query: 1041 RSTNGERDEKLKLDYEDTRDITMQGKEVQYGEGDNPEKLSTMDYKQRTESTADGSHPSTS 1220 R+ GE+DEK+K D ++R I +G+E+Q EGD T + S STS Sbjct: 334 RAREGEKDEKVKADGGNSR-IARKGEEIQDNEGD------------LTHNEKSISSTSTS 380 Query: 1221 ELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLNADKEKALHLSKVFEEQDNVDQ 1400 ELE R+ KMKEERLK+K +G SE+ +WVNKSRK+ EK NA+KEKAL LSK EEQDN+ Sbjct: 381 ELEERVTKMKEERLKRKPDGASEISSWVNKSRKLEEKRNAEKEKALRLSKALEEQDNI-L 439 Query: 1401 GESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLKDQNILADGDINEEVDMLENVE 1580 ES+DEE H DLAGVKILHGLDKV+EGGAVVLTLKDQ+ILADGDINE+ DMLENVE Sbjct: 440 AESEDEEATGHSGNDLAGVKILHGLDKVMEGGAVVLTLKDQSILADGDINEDADMLENVE 499 Query: 1581 IGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQYDDPADEEGVTLDVSGHFSGEA 1760 IGEQ F++D S+K ILPQYD+ ++EGVTLD SG F+GEA Sbjct: 500 IGEQKQRDEAYRAAKKRTGLYDDKFSDDMGSRKPILPQYDNEIEDEGVTLDESGRFTGEA 559 Query: 1761 XXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEMVQFXXXXXXXXXXXXXXXDLD 1940 I+ + ++DLT++GK SSDYYT +EM+QF DLD Sbjct: 560 EKKLEELRKRIEGGIIKQNYEDLTSSGKSSSDYYTPDEMLQFKKPKKKKSLRKKEKLDLD 619 Query: 1941 ALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMRTXXXXXXXXXXXXXXXXLRQG 2120 ALEAEAISAGLG GDLGSRN+ RRQ+AK E+ +A A+MR+ LRQ Sbjct: 620 ALEAEAISAGLGAGDLGSRNDLRRQTAKEEQVKADAEMRSNAYQSAIAKAEEASKALRQE 679 Query: 2121 QTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDDAGASGMHVVARLAETNNESEE 2300 QTLT++ ED+NLVFG D EDL +S+ +ARKLALK+QD+ SG VA +A T E E+ Sbjct: 680 QTLTVKSVEDDNLVFGEDFEDLQRSIGQARKLALKKQDETPVSGPEAVALVATTKKEQED 739 Query: 2301 TQ----KPVSGGIVITEMEEFVSKIHLDEEINKPEADDVFEDEE-VPKSFEKEMEDETGG 2465 +P ++ITEMEEFV + E+ +KPE++DVF+DEE +PKS E E E E GG Sbjct: 740 ASPTEGEPQENKVIITEMEEFVLGLQFTEDTHKPESEDVFKDEEDIPKSLELETEAEVGG 799 Query: 2466 WMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLSGALQLLKERGTLKETINWGGRNM 2645 W EV +T E ++EEKED+ PD+I HE A+GKGLSG L+LLK+RGTL E ++ GGRNM Sbjct: 800 WAEVMETDKTEAAVSEEKEDINPDEINHETAIGKGLSGVLKLLKDRGTLNEGVDLGGRNM 859 Query: 2646 DKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISHKFHGKGPGXXXXXXXXXX 2825 DKKKSKLVGIY+N+G KEIRIERTDEFGRIMTPKEAFR++SHKFHGKGPG Sbjct: 860 DKKKSKLVGIYDNEGQKEIRIERTDEFGRIMTPKEAFRMLSHKFHGKGPGKMKQEKRMKQ 919 Query: 2826 XXXXXXXXXXXASDTPSESMERMREAQARLKTPYLVLSGHVKPGQTSDPRSGFATVEKDH 3005 ASDTP +ME+MREAQARLKTPYLVLSGHVKPGQTSDPRSGFATVEKDH Sbjct: 920 YQEDLKTKQMKASDTPLLAMEKMREAQARLKTPYLVLSGHVKPGQTSDPRSGFATVEKDH 979 Query: 3006 PGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 3110 GSLTPMLGD+KVEHFLGI R+ + SMGPP +K Sbjct: 980 LGSLTPMLGDKKVEHFLGINRRPDAGSMGPPPPKK 1014 Score = 72.4 bits (176), Expect = 2e-09 Identities = 48/128 (37%), Positives = 74/128 (57%), Gaps = 15/128 (11%) Frame = +3 Query: 132 MDMEWSESRYEHKQDSRCESSSPNAVYREEKMDDFEDELSTV-ID----------TKDKE 278 MDME E R E + ++R S R+E+MD+ ED + ID ++DK Sbjct: 1 MDMELGEIRME-QDEARFGKGSLERAARDEEMDELEDYGGDMGIDGIGKEDNDNGSRDKG 59 Query: 279 KSRDSGKH*SKD---RKKERRDHGSKDRERAKIDLLKESEENQNELRENDHI-GSRERRK 446 KSR+SGKH SK+ R++E ++HG++D ER+K+ +E + ++ + ++ D SRERRK Sbjct: 60 KSRESGKHRSKEGRKRRREEKEHGNRDGERSKVKEKEERDSDRYDAKDKDRFENSRERRK 119 Query: 447 EEHKENTK 470 EE E K Sbjct: 120 EERLEFNK 127 >ref|XP_011094061.1| PREDICTED: SART-1 family protein DOT2 [Sesamum indicum] Length = 942 Score = 734 bits (1894), Expect = 0.0 Identities = 389/710 (54%), Positives = 485/710 (68%), Gaps = 8/710 (1%) Frame = +3 Query: 1005 KNRDQGYDKEMLRSTNGERDEKLKLDYEDTRDITMQGKEVQYGEGDNPEKLSTMDYKQRT 1184 K +D+ +D RS + ++D +L+ + +RD + + +N K+ + ++++ Sbjct: 238 KQKDESHD----RSKDTDKDGHSRLENDYSRDKQSTKELADNSDDENDSKI--LKHQEKA 291 Query: 1185 ESTADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLNADKEKALHL 1364 ++ GS S SELE RI KM+EERLKK SEG SEVLAWVN+SRK+ EK A+KEKAL L Sbjct: 292 DTAIAGSRQSASELEDRISKMREERLKKPSEGASEVLAWVNRSRKLEEKRTAEKEKALQL 351 Query: 1365 SKVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLKDQNILADGD 1544 SK+FEEQDN++ GESD+E +H T+DL GVKILHGLDKV+EGGAVVLTLKDQ+ILADGD Sbjct: 352 SKIFEEQDNMNGGESDEEAAAEHTTQDLGGVKILHGLDKVLEGGAVVLTLKDQSILADGD 411 Query: 1545 INEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQYDDPADEEGV 1724 INEEVDMLENVEIGEQ F+++P ++KKILPQYDDP +EGV Sbjct: 412 INEEVDMLENVEIGEQKRRDEAYKAAKKKTGIYDDKFSDEPGAEKKILPQYDDPVADEGV 471 Query: 1725 TLDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEMVQFXXXXXX 1904 TLD SG F+GEA IQ S S + +DL +T KI +DYYT +EM +F Sbjct: 472 TLDSSGRFTGEAERKLEELRRRIQGVSTSTRGEDLNSTAKILTDYYTQDEMTKFKKPKKK 531 Query: 1905 XXXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMRTXXXXXXXX 2084 DLDALEAEA SAGLG GDLGSRN+ RRQ+ + E+E+ +A+MR Sbjct: 532 KSLRKKEKLDLDALEAEARSAGLGAGDLGSRNDGRRQNLREEQEKIEAEMRRNAYESAYA 591 Query: 2085 XXXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDDAGASGMHVV 2264 LRQ Q +Q EED+ VFG DD++L KSLE+ARK+ALK+QD+ S V+ Sbjct: 592 KADEASKALRQEQVPAMQTEEDDAPVFGDDDDELRKSLERARKIALKKQDEEEKSAPQVI 651 Query: 2265 ARLAETNNESEETQKPVSGGI-------VITEMEEFVSKIHLDEEINKPEADDVFEDEEV 2423 LA ++ T+ P SG + + TEMEEFV + LDEE PE++DVF +E+V Sbjct: 652 TLLATSSANDSTTENPNSGSVDQQENKVIFTEMEEFVWGLQLDEEEKNPESEDVFMEEDV 711 Query: 2424 -PKSFEKEMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLSGALQLLKE 2600 P + ++EM+DE GGW EVK+T DE P EEKE+++PD+ IHE AVGKGL+GAL+LLK+ Sbjct: 712 APSTSDQEMKDEAGGWAEVKETMKDETPAKEEKEEVVPDETIHESAVGKGLAGALKLLKD 771 Query: 2601 RGTLKETINWGGRNMDKKKSKLVGIYENDGTKEIRIERTDEFGRIMTPKEAFRIISHKFH 2780 RGTLKETI WGGRNMDKKKSKLVGIY+ND KEIRIERTDE+GRI+TPKEAFR++SHKFH Sbjct: 772 RGTLKETIEWGGRNMDKKKSKLVGIYDNDAAKEIRIERTDEYGRILTPKEAFRLLSHKFH 831 Query: 2781 GKGPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREAQARLKTPYLVLSGHVKPGQ 2960 GKGPG +DTPS S+ERMREAQA+L+TPYLVLSGHVKPGQ Sbjct: 832 GKGPGKMKQEKRMRQYQEELKVKQMKNADTPSLSVERMREAQAKLQTPYLVLSGHVKPGQ 891 Query: 2961 TSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 3110 +SDPR+ FATVEKD G LTPMLGD+KVEHFL IKRK EP KK K Sbjct: 892 SSDPRNTFATVEKDFAGGLTPMLGDKKVEHFLNIKRKPEPGDTASQKKPK 941 >ref|XP_002516516.1| conserved hypothetical protein [Ricinus communis] gi|223544336|gb|EEF45857.1| conserved hypothetical protein [Ricinus communis] Length = 873 Score = 728 bits (1879), Expect = 0.0 Identities = 395/716 (55%), Positives = 489/716 (68%), Gaps = 14/716 (1%) Frame = +3 Query: 1005 KNRDQGYDKEMLRSTNGERDEKLKLDYEDTRDITMQGKEVQYGEGDNPEKLS---TMDYK 1175 K +++ +DK+ LR +R + + D I M + + + +K+S D + Sbjct: 159 KEKEEFHDKDRLRDGVSKRSHEEENDRSKNDTIEMGYERERNSDVGKQKKVSFDDDNDDE 218 Query: 1176 QRTESTADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLNADKEKA 1355 Q+ E T+ G S+ E E RI+K++EERLKK S+ SEVL+WVN+SRK+ EK NA+K+KA Sbjct: 219 QKVERTSGGGLASSLEFEERILKVREERLKKNSDAGSEVLSWVNRSRKLAEKKNAEKKKA 278 Query: 1356 LHLSKVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLKDQNILA 1535 LSKVFEEQD + QGES+DEE + T DLAGVK+LHGL+KV+EGGAVVLTLKDQ+IL Sbjct: 279 KQLSKVFEEQDKIVQGESEDEEAGELATNDLAGVKVLHGLEKVMEGGAVVLTLKDQSILV 338 Query: 1536 DGDINEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQYDDPADE 1715 DGDINEEVDMLEN+EIGEQ FN+DP+S++KILPQYDDP + Sbjct: 339 DGDINEEVDMLENIEIGEQKRRNEAYKAAKKKTGIYDDKFNDDPASERKILPQYDDPTTD 398 Query: 1716 EGVTLDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEMVQFXXX 1895 EGVTLD G F+GEA +Q N F+DL ++GK+SSD+YTHEEM+QF Sbjct: 399 EGVTLDERGRFTGEAEKKLEELRRRLQGALTDNCFEDLNSSGKMSSDFYTHEEMLQFKKP 458 Query: 1896 XXXXXXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMRTXXXXX 2075 D+DALEAEA+SAGLGVGDLGSR++ RRQ+ + E+ER++A+ R+ Sbjct: 459 KKKKSLRKKEKLDIDALEAEAVSAGLGVGDLGSRSDGRRQAIREEQERSEAERRSSAYQS 518 Query: 2076 XXXXXXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDDAGASGM 2255 LR QTL + E+EN VF DDEDL+KSLE+ARKLALK+Q++ ASG Sbjct: 519 AYAKADEASKSLRLEQTLPAKVNEEENPVFADDDEDLFKSLERARKLALKKQEE--ASGP 576 Query: 2256 HVVARLA-ETNNESEETQKPVSG-----GIVITEMEEFVSKIHLDEEINKPEADDVFEDE 2417 +ARLA TNN+ + Q P G +V TEMEEFV + LDEE +KP ++DVF DE Sbjct: 577 QAIARLATATNNQIADDQNPADGESQENKVVFTEMEEFVWGLQLDEESHKPGSEDVFMDE 636 Query: 2418 E-VPKSFEKEMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLSGALQLL 2594 + P+ ++EM+DE G W EV D + D+ +NE KED++PD+ IHE AVGKGLSGAL+LL Sbjct: 637 DAAPRVSDQEMKDEAGRWTEVNDAAEDDNSVNENKEDVVPDETIHEVAVGKGLSGALKLL 696 Query: 2595 KERGTLKETINWGGRNMDKKKSKLVGIYENDGT----KEIRIERTDEFGRIMTPKEAFRI 2762 KERGTLKET++WGGRNMDKKKSKLVGI ++D KEIRIER DEFGRIMTPKEAFR+ Sbjct: 697 KERGTLKETVDWGGRNMDKKKSKLVGIVDSDADNEKFKEIRIERMDEFGRIMTPKEAFRM 756 Query: 2763 ISHKFHGKGPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREAQARLKTPYLVLSG 2942 ISHKFHGKGPG SDTPSES+ERMREAQ +LKTPYLVLSG Sbjct: 757 ISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSESVERMREAQKKLKTPYLVLSG 816 Query: 2943 HVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 3110 HVK GQ SDPRS FATVEKD PG LTPMLGD+KVEHFLGIKRKAE + P KK K Sbjct: 817 HVKSGQASDPRSSFATVEKDLPGGLTPMLGDKKVEHFLGIKRKAEHENSSPSKKPK 872 >ref|XP_010033990.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Eucalyptus grandis] gi|629087518|gb|KCW53875.1| hypothetical protein EUGRSUZ_J03092 [Eucalyptus grandis] Length = 900 Score = 728 bits (1878), Expect = 0.0 Identities = 398/729 (54%), Positives = 493/729 (67%), Gaps = 27/729 (3%) Frame = +3 Query: 1005 KNRDQGYDKEMLRST-------NGERDEKLKLDYEDTRD---ITMQGKEVQYG------- 1133 K RD+G +KE R T N +RD + D + +RD + +G Y Sbjct: 173 KYRDKGREKEKDRVTDEAKEKSNRQRDREEDHDRDRSRDKERVIRKGDAHDYDRIKDNRV 232 Query: 1134 EGDNPEKLSTMDYKQRTESTADGSHPSTSELESRIVKMKEERLKKK--SEGVSEVLAWVN 1307 E D E+ + + Q +S DG+ STS L+ RI K KEERLK++ SEG SE+LAWVN Sbjct: 233 EFDIAEEKEDVGHGQNPDSALDGTRLSTSNLQDRISKAKEERLKRQPESEGASEILAWVN 292 Query: 1308 KSRKIGEKLNADKEKALHLSKVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVI 1487 +SRK+ +K NA+KEK + LSKVFEEQD++ GES+DE+E DLAGVK+LHGLDKV+ Sbjct: 293 RSRKLEQKRNAEKEKVMRLSKVFEEQDDIGHGESEDEQEVPRNAHDLAGVKVLHGLDKVV 352 Query: 1488 EGGAVVLTLKDQNILADGDINEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDP 1667 EGGAVVLTLKDQNILADGDINEEVDMLENVEIGEQ F++DP Sbjct: 353 EGGAVVLTLKDQNILADGDINEEVDMLENVEIGEQKHRDEAYKAAKKKSGIYDDKFSDDP 412 Query: 1668 SSQKKILPQYDDPADEEGVTLDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKI 1847 +S+KK+LPQYDDPA +EGVTLD SG + EA +Q S S+ ++DLT++ K Sbjct: 413 ASEKKMLPQYDDPAQDEGVTLDSSGRLTNEAEKKLEELRRRLQGVSSSSHYEDLTSSAKT 472 Query: 1848 SSDYYTHEEMVQFXXXXXXXXXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKA 2027 SSDYYT EE+++F DLDALEAEA+SAGLGVGDLGSR + RRQ+++ Sbjct: 473 SSDYYTQEELLRFRKPKKKKSLRKKEKLDLDALEAEAVSAGLGVGDLGSRKDGRRQASRE 532 Query: 2028 EEERAKADMRTXXXXXXXXXXXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKA 2207 E+E+ +A+MR LR QTL ++ E DEN+V DDEDLYKSLE+A Sbjct: 533 EQEKIEAEMRKNAFQLAYAKAEEASRLLRVEQTLPVKTENDENMVIADDDEDLYKSLERA 592 Query: 2208 RKLALKRQDDAGASGMHVVARLAET-------NNESEETQKPVSGGIVITEMEEFVSKIH 2366 RKLALK+Q++ GASG +A A + N+S T + +V+TE+E FVS + Sbjct: 593 RKLALKKQEEKGASGPKAIALRASSIPSTHNAENQSVTTGESQESRVVMTEIEGFVSGLE 652 Query: 2367 LDEEINKPEADDVFEDE-EVPKSFEKEMEDETGGWMEVKDTSADELPINEEKEDLIPDKI 2543 +DE KP+ +DVF DE E P + + E++DE GGW E K+ DE +NE++E+++PD+ Sbjct: 653 VDEVSRKPDTEDVFMDEDEAPVTSDNEVKDEPGGWTEFKEFGNDEGSVNEDEEEVVPDET 712 Query: 2544 IHEPAVGKGLSGALQLLKERGTLKETINWGGRNMDKKKSKLVGIYENDGTKEIRIERTDE 2723 IHE AVGKGLSGAL+LLK+RGTLKET+ WGGRNMDKKKSKLVGI + G KEIRIERTDE Sbjct: 713 IHEAAVGKGLSGALKLLKDRGTLKETVEWGGRNMDKKKSKLVGIADG-GQKEIRIERTDE 771 Query: 2724 FGRIMTPKEAFRIISHKFHGKGPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREA 2903 FGRI+TPKEAFR++SHKFHGKGPG SDTPS S ERMREA Sbjct: 772 FGRILTPKEAFRLLSHKFHGKGPGKMKQEKRMKQYHEELKLKQMKNSDTPSSSAERMREA 831 Query: 2904 QARLKTPYLVLSGHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPS 3083 QA++KTPYLVLSGHVKPGQ SDPRSGFAT+EKD PGSLTPMLGDRKVEHFLGIKRK EPS Sbjct: 832 QAQMKTPYLVLSGHVKPGQNSDPRSGFATIEKD-PGSLTPMLGDRKVEHFLGIKRKPEPS 890 Query: 3084 SMGPPKKQK 3110 ++G KK K Sbjct: 891 NLGASKKPK 899 >ref|XP_007022025.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] gi|590611175|ref|XP_007022026.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] gi|508721653|gb|EOY13550.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] gi|508721654|gb|EOY13551.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] Length = 907 Score = 721 bits (1862), Expect = 0.0 Identities = 398/717 (55%), Positives = 477/717 (66%), Gaps = 15/717 (2%) Frame = +3 Query: 1005 KNRDQGYDKEMLRSTNGERDEKLKLDYEDTRDITMQGKEVQYGEGDNPEKLSTMDYKQRT 1184 ++RD K G +D +L LDY D+RD E + G N Sbjct: 211 RDRDNAIKKNHEEDYEGSKDGELALDYGDSRD----KDEAELNAGSN------------- 253 Query: 1185 ESTADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLNADKEKALHL 1364 A + S+SELE RI +MKEERLKKKSEGVSEVL WV RK+ EK NA+KEKAL Sbjct: 254 ---AGVAQASSSELEERIARMKEERLKKKSEGVSEVLEWVGNFRKLEEKRNAEKEKALQR 310 Query: 1365 SKVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLKDQNILADGD 1544 SK+FEEQD+ QGE++DEE +H DLAGVK+LHGLDKV++GGAVVLTLKDQ+ILA+GD Sbjct: 311 SKIFEEQDDFVQGENEDEEAVRHAAHDLAGVKVLHGLDKVMDGGAVVLTLKDQSILANGD 370 Query: 1545 INEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQYDDPADEEGV 1724 INE+VDMLENVEIGEQ FN++P S+KKILPQYD+P +EGV Sbjct: 371 INEDVDMLENVEIGEQRRRDEAYKAAKKKTGVYDDKFNDEPGSEKKILPQYDNPVADEGV 430 Query: 1725 TLDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEMVQFXXXXXX 1904 TLD G F+GEA +Q +N+ +DL GKI+SDYYT EEM++F Sbjct: 431 TLDERGRFTGEAEKKLQELRKRLQGVPTNNRVEDLNNAGKIASDYYTQEEMLKFKKPKKK 490 Query: 1905 XXXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMRTXXXXXXXX 2084 D+DALEAEAIS+GLG GDLGSRN+ RRQ+ + EE R++A+ R Sbjct: 491 KALRKKEKLDIDALEAEAISSGLGAGDLGSRNDARRQAIREEEARSEAEKRNSAYQSAYA 550 Query: 2085 XXXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDDAGASGMHVV 2264 L QTL ++PEEDEN VF DD+DLYKS+E++RKLA K+Q+D SG + Sbjct: 551 KADEASKSLWLEQTLIVKPEEDENQVFADDDDDLYKSIERSRKLAFKKQEDE-KSGPQAI 609 Query: 2265 ARLAET-------NNESEETQKPVSGGIVITEMEEFVSKIHLDEEINKPEADDVFEDE-E 2420 A A T ++++ T + +VITEMEEFV + DEE +KP+++DVF DE E Sbjct: 610 ALRATTAAISQTADDQTTTTGEAQENKLVITEMEEFVWGLQHDEEAHKPDSEDVFMDEDE 669 Query: 2421 VPKSFE---KEMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLSGALQL 2591 VP E K E+E GGW EV D S DE P NE+K+D++PD+ IHE AVGKGLSGAL+L Sbjct: 670 VPGVSEHDGKSGENEVGGWTEVVDASTDENPSNEDKDDIVPDETIHEVAVGKGLSGALKL 729 Query: 2592 LKERGTLKETINWGGRNMDKKKSKLVGIY----ENDGTKEIRIERTDEFGRIMTPKEAFR 2759 LK+RGTLKE+I WGGRNMDKKKSKLVGI END K+IRIERTDEFGRI+TPKEAFR Sbjct: 730 LKDRGTLKESIEWGGRNMDKKKSKLVGIVDDDRENDRFKDIRIERTDEFGRIITPKEAFR 789 Query: 2760 IISHKFHGKGPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREAQARLKTPYLVLS 2939 ++SHKFHGKGPG SDTPS S+ERMREAQA+LKTPYLVLS Sbjct: 790 VLSHKFHGKGPGKMKQEKRQKQYQEELKLKQMKNSDTPSLSVERMREAQAQLKTPYLVLS 849 Query: 2940 GHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 3110 GHVKPGQTSDPRSGFATVEKD PG LTPMLGDRKVEHFLGIKRKAEP + PKK K Sbjct: 850 GHVKPGQTSDPRSGFATVEKDFPGGLTPMLGDRKVEHFLGIKRKAEPGNSSTPKKPK 906 >ref|XP_010102332.1| hypothetical protein L484_015280 [Morus notabilis] gi|587905102|gb|EXB93293.1| hypothetical protein L484_015280 [Morus notabilis] Length = 952 Score = 719 bits (1856), Expect = 0.0 Identities = 403/755 (53%), Positives = 499/755 (66%), Gaps = 53/755 (7%) Frame = +3 Query: 1005 KNRDQGYDKEMLRST--------------NGERDEKLKLDYEDTRDI-TMQGKEVQYGEG 1139 K R+ DKE R +G RD+K KLD ++ +D QG QY +G Sbjct: 210 KEREADQDKEKSRDRVSKKSVEEDYELGKDGGRDDKTKLDDDNKKDREAKQGNVSQYIDG 269 Query: 1140 DNPEKLSTMDYKQRTESTADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRK 1319 + Q T + +H +T+ELE RI+KMK+ER KKK+E V EVLAWVNKSRK Sbjct: 270 E-----------QITHDISHKAHLTTTELEKRILKMKQERSKKKTEDVPEVLAWVNKSRK 318 Query: 1320 IGEKLNADKEKALHLSKVFEEQDNVDQGESDDEEEP-QHPTKDLAGVKILHGLDKVIEGG 1496 + EK N +KEKAL LSK+FEEQDN+ Q +S+DEE QH +LAGVK+LHG+DKV+EGG Sbjct: 319 LEEKKNDEKEKALQLSKIFEEQDNIVQEDSEDEETTTQH--YNLAGVKVLHGIDKVMEGG 376 Query: 1497 AVVLTLKDQNILADGDINEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQ 1676 AVVLTLKDQNILADGDIN E+DMLENVEIGEQ FN+DP+S+ Sbjct: 377 AVVLTLKDQNILADGDINLEIDMLENVEIGEQKRRDEAYKAAKKKVGIYVDKFNDDPNSE 436 Query: 1677 KKILPQYDDPADEEGVTLDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSD 1856 +K+LPQYDDP+ + GVT+D G + EA +Q S +++F+DL+ GK+SSD Sbjct: 437 RKMLPQYDDPSTDVGVTIDERGRITSEAEKKLEELRRRLQGASTNSRFEDLSFPGKVSSD 496 Query: 1857 YYTHEEMVQFXXXXXXXXXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEE 2036 YYT EEM+QF D+DALEAEA+SAGLGVGDLGSRN+ +RQ + E++ Sbjct: 497 YYTSEEMMQFKKPKKKKSLRKKDKLDIDALEAEAVSAGLGVGDLGSRNDPKRQVIREEQD 556 Query: 2037 RAKADMRTXXXXXXXXXXXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKL 2216 RA+A+ R LR QTL ++ EE+ENLVF DDED +K++E+ARK+ Sbjct: 557 RAEAERRNNAYKTAFAKADEASKSLRLEQTLPVKLEEEENLVFADDDEDFHKAVERARKI 616 Query: 2217 ALKRQDDAGASGMHVVARLAET--NNESEETQKPVSGG----IVITEMEEFVSKIHLDEE 2378 A+K++D SG VA LA T N++ + Q P +V TEMEEFV + L+EE Sbjct: 617 AVKKEDKETPSGPEAVALLAATIANSQPADEQNPSGESQENKVVFTEMEEFVWGLQLEEE 676 Query: 2379 INKPEADDVFEDE-EVPKSFEKEMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEP 2555 KP+ +DVF DE E PK++ +E+++E GGW EVK+T+ DE P EE+E+++PD IIHE Sbjct: 677 AQKPDNEDVFMDEDEEPKAYNEEIKNEPGGWTEVKETNNDEHPSKEEEEEIVPDGIIHEV 736 Query: 2556 AVGKGLSGALQLLKERGTLKETINWGGRNMDKKKSKLVGIYEND-----------GT--- 2693 AVGKGLSGAL+LLKERGTLKE+I+WGGRNMDKKKSKLVGI ++D GT Sbjct: 737 AVGKGLSGALKLLKERGTLKESIDWGGRNMDKKKSKLVGIVDDDEPGQQVHPKKDGTRTS 796 Query: 2694 ----------------KEIRIERTDEFGRIMTPKEAFRIISHKFHGKGPGXXXXXXXXXX 2825 K+IRIERTDEFGRI+TPKEAFRIISHKFHGKGPG Sbjct: 797 SSSYSKETRASKVYEEKDIRIERTDEFGRILTPKEAFRIISHKFHGKGPGKMKQEKRMKQ 856 Query: 2826 XXXXXXXXXXXASDTPSESMERMREAQARLKTPYLVLSGHVKPGQTSDPRSGFATVEKDH 3005 +SDTPS+S+ERMREAQA+LKTPYLVLSGHVKPGQTSDPRSGFATVEKD Sbjct: 857 YQEELKLKQMKSSDTPSQSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDP 916 Query: 3006 PGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 3110 PG LTPMLGDRKVEHFLGIKRK EP++ G PKK K Sbjct: 917 PGGLTPMLGDRKVEHFLGIKRKPEPANSGRPKKPK 951 >ref|XP_012077380.1| PREDICTED: SART-1 family protein DOT2 isoform X2 [Jatropha curcas] Length = 636 Score = 716 bits (1848), Expect = 0.0 Identities = 385/636 (60%), Positives = 455/636 (71%), Gaps = 14/636 (2%) Frame = +3 Query: 1245 MKEERLKKKSEGVSEVLAWVNKSRKIGEKLNADKEKALHLSKVFEEQDNVDQGESDDEEE 1424 MKEERLKK SE EVLAWVN+SRK+ EK NA+K+KA LSK+FEEQDN QGES+DE+ Sbjct: 1 MKEERLKKNSEPGDEVLAWVNRSRKLEEKKNAEKQKAKQLSKIFEEQDNNVQGESEDEDS 60 Query: 1425 PQHPTKDLAGVKILHGLDKVIEGGAVVLTLKDQNILADGDINEEVDMLENVEIGEQXXXX 1604 +H T DLAGVK+LHGL+KV+EGGAVVLTLKDQ+ILADGDINEEVDMLENVEIGEQ Sbjct: 61 GEHTTHDLAGVKVLHGLEKVMEGGAVVLTLKDQSILADGDINEEVDMLENVEIGEQKRRD 120 Query: 1605 XXXXXXXXXXXXXXXXFNEDPSSQKKILPQYDDPADEEGVTLDVSGHFSGEAXXXXXXXX 1784 FN+DP+S+KKILPQYDD A +EGV LD G F+GEA Sbjct: 121 DAYKAAKKKTGIYDDKFNDDPASEKKILPQYDDSAADEGVALDERGRFTGEAEKKLEELR 180 Query: 1785 XXIQVTSVSNQFQDLTTTGKISSDYYTHEEMVQFXXXXXXXXXXXXXXXDLDALEAEAIS 1964 +Q S +N+F+DL+++GKISSDYYTHEE++QF D+DALEAEA+S Sbjct: 181 RRLQGVSTNNRFEDLSSSGKISSDYYTHEELLQFKKPKKKKSLRKKEKLDIDALEAEAVS 240 Query: 1965 AGLGVGDLGSRNEERRQSAKAEEERAKADMRTXXXXXXXXXXXXXXXXLRQGQTLTIQPE 2144 AGLGVGDLGSRN RRQ+ + E+ER++A+MR+ LRQ QTL + + Sbjct: 241 AGLGVGDLGSRNNGRRQAIRQEQERSEAEMRSSAYQAAYDKADEASKSLRQEQTLHAKLD 300 Query: 2145 EDENLVFGGDDEDLYKSLEKARKLALKRQDDAGASGMHVVARLA----ETNNESEETQKP 2312 EDEN VF DDEDLYKSLE+ARKLALK+Q++ ASG +ARLA T++++ + Q P Sbjct: 301 EDENPVFAEDDEDLYKSLERARKLALKKQEEK-ASGPQAIARLAAATTTTSSQTTDDQNP 359 Query: 2313 VSG-----GIVITEMEEFVSKIHLDEEINKPEADDVFEDE-EVPKSFEKEMEDETGGWME 2474 +G IV TEMEEFV + LDEE +K DDVF DE E P ++E +DETGGW E Sbjct: 360 TTGESQENKIVFTEMEEFVWGLQLDEESHKHGNDDVFMDEDEAPIVSDQEKKDETGGWTE 419 Query: 2475 VKDTSADELPINEEKEDLIPDKIIHEPAVGKGLSGALQLLKERGTLKETINWGGRNMDKK 2654 V+D DE P+NE ED++PD+ IHE VGKGLS AL+LLKERGTLKE+ WGGRNMDKK Sbjct: 420 VQDIDKDENPVNENNEDIVPDETIHEVPVGKGLSAALKLLKERGTLKESTEWGGRNMDKK 479 Query: 2655 KSKLVGI----YENDGTKEIRIERTDEFGRIMTPKEAFRIISHKFHGKGPGXXXXXXXXX 2822 KSKLVGI +N+ K+IRI+RTDE+GR +TPKEAFRIISHKFHGKGPG Sbjct: 480 KSKLVGIVDSDVDNERFKDIRIDRTDEYGRTLTPKEAFRIISHKFHGKGPGKMKQEKRMK 539 Query: 2823 XXXXXXXXXXXXASDTPSESMERMREAQARLKTPYLVLSGHVKPGQTSDPRSGFATVEKD 3002 SDTPS S+ERMREAQA+LKTPYLVLSGHVKPGQTSDPRSGFATVEKD Sbjct: 540 QYLEELKMKQMKNSDTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKD 599 Query: 3003 HPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPKKQK 3110 PG LTPMLGD+KVEHFLGIKRKAEP + PKK K Sbjct: 600 LPGGLTPMLGDKKVEHFLGIKRKAEPGNSNAPKKPK 635 >ref|XP_003530377.1| PREDICTED: zinc finger CCCH domain-containing protein 13-like [Glycine max] Length = 882 Score = 711 bits (1834), Expect = 0.0 Identities = 401/723 (55%), Positives = 492/723 (68%), Gaps = 21/723 (2%) Frame = +3 Query: 1005 KNRDQGYDKEMLRST----NGERDEKL-----KLDYEDTRDITMQGKEVQYGEGDNPEKL 1157 K R+ DKE R E D +L K+DY+D RD + GK+ EK Sbjct: 179 KERETDRDKERTRDRVSRKTHEEDYELDNVDDKVDYQDKRDEEI-GKQ---------EKD 228 Query: 1158 STMDYKQRTESTADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLN 1337 S +D + T+ +H S++ELE RI+KMKE R KK+ E SE+ AWVNKSRKI Sbjct: 229 SKLDNDNQDGQTS--AHLSSTELEDRILKMKESRTKKQPEADSEISAWVNKSRKI----- 281 Query: 1338 ADKEKALHLSKVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLK 1517 +K++A LSK+FEEQDN+ SDDE+ QH T +LAGVK+LHGLDKV+EGG VVLT+K Sbjct: 282 -EKKRAFQLSKIFEEQDNIAVEGSDDEDTAQH-TDNLAGVKVLHGLDKVMEGGTVVLTIK 339 Query: 1518 DQNILADGDINEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQY 1697 DQ ILADGD+NE+VDMLEN+EIGEQ F++DPS++KK+LPQY Sbjct: 340 DQPILADGDVNEDVDMLENIEIGEQKRRDEAYKAAKKKTGVYDDKFHDDPSTEKKMLPQY 399 Query: 1698 DDPADEEGVTLDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEM 1877 DDPA EEG+TLD G FSGEA + S +N F+DLT++GK+SSDYYTHEEM Sbjct: 400 DDPAAEEGLTLDGKGRFSGEAEKKLEELRRRLTGVS-TNTFEDLTSSGKVSSDYYTHEEM 458 Query: 1878 VQFXXXXXXXXXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMR 2057 ++F D++ALEAEA+S+GLGVGDLGSR + RRQ+ K E+ER +A+MR Sbjct: 459 LKFKKPKKKKSLRKKDKLDINALEAEAVSSGLGVGDLGSRKDVRRQAIKDEQERLEAEMR 518 Query: 2058 TXXXXXXXXXXXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDD 2237 + LR QTL ++ EEDE VF DDEDL KSLEKAR+LALK+++ Sbjct: 519 SNAYQSAYAKADEASKLLRLEQTLNVKTEEDETPVFVDDDEDLRKSLEKARRLALKKKEG 578 Query: 2238 AGASGMHVVARLAETNNESE-ETQKPVSGG-----IVITEMEEFVSKIHLDEEINKPEAD 2399 GASG +A LA +N+ +E + Q P +G +V TEMEEFV +H+DEE KPE++ Sbjct: 579 EGASGPQAIALLATSNHNNETDDQNPTAGESRENKVVFTEMEEFVWGLHIDEEARKPESE 638 Query: 2400 DVF-EDEEVPKSFEKEMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLS 2576 DVF D+E ++E +E GGW EV++TS DE E+KE++IPD+ IHE AVGKGLS Sbjct: 639 DVFMHDDEEANVPDEEKINEVGGWTEVQETSEDEQRNTEDKEEIIPDETIHEVAVGKGLS 698 Query: 2577 GALQLLKERGTLKETINWGGRNMDKKKSKLVGIYENDG-----TKEIRIERTDEFGRIMT 2741 GAL+LLKERGTLKE+I WGGRNMDKKKSKLVGI +++ T+EIRIERTDEFGRI+T Sbjct: 699 GALKLLKERGTLKESIEWGGRNMDKKKSKLVGIVDDEEKEAQKTREIRIERTDEFGRILT 758 Query: 2742 PKEAFRIISHKFHGKGPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREAQARLKT 2921 PKEAFR+ISHKFHGKGPG +SDTPS S+ERMREAQARL+T Sbjct: 759 PKEAFRMISHKFHGKGPGKMKQEKRMKQYYEELKMKQMKSSDTPSLSVERMREAQARLQT 818 Query: 2922 PYLVLSGHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPK 3101 PYLVLSGHVKPGQTSDP+SGFATVEKD PG LTPMLGDRKVEHFLGIKRKAEPSS PK Sbjct: 819 PYLVLSGHVKPGQTSDPKSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAEPSSSDTPK 878 Query: 3102 KQK 3110 K K Sbjct: 879 KPK 881 >gb|KJB61483.1| hypothetical protein B456_009G361400 [Gossypium raimondii] Length = 878 Score = 708 bits (1828), Expect = 0.0 Identities = 390/696 (56%), Positives = 467/696 (67%), Gaps = 15/696 (2%) Frame = +3 Query: 999 IGKNRDQGYDKEMLRSTNGERDEKLKLDYEDTRDITMQGKEVQYGEGDNPEKLSTMDYKQ 1178 +GKN ++ Y+ G +D +L LDYED RD E + G N + Sbjct: 211 VGKNHEEDYE--------GSKDGELALDYEDRRD----KDEAELNAGSNASLVQA----- 253 Query: 1179 RTESTADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLNADKEKAL 1358 S+SELE RIV+MKE+RLKKKSEG+SEV AWV++SRK+ +K NA+KEKAL Sbjct: 254 -----------SSSELEERIVRMKEDRLKKKSEGLSEVSAWVSRSRKLEDKRNAEKEKAL 302 Query: 1359 HLSKVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLKDQNILAD 1538 LSK+FEEQDN QGE +DEE PT DL GVK+LHGLDKV++GGAVVLTLKDQ+ILAD Sbjct: 303 QLSKIFEEQDNFVQGEDEDEEADNRPTHDLGGVKVLHGLDKVMDGGAVVLTLKDQSILAD 362 Query: 1539 GDINEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQYDDPADEE 1718 GD+NE+VDMLEN+EIGEQ FNEDP S+KKILPQYDDP +E Sbjct: 363 GDLNEDVDMLENIEIGEQKQRDEAYKAAKKKTGVYDDKFNEDPGSEKKILPQYDDPVADE 422 Query: 1719 GVTLDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEMVQFXXXX 1898 GVTLD G F+GEA + +N+ +DL GKISSDYYT EEM++F Sbjct: 423 GVTLDERGRFTGEAEKKLEELRKRLLGVPTNNRVEDLNNVGKISSDYYTQEEMLRFKKPK 482 Query: 1899 XXXXXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMRTXXXXXX 2078 D+DALEAEA+SAGLG GDLGSR + RRQ+ K EE R++A+ R Sbjct: 483 KKKALRKKEKLDIDALEAEAVSAGLGAGDLGSRKDSRRQAIKEEEARSEAEKRKNAYQAA 542 Query: 2079 XXXXXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDDAGASGMH 2258 LR QT T++PEEDEN VF D+EDLYKSLEKAR+LALK+Q++ SG Sbjct: 543 FAKADEASKSLRLEQTHTVKPEEDENQVFADDEEDLYKSLEKARRLALKKQEE--KSGPQ 600 Query: 2259 VVARLAETNNESEETQKPVSGG------IVITEMEEFVSKIHLDEEINKPEADDVFEDE- 2417 +A LA T+ ++ T S G +VITEMEEFV + LDEE +KP+++DVF DE Sbjct: 601 AIALLATTSASNQTTDDHTSTGEAQENKVVITEMEEFVWGLQLDEEAHKPDSEDVFMDED 660 Query: 2418 EVPKSFE---KEMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLSGALQ 2588 EVP + E K E+E GGW EV DTSADE P NE+ ++++PD+ IHE AVGKGLSGAL+ Sbjct: 661 EVPGASEQDRKNGENEVGGWTEVIDTSADEKPANEDNDEVVPDETIHEIAVGKGLSGALK 720 Query: 2589 LLKERGTLKETINWGGRNMDKKKSKLVGIYENDGT-----KEIRIERTDEFGRIMTPKEA 2753 LLK+RGTLKETI WGGRNMDKKKSKLVGI ++D K+IRIERTDEFGRI+TPKEA Sbjct: 721 LLKDRGTLKETIEWGGRNMDKKKSKLVGIVDDDHQTDNRFKDIRIERTDEFGRIVTPKEA 780 Query: 2754 FRIISHKFHGKGPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREAQARLKTPYLV 2933 FR++SHKFHGKGPG SDTPS S+ERMREAQA+LKTPYLV Sbjct: 781 FRMLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMREAQAQLKTPYLV 840 Query: 2934 LSGHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRK 3041 LSGHVKPGQTSDP SGFATVEKD PG LTPMLGDRK Sbjct: 841 LSGHVKPGQTSDPASGFATVEKDFPGGLTPMLGDRK 876 >gb|KHN38139.1| U4/U6.U5 tri-snRNP-associated protein 1 [Glycine soja] Length = 882 Score = 708 bits (1828), Expect = 0.0 Identities = 400/723 (55%), Positives = 491/723 (67%), Gaps = 21/723 (2%) Frame = +3 Query: 1005 KNRDQGYDKEMLRST----NGERDEKL-----KLDYEDTRDITMQGKEVQYGEGDNPEKL 1157 K R+ DKE R E D +L K+DY+D RD + GK+ EK Sbjct: 179 KERETDRDKERTRDRVSRKTHEEDYELDNVDDKVDYQDKRDEEI-GKQ---------EKD 228 Query: 1158 STMDYKQRTESTADGSHPSTSELESRIVKMKEERLKKKSEGVSEVLAWVNKSRKIGEKLN 1337 S +D + T+ +H S++ELE RI+KMKE R KK+ E SE+ AWVNKSRKI Sbjct: 229 SKLDNDNQDGQTS--AHLSSTELEDRILKMKESRTKKQPEADSEISAWVNKSRKI----- 281 Query: 1338 ADKEKALHLSKVFEEQDNVDQGESDDEEEPQHPTKDLAGVKILHGLDKVIEGGAVVLTLK 1517 +K++A LSK+FEEQDN+ SDDE+ QH T +LAGVK+LHGLDKV+ GG VVLT+K Sbjct: 282 -EKKRAFQLSKIFEEQDNIAVEGSDDEDTAQH-TDNLAGVKVLHGLDKVMAGGTVVLTIK 339 Query: 1518 DQNILADGDINEEVDMLENVEIGEQXXXXXXXXXXXXXXXXXXXXFNEDPSSQKKILPQY 1697 DQ ILADGD+NE+VDMLEN+EIGEQ F++DPS++KK+LPQY Sbjct: 340 DQPILADGDVNEDVDMLENIEIGEQKRRDEAYKAAKKKTGVYDDKFHDDPSTEKKMLPQY 399 Query: 1698 DDPADEEGVTLDVSGHFSGEAXXXXXXXXXXIQVTSVSNQFQDLTTTGKISSDYYTHEEM 1877 DDPA EEG+TLD G FSGEA + S +N F+DLT++GK+SSDYYTHEEM Sbjct: 400 DDPAAEEGLTLDGKGRFSGEAEKKLEELRRRLTGVS-TNTFEDLTSSGKVSSDYYTHEEM 458 Query: 1878 VQFXXXXXXXXXXXXXXXDLDALEAEAISAGLGVGDLGSRNEERRQSAKAEEERAKADMR 2057 ++F D++ALEAEA+S+GLGVGDLGSR + RRQ+ K E+ER +A+MR Sbjct: 459 LKFKKPKKKKSLRKKDKLDINALEAEAVSSGLGVGDLGSRKDVRRQAIKDEQERLEAEMR 518 Query: 2058 TXXXXXXXXXXXXXXXXLRQGQTLTIQPEEDENLVFGGDDEDLYKSLEKARKLALKRQDD 2237 + LR QTL ++ EEDE VF DDEDL KSLEKAR+LALK+++ Sbjct: 519 SNAYQSAYAKADEASKLLRLEQTLNVKTEEDETPVFVDDDEDLRKSLEKARRLALKKKEG 578 Query: 2238 AGASGMHVVARLAETNNESE-ETQKPVSGG-----IVITEMEEFVSKIHLDEEINKPEAD 2399 GASG +A LA +N+ +E + Q P +G +V TEMEEFV +H+DEE KPE++ Sbjct: 579 EGASGPQAIALLATSNHNNETDDQNPTAGESRENKVVFTEMEEFVWGLHIDEEARKPESE 638 Query: 2400 DVF-EDEEVPKSFEKEMEDETGGWMEVKDTSADELPINEEKEDLIPDKIIHEPAVGKGLS 2576 DVF D+E ++E +E GGW EV++TS DE E+KE++IPD+ IHE AVGKGLS Sbjct: 639 DVFMHDDEEANVPDEEKINEVGGWTEVQETSEDEQRNTEDKEEIIPDETIHEVAVGKGLS 698 Query: 2577 GALQLLKERGTLKETINWGGRNMDKKKSKLVGIYENDG-----TKEIRIERTDEFGRIMT 2741 GAL+LLKERGTLKE+I WGGRNMDKKKSKLVGI +++ T+EIRIERTDEFGRI+T Sbjct: 699 GALKLLKERGTLKESIEWGGRNMDKKKSKLVGIVDDEEKEAQKTREIRIERTDEFGRILT 758 Query: 2742 PKEAFRIISHKFHGKGPGXXXXXXXXXXXXXXXXXXXXXASDTPSESMERMREAQARLKT 2921 PKEAFR+ISHKFHGKGPG +SDTPS S+ERMREAQARL+T Sbjct: 759 PKEAFRMISHKFHGKGPGKMKQEKRMKQYYEELKMKQMKSSDTPSLSVERMREAQARLQT 818 Query: 2922 PYLVLSGHVKPGQTSDPRSGFATVEKDHPGSLTPMLGDRKVEHFLGIKRKAEPSSMGPPK 3101 PYLVLSGHVKPGQTSDP+SGFATVEKD PG LTPMLGDRKVEHFLGIKRKAEPSS PK Sbjct: 819 PYLVLSGHVKPGQTSDPKSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAEPSSSDTPK 878 Query: 3102 KQK 3110 K K Sbjct: 879 KPK 881