BLASTX nr result
ID: Anemarrhena21_contig00000261
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Anemarrhena21_contig00000261 (2960 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_008806835.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 805 0.0 ref|XP_008806833.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 805 0.0 ref|XP_010926911.1| PREDICTED: SART-1 family protein DOT2 [Elaei... 785 0.0 ref|XP_009405353.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 746 0.0 ref|XP_010256356.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 716 0.0 ref|XP_010656678.1| PREDICTED: SART-1 family protein DOT2 [Vitis... 698 0.0 gb|KJB61483.1| hypothetical protein B456_009G361400 [Gossypium r... 672 0.0 ref|XP_012441144.1| PREDICTED: SART-1 family protein DOT2 [Gossy... 672 0.0 ref|XP_012077379.1| PREDICTED: SART-1 family protein DOT2 isofor... 671 0.0 ref|XP_011094061.1| PREDICTED: SART-1 family protein DOT2 [Sesam... 667 0.0 ref|XP_007022029.1| U4/U6.U5 tri-snRNP-associated protein 1 isof... 664 0.0 ref|XP_007022027.1| U4/U6.U5 tri-snRNP-associated protein 1 isof... 664 0.0 ref|XP_007022025.1| U4/U6.U5 tri-snRNP-associated protein 1 isof... 664 0.0 ref|XP_002516516.1| conserved hypothetical protein [Ricinus comm... 662 0.0 ref|XP_006836392.1| PREDICTED: SART-1 family protein DOT2 [Ambor... 662 0.0 gb|KHG25959.1| U4/U6.U5 tri-snRNP-associated 1 [Gossypium arboreum] 661 0.0 ref|XP_007225495.1| hypothetical protein PRUPE_ppa000914mg [Prun... 658 0.0 ref|XP_008390895.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 648 0.0 ref|XP_002297938.2| hypothetical protein POPTR_0001s11550g [Popu... 646 0.0 ref|XP_010102332.1| hypothetical protein L484_015280 [Morus nota... 645 0.0 >ref|XP_008806835.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 isoform X2 [Phoenix dactylifera] Length = 1013 Score = 805 bits (2080), Expect = 0.0 Identities = 440/760 (57%), Positives = 535/760 (70%), Gaps = 5/760 (0%) Frame = -1 Query: 2267 KERDLSREYEHGQDQESMSEHSXXXXXXXXXGQDKNKGHDKS----RVREKDYEKQLERE 2100 +ERDL REY+ G+++E + +H+ +D++K H++S R REKD EK+L+RE Sbjct: 207 RERDLMREYDRGKEREKVHDHA----------RDRDKDHERSKFRERDREKDVEKELDRE 256 Query: 2099 KVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKNRLERAKDKEKS 1920 + KG EKE ++EK +LE+ + KEKS Sbjct: 257 RGKEKDRERGKDRDREREKDRDRLKEKEREKEKIKG---REKEKEKEKGKLEKDRAKEKS 313 Query: 1919 REESVRNRETDLDKSRTKDKERDAGRAKGGEENERTKVGGGGIDIVRXXXXXXXXXXXNH 1740 RE +E D+ + KER+ GRA+ GE++E+ K GG I R Sbjct: 314 RE-----KEIDI-----RGKEREIGRAREGEKDEKVKGDGGDSRIARKGQEVQDDEGDL- 362 Query: 1739 ESFSLNDHDERPVGSQSTSELGECXXXXXXXXXXXRADDALEISSWVNKSRRLEEKRNTE 1560 H+E+P+ S STS+L E ++D A EISSWVNKSR+LEEK E Sbjct: 363 ------THNEKPLSSISTSKLEERVVKMKEERLKRKSDGASEISSWVNKSRKLEEKWTAE 416 Query: 1559 SEKAERTSRILDEQDNVGEESDDETA-GHSANDLAGIKILHGLEKVIEGGNVVLTLKDQS 1383 EKA R S+ L+EQDN+ ES+DE A GHS NDLAG KILHGL+KV+EGG VVLTLKDQS Sbjct: 417 KEKALRLSKALEEQDNILAESEDEEATGHSGNDLAGAKILHGLDKVMEGGAVVLTLKDQS 476 Query: 1382 ILTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSHKTILPQYDDP 1203 IL DGDINEE DMLENVEIGEQ++RD+AY+AAKK TGLY+DKF+DD+ S KTILPQYD+ Sbjct: 477 ILADGDINEEADMLENVEIGEQKQRDEAYRAAKKRTGLYDDKFSDDIGSQKTILPQYDNQ 536 Query: 1202 VEDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSDYFTPDEMLQF 1023 EDEGVTLDE+GRFTGEA RI+GG+ ++ EDL S+ K +SDY+TPDEMLQF Sbjct: 537 NEDEGVTLDESGRFTGEAEKKLEELRKRIEGGAIKKSNEDLTSSGKISSDYYTPDEMLQF 596 Query: 1022 XXXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXXXXXXEMRSNA 843 LDLDALEAEAIS+GLG GDLGSR D +R + E RS+A Sbjct: 597 KKPKKKKSLRKKEKLDLDALEAEAISAGLGAGDLGSRNDLRRQTAKEEQEKAEAEKRSHA 656 Query: 842 YQSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKLARKRKEEAAP 663 YQSA+AKAEEASK LRQEQ TVK+VEDD +VFGEDYED+ S+ QARKLA K+++E A Sbjct: 657 YQSAIAKAEEASKALRQEQTSTVKSVEDDNLVFGEDYEDVHRSIGQARKLALKKQDETAV 716 Query: 662 SGPQAVALLATTNKEQEDTQGSTVGEPQENKVVITEMEEFVLGLQLNEETHKPESEDVFK 483 SGP+AVAL+ATT KEQED + GEPQENKV+ITEMEEFVLGLQ+ E+THKPESEDVFK Sbjct: 717 SGPEAVALVATTKKEQEDASPTEGGEPQENKVIITEMEEFVLGLQITEDTHKPESEDVFK 776 Query: 482 DEDDIPKPVEHEMDSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIHEVAVXXXXXXXXX 303 DE+DIPKP+E E ++++GGWTE+ ET+ + +EE +D+ PDEIIHE ++ Sbjct: 777 DEEDIPKPLELETEAEVGGWTEVMETDDTEAAVNEEKEDINPDEIIHETSMGKGLSGALK 836 Query: 302 XXKDRGTLKESIDWGGRNMDKKKSKLVGINDNDGPKEIRIERTDEFGRIMTPKEAFRVIS 123 K+RGTL ESIDWGGRNMDKKKSKLVGINDN+GPKEIRIERTDEFGRIMTPKEAFR++S Sbjct: 837 LLKERGTLNESIDWGGRNMDKKKSKLVGINDNEGPKEIRIERTDEFGRIMTPKEAFRMLS 896 Query: 122 HKFHGKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLAME 3 HKFHGKGPGKMKQEKRMKQ+QEDLKTKQMKASDTPLLAME Sbjct: 897 HKFHGKGPGKMKQEKRMKQYQEDLKTKQMKASDTPLLAME 936 >ref|XP_008806833.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 isoform X1 [Phoenix dactylifera] Length = 1040 Score = 805 bits (2080), Expect = 0.0 Identities = 440/760 (57%), Positives = 535/760 (70%), Gaps = 5/760 (0%) Frame = -1 Query: 2267 KERDLSREYEHGQDQESMSEHSXXXXXXXXXGQDKNKGHDKS----RVREKDYEKQLERE 2100 +ERDL REY+ G+++E + +H+ +D++K H++S R REKD EK+L+RE Sbjct: 234 RERDLMREYDRGKEREKVHDHA----------RDRDKDHERSKFRERDREKDVEKELDRE 283 Query: 2099 KVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKNRLERAKDKEKS 1920 + KG EKE ++EK +LE+ + KEKS Sbjct: 284 RGKEKDRERGKDRDREREKDRDRLKEKEREKEKIKG---REKEKEKEKGKLEKDRAKEKS 340 Query: 1919 REESVRNRETDLDKSRTKDKERDAGRAKGGEENERTKVGGGGIDIVRXXXXXXXXXXXNH 1740 RE +E D+ + KER+ GRA+ GE++E+ K GG I R Sbjct: 341 RE-----KEIDI-----RGKEREIGRAREGEKDEKVKGDGGDSRIARKGQEVQDDEGDL- 389 Query: 1739 ESFSLNDHDERPVGSQSTSELGECXXXXXXXXXXXRADDALEISSWVNKSRRLEEKRNTE 1560 H+E+P+ S STS+L E ++D A EISSWVNKSR+LEEK E Sbjct: 390 ------THNEKPLSSISTSKLEERVVKMKEERLKRKSDGASEISSWVNKSRKLEEKWTAE 443 Query: 1559 SEKAERTSRILDEQDNVGEESDDETA-GHSANDLAGIKILHGLEKVIEGGNVVLTLKDQS 1383 EKA R S+ L+EQDN+ ES+DE A GHS NDLAG KILHGL+KV+EGG VVLTLKDQS Sbjct: 444 KEKALRLSKALEEQDNILAESEDEEATGHSGNDLAGAKILHGLDKVMEGGAVVLTLKDQS 503 Query: 1382 ILTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSHKTILPQYDDP 1203 IL DGDINEE DMLENVEIGEQ++RD+AY+AAKK TGLY+DKF+DD+ S KTILPQYD+ Sbjct: 504 ILADGDINEEADMLENVEIGEQKQRDEAYRAAKKRTGLYDDKFSDDIGSQKTILPQYDNQ 563 Query: 1202 VEDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSDYFTPDEMLQF 1023 EDEGVTLDE+GRFTGEA RI+GG+ ++ EDL S+ K +SDY+TPDEMLQF Sbjct: 564 NEDEGVTLDESGRFTGEAEKKLEELRKRIEGGAIKKSNEDLTSSGKISSDYYTPDEMLQF 623 Query: 1022 XXXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXXXXXXEMRSNA 843 LDLDALEAEAIS+GLG GDLGSR D +R + E RS+A Sbjct: 624 KKPKKKKSLRKKEKLDLDALEAEAISAGLGAGDLGSRNDLRRQTAKEEQEKAEAEKRSHA 683 Query: 842 YQSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKLARKRKEEAAP 663 YQSA+AKAEEASK LRQEQ TVK+VEDD +VFGEDYED+ S+ QARKLA K+++E A Sbjct: 684 YQSAIAKAEEASKALRQEQTSTVKSVEDDNLVFGEDYEDVHRSIGQARKLALKKQDETAV 743 Query: 662 SGPQAVALLATTNKEQEDTQGSTVGEPQENKVVITEMEEFVLGLQLNEETHKPESEDVFK 483 SGP+AVAL+ATT KEQED + GEPQENKV+ITEMEEFVLGLQ+ E+THKPESEDVFK Sbjct: 744 SGPEAVALVATTKKEQEDASPTEGGEPQENKVIITEMEEFVLGLQITEDTHKPESEDVFK 803 Query: 482 DEDDIPKPVEHEMDSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIHEVAVXXXXXXXXX 303 DE+DIPKP+E E ++++GGWTE+ ET+ + +EE +D+ PDEIIHE ++ Sbjct: 804 DEEDIPKPLELETEAEVGGWTEVMETDDTEAAVNEEKEDINPDEIIHETSMGKGLSGALK 863 Query: 302 XXKDRGTLKESIDWGGRNMDKKKSKLVGINDNDGPKEIRIERTDEFGRIMTPKEAFRVIS 123 K+RGTL ESIDWGGRNMDKKKSKLVGINDN+GPKEIRIERTDEFGRIMTPKEAFR++S Sbjct: 864 LLKERGTLNESIDWGGRNMDKKKSKLVGINDNEGPKEIRIERTDEFGRIMTPKEAFRMLS 923 Query: 122 HKFHGKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLAME 3 HKFHGKGPGKMKQEKRMKQ+QEDLKTKQMKASDTPLLAME Sbjct: 924 HKFHGKGPGKMKQEKRMKQYQEDLKTKQMKASDTPLLAME 963 >ref|XP_010926911.1| PREDICTED: SART-1 family protein DOT2 [Elaeis guineensis] Length = 1017 Score = 785 bits (2026), Expect = 0.0 Identities = 442/799 (55%), Positives = 536/799 (67%), Gaps = 44/799 (5%) Frame = -1 Query: 2267 KERDLSREYEHGQDQESMSEHSXXXXXXXXXGQDKNKGHDKSRVREKD------------ 2124 KERDL R + G++ + E S + +++G +K + REKD Sbjct: 159 KERDLERGKDRGKELDKERERSTDREHD----RGRDRGKEKGKEREKDREGERERDLMRE 214 Query: 2123 YEKQLEREKVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERD-----EKENDRE 1959 Y++ EREKV +G E+D +++ +RE Sbjct: 215 YDRGKEREKVHDHARDRDKDRERSKIRERDHEKDVEKELDRERGKEKDHERGKDRDRERE 274 Query: 1958 KNR-----LERAKDKEKSRE-ESVRNRETDLDKSR----------------TKDKERDAG 1845 K+R ER KDK K RE E +++RE + +K + +DKERD G Sbjct: 275 KDRDRLKDKEREKDKIKDREKEKIKDREKEKEKGKLEKVRAKEKSREKEIDIRDKERD-G 333 Query: 1844 RAKGGEENERTKVGGGGIDIVRXXXXXXXXXXXNHESFSLND----HDERPVGSQSTSEL 1677 RA+ GE++E+ K GG I R E N+ H+E+ + S STSEL Sbjct: 334 RAREGEKDEKVKADGGNSRIARKG-----------EEIQDNEGDLTHNEKSISSTSTSEL 382 Query: 1676 GECXXXXXXXXXXXRADDALEISSWVNKSRRLEEKRNTESEKAERTSRILDEQDNVGEES 1497 E + D A EISSWVNKSR+LEEKRN E EKA R S+ L+EQDN+ ES Sbjct: 383 EERVTKMKEERLKRKPDGASEISSWVNKSRKLEEKRNAEKEKALRLSKALEEQDNILAES 442 Query: 1496 DDETA-GHSANDLAGIKILHGLEKVIEGGNVVLTLKDQSILTDGDINEEVDMLENVEIGE 1320 +DE A GHS NDLAG+KILHGL+KV+EGG VVLTLKDQSIL DGDINE+ DMLENVEIGE Sbjct: 443 EDEEATGHSGNDLAGVKILHGLDKVMEGGAVVLTLKDQSILADGDINEDADMLENVEIGE 502 Query: 1319 QRRRDDAYKAAKKTTGLYEDKFNDDLSSHKTILPQYDDPVEDEGVTLDETGRFTGEAXXX 1140 Q++RD+AY+AAKK TGLY+DKF+DD+ S K ILPQYD+ +EDEGVTLDE+GRFTGEA Sbjct: 503 QKQRDEAYRAAKKRTGLYDDKFSDDMGSRKPILPQYDNEIEDEGVTLDESGRFTGEAEKK 562 Query: 1139 XXXXXXRIQGGSTSRAFEDLNSTAKNTSDYFTPDEMLQFXXXXXXXXXXXXXXLDLDALE 960 RI+GG + +EDL S+ K++SDY+TPDEMLQF LDLDALE Sbjct: 563 LEELRKRIEGGIIKQNYEDLTSSGKSSSDYYTPDEMLQFKKPKKKKSLRKKEKLDLDALE 622 Query: 959 AEAISSGLGVGDLGSRKDSKRLSXXXXXXXXXXEMRSNAYQSALAKAEEASKVLRQEQIL 780 AEAIS+GLG GDLGSR D +R + EMRSNAYQSA+AKAEEASK LRQEQ L Sbjct: 623 AEAISAGLGAGDLGSRNDLRRQTAKEEQVKADAEMRSNAYQSAIAKAEEASKALRQEQTL 682 Query: 779 TVKAVEDDGMVFGEDYEDLENSLEQARKLARKRKEEAAPSGPQAVALLATTNKEQEDTQG 600 TVK+VEDD +VFGED+EDL+ S+ QARKLA K+++E SGP+AVAL+ATT KEQED Sbjct: 683 TVKSVEDDNLVFGEDFEDLQRSIGQARKLALKKQDETPVSGPEAVALVATTKKEQEDA-S 741 Query: 599 STVGEPQENKVVITEMEEFVLGLQLNEETHKPESEDVFKDEDDIPKPVEHEMDSQIGGWT 420 T GEPQENKV+ITEMEEFVLGLQ E+THKPESEDVFKDE+DIPK +E E ++++GGW Sbjct: 742 PTEGEPQENKVIITEMEEFVLGLQFTEDTHKPESEDVFKDEEDIPKSLELETEAEVGGWA 801 Query: 419 EIKETNKNQQPPSEENDDVAPDEIIHEVAVXXXXXXXXXXXKDRGTLKESIDWGGRNMDK 240 E+ ET+K + SEE +D+ PDEI HE A+ KDRGTL E +D GGRNMDK Sbjct: 802 EVMETDKTEAAVSEEKEDINPDEINHETAIGKGLSGVLKLLKDRGTLNEGVDLGGRNMDK 861 Query: 239 KKSKLVGINDNDGPKEIRIERTDEFGRIMTPKEAFRVISHKFHGKGPGKMKQEKRMKQFQ 60 KKSKLVGI DN+G KEIRIERTDEFGRIMTPKEAFR++SHKFHGKGPGKMKQEKRMKQ+Q Sbjct: 862 KKSKLVGIYDNEGQKEIRIERTDEFGRIMTPKEAFRMLSHKFHGKGPGKMKQEKRMKQYQ 921 Query: 59 EDLKTKQMKASDTPLLAME 3 EDLKTKQMKASDTPLLAME Sbjct: 922 EDLKTKQMKASDTPLLAME 940 >ref|XP_009405353.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Musa acuminata subsp. malaccensis] gi|695035842|ref|XP_009405354.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Musa acuminata subsp. malaccensis] gi|695035844|ref|XP_009405355.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Musa acuminata subsp. malaccensis] Length = 996 Score = 746 bits (1926), Expect = 0.0 Identities = 415/762 (54%), Positives = 513/762 (67%), Gaps = 8/762 (1%) Frame = -1 Query: 2264 ERDLSREYEHGQDQESMSEHSXXXXXXXXXGQDKNKGHDKSRVREKDYEKQLEREKVXXX 2085 + +SR E G+D E + + K + D+S++RE+D+EK ++RE Sbjct: 166 DHGISRVKERGKDSEIEKDRDLARKHD----RGKERDRDRSKIRERDHEKDVQRESERER 221 Query: 2084 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERD-EKENDREKNRL---ERAKDKEKSR 1917 + +D EKE ++EK+R+ ER K KE R Sbjct: 222 RKEKDHEKGTDKNREREKDRDMVKDREREREKTKDREKEKEKEKDRVRDKEREKTKENFR 281 Query: 1916 EESV-RNRETDLDKSRTKDKERDAGRAKGGEENERT--KVGGGGIDIVRXXXXXXXXXXX 1746 ++ + R+ E D D+SRT+D+E+ AK E++ERT G +D Sbjct: 282 QKEIDRSLEADRDRSRTRDREKGPAGAKESEKDERTLSDFEDGRLD---SREEEARDGSD 338 Query: 1745 NHESFSL-NDHDERPVGSQSTSELGECXXXXXXXXXXXRADDALEISSWVNKSRRLEEKR 1569 +HE +L N E+ S SEL E ++D A EISSWVNKSRRLEE++ Sbjct: 339 SHEKSTLKNQQSEKHTDSLLASELEERLARTKEERMKKKSDGAFEISSWVNKSRRLEERK 398 Query: 1568 NTESEKAERTSRILDEQDNVGEESDDETAGHSANDLAGIKILHGLEKVIEGGNVVLTLKD 1389 N E E A R S+ +EQDN+ + DDET GH+ DLAG+KILHGL+KVIEGG VVLTLKD Sbjct: 399 NAEKE-ALRLSKAFEEQDNMLADGDDETVGHTQKDLAGVKILHGLDKVIEGGAVVLTLKD 457 Query: 1388 QSILTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSHKTILPQYD 1209 Q IL DGDINEE+DMLENVEIGEQ++RD+AYKAAKK TGLY+DKFND+ S KTILPQYD Sbjct: 458 QDILKDGDINEEIDMLENVEIGEQKQRDEAYKAAKKRTGLYDDKFNDETGSQKTILPQYD 517 Query: 1208 DPVEDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSDYFTPDEML 1029 DPVEDEGV LDE+G FTGEA RI+G +++EDL S+AKN+SDY+T +EML Sbjct: 518 DPVEDEGVALDESGHFTGEAEKKLEELRRRIEGSFVPKSYEDLTSSAKNSSDYYTAEEML 577 Query: 1028 QFXXXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXXXXXXEMRS 849 +F LDLDA+EAEA S+GLG DLGSR D +R E RS Sbjct: 578 RFKKPKKKKSLRKKEKLDLDAMEAEARSAGLGASDLGSRNDMRRQIEREEQEKIEAERRS 637 Query: 848 NAYQSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKLARKRKEEA 669 AYQ+A KAEEASKV+ QEQ L +K+ EDD +VFGEDYEDL+ SLEQARKLA ++ +EA Sbjct: 638 KAYQTAYEKAEEASKVMLQEQTLRLKSFEDDDIVFGEDYEDLQMSLEQARKLALRKHDEA 697 Query: 668 APSGPQAVALLATTNKEQEDTQGSTVGEPQENKVVITEMEEFVLGLQLNEETHKPESEDV 489 +GPQAVALLAT+ KEQE++Q + GE QE KVVITE+EEFVLGLQLNE KPESEDV Sbjct: 698 GATGPQAVALLATSIKEQENSQSQSTGELQEEKVVITEVEEFVLGLQLNEGAQKPESEDV 757 Query: 488 FKDEDDIPKPVEHEMDSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIHEVAVXXXXXXX 309 F DE+D PK +E E+ + GWTE++ET+K++ P SE+ DDV+PDEIIHEVAV Sbjct: 758 FMDEEDSPKSLEPEIKVDVTGWTEVEETSKSEDPISEKKDDVSPDEIIHEVAVGKGLSGA 817 Query: 308 XXXXKDRGTLKESIDWGGRNMDKKKSKLVGINDNDGPKEIRIERTDEFGRIMTPKEAFRV 129 K+RG LKE++DWGGR MDKKKSKLVG+ D+ G KEIRIERTDEFGRIMTPKEAFR+ Sbjct: 818 LKLLKERGALKETVDWGGRTMDKKKSKLVGLYDDGGTKEIRIERTDEFGRIMTPKEAFRM 877 Query: 128 ISHKFHGKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLAME 3 +SHKFHGKGPGKMKQEKRMKQ+QEDLKTKQMKASDTPLLA+E Sbjct: 878 LSHKFHGKGPGKMKQEKRMKQYQEDLKTKQMKASDTPLLAVE 919 >ref|XP_010256356.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001422|ref|XP_010256357.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001427|ref|XP_010256358.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001430|ref|XP_010256359.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001433|ref|XP_010256360.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001436|ref|XP_010256361.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] Length = 851 Score = 716 bits (1848), Expect = 0.0 Identities = 403/771 (52%), Positives = 506/771 (65%), Gaps = 15/771 (1%) Frame = -1 Query: 2270 HKERDLSREYEHGQDQESMSEHSXXXXXXXXXGQDKNKGHDKSRVREKDYEKQLEREKVX 2091 H+ +D + + +E + EH +++ K +K R REK+ E+ EREK Sbjct: 44 HRSKDRKKSRREEKARERVKEHDRGLE------REREKEKEKDRDREKEKERGREREK-- 95 Query: 2090 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKNRLERAKDKEKSREE 1911 G + ++E +REK++ +R ++K K RE+ Sbjct: 96 -----------------------------DRDGSKDRDREKEREKHK-DREREKVKDREK 125 Query: 1910 SVRNRETDLDKSRTKDKERDA----------GRAKGGEENERTKVGGGGI-DIVRXXXXX 1764 R++ + DK R+KDKERDA GR K ++E+ + GG D+V+ Sbjct: 126 LERDKSKEKDKERSKDKERDARNGKLDDESQGRGKDVGKDEKLDLDGGNDRDVVKQVKEV 185 Query: 1763 XXXXXXNHESFSLNDHDERPVGSQ-STSELGECXXXXXXXXXXXRADDALEISSWVNKSR 1587 + + D GSQ ST EL E +++ E+ SWVNKSR Sbjct: 186 QHDVVVDMSVENKKKVDGAMGGSQPSTGELEERILKMREERSKKKSEGVSEVLSWVNKSR 245 Query: 1586 RLEEKRNTESEKAERTSRILDEQDNV--GEESDDETAGHSANDLAGIKILHGLEKVIEGG 1413 +LEEKRN E +KA + S++ +EQD + GE D++TA H++ DLAG+KILHG++KVIEGG Sbjct: 246 KLEEKRNAEKQKALQLSKVFEEQDKIDQGESEDEDTARHTSKDLAGVKILHGIDKVIEGG 305 Query: 1412 NVVLTLKDQSILTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSH 1233 VVLTLKDQ+IL + D+NEE D+LENVEIGEQ++RD AYKAAKK TG+YEDKF+ + + Sbjct: 306 AVVLTLKDQNILANDDVNEEADVLENVEIGEQKQRDAAYKAAKKKTGIYEDKFSGEDGAQ 365 Query: 1232 KTILPQYDDPVEDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSD 1053 K ILPQYDDPVEDEG+ LDE+GRF GEA R+QG S S FEDLNS+AK TSD Sbjct: 366 KKILPQYDDPVEDEGLVLDESGRFAGEAEKKLEELRKRLQGVSASNHFEDLNSSAKITSD 425 Query: 1052 YFTPDEMLQFXXXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXX 873 ++T +EMLQF LDLDALEAEAIS+G GVGDLGSRKD +R + Sbjct: 426 FYTHEEMLQFKKPKKKKSLRKKVKLDLDALEAEAISAGFGVGDLGSRKDGQRQATKEQQE 485 Query: 872 XXXXEMRSNAYQSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKL 693 EMRSNAYQSA AKAEEASK LRQEQ LTV+ E++ VFG+D EDL SLE+ARKL Sbjct: 486 RSEAEMRSNAYQSAFAKAEEASKTLRQEQTLTVQVEENESPVFGDDEEDLYKSLEKARKL 545 Query: 692 ARKRKEEAAPSGPQAVALLATT-NKEQEDTQGSTVGEPQENKVVITEMEEFVLGLQLNEE 516 A K + EAA SGPQAVALLA+T + + +D + T GEPQENKVV TEMEEFV GLQLNEE Sbjct: 546 ALKTQNEAAASGPQAVALLASTVSNQPKDEENLTSGEPQENKVVFTEMEEFVWGLQLNEE 605 Query: 515 THKPESEDVFKDEDDIPKPVEHEMDSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIHEV 336 K ESEDVF DED++PK + E+ + GGWTE+ + ++N+ P EE ++V PDE IHEV Sbjct: 606 ARKLESEDVFMDEDNVPKASDQEIKDEAGGWTEVNDIDENEHPVEEEKEEVVPDETIHEV 665 Query: 335 AVXXXXXXXXXXXKDRGTLKESIDWGGRNMDKKKSKLVGINDNDGPKEIRIERTDEFGRI 156 A+ K+RGTLKE++DWGGRNMDKKKSKLVGI D+ GPKEIRIERTDEFGRI Sbjct: 666 AIGKGLSGALKLLKERGTLKETVDWGGRNMDKKKSKLVGIYDDGGPKEIRIERTDEFGRI 725 Query: 155 MTPKEAFRVISHKFHGKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLAME 3 MTPKEAFRVISHKFHGKGPGKMKQEKRMKQ+QE+LK KQMK SDTP +ME Sbjct: 726 MTPKEAFRVISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSQSME 776 >ref|XP_010656678.1| PREDICTED: SART-1 family protein DOT2 [Vitis vinifera] gi|296090475|emb|CBI40671.3| unnamed protein product [Vitis vinifera] Length = 944 Score = 698 bits (1802), Expect = 0.0 Identities = 386/772 (50%), Positives = 505/772 (65%), Gaps = 17/772 (2%) Frame = -1 Query: 2267 KERDLSREYEHGQDQESMSEHSXXXXXXXXXGQDKNKGHDKSRVREKDYEKQLEREKVXX 2088 +ER++ +E + G+D+E E + +D++K +K R R KD +++ E+EK Sbjct: 144 REREVDKESDRGRDKERGKEKN----------RDRDKEREKERDRTKDRDREKEKEKSKD 193 Query: 2087 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKNRLERAKDKEKSREES 1908 EKE + +K+R A DKEK +E Sbjct: 194 R-----------------------------------EKERENDKDRDRDAIDKEKGKER- 217 Query: 1907 VRNRETDLDKSRTKDKERDAGRAKGGEE-NERTKVGG--------GGIDIVRXXXXXXXX 1755 +R++E + D+ R + K+RD G K +E ++R+K GG GG + R Sbjct: 218 IRDKEREADQDRDRYKDRDKGSRKNRDEGHDRSKDGGKDDKLKLDGGDNRDRDVTKQGRG 277 Query: 1754 XXXNHESFSLNDHDERPVGSQ----STSELGECXXXXXXXXXXXRADDALEISSWVNKSR 1587 + + +H++ G+ ST++L E +++ + E+ +WVN+SR Sbjct: 278 SHHDEDDSRAIEHEKNAEGASGPQSSTAQLQERILRMKEERVKRKSEGSSEVLAWVNRSR 337 Query: 1586 RLEEKRNTESEKAERTSRILDEQDNV--GEESDDETAGHSANDLAGIKILHGLEKVIEGG 1413 ++EE+RN E EKA + S+I +EQDN+ GE D++ HS+ DLAG+K+LHGL+KVIEGG Sbjct: 338 KVEEQRNAEKEKALQLSKIFEEQDNIDQGESDDEKPTRHSSQDLAGVKVLHGLDKVIEGG 397 Query: 1412 NVVLTLKDQSILTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSH 1233 VVLTLKDQ IL +GDINE+VDMLENVEIGEQ+RRD+AYKAAKK TG+YEDKFND+ S Sbjct: 398 AVVLTLKDQDILANGDINEDVDMLENVEIGEQKRRDEAYKAAKKKTGIYEDKFNDEPGSE 457 Query: 1232 KTILPQYDDPVEDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSD 1053 K ILPQYDDPV DEG+ LD +GRFTGEA R+QG ST+ FEDLN+ KN+SD Sbjct: 458 KKILPQYDDPVTDEGLALDASGRFTGEAEKKLEELRRRLQGVSTNNRFEDLNTYGKNSSD 517 Query: 1052 YFTPDEMLQFXXXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXX 873 Y+T +EMLQF L++DALEAEA+S+GLGVGDLGSR D KR S Sbjct: 518 YYTHEEMLQFKKPKKKKSLRKKEKLNIDALEAEAVSAGLGVGDLGSRNDGKRQSIREEQE 577 Query: 872 XXXXEMRSNAYQSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKL 693 EMR++AYQ A AKA+EASK LR +Q L V+ E++ VFGED E+L+ SL++ARKL Sbjct: 578 RSEAEMRNSAYQLAYAKADEASKALRLDQTLPVQLEENENQVFGEDDEELQKSLQRARKL 637 Query: 692 ARKRKEEAAPSGPQAVALLA--TTNKEQEDTQGSTVGEPQENKVVITEMEEFVLGLQLNE 519 ++++EAA SGPQA+ALLA TT+ + D Q GE QEN+VV TEMEEFV GLQL + Sbjct: 638 VLQKQDEAATSGPQAIALLASTTTSSQNVDNQNPISGESQENRVVFTEMEEFVWGLQLED 697 Query: 518 ETHKPESEDVFKDEDDIPKPVEHEMDSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIHE 339 E HKP+ EDVF DED+ PK + E + GGWTE+K+T+K++ P +E +++ PD+ IHE Sbjct: 698 EAHKPDGEDVFMDEDEAPKASDQERKDEAGGWTEVKDTDKDELPVNENKEEMVPDDTIHE 757 Query: 338 VAVXXXXXXXXXXXKDRGTLKESIDWGGRNMDKKKSKLVGINDNDGPKEIRIERTDEFGR 159 VAV K+RGTLKE I+WGGRNMDKKKSKLVGI DN G KEIRIERTDEFGR Sbjct: 758 VAVGKGLSGALQLLKERGTLKEGIEWGGRNMDKKKSKLVGIYDNTGTKEIRIERTDEFGR 817 Query: 158 IMTPKEAFRVISHKFHGKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLAME 3 IMTPKEAFR+ISHKFHGKGPGKMKQEKRMKQ+QE+LK KQMK SDTP ++E Sbjct: 818 IMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSQSVE 869 >gb|KJB61483.1| hypothetical protein B456_009G361400 [Gossypium raimondii] Length = 878 Score = 672 bits (1734), Expect = 0.0 Identities = 384/767 (50%), Positives = 497/767 (64%), Gaps = 13/767 (1%) Frame = -1 Query: 2264 ERDLSREYEHGQDQES-MSEHSXXXXXXXXXGQDKNKGHDKSRVREKDYEKQLEREKVXX 2088 +RD RE EH +++E + +K +G DKSR R+++ EK+ ++ K Sbjct: 110 DRDKHREKEHEREREKDRKDRGKEKDRERDRESEKERGKDKSRDRDREKEKERDKAK--- 166 Query: 2087 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKNRLERAKDKEKSREES 1908 ER EKE D+ K+R E+ ++ EK ++ S Sbjct: 167 ---------------------------------ER-EKERDKLKDR-EKEREGEKGKDRS 191 Query: 1907 V-RNRETDLDKSRTKDKERDAGRAKGGEENERTKVGGGGIDIVRXXXXXXXXXXXNHESF 1731 +NRE DL+K R++D RD E+ E +K G +D + + Sbjct: 192 KQKNREADLEKERSRD--RDNVGKNHEEDYEGSKDGELALDY---------EDRRDKDEA 240 Query: 1730 SLNDHDERPVGSQSTSELGECXXXXXXXXXXXRADDALEISSWVNKSRRLEEKRNTESEK 1551 LN + S+SEL E +++ E+S+WV++SR+LE+KRN E EK Sbjct: 241 ELNAGSNASLVQASSSELEERIVRMKEDRLKKKSEGLSEVSAWVSRSRKLEDKRNAEKEK 300 Query: 1550 AERTSRILDEQDNV--GEESDDETAGHSANDLAGIKILHGLEKVIEGGNVVLTLKDQSIL 1377 A + S+I +EQDN GE+ D+E +DL G+K+LHGL+KV++GG VVLTLKDQSIL Sbjct: 301 ALQLSKIFEEQDNFVQGEDEDEEADNRPTHDLGGVKVLHGLDKVMDGGAVVLTLKDQSIL 360 Query: 1376 TDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSHKTILPQYDDPVE 1197 DGD+NE+VDMLEN+EIGEQ++RD+AYKAAKK TG+Y+DKFN+D S K ILPQYDDPV Sbjct: 361 ADGDLNEDVDMLENIEIGEQKQRDEAYKAAKKKTGVYDDKFNEDPGSEKKILPQYDDPVA 420 Query: 1196 DEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSDYFTPDEMLQFXX 1017 DEGVTLDE GRFTGEA R+ G T+ EDLN+ K +SDY+T +EML+F Sbjct: 421 DEGVTLDERGRFTGEAEKKLEELRKRLLGVPTNNRVEDLNNVGKISSDYYTQEEMLRFKK 480 Query: 1016 XXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXXXXXXEMRSNAYQ 837 LD+DALEAEA+S+GLG GDLGSRKDS+R + E R NAYQ Sbjct: 481 PKKKKALRKKEKLDIDALEAEAVSAGLGAGDLGSRKDSRRQAIKEEEARSEAEKRKNAYQ 540 Query: 836 SALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKLARKRKEEAAPSG 657 +A AKA+EASK LR EQ TVK ED+ VF +D EDL SLE+AR+LA K++EE SG Sbjct: 541 AAFAKADEASKSLRLEQTHTVKPEEDENQVFADDEEDLYKSLEKARRLALKKQEE--KSG 598 Query: 656 PQAVALLATTNKEQEDTQGST-VGEPQENKVVITEMEEFVLGLQLNEETHKPESEDVFKD 480 PQA+ALLATT+ + T T GE QENKVVITEMEEFV GLQL+EE HKP+SEDVF D Sbjct: 599 PQAIALLATTSASNQTTDDHTSTGEAQENKVVITEMEEFVWGLQLDEEAHKPDSEDVFMD 658 Query: 479 EDDIPKPVEHEM---DSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIHEVAVXXXXXXX 309 ED++P E + ++++GGWTE+ +T+ +++P +E+ND+V PDE IHE+AV Sbjct: 659 EDEVPGASEQDRKNGENEVGGWTEVIDTSADEKPANEDNDEVVPDETIHEIAVGKGLSGA 718 Query: 308 XXXXKDRGTLKESIDWGGRNMDKKKSKLVGINDNDGP-----KEIRIERTDEFGRIMTPK 144 KDRGTLKE+I+WGGRNMDKKKSKLVGI D+D K+IRIERTDEFGRI+TPK Sbjct: 719 LKLLKDRGTLKETIEWGGRNMDKKKSKLVGIVDDDHQTDNRFKDIRIERTDEFGRIVTPK 778 Query: 143 EAFRVISHKFHGKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLAME 3 EAFR++SHKFHGKGPGKMKQEKRMKQ+QE+LK KQMK SDTP L++E Sbjct: 779 EAFRMLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVE 825 >ref|XP_012441144.1| PREDICTED: SART-1 family protein DOT2 [Gossypium raimondii] gi|823216924|ref|XP_012441145.1| PREDICTED: SART-1 family protein DOT2 [Gossypium raimondii] gi|763794483|gb|KJB61479.1| hypothetical protein B456_009G361400 [Gossypium raimondii] gi|763794484|gb|KJB61480.1| hypothetical protein B456_009G361400 [Gossypium raimondii] gi|763794485|gb|KJB61481.1| hypothetical protein B456_009G361400 [Gossypium raimondii] gi|763794488|gb|KJB61484.1| hypothetical protein B456_009G361400 [Gossypium raimondii] Length = 900 Score = 672 bits (1734), Expect = 0.0 Identities = 384/767 (50%), Positives = 497/767 (64%), Gaps = 13/767 (1%) Frame = -1 Query: 2264 ERDLSREYEHGQDQES-MSEHSXXXXXXXXXGQDKNKGHDKSRVREKDYEKQLEREKVXX 2088 +RD RE EH +++E + +K +G DKSR R+++ EK+ ++ K Sbjct: 110 DRDKHREKEHEREREKDRKDRGKEKDRERDRESEKERGKDKSRDRDREKEKERDKAK--- 166 Query: 2087 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKNRLERAKDKEKSREES 1908 ER EKE D+ K+R E+ ++ EK ++ S Sbjct: 167 ---------------------------------ER-EKERDKLKDR-EKEREGEKGKDRS 191 Query: 1907 V-RNRETDLDKSRTKDKERDAGRAKGGEENERTKVGGGGIDIVRXXXXXXXXXXXNHESF 1731 +NRE DL+K R++D RD E+ E +K G +D + + Sbjct: 192 KQKNREADLEKERSRD--RDNVGKNHEEDYEGSKDGELALDY---------EDRRDKDEA 240 Query: 1730 SLNDHDERPVGSQSTSELGECXXXXXXXXXXXRADDALEISSWVNKSRRLEEKRNTESEK 1551 LN + S+SEL E +++ E+S+WV++SR+LE+KRN E EK Sbjct: 241 ELNAGSNASLVQASSSELEERIVRMKEDRLKKKSEGLSEVSAWVSRSRKLEDKRNAEKEK 300 Query: 1550 AERTSRILDEQDNV--GEESDDETAGHSANDLAGIKILHGLEKVIEGGNVVLTLKDQSIL 1377 A + S+I +EQDN GE+ D+E +DL G+K+LHGL+KV++GG VVLTLKDQSIL Sbjct: 301 ALQLSKIFEEQDNFVQGEDEDEEADNRPTHDLGGVKVLHGLDKVMDGGAVVLTLKDQSIL 360 Query: 1376 TDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSHKTILPQYDDPVE 1197 DGD+NE+VDMLEN+EIGEQ++RD+AYKAAKK TG+Y+DKFN+D S K ILPQYDDPV Sbjct: 361 ADGDLNEDVDMLENIEIGEQKQRDEAYKAAKKKTGVYDDKFNEDPGSEKKILPQYDDPVA 420 Query: 1196 DEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSDYFTPDEMLQFXX 1017 DEGVTLDE GRFTGEA R+ G T+ EDLN+ K +SDY+T +EML+F Sbjct: 421 DEGVTLDERGRFTGEAEKKLEELRKRLLGVPTNNRVEDLNNVGKISSDYYTQEEMLRFKK 480 Query: 1016 XXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXXXXXXEMRSNAYQ 837 LD+DALEAEA+S+GLG GDLGSRKDS+R + E R NAYQ Sbjct: 481 PKKKKALRKKEKLDIDALEAEAVSAGLGAGDLGSRKDSRRQAIKEEEARSEAEKRKNAYQ 540 Query: 836 SALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKLARKRKEEAAPSG 657 +A AKA+EASK LR EQ TVK ED+ VF +D EDL SLE+AR+LA K++EE SG Sbjct: 541 AAFAKADEASKSLRLEQTHTVKPEEDENQVFADDEEDLYKSLEKARRLALKKQEE--KSG 598 Query: 656 PQAVALLATTNKEQEDTQGST-VGEPQENKVVITEMEEFVLGLQLNEETHKPESEDVFKD 480 PQA+ALLATT+ + T T GE QENKVVITEMEEFV GLQL+EE HKP+SEDVF D Sbjct: 599 PQAIALLATTSASNQTTDDHTSTGEAQENKVVITEMEEFVWGLQLDEEAHKPDSEDVFMD 658 Query: 479 EDDIPKPVEHEM---DSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIHEVAVXXXXXXX 309 ED++P E + ++++GGWTE+ +T+ +++P +E+ND+V PDE IHE+AV Sbjct: 659 EDEVPGASEQDRKNGENEVGGWTEVIDTSADEKPANEDNDEVVPDETIHEIAVGKGLSGA 718 Query: 308 XXXXKDRGTLKESIDWGGRNMDKKKSKLVGINDNDGP-----KEIRIERTDEFGRIMTPK 144 KDRGTLKE+I+WGGRNMDKKKSKLVGI D+D K+IRIERTDEFGRI+TPK Sbjct: 719 LKLLKDRGTLKETIEWGGRNMDKKKSKLVGIVDDDHQTDNRFKDIRIERTDEFGRIVTPK 778 Query: 143 EAFRVISHKFHGKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLAME 3 EAFR++SHKFHGKGPGKMKQEKRMKQ+QE+LK KQMK SDTP L++E Sbjct: 779 EAFRMLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVE 825 >ref|XP_012077379.1| PREDICTED: SART-1 family protein DOT2 isoform X1 [Jatropha curcas] gi|643724962|gb|KDP34163.1| hypothetical protein JCGZ_07734 [Jatropha curcas] Length = 864 Score = 671 bits (1730), Expect = 0.0 Identities = 386/773 (49%), Positives = 499/773 (64%), Gaps = 19/773 (2%) Frame = -1 Query: 2264 ERDLSREYEHGQDQESMSEHSXXXXXXXXXGQDKNKGHDKSRVREKDYEKQLEREKVXXX 2085 +++ SRE E +D+E K++G +KSR R++D E++ ERE+V Sbjct: 76 DKEKSREKERERDKER-----------------KDRGKEKSRDRDRDKEREKERERV--- 115 Query: 2084 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKNR-LERAKDKEKSREES 1908 + EK DREK R +E+ +D+EK RE++ Sbjct: 116 --------------------------------KEKEKYKDREKEREVEKDRDREKGREKT 143 Query: 1907 V-RNRETDLDKSRTKDKERDAGRAKGGEENERTKVGGGGIDIVRXXXXXXXXXXXNHES- 1734 R R++D DK R +D+E+ + R+ E+ +R+K D+V +S Sbjct: 144 KERERDSDYDKERLRDREKVSKRSHE-EDYDRSKD-----DVVEMDYENNKDSSVLKQSK 197 Query: 1733 FSLNDHDERPV------GSQSTSELGECXXXXXXXXXXXRADDALEISSWVNKSRRLEEK 1572 S ++ DE+ GS S+L E ++ E+ +WVN+SR+LEEK Sbjct: 198 VSFDNKDEQKAEETSRGGSAPVSQLEERILKMKEERLKKNSEPGDEVLAWVNRSRKLEEK 257 Query: 1571 RNTESEKAERTSRILDEQDN--VGEESDDETAGHSANDLAGIKILHGLEKVIEGGNVVLT 1398 +N E +KA++ S+I +EQDN GE D+++ H+ +DLAG+K+LHGLEKV+EGG VVLT Sbjct: 258 KNAEKQKAKQLSKIFEEQDNNVQGESEDEDSGEHTTHDLAGVKVLHGLEKVMEGGAVVLT 317 Query: 1397 LKDQSILTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSHKTILP 1218 LKDQSIL DGDINEEVDMLENVEIGEQ+RRDDAYKAAKK TG+Y+DKFNDD +S K ILP Sbjct: 318 LKDQSILADGDINEEVDMLENVEIGEQKRRDDAYKAAKKKTGIYDDKFNDDPASEKKILP 377 Query: 1217 QYDDPVEDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSDYFTPD 1038 QYDD DEGV LDE GRFTGEA R+QG ST+ FEDL+S+ K +SDY+T + Sbjct: 378 QYDDSAADEGVALDERGRFTGEAEKKLEELRRRLQGVSTNNRFEDLSSSGKISSDYYTHE 437 Query: 1037 EMLQFXXXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXXXXXXE 858 E+LQF LD+DALEAEA+S+GLGVGDLGSR + +R + E Sbjct: 438 ELLQFKKPKKKKSLRKKEKLDIDALEAEAVSAGLGVGDLGSRNNGRRQAIRQEQERSEAE 497 Query: 857 MRSNAYQSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKLARKRK 678 MRS+AYQ+A KA+EASK LRQEQ L K ED+ VF ED EDL SLE+ARKLA K++ Sbjct: 498 MRSSAYQAAYDKADEASKSLRQEQTLHAKLDEDENPVFAEDDEDLYKSLERARKLALKKQ 557 Query: 677 EEAAPSGPQAVALLA----TTNKEQEDTQGSTVGEPQENKVVITEMEEFVLGLQLNEETH 510 EE A SGPQA+A LA TT+ + D Q T GE QENK+V TEMEEFV GLQL+EE+H Sbjct: 558 EEKA-SGPQAIARLAAATTTTSSQTTDDQNPTTGESQENKIVFTEMEEFVWGLQLDEESH 616 Query: 509 KPESEDVFKDEDDIPKPVEHEMDSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIHEVAV 330 K ++DVF DED+ P + E + GGWTE+++ +K++ P +E N+D+ PDE IHEV V Sbjct: 617 KHGNDDVFMDEDEAPIVSDQEKKDETGGWTEVQDIDKDENPVNENNEDIVPDETIHEVPV 676 Query: 329 XXXXXXXXXXXKDRGTLKESIDWGGRNMDKKKSKLVGINDNDGP----KEIRIERTDEFG 162 K+RGTLKES +WGGRNMDKKKSKLVGI D+D K+IRI+RTDE+G Sbjct: 677 GKGLSAALKLLKERGTLKESTEWGGRNMDKKKSKLVGIVDSDVDNERFKDIRIDRTDEYG 736 Query: 161 RIMTPKEAFRVISHKFHGKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLAME 3 R +TPKEAFR+ISHKFHGKGPGKMKQEKRMKQ+ E+LK KQMK SDTP L++E Sbjct: 737 RTLTPKEAFRIISHKFHGKGPGKMKQEKRMKQYLEELKMKQMKNSDTPSLSVE 789 >ref|XP_011094061.1| PREDICTED: SART-1 family protein DOT2 [Sesamum indicum] Length = 942 Score = 667 bits (1722), Expect = 0.0 Identities = 381/773 (49%), Positives = 485/773 (62%), Gaps = 18/773 (2%) Frame = -1 Query: 2267 KERDLSREYEHGQDQESMSEHSXXXXXXXXXGQDKNKGHDKSRVREKDYEKQLEREKVXX 2088 KE+D R+ + +++E E G++K +G +KSR REK+ ++ +RE+ Sbjct: 130 KEKDKERK-DRAKEKERERERDKELEKDADKGREKERGKEKSRDREKERDRTKDREREKH 188 Query: 2087 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKEN--DREKNRLERAKDKEKSRE 1914 ER EKEN DR K+ +R K KE++RE Sbjct: 189 RDR------------------------------ER-EKENGRDRGKDTADREKGKERNRE 217 Query: 1913 ESVRNRETDLDKSRTKDKERDAGRAKGGEENERTKVGGGGIDIVRXXXXXXXXXXXNHES 1734 + ++ D +K R +D+ER + + K + G + Sbjct: 218 ---KEKQADQEKDRARDRERSSRKQKDESHDRSKDTDKDGHSRLENDYSRDKQSTKELAD 274 Query: 1733 FSLNDHDERPV------------GSQSTSELGECXXXXXXXXXXXRADDALEISSWVNKS 1590 S +++D + + QS SEL + ++ A E+ +WVN+S Sbjct: 275 NSDDENDSKILKHQEKADTAIAGSRQSASELEDRISKMREERLKKPSEGASEVLAWVNRS 334 Query: 1589 RRLEEKRNTESEKAERTSRILDEQDNV-GEESDDETAG-HSANDLAGIKILHGLEKVIEG 1416 R+LEEKR E EKA + S+I +EQDN+ G ESD+E A H+ DL G+KILHGL+KV+EG Sbjct: 335 RKLEEKRTAEKEKALQLSKIFEEQDNMNGGESDEEAAAEHTTQDLGGVKILHGLDKVLEG 394 Query: 1415 GNVVLTLKDQSILTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSS 1236 G VVLTLKDQSIL DGDINEEVDMLENVEIGEQ+RRD+AYKAAKK TG+Y+DKF+D+ + Sbjct: 395 GAVVLTLKDQSILADGDINEEVDMLENVEIGEQKRRDEAYKAAKKKTGIYDDKFSDEPGA 454 Query: 1235 HKTILPQYDDPVEDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTS 1056 K ILPQYDDPV DEGVTLD +GRFTGEA RIQG STS EDLNSTAK + Sbjct: 455 EKKILPQYDDPVADEGVTLDSSGRFTGEAERKLEELRRRIQGVSTSTRGEDLNSTAKILT 514 Query: 1055 DYFTPDEMLQFXXXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXX 876 DY+T DEM +F LDLDALEAEA S+GLG GDLGSR D +R + Sbjct: 515 DYYTQDEMTKFKKPKKKKSLRKKEKLDLDALEAEARSAGLGAGDLGSRNDGRRQNLREEQ 574 Query: 875 XXXXXEMRSNAYQSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARK 696 EMR NAY+SA AKA+EASK LRQEQ+ ++ EDD VFG+D ++L SLE+ARK Sbjct: 575 EKIEAEMRRNAYESAYAKADEASKALRQEQVPAMQTEEDDAPVFGDDDDELRKSLERARK 634 Query: 695 LARKRKEEAAPSGPQAVALLATTNKEQEDTQGSTVG--EPQENKVVITEMEEFVLGLQLN 522 +A K+++E S PQ + LLAT++ T+ G + QENKV+ TEMEEFV GLQL+ Sbjct: 635 IALKKQDEEEKSAPQVITLLATSSANDSTTENPNSGSVDQQENKVIFTEMEEFVWGLQLD 694 Query: 521 EETHKPESEDVFKDEDDIPKPVEHEMDSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIH 342 EE PESEDVF +ED P + EM + GGW E+KET K++ P EE ++V PDE IH Sbjct: 695 EEEKNPESEDVFMEEDVAPSTSDQEMKDEAGGWAEVKETMKDETPAKEEKEEVVPDETIH 754 Query: 341 EVAVXXXXXXXXXXXKDRGTLKESIDWGGRNMDKKKSKLVGINDNDGPKEIRIERTDEFG 162 E AV KDRGTLKE+I+WGGRNMDKKKSKLVGI DND KEIRIERTDE+G Sbjct: 755 ESAVGKGLAGALKLLKDRGTLKETIEWGGRNMDKKKSKLVGIYDNDAAKEIRIERTDEYG 814 Query: 161 RIMTPKEAFRVISHKFHGKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLAME 3 RI+TPKEAFR++SHKFHGKGPGKMKQEKRM+Q+QE+LK KQMK +DTP L++E Sbjct: 815 RILTPKEAFRLLSHKFHGKGPGKMKQEKRMRQYQEELKVKQMKNADTPSLSVE 867 >ref|XP_007022029.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 5, partial [Theobroma cacao] gi|508721657|gb|EOY13554.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 5, partial [Theobroma cacao] Length = 807 Score = 664 bits (1713), Expect = 0.0 Identities = 381/768 (49%), Positives = 492/768 (64%), Gaps = 14/768 (1%) Frame = -1 Query: 2264 ERDLSREYEHGQDQES-MSEHSXXXXXXXXXGQDKNKGHDKSRVREKDYEKQLEREKVXX 2088 +RD RE EH +++E + +K +G DK R R+++ EK+ ++ K Sbjct: 6 DRDKYREKEHEREREKDRKDRGKEKDRERGRDSEKERGKDKGRDRDREKEKERDKAK--- 62 Query: 2087 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKN-RLERAKDKEKSREE 1911 ER++K+ ++E+ +R +D+EK +E Sbjct: 63 ---------------------------------EREKKDREKEREGEKDRDRDREKGKER 89 Query: 1910 SV-RNRETDLDKSRTKDKERDAGRAKGGEENERTKVGGGGIDIVRXXXXXXXXXXXNHES 1734 S ++RE DL+K R++D++ +A + E+ E +K G +D + + Sbjct: 90 SKQKSREADLEKERSRDRD-NAIKKNHEEDYEGSKDGELALDY---------GDSRDKDE 139 Query: 1733 FSLNDHDERPVGSQSTSELGECXXXXXXXXXXXRADDALEISSWVNKSRRLEEKRNTESE 1554 LN V S+SEL E +++ E+ WV R+LEEKRN E E Sbjct: 140 AELNAGSNAGVAQASSSELEERIARMKEERLKKKSEGVSEVLEWVGNFRKLEEKRNAEKE 199 Query: 1553 KAERTSRILDEQDNV--GEESDDETAGHSANDLAGIKILHGLEKVIEGGNVVLTLKDQSI 1380 KA + S+I +EQD+ GE D+E H+A+DLAG+K+LHGL+KV++GG VVLTLKDQSI Sbjct: 200 KALQRSKIFEEQDDFVQGENEDEEAVRHAAHDLAGVKVLHGLDKVMDGGAVVLTLKDQSI 259 Query: 1379 LTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSHKTILPQYDDPV 1200 L +GDINE+VDMLENVEIGEQRRRD+AYKAAKK TG+Y+DKFND+ S K ILPQYD+PV Sbjct: 260 LANGDINEDVDMLENVEIGEQRRRDEAYKAAKKKTGVYDDKFNDEPGSEKKILPQYDNPV 319 Query: 1199 EDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSDYFTPDEMLQFX 1020 DEGVTLDE GRFTGEA R+QG T+ EDLN+ K SDY+T +EML+F Sbjct: 320 ADEGVTLDERGRFTGEAEKKLQELRKRLQGVPTNNRVEDLNNAGKIASDYYTQEEMLKFK 379 Query: 1019 XXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXXXXXXEMRSNAY 840 LD+DALEAEAISSGLG GDLGSR D++R + E R++AY Sbjct: 380 KPKKKKALRKKEKLDIDALEAEAISSGLGAGDLGSRNDARRQAIREEEARSEAEKRNSAY 439 Query: 839 QSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKLARKRKEEAAPS 660 QSA AKA+EASK L EQ L VK ED+ VF +D +DL S+E++RKLA K K+E S Sbjct: 440 QSAYAKADEASKSLWLEQTLIVKPEEDENQVFADDDDDLYKSIERSRKLAFK-KQEDEKS 498 Query: 659 GPQAVALLATTN--KEQEDTQGSTVGEPQENKVVITEMEEFVLGLQLNEETHKPESEDVF 486 GPQA+AL ATT + D Q +T GE QENK+VITEMEEFV GLQ +EE HKP+SEDVF Sbjct: 499 GPQAIALRATTAAISQTADDQTTTTGEAQENKLVITEMEEFVWGLQHDEEAHKPDSEDVF 558 Query: 485 KDEDDIPKPVEHE---MDSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIHEVAVXXXXX 315 DED++P EH+ ++++GGWTE+ + + ++ P +E+ DD+ PDE IHEVAV Sbjct: 559 MDEDEVPGVSEHDGKSGENEVGGWTEVVDASTDENPSNEDKDDIVPDETIHEVAVGKGLS 618 Query: 314 XXXXXXKDRGTLKESIDWGGRNMDKKKSKLVGIND----NDGPKEIRIERTDEFGRIMTP 147 KDRGTLKESI+WGGRNMDKKKSKLVGI D ND K+IRIERTDEFGRI+TP Sbjct: 619 GALKLLKDRGTLKESIEWGGRNMDKKKSKLVGIVDDDRENDRFKDIRIERTDEFGRIITP 678 Query: 146 KEAFRVISHKFHGKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLAME 3 KEAFRV+SHKFHGKGPGKMKQEKR KQ+QE+LK KQMK SDTP L++E Sbjct: 679 KEAFRVLSHKFHGKGPGKMKQEKRQKQYQEELKLKQMKNSDTPSLSVE 726 >ref|XP_007022027.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 3, partial [Theobroma cacao] gi|508721655|gb|EOY13552.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 3, partial [Theobroma cacao] Length = 864 Score = 664 bits (1713), Expect = 0.0 Identities = 381/768 (49%), Positives = 492/768 (64%), Gaps = 14/768 (1%) Frame = -1 Query: 2264 ERDLSREYEHGQDQES-MSEHSXXXXXXXXXGQDKNKGHDKSRVREKDYEKQLEREKVXX 2088 +RD RE EH +++E + +K +G DK R R+++ EK+ ++ K Sbjct: 112 DRDKYREKEHEREREKDRKDRGKEKDRERGRDSEKERGKDKGRDRDREKEKERDKAK--- 168 Query: 2087 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKN-RLERAKDKEKSREE 1911 ER++K+ ++E+ +R +D+EK +E Sbjct: 169 ---------------------------------EREKKDREKEREGEKDRDRDREKGKER 195 Query: 1910 SV-RNRETDLDKSRTKDKERDAGRAKGGEENERTKVGGGGIDIVRXXXXXXXXXXXNHES 1734 S ++RE DL+K R++D++ +A + E+ E +K G +D + + Sbjct: 196 SKQKSREADLEKERSRDRD-NAIKKNHEEDYEGSKDGELALDY---------GDSRDKDE 245 Query: 1733 FSLNDHDERPVGSQSTSELGECXXXXXXXXXXXRADDALEISSWVNKSRRLEEKRNTESE 1554 LN V S+SEL E +++ E+ WV R+LEEKRN E E Sbjct: 246 AELNAGSNAGVAQASSSELEERIARMKEERLKKKSEGVSEVLEWVGNFRKLEEKRNAEKE 305 Query: 1553 KAERTSRILDEQDNV--GEESDDETAGHSANDLAGIKILHGLEKVIEGGNVVLTLKDQSI 1380 KA + S+I +EQD+ GE D+E H+A+DLAG+K+LHGL+KV++GG VVLTLKDQSI Sbjct: 306 KALQRSKIFEEQDDFVQGENEDEEAVRHAAHDLAGVKVLHGLDKVMDGGAVVLTLKDQSI 365 Query: 1379 LTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSHKTILPQYDDPV 1200 L +GDINE+VDMLENVEIGEQRRRD+AYKAAKK TG+Y+DKFND+ S K ILPQYD+PV Sbjct: 366 LANGDINEDVDMLENVEIGEQRRRDEAYKAAKKKTGVYDDKFNDEPGSEKKILPQYDNPV 425 Query: 1199 EDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSDYFTPDEMLQFX 1020 DEGVTLDE GRFTGEA R+QG T+ EDLN+ K SDY+T +EML+F Sbjct: 426 ADEGVTLDERGRFTGEAEKKLQELRKRLQGVPTNNRVEDLNNAGKIASDYYTQEEMLKFK 485 Query: 1019 XXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXXXXXXEMRSNAY 840 LD+DALEAEAISSGLG GDLGSR D++R + E R++AY Sbjct: 486 KPKKKKALRKKEKLDIDALEAEAISSGLGAGDLGSRNDARRQAIREEEARSEAEKRNSAY 545 Query: 839 QSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKLARKRKEEAAPS 660 QSA AKA+EASK L EQ L VK ED+ VF +D +DL S+E++RKLA K K+E S Sbjct: 546 QSAYAKADEASKSLWLEQTLIVKPEEDENQVFADDDDDLYKSIERSRKLAFK-KQEDEKS 604 Query: 659 GPQAVALLATTN--KEQEDTQGSTVGEPQENKVVITEMEEFVLGLQLNEETHKPESEDVF 486 GPQA+AL ATT + D Q +T GE QENK+VITEMEEFV GLQ +EE HKP+SEDVF Sbjct: 605 GPQAIALRATTAAISQTADDQTTTTGEAQENKLVITEMEEFVWGLQHDEEAHKPDSEDVF 664 Query: 485 KDEDDIPKPVEHE---MDSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIHEVAVXXXXX 315 DED++P EH+ ++++GGWTE+ + + ++ P +E+ DD+ PDE IHEVAV Sbjct: 665 MDEDEVPGVSEHDGKSGENEVGGWTEVVDASTDENPSNEDKDDIVPDETIHEVAVGKGLS 724 Query: 314 XXXXXXKDRGTLKESIDWGGRNMDKKKSKLVGIND----NDGPKEIRIERTDEFGRIMTP 147 KDRGTLKESI+WGGRNMDKKKSKLVGI D ND K+IRIERTDEFGRI+TP Sbjct: 725 GALKLLKDRGTLKESIEWGGRNMDKKKSKLVGIVDDDRENDRFKDIRIERTDEFGRIITP 784 Query: 146 KEAFRVISHKFHGKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLAME 3 KEAFRV+SHKFHGKGPGKMKQEKR KQ+QE+LK KQMK SDTP L++E Sbjct: 785 KEAFRVLSHKFHGKGPGKMKQEKRQKQYQEELKLKQMKNSDTPSLSVE 832 >ref|XP_007022025.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] gi|590611175|ref|XP_007022026.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] gi|508721653|gb|EOY13550.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] gi|508721654|gb|EOY13551.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] Length = 907 Score = 664 bits (1713), Expect = 0.0 Identities = 381/768 (49%), Positives = 492/768 (64%), Gaps = 14/768 (1%) Frame = -1 Query: 2264 ERDLSREYEHGQDQES-MSEHSXXXXXXXXXGQDKNKGHDKSRVREKDYEKQLEREKVXX 2088 +RD RE EH +++E + +K +G DK R R+++ EK+ ++ K Sbjct: 112 DRDKYREKEHEREREKDRKDRGKEKDRERGRDSEKERGKDKGRDRDREKEKERDKAK--- 168 Query: 2087 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKN-RLERAKDKEKSREE 1911 ER++K+ ++E+ +R +D+EK +E Sbjct: 169 ---------------------------------EREKKDREKEREGEKDRDRDREKGKER 195 Query: 1910 SV-RNRETDLDKSRTKDKERDAGRAKGGEENERTKVGGGGIDIVRXXXXXXXXXXXNHES 1734 S ++RE DL+K R++D++ +A + E+ E +K G +D + + Sbjct: 196 SKQKSREADLEKERSRDRD-NAIKKNHEEDYEGSKDGELALDY---------GDSRDKDE 245 Query: 1733 FSLNDHDERPVGSQSTSELGECXXXXXXXXXXXRADDALEISSWVNKSRRLEEKRNTESE 1554 LN V S+SEL E +++ E+ WV R+LEEKRN E E Sbjct: 246 AELNAGSNAGVAQASSSELEERIARMKEERLKKKSEGVSEVLEWVGNFRKLEEKRNAEKE 305 Query: 1553 KAERTSRILDEQDNV--GEESDDETAGHSANDLAGIKILHGLEKVIEGGNVVLTLKDQSI 1380 KA + S+I +EQD+ GE D+E H+A+DLAG+K+LHGL+KV++GG VVLTLKDQSI Sbjct: 306 KALQRSKIFEEQDDFVQGENEDEEAVRHAAHDLAGVKVLHGLDKVMDGGAVVLTLKDQSI 365 Query: 1379 LTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSHKTILPQYDDPV 1200 L +GDINE+VDMLENVEIGEQRRRD+AYKAAKK TG+Y+DKFND+ S K ILPQYD+PV Sbjct: 366 LANGDINEDVDMLENVEIGEQRRRDEAYKAAKKKTGVYDDKFNDEPGSEKKILPQYDNPV 425 Query: 1199 EDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSDYFTPDEMLQFX 1020 DEGVTLDE GRFTGEA R+QG T+ EDLN+ K SDY+T +EML+F Sbjct: 426 ADEGVTLDERGRFTGEAEKKLQELRKRLQGVPTNNRVEDLNNAGKIASDYYTQEEMLKFK 485 Query: 1019 XXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXXXXXXEMRSNAY 840 LD+DALEAEAISSGLG GDLGSR D++R + E R++AY Sbjct: 486 KPKKKKALRKKEKLDIDALEAEAISSGLGAGDLGSRNDARRQAIREEEARSEAEKRNSAY 545 Query: 839 QSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKLARKRKEEAAPS 660 QSA AKA+EASK L EQ L VK ED+ VF +D +DL S+E++RKLA K K+E S Sbjct: 546 QSAYAKADEASKSLWLEQTLIVKPEEDENQVFADDDDDLYKSIERSRKLAFK-KQEDEKS 604 Query: 659 GPQAVALLATTN--KEQEDTQGSTVGEPQENKVVITEMEEFVLGLQLNEETHKPESEDVF 486 GPQA+AL ATT + D Q +T GE QENK+VITEMEEFV GLQ +EE HKP+SEDVF Sbjct: 605 GPQAIALRATTAAISQTADDQTTTTGEAQENKLVITEMEEFVWGLQHDEEAHKPDSEDVF 664 Query: 485 KDEDDIPKPVEHE---MDSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIHEVAVXXXXX 315 DED++P EH+ ++++GGWTE+ + + ++ P +E+ DD+ PDE IHEVAV Sbjct: 665 MDEDEVPGVSEHDGKSGENEVGGWTEVVDASTDENPSNEDKDDIVPDETIHEVAVGKGLS 724 Query: 314 XXXXXXKDRGTLKESIDWGGRNMDKKKSKLVGIND----NDGPKEIRIERTDEFGRIMTP 147 KDRGTLKESI+WGGRNMDKKKSKLVGI D ND K+IRIERTDEFGRI+TP Sbjct: 725 GALKLLKDRGTLKESIEWGGRNMDKKKSKLVGIVDDDRENDRFKDIRIERTDEFGRIITP 784 Query: 146 KEAFRVISHKFHGKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLAME 3 KEAFRV+SHKFHGKGPGKMKQEKR KQ+QE+LK KQMK SDTP L++E Sbjct: 785 KEAFRVLSHKFHGKGPGKMKQEKRQKQYQEELKLKQMKNSDTPSLSVE 832 >ref|XP_002516516.1| conserved hypothetical protein [Ricinus communis] gi|223544336|gb|EEF45857.1| conserved hypothetical protein [Ricinus communis] Length = 873 Score = 662 bits (1709), Expect = 0.0 Identities = 394/782 (50%), Positives = 492/782 (62%), Gaps = 29/782 (3%) Frame = -1 Query: 2261 RDLSREYEHGQDQESMSEHSXXXXXXXXXGQDKNKGHDKSRVREKDYEKQLEREKVXXXX 2082 +D + + D+E + + S +D+ K +K R R + EK+LERE+ Sbjct: 58 KDSDKNQDEYMDRECVKDRSSRDSKVRDKDKDREKTREKDRER-RGKEKELERERER--- 113 Query: 2081 XXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKNRLERAKDKEKSRE-ESV 1905 ERD KE D+E+ + E+++D+ K RE E Sbjct: 114 -------------------------------ERD-KEVDKERGK-EKSRDRNKDREREKY 140 Query: 1904 RNRETDLD------KSRTKDKER--DAGRAKGG-------EENERTKVGGGGIDIVRXXX 1770 ++RE D D K +TK+KE D R + G EEN+R+K I++ Sbjct: 141 KDREVDKDRDVQKGKEKTKEKEEFHDKDRLRDGVSKRSHEEENDRSK--NDTIEMGYERE 198 Query: 1769 XXXXXXXXNHESFSLNDHDERPV------GSQSTSELGECXXXXXXXXXXXRADDALEIS 1608 SF ++ DE+ V G S+ E E +D E+ Sbjct: 199 RNSDVGKQKKVSFDDDNDDEQKVERTSGGGLASSLEFEERILKVREERLKKNSDAGSEVL 258 Query: 1607 SWVNKSRRLEEKRNTESEKAERTSRILDEQDNVGE-ESDDETAGHSA-NDLAGIKILHGL 1434 SWVN+SR+L EK+N E +KA++ S++ +EQD + + ES+DE AG A NDLAG+K+LHGL Sbjct: 259 SWVNRSRKLAEKKNAEKKKAKQLSKVFEEQDKIVQGESEDEEAGELATNDLAGVKVLHGL 318 Query: 1433 EKVIEGGNVVLTLKDQSILTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKF 1254 EKV+EGG VVLTLKDQSIL DGDINEEVDMLEN+EIGEQ+RR++AYKAAKK TG+Y+DKF Sbjct: 319 EKVMEGGAVVLTLKDQSILVDGDINEEVDMLENIEIGEQKRRNEAYKAAKKKTGIYDDKF 378 Query: 1253 NDDLSSHKTILPQYDDPVEDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNS 1074 NDD +S + ILPQYDDP DEGVTLDE GRFTGEA R+QG T FEDLNS Sbjct: 379 NDDPASERKILPQYDDPTTDEGVTLDERGRFTGEAEKKLEELRRRLQGALTDNCFEDLNS 438 Query: 1073 TAKNTSDYFTPDEMLQFXXXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRL 894 + K +SD++T +EMLQF LD+DALEAEA+S+GLGVGDLGSR D +R Sbjct: 439 SGKMSSDFYTHEEMLQFKKPKKKKSLRKKEKLDIDALEAEAVSAGLGVGDLGSRSDGRRQ 498 Query: 893 SXXXXXXXXXXEMRSNAYQSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENS 714 + E RS+AYQSA AKA+EASK LR EQ L K E++ VF +D EDL S Sbjct: 499 AIREEQERSEAERRSSAYQSAYAKADEASKSLRLEQTLPAKVNEEENPVFADDDEDLFKS 558 Query: 713 LEQARKLARKRKEEAAPSGPQAVALLAT-TNKEQEDTQGSTVGEPQENKVVITEMEEFVL 537 LE+ARKLA K++EEA SGPQA+A LAT TN + D Q GE QENKVV TEMEEFV Sbjct: 559 LERARKLALKKQEEA--SGPQAIARLATATNNQIADDQNPADGESQENKVVFTEMEEFVW 616 Query: 536 GLQLNEETHKPESEDVFKDEDDIPKPVEHEMDSQIGGWTEIKETNKNQQPPSEENDDVAP 357 GLQL+EE+HKP SEDVF DED P+ + EM + G WTE+ + ++ +E +DV P Sbjct: 617 GLQLDEESHKPGSEDVFMDEDAAPRVSDQEMKDEAGRWTEVNDAAEDDNSVNENKEDVVP 676 Query: 356 DEIIHEVAVXXXXXXXXXXXKDRGTLKESIDWGGRNMDKKKSKLVGINDNDGP----KEI 189 DE IHEVAV K+RGTLKE++DWGGRNMDKKKSKLVGI D+D KEI Sbjct: 677 DETIHEVAVGKGLSGALKLLKERGTLKETVDWGGRNMDKKKSKLVGIVDSDADNEKFKEI 736 Query: 188 RIERTDEFGRIMTPKEAFRVISHKFHGKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLA 9 RIER DEFGRIMTPKEAFR+ISHKFHGKGPGKMKQEKRMKQ+QE+LK KQMK SDTP + Sbjct: 737 RIERMDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSES 796 Query: 8 ME 3 +E Sbjct: 797 VE 798 >ref|XP_006836392.1| PREDICTED: SART-1 family protein DOT2 [Amborella trichopoda] gi|548838910|gb|ERM99245.1| hypothetical protein AMTR_s00092p00135160 [Amborella trichopoda] Length = 1028 Score = 662 bits (1707), Expect = 0.0 Identities = 375/767 (48%), Positives = 486/767 (63%), Gaps = 12/767 (1%) Frame = -1 Query: 2267 KERDLSREYEHGQDQESMSEHSXXXXXXXXXGQDKNKGHDKSRVREKDYEKQLEREKVXX 2088 +E+D +E EH +D+E EH G++K ++ R ++KD EK+ +E+ Sbjct: 204 REKDREKEREHDRDREKEREHDRDRERTRERGKEKEIEKEREREKDKDREKEKNKEREKE 263 Query: 2087 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKNRL-ERAKDKEKSRE- 1914 ERD E DR K +L E+ K+++K RE Sbjct: 264 KDRSKDREKLRDRQREKDI--------------ERDGLEKDRMKEKLREKEKERDKYREK 309 Query: 1913 ESVRNRETDLDKSRTKDKERDA--GRAKGGEENERTKVGG-GGIDIVRXXXXXXXXXXXN 1743 E + ++E D K ++KD RD R K GE+ + K+ G DI Sbjct: 310 ERISDKERDKVKGKSKDHGRDKEFDRGKEGEKEAKPKIDAWDGRDITEQEDNVQDDKDNT 369 Query: 1742 HESFSLNDHDERP-----VGSQSTSELGECXXXXXXXXXXXRADDALEISSWVNKSRRLE 1578 ++ DH E+ V STSE+ E + + E+SSWVNKSR++E Sbjct: 370 YDRTGAMDHKEKNEIQAGVSRPSTSEIEERLAKMREERMKKKNEGVSEVSSWVNKSRKIE 429 Query: 1577 EKRNTESEKAERTSRILDEQDNVGEESDDET-AGHSANDLAGIKILHGLEKVIEGGNVVL 1401 EK ++E EKA +++ EQD+V +ESD+E A HS DLAG+K+LHGLE+VI GG VVL Sbjct: 430 EKLSSEKEKALHLAKVFAEQDSVVQESDEEEEAQHSGKDLAGVKVLHGLEQVIVGGAVVL 489 Query: 1400 TLKDQSILTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSHKTIL 1221 TLKDQ+IL DGD+N EVDMLENVE+GEQ+RRD+AYKAAKK G+YEDKF DD S K IL Sbjct: 490 TLKDQNILADGDLNNEVDMLENVELGEQKRRDEAYKAAKKKPGIYEDKFADDDGSQKKIL 549 Query: 1220 PQYDDPVEDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSDYFTP 1041 PQYDD +DEGV LDE+G T EA R+QG ST + FEDL +T K +SDY+T Sbjct: 550 PQYDDTSKDEGVALDESGHITREAQKKLEELRKRLQGASTGQHFEDLTATGKVSSDYYTQ 609 Query: 1040 DEMLQFXXXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXXXXXX 861 +EMLQF LDLDALEAEAI+SGLGVGD GSR D++R Sbjct: 610 EEMLQFKKPKKKKALRKKVKLDLDALEAEAIASGLGVGDRGSRADAQRQRAKEEEEWAEA 669 Query: 860 EMRSNAYQSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKLARKR 681 E R AYQSA AKA E++K LR+EQ L V+ ED+ + FG+D EDL S+E+ARKLARK+ Sbjct: 670 ETRKEAYQSAFAKANESTKALREEQTLKVEGDEDENLAFGDD-EDLHKSIEEARKLARKK 728 Query: 680 KEEAAPSGPQAVALLATTNKEQEDTQGSTVGEPQENKVVITEMEEFVLGLQLNEETHKPE 501 ++E A SGP AVA LA + E +D + S GEPQEN++V TE++EFVLGLQ +E P+ Sbjct: 729 QDEGAASGPLAVAQLAVSASESKDAEAS--GEPQENRLVFTEVDEFVLGLQHDEGAQNPD 786 Query: 500 SEDVFKDEDDIPKPV-EHEMDSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIHEVAVXX 324 +EDVFK++D++ P+ + E Q+GGWT++ E+ K++Q +EE+++V PD I E V Sbjct: 787 AEDVFKEDDEVQNPIKQDEPMEQVGGWTDVIESEKDEQMKTEEDEEVVPDATIQEAVVGK 846 Query: 323 XXXXXXXXXKDRGTLKESIDWGGRNMDKKKSKLVGINDNDGPKEIRIERTDEFGRIMTPK 144 K+RGTLKE+IDWGGRNMDKKKSKLVG+ +NDG KEI ++R DEFGRIMTPK Sbjct: 847 GLSGALQLLKERGTLKEAIDWGGRNMDKKKSKLVGVRENDGAKEIVLDRLDEFGRIMTPK 906 Query: 143 EAFRVISHKFHGKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLAME 3 EAFR +SHKFHGKGPGKMKQEKRMKQF E+LK KQMKASDTPLL+ME Sbjct: 907 EAFRKLSHKFHGKGPGKMKQEKRMKQFMEELKLKQMKASDTPLLSME 953 >gb|KHG25959.1| U4/U6.U5 tri-snRNP-associated 1 [Gossypium arboreum] Length = 955 Score = 661 bits (1706), Expect = 0.0 Identities = 384/796 (48%), Positives = 502/796 (63%), Gaps = 42/796 (5%) Frame = -1 Query: 2264 ERDLSREYEHGQDQES-MSEHSXXXXXXXXXGQDKNKGHDKSRVREKDYEKQLEREKVXX 2088 +RD RE EH +++E + +K +G DKSR R+++ EK+ ++ K Sbjct: 110 DRDKHREKEHEREREKDRKDRGKDKDRERDRESEKERGKDKSRDRDREKEKERDKAKERE 169 Query: 2087 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKNRL-ERAKDKEKSREE 1911 ERD K DREK R E+ +D+EK ++ Sbjct: 170 K--------------------------------ERD-KLKDREKEREGEKDRDREKGKDR 196 Query: 1910 SV-RNRETDLDKSRTKDKERDAGRAKGGEENERTKVGGGGIDIVRXXXXXXXXXXXNHES 1734 S +NRETDL+K R++D++ + E+ E +K G +D + + Sbjct: 197 SKQKNRETDLEKERSRDRDNVVKNHE--EDYEGSKDGELALDY---------EDRRDKDE 245 Query: 1733 FSLNDHDERPVGSQSTSELGECXXXXXXXXXXXRADDALEISSWVNKSRRLEEKRNTESE 1554 LN + S+SEL E +++ E+S+WV++SR+LE+KRN E E Sbjct: 246 AELNAGSNASLVQASSSELEERIVRMKEVRLKKKSEGLSEVSAWVSRSRKLEDKRNAEKE 305 Query: 1553 KAERTSRILDEQDNV--GEESDDETAGHSANDLAGIKILHGLEKVIEGGNVVLTLKDQSI 1380 KA + S+I +EQDN GE+ D+E ++DL G+K+LHGL+KV++GG VVLTLKDQSI Sbjct: 306 KALQLSKIFEEQDNFVQGEDEDEEADNRPSHDLGGVKVLHGLDKVMDGGAVVLTLKDQSI 365 Query: 1379 LTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSHKTILPQYDDPV 1200 L DGD+NE+VDMLEN+EIGEQ++RD+AYKAAKK TG+Y+DKFN+D S K ILPQYDDPV Sbjct: 366 LADGDLNEDVDMLENIEIGEQKQRDEAYKAAKKKTGVYDDKFNEDPGSEKKILPQYDDPV 425 Query: 1199 EDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSDYFTPDEMLQFX 1020 DEGVTLDE GRFTGEA R+ G T+ EDLN+ K +SDY+T +EML+F Sbjct: 426 ADEGVTLDERGRFTGEAEKKLDELRKRLLGVPTNNRVEDLNNVGKVSSDYYTQEEMLRFK 485 Query: 1019 XXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXXXXXXEMRSNAY 840 LD+DALEAEA+S+GLG GDLGSR DS+R + E R+NAY Sbjct: 486 KPKKKKALRKKEKLDIDALEAEAVSAGLGAGDLGSRNDSRRQAIKEEEARSEAEKRNNAY 545 Query: 839 QSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKLARKRKEEAAPS 660 Q+A AKA+EASK LR EQ LTVK ED+ VF +D EDL SLE+AR+LA K++EE S Sbjct: 546 QAAFAKADEASKSLRLEQTLTVKPEEDENQVFADDEEDLYKSLEKARRLALKKQEE--KS 603 Query: 659 GPQAVALLATTNKEQE--DTQGSTVGEPQENKVVITEMEEFVLGLQLNE----------- 519 GPQAVALLA T+ + D Q ++ GE QENKVVITEMEEFV GLQL+E Sbjct: 604 GPQAVALLAATSASNQTTDDQNTSTGEAQENKVVITEMEEFVWGLQLDEATKSSAKIWNI 663 Query: 518 ----------------ETHKPESEDVFKDEDDIPKPVEHEM---DSQIGGWTEIKETNKN 396 E HKP+SEDVF DED++P E + ++++GGWTE+ +T+ + Sbjct: 664 FSFMGSCVRLMLIWSSEAHKPDSEDVFMDEDEVPGASEQDRENGENEVGGWTEVVDTSAD 723 Query: 395 QQPPSEENDDVAPDEIIHEVAVXXXXXXXXXXXKDRGTLKESIDWGGRNMDKKKSKLVGI 216 ++P +E+N++V PDE IHE+AV KDRGTLKE+I+WGGRNMDKKKSKLVGI Sbjct: 724 EKPANEDNNEVVPDETIHEIAVGKGLSGALKLLKDRGTLKETIEWGGRNMDKKKSKLVGI 783 Query: 215 NDNDGP-----KEIRIERTDEFGRIMTPKEAFRVISHKFHGKGPGKMKQEKRMKQFQEDL 51 D+D K+IRIERTDEFGRI+TPKEAFR++SHKFHGKGPGKMKQEKRMKQ+QE+L Sbjct: 784 VDDDHQTDNRFKDIRIERTDEFGRIVTPKEAFRMLSHKFHGKGPGKMKQEKRMKQYQEEL 843 Query: 50 KTKQMKASDTPLLAME 3 K KQMK SDTP L++E Sbjct: 844 KLKQMKNSDTPSLSVE 859 >ref|XP_007225495.1| hypothetical protein PRUPE_ppa000914mg [Prunus persica] gi|596285693|ref|XP_007225496.1| hypothetical protein PRUPE_ppa000914mg [Prunus persica] gi|462422431|gb|EMJ26694.1| hypothetical protein PRUPE_ppa000914mg [Prunus persica] gi|462422432|gb|EMJ26695.1| hypothetical protein PRUPE_ppa000914mg [Prunus persica] Length = 963 Score = 658 bits (1697), Expect = 0.0 Identities = 394/816 (48%), Positives = 494/816 (60%), Gaps = 62/816 (7%) Frame = -1 Query: 2264 ERDLSREYEH--GQDQESMSEHSXXXXXXXXXGQDKNKGHDKSRVREK--DYEKQLEREK 2097 +R+ RE EH G+D++ + +D ++G DK R +EK D +K EREK Sbjct: 111 DRESHRETEHERGKDRKDRGKEKEREKEREVE-KDSDRGRDKERGKEKIKDRDKDKEREK 169 Query: 2096 VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDE-KENDREKNRLERAKDKEKS 1920 ERD KE +REK R E+ KD+EK Sbjct: 170 ------------------------------------ERDRAKEKEREKER-EKHKDREKG 192 Query: 1919 RE-------ESVRN--RETDLDKSRTKDKERDAGRAKGGEENERTKVGGGGIDIVRXXXX 1767 RE E V++ RE + + KDK RD + +EN GG D + Sbjct: 193 RENYKDTDRERVKDKYREKEREVDHDKDKSRDRVSRRSLDENYEWSKDGGRDDKAKLNEE 252 Query: 1766 XXXXXXXNHESFSLNDHDERPV------GSQSTSELGECXXXXXXXXXXXRADDALEISS 1605 S N DER S EL E + +D E+ + Sbjct: 253 YTGDKDIKQGKVSHNAEDERKAEGLSGGAHLSALELEERIMKTKEERLKKKKEDVPEVLA 312 Query: 1604 WVNKSRRLEEKRNTESEKAERTSRILDEQDNVG--EESDDETAGHSANDLAGIKILHGLE 1431 WV++SR+LE+KRN E +KA + S+I +EQDN+G E D+ETA + +DLAG+K+LHGL+ Sbjct: 313 WVSRSRKLEDKRNAEKQKALQLSKIFEEQDNIGQGESEDEETAQDTTHDLAGVKVLHGLD 372 Query: 1430 KVIEGGNVVLTLKDQSILTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFN 1251 KV+EGG VVLTLKDQ+IL DG +NE++DMLENVEIGEQ++RDDAYKAAKK TG+Y DKFN Sbjct: 373 KVMEGGAVVLTLKDQNILADGGVNEDIDMLENVEIGEQKQRDDAYKAAKKKTGIYVDKFN 432 Query: 1250 DDLSSHKTILPQYDDPVEDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNST 1071 DDL++ K ILPQYDDPV DEG+TLDE GRFTGEA RIQG T+ FEDLN + Sbjct: 433 DDLNTEKKILPQYDDPVPDEGLTLDERGRFTGEAEKKLEELRKRIQGVPTNNRFEDLNMS 492 Query: 1070 AKNTSDYFTPDEMLQF--XXXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKR 897 TSD++T +EMLQF LDLDALEAEA+S+GLGV DLGSR D+KR Sbjct: 493 GNITSDFYTQEEMLQFKKPKKGKKKSLRKKEKLDLDALEAEAVSAGLGVADLGSRNDAKR 552 Query: 896 LSXXXXXXXXXXEMRSNAYQSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLEN 717 + E R++AYQ A AKA+EASK LR EQILTV ED+ F +D +DL Sbjct: 553 QANKEEQERLEAERRNSAYQLAYAKADEASKSLRLEQILTVIPEEDETPAFADDDDDLYK 612 Query: 716 SLEQARKLARKRKEEAAPSGPQAVALLATT--NKEQEDTQGSTVGEPQENKVVITEMEEF 543 SLE+ARKLA K+KEE SGPQA+ALLATT + + D Q + GE Q+NKVV TEMEEF Sbjct: 613 SLERARKLALKKKEEETASGPQAIALLATTTASSQTADNQIPSTGESQDNKVVFTEMEEF 672 Query: 542 VLGLQLNEETHKPESEDVFKDEDDIPKPVEHEMDSQIGGWTEIKETNKNQQPPSEENDDV 363 V GLQL+EE+HKPESEDVF ED+ PKP E ++ GGWTE+K+ +++++P +E+ +++ Sbjct: 673 VWGLQLDEESHKPESEDVFMQEDEEPKPSHEERMNEPGGWTEVKDMDEDEKPATEDKEEI 732 Query: 362 APDEIIHEVAVXXXXXXXXXXXKDRGTLKESIDWGGRNMDKKKSKLVGI-NDNDGPKE-- 192 PDE IHEVAV KDRGTLKE I+WGGRNMDKKKSKL+GI +D+D PKE Sbjct: 733 VPDETIHEVAVGKGLSGVLKLLKDRGTLKEGIEWGGRNMDKKKSKLLGIVDDDDEPKEPH 792 Query: 191 ---------------------------------IRIERTDEFGRIMTPKEAFRVISHKFH 111 I IERTDEFGR +TPKEAFR +SHKFH Sbjct: 793 TSRQKKDEHKDTRPSSSSHQKETRPSKVYQEKDIHIERTDEFGRTLTPKEAFRTLSHKFH 852 Query: 110 GKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLAME 3 GKGPGKMKQEKRMKQ+QE+LK KQMK+SDTP L+ E Sbjct: 853 GKGPGKMKQEKRMKQYQEELKLKQMKSSDTPSLSAE 888 >ref|XP_008390895.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1-like [Malus domestica] gi|657997037|ref|XP_008390896.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1-like [Malus domestica] gi|657997039|ref|XP_008390897.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1-like [Malus domestica] Length = 946 Score = 648 bits (1671), Expect = 0.0 Identities = 388/803 (48%), Positives = 485/803 (60%), Gaps = 48/803 (5%) Frame = -1 Query: 2267 KERDLSREYEHGQDQESMSEHSXXXXXXXXXGQDKNKGHDKSRVREKDYEKQLEREKVXX 2088 KER+ RE E D+ E N+ D+ R +E+D K+ EREK Sbjct: 130 KEREKEREAEKDSDRGREKERG-------------NRDKDREREKERDRAKEKEREK--- 173 Query: 2087 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKNRLERAKDKEKSR-EE 1911 ER EK DREK R E KD ++ R ++ Sbjct: 174 ---------------------------------ER-EKHKDREKGR-ESYKDTDRERVKD 198 Query: 1910 SVRNRETDLDKSRTKDKERDAGRAKGGEENERTKVGGGGI---DIVRXXXXXXXXXXXNH 1740 R +E ++D+ KDK RD G + E +++ K+ G DI++ H Sbjct: 199 KYREKEREVDQD--KDKSRDRGSRRSVERDDKLKLNGDDNRDKDILKQGKVSHNAEDERH 256 Query: 1739 -ESFSLNDHDERPVGSQSTSELGECXXXXXXXXXXXRADDALEISSWVNKSRRLEEKRNT 1563 + S H S SEL E + +D E+ +WV+KSR++EEKRN Sbjct: 257 ADGLSSGTH-------LSASELEERILKTKEERLKKKTEDVPEVLAWVSKSRKIEEKRNA 309 Query: 1562 ESEKAERTSRILDEQDNVG--EESDDETAGHSANDLAGIKILHGLEKVIEGGNVVLTLKD 1389 E +KA + S+I +EQDN+G E D+ETA +DLAG+K+LHGL+KV+EGG VVLTLKD Sbjct: 310 EKQKALQLSKIFEEQDNIGQGESEDEETAQDPTHDLAGVKVLHGLDKVMEGGAVVLTLKD 369 Query: 1388 QSILTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSHKTILPQYD 1209 Q+IL DGDINE++DMLENVEIGEQ++RDDAYKAAKK G Y DKFNDD + K +LPQYD Sbjct: 370 QNILADGDINEDIDMLENVEIGEQKQRDDAYKAAKKKRGAYVDKFNDDPGTEKKMLPQYD 429 Query: 1208 DPVEDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSDYFTPDEML 1029 DP DEG+TLDE GRFTGEA RIQG T FEDLN + K +SD++T DEML Sbjct: 430 DPTPDEGLTLDERGRFTGEAEKKLEELRKRIQGVPTKDRFEDLNMSGKISSDFYTQDEML 489 Query: 1028 QFXXXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXXXXXXEMRS 849 QF LDLDALEAEA+S+GLGV DLGSR D+KR + E R+ Sbjct: 490 QFKKPKKKKSLRKREKLDLDALEAEAVSAGLGVEDLGSRNDAKRRASKEEQERLEAERRN 549 Query: 848 NAYQSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKLA-RKRKEE 672 +AYQ A A+A+EASK LR EQ L+VK ED+ VF +D +DL SLE+ARKLA +K++EE Sbjct: 550 SAYQLAYARADEASKSLRLEQTLSVKREEDENPVFADDDDDLYKSLEKARKLALKKKEEE 609 Query: 671 AAPSGPQAVALLATT--NKEQEDTQGSTVGEPQENKVVITEMEEFVLGLQLNEETHKPES 498 SGPQA+ALLATT + + D Q + GE Q+NKVV TEMEEFV GLQL+EE+HKPES Sbjct: 610 KTVSGPQAIALLATTTASSQTADDQIPSTGESQDNKVVFTEMEEFVWGLQLDEESHKPES 669 Query: 497 EDVFKDEDDIPKPVEHEMDSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIHEVAVXXXX 318 EDVF ED+ P E +MD + GGWTE+ + ++++QP +E+ D+V PDE IHEVAV Sbjct: 670 EDVFMQEDEPEVPHEEKMD-EPGGWTEVNDMDEDKQPENEDKDEVVPDETIHEVAVGKGL 728 Query: 317 XXXXXXXKDRGTLKESIDWGGRNMDKKKSKLVGINDND---------------------- 204 KDRGTLKE IDWGGRNMDKKKSKL GI D+D Sbjct: 729 SGVLKLLKDRGTLKEGIDWGGRNMDKKKSKLFGIVDDDEEEQPKETHTSRQKKDEPRDTR 788 Query: 203 ----------------GPKEIRIERTDEFGRIMTPKEAFRVISHKFHGKGPGKMKQEKRM 72 K+IRIERTDEFGR +TPKEAFR++SHKFHGKGPGKMKQEKRM Sbjct: 789 SSSSSHQKDTRAPKVYQEKDIRIERTDEFGRTLTPKEAFRILSHKFHGKGPGKMKQEKRM 848 Query: 71 KQFQEDLKTKQMKASDTPLLAME 3 KQ+QE+LK KQMK+SDTP L+ E Sbjct: 849 KQYQEELKLKQMKSSDTPSLSAE 871 >ref|XP_002297938.2| hypothetical protein POPTR_0001s11550g [Populus trichocarpa] gi|550347020|gb|EEE82743.2| hypothetical protein POPTR_0001s11550g [Populus trichocarpa] Length = 862 Score = 646 bits (1666), Expect = 0.0 Identities = 382/775 (49%), Positives = 488/775 (62%), Gaps = 20/775 (2%) Frame = -1 Query: 2267 KERDLSREYEHGQDQESMSEHSXXXXXXXXXGQDKNKGHDKSR---VREKDYEKQLEREK 2097 +ER+ R+ +H S+ + D + G DKSR V++K+Y+++ REK Sbjct: 41 EERERERDRDHKSKDRERSKKT----------SDNDVGKDKSRDSKVKDKEYDREKSREK 90 Query: 2096 VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKNRLERAKDKEKSR 1917 K E+D ++ND+E+ ER ++K K R Sbjct: 91 DKDRKDRGKEKERERDREKKEKERERVKEKEKHKDREKD-RDNDKER---ERGREKTKER 146 Query: 1916 EESVRNRETDLDKSRTKDKERDAGRAKGGEENERTKVGGGGIDIVRXXXXXXXXXXXNHE 1737 E R+RE D DK R+++K+R + K EE+ KV D V Sbjct: 147 E---RDREADQDKERSREKDRAS--RKSNEEDYDDKVQMDYEDEV-------DKDNRKQG 194 Query: 1736 SFSLNDHDERPV------GSQSTSELGECXXXXXXXXXXXRADDALEISSWVNKSRRLEE 1575 S D D++ S SELG+ +++ +I +WV KSR++EE Sbjct: 195 KVSFRDEDDQSAEGASAGAHSSASELGQRILKMKEERTKKKSEPGSDILAWVGKSRKIEE 254 Query: 1574 KRNTESEKAERTSRILDEQDNVGEE-SDDETAG-HSANDLAGIKILHGLEKVIEGGNVVL 1401 + ++A+ S+I +EQDN+G+ SDDE A H+A +LAGIK+L GL+KV+EGG VVL Sbjct: 255 NKYAAKKRAKHLSKIFEEQDNIGQGGSDDEEADQHNAYNLAGIKVLDGLDKVLEGGAVVL 314 Query: 1400 TLKDQSILTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSHKTIL 1221 TLKDQ+IL DGDINEEVDMLENVEIGEQ+RRD+AYKAAKK TG+YEDKFNDD +S K +L Sbjct: 315 TLKDQNILADGDINEEVDMLENVEIGEQKRRDEAYKAAKKKTGIYEDKFNDDPASEKKML 374 Query: 1220 PQYDDPVEDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSDYFTP 1041 PQYDD DEGVTLDE GRFTGEA R+QG STS EDLNS+ K +SDYFT Sbjct: 375 PQYDDANADEGVTLDERGRFTGEAEKKLEELRRRLQGTSTSARLEDLNSSGKISSDYFTH 434 Query: 1040 DEMLQFXXXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXXXXXX 861 +EMLQF LD+DALEAEA+S+GLG+GDLGSRKD +R + Sbjct: 435 EEMLQFKKPKKKKSLRKKDKLDIDALEAEAVSAGLGIGDLGSRKDGRRQAIREEQERSEA 494 Query: 860 EMRSNAYQSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKLARKR 681 EMR+NAYQSA AKA+EASK LR ++ L K E++ +VF +D EDL SLE+ARKLA K Sbjct: 495 EMRNNAYQSAYAKADEASKSLRLDRTLQTKVEEEENLVFADDEEDLYKSLERARKLALK- 553 Query: 680 KEEAAPSGPQAVALLATTNKEQE--DTQGSTVGEPQENKVVITEMEEFVLGLQLNEETHK 507 K+EA SGP A+A LA+T + D + GE ENK+V TEMEEFV +QL EE HK Sbjct: 554 KQEAEASGPLAIAHLASTTLSSQIADDKNPETGESHENKLVFTEMEEFVSAIQLAEEVHK 613 Query: 506 PESEDVFKDEDDIPKPVEHEMDSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIHEVAVX 327 P++EDVF DED+ P+ + E + GGW E+ + +K++ P +E+ +++ PDE IHEVAV Sbjct: 614 PDNEDVFMDEDEPPRVSDEEQKDEAGGWMEVPDNSKDENPVNED-EEIVPDETIHEVAVG 672 Query: 326 XXXXXXXXXXKDRGTLKESIDWGGRNMDKKKSKLVGINDNDGP-------KEIRIERTDE 168 K+RGTLKESIDWGGRNMDKKKSKLVGI D+D K+IRIERTDE Sbjct: 673 KGLSGALKLLKERGTLKESIDWGGRNMDKKKSKLVGIVDDDVGTNNDNKFKDIRIERTDE 732 Query: 167 FGRIMTPKEAFRVISHKFHGKGPGKMKQEKRMKQFQEDLKTKQMKASDTPLLAME 3 FGRIMTPKEAFR+ISHKFHGKGPGKMKQEKRMKQ+QE+LK KQMK SDTP L++E Sbjct: 733 FGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVE 787 >ref|XP_010102332.1| hypothetical protein L484_015280 [Morus notabilis] gi|587905102|gb|EXB93293.1| hypothetical protein L484_015280 [Morus notabilis] Length = 952 Score = 645 bits (1665), Expect = 0.0 Identities = 377/798 (47%), Positives = 485/798 (60%), Gaps = 43/798 (5%) Frame = -1 Query: 2267 KERDLSREYEHGQDQESMSEHSXXXXXXXXXGQDKNKGHDKSRVREKDYEKQLEREKVXX 2088 KERD+ ++ + G+D+E E KN DK R +E+D ++ +RE+ Sbjct: 136 KERDVEKDSDRGRDKERGKE--------------KNNDRDKEREKERDKGREKDRER--- 178 Query: 2087 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGWERDEKENDREKNR---LERAKDKEKSR 1917 ER EK DREK R + K+KEK++ Sbjct: 179 ---------------------------------ER-EKHRDREKGRENYKDTDKEKEKAK 204 Query: 1916 EE-SVRNRETDLDKSRTKDK------ERDAGRAKGGEENERTKVGGGGIDIVRXXXXXXX 1758 E+ + RE D DK +++D+ E D K G +++TK+ D + Sbjct: 205 EKIKEKEREADQDKEKSRDRVSKKSVEEDYELGKDGGRDDKTKLD----DDNKKDREAKQ 260 Query: 1757 XXXXNHESFSLNDHDERPVGSQSTSELGECXXXXXXXXXXXRADDALEISSWVNKSRRLE 1578 + HD +T+EL + + +D E+ +WVNKSR+LE Sbjct: 261 GNVSQYIDGEQITHDISHKAHLTTTELEKRILKMKQERSKKKTEDVPEVLAWVNKSRKLE 320 Query: 1577 EKRNTESEKAERTSRILDEQDN-VGEESDDETAGHSANDLAGIKILHGLEKVIEGGNVVL 1401 EK+N E EKA + S+I +EQDN V E+S+DE +LAG+K+LHG++KV+EGG VVL Sbjct: 321 EKKNDEKEKALQLSKIFEEQDNIVQEDSEDEETTTQHYNLAGVKVLHGIDKVMEGGAVVL 380 Query: 1400 TLKDQSILTDGDINEEVDMLENVEIGEQRRRDDAYKAAKKTTGLYEDKFNDDLSSHKTIL 1221 TLKDQ+IL DGDIN E+DMLENVEIGEQ+RRD+AYKAAKK G+Y DKFNDD +S + +L Sbjct: 381 TLKDQNILADGDINLEIDMLENVEIGEQKRRDEAYKAAKKKVGIYVDKFNDDPNSERKML 440 Query: 1220 PQYDDPVEDEGVTLDETGRFTGEAXXXXXXXXXRIQGGSTSRAFEDLNSTAKNTSDYFTP 1041 PQYDDP D GVT+DE GR T EA R+QG ST+ FEDL+ K +SDY+T Sbjct: 441 PQYDDPSTDVGVTIDERGRITSEAEKKLEELRRRLQGASTNSRFEDLSFPGKVSSDYYTS 500 Query: 1040 DEMLQFXXXXXXXXXXXXXXLDLDALEAEAISSGLGVGDLGSRKDSKRLSXXXXXXXXXX 861 +EM+QF LD+DALEAEA+S+GLGVGDLGSR D KR Sbjct: 501 EEMMQFKKPKKKKSLRKKDKLDIDALEAEAVSAGLGVGDLGSRNDPKRQVIREEQDRAEA 560 Query: 860 EMRSNAYQSALAKAEEASKVLRQEQILTVKAVEDDGMVFGEDYEDLENSLEQARKLARKR 681 E R+NAY++A AKA+EASK LR EQ L VK E++ +VF +D ED ++E+ARK+A K+ Sbjct: 561 ERRNNAYKTAFAKADEASKSLRLEQTLPVKLEEEENLVFADDDEDFHKAVERARKIAVKK 620 Query: 680 KEEAAPSGPQAVALLATT--NKEQEDTQGSTVGEPQENKVVITEMEEFVLGLQLNEETHK 507 +++ PSGP+AVALLA T N + D Q + GE QENKVV TEMEEFV GLQL EE K Sbjct: 621 EDKETPSGPEAVALLAATIANSQPADEQNPS-GESQENKVVFTEMEEFVWGLQLEEEAQK 679 Query: 506 PESEDVFKDEDDIPKPVEHEMDSQIGGWTEIKETNKNQQPPSEENDDVAPDEIIHEVAVX 327 P++EDVF DED+ PK E+ ++ GGWTE+KETN ++ P EE +++ PD IIHEVAV Sbjct: 680 PDNEDVFMDEDEEPKAYNEEIKNEPGGWTEVKETNNDEHPSKEEEEEIVPDGIIHEVAVG 739 Query: 326 XXXXXXXXXXKDRGTLKESIDWGGRNMDKKKSKLVGINDNDGP----------------- 198 K+RGTLKESIDWGGRNMDKKKSKLVGI D+D P Sbjct: 740 KGLSGALKLLKERGTLKESIDWGGRNMDKKKSKLVGIVDDDEPGQQVHPKKDGTRTSSSS 799 Query: 197 -------------KEIRIERTDEFGRIMTPKEAFRVISHKFHGKGPGKMKQEKRMKQFQE 57 K+IRIERTDEFGRI+TPKEAFR+ISHKFHGKGPGKMKQEKRMKQ+QE Sbjct: 800 YSKETRASKVYEEKDIRIERTDEFGRILTPKEAFRIISHKFHGKGPGKMKQEKRMKQYQE 859 Query: 56 DLKTKQMKASDTPLLAME 3 +LK KQMK+SDTP ++E Sbjct: 860 ELKLKQMKSSDTPSQSVE 877