BLASTX nr result
ID: Perilla23_contig00002244
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Perilla23_contig00002244 (2205 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011094061.1| PREDICTED: SART-1 family protein DOT2 [Sesam... 981 0.0 ref|XP_012851195.1| PREDICTED: SART-1 family protein DOT2 [Eryth... 915 0.0 ref|XP_010656678.1| PREDICTED: SART-1 family protein DOT2 [Vitis... 835 0.0 ref|XP_004250062.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 822 0.0 ref|XP_009630824.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 820 0.0 ref|XP_006361674.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 820 0.0 ref|XP_010256356.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 811 0.0 ref|XP_012077379.1| PREDICTED: SART-1 family protein DOT2 isofor... 793 0.0 ref|XP_002516516.1| conserved hypothetical protein [Ricinus comm... 792 0.0 ref|XP_012441144.1| PREDICTED: SART-1 family protein DOT2 [Gossy... 784 0.0 ref|XP_006471158.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 772 0.0 ref|XP_007022025.1| U4/U6.U5 tri-snRNP-associated protein 1 isof... 767 0.0 ref|XP_011011622.1| PREDICTED: SART-1 family protein DOT2 isofor... 766 0.0 ref|XP_006431678.1| hypothetical protein CICLE_v10000233mg [Citr... 764 0.0 ref|XP_003530377.1| PREDICTED: zinc finger CCCH domain-containin... 764 0.0 ref|XP_007225495.1| hypothetical protein PRUPE_ppa000914mg [Prun... 763 0.0 gb|KHN38139.1| U4/U6.U5 tri-snRNP-associated protein 1 [Glycine ... 762 0.0 ref|XP_010033990.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro... 762 0.0 ref|XP_011011623.1| PREDICTED: SART-1 family protein DOT2 isofor... 760 0.0 ref|XP_002297938.2| hypothetical protein POPTR_0001s11550g [Popu... 760 0.0 >ref|XP_011094061.1| PREDICTED: SART-1 family protein DOT2 [Sesamum indicum] Length = 942 Score = 981 bits (2537), Expect = 0.0 Identities = 518/721 (71%), Positives = 571/721 (79%), Gaps = 11/721 (1%) Frame = -1 Query: 2160 ADQEKDRTSDRDKSIRKQKDESNDMSKD----GHSRFENDYTHNSQASKEKVVNFDEKSG 1993 ADQEKDR DR++S RKQKDES+D SKD GHSR ENDY+ + Q++KE N D+++ Sbjct: 222 ADQEKDRARDRERSSRKQKDESHDRSKDTDKDGHSRLENDYSRDKQSTKELADNSDDEND 281 Query: 1992 SKILDQTEKAE-----NRESTYELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXX 1828 SKIL EKA+ +R+S EL+ RISKMRE+RL K SEGA E+L+WVNRS Sbjct: 282 SKILKHQEKADTAIAGSRQSASELEDRISKMREERLKKPSEGASEVLAWVNRSRKLEEKR 341 Query: 1827 XXXXXXALQLSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTL 1648 ALQLS+IFEEQDNMN ESD+E A + T++ LGGVK+LHGLDKVLEGGAVVLTL Sbjct: 342 TAEKEKALQLSKIFEEQDNMNGGESDEEAAAEHTTQDLGGVKILHGLDKVLEGGAVVLTL 401 Query: 1647 KDQSILANGDINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQ 1468 KDQSILA+GDINEEVDMLENVEIGEQKRRDEAY+A+KKK G+YDDKF+DE GAEKK+LPQ Sbjct: 402 KDQSILADGDINEEVDMLENVEIGEQKRRDEAYKAAKKKTGIYDDKFSDEPGAEKKILPQ 461 Query: 1467 YDDPVTDEGLILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEE 1288 YDDPV DEG+ LDSSGRF+GEA RIQGVS S+ GEDLNS KI TDYYTQ+E Sbjct: 462 YDDPVADEGVTLDSSGRFTGEAERKLEELRRRIQGVSTSTRGEDLNSTAKILTDYYTQDE 521 Query: 1287 MTXXXXXXXXXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGRRQNLKDEQERIDAEM 1108 MT GSRNDGRRQNL++EQE+I+AEM Sbjct: 522 MTKFKKPKKKKSLRKKEKLDLDALEAEARSAGLGAGDLGSRNDGRRQNLREEQEKIEAEM 581 Query: 1107 RSNAYRSALAKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQ- 931 R NAY SA AKADEASKALRQEQV MQTEE+DAP FGDDDDELRKSLERARKIALKKQ Sbjct: 582 RRNAYESAYAKADEASKALRQEQVPAMQTEEDDAPVFGDDDDELRKSLERARKIALKKQD 641 Query: 930 -EEKSGSFVIKLLASSNANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPE 754 EEKS VI LLA+S+AN+ ++NPN S DQ ENKV+FTEMEEFVWGLQLDEEEK PE Sbjct: 642 EEEKSAPQVITLLATSSANDSTTENPNSGSVDQQENKVIFTEMEEFVWGLQLDEEEKNPE 701 Query: 753 SEDVFMEEDVAPSTSDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKG 574 SEDVFMEEDVAPSTSDQEM+DE GGW EV+E M DE +E +EEV PDETIHE AVGKG Sbjct: 702 SEDVFMEEDVAPSTSDQEMKDEAGGWAEVKETMKDETPAKEEKEEVVPDETIHESAVGKG 761 Query: 573 LAATLNLLKGRGTLKETIEWGGRNMDKKKSKLVGIVGEGGTKEIRIERTDEYGRILTPKE 394 LA L LLK RGTLKETIEWGGRNMDKKKSKLVGI KEIRIERTDEYGRILTPKE Sbjct: 762 LAGALKLLKDRGTLKETIEWGGRNMDKKKSKLVGIYDNDAAKEIRIERTDEYGRILTPKE 821 Query: 393 AFRLLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKLKAPYL 214 AFRLLSHKFHGKGPGKMKQEKRMRQYQEELK+KQMKNADTPSLSV RMREAQAKL+ PYL Sbjct: 822 AFRLLSHKFHGKGPGKMKQEKRMRQYQEELKVKQMKNADTPSLSVERMREAQAKLQTPYL 881 Query: 213 VLSGHVKPGQSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIKRKYENEDSSSQKKPK 34 VLSGHVKPGQSSDPR+ FATVEKD AGGLTPM GDKKVEHFLNIKRK E D++SQKKPK Sbjct: 882 VLSGHVKPGQSSDPRNTFATVEKDFAGGLTPMLGDKKVEHFLNIKRKPEPGDTASQKKPK 941 Query: 33 T 31 T Sbjct: 942 T 942 >ref|XP_012851195.1| PREDICTED: SART-1 family protein DOT2 [Erythranthe guttatus] gi|604311746|gb|EYU25740.1| hypothetical protein MIMGU_mgv1a000914mg [Erythranthe guttata] Length = 944 Score = 915 bits (2364), Expect = 0.0 Identities = 488/722 (67%), Positives = 563/722 (77%), Gaps = 13/722 (1%) Frame = -1 Query: 2157 DQEKDRTSDRDKSIRKQKDESNDM----SKDGHSRFENDYTHNSQASKEKVVNFDEKSGS 1990 DQEK+R DRD+S RKQKDES DM KDGH R ENDY+ ++Q++K +V N D ++ S Sbjct: 225 DQEKERARDRDRSSRKQKDESYDMVKDTEKDGHLRLENDYSRDNQSNKVRVDNSDGENDS 284 Query: 1989 KILDQTEKAE-----NRESTYELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXXX 1825 KIL Q ++AE N +S +L +RISKMR++RL+KSSEGA E+L+WVNRS Sbjct: 285 KILKQQDRAEKSVDGNSQSASDLGERISKMRQERLVKSSEGASEVLAWVNRSRKLEDKRT 344 Query: 1824 XXXXXALQLSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTLK 1645 LQLS++FEEQDNMND +SDDE ATQ ++ LGGVKVLHGL+KVLEGGA+VLTLK Sbjct: 345 EKEKA-LQLSKVFEEQDNMNDGDSDDEAATQAVTESLGGVKVLHGLEKVLEGGAIVLTLK 403 Query: 1644 DQSILANGDINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQY 1465 DQSILA+GD+N+EVDMLENVEIGEQKRR+EAY A+KKK GVY DKF+DE G EKKMLPQY Sbjct: 404 DQSILADGDVNQEVDMLENVEIGEQKRRNEAYGAAKKKTGVYVDKFSDEPGTEKKMLPQY 463 Query: 1464 DDPVTDEGLILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEEM 1285 DDPV DEGL LDS+GRF+GEA RIQGV AS++GEDLNS KISTDYYTQEEM Sbjct: 464 DDPVADEGLTLDSTGRFTGEAERKLEELRKRIQGVPASTYGEDLNSTLKISTDYYTQEEM 523 Query: 1284 TXXXXXXXXXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGRRQNLKDEQERIDAEMR 1105 T GSRNDGR+QNLK EQER+DAEMR Sbjct: 524 TKFKKPKKKKSLRKREKLDIDALEAEAVTAGLGAGDLGSRNDGRKQNLKKEQERVDAEMR 583 Query: 1104 SNAYRSALAKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQEE 925 SNA++SA AKA+EASKALR +V+ M+TE++D FGDDDDELRKSLERARKIA KKQ+E Sbjct: 584 SNAFQSAYAKAEEASKALRPGKVNIMRTEDDDT-VFGDDDDELRKSLERARKIAFKKQDE 642 Query: 924 KS--GSFVIKLLASSNANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPES 751 K G +I LLASS AN+ ++NPN+SS DQSENKVVFTEMEEFVWGLQLDEEEK PE+ Sbjct: 643 KEKPGPQMITLLASSTANDSTAENPNLSSVDQSENKVVFTEMEEFVWGLQLDEEEKNPEN 702 Query: 750 EDVFMEEDVAPSTSDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKGL 571 E V MEED+APSTSD EM + DGGW+EV+E + + ++E EEEV PDETIHE +VGKGL Sbjct: 703 EGVCMEEDLAPSTSDHEMTEVDGGWSEVKEAVEEVAPLKEEEEEVVPDETIHETSVGKGL 762 Query: 570 AATLNLLKGRGTLKETIEWGGRNMDKKKSKLVGIVGEGGTKEIRIERTDEYGRILTPKEA 391 A L LLK RG+LKET EWGGRNMDKKKSKLVGI G KEIRIERTDE+GRILTPKE+ Sbjct: 763 ANALKLLKDRGSLKETTEWGGRNMDKKKSKLVGINDNDGGKEIRIERTDEFGRILTPKES 822 Query: 390 FRLLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKLKAPYLV 211 FRLLSHKFHGKGPGKMKQEKRMRQYQEELK+KQMKN+DTPS SV+RM+EAQ KL+ PYLV Sbjct: 823 FRLLSHKFHGKGPGKMKQEKRMRQYQEELKVKQMKNSDTPSSSVSRMKEAQEKLQTPYLV 882 Query: 210 LSGHVKPGQSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIKRKYENEDS--SSQKKP 37 LSG+VKPGQ+SDPRSGFATVEK L GGLTPM GDKKVEHFLNIKR + +S SS KKP Sbjct: 883 LSGNVKPGQTSDPRSGFATVEKSLTGGLTPMLGDKKVEHFLNIKRMPDPGESGASSSKKP 942 Query: 36 KT 31 KT Sbjct: 943 KT 944 >ref|XP_010656678.1| PREDICTED: SART-1 family protein DOT2 [Vitis vinifera] gi|296090475|emb|CBI40671.3| unnamed protein product [Vitis vinifera] Length = 944 Score = 835 bits (2156), Expect = 0.0 Identities = 434/720 (60%), Positives = 530/720 (73%), Gaps = 10/720 (1%) Frame = -1 Query: 2160 ADQEKDRTSDRDKSIRKQKDESNDMSKDGHS----RFENDYTHNSQASKEKVVNFDEKSG 1993 ADQ++DR DRDK RK +DE +D SKDG + + + +K+ + ++ Sbjct: 225 ADQDRDRYKDRDKGSRKNRDEGHDRSKDGGKDDKLKLDGGDNRDRDVTKQGRGSHHDEDD 284 Query: 1992 SKILDQTEKAEN----RESTYELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXXX 1825 S+ ++ + AE + ST +L +RI +M+E+R+ + SEG+ E+L+WVNRS Sbjct: 285 SRAIEHEKNAEGASGPQSSTAQLQERILRMKEERVKRKSEGSSEVLAWVNRSRKVEEQRN 344 Query: 1824 XXXXXALQLSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTLK 1645 ALQLS+IFEEQDN++ ESDDE T+ +S+ L GVKVLHGLDKV+EGGAVVLTLK Sbjct: 345 AEKEKALQLSKIFEEQDNIDQGESDDEKPTRHSSQDLAGVKVLHGLDKVIEGGAVVLTLK 404 Query: 1644 DQSILANGDINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQY 1465 DQ ILANGDINE+VDMLENVEIGEQKRRDEAY+A+KKK G+Y+DKFNDE G+EKK+LPQY Sbjct: 405 DQDILANGDINEDVDMLENVEIGEQKRRDEAYKAAKKKTGIYEDKFNDEPGSEKKILPQY 464 Query: 1464 DDPVTDEGLILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEEM 1285 DDPVTDEGL LD+SGRF+GEA R+QGVS ++ EDLN+ GK S+DYYT EEM Sbjct: 465 DDPVTDEGLALDASGRFTGEAEKKLEELRRRLQGVSTNNRFEDLNTYGKNSSDYYTHEEM 524 Query: 1284 TXXXXXXXXXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGRRQNLKDEQERIDAEMR 1105 GSRNDG+RQ++++EQER +AEMR Sbjct: 525 LQFKKPKKKKSLRKKEKLNIDALEAEAVSAGLGVGDLGSRNDGKRQSIREEQERSEAEMR 584 Query: 1104 SNAYRSALAKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQEE 925 ++AY+ A AKADEASKALR +Q +Q EE + FG+DD+EL+KSL+RARK+ L+KQ+E Sbjct: 585 NSAYQLAYAKADEASKALRLDQTLPVQLEENENQVFGEDDEELQKSLQRARKLVLQKQDE 644 Query: 924 K--SGSFVIKLLASSNANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPES 751 SG I LLAS+ + DN N SG+ EN+VVFTEMEEFVWGLQL++E KP+ Sbjct: 645 AATSGPQAIALLASTTTSSQNVDNQNPISGESQENRVVFTEMEEFVWGLQLEDEAHKPDG 704 Query: 750 EDVFMEEDVAPSTSDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKGL 571 EDVFM+ED AP SDQE +DE GGWTEV++ DE+ + EN+EE+ PD+TIHE AVGKGL Sbjct: 705 EDVFMDEDEAPKASDQERKDEAGGWTEVKDTDKDELPVNENKEEMVPDDTIHEVAVGKGL 764 Query: 570 AATLNLLKGRGTLKETIEWGGRNMDKKKSKLVGIVGEGGTKEIRIERTDEYGRILTPKEA 391 + L LLK RGTLKE IEWGGRNMDKKKSKLVGI GTKEIRIERTDE+GRI+TPKEA Sbjct: 765 SGALQLLKERGTLKEGIEWGGRNMDKKKSKLVGIYDNTGTKEIRIERTDEFGRIMTPKEA 824 Query: 390 FRLLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKLKAPYLV 211 FR++SHKFHGKGPGKMKQEKRM+QYQEELK+KQMKN+DTPS SV RMREAQA+LK PYLV Sbjct: 825 FRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSQSVERMREAQARLKTPYLV 884 Query: 210 LSGHVKPGQSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIKRKYENEDSSSQKKPKT 31 LSGHVKPGQ+SDPRSGFATVEKD+ G LTPM GD+KVEHFL IKRK E + KKPKT Sbjct: 885 LSGHVKPGQTSDPRSGFATVEKDVPGSLTPMLGDRKVEHFLGIKRKAEPSNMGPPKKPKT 944 >ref|XP_004250062.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Solanum lycopersicum] Length = 898 Score = 822 bits (2124), Expect = 0.0 Identities = 436/711 (61%), Positives = 525/711 (73%), Gaps = 3/711 (0%) Frame = -1 Query: 2157 DQEKDRTSDRDKSIRKQKDESNDMSKDGHSRFENDYTHNSQASKEKVVNF-DEKSGSKIL 1981 + +K+R+ D+D+S R+Q+DE +D SKD R + D + A +E VV+ DE+ Sbjct: 188 EDDKERSRDKDRSSRRQRDEGHDRSKDKDRRKDEDSDYRYAAKQEIVVSHEDEERSHNNA 247 Query: 1980 DQTEKAENRESTYELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXXXXXXXXALQ 1801 +T A++ + EL++RI KM+E+RL K SEGA E+L+WV++S ALQ Sbjct: 248 VETGGAQSAAAASELEERILKMKEERLKKKSEGASEVLAWVSKSRKIEEIRNAEKEKALQ 307 Query: 1800 LSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTLKDQSILANG 1621 LS+IFEEQD MN+EESDDE + +K LGG+KVLHGLDKV+EGGAVVLTLKDQSILA Sbjct: 308 LSKIFEEQDKMNEEESDDEENARLAAKELGGMKVLHGLDKVVEGGAVVLTLKDQSILAGD 367 Query: 1620 DINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQYDDPVTDEG 1441 D+N+EVD+LENVEIGEQKRRD+AY+A+K K G+YDDKFNDE G E+K+LP+YDDP +EG Sbjct: 368 DVNQEVDVLENVEIGEQKRRDDAYKAAKNKTGIYDDKFNDEPGFERKILPKYDDPAEEEG 427 Query: 1440 LILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEEMTXXXXXXX 1261 +ILD++G FS +A RIQG S+ + EDLNS GK+ +DYYTQEEM Sbjct: 428 VILDATGGFSLDAEKKLEELRRRIQGPSSINRMEDLNSSGKLLSDYYTQEEMVQFKKPKK 487 Query: 1260 XXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGRRQNLKDEQERIDAEMRSNAYRSAL 1081 GSRND RQ LK+E+ER DAE RSNAY++A Sbjct: 488 KKSLRKKEKMDLDALEAEAKSAGLGVSDLGSRNDKTRQVLKEEKERADAETRSNAYQAAY 547 Query: 1080 AKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQEEKSGSFV-- 907 AKA+EASKALR ++ + Q EE+DA F DDD+ELRKSLERARK+AL+KQE + +F Sbjct: 548 AKAEEASKALRPDKTNNNQREEDDA-VFDDDDEELRKSLERARKLALRKQEGLAKTFPES 606 Query: 906 IKLLASSNANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPESEDVFMEED 727 I LA+S AN+ DN + +SG+ ENKVVFTEMEEFVWGLQLDEEE+KP S+DVFMEED Sbjct: 607 IASLAASRANDSMVDNSSSASGEAQENKVVFTEMEEFVWGLQLDEEEQKPGSDDVFMEED 666 Query: 726 VAPSTSDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKGLAATLNLLK 547 V P SD+E++ EDGGWTEV+E +E ++E E EV PD+TI E VGKGL+ L LL+ Sbjct: 667 VLPKPSDEELKSEDGGWTEVKETKEEEPSVKEEEMEVTPDDTIREVPVGKGLSGVLKLLQ 726 Query: 546 GRGTLKETIEWGGRNMDKKKSKLVGIVGEGGTKEIRIERTDEYGRILTPKEAFRLLSHKF 367 RGTLKE IEWGGRNMDKKKSKLVGI E G KEI IERTDEYGRILTPKEAFRLLSHKF Sbjct: 727 ERGTLKEDIEWGGRNMDKKKSKLVGIRSEDGKKEINIERTDEYGRILTPKEAFRLLSHKF 786 Query: 366 HGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKLKAPYLVLSGHVKPG 187 HGKGPGKMKQEKRMRQYQEELKIKQMKN+DTPS SV RMRE A+ + PY+VLSGHVKPG Sbjct: 787 HGKGPGKMKQEKRMRQYQEELKIKQMKNSDTPSQSVERMRETHAQTRTPYIVLSGHVKPG 846 Query: 186 QSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIKRKYENEDSSSQKKPK 34 Q+SDPRSGFATVEKDL GGLTPM GDKKVEHFL IKRK+E + SSQKKPK Sbjct: 847 QTSDPRSGFATVEKDLPGGLTPMLGDKKVEHFLGIKRKFEPGEGSSQKKPK 897 >ref|XP_009630824.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nicotiana tomentosiformis] gi|697153160|ref|XP_009630825.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nicotiana tomentosiformis] Length = 922 Score = 820 bits (2119), Expect = 0.0 Identities = 434/716 (60%), Positives = 530/716 (74%), Gaps = 6/716 (0%) Frame = -1 Query: 2160 ADQEKDRTSDRDKSIRKQKDESNDMSKDGHS----RFENDYTHNSQASKEKVVNFDEKSG 1993 AD++K+R+ D+D+ R+Q+DE +D SKD R +++ + +K+++V++++ Sbjct: 209 ADEDKERSRDKDRGNRRQRDEGHDRSKDRRKDDVQRVDDEDSDYQDVAKQEIVSYEDDDR 268 Query: 1992 SKILDQTEKAENRESTYELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXXXXXXX 1813 ++ + E A ++ S +L++RI KM+E+RL K SEGA E+++WV++S Sbjct: 269 ARN-NAVETAGSQSSASKLEERILKMKEERLKKKSEGASEVMTWVSKSRKIEEKRTAEKE 327 Query: 1812 XALQLSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTLKDQSI 1633 ALQLS+IFEEQD +NDEESDDE + +K LGG+KVLHGLDKV+EGGAVVLTLKDQSI Sbjct: 328 RALQLSKIFEEQDKINDEESDDEEKARLAAKELGGMKVLHGLDKVVEGGAVVLTLKDQSI 387 Query: 1632 LANGDINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQYDDPV 1453 LA DIN+EVD+LENVEIGEQK+RD+AY+A+KKK G+YDDKFND+ G E+K+LPQYDDP Sbjct: 388 LAGDDINQEVDVLENVEIGEQKKRDDAYKAAKKKTGIYDDKFNDDPGFERKILPQYDDPA 447 Query: 1452 TDEGLILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEEMTXXX 1273 +EG+ LD++G FS +A RIQG S+ + EDLNS GK+ +DYYTQEEM Sbjct: 448 EEEGVTLDATGGFSVDAEKKLEELRKRIQGSSSKTLAEDLNSSGKLLSDYYTQEEMLQFK 507 Query: 1272 XXXXXXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGRRQNLKDEQERIDAEMRSNAY 1093 GSRND RQ L++E ER +AE +S +Y Sbjct: 508 KPKKKKSLRKKEKMDLDALEVEAKSSGLGVGDLGSRNDKTRQALREEMERAEAETKSKSY 567 Query: 1092 RSALAKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQEEKSGS 913 ++A AKA+EASKALR E+ + Q EE+D F DDD+ELRKSLERARK+ALKKQE + + Sbjct: 568 QAAYAKAEEASKALRPEKTNNNQREEDDT-VFDDDDEELRKSLERARKLALKKQEGLAKT 626 Query: 912 FV--IKLLASSNANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPESEDVF 739 F I LA S AN+ DNP+ SG+ ENKVVFTEMEEFVWGLQLDEEE+KP S+DVF Sbjct: 627 FPESIASLAISRANDSTVDNPSSVSGESQENKVVFTEMEEFVWGLQLDEEEQKPGSDDVF 686 Query: 738 MEEDVAPSTSDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKGLAATL 559 MEE+V P SD+EM+ EDGGWTEV+E +E ++E E EV PD TIHE VGKGL+ L Sbjct: 687 MEEEVLPKPSDEEMKTEDGGWTEVKETEEEEPSVKEEEMEVTPDATIHEVPVGKGLSGAL 746 Query: 558 NLLKGRGTLKETIEWGGRNMDKKKSKLVGIVGEGGTKEIRIERTDEYGRILTPKEAFRLL 379 LL+ RGTLKE IEWGGRNMDKKKSKLVGI GE G KEIRIERTDEYGRILTPKEAFRLL Sbjct: 747 KLLQERGTLKEDIEWGGRNMDKKKSKLVGIRGEDGKKEIRIERTDEYGRILTPKEAFRLL 806 Query: 378 SHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKLKAPYLVLSGH 199 SHKFHGKGPGKMKQEKRMRQYQEELKIKQMKN+DTPSLSV RMREAQA+ K PYLVLSG+ Sbjct: 807 SHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNSDTPSLSVERMREAQAQFKTPYLVLSGN 866 Query: 198 VKPGQSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIKRKYENEDSSSQKKPKT 31 VKPGQ+SDPRSGFATVEK L GGLTPM GDKKVEHFL IKRK E + +SQKKPKT Sbjct: 867 VKPGQTSDPRSGFATVEKALPGGLTPMLGDKKVEHFLGIKRKSEPGEGTSQKKPKT 922 >ref|XP_006361674.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1-like [Solanum tuberosum] Length = 880 Score = 820 bits (2117), Expect = 0.0 Identities = 434/713 (60%), Positives = 526/713 (73%), Gaps = 3/713 (0%) Frame = -1 Query: 2163 LADQEKDRTSDRDKSIRKQKDESNDMSKDGHSRFENDYTHNSQASKEKVVNF-DEKSGSK 1987 +A+ +K+R+ D+D+S R+Q+DES+D SKD R + D + A +E VV+ DE+ Sbjct: 168 VAEDDKERSRDKDRSSRRQRDESHDRSKDKDRRKDEDSDYRDSAKQEIVVSHEDEERSHN 227 Query: 1986 ILDQTEKAENRESTYELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXXXXXXXXA 1807 +T +++ + EL++RI KM+E+RL K SEGA E+L+WV++S A Sbjct: 228 NAVETGGSQSAAAASELEERILKMKEERLKKKSEGASEVLTWVSKSRKIEEIRNAEKEKA 287 Query: 1806 LQLSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTLKDQSILA 1627 LQLS+IFEEQD MN EESD+E + +K LGG+KVLHGLDKV+EGGAVVLTLKDQSILA Sbjct: 288 LQLSKIFEEQDKMNGEESDEEENARLAAKELGGMKVLHGLDKVVEGGAVVLTLKDQSILA 347 Query: 1626 NGDINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQYDDPVTD 1447 D+N+EVD+LENVEIGEQKRRD+AY+A+K K G+YDDKFNDE G E+K+LP+YDDP + Sbjct: 348 GDDVNQEVDVLENVEIGEQKRRDDAYKAAKNKTGIYDDKFNDEPGFERKILPKYDDPAEE 407 Query: 1446 EGLILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEEMTXXXXX 1267 EG+ILD++G F+ +A RIQG S+ + EDLNS GK+ +DYYTQEEM Sbjct: 408 EGVILDATGGFNIDAEKKLEELRRRIQGPSSINRSEDLNSSGKLLSDYYTQEEMVQFKKP 467 Query: 1266 XXXXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGRRQNLKDEQERIDAEMRSNAYRS 1087 GSRND RQ LK+E+ER D EMRSNAY++ Sbjct: 468 KKKKSLRKKEKMDLDALEAEAKSAGLGVSDLGSRNDKTRQVLKEEKERADTEMRSNAYQA 527 Query: 1086 ALAKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQEEKSGSFV 907 A AKA+EASKALR E+ Q EE+DA F DDD+ELRKSLERARK+AL+KQE + +F Sbjct: 528 AYAKAEEASKALRPEKTKNNQREEDDA-VFDDDDEELRKSLERARKLALRKQEGLAKTFP 586 Query: 906 --IKLLASSNANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPESEDVFME 733 I LA+S AN+ DN + +SG+ ENKVVFTEMEEFVWGLQLDEEE+KP S+DVFME Sbjct: 587 ESIASLAASRANDSTVDNTSSASGEAQENKVVFTEMEEFVWGLQLDEEEQKPGSDDVFME 646 Query: 732 EDVAPSTSDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKGLAATLNL 553 EDV P SD+EM++EDGGWTEV+EI +E ++E E EV PD TI E VGKGL+ L L Sbjct: 647 EDVLPKPSDEEMKNEDGGWTEVKEIKEEEPSVKEEEMEVTPDNTIREVPVGKGLSGVLKL 706 Query: 552 LKGRGTLKETIEWGGRNMDKKKSKLVGIVGEGGTKEIRIERTDEYGRILTPKEAFRLLSH 373 L+ RGTLKE IEWGGRNMDKKKSKLVGI E G KEI IERTDEYGRILTPKEAFRL+SH Sbjct: 707 LQERGTLKEDIEWGGRNMDKKKSKLVGIRSEDGKKEIHIERTDEYGRILTPKEAFRLISH 766 Query: 372 KFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKLKAPYLVLSGHVK 193 KFHGKGPGKMKQEKRMRQYQEELKIKQM+N+DTPS SV RMRE A+ + PY+VLSG+VK Sbjct: 767 KFHGKGPGKMKQEKRMRQYQEELKIKQMRNSDTPSQSVERMRETHAQTRVPYIVLSGNVK 826 Query: 192 PGQSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIKRKYENEDSSSQKKPK 34 PGQ+SDPRSGFATVEKDL GGLTPM GDKKVEHFL IKRK+E + SSQKK K Sbjct: 827 PGQTSDPRSGFATVEKDLPGGLTPMLGDKKVEHFLGIKRKFEPGEGSSQKKTK 879 >ref|XP_010256356.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001422|ref|XP_010256357.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001427|ref|XP_010256358.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001430|ref|XP_010256359.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001433|ref|XP_010256360.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] gi|720001436|ref|XP_010256361.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera] Length = 851 Score = 811 bits (2094), Expect = 0.0 Identities = 427/739 (57%), Positives = 528/739 (71%), Gaps = 30/739 (4%) Frame = -1 Query: 2157 DQEKDRTSDR-----DKSIRKQKDESNDMSKDGHSRFENDYTHN--SQASKEKVVNFDEK 1999 D+E+++ DR DKS K K+ S D +D + +D + K++ ++ D Sbjct: 114 DREREKVKDREKLERDKSKEKDKERSKDKERDARNGKLDDESQGRGKDVGKDEKLDLDGG 173 Query: 1998 SGSKILDQTEKAEN---------------------RESTYELDQRISKMREQRLMKSSEG 1882 + ++ Q ++ ++ + ST EL++RI KMRE+R K SEG Sbjct: 174 NDRDVVKQVKEVQHDVVVDMSVENKKKVDGAMGGSQPSTGELEERILKMREERSKKKSEG 233 Query: 1881 APEILSWVNRSXXXXXXXXXXXXXALQLSRIFEEQDNMNDEESDDEPATQDTSKHLGGVK 1702 E+LSWVN+S ALQLS++FEEQD ++ ES+DE + TSK L GVK Sbjct: 234 VSEVLSWVNKSRKLEEKRNAEKQKALQLSKVFEEQDKIDQGESEDEDTARHTSKDLAGVK 293 Query: 1701 VLHGLDKVLEGGAVVLTLKDQSILANGDINEEVDMLENVEIGEQKRRDEAYRASKKKVGV 1522 +LHG+DKV+EGGAVVLTLKDQ+ILAN D+NEE D+LENVEIGEQK+RD AY+A+KKK G+ Sbjct: 294 ILHGIDKVIEGGAVVLTLKDQNILANDDVNEEADVLENVEIGEQKQRDAAYKAAKKKTGI 353 Query: 1521 YDDKFNDELGAEKKMLPQYDDPVTDEGLILDSSGRFSGEAXXXXXXXXXRIQGVSASSHG 1342 Y+DKF+ E GA+KK+LPQYDDPV DEGL+LD SGRF+GEA R+QGVSAS+H Sbjct: 354 YEDKFSGEDGAQKKILPQYDDPVEDEGLVLDESGRFAGEAEKKLEELRKRLQGVSASNHF 413 Query: 1341 EDLNSIGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRN 1162 EDLNS KI++D+YT EEM GSR Sbjct: 414 EDLNSSAKITSDFYTHEEMLQFKKPKKKKSLRKKVKLDLDALEAEAISAGFGVGDLGSRK 473 Query: 1161 DGRRQNLKDEQERIDAEMRSNAYRSALAKADEASKALRQEQVHTMQTEEEDAPAFGDDDD 982 DG+RQ K++QER +AEMRSNAY+SA AKA+EASK LRQEQ T+Q EE ++P FGDD++ Sbjct: 474 DGQRQATKEQQERSEAEMRSNAYQSAFAKAEEASKTLRQEQTLTVQVEENESPVFGDDEE 533 Query: 981 ELRKSLERARKIALKKQEEK--SGSFVIKLLASSNANEPASDNPNISSGDQSENKVVFTE 808 +L KSLE+ARK+ALK Q E SG + LLAS+ +N+P D N++SG+ ENKVVFTE Sbjct: 534 DLYKSLEKARKLALKTQNEAAASGPQAVALLASTVSNQP-KDEENLTSGEPQENKVVFTE 592 Query: 807 MEEFVWGLQLDEEEKKPESEDVFMEEDVAPSTSDQEMRDEDGGWTEVQEIMPDEIHMQEN 628 MEEFVWGLQL+EE +K ESEDVFM+ED P SDQE++DE GGWTEV +I +E ++E Sbjct: 593 MEEFVWGLQLNEEARKLESEDVFMDEDNVPKASDQEIKDEAGGWTEVNDIDENEHPVEEE 652 Query: 627 EEEVAPDETIHEPAVGKGLAATLNLLKGRGTLKETIEWGGRNMDKKKSKLVGIVGEGGTK 448 +EEV PDETIHE A+GKGL+ L LLK RGTLKET++WGGRNMDKKKSKLVGI +GG K Sbjct: 653 KEEVVPDETIHEVAIGKGLSGALKLLKERGTLKETVDWGGRNMDKKKSKLVGIYDDGGPK 712 Query: 447 EIRIERTDEYGRILTPKEAFRLLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPS 268 EIRIERTDE+GRI+TPKEAFR++SHKFHGKGPGKMKQEKRM+QYQEELK+KQMKN+DTPS Sbjct: 713 EIRIERTDEFGRIMTPKEAFRVISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPS 772 Query: 267 LSVARMREAQAKLKAPYLVLSGHVKPGQSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFL 88 S+ RMREAQA+LK PYLVLSGHVKPGQ+SDPRSGFATVEKD+ GGLTPM GDKKVEHFL Sbjct: 773 QSMERMREAQARLKTPYLVLSGHVKPGQTSDPRSGFATVEKDIPGGLTPMLGDKKVEHFL 832 Query: 87 NIKRKYENEDSSSQKKPKT 31 IKRK E + KK KT Sbjct: 833 GIKRKAEPSNMGPPKKSKT 851 >ref|XP_012077379.1| PREDICTED: SART-1 family protein DOT2 isoform X1 [Jatropha curcas] gi|643724962|gb|KDP34163.1| hypothetical protein JCGZ_07734 [Jatropha curcas] Length = 864 Score = 793 bits (2047), Expect = 0.0 Identities = 427/719 (59%), Positives = 521/719 (72%), Gaps = 9/719 (1%) Frame = -1 Query: 2160 ADQEKDRTSDRDKSIRKQKDESNDMSKDGHSRFENDYTHNSQAS--KEKVVNFDEKSGSK 1987 +D +K+R DR+K ++ +E D SKD E DY +N +S K+ V+FD K K Sbjct: 150 SDYDKERLRDREKVSKRSHEEDYDRSKD--DVVEMDYENNKDSSVLKQSKVSFDNKDEQK 207 Query: 1986 ILDQTEKAENRESTYELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXXXXXXXXA 1807 ++T + + + +L++RI KM+E+RL K+SE E+L+WVNRS A Sbjct: 208 A-EETSRGGSAPVS-QLEERILKMKEERLKKNSEPGDEVLAWVNRSRKLEEKKNAEKQKA 265 Query: 1806 LQLSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTLKDQSILA 1627 QLS+IFEEQDN ES+DE + + T+ L GVKVLHGL+KV+EGGAVVLTLKDQSILA Sbjct: 266 KQLSKIFEEQDNNVQGESEDEDSGEHTTHDLAGVKVLHGLEKVMEGGAVVLTLKDQSILA 325 Query: 1626 NGDINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQYDDPVTD 1447 +GDINEEVDMLENVEIGEQKRRD+AY+A+KKK G+YDDKFND+ +EKK+LPQYDD D Sbjct: 326 DGDINEEVDMLENVEIGEQKRRDDAYKAAKKKTGIYDDKFNDDPASEKKILPQYDDSAAD 385 Query: 1446 EGLILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEEMTXXXXX 1267 EG+ LD GRF+GEA R+QGVS ++ EDL+S GKIS+DYYT EE+ Sbjct: 386 EGVALDERGRFTGEAEKKLEELRRRLQGVSTNNRFEDLSSSGKISSDYYTHEELLQFKKP 445 Query: 1266 XXXXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGRRQNLKDEQERIDAEMRSNAYRS 1087 GSRN+GRRQ ++ EQER +AEMRS+AY++ Sbjct: 446 KKKKSLRKKEKLDIDALEAEAVSAGLGVGDLGSRNNGRRQAIRQEQERSEAEMRSSAYQA 505 Query: 1086 ALAKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQEEK-SGSF 910 A KADEASK+LRQEQ + +E++ P F +DD++L KSLERARK+ALKKQEEK SG Sbjct: 506 AYDKADEASKSLRQEQTLHAKLDEDENPVFAEDDEDLYKSLERARKLALKKQEEKASGPQ 565 Query: 909 VIKLLASSNA--NEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPESEDVFM 736 I LA++ + +D+ N ++G+ ENK+VFTEMEEFVWGLQLDEE K ++DVFM Sbjct: 566 AIARLAAATTTTSSQTTDDQNPTTGESQENKIVFTEMEEFVWGLQLDEESHKHGNDDVFM 625 Query: 735 EEDVAPSTSDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKGLAATLN 556 +ED AP SDQE +DE GGWTEVQ+I DE + EN E++ PDETIHE VGKGL+A L Sbjct: 626 DEDEAPIVSDQEKKDETGGWTEVQDIDKDENPVNENNEDIVPDETIHEVPVGKGLSAALK 685 Query: 555 LLKGRGTLKETIEWGGRNMDKKKSKLVGIVGEGGT----KEIRIERTDEYGRILTPKEAF 388 LLK RGTLKE+ EWGGRNMDKKKSKLVGIV K+IRI+RTDEYGR LTPKEAF Sbjct: 686 LLKERGTLKESTEWGGRNMDKKKSKLVGIVDSDVDNERFKDIRIDRTDEYGRTLTPKEAF 745 Query: 387 RLLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKLKAPYLVL 208 R++SHKFHGKGPGKMKQEKRM+QY EELK+KQMKN+DTPSLSV RMREAQA+LK PYLVL Sbjct: 746 RIISHKFHGKGPGKMKQEKRMKQYLEELKMKQMKNSDTPSLSVERMREAQAQLKTPYLVL 805 Query: 207 SGHVKPGQSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIKRKYENEDSSSQKKPKT 31 SGHVKPGQ+SDPRSGFATVEKDL GGLTPM GDKKVEHFL IKRK E +S++ KKPKT Sbjct: 806 SGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDKKVEHFLGIKRKAEPGNSNAPKKPKT 864 >ref|XP_002516516.1| conserved hypothetical protein [Ricinus communis] gi|223544336|gb|EEF45857.1| conserved hypothetical protein [Ricinus communis] Length = 873 Score = 792 bits (2046), Expect = 0.0 Identities = 421/711 (59%), Positives = 505/711 (71%), Gaps = 4/711 (0%) Frame = -1 Query: 2151 EKDRTSDRDKSIRKQKDESNDMSKDGHSRFENDYTHNSQASKEKVVNFDEKSGSKILDQT 1972 +KDR RD ++ +E ND SK+ + NS K+K V+FD+ + + + Sbjct: 166 DKDRL--RDGVSKRSHEEENDRSKNDTIEMGYERERNSDVGKQKKVSFDDDNDDEQKVER 223 Query: 1971 EKAENRESTYELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXXXXXXXXALQLSR 1792 S+ E ++RI K+RE+RL K+S+ E+LSWVNRS A QLS+ Sbjct: 224 TSGGGLASSLEFEERILKVREERLKKNSDAGSEVLSWVNRSRKLAEKKNAEKKKAKQLSK 283 Query: 1791 IFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTLKDQSILANGDIN 1612 +FEEQD + ES+DE A + + L GVKVLHGL+KV+EGGAVVLTLKDQSIL +GDIN Sbjct: 284 VFEEQDKIVQGESEDEEAGELATNDLAGVKVLHGLEKVMEGGAVVLTLKDQSILVDGDIN 343 Query: 1611 EEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQYDDPVTDEGLIL 1432 EEVDMLEN+EIGEQKRR+EAY+A+KKK G+YDDKFND+ +E+K+LPQYDDP TDEG+ L Sbjct: 344 EEVDMLENIEIGEQKRRNEAYKAAKKKTGIYDDKFNDDPASERKILPQYDDPTTDEGVTL 403 Query: 1431 DSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEEMTXXXXXXXXXX 1252 D GRF+GEA R+QG + EDLNS GK+S+D+YT EEM Sbjct: 404 DERGRFTGEAEKKLEELRRRLQGALTDNCFEDLNSSGKMSSDFYTHEEMLQFKKPKKKKS 463 Query: 1251 XXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGRRQNLKDEQERIDAEMRSNAYRSALAKA 1072 GSR+DGRRQ +++EQER +AE RS+AY+SA AKA Sbjct: 464 LRKKEKLDIDALEAEAVSAGLGVGDLGSRSDGRRQAIREEQERSEAERRSSAYQSAYAKA 523 Query: 1071 DEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQEEKSGSFVIKLLA 892 DEASK+LR EQ + EE+ P F DDD++L KSLERARK+ALKKQEE SG I LA Sbjct: 524 DEASKSLRLEQTLPAKVNEEENPVFADDDEDLFKSLERARKLALKKQEEASGPQAIARLA 583 Query: 891 SSNANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPESEDVFMEEDVAPST 712 ++ N+ A D N + G+ ENKVVFTEMEEFVWGLQLDEE KP SEDVFM+ED AP Sbjct: 584 TATNNQIADDQ-NPADGESQENKVVFTEMEEFVWGLQLDEESHKPGSEDVFMDEDAAPRV 642 Query: 711 SDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKGLAATLNLLKGRGTL 532 SDQEM+DE G WTEV + D+ + EN+E+V PDETIHE AVGKGL+ L LLK RGTL Sbjct: 643 SDQEMKDEAGRWTEVNDAAEDDNSVNENKEDVVPDETIHEVAVGKGLSGALKLLKERGTL 702 Query: 531 KETIEWGGRNMDKKKSKLVGIVGEGGT----KEIRIERTDEYGRILTPKEAFRLLSHKFH 364 KET++WGGRNMDKKKSKLVGIV KEIRIER DE+GRI+TPKEAFR++SHKFH Sbjct: 703 KETVDWGGRNMDKKKSKLVGIVDSDADNEKFKEIRIERMDEFGRIMTPKEAFRMISHKFH 762 Query: 363 GKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKLKAPYLVLSGHVKPGQ 184 GKGPGKMKQEKRM+QYQEELK+KQMKN+DTPS SV RMREAQ KLK PYLVLSGHVK GQ Sbjct: 763 GKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSESVERMREAQKKLKTPYLVLSGHVKSGQ 822 Query: 183 SSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIKRKYENEDSSSQKKPKT 31 +SDPRS FATVEKDL GGLTPM GDKKVEHFL IKRK E+E+SS KKPK+ Sbjct: 823 ASDPRSSFATVEKDLPGGLTPMLGDKKVEHFLGIKRKAEHENSSPSKKPKS 873 >ref|XP_012441144.1| PREDICTED: SART-1 family protein DOT2 [Gossypium raimondii] gi|823216924|ref|XP_012441145.1| PREDICTED: SART-1 family protein DOT2 [Gossypium raimondii] gi|763794483|gb|KJB61479.1| hypothetical protein B456_009G361400 [Gossypium raimondii] gi|763794484|gb|KJB61480.1| hypothetical protein B456_009G361400 [Gossypium raimondii] gi|763794485|gb|KJB61481.1| hypothetical protein B456_009G361400 [Gossypium raimondii] gi|763794488|gb|KJB61484.1| hypothetical protein B456_009G361400 [Gossypium raimondii] Length = 900 Score = 784 bits (2025), Expect = 0.0 Identities = 422/733 (57%), Positives = 516/733 (70%), Gaps = 22/733 (3%) Frame = -1 Query: 2163 LADQEKDRTSDRDKSIRKQKDESNDMSKDGHSRFENDYTHNSQASKEKVVNFDEKSGSKI 1984 L D+EK+R ++ K KQK+ D+ K+ SR ++ N + E K G Sbjct: 175 LKDREKEREGEKGKDRSKQKNREADLEKE-RSRDRDNVGKNHEEDYE-----GSKDGELA 228 Query: 1983 LDQTEKAENRE--------------STYELDQRISKMREQRLMKSSEGAPEILSWVNRSX 1846 LD ++ + E S+ EL++RI +M+E RL K SEG E+ +WV+RS Sbjct: 229 LDYEDRRDKDEAELNAGSNASLVQASSSELEERIVRMKEDRLKKKSEGLSEVSAWVSRSR 288 Query: 1845 XXXXXXXXXXXXALQLSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGG 1666 ALQLS+IFEEQDN E +DE A + LGGVKVLHGLDKV++GG Sbjct: 289 KLEDKRNAEKEKALQLSKIFEEQDNFVQGEDEDEEADNRPTHDLGGVKVLHGLDKVMDGG 348 Query: 1665 AVVLTLKDQSILANGDINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAE 1486 AVVLTLKDQSILA+GD+NE+VDMLEN+EIGEQK+RDEAY+A+KKK GVYDDKFN++ G+E Sbjct: 349 AVVLTLKDQSILADGDLNEDVDMLENIEIGEQKQRDEAYKAAKKKTGVYDDKFNEDPGSE 408 Query: 1485 KKMLPQYDDPVTDEGLILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTD 1306 KK+LPQYDDPV DEG+ LD GRF+GEA R+ GV ++ EDLN++GKIS+D Sbjct: 409 KKILPQYDDPVADEGVTLDERGRFTGEAEKKLEELRKRLLGVPTNNRVEDLNNVGKISSD 468 Query: 1305 YYTQEEMTXXXXXXXXXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGRRQNLKDEQE 1126 YYTQEEM GSR D RRQ +K+E+ Sbjct: 469 YYTQEEMLRFKKPKKKKALRKKEKLDIDALEAEAVSAGLGAGDLGSRKDSRRQAIKEEEA 528 Query: 1125 RIDAEMRSNAYRSALAKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKI 946 R +AE R NAY++A AKADEASK+LR EQ HT++ EE++ F DD+++L KSLE+AR++ Sbjct: 529 RSEAEKRKNAYQAAFAKADEASKSLRLEQTHTVKPEEDENQVFADDEEDLYKSLEKARRL 588 Query: 945 ALKKQEEKSGSFVIKLLASSNANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEE 766 ALKKQEEKSG I LLA+++A+ +D+ + S+G+ ENKVV TEMEEFVWGLQLDEE Sbjct: 589 ALKKQEEKSGPQAIALLATTSASNQTTDD-HTSTGEAQENKVVITEMEEFVWGLQLDEEA 647 Query: 765 KKPESEDVFMEEDVAPSTSDQEMR---DEDGGWTEVQEIMPDEIHMQENEEEVAPDETIH 595 KP+SEDVFM+ED P S+Q+ + +E GGWTEV + DE E+ +EV PDETIH Sbjct: 648 HKPDSEDVFMDEDEVPGASEQDRKNGENEVGGWTEVIDTSADEKPANEDNDEVVPDETIH 707 Query: 594 EPAVGKGLAATLNLLKGRGTLKETIEWGGRNMDKKKSKLVGIVG-----EGGTKEIRIER 430 E AVGKGL+ L LLK RGTLKETIEWGGRNMDKKKSKLVGIV + K+IRIER Sbjct: 708 EIAVGKGLSGALKLLKDRGTLKETIEWGGRNMDKKKSKLVGIVDDDHQTDNRFKDIRIER 767 Query: 429 TDEYGRILTPKEAFRLLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARM 250 TDE+GRI+TPKEAFR+LSHKFHGKGPGKMKQEKRM+QYQEELK+KQMKN+DTPSLSV RM Sbjct: 768 TDEFGRIVTPKEAFRMLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERM 827 Query: 249 REAQAKLKAPYLVLSGHVKPGQSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIKRKY 70 REAQA+LK PYLVLSGHVKPGQ+SDP SGFATVEKD GGLTPM GD+KVEHFL IKRK Sbjct: 828 REAQAQLKTPYLVLSGHVKPGQTSDPASGFATVEKDFPGGLTPMLGDRKVEHFLGIKRKA 887 Query: 69 ENEDSSSQKKPKT 31 E +S + KKPKT Sbjct: 888 EAGNSGTPKKPKT 900 >ref|XP_006471158.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1-like [Citrus sinensis] Length = 878 Score = 772 bits (1994), Expect = 0.0 Identities = 413/715 (57%), Positives = 511/715 (71%), Gaps = 8/715 (1%) Frame = -1 Query: 2151 EKDRTSDRDKSIRKQKDESNDMSKDGHSRFENDYTHNSQASKEKVVNFDEKSGSKILDQT 1972 +K+R+ +RD+ RK +E S D + +N+ N +K V++D+ +D Sbjct: 175 DKERSRERDRVSRKAHEEDCARSNDNMPKLDNEGNMNRDINKHGKVSYDD------IDDQ 228 Query: 1971 EKAENRESTYELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXXXXXXXXALQLSR 1792 + + ST L RI KM+E+RL K+SEGAPEILSWVNRS ALQLS+ Sbjct: 229 DNEDAHVSTSGLGDRILKMKEERLKKNSEGAPEILSWVNRSRKIEQIKNVEKKKALQLSK 288 Query: 1791 IFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTLKDQSILANGDIN 1612 IFEEQDN+ ES+DE A Q S L GVKVLHGLDKV+EGGAVVLTLKDQ ILA+GDIN Sbjct: 289 IFEEQDNIVQGESEDEEAGQHNSHDLAGVKVLHGLDKVMEGGAVVLTLKDQQILADGDIN 348 Query: 1611 EEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQYDDPVTDEGLIL 1432 E+VDMLEN+EIGEQKRRDEAY+A+KKK G+YDDKFND+ +EKK+LPQYD+P TDEGL L Sbjct: 349 EDVDMLENIEIGEQKRRDEAYKAAKKKTGIYDDKFNDDPSSEKKILPQYDEPATDEGLTL 408 Query: 1431 DSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEEMTXXXXXXXXXX 1252 D+ GRF+GEA RIQGV A++ EDLN I++DY+TQEEM Sbjct: 409 DARGRFTGEAEKKLEELRRRIQGVQANNSTEDLNLSANITSDYFTQEEMLQFKKPKKKKK 468 Query: 1251 XXXXXXXXXKXXXXXXXXXXXXXXXXG-SRNDGRRQNLKDEQERIDAEMRSNAYRSALAK 1075 SR DGRRQ +++EQE+ +AEM++ AY+SA AK Sbjct: 469 SIRKKEKLDLDALEAEALSAGLGVEDLGSRKDGRRQAIREEQEKSEAEMKNKAYQSAYAK 528 Query: 1074 ADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQEEKSGSFVIKLL 895 A+EA K+LR EQ ++ EEE+ DD+D+L KSLERARK+ALKKQE SG I L Sbjct: 529 AEEAVKSLRMEQTRPVKLEEENEEPIADDEDDLYKSLERARKLALKKQEASSGPEAIARL 588 Query: 894 ASSN-ANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPESEDVFMEEDVAP 718 A+S ANE ++ N + E KVV TE++EFVWGL + EE +K + +DVFM+ED P Sbjct: 589 ATSQTANEQSTTNE-----ESEEKKVVITELQEFVWGLPVGEEVQKQDRQDVFMDEDEGP 643 Query: 717 STSDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKGLAATLNLLKGRG 538 TSD EM+DE GGWTEV+EI +E +E++EE+ PDETIHE AVGKGLA L+LLK RG Sbjct: 644 RTSDLEMKDEPGGWTEVKEIGEEENPSKEDKEEIVPDETIHELAVGKGLAGALSLLKDRG 703 Query: 537 TLKETIEWGGRNMDKKKSKLVGIVGEGGT-----KEIRIERTDEYGRILTPKEAFRLLSH 373 TLKE I+WGGRNMDKKKSKL+G+V + K+IRIERTDE+GRI+TPKEAFR++SH Sbjct: 704 TLKEGIDWGGRNMDKKKSKLIGVVDDNPNVDNRFKDIRIERTDEFGRIMTPKEAFRMISH 763 Query: 372 KFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKLKAPYLVLSGHVK 193 KFHGKGPGKMKQEKRM+QYQEELK+KQMKN+DTP+ SV RMREAQA+LK PYLVLSGHVK Sbjct: 764 KFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPTESVERMREAQARLKTPYLVLSGHVK 823 Query: 192 PGQSSDPRSGFATVEKDL-AGGLTPMSGDKKVEHFLNIKRKYENEDSSSQKKPKT 31 PGQ+SDPRSGFATVEKDL AGGLTPM G++KVEHFL IKRK ++E+++S K P+T Sbjct: 824 PGQTSDPRSGFATVEKDLPAGGLTPMLGNRKVEHFLGIKRKGDSENTNSPKNPRT 878 >ref|XP_007022025.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] gi|590611175|ref|XP_007022026.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] gi|508721653|gb|EOY13550.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] gi|508721654|gb|EOY13551.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao] Length = 907 Score = 767 bits (1981), Expect = 0.0 Identities = 414/718 (57%), Positives = 508/718 (70%), Gaps = 8/718 (1%) Frame = -1 Query: 2160 ADQEKDRTSDRDKSIRKQKDESNDMSKDGHSRFENDYTHNSQASKEKVVNFDEKSGSKIL 1981 AD EK+R+ DRD +I+K +E + SKDG DY +S+ E +N +G Sbjct: 203 ADLEKERSRDRDNAIKKNHEEDYEGSKDGELAL--DYG-DSRDKDEAELNAGSNAGVA-- 257 Query: 1980 DQTEKAENRESTYELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXXXXXXXXALQ 1801 + S+ EL++RI++M+E+RL K SEG E+L WV ALQ Sbjct: 258 --------QASSSELEERIARMKEERLKKKSEGVSEVLEWVGNFRKLEEKRNAEKEKALQ 309 Query: 1800 LSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTLKDQSILANG 1621 S+IFEEQD+ E++DE A + + L GVKVLHGLDKV++GGAVVLTLKDQSILANG Sbjct: 310 RSKIFEEQDDFVQGENEDEEAVRHAAHDLAGVKVLHGLDKVMDGGAVVLTLKDQSILANG 369 Query: 1620 DINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQYDDPVTDEG 1441 DINE+VDMLENVEIGEQ+RRDEAY+A+KKK GVYDDKFNDE G+EKK+LPQYD+PV DEG Sbjct: 370 DINEDVDMLENVEIGEQRRRDEAYKAAKKKTGVYDDKFNDEPGSEKKILPQYDNPVADEG 429 Query: 1440 LILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEEMTXXXXXXX 1261 + LD GRF+GEA R+QGV ++ EDLN+ GKI++DYYTQEEM Sbjct: 430 VTLDERGRFTGEAEKKLQELRKRLQGVPTNNRVEDLNNAGKIASDYYTQEEMLKFKKPKK 489 Query: 1260 XXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGRRQNLKDEQERIDAEMRSNAYRSAL 1081 GSRND RRQ +++E+ R +AE R++AY+SA Sbjct: 490 KKALRKKEKLDIDALEAEAISSGLGAGDLGSRNDARRQAIREEEARSEAEKRNSAYQSAY 549 Query: 1080 AKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQE-EKSGSFVI 904 AKADEASK+L EQ ++ EE++ F DDDD+L KS+ER+RK+A KKQE EKSG I Sbjct: 550 AKADEASKSLWLEQTLIVKPEEDENQVFADDDDDLYKSIERSRKLAFKKQEDEKSGPQAI 609 Query: 903 KLLASSNANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPESEDVFMEEDV 724 L A++ A +D+ ++G+ ENK+V TEMEEFVWGLQ DEE KP+SEDVFM+ED Sbjct: 610 ALRATTAAISQTADDQTTTTGEAQENKLVITEMEEFVWGLQHDEEAHKPDSEDVFMDEDE 669 Query: 723 APSTSDQEMR---DEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKGLAATLNL 553 P S+ + + +E GGWTEV + DE E+++++ PDETIHE AVGKGL+ L L Sbjct: 670 VPGVSEHDGKSGENEVGGWTEVVDASTDENPSNEDKDDIVPDETIHEVAVGKGLSGALKL 729 Query: 552 LKGRGTLKETIEWGGRNMDKKKSKLVGIVGEGGT----KEIRIERTDEYGRILTPKEAFR 385 LK RGTLKE+IEWGGRNMDKKKSKLVGIV + K+IRIERTDE+GRI+TPKEAFR Sbjct: 730 LKDRGTLKESIEWGGRNMDKKKSKLVGIVDDDRENDRFKDIRIERTDEFGRIITPKEAFR 789 Query: 384 LLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKLKAPYLVLS 205 +LSHKFHGKGPGKMKQEKR +QYQEELK+KQMKN+DTPSLSV RMREAQA+LK PYLVLS Sbjct: 790 VLSHKFHGKGPGKMKQEKRQKQYQEELKLKQMKNSDTPSLSVERMREAQAQLKTPYLVLS 849 Query: 204 GHVKPGQSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIKRKYENEDSSSQKKPKT 31 GHVKPGQ+SDPRSGFATVEKD GGLTPM GD+KVEHFL IKRK E +SS+ KKPKT Sbjct: 850 GHVKPGQTSDPRSGFATVEKDFPGGLTPMLGDRKVEHFLGIKRKAEPGNSSTPKKPKT 907 >ref|XP_011011622.1| PREDICTED: SART-1 family protein DOT2 isoform X1 [Populus euphratica] Length = 860 Score = 766 bits (1978), Expect = 0.0 Identities = 420/720 (58%), Positives = 505/720 (70%), Gaps = 12/720 (1%) Frame = -1 Query: 2157 DQEKDRTSDRDKSIRKQKDESNDMSKDGHSRFENDYTHNSQASKEKVVNFDEKSGSKILD 1978 ++E+DR +D+DK ++KD + S+ G+ E DY Q E V+ D + K+ Sbjct: 147 ERERDREADQDKERSREKDRA---SRKGN---EEDYDDKVQMDYEDEVDKDNRKQGKVSF 200 Query: 1977 QTEKAENRESTY----ELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXXXXXXXX 1810 + E ++ E + EL+QRI KM+E+R K SE +IL+WV RS Sbjct: 201 RDEGEQSAEGAHSSASELEQRILKMKEERTKKKSEAGSDILAWVGRSRKIEENKHAAKAR 260 Query: 1809 ALQLSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTLKDQSIL 1630 A LS+IFEEQDN+ SDDE A Q + +L G+KVL GLDKVLEGGAVVLTLKDQ+IL Sbjct: 261 AKHLSKIFEEQDNIGQGGSDDEEADQHNAYNLAGIKVLDGLDKVLEGGAVVLTLKDQNIL 320 Query: 1629 ANGDINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQYDDPVT 1450 A+GDINEEVDMLENVEIGEQKRRDEAY+A+KKK G+YDDKFND+ +EKKMLPQYDD Sbjct: 321 ADGDINEEVDMLENVEIGEQKRRDEAYKAAKKKTGIYDDKFNDDPASEKKMLPQYDDANA 380 Query: 1449 DEGLILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEEMTXXXX 1270 DEG+ LD GRF+GEA R+QG S S+ EDLNS GKIS+DY+T EEM Sbjct: 381 DEGITLDERGRFTGEAEKKLEELRRRLQGTSTSARLEDLNSSGKISSDYFTHEEMLKFKK 440 Query: 1269 XXXXXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGRRQNLKDEQERIDAEMRSNAYR 1090 GSR DGRRQ +++EQER AEMR+NAY+ Sbjct: 441 PKKKKSLRKKDKLDIDALEAEAVSAGLGIGDLGSRKDGRRQAIREEQERSAAEMRNNAYQ 500 Query: 1089 SALAKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQE-EKSGS 913 SA AKADEASK+LR +Q + EEE+ F DD+++L KSLERARK+ALKKQE E SG Sbjct: 501 SAYAKADEASKSLRLDQTLQTKVEEEENLVFADDEEDLYKSLERARKLALKKQEAEASGP 560 Query: 912 FVIKLLASSNANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPESEDVFME 733 I LAS+ + +D+ N +G+ ENK+VFTEMEEFV +QL EE KP++EDVFM+ Sbjct: 561 LAIAHLASTTLSSQIADDKNPETGESHENKLVFTEMEEFVSAIQLAEEVHKPDNEDVFMD 620 Query: 732 EDVAPSTSDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKGLAATLNL 553 ED P SD+E +DE GGW EV + DE + E +EE+ PDETIHE AVGKGL+ L L Sbjct: 621 EDEPPRVSDEEQKDEAGGWMEVPDNSKDENPVNE-DEEIVPDETIHEVAVGKGLSGALKL 679 Query: 552 LKGRGTLKETIEWGGRNMDKKKSKLVGIVGEG-GT------KEIRIERTDEYGRILTPKE 394 LK RGTLKE+I+WGGRNMDKKKSKLVGIV + GT K+IRIERTDE+GRI+TPKE Sbjct: 680 LKERGTLKESIDWGGRNMDKKKSKLVGIVDDDVGTNNDNKFKDIRIERTDEFGRIMTPKE 739 Query: 393 AFRLLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKLKAPYL 214 AFR++SHKFHGKGPGKMKQEKRM+QYQEELK+KQMKN+DTPSLSV RMR AQA+LK PYL Sbjct: 740 AFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMRGAQAQLKTPYL 799 Query: 213 VLSGHVKPGQSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIKRKYENEDSSSQKKPK 34 VLSGHVKPGQ+SDPRSGFATVEKD GGLTPM GDKKVEHFL IKRK E S + KKPK Sbjct: 800 VLSGHVKPGQTSDPRSGFATVEKDFPGGLTPMLGDKKVEHFLGIKRKPETGFSGAPKKPK 859 >ref|XP_006431678.1| hypothetical protein CICLE_v10000233mg [Citrus clementina] gi|567878241|ref|XP_006431679.1| hypothetical protein CICLE_v10000233mg [Citrus clementina] gi|557533800|gb|ESR44918.1| hypothetical protein CICLE_v10000233mg [Citrus clementina] gi|557533801|gb|ESR44919.1| hypothetical protein CICLE_v10000233mg [Citrus clementina] Length = 878 Score = 764 bits (1974), Expect = 0.0 Identities = 410/715 (57%), Positives = 509/715 (71%), Gaps = 8/715 (1%) Frame = -1 Query: 2151 EKDRTSDRDKSIRKQKDESNDMSKDGHSRFENDYTHNSQASKEKVVNFDEKSGSKILDQT 1972 +K+R+ +RD+ RK +E S D + +N+ N +K V++D+ D Sbjct: 175 DKERSRERDRVSRKAHEEDCARSNDNMPKLDNEDNMNRDINKHGKVSYDDT------DDQ 228 Query: 1971 EKAENRESTYELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXXXXXXXXALQLSR 1792 + + ST L RI KM+E+RL K+SEGAPEILSWVNRS ALQLS+ Sbjct: 229 DNEDAHVSTSGLGDRILKMKEERLKKNSEGAPEILSWVNRSRKIEQIKNVEKKKALQLSK 288 Query: 1791 IFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTLKDQSILANGDIN 1612 IFEEQDN+ ES+DE A Q +S L GVKVLHGLDKV+ GGAVVLTLKDQ ILA+GDIN Sbjct: 289 IFEEQDNIVQGESEDEEAGQHSSHDLAGVKVLHGLDKVMGGGAVVLTLKDQQILADGDIN 348 Query: 1611 EEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQYDDPVTDEGLIL 1432 E+VDMLEN+EIGEQKRRDEAY+A+KKK G+YDDKFND+ +EKK+LPQYD+P TDEGL L Sbjct: 349 EDVDMLENIEIGEQKRRDEAYKAAKKKTGIYDDKFNDDPSSEKKILPQYDEPATDEGLTL 408 Query: 1431 DSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEEMTXXXXXXXXXX 1252 D+ GRF+GEA RIQGV A++ DLN KI++DY+TQEEM Sbjct: 409 DARGRFTGEAEKKLEELRRRIQGVQANNSTGDLNLSAKITSDYFTQEEMLQFKKPKKKKK 468 Query: 1251 XXXXXXXXXKXXXXXXXXXXXXXXXXG-SRNDGRRQNLKDEQERIDAEMRSNAYRSALAK 1075 SR DGRRQ +++EQE+ +AEM++ AY+SA AK Sbjct: 469 SIRKKEKLDLDALEAEALSAGLGVEDLGSRKDGRRQAIREEQEKSEAEMKNKAYQSAYAK 528 Query: 1074 ADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQEEKSGSFVIKLL 895 A+EA K+LR EQ ++ EEE+ DD+D+L KSLERARK+ALKKQE SG I L Sbjct: 529 AEEAIKSLRMEQTRPVKLEEENEEPIADDEDDLYKSLERARKLALKKQEASSGPEAIARL 588 Query: 894 ASSN-ANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPESEDVFMEEDVAP 718 A+S ANE ++ N + E KVV TE++EFVWGL + EE +K + +DVFM+ED P Sbjct: 589 ATSQTANEQSTTNE-----ESEEKKVVITELQEFVWGLPVGEEVQKQDRQDVFMDEDEGP 643 Query: 717 STSDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKGLAATLNLLKGRG 538 T+D EM+DE GGWTEV+E +E +E++EE+ PDETIHE AVGKGLA L+LLK RG Sbjct: 644 RTTDHEMKDEPGGWTEVKETGEEENPSKEDKEEIVPDETIHELAVGKGLAGALSLLKDRG 703 Query: 537 TLKETIEWGGRNMDKKKSKLVGIVGEGGT-----KEIRIERTDEYGRILTPKEAFRLLSH 373 TLKE I+WGGRNMDKKKSKLVG+V + K++RIERTDE+GRI+TPKEAFR++SH Sbjct: 704 TLKEGIDWGGRNMDKKKSKLVGVVDDTPNVDNRFKDLRIERTDEFGRIMTPKEAFRMISH 763 Query: 372 KFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKLKAPYLVLSGHVK 193 KFHGKGPGKMKQEKRM+QYQEELK+KQMKN+DTP+ SV RMREAQA+LK PYLVLSGHVK Sbjct: 764 KFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPTESVERMREAQARLKTPYLVLSGHVK 823 Query: 192 PGQSSDPRSGFATVEKDL-AGGLTPMSGDKKVEHFLNIKRKYENEDSSSQKKPKT 31 PGQ+SDPRSGFATVEKDL AGGLTPM G++KVEHFL IKRK ++E+++S K P+T Sbjct: 824 PGQTSDPRSGFATVEKDLPAGGLTPMLGNRKVEHFLGIKRKGDSENTNSPKNPRT 878 >ref|XP_003530377.1| PREDICTED: zinc finger CCCH domain-containing protein 13-like [Glycine max] gi|947096175|gb|KRH44760.1| hypothetical protein GLYMA_08G229600 [Glycine max] Length = 882 Score = 764 bits (1974), Expect = 0.0 Identities = 420/726 (57%), Positives = 509/726 (70%), Gaps = 17/726 (2%) Frame = -1 Query: 2157 DQEKDRTSDRDKSIRKQKDESNDMSKDGHSR--FENDYTHNSQASKEKVVNF-DEKSGSK 1987 D +KD+ D+ + ++ D + ++D SR E DY ++ K + DE+ G + Sbjct: 166 DGDKDKGKDKIREKERETDRDKERTRDRVSRKTHEEDYELDNVDDKVDYQDKRDEEIGKQ 225 Query: 1986 ILDQTEKAENRE-------STYELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXX 1828 D +N++ S+ EL+ RI KM+E R K E EI +WVN+S Sbjct: 226 EKDSKLDNDNQDGQTSAHLSSTELEDRILKMKESRTKKQPEADSEISAWVNKSRKIEKKR 285 Query: 1827 XXXXXXALQLSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTL 1648 QLS+IFEEQDN+ E SDDE Q T +L GVKVLHGLDKV+EGG VVLT+ Sbjct: 286 A------FQLSKIFEEQDNIAVEGSDDEDTAQHTD-NLAGVKVLHGLDKVMEGGTVVLTI 338 Query: 1647 KDQSILANGDINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQ 1468 KDQ ILA+GD+NE+VDMLEN+EIGEQKRRDEAY+A+KKK GVYDDKF+D+ EKKMLPQ Sbjct: 339 KDQPILADGDVNEDVDMLENIEIGEQKRRDEAYKAAKKKTGVYDDKFHDDPSTEKKMLPQ 398 Query: 1467 YDDPVTDEGLILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEE 1288 YDDP +EGL LD GRFSGEA R+ GVS ++ EDL S GK+S+DYYT EE Sbjct: 399 YDDPAAEEGLTLDGKGRFSGEAEKKLEELRRRLTGVSTNTF-EDLTSSGKVSSDYYTHEE 457 Query: 1287 MTXXXXXXXXXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGRRQNLKDEQERIDAEM 1108 M GSR D RRQ +KDEQER++AEM Sbjct: 458 MLKFKKPKKKKSLRKKDKLDINALEAEAVSSGLGVGDLGSRKDVRRQAIKDEQERLEAEM 517 Query: 1107 RSNAYRSALAKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQE 928 RSNAY+SA AKADEASK LR EQ ++TEE++ P F DDD++LRKSLE+AR++ALKK+E Sbjct: 518 RSNAYQSAYAKADEASKLLRLEQTLNVKTEEDETPVFVDDDEDLRKSLEKARRLALKKKE 577 Query: 927 EK--SGSFVIKLLASSNANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPE 754 + SG I LLA+SN N +D+ N ++G+ ENKVVFTEMEEFVWGL +DEE +KPE Sbjct: 578 GEGASGPQAIALLATSNHNNE-TDDQNPTAGESRENKVVFTEMEEFVWGLHIDEEARKPE 636 Query: 753 SEDVFMEEDVAPSTSDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKG 574 SEDVFM +D + D+E +E GGWTEVQE DE E++EE+ PDETIHE AVGKG Sbjct: 637 SEDVFMHDDEEANVPDEEKINEVGGWTEVQETSEDEQRNTEDKEEIIPDETIHEVAVGKG 696 Query: 573 LAATLNLLKGRGTLKETIEWGGRNMDKKKSKLVGIVGEGG-----TKEIRIERTDEYGRI 409 L+ L LLK RGTLKE+IEWGGRNMDKKKSKLVGIV + T+EIRIERTDE+GRI Sbjct: 697 LSGALKLLKERGTLKESIEWGGRNMDKKKSKLVGIVDDEEKEAQKTREIRIERTDEFGRI 756 Query: 408 LTPKEAFRLLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKL 229 LTPKEAFR++SHKFHGKGPGKMKQEKRM+QY EELK+KQMK++DTPSLSV RMREAQA+L Sbjct: 757 LTPKEAFRMISHKFHGKGPGKMKQEKRMKQYYEELKMKQMKSSDTPSLSVERMREAQARL 816 Query: 228 KAPYLVLSGHVKPGQSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIKRKYENEDSSS 49 + PYLVLSGHVKPGQ+SDP+SGFATVEKDL GGLTPM GD+KVEHFL IKRK E S + Sbjct: 817 QTPYLVLSGHVKPGQTSDPKSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAEPSSSDT 876 Query: 48 QKKPKT 31 KKPK+ Sbjct: 877 PKKPKS 882 >ref|XP_007225495.1| hypothetical protein PRUPE_ppa000914mg [Prunus persica] gi|596285693|ref|XP_007225496.1| hypothetical protein PRUPE_ppa000914mg [Prunus persica] gi|462422431|gb|EMJ26694.1| hypothetical protein PRUPE_ppa000914mg [Prunus persica] gi|462422432|gb|EMJ26695.1| hypothetical protein PRUPE_ppa000914mg [Prunus persica] Length = 963 Score = 763 bits (1971), Expect = 0.0 Identities = 418/758 (55%), Positives = 510/758 (67%), Gaps = 49/758 (6%) Frame = -1 Query: 2157 DQEKDRTSDRDKSIRKQKDESNDMSKDG----HSRFENDYTHNSQASKEKVVNFDEKSGS 1990 D +KD++ RD+ R+ DE+ + SKDG ++ +YT + + KV S Sbjct: 216 DHDKDKS--RDRVSRRSLDENYEWSKDGGRDDKAKLNEEYTGDKDIKQGKV--------S 265 Query: 1989 KILDQTEKAENRE-----STYELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXXX 1825 + KAE S EL++RI K +E+RL K E PE+L+WV+RS Sbjct: 266 HNAEDERKAEGLSGGAHLSALELEERIMKTKEERLKKKKEDVPEVLAWVSRSRKLEDKRN 325 Query: 1824 XXXXXALQLSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTLK 1645 ALQLS+IFEEQDN+ ES+DE QDT+ L GVKVLHGLDKV+EGGAVVLTLK Sbjct: 326 AEKQKALQLSKIFEEQDNIGQGESEDEETAQDTTHDLAGVKVLHGLDKVMEGGAVVLTLK 385 Query: 1644 DQSILANGDINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQY 1465 DQ+ILA+G +NE++DMLENVEIGEQK+RD+AY+A+KKK G+Y DKFND+L EKK+LPQY Sbjct: 386 DQNILADGGVNEDIDMLENVEIGEQKQRDDAYKAAKKKTGIYVDKFNDDLNTEKKILPQY 445 Query: 1464 DDPVTDEGLILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEEM 1285 DDPV DEGL LD GRF+GEA RIQGV ++ EDLN G I++D+YTQEEM Sbjct: 446 DDPVPDEGLTLDERGRFTGEAEKKLEELRKRIQGVPTNNRFEDLNMSGNITSDFYTQEEM 505 Query: 1284 TXXXXXXXXXXXXXXXXXXXKXXXXXXXXXXXXXXXXG--SRNDGRRQNLKDEQERIDAE 1111 SRND +RQ K+EQER++AE Sbjct: 506 LQFKKPKKGKKKSLRKKEKLDLDALEAEAVSAGLGVADLGSRNDAKRQANKEEQERLEAE 565 Query: 1110 MRSNAYRSALAKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQ 931 R++AY+ A AKADEASK+LR EQ+ T+ EE++ PAF DDDD+L KSLERARK+ALKK+ Sbjct: 566 RRNSAYQLAYAKADEASKSLRLEQILTVIPEEDETPAFADDDDDLYKSLERARKLALKKK 625 Query: 930 EEK--SGSFVIKLLASSNANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKP 757 EE+ SG I LLA++ A+ +DN S+G+ +NKVVFTEMEEFVWGLQLDEE KP Sbjct: 626 EEETASGPQAIALLATTTASSQTADNQIPSTGESQDNKVVFTEMEEFVWGLQLDEESHKP 685 Query: 756 ESEDVFMEEDVAPSTSDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGK 577 ESEDVFM+ED P S +E +E GGWTEV+++ DE E++EE+ PDETIHE AVGK Sbjct: 686 ESEDVFMQEDEEPKPSHEERMNEPGGWTEVKDMDEDEKPATEDKEEIVPDETIHEVAVGK 745 Query: 576 GLAATLNLLKGRGTLKETIEWGGRNMDKKKSKLVGIVGEGG------------------- 454 GL+ L LLK RGTLKE IEWGGRNMDKKKSKL+GIV + Sbjct: 746 GLSGVLKLLKDRGTLKEGIEWGGRNMDKKKSKLLGIVDDDDEPKEPHTSRQKKDEHKDTR 805 Query: 453 -----------------TKEIRIERTDEYGRILTPKEAFRLLSHKFHGKGPGKMKQEKRM 325 K+I IERTDE+GR LTPKEAFR LSHKFHGKGPGKMKQEKRM Sbjct: 806 PSSSSHQKETRPSKVYQEKDIHIERTDEFGRTLTPKEAFRTLSHKFHGKGPGKMKQEKRM 865 Query: 324 RQYQEELKIKQMKNADTPSLSVARMREAQAKLKAPYLVLSGHVKPGQSSDPRSGFATVEK 145 +QYQEELK+KQMK++DTPSLS RMR+ QA+L+ PYLVLSGHVKPGQ+SDPRSGFATVEK Sbjct: 866 KQYQEELKLKQMKSSDTPSLSAERMRDTQARLQTPYLVLSGHVKPGQTSDPRSGFATVEK 925 Query: 144 DLAGGLTPMSGDKKVEHFLNIKRKYENEDSSSQKKPKT 31 D GGLTPM GD+KVE++L IKRK E E S + KKPKT Sbjct: 926 DFPGGLTPMLGDRKVENYLGIKRKAEPESSGTPKKPKT 963 >gb|KHN38139.1| U4/U6.U5 tri-snRNP-associated protein 1 [Glycine soja] Length = 882 Score = 762 bits (1968), Expect = 0.0 Identities = 419/726 (57%), Positives = 508/726 (69%), Gaps = 17/726 (2%) Frame = -1 Query: 2157 DQEKDRTSDRDKSIRKQKDESNDMSKDGHSR--FENDYTHNSQASKEKVVNF-DEKSGSK 1987 D +KD+ D+ + ++ D + ++D SR E DY ++ K + DE+ G + Sbjct: 166 DGDKDKGKDKIREKERETDRDKERTRDRVSRKTHEEDYELDNVDDKVDYQDKRDEEIGKQ 225 Query: 1986 ILDQTEKAENRE-------STYELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXX 1828 D +N++ S+ EL+ RI KM+E R K E EI +WVN+S Sbjct: 226 EKDSKLDNDNQDGQTSAHLSSTELEDRILKMKESRTKKQPEADSEISAWVNKSRKIEKKR 285 Query: 1827 XXXXXXALQLSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTL 1648 QLS+IFEEQDN+ E SDDE Q T +L GVKVLHGLDKV+ GG VVLT+ Sbjct: 286 A------FQLSKIFEEQDNIAVEGSDDEDTAQHTD-NLAGVKVLHGLDKVMAGGTVVLTI 338 Query: 1647 KDQSILANGDINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQ 1468 KDQ ILA+GD+NE+VDMLEN+EIGEQKRRDEAY+A+KKK GVYDDKF+D+ EKKMLPQ Sbjct: 339 KDQPILADGDVNEDVDMLENIEIGEQKRRDEAYKAAKKKTGVYDDKFHDDPSTEKKMLPQ 398 Query: 1467 YDDPVTDEGLILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEE 1288 YDDP +EGL LD GRFSGEA R+ GVS ++ EDL S GK+S+DYYT EE Sbjct: 399 YDDPAAEEGLTLDGKGRFSGEAEKKLEELRRRLTGVSTNTF-EDLTSSGKVSSDYYTHEE 457 Query: 1287 MTXXXXXXXXXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGRRQNLKDEQERIDAEM 1108 M GSR D RRQ +KDEQER++AEM Sbjct: 458 MLKFKKPKKKKSLRKKDKLDINALEAEAVSSGLGVGDLGSRKDVRRQAIKDEQERLEAEM 517 Query: 1107 RSNAYRSALAKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQE 928 RSNAY+SA AKADEASK LR EQ ++TEE++ P F DDD++LRKSLE+AR++ALKK+E Sbjct: 518 RSNAYQSAYAKADEASKLLRLEQTLNVKTEEDETPVFVDDDEDLRKSLEKARRLALKKKE 577 Query: 927 EK--SGSFVIKLLASSNANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPE 754 + SG I LLA+SN N +D+ N ++G+ ENKVVFTEMEEFVWGL +DEE +KPE Sbjct: 578 GEGASGPQAIALLATSNHNNE-TDDQNPTAGESRENKVVFTEMEEFVWGLHIDEEARKPE 636 Query: 753 SEDVFMEEDVAPSTSDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKG 574 SEDVFM +D + D+E +E GGWTEVQE DE E++EE+ PDETIHE AVGKG Sbjct: 637 SEDVFMHDDEEANVPDEEKINEVGGWTEVQETSEDEQRNTEDKEEIIPDETIHEVAVGKG 696 Query: 573 LAATLNLLKGRGTLKETIEWGGRNMDKKKSKLVGIVGEGG-----TKEIRIERTDEYGRI 409 L+ L LLK RGTLKE+IEWGGRNMDKKKSKLVGIV + T+EIRIERTDE+GRI Sbjct: 697 LSGALKLLKERGTLKESIEWGGRNMDKKKSKLVGIVDDEEKEAQKTREIRIERTDEFGRI 756 Query: 408 LTPKEAFRLLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKL 229 LTPKEAFR++SHKFHGKGPGKMKQEKRM+QY EELK+KQMK++DTPSLSV RMREAQA+L Sbjct: 757 LTPKEAFRMISHKFHGKGPGKMKQEKRMKQYYEELKMKQMKSSDTPSLSVERMREAQARL 816 Query: 228 KAPYLVLSGHVKPGQSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIKRKYENEDSSS 49 + PYLVLSGHVKPGQ+SDP+SGFATVEKDL GGLTPM GD+KVEHFL IKRK E S + Sbjct: 817 QTPYLVLSGHVKPGQTSDPKSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKAEPSSSDT 876 Query: 48 QKKPKT 31 KKPK+ Sbjct: 877 PKKPKS 882 >ref|XP_010033990.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Eucalyptus grandis] gi|629087518|gb|KCW53875.1| hypothetical protein EUGRSUZ_J03092 [Eucalyptus grandis] Length = 900 Score = 762 bits (1968), Expect = 0.0 Identities = 417/736 (56%), Positives = 513/736 (69%), Gaps = 28/736 (3%) Frame = -1 Query: 2154 QEKDRTSDRDKSIRKQKDESNDMSKDGHSRF---ENDYTHNSQASKEKVV------NFDE 2002 +EK+R RDK K+KD D +K+ +R E D+ + KE+V+ ++D Sbjct: 167 KEKEREKYRDKGREKEKDRVTDEAKEKSNRQRDREEDHDRDRSRDKERVIRKGDAHDYDR 226 Query: 2001 KSGSKI-LDQTEKAEN--------------RESTYELDQRISKMREQRLMKS--SEGAPE 1873 +++ D E+ E+ R ST L RISK +E+RL + SEGA E Sbjct: 227 IKDNRVEFDIAEEKEDVGHGQNPDSALDGTRLSTSNLQDRISKAKEERLKRQPESEGASE 286 Query: 1872 ILSWVNRSXXXXXXXXXXXXXALQLSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLH 1693 IL+WVNRS ++LS++FEEQD++ ES+DE + L GVKVLH Sbjct: 287 ILAWVNRSRKLEQKRNAEKEKVMRLSKVFEEQDDIGHGESEDEQEVPRNAHDLAGVKVLH 346 Query: 1692 GLDKVLEGGAVVLTLKDQSILANGDINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDD 1513 GLDKV+EGGAVVLTLKDQ+ILA+GDINEEVDMLENVEIGEQK RDEAY+A+KKK G+YDD Sbjct: 347 GLDKVVEGGAVVLTLKDQNILADGDINEEVDMLENVEIGEQKHRDEAYKAAKKKSGIYDD 406 Query: 1512 KFNDELGAEKKMLPQYDDPVTDEGLILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDL 1333 KF+D+ +EKKMLPQYDDP DEG+ LDSSGR + EA R+QGVS+SSH EDL Sbjct: 407 KFSDDPASEKKMLPQYDDPAQDEGVTLDSSGRLTNEAEKKLEELRRRLQGVSSSSHYEDL 466 Query: 1332 NSIGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGR 1153 S K S+DYYTQEE+ GSR DGR Sbjct: 467 TSSAKTSSDYYTQEELLRFRKPKKKKSLRKKEKLDLDALEAEAVSAGLGVGDLGSRKDGR 526 Query: 1152 RQNLKDEQERIDAEMRSNAYRSALAKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELR 973 RQ ++EQE+I+AEMR NA++ A AKA+EAS+ LR EQ ++TE ++ DDD++L Sbjct: 527 RQASREEQEKIEAEMRKNAFQLAYAKAEEASRLLRVEQTLPVKTENDENMVIADDDEDLY 586 Query: 972 KSLERARKIALKKQEEK--SGSFVIKLLASSNANEPASDNPNISSGDQSENKVVFTEMEE 799 KSLERARK+ALKKQEEK SG I L ASS + ++N ++++G+ E++VV TE+E Sbjct: 587 KSLERARKLALKKQEEKGASGPKAIALRASSIPSTHNAENQSVTTGESQESRVVMTEIEG 646 Query: 798 FVWGLQLDEEEKKPESEDVFMEEDVAPSTSDQEMRDEDGGWTEVQEIMPDEIHMQENEEE 619 FV GL++DE +KP++EDVFM+ED AP TSD E++DE GGWTE +E DE + E+EEE Sbjct: 647 FVSGLEVDEVSRKPDTEDVFMDEDEAPVTSDNEVKDEPGGWTEFKEFGNDEGSVNEDEEE 706 Query: 618 VAPDETIHEPAVGKGLAATLNLLKGRGTLKETIEWGGRNMDKKKSKLVGIVGEGGTKEIR 439 V PDETIHE AVGKGL+ L LLK RGTLKET+EWGGRNMDKKKSKLVGI +GG KEIR Sbjct: 707 VVPDETIHEAAVGKGLSGALKLLKDRGTLKETVEWGGRNMDKKKSKLVGI-ADGGQKEIR 765 Query: 438 IERTDEYGRILTPKEAFRLLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSV 259 IERTDE+GRILTPKEAFRLLSHKFHGKGPGKMKQEKRM+QY EELK+KQMKN+DTPS S Sbjct: 766 IERTDEFGRILTPKEAFRLLSHKFHGKGPGKMKQEKRMKQYHEELKLKQMKNSDTPSSSA 825 Query: 258 ARMREAQAKLKAPYLVLSGHVKPGQSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIK 79 RMREAQA++K PYLVLSGHVKPGQ+SDPRSGFAT+EKD G LTPM GD+KVEHFL IK Sbjct: 826 ERMREAQAQMKTPYLVLSGHVKPGQNSDPRSGFATIEKD-PGSLTPMLGDRKVEHFLGIK 884 Query: 78 RKYENEDSSSQKKPKT 31 RK E + + KKPK+ Sbjct: 885 RKPEPSNLGASKKPKS 900 >ref|XP_011011623.1| PREDICTED: SART-1 family protein DOT2 isoform X2 [Populus euphratica] Length = 859 Score = 760 bits (1962), Expect = 0.0 Identities = 419/720 (58%), Positives = 504/720 (70%), Gaps = 12/720 (1%) Frame = -1 Query: 2157 DQEKDRTSDRDKSIRKQKDESNDMSKDGHSRFENDYTHNSQASKEKVVNFDEKSGSKILD 1978 ++E+DR +D+DK ++KD + S+ G+ E DY Q E V+ D + K+ Sbjct: 147 ERERDREADQDKERSREKDRA---SRKGN---EEDYDDKVQMDYEDEVDKDNRKQGKVSF 200 Query: 1977 QTEKAENRESTY----ELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXXXXXXXX 1810 + E ++ E + EL+QRI KM+E+R K SE +IL+WV RS Sbjct: 201 RDEGEQSAEGAHSSASELEQRILKMKEERTKKKSEAGSDILAWVGRSRKIEENKHAAKAR 260 Query: 1809 ALQLSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTLKDQSIL 1630 A LS+IFEEQDN+ SDDE A Q + +L G+KVL GLDKVLEGGAVVLTLKDQ+IL Sbjct: 261 AKHLSKIFEEQDNIGQGGSDDEEADQHNAYNLAGIKVLDGLDKVLEGGAVVLTLKDQNIL 320 Query: 1629 ANGDINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQYDDPVT 1450 A+GDINEEVDMLENVEIGEQKRRDEAY+A+KKK G+YDDKFND+ +EKKMLPQYDD Sbjct: 321 ADGDINEEVDMLENVEIGEQKRRDEAYKAAKKKTGIYDDKFNDDPASEKKMLPQYDDANA 380 Query: 1449 DEGLILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEEMTXXXX 1270 DEG+ LD GRF+GEA R+QG S S+ EDLNS GKIS+DY+T EEM Sbjct: 381 DEGITLDERGRFTGEAEKKLEELRRRLQGTSTSARLEDLNSSGKISSDYFTHEEMLKFKK 440 Query: 1269 XXXXXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGRRQNLKDEQERIDAEMRSNAYR 1090 GSR DGRRQ +++EQER AEMR+NAY+ Sbjct: 441 PKKKKSLRKKDKLDIDALEAEAVSAGLGIGDLGSRKDGRRQAIREEQERSAAEMRNNAYQ 500 Query: 1089 SALAKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQE-EKSGS 913 SA AKADEASK+LR +Q + EEE+ F DD+++L KSLERARK+ALKKQE E SG Sbjct: 501 SAYAKADEASKSLRLDQTLQTKVEEEENLVFADDEEDLYKSLERARKLALKKQEAEASGP 560 Query: 912 FVIKLLASSNANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPESEDVFME 733 I LAS+ + +D+ N +G+ ENK+VFTEMEEFV +QL E K P++EDVFM+ Sbjct: 561 LAIAHLASTTLSSQIADDKNPETGESHENKLVFTEMEEFVSAIQLAEVHK-PDNEDVFMD 619 Query: 732 EDVAPSTSDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKGLAATLNL 553 ED P SD+E +DE GGW EV + DE + E +EE+ PDETIHE AVGKGL+ L L Sbjct: 620 EDEPPRVSDEEQKDEAGGWMEVPDNSKDENPVNE-DEEIVPDETIHEVAVGKGLSGALKL 678 Query: 552 LKGRGTLKETIEWGGRNMDKKKSKLVGIVGEG-GT------KEIRIERTDEYGRILTPKE 394 LK RGTLKE+I+WGGRNMDKKKSKLVGIV + GT K+IRIERTDE+GRI+TPKE Sbjct: 679 LKERGTLKESIDWGGRNMDKKKSKLVGIVDDDVGTNNDNKFKDIRIERTDEFGRIMTPKE 738 Query: 393 AFRLLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKLKAPYL 214 AFR++SHKFHGKGPGKMKQEKRM+QYQEELK+KQMKN+DTPSLSV RMR AQA+LK PYL Sbjct: 739 AFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMRGAQAQLKTPYL 798 Query: 213 VLSGHVKPGQSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIKRKYENEDSSSQKKPK 34 VLSGHVKPGQ+SDPRSGFATVEKD GGLTPM GDKKVEHFL IKRK E S + KKPK Sbjct: 799 VLSGHVKPGQTSDPRSGFATVEKDFPGGLTPMLGDKKVEHFLGIKRKPETGFSGAPKKPK 858 >ref|XP_002297938.2| hypothetical protein POPTR_0001s11550g [Populus trichocarpa] gi|550347020|gb|EEE82743.2| hypothetical protein POPTR_0001s11550g [Populus trichocarpa] Length = 862 Score = 760 bits (1962), Expect = 0.0 Identities = 417/724 (57%), Positives = 503/724 (69%), Gaps = 16/724 (2%) Frame = -1 Query: 2157 DQEKDRTSDRDKSIRKQKDESNDMSKDGHSRFENDYTHNSQASKEKVVNFDEKSGSKILD 1978 ++E+DR +D+DK ++KD ++ S E DY Q E V+ D + K+ Sbjct: 145 ERERDREADQDKERSREKDRASRKSN------EEDYDDKVQMDYEDEVDKDNRKQGKVSF 198 Query: 1977 QTEKAENRE--------STYELDQRISKMREQRLMKSSEGAPEILSWVNRSXXXXXXXXX 1822 + E ++ E S EL QRI KM+E+R K SE +IL+WV +S Sbjct: 199 RDEDDQSAEGASAGAHSSASELGQRILKMKEERTKKKSEPGSDILAWVGKSRKIEENKYA 258 Query: 1821 XXXXALQLSRIFEEQDNMNDEESDDEPATQDTSKHLGGVKVLHGLDKVLEGGAVVLTLKD 1642 A LS+IFEEQDN+ SDDE A Q + +L G+KVL GLDKVLEGGAVVLTLKD Sbjct: 259 AKKRAKHLSKIFEEQDNIGQGGSDDEEADQHNAYNLAGIKVLDGLDKVLEGGAVVLTLKD 318 Query: 1641 QSILANGDINEEVDMLENVEIGEQKRRDEAYRASKKKVGVYDDKFNDELGAEKKMLPQYD 1462 Q+ILA+GDINEEVDMLENVEIGEQKRRDEAY+A+KKK G+Y+DKFND+ +EKKMLPQYD Sbjct: 319 QNILADGDINEEVDMLENVEIGEQKRRDEAYKAAKKKTGIYEDKFNDDPASEKKMLPQYD 378 Query: 1461 DPVTDEGLILDSSGRFSGEAXXXXXXXXXRIQGVSASSHGEDLNSIGKISTDYYTQEEMT 1282 D DEG+ LD GRF+GEA R+QG S S+ EDLNS GKIS+DY+T EEM Sbjct: 379 DANADEGVTLDERGRFTGEAEKKLEELRRRLQGTSTSARLEDLNSSGKISSDYFTHEEML 438 Query: 1281 XXXXXXXXXXXXXXXXXXXKXXXXXXXXXXXXXXXXGSRNDGRRQNLKDEQERIDAEMRS 1102 GSR DGRRQ +++EQER +AEMR+ Sbjct: 439 QFKKPKKKKSLRKKDKLDIDALEAEAVSAGLGIGDLGSRKDGRRQAIREEQERSEAEMRN 498 Query: 1101 NAYRSALAKADEASKALRQEQVHTMQTEEEDAPAFGDDDDELRKSLERARKIALKKQE-E 925 NAY+SA AKADEASK+LR ++ + EEE+ F DD+++L KSLERARK+ALKKQE E Sbjct: 499 NAYQSAYAKADEASKSLRLDRTLQTKVEEEENLVFADDEEDLYKSLERARKLALKKQEAE 558 Query: 924 KSGSFVIKLLASSNANEPASDNPNISSGDQSENKVVFTEMEEFVWGLQLDEEEKKPESED 745 SG I LAS+ + +D+ N +G+ ENK+VFTEMEEFV +QL EE KP++ED Sbjct: 559 ASGPLAIAHLASTTLSSQIADDKNPETGESHENKLVFTEMEEFVSAIQLAEEVHKPDNED 618 Query: 744 VFMEEDVAPSTSDQEMRDEDGGWTEVQEIMPDEIHMQENEEEVAPDETIHEPAVGKGLAA 565 VFM+ED P SD+E +DE GGW EV + DE + E +EE+ PDETIHE AVGKGL+ Sbjct: 619 VFMDEDEPPRVSDEEQKDEAGGWMEVPDNSKDENPVNE-DEEIVPDETIHEVAVGKGLSG 677 Query: 564 TLNLLKGRGTLKETIEWGGRNMDKKKSKLVGIVGEG-GT------KEIRIERTDEYGRIL 406 L LLK RGTLKE+I+WGGRNMDKKKSKLVGIV + GT K+IRIERTDE+GRI+ Sbjct: 678 ALKLLKERGTLKESIDWGGRNMDKKKSKLVGIVDDDVGTNNDNKFKDIRIERTDEFGRIM 737 Query: 405 TPKEAFRLLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNADTPSLSVARMREAQAKLK 226 TPKEAFR++SHKFHGKGPGKMKQEKRM+QYQEELK+KQMKN+DTPSLSV RMR AQA+LK Sbjct: 738 TPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMRGAQAQLK 797 Query: 225 APYLVLSGHVKPGQSSDPRSGFATVEKDLAGGLTPMSGDKKVEHFLNIKRKYENEDSSSQ 46 PYLVLSGHVKPGQ+SDPRSGFATVEKD GGLTPM GDKKVEHFL IKRK E S + Sbjct: 798 TPYLVLSGHVKPGQTSDPRSGFATVEKDFPGGLTPMLGDKKVEHFLGIKRKPETGFSGAP 857 Query: 45 KKPK 34 KKPK Sbjct: 858 KKPK 861