BLASTX nr result

ID: Rehmannia27_contig00004960 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia27_contig00004960
         (2045 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011094061.1| PREDICTED: SART-1 family protein DOT2 [Sesam...   598   0.0  
ref|XP_012851195.1| PREDICTED: SART-1 family protein DOT2 [Eryth...   544   e-179
ref|XP_010656678.1| PREDICTED: SART-1 family protein DOT2 [Vitis...   464   e-148
emb|CAN77549.1| hypothetical protein VITISV_017244 [Vitis vinifera]   451   e-146
ref|XP_007022028.1| U4/U6.U5 tri-snRNP-associated protein 1 isof...   447   e-145
ref|XP_007022029.1| U4/U6.U5 tri-snRNP-associated protein 1 isof...   447   e-144
ref|XP_009782950.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro...   448   e-143
ref|XP_007022027.1| U4/U6.U5 tri-snRNP-associated protein 1 isof...   447   e-143
ref|XP_009630824.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro...   449   e-143
ref|XP_007022025.1| U4/U6.U5 tri-snRNP-associated protein 1 isof...   447   e-142
ref|XP_012077379.1| PREDICTED: SART-1 family protein DOT2 isofor...   444   e-141
gb|KJB61482.1| hypothetical protein B456_009G361400 [Gossypium r...   431   e-139
gb|KVH99952.1| SART-1 protein [Cynara cardunculus var. scolymus]      436   e-138
ref|XP_008390895.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro...   435   e-137
gb|KHG25959.1| U4/U6.U5 tri-snRNP-associated 1 [Gossypium arboreum]   435   e-137
gb|KJB61483.1| hypothetical protein B456_009G361400 [Gossypium r...   431   e-137
gb|EPS63268.1| hypothetical protein M569_11517, partial [Genlise...   428   e-136
ref|XP_012441144.1| PREDICTED: SART-1 family protein DOT2 [Gossy...   431   e-136
ref|XP_004250062.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro...   431   e-136
ref|XP_010256356.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro...   428   e-135

>ref|XP_011094061.1| PREDICTED: SART-1 family protein DOT2 [Sesamum indicum]
          Length = 942

 Score =  598 bits (1541), Expect = 0.0
 Identities = 323/454 (71%), Positives = 347/454 (76%), Gaps = 19/454 (4%)
 Frame = +1

Query: 739  NGKHRGKDTADRERVKDRSREKDKQADQEKDRTRDRERSSRKQKDDSHDMAKDIEK---- 906
            NG+ RGKDTADRE+ K+R+REK+KQADQEKDR RDRERSSRKQKD+SHD +KD +K    
Sbjct: 197  NGRDRGKDTADREKGKERNREKEKQADQEKDRARDRERSSRKQKDESHDRSKDTDKDGHS 256

Query: 907  --------------ERVDNSDDEYDSNILKQQEKDVIAGAGYHQSASELEERISKMREER 1044
                          E  DNSDDE DS ILK QEK   A AG  QSASELE+RISKMREER
Sbjct: 257  RLENDYSRDKQSTKELADNSDDENDSKILKHQEKADTAIAGSRQSASELEDRISKMREER 316

Query: 1045 LIKPSEGASEVLSWVNRSRKLEEKRSVEKEKALQRSKVFEEQDNMIDEESDDEPATQHTK 1224
            L KPSEGASEVL+WVNRSRKLEEKR+ EKEKALQ SK+FEEQDNM   ESD+E A +HT 
Sbjct: 317  LKKPSEGASEVLAWVNRSRKLEEKRTAEKEKALQLSKIFEEQDNMNGGESDEEAAAEHTT 376

Query: 1225 QHLGGVKILHGLDKVLEGGAVVLTLKDQSILADGDINQEVDMLENVEIGEQXXXXXXXXX 1404
            Q LGGVKILHGLDKVLEGGAVVLTLKDQSILADGDIN+EVDMLENVEIGEQ         
Sbjct: 377  QDLGGVKILHGLDKVLEGGAVVLTLKDQSILADGDINEEVDMLENVEIGEQKRRDEAYKA 436

Query: 1405 XXXXTGVYDDKFNDEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEAXXXXXXXXXXING 1584
                TG+YDDKF+DEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEA          I G
Sbjct: 437  AKKKTGIYDDKFSDEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEAERKLEELRRRIQG 496

Query: 1585 VSASTRGEDLNSTGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLGS 1764
            VS STRGEDLNST KI TDYYTQ+EMT                             GLG+
Sbjct: 497  VSTSTRGEDLNSTAKILTDYYTQDEMTKFKKPKKKKSLRKKEKLDLDALEAEARSAGLGA 556

Query: 1765 EDLGSRNDGRRQNLREEQDRIEAEMRSNAYQSAYAKADEASKALRQEQVRSMQIEEDDTP 1944
             DLGSRNDGRRQNLREEQ++IEAEMR NAY+SAYAKADEASKALRQEQV +MQ EEDD P
Sbjct: 557  GDLGSRNDGRRQNLREEQEKIEAEMRRNAYESAYAKADEASKALRQEQVPAMQTEEDDAP 616

Query: 1945 VFGDDDDELRKSLERARKVALKKQ-EEEKSGPQV 2043
            VFGDDDDELRKSLERARK+ALKKQ EEEKS PQV
Sbjct: 617  VFGDDDDELRKSLERARKIALKKQDEEEKSAPQV 650



 Score = 80.9 bits (198), Expect = 3e-12
 Identities = 42/80 (52%), Positives = 46/80 (57%)
 Frame = +1

Query: 151 MGADLAESRRGRSVEKQKQDDMPMRERWNGEYDDFGKNESDEVQXXXXXXXXXXXXXXXX 330
           MGADLAESRRGRSVE   QDDMPMRERW GEYDD   NE DEV+                
Sbjct: 1   MGADLAESRRGRSVETSDQDDMPMRERWTGEYDDLEGNEQDEVRDSEKYRSKDKNKNSGR 60

Query: 331 XXXXXXXXXDRDRSKALDGV 390
                    D ++SKALDG+
Sbjct: 61  REEKEHRTKDHEKSKALDGL 80


>ref|XP_012851195.1| PREDICTED: SART-1 family protein DOT2 [Erythranthe guttata]
            gi|604311746|gb|EYU25740.1| hypothetical protein
            MIMGU_mgv1a000914mg [Erythranthe guttata]
          Length = 944

 Score =  544 bits (1401), Expect = e-179
 Identities = 295/454 (64%), Positives = 341/454 (75%), Gaps = 19/454 (4%)
 Frame = +1

Query: 739  NGKHRGKDTADRERVKDRSREKDKQADQEKDRTRDRERSSRKQKDDSHDMAKDIEKE--- 909
            NGKHRGK+T +RE+ K+++REK+KQ DQEK+R RDR+RSSRKQKD+S+DM KD EK+   
Sbjct: 199  NGKHRGKNTDEREKGKEKNREKEKQGDQEKERARDRDRSSRKQKDESYDMVKDTEKDGHL 258

Query: 910  ---------------RVDNSDDEYDSNILKQQEKDVIAGAGYHQSASELEERISKMREER 1044
                           RVDNSD E DS ILKQQ++   +  G  QSAS+L ERISKMR+ER
Sbjct: 259  RLENDYSRDNQSNKVRVDNSDGENDSKILKQQDRAEKSVDGNSQSASDLGERISKMRQER 318

Query: 1045 LIKPSEGASEVLSWVNRSRKLEEKRSVEKEKALQRSKVFEEQDNMIDEESDDEPATQHTK 1224
            L+K SEGASEVL+WVNRSRKLE+KR+ EKEKALQ SKVFEEQDNM D +SDDE ATQ   
Sbjct: 319  LVKSSEGASEVLAWVNRSRKLEDKRT-EKEKALQLSKVFEEQDNMNDGDSDDEAATQAVT 377

Query: 1225 QHLGGVKILHGLDKVLEGGAVVLTLKDQSILADGDINQEVDMLENVEIGEQXXXXXXXXX 1404
            + LGGVK+LHGL+KVLEGGA+VLTLKDQSILADGD+NQEVDMLENVEIGEQ         
Sbjct: 378  ESLGGVKVLHGLEKVLEGGAIVLTLKDQSILADGDVNQEVDMLENVEIGEQKRRNEAYGA 437

Query: 1405 XXXXTGVYDDKFNDEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEAXXXXXXXXXXING 1584
                TGVY DKF+DEPG EKK+LPQYDDPVADEG+TLDS+GRFTGEA          I G
Sbjct: 438  AKKKTGVYVDKFSDEPGTEKKMLPQYDDPVADEGLTLDSTGRFTGEAERKLEELRKRIQG 497

Query: 1585 VSASTRGEDLNSTGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLGS 1764
            V AST GEDLNST KISTDYYTQEEMT                             GLG+
Sbjct: 498  VPASTYGEDLNSTLKISTDYYTQEEMTKFKKPKKKKSLRKREKLDIDALEAEAVTAGLGA 557

Query: 1765 EDLGSRNDGRRQNLREEQDRIEAEMRSNAYQSAYAKADEASKALRQEQVRSMQIEEDDTP 1944
             DLGSRNDGR+QNL++EQ+R++AEMRSNA+QSAYAKA+EASKALR  +V  M+ E+DDT 
Sbjct: 558  GDLGSRNDGRKQNLKKEQERVDAEMRSNAFQSAYAKAEEASKALRPGKVNIMRTEDDDT- 616

Query: 1945 VFGDDDDELRKSLERARKVALKKQEE-EKSGPQV 2043
            VFGDDDDELRKSLERARK+A KKQ+E EK GPQ+
Sbjct: 617  VFGDDDDELRKSLERARKIAFKKQDEKEKPGPQM 650



 Score = 71.6 bits (174), Expect = 2e-09
 Identities = 39/80 (48%), Positives = 43/80 (53%)
 Frame = +1

Query: 151 MGADLAESRRGRSVEKQKQDDMPMRERWNGEYDDFGKNESDEVQXXXXXXXXXXXXXXXX 330
           MG+D AE  RGRSVEK+ QDD P RERW+GEYDD  KN SDEV                 
Sbjct: 1   MGSDNAEPSRGRSVEKRNQDDSPTRERWSGEYDDAEKNGSDEVLVAGKHRSKDKSKSSGR 60

Query: 331 XXXXXXXXXDRDRSKALDGV 390
                    DR+RSKA D V
Sbjct: 61  REEKEHRSRDRERSKAFDSV 80


>ref|XP_010656678.1| PREDICTED: SART-1 family protein DOT2 [Vitis vinifera]
            gi|296090475|emb|CBI40671.3| unnamed protein product
            [Vitis vinifera]
          Length = 944

 Score =  464 bits (1193), Expect = e-148
 Identities = 245/453 (54%), Positives = 316/453 (69%), Gaps = 19/453 (4%)
 Frame = +1

Query: 739  NGKHRGKDTADRERVKDRSREKDKQADQEKDRTRDRERSSRKQKDDSHDMAKDIEKE--- 909
            N K R +D  D+E+ K+R R+K+++ADQ++DR +DR++ SRK +D+ HD +KD  K+   
Sbjct: 200  NDKDRDRDAIDKEKGKERIRDKEREADQDRDRYKDRDKGSRKNRDEGHDRSKDGGKDDKL 259

Query: 910  RVDNSD---------------DEYDSNILKQQEKDVIAGAGYHQSASELEERISKMREER 1044
            ++D  D               DE DS  + + EK+    +G   S ++L+ERI +M+EER
Sbjct: 260  KLDGGDNRDRDVTKQGRGSHHDEDDSRAI-EHEKNAEGASGPQSSTAQLQERILRMKEER 318

Query: 1045 LIKPSEGASEVLSWVNRSRKLEEKRSVEKEKALQRSKVFEEQDNMIDEESDDEPATQHTK 1224
            + + SEG+SEVL+WVNRSRK+EE+R+ EKEKALQ SK+FEEQDN+   ESDDE  T+H+ 
Sbjct: 319  VKRKSEGSSEVLAWVNRSRKVEEQRNAEKEKALQLSKIFEEQDNIDQGESDDEKPTRHSS 378

Query: 1225 QHLGGVKILHGLDKVLEGGAVVLTLKDQSILADGDINQEVDMLENVEIGEQXXXXXXXXX 1404
            Q L GVK+LHGLDKV+EGGAVVLTLKDQ ILA+GDIN++VDMLENVEIGEQ         
Sbjct: 379  QDLAGVKVLHGLDKVIEGGAVVLTLKDQDILANGDINEDVDMLENVEIGEQKRRDEAYKA 438

Query: 1405 XXXXTGVYDDKFNDEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEAXXXXXXXXXXING 1584
                TG+Y+DKFNDEPG+EKKILPQYDDPV DEG+ LD+SGRFTGEA          + G
Sbjct: 439  AKKKTGIYEDKFNDEPGSEKKILPQYDDPVTDEGLALDASGRFTGEAEKKLEELRRRLQG 498

Query: 1585 VSASTRGEDLNSTGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLGS 1764
            VS + R EDLN+ GK S+DYYT EEM                              GLG 
Sbjct: 499  VSTNNRFEDLNTYGKNSSDYYTHEEMLQFKKPKKKKSLRKKEKLNIDALEAEAVSAGLGV 558

Query: 1765 EDLGSRNDGRRQNLREEQDRIEAEMRSNAYQSAYAKADEASKALRQEQVRSMQIEEDDTP 1944
             DLGSRNDG+RQ++REEQ+R EAEMR++AYQ AYAKADEASKALR +Q   +Q+EE++  
Sbjct: 559  GDLGSRNDGKRQSIREEQERSEAEMRNSAYQLAYAKADEASKALRLDQTLPVQLEENENQ 618

Query: 1945 VFGDDDDELRKSLERARKVALKKQEE-EKSGPQ 2040
            VFG+DD+EL+KSL+RARK+ L+KQ+E   SGPQ
Sbjct: 619  VFGEDDEELQKSLQRARKLVLQKQDEAATSGPQ 651


>emb|CAN77549.1| hypothetical protein VITISV_017244 [Vitis vinifera]
          Length = 710

 Score =  451 bits (1160), Expect = e-146
 Identities = 244/472 (51%), Positives = 316/472 (66%), Gaps = 38/472 (8%)
 Frame = +1

Query: 739  NGKHRGKDTADRERVKDRSREKDKQADQEKDRTRDRERSSRKQKDDSHDMAKDIEKE--- 909
            N K R +D  D+E+ K+R R+K+++ADQ++DR +DR++ SRK +D+ HD +KD  K+   
Sbjct: 188  NDKDRDRDAIDKEKGKERIRDKEREADQDRDRYKDRDKGSRKNRDEGHDRSKDGGKDDKL 247

Query: 910  RVDNSD---------------DEYDSNILKQQEKDVIAGAGYHQSASELEERISKMREER 1044
            ++D  D               DE DS  + + EK+    +G   S ++L+ERI +M+EER
Sbjct: 248  KLDGGDNRDRDVTKQGRGSHHDEDDSRAI-EHEKNAEGASGPQSSTAQLQERILRMKEER 306

Query: 1045 LIKPSEGASEVLSWVNRSRKLEEKRSVEKEKALQRSKVFEEQDNMIDEESDDEPATQHTK 1224
            + + SEG+SEVL+WVNRSRK+EE+R+ EKEKALQ SK+FEEQDN+   ESDDE  T+H+ 
Sbjct: 307  VKRKSEGSSEVLAWVNRSRKVEEQRNAEKEKALQLSKIFEEQDNIDQGESDDEKPTRHSS 366

Query: 1225 -------------------QHLGGVKILHGLDKVLEGGAVVLTLKDQSILADGDINQEVD 1347
                               + L GVK+LHGLDKV+EGGAVVLTLKDQ ILA+GDIN++VD
Sbjct: 367  RMKDSWPYRSHFYFEHLIPEDLAGVKVLHGLDKVIEGGAVVLTLKDQDILANGDINEDVD 426

Query: 1348 MLENVEIGEQXXXXXXXXXXXXXTGVYDDKFNDEPGAEKKILPQYDDPVADEGVTLDSSG 1527
            MLENVEIGEQ             TG+Y+DKFNDEPG+EKKILPQYDDPV DEG+ LD+SG
Sbjct: 427  MLENVEIGEQKRRDEAYKAAKKKTGIYEDKFNDEPGSEKKILPQYDDPVTDEGLALDASG 486

Query: 1528 RFTGEAXXXXXXXXXXINGVSASTRGEDLNSTGKISTDYYTQEEMTXXXXXXXXXXXXXX 1707
            RFTGEA          + GVS + R EDLN+ GK S+DYYT EEM               
Sbjct: 487  RFTGEAEKKLEELRRRLQGVSTNNRFEDLNTYGKNSSDYYTHEEMLQFKKPKKKKSLRKK 546

Query: 1708 XXXXXXXXXXXXXXXGLGSEDLGSRNDGRRQNLREEQDRIEAEMRSNAYQSAYAKADEAS 1887
                           GLG  DLGSRNDG+RQ++REEQ+R EAEMR++AYQ AYAKADEAS
Sbjct: 547  EKLNIDALEAEAVSAGLGVGDLGSRNDGKRQSIREEQERSEAEMRNSAYQLAYAKADEAS 606

Query: 1888 KALRQEQVRSMQIEEDDTPVFGDDDDELRKSLERARKVALKKQEE-EKSGPQ 2040
            KALR +Q   +Q+EE++  VFG+DD+EL+KSL+RARK+ L+KQ+E   SGPQ
Sbjct: 607  KALRLDQTLPVQLEENENQVFGEDDEELQKSLQRARKLVLQKQDEAATSGPQ 658


>ref|XP_007022028.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 4, partial [Theobroma
            cacao] gi|508721656|gb|EOY13553.1| U4/U6.U5
            tri-snRNP-associated protein 1 isoform 4, partial
            [Theobroma cacao]
          Length = 675

 Score =  447 bits (1151), Expect = e-145
 Identities = 240/425 (56%), Positives = 297/425 (69%), Gaps = 1/425 (0%)
 Frame = +1

Query: 769  DRERVKDRSREKDKQADQEKDRTRDRERSSRKQKDDSHDMAKDIEKERVDNSDDEYDSNI 948
            DRE+ K+RS++K ++AD EK+R+RDR+ + +K  ++ ++ +KD E        D  DS  
Sbjct: 82   DREKGKERSKQKSREADLEKERSRDRDNAIKKNHEEDYEGSKDGELAL-----DYGDSRD 136

Query: 949  LKQQEKDVIAGAGYHQ-SASELEERISKMREERLIKPSEGASEVLSWVNRSRKLEEKRSV 1125
              + E +  + AG  Q S+SELEERI++M+EERL K SEG SEVL WV   RKLEEKR+ 
Sbjct: 137  KDEAELNAGSNAGVAQASSSELEERIARMKEERLKKKSEGVSEVLEWVGNFRKLEEKRNA 196

Query: 1126 EKEKALQRSKVFEEQDNMIDEESDDEPATQHTKQHLGGVKILHGLDKVLEGGAVVLTLKD 1305
            EKEKALQRSK+FEEQD+ +  E++DE A +H    L GVK+LHGLDKV++GGAVVLTLKD
Sbjct: 197  EKEKALQRSKIFEEQDDFVQGENEDEEAVRHAAHDLAGVKVLHGLDKVMDGGAVVLTLKD 256

Query: 1306 QSILADGDINQEVDMLENVEIGEQXXXXXXXXXXXXXTGVYDDKFNDEPGAEKKILPQYD 1485
            QSILA+GDIN++VDMLENVEIGEQ             TGVYDDKFNDEPG+EKKILPQYD
Sbjct: 257  QSILANGDINEDVDMLENVEIGEQRRRDEAYKAAKKKTGVYDDKFNDEPGSEKKILPQYD 316

Query: 1486 DPVADEGVTLDSSGRFTGEAXXXXXXXXXXINGVSASTRGEDLNSTGKISTDYYTQEEMT 1665
            +PVADEGVTLD  GRFTGEA          + GV  + R EDLN+ GKI++DYYTQEEM 
Sbjct: 317  NPVADEGVTLDERGRFTGEAEKKLQELRKRLQGVPTNNRVEDLNNAGKIASDYYTQEEML 376

Query: 1666 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLGSEDLGSRNDGRRQNLREEQDRIEAEMRS 1845
                                         GLG+ DLGSRND RRQ +REE+ R EAE R+
Sbjct: 377  KFKKPKKKKALRKKEKLDIDALEAEAISSGLGAGDLGSRNDARRQAIREEEARSEAEKRN 436

Query: 1846 NAYQSAYAKADEASKALRQEQVRSMQIEEDDTPVFGDDDDELRKSLERARKVALKKQEEE 2025
            +AYQSAYAKADEASK+L  EQ   ++ EED+  VF DDDD+L KS+ER+RK+A KKQE+E
Sbjct: 437  SAYQSAYAKADEASKSLWLEQTLIVKPEEDENQVFADDDDDLYKSIERSRKLAFKKQEDE 496

Query: 2026 KSGPQ 2040
            KSGPQ
Sbjct: 497  KSGPQ 501


>ref|XP_007022029.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 5, partial [Theobroma
            cacao] gi|508721657|gb|EOY13554.1| U4/U6.U5
            tri-snRNP-associated protein 1 isoform 5, partial
            [Theobroma cacao]
          Length = 807

 Score =  447 bits (1151), Expect = e-144
 Identities = 240/425 (56%), Positives = 297/425 (69%), Gaps = 1/425 (0%)
 Frame = +1

Query: 769  DRERVKDRSREKDKQADQEKDRTRDRERSSRKQKDDSHDMAKDIEKERVDNSDDEYDSNI 948
            DRE+ K+RS++K ++AD EK+R+RDR+ + +K  ++ ++ +KD E        D  DS  
Sbjct: 82   DREKGKERSKQKSREADLEKERSRDRDNAIKKNHEEDYEGSKDGELAL-----DYGDSRD 136

Query: 949  LKQQEKDVIAGAGYHQ-SASELEERISKMREERLIKPSEGASEVLSWVNRSRKLEEKRSV 1125
              + E +  + AG  Q S+SELEERI++M+EERL K SEG SEVL WV   RKLEEKR+ 
Sbjct: 137  KDEAELNAGSNAGVAQASSSELEERIARMKEERLKKKSEGVSEVLEWVGNFRKLEEKRNA 196

Query: 1126 EKEKALQRSKVFEEQDNMIDEESDDEPATQHTKQHLGGVKILHGLDKVLEGGAVVLTLKD 1305
            EKEKALQRSK+FEEQD+ +  E++DE A +H    L GVK+LHGLDKV++GGAVVLTLKD
Sbjct: 197  EKEKALQRSKIFEEQDDFVQGENEDEEAVRHAAHDLAGVKVLHGLDKVMDGGAVVLTLKD 256

Query: 1306 QSILADGDINQEVDMLENVEIGEQXXXXXXXXXXXXXTGVYDDKFNDEPGAEKKILPQYD 1485
            QSILA+GDIN++VDMLENVEIGEQ             TGVYDDKFNDEPG+EKKILPQYD
Sbjct: 257  QSILANGDINEDVDMLENVEIGEQRRRDEAYKAAKKKTGVYDDKFNDEPGSEKKILPQYD 316

Query: 1486 DPVADEGVTLDSSGRFTGEAXXXXXXXXXXINGVSASTRGEDLNSTGKISTDYYTQEEMT 1665
            +PVADEGVTLD  GRFTGEA          + GV  + R EDLN+ GKI++DYYTQEEM 
Sbjct: 317  NPVADEGVTLDERGRFTGEAEKKLQELRKRLQGVPTNNRVEDLNNAGKIASDYYTQEEML 376

Query: 1666 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLGSEDLGSRNDGRRQNLREEQDRIEAEMRS 1845
                                         GLG+ DLGSRND RRQ +REE+ R EAE R+
Sbjct: 377  KFKKPKKKKALRKKEKLDIDALEAEAISSGLGAGDLGSRNDARRQAIREEEARSEAEKRN 436

Query: 1846 NAYQSAYAKADEASKALRQEQVRSMQIEEDDTPVFGDDDDELRKSLERARKVALKKQEEE 2025
            +AYQSAYAKADEASK+L  EQ   ++ EED+  VF DDDD+L KS+ER+RK+A KKQE+E
Sbjct: 437  SAYQSAYAKADEASKSLWLEQTLIVKPEEDENQVFADDDDDLYKSIERSRKLAFKKQEDE 496

Query: 2026 KSGPQ 2040
            KSGPQ
Sbjct: 497  KSGPQ 501


>ref|XP_009782950.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nicotiana
            sylvestris]
          Length = 873

 Score =  448 bits (1153), Expect = e-143
 Identities = 239/438 (54%), Positives = 306/438 (69%), Gaps = 13/438 (2%)
 Frame = +1

Query: 745  KHRGKDTADRERVKDRSREKDKQADQEKDRTRDRERSSRKQKDDSHDMAKDIEKE---RV 915
            + RG+D  D+E+ ++R++EK ++AD++K+R+RD++R +R+Q+D+ HD +KD  K+   RV
Sbjct: 184  RERGRDAVDKEKGRERTKEKGREADEDKERSRDKDRGNRRQRDEGHDRSKDRRKDDVQRV 243

Query: 916  DNSDDEYDSNILKQQ----------EKDVIAGAGYHQSASELEERISKMREERLIKPSEG 1065
            D+ D +Y  ++ KQ+            + +  AG   SASELEERI KM+EERL K SEG
Sbjct: 244  DDEDSDYQ-DVAKQEIVSYEDDDRARNNAVETAGSQSSASELEERILKMKEERLKKKSEG 302

Query: 1066 ASEVLSWVNRSRKLEEKRSVEKEKALQRSKVFEEQDNMIDEESDDEPATQHTKQHLGGVK 1245
            ASEV++WV++SRK+EEKR+ EKE+ALQ SK+FEEQD M DEESDDE   +   + LGG+K
Sbjct: 303  ASEVMTWVSKSRKIEEKRNAEKERALQLSKIFEEQDKMNDEESDDEEKARLAAKELGGMK 362

Query: 1246 ILHGLDKVLEGGAVVLTLKDQSILADGDINQEVDMLENVEIGEQXXXXXXXXXXXXXTGV 1425
            +LHGLDKV+EGGAVVLTLKDQSILA  DINQEVD+LENVEIGEQ             TG+
Sbjct: 363  VLHGLDKVVEGGAVVLTLKDQSILAGDDINQEVDVLENVEIGEQKKRDDAYKAAKKKTGI 422

Query: 1426 YDDKFNDEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEAXXXXXXXXXXINGVSASTRG 1605
            YDDKFND+PG E+KILPQYDDP  +EGVTLD++G F+ +A          I G S+ T  
Sbjct: 423  YDDKFNDDPGFERKILPQYDDPTEEEGVTLDATGGFSVDAEKKLEELRKRIQGSSSKTLA 482

Query: 1606 EDLNSTGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLGSEDLGSRN 1785
            EDLNS+GK+ +DYYTQEEM                              GLG  DLGSRN
Sbjct: 483  EDLNSSGKLLSDYYTQEEMLQFKKPKKKKSLRKKEKMDLDALEAEARSSGLGVGDLGSRN 542

Query: 1786 DGRRQNLREEQDRIEAEMRSNAYQSAYAKADEASKALRQEQVRSMQIEEDDTPVFGDDDD 1965
            +  RQ LREE +R EAE +S +YQ+AYAKA+EASKALR E+  + Q EEDDT VF DD++
Sbjct: 543  NKTRQALREEMERAEAETKSKSYQAAYAKAEEASKALRPEKTNNNQREEDDT-VFDDDEE 601

Query: 1966 ELRKSLERARKVALKKQE 2019
            ELRKSLERARK+ALKKQE
Sbjct: 602  ELRKSLERARKLALKKQE 619


>ref|XP_007022027.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 3, partial [Theobroma
            cacao] gi|508721655|gb|EOY13552.1| U4/U6.U5
            tri-snRNP-associated protein 1 isoform 3, partial
            [Theobroma cacao]
          Length = 864

 Score =  447 bits (1151), Expect = e-143
 Identities = 240/425 (56%), Positives = 297/425 (69%), Gaps = 1/425 (0%)
 Frame = +1

Query: 769  DRERVKDRSREKDKQADQEKDRTRDRERSSRKQKDDSHDMAKDIEKERVDNSDDEYDSNI 948
            DRE+ K+RS++K ++AD EK+R+RDR+ + +K  ++ ++ +KD E        D  DS  
Sbjct: 188  DREKGKERSKQKSREADLEKERSRDRDNAIKKNHEEDYEGSKDGELAL-----DYGDSRD 242

Query: 949  LKQQEKDVIAGAGYHQ-SASELEERISKMREERLIKPSEGASEVLSWVNRSRKLEEKRSV 1125
              + E +  + AG  Q S+SELEERI++M+EERL K SEG SEVL WV   RKLEEKR+ 
Sbjct: 243  KDEAELNAGSNAGVAQASSSELEERIARMKEERLKKKSEGVSEVLEWVGNFRKLEEKRNA 302

Query: 1126 EKEKALQRSKVFEEQDNMIDEESDDEPATQHTKQHLGGVKILHGLDKVLEGGAVVLTLKD 1305
            EKEKALQRSK+FEEQD+ +  E++DE A +H    L GVK+LHGLDKV++GGAVVLTLKD
Sbjct: 303  EKEKALQRSKIFEEQDDFVQGENEDEEAVRHAAHDLAGVKVLHGLDKVMDGGAVVLTLKD 362

Query: 1306 QSILADGDINQEVDMLENVEIGEQXXXXXXXXXXXXXTGVYDDKFNDEPGAEKKILPQYD 1485
            QSILA+GDIN++VDMLENVEIGEQ             TGVYDDKFNDEPG+EKKILPQYD
Sbjct: 363  QSILANGDINEDVDMLENVEIGEQRRRDEAYKAAKKKTGVYDDKFNDEPGSEKKILPQYD 422

Query: 1486 DPVADEGVTLDSSGRFTGEAXXXXXXXXXXINGVSASTRGEDLNSTGKISTDYYTQEEMT 1665
            +PVADEGVTLD  GRFTGEA          + GV  + R EDLN+ GKI++DYYTQEEM 
Sbjct: 423  NPVADEGVTLDERGRFTGEAEKKLQELRKRLQGVPTNNRVEDLNNAGKIASDYYTQEEML 482

Query: 1666 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLGSEDLGSRNDGRRQNLREEQDRIEAEMRS 1845
                                         GLG+ DLGSRND RRQ +REE+ R EAE R+
Sbjct: 483  KFKKPKKKKALRKKEKLDIDALEAEAISSGLGAGDLGSRNDARRQAIREEEARSEAEKRN 542

Query: 1846 NAYQSAYAKADEASKALRQEQVRSMQIEEDDTPVFGDDDDELRKSLERARKVALKKQEEE 2025
            +AYQSAYAKADEASK+L  EQ   ++ EED+  VF DDDD+L KS+ER+RK+A KKQE+E
Sbjct: 543  SAYQSAYAKADEASKSLWLEQTLIVKPEEDENQVFADDDDDLYKSIERSRKLAFKKQEDE 602

Query: 2026 KSGPQ 2040
            KSGPQ
Sbjct: 603  KSGPQ 607


>ref|XP_009630824.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nicotiana
            tomentosiformis] gi|697153160|ref|XP_009630825.1|
            PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1
            [Nicotiana tomentosiformis]
          Length = 922

 Score =  449 bits (1154), Expect = e-143
 Identities = 239/438 (54%), Positives = 306/438 (69%), Gaps = 13/438 (2%)
 Frame = +1

Query: 745  KHRGKDTADRERVKDRSREKDKQADQEKDRTRDRERSSRKQKDDSHDMAKDIEKE---RV 915
            + RG+D  D+E+ ++R++EK ++AD++K+R+RD++R +R+Q+D+ HD +KD  K+   RV
Sbjct: 186  RERGRDAVDKEKGRERTKEKGREADEDKERSRDKDRGNRRQRDEGHDRSKDRRKDDVQRV 245

Query: 916  DNSDDEYDSNILKQQ----------EKDVIAGAGYHQSASELEERISKMREERLIKPSEG 1065
            D+ D +Y  ++ KQ+            + +  AG   SAS+LEERI KM+EERL K SEG
Sbjct: 246  DDEDSDYQ-DVAKQEIVSYEDDDRARNNAVETAGSQSSASKLEERILKMKEERLKKKSEG 304

Query: 1066 ASEVLSWVNRSRKLEEKRSVEKEKALQRSKVFEEQDNMIDEESDDEPATQHTKQHLGGVK 1245
            ASEV++WV++SRK+EEKR+ EKE+ALQ SK+FEEQD + DEESDDE   +   + LGG+K
Sbjct: 305  ASEVMTWVSKSRKIEEKRTAEKERALQLSKIFEEQDKINDEESDDEEKARLAAKELGGMK 364

Query: 1246 ILHGLDKVLEGGAVVLTLKDQSILADGDINQEVDMLENVEIGEQXXXXXXXXXXXXXTGV 1425
            +LHGLDKV+EGGAVVLTLKDQSILA  DINQEVD+LENVEIGEQ             TG+
Sbjct: 365  VLHGLDKVVEGGAVVLTLKDQSILAGDDINQEVDVLENVEIGEQKKRDDAYKAAKKKTGI 424

Query: 1426 YDDKFNDEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEAXXXXXXXXXXINGVSASTRG 1605
            YDDKFND+PG E+KILPQYDDP  +EGVTLD++G F+ +A          I G S+ T  
Sbjct: 425  YDDKFNDDPGFERKILPQYDDPAEEEGVTLDATGGFSVDAEKKLEELRKRIQGSSSKTLA 484

Query: 1606 EDLNSTGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLGSEDLGSRN 1785
            EDLNS+GK+ +DYYTQEEM                              GLG  DLGSRN
Sbjct: 485  EDLNSSGKLLSDYYTQEEMLQFKKPKKKKSLRKKEKMDLDALEVEAKSSGLGVGDLGSRN 544

Query: 1786 DGRRQNLREEQDRIEAEMRSNAYQSAYAKADEASKALRQEQVRSMQIEEDDTPVFGDDDD 1965
            D  RQ LREE +R EAE +S +YQ+AYAKA+EASKALR E+  + Q EEDDT VF DDD+
Sbjct: 545  DKTRQALREEMERAEAETKSKSYQAAYAKAEEASKALRPEKTNNNQREEDDT-VFDDDDE 603

Query: 1966 ELRKSLERARKVALKKQE 2019
            ELRKSLERARK+ALKKQE
Sbjct: 604  ELRKSLERARKLALKKQE 621


>ref|XP_007022025.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao]
            gi|590611175|ref|XP_007022026.1| U4/U6.U5
            tri-snRNP-associated protein 1 isoform 1 [Theobroma
            cacao] gi|508721653|gb|EOY13550.1| U4/U6.U5
            tri-snRNP-associated protein 1 isoform 1 [Theobroma
            cacao] gi|508721654|gb|EOY13551.1| U4/U6.U5
            tri-snRNP-associated protein 1 isoform 1 [Theobroma
            cacao]
          Length = 907

 Score =  447 bits (1151), Expect = e-142
 Identities = 240/425 (56%), Positives = 297/425 (69%), Gaps = 1/425 (0%)
 Frame = +1

Query: 769  DRERVKDRSREKDKQADQEKDRTRDRERSSRKQKDDSHDMAKDIEKERVDNSDDEYDSNI 948
            DRE+ K+RS++K ++AD EK+R+RDR+ + +K  ++ ++ +KD E        D  DS  
Sbjct: 188  DREKGKERSKQKSREADLEKERSRDRDNAIKKNHEEDYEGSKDGELAL-----DYGDSRD 242

Query: 949  LKQQEKDVIAGAGYHQ-SASELEERISKMREERLIKPSEGASEVLSWVNRSRKLEEKRSV 1125
              + E +  + AG  Q S+SELEERI++M+EERL K SEG SEVL WV   RKLEEKR+ 
Sbjct: 243  KDEAELNAGSNAGVAQASSSELEERIARMKEERLKKKSEGVSEVLEWVGNFRKLEEKRNA 302

Query: 1126 EKEKALQRSKVFEEQDNMIDEESDDEPATQHTKQHLGGVKILHGLDKVLEGGAVVLTLKD 1305
            EKEKALQRSK+FEEQD+ +  E++DE A +H    L GVK+LHGLDKV++GGAVVLTLKD
Sbjct: 303  EKEKALQRSKIFEEQDDFVQGENEDEEAVRHAAHDLAGVKVLHGLDKVMDGGAVVLTLKD 362

Query: 1306 QSILADGDINQEVDMLENVEIGEQXXXXXXXXXXXXXTGVYDDKFNDEPGAEKKILPQYD 1485
            QSILA+GDIN++VDMLENVEIGEQ             TGVYDDKFNDEPG+EKKILPQYD
Sbjct: 363  QSILANGDINEDVDMLENVEIGEQRRRDEAYKAAKKKTGVYDDKFNDEPGSEKKILPQYD 422

Query: 1486 DPVADEGVTLDSSGRFTGEAXXXXXXXXXXINGVSASTRGEDLNSTGKISTDYYTQEEMT 1665
            +PVADEGVTLD  GRFTGEA          + GV  + R EDLN+ GKI++DYYTQEEM 
Sbjct: 423  NPVADEGVTLDERGRFTGEAEKKLQELRKRLQGVPTNNRVEDLNNAGKIASDYYTQEEML 482

Query: 1666 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLGSEDLGSRNDGRRQNLREEQDRIEAEMRS 1845
                                         GLG+ DLGSRND RRQ +REE+ R EAE R+
Sbjct: 483  KFKKPKKKKALRKKEKLDIDALEAEAISSGLGAGDLGSRNDARRQAIREEEARSEAEKRN 542

Query: 1846 NAYQSAYAKADEASKALRQEQVRSMQIEEDDTPVFGDDDDELRKSLERARKVALKKQEEE 2025
            +AYQSAYAKADEASK+L  EQ   ++ EED+  VF DDDD+L KS+ER+RK+A KKQE+E
Sbjct: 543  SAYQSAYAKADEASKSLWLEQTLIVKPEEDENQVFADDDDDLYKSIERSRKLAFKKQEDE 602

Query: 2026 KSGPQ 2040
            KSGPQ
Sbjct: 603  KSGPQ 607


>ref|XP_012077379.1| PREDICTED: SART-1 family protein DOT2 isoform X1 [Jatropha curcas]
            gi|643724962|gb|KDP34163.1| hypothetical protein
            JCGZ_07734 [Jatropha curcas]
          Length = 864

 Score =  444 bits (1141), Expect = e-141
 Identities = 234/433 (54%), Positives = 300/433 (69%), Gaps = 9/433 (2%)
 Frame = +1

Query: 769  DRERVKDRSREKDKQADQEKDRTRDRERSSRKQKDDSHDMAKDIEKERVDNSDDEYDSNI 948
            DRE+ +++++E+++ +D +K+R RDRE+ S++  ++ +D +KD   E   + ++  DS++
Sbjct: 135  DREKGREKTKERERDSDYDKERLRDREKVSKRSHEEDYDRSKDDVVEM--DYENNKDSSV 192

Query: 949  LKQ---------QEKDVIAGAGYHQSASELEERISKMREERLIKPSEGASEVLSWVNRSR 1101
            LKQ         ++K      G     S+LEERI KM+EERL K SE   EVL+WVNRSR
Sbjct: 193  LKQSKVSFDNKDEQKAEETSRGGSAPVSQLEERILKMKEERLKKNSEPGDEVLAWVNRSR 252

Query: 1102 KLEEKRSVEKEKALQRSKVFEEQDNMIDEESDDEPATQHTKQHLGGVKILHGLDKVLEGG 1281
            KLEEK++ EK+KA Q SK+FEEQDN +  ES+DE + +HT   L GVK+LHGL+KV+EGG
Sbjct: 253  KLEEKKNAEKQKAKQLSKIFEEQDNNVQGESEDEDSGEHTTHDLAGVKVLHGLEKVMEGG 312

Query: 1282 AVVLTLKDQSILADGDINQEVDMLENVEIGEQXXXXXXXXXXXXXTGVYDDKFNDEPGAE 1461
            AVVLTLKDQSILADGDIN+EVDMLENVEIGEQ             TG+YDDKFND+P +E
Sbjct: 313  AVVLTLKDQSILADGDINEEVDMLENVEIGEQKRRDDAYKAAKKKTGIYDDKFNDDPASE 372

Query: 1462 KKILPQYDDPVADEGVTLDSSGRFTGEAXXXXXXXXXXINGVSASTRGEDLNSTGKISTD 1641
            KKILPQYDD  ADEGV LD  GRFTGEA          + GVS + R EDL+S+GKIS+D
Sbjct: 373  KKILPQYDDSAADEGVALDERGRFTGEAEKKLEELRRRLQGVSTNNRFEDLSSSGKISSD 432

Query: 1642 YYTQEEMTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLGSEDLGSRNDGRRQNLREEQD 1821
            YYT EE+                              GLG  DLGSRN+GRRQ +R+EQ+
Sbjct: 433  YYTHEELLQFKKPKKKKSLRKKEKLDIDALEAEAVSAGLGVGDLGSRNNGRRQAIRQEQE 492

Query: 1822 RIEAEMRSNAYQSAYAKADEASKALRQEQVRSMQIEEDDTPVFGDDDDELRKSLERARKV 2001
            R EAEMRS+AYQ+AY KADEASK+LRQEQ    +++ED+ PVF +DD++L KSLERARK+
Sbjct: 493  RSEAEMRSSAYQAAYDKADEASKSLRQEQTLHAKLDEDENPVFAEDDEDLYKSLERARKL 552

Query: 2002 ALKKQEEEKSGPQ 2040
            ALKKQEE+ SGPQ
Sbjct: 553  ALKKQEEKASGPQ 565


>gb|KJB61482.1| hypothetical protein B456_009G361400 [Gossypium raimondii]
          Length = 655

 Score =  431 bits (1109), Expect = e-139
 Identities = 241/458 (52%), Positives = 305/458 (66%), Gaps = 26/458 (5%)
 Frame = +1

Query: 745  KHRGKDTA-DRERVKDRSREKDKQADQEKDRTRDRE--RSSRKQKDDSHDMAK--DIEKE 909
            K RGKD + DR+R K++ R+K K+ ++E+D+ +DRE  R   K KD S    +  D+EKE
Sbjct: 144  KERGKDKSRDRDREKEKERDKAKEREKERDKLKDREKEREGEKGKDRSKQKNREADLEKE 203

Query: 910  RV-------DNSDDEYDSNI-----------LKQQEKDVIAGAG---YHQSASELEERIS 1026
            R         N +++Y+ +              + E ++ AG+       S+SELEERI 
Sbjct: 204  RSRDRDNVGKNHEEDYEGSKDGELALDYEDRRDKDEAELNAGSNASLVQASSSELEERIV 263

Query: 1027 KMREERLIKPSEGASEVLSWVNRSRKLEEKRSVEKEKALQRSKVFEEQDNMIDEESDDEP 1206
            +M+E+RL K SEG SEV +WV+RSRKLE+KR+ EKEKALQ SK+FEEQDN +  E +DE 
Sbjct: 264  RMKEDRLKKKSEGLSEVSAWVSRSRKLEDKRNAEKEKALQLSKIFEEQDNFVQGEDEDEE 323

Query: 1207 ATQHTKQHLGGVKILHGLDKVLEGGAVVLTLKDQSILADGDINQEVDMLENVEIGEQXXX 1386
            A       LGGVK+LHGLDKV++GGAVVLTLKDQSILADGD+N++VDMLEN+EIGEQ   
Sbjct: 324  ADNRPTHDLGGVKVLHGLDKVMDGGAVVLTLKDQSILADGDLNEDVDMLENIEIGEQKQR 383

Query: 1387 XXXXXXXXXXTGVYDDKFNDEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEAXXXXXXX 1566
                      TGVYDDKFN++PG+EKKILPQYDDPVADEGVTLD  GRFTGEA       
Sbjct: 384  DEAYKAAKKKTGVYDDKFNEDPGSEKKILPQYDDPVADEGVTLDERGRFTGEAEKKLEEL 443

Query: 1567 XXXINGVSASTRGEDLNSTGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXXXXXXXXX 1746
               + GV  + R EDLN+ GKIS+DYYTQEEM                            
Sbjct: 444  RKRLLGVPTNNRVEDLNNVGKISSDYYTQEEMLRFKKPKKKKALRKKEKLDIDALEAEAV 503

Query: 1747 XXGLGSEDLGSRNDGRRQNLREEQDRIEAEMRSNAYQSAYAKADEASKALRQEQVRSMQI 1926
              GLG+ DLGSR D RRQ ++EE+ R EAE R NAYQ+A+AKADEASK+LR EQ  +++ 
Sbjct: 504  SAGLGAGDLGSRKDSRRQAIKEEEARSEAEKRKNAYQAAFAKADEASKSLRLEQTHTVKP 563

Query: 1927 EEDDTPVFGDDDDELRKSLERARKVALKKQEEEKSGPQ 2040
            EED+  VF DD+++L KSLE+AR++ALKKQ EEKSGPQ
Sbjct: 564  EEDENQVFADDEEDLYKSLEKARRLALKKQ-EEKSGPQ 600


>gb|KVH99952.1| SART-1 protein [Cynara cardunculus var. scolymus]
          Length = 915

 Score =  436 bits (1122), Expect = e-138
 Identities = 245/441 (55%), Positives = 291/441 (65%), Gaps = 19/441 (4%)
 Frame = +1

Query: 766  ADRERVKDRSREKDKQADQEKDRTRDRERSSRKQKDDSHDMAKDIEKERVDNSD------ 927
            ADR+R K R R +D   D +KDR+  RE+ S KQ DD H  +KD+ KE   NSD      
Sbjct: 181  ADRDREKTRERVRD--GDHDKDRSTGREKVSGKQHDDDHGGSKDLGKEDKLNSDSEDGQY 238

Query: 928  ------------DEYDSNILKQQEKDVIAGAGYHQSASELEERISKMREERLIKPSEGAS 1071
                        D+  + ILK +       AG  QSASEL++RI +M+EERL K SEGAS
Sbjct: 239  RDTSKHGIGSHRDKDATKILKHEADAEGEYAGSQQSASELQDRIMRMKEERLKKKSEGAS 298

Query: 1072 EVLSWVNRSRKLEEKRSVEKEKALQRSKVFEEQDNMIDEESDDEPAT-QHTKQHLGGVKI 1248
            +VLSWV++SRKLE++R+ EKEKALQRSK+FEEQDN+   E +DE A   HT   L G K+
Sbjct: 299  DVLSWVSKSRKLEDRRNAEKEKALQRSKMFEEQDNVTQGEDEDEVAACPHTSHDLAGFKV 358

Query: 1249 LHGLDKVLEGGAVVLTLKDQSILADGDINQEVDMLENVEIGEQXXXXXXXXXXXXXTGVY 1428
            LHGLDKV+EGG VVLTLKDQSILA GDINQE+DMLENVEIGEQ             +GVY
Sbjct: 359  LHGLDKVIEGGTVVLTLKDQSILAAGDINQEIDMLENVEIGEQKRRNEAYKAAKKKSGVY 418

Query: 1429 DDKFNDEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEAXXXXXXXXXXINGVSASTRGE 1608
            DDKFN+EPG EK +LPQYDDPV DEGVTLD  G F GEA          I+G S +TR E
Sbjct: 419  DDKFNEEPGIEKIMLPQYDDPVVDEGVTLDERGSFGGEAEKKLEELRRRIDGASVNTRFE 478

Query: 1609 DLNSTGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLGSEDLGSRND 1788
            DL S+GK+STDYYT EEM                              GLG+ DLGSR D
Sbjct: 479  DLTSSGKVSTDYYTSEEMLRFKKPKKKKALRKKDKLDIDALEAEARSAGLGTGDLGSRAD 538

Query: 1789 GRRQNLREEQDRIEAEMRSNAYQSAYAKADEASKALRQEQVRSMQIEEDDTPVFGDDDDE 1968
            G+RQ L+EEQ+R EAE RSNA+QSAY KADEASKALR EQ  S+Q E++D  VFGDDDD+
Sbjct: 539  GKRQALKEEQERSEAEKRSNAFQSAYVKADEASKALRMEQTVSLQKEDEDNLVFGDDDDD 598

Query: 1969 LRKSLERARKVALKKQEEEKS 2031
            L KSL+RARKVALK+Q++  S
Sbjct: 599  LHKSLQRARKVALKRQDDGTS 619


>ref|XP_008390895.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1-like [Malus
            domestica] gi|657997037|ref|XP_008390896.1| PREDICTED:
            U4/U6.U5 tri-snRNP-associated protein 1-like [Malus
            domestica] gi|657997039|ref|XP_008390897.1| PREDICTED:
            U4/U6.U5 tri-snRNP-associated protein 1-like [Malus
            domestica]
          Length = 946

 Score =  435 bits (1118), Expect = e-137
 Identities = 240/444 (54%), Positives = 294/444 (66%), Gaps = 11/444 (2%)
 Frame = +1

Query: 742  GKHRGKDTADRERVKDRSREKDKQADQEKDRTRDRERSSRKQKDDSHDMAKDIEKERVDN 921
            G+   KDT DRERVKD+ REK+++ DQ+KD++RDR      ++DD   +          N
Sbjct: 184  GRESYKDT-DRERVKDKYREKEREVDQDKDKSRDRGSRRSVERDDKLKL----------N 232

Query: 922  SDDEYDSNILKQQEKDVIA---------GAGYHQSASELEERISKMREERLIKPSEGASE 1074
             DD  D +ILKQ +    A          +G H SASELEERI K +EERL K +E   E
Sbjct: 233  GDDNRDKDILKQGKVSHNAEDERHADGLSSGTHLSASELEERILKTKEERLKKKTEDVPE 292

Query: 1075 VLSWVNRSRKLEEKRSVEKEKALQRSKVFEEQDNMIDEESDDEPATQHTKQHLGGVKILH 1254
            VL+WV++SRK+EEKR+ EK+KALQ SK+FEEQDN+   ES+DE   Q     L GVK+LH
Sbjct: 293  VLAWVSKSRKIEEKRNAEKQKALQLSKIFEEQDNIGQGESEDEETAQDPTHDLAGVKVLH 352

Query: 1255 GLDKVLEGGAVVLTLKDQSILADGDINQEVDMLENVEIGEQXXXXXXXXXXXXXTGVYDD 1434
            GLDKV+EGGAVVLTLKDQ+ILADGDIN+++DMLENVEIGEQ              G Y D
Sbjct: 353  GLDKVMEGGAVVLTLKDQNILADGDINEDIDMLENVEIGEQKQRDDAYKAAKKKRGAYVD 412

Query: 1435 KFNDEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEAXXXXXXXXXXINGVSASTRGEDL 1614
            KFND+PG EKK+LPQYDDP  DEG+TLD  GRFTGEA          I GV    R EDL
Sbjct: 413  KFNDDPGTEKKMLPQYDDPTPDEGLTLDERGRFTGEAEKKLEELRKRIQGVPTKDRFEDL 472

Query: 1615 NSTGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLGSEDLGSRNDGR 1794
            N +GKIS+D+YTQ+EM                              GLG EDLGSRND +
Sbjct: 473  NMSGKISSDFYTQDEMLQFKKPKKKKSLRKREKLDLDALEAEAVSAGLGVEDLGSRNDAK 532

Query: 1795 RQNLREEQDRIEAEMRSNAYQSAYAKADEASKALRQEQVRSMQIEEDDTPVFGDDDDELR 1974
            R+  +EEQ+R+EAE R++AYQ AYA+ADEASK+LR EQ  S++ EED+ PVF DDDD+L 
Sbjct: 533  RRASKEEQERLEAERRNSAYQLAYARADEASKSLRLEQTLSVKREEDENPVFADDDDDLY 592

Query: 1975 KSLERARKVALKKQEEEK--SGPQ 2040
            KSLE+ARK+ALKK+EEEK  SGPQ
Sbjct: 593  KSLEKARKLALKKKEEEKTVSGPQ 616


>gb|KHG25959.1| U4/U6.U5 tri-snRNP-associated 1 [Gossypium arboreum]
          Length = 955

 Score =  435 bits (1118), Expect = e-137
 Identities = 239/464 (51%), Positives = 305/464 (65%), Gaps = 32/464 (6%)
 Frame = +1

Query: 745  KHRGKDTA-DRERVKDRSREKDKQADQEKDRTRDRERSSRKQKDDSHDMAKDIEKERVDN 921
            K RGKD + DR+R K++ R+K K+ ++E+D+ +DRE+    +KD   +  KD  K++   
Sbjct: 144  KERGKDKSRDRDREKEKERDKAKEREKERDKLKDREKEREGEKDRDREKGKDRSKQKNRE 203

Query: 922  SDDEYD-----SNILKQQEKD-----------------------VIAGAG---YHQSASE 1008
            +D E +      N++K  E+D                       + AG+       S+SE
Sbjct: 204  TDLEKERSRDRDNVVKNHEEDYEGSKDGELALDYEDRRDKDEAELNAGSNASLVQASSSE 263

Query: 1009 LEERISKMREERLIKPSEGASEVLSWVNRSRKLEEKRSVEKEKALQRSKVFEEQDNMIDE 1188
            LEERI +M+E RL K SEG SEV +WV+RSRKLE+KR+ EKEKALQ SK+FEEQDN +  
Sbjct: 264  LEERIVRMKEVRLKKKSEGLSEVSAWVSRSRKLEDKRNAEKEKALQLSKIFEEQDNFVQG 323

Query: 1189 ESDDEPATQHTKQHLGGVKILHGLDKVLEGGAVVLTLKDQSILADGDINQEVDMLENVEI 1368
            E +DE A       LGGVK+LHGLDKV++GGAVVLTLKDQSILADGD+N++VDMLEN+EI
Sbjct: 324  EDEDEEADNRPSHDLGGVKVLHGLDKVMDGGAVVLTLKDQSILADGDLNEDVDMLENIEI 383

Query: 1369 GEQXXXXXXXXXXXXXTGVYDDKFNDEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEAX 1548
            GEQ             TGVYDDKFN++PG+EKKILPQYDDPVADEGVTLD  GRFTGEA 
Sbjct: 384  GEQKQRDEAYKAAKKKTGVYDDKFNEDPGSEKKILPQYDDPVADEGVTLDERGRFTGEAE 443

Query: 1549 XXXXXXXXXINGVSASTRGEDLNSTGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXXX 1728
                     + GV  + R EDLN+ GK+S+DYYTQEEM                      
Sbjct: 444  KKLDELRKRLLGVPTNNRVEDLNNVGKVSSDYYTQEEMLRFKKPKKKKALRKKEKLDIDA 503

Query: 1729 XXXXXXXXGLGSEDLGSRNDGRRQNLREEQDRIEAEMRSNAYQSAYAKADEASKALRQEQ 1908
                    GLG+ DLGSRND RRQ ++EE+ R EAE R+NAYQ+A+AKADEASK+LR EQ
Sbjct: 504  LEAEAVSAGLGAGDLGSRNDSRRQAIKEEEARSEAEKRNNAYQAAFAKADEASKSLRLEQ 563

Query: 1909 VRSMQIEEDDTPVFGDDDDELRKSLERARKVALKKQEEEKSGPQ 2040
              +++ EED+  VF DD+++L KSLE+AR++ALKKQ EEKSGPQ
Sbjct: 564  TLTVKPEEDENQVFADDEEDLYKSLEKARRLALKKQ-EEKSGPQ 606


>gb|KJB61483.1| hypothetical protein B456_009G361400 [Gossypium raimondii]
          Length = 878

 Score =  431 bits (1109), Expect = e-137
 Identities = 241/458 (52%), Positives = 305/458 (66%), Gaps = 26/458 (5%)
 Frame = +1

Query: 745  KHRGKDTA-DRERVKDRSREKDKQADQEKDRTRDRE--RSSRKQKDDSHDMAK--DIEKE 909
            K RGKD + DR+R K++ R+K K+ ++E+D+ +DRE  R   K KD S    +  D+EKE
Sbjct: 144  KERGKDKSRDRDREKEKERDKAKEREKERDKLKDREKEREGEKGKDRSKQKNREADLEKE 203

Query: 910  RV-------DNSDDEYDSNI-----------LKQQEKDVIAGAG---YHQSASELEERIS 1026
            R         N +++Y+ +              + E ++ AG+       S+SELEERI 
Sbjct: 204  RSRDRDNVGKNHEEDYEGSKDGELALDYEDRRDKDEAELNAGSNASLVQASSSELEERIV 263

Query: 1027 KMREERLIKPSEGASEVLSWVNRSRKLEEKRSVEKEKALQRSKVFEEQDNMIDEESDDEP 1206
            +M+E+RL K SEG SEV +WV+RSRKLE+KR+ EKEKALQ SK+FEEQDN +  E +DE 
Sbjct: 264  RMKEDRLKKKSEGLSEVSAWVSRSRKLEDKRNAEKEKALQLSKIFEEQDNFVQGEDEDEE 323

Query: 1207 ATQHTKQHLGGVKILHGLDKVLEGGAVVLTLKDQSILADGDINQEVDMLENVEIGEQXXX 1386
            A       LGGVK+LHGLDKV++GGAVVLTLKDQSILADGD+N++VDMLEN+EIGEQ   
Sbjct: 324  ADNRPTHDLGGVKVLHGLDKVMDGGAVVLTLKDQSILADGDLNEDVDMLENIEIGEQKQR 383

Query: 1387 XXXXXXXXXXTGVYDDKFNDEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEAXXXXXXX 1566
                      TGVYDDKFN++PG+EKKILPQYDDPVADEGVTLD  GRFTGEA       
Sbjct: 384  DEAYKAAKKKTGVYDDKFNEDPGSEKKILPQYDDPVADEGVTLDERGRFTGEAEKKLEEL 443

Query: 1567 XXXINGVSASTRGEDLNSTGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXXXXXXXXX 1746
               + GV  + R EDLN+ GKIS+DYYTQEEM                            
Sbjct: 444  RKRLLGVPTNNRVEDLNNVGKISSDYYTQEEMLRFKKPKKKKALRKKEKLDIDALEAEAV 503

Query: 1747 XXGLGSEDLGSRNDGRRQNLREEQDRIEAEMRSNAYQSAYAKADEASKALRQEQVRSMQI 1926
              GLG+ DLGSR D RRQ ++EE+ R EAE R NAYQ+A+AKADEASK+LR EQ  +++ 
Sbjct: 504  SAGLGAGDLGSRKDSRRQAIKEEEARSEAEKRKNAYQAAFAKADEASKSLRLEQTHTVKP 563

Query: 1927 EEDDTPVFGDDDDELRKSLERARKVALKKQEEEKSGPQ 2040
            EED+  VF DD+++L KSLE+AR++ALKKQ EEKSGPQ
Sbjct: 564  EEDENQVFADDEEDLYKSLEKARRLALKKQ-EEKSGPQ 600


>gb|EPS63268.1| hypothetical protein M569_11517, partial [Genlisea aurea]
          Length = 795

 Score =  428 bits (1101), Expect = e-136
 Identities = 241/444 (54%), Positives = 299/444 (67%), Gaps = 10/444 (2%)
 Frame = +1

Query: 742  GKHRGKDTADRERVKDRSREKDKQADQEKDRTRDRERSSRKQKDDSHDMAKDIEKERVDN 921
            G+ RGKD ADR+++K+++RE +    QE D+  +R  S+ +++D +H    DI+K+    
Sbjct: 191  GRERGKDAADRDKLKEKNRETEVPIHQETDQEGNRHGSNLRRRDGNHGRGNDIDKDGESL 250

Query: 922  SDDEYDSN------ILKQQEKDVIAGAGYHQSASELEERISKMREERLIKPSEGASEVLS 1083
            ++   D N      I+ ++E+    G   H +  ELEERI KM+EERL+K +EG+SEVL+
Sbjct: 251  TEPSGDVNLGSDFKIINKEERPETPGD--HMAVLELEERIHKMKEERLLKSTEGSSEVLA 308

Query: 1084 WVNRSRKLEEKRSVEKEKALQRSKVFEEQDNMIDEESDDEPATQHTKQHLGGVKIL-HGL 1260
            WVNRSRK+ EK++ EKEK  + S  FEEQDNM +EESDDE   QH+ +HLGGVKI  HGL
Sbjct: 309  WVNRSRKIAEKKNSEKEKTFKLSMNFEEQDNMNEEESDDE--NQHSSKHLGGVKIQQHGL 366

Query: 1261 DKVLEGGAVVLTLKDQSILADGDINQEVDMLENVEIGEQXXXXXXXXXXXXXTGVYDDKF 1440
            +KV EGGA+V+TLKD SILADGD+NQEVD+LEN+EIGEQ             TG+YDDKF
Sbjct: 367  EKVAEGGAIVMTLKDHSILADGDVNQEVDVLENLEIGEQRRRDEAYKAAKKKTGIYDDKF 426

Query: 1441 NDEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEAXXXXXXXXXXINGVSASTRGEDLNS 1620
            + E  +EKKILPQYDDPVADE VTLDSSG F+ EA          I G S   +GEDLNS
Sbjct: 427  SAEAVSEKKILPQYDDPVADELVTLDSSGHFSAEAERKLKELRKRIQGASTRLQGEDLNS 486

Query: 1621 TGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-GLGSEDLGSRNDGRR 1797
              K S+DYYT+EEM                               GLGS DLGSRN+G+R
Sbjct: 487  ASKASSDYYTEEEMIMKFNRPKKKKSLRKKEKLDLDVLEAEAKSAGLGSGDLGSRNNGKR 546

Query: 1798 QNLREEQDRIEAEMRSNAYQSAYAKADEASKALRQEQVRSMQI-EEDDTPVFGDDDDELR 1974
            QNLRE QDRIEAEMRSNAYQSAYAKA+EASKALR E+    Q  EEDD   FGDDDDEL+
Sbjct: 547  QNLREAQDRIEAEMRSNAYQSAYAKANEASKALRLEKENGAQSKEEDDIQAFGDDDDELQ 606

Query: 1975 KSLERARKVALK-KQEEEKSGPQV 2043
            KSL RARK+ALK K E+EKS P++
Sbjct: 607  KSLARARKIALKSKDEDEKSTPKL 630


>ref|XP_012441144.1| PREDICTED: SART-1 family protein DOT2 [Gossypium raimondii]
            gi|823216924|ref|XP_012441145.1| PREDICTED: SART-1 family
            protein DOT2 [Gossypium raimondii]
            gi|763794483|gb|KJB61479.1| hypothetical protein
            B456_009G361400 [Gossypium raimondii]
            gi|763794484|gb|KJB61480.1| hypothetical protein
            B456_009G361400 [Gossypium raimondii]
            gi|763794485|gb|KJB61481.1| hypothetical protein
            B456_009G361400 [Gossypium raimondii]
            gi|763794488|gb|KJB61484.1| hypothetical protein
            B456_009G361400 [Gossypium raimondii]
          Length = 900

 Score =  431 bits (1109), Expect = e-136
 Identities = 241/458 (52%), Positives = 305/458 (66%), Gaps = 26/458 (5%)
 Frame = +1

Query: 745  KHRGKDTA-DRERVKDRSREKDKQADQEKDRTRDRE--RSSRKQKDDSHDMAK--DIEKE 909
            K RGKD + DR+R K++ R+K K+ ++E+D+ +DRE  R   K KD S    +  D+EKE
Sbjct: 144  KERGKDKSRDRDREKEKERDKAKEREKERDKLKDREKEREGEKGKDRSKQKNREADLEKE 203

Query: 910  RV-------DNSDDEYDSNI-----------LKQQEKDVIAGAG---YHQSASELEERIS 1026
            R         N +++Y+ +              + E ++ AG+       S+SELEERI 
Sbjct: 204  RSRDRDNVGKNHEEDYEGSKDGELALDYEDRRDKDEAELNAGSNASLVQASSSELEERIV 263

Query: 1027 KMREERLIKPSEGASEVLSWVNRSRKLEEKRSVEKEKALQRSKVFEEQDNMIDEESDDEP 1206
            +M+E+RL K SEG SEV +WV+RSRKLE+KR+ EKEKALQ SK+FEEQDN +  E +DE 
Sbjct: 264  RMKEDRLKKKSEGLSEVSAWVSRSRKLEDKRNAEKEKALQLSKIFEEQDNFVQGEDEDEE 323

Query: 1207 ATQHTKQHLGGVKILHGLDKVLEGGAVVLTLKDQSILADGDINQEVDMLENVEIGEQXXX 1386
            A       LGGVK+LHGLDKV++GGAVVLTLKDQSILADGD+N++VDMLEN+EIGEQ   
Sbjct: 324  ADNRPTHDLGGVKVLHGLDKVMDGGAVVLTLKDQSILADGDLNEDVDMLENIEIGEQKQR 383

Query: 1387 XXXXXXXXXXTGVYDDKFNDEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEAXXXXXXX 1566
                      TGVYDDKFN++PG+EKKILPQYDDPVADEGVTLD  GRFTGEA       
Sbjct: 384  DEAYKAAKKKTGVYDDKFNEDPGSEKKILPQYDDPVADEGVTLDERGRFTGEAEKKLEEL 443

Query: 1567 XXXINGVSASTRGEDLNSTGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXXXXXXXXX 1746
               + GV  + R EDLN+ GKIS+DYYTQEEM                            
Sbjct: 444  RKRLLGVPTNNRVEDLNNVGKISSDYYTQEEMLRFKKPKKKKALRKKEKLDIDALEAEAV 503

Query: 1747 XXGLGSEDLGSRNDGRRQNLREEQDRIEAEMRSNAYQSAYAKADEASKALRQEQVRSMQI 1926
              GLG+ DLGSR D RRQ ++EE+ R EAE R NAYQ+A+AKADEASK+LR EQ  +++ 
Sbjct: 504  SAGLGAGDLGSRKDSRRQAIKEEEARSEAEKRKNAYQAAFAKADEASKSLRLEQTHTVKP 563

Query: 1927 EEDDTPVFGDDDDELRKSLERARKVALKKQEEEKSGPQ 2040
            EED+  VF DD+++L KSLE+AR++ALKKQ EEKSGPQ
Sbjct: 564  EEDENQVFADDEEDLYKSLEKARRLALKKQ-EEKSGPQ 600


>ref|XP_004250062.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Solanum
            lycopersicum]
          Length = 898

 Score =  431 bits (1107), Expect = e-136
 Identities = 235/437 (53%), Positives = 305/437 (69%), Gaps = 12/437 (2%)
 Frame = +1

Query: 745  KHRG-KDTADRERVKDRSREKDKQADQE-KDRTRDRERSSRKQKDDSHDMAKDIEKERVD 918
            K RG +D A++E+ +DR++EK K+ D++ K+R+RD++RSSR+Q+D+ HD +KD ++ + +
Sbjct: 162  KERGSRDGAEKEKGRDRAKEKGKEVDEDDKERSRDKDRSSRRQRDEGHDRSKDKDRRKDE 221

Query: 919  NSDDEYDSN---ILKQQEKD-------VIAGAGYHQSASELEERISKMREERLIKPSEGA 1068
            +SD  Y +    ++  ++++          GA    +ASELEERI KM+EERL K SEGA
Sbjct: 222  DSDYRYAAKQEIVVSHEDEERSHNNAVETGGAQSAAAASELEERILKMKEERLKKKSEGA 281

Query: 1069 SEVLSWVNRSRKLEEKRSVEKEKALQRSKVFEEQDNMIDEESDDEPATQHTKQHLGGVKI 1248
            SEVL+WV++SRK+EE R+ EKEKALQ SK+FEEQD M +EESDDE   +   + LGG+K+
Sbjct: 282  SEVLAWVSKSRKIEEIRNAEKEKALQLSKIFEEQDKMNEEESDDEENARLAAKELGGMKV 341

Query: 1249 LHGLDKVLEGGAVVLTLKDQSILADGDINQEVDMLENVEIGEQXXXXXXXXXXXXXTGVY 1428
            LHGLDKV+EGGAVVLTLKDQSILA  D+NQEVD+LENVEIGEQ             TG+Y
Sbjct: 342  LHGLDKVVEGGAVVLTLKDQSILAGDDVNQEVDVLENVEIGEQKRRDDAYKAAKNKTGIY 401

Query: 1429 DDKFNDEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEAXXXXXXXXXXINGVSASTRGE 1608
            DDKFNDEPG E+KILP+YDDP  +EGV LD++G F+ +A          I G S+  R E
Sbjct: 402  DDKFNDEPGFERKILPKYDDPAEEEGVILDATGGFSLDAEKKLEELRRRIQGPSSINRME 461

Query: 1609 DLNSTGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLGSEDLGSRND 1788
            DLNS+GK+ +DYYTQEEM                              GLG  DLGSRND
Sbjct: 462  DLNSSGKLLSDYYTQEEMVQFKKPKKKKSLRKKEKMDLDALEAEAKSAGLGVSDLGSRND 521

Query: 1789 GRRQNLREEQDRIEAEMRSNAYQSAYAKADEASKALRQEQVRSMQIEEDDTPVFGDDDDE 1968
              RQ L+EE++R +AE RSNAYQ+AYAKA+EASKALR ++  + Q EEDD  VF DDD+E
Sbjct: 522  KTRQVLKEEKERADAETRSNAYQAAYAKAEEASKALRPDKTNNNQREEDDA-VFDDDDEE 580

Query: 1969 LRKSLERARKVALKKQE 2019
            LRKSLERARK+AL+KQE
Sbjct: 581  LRKSLERARKLALRKQE 597


>ref|XP_010256356.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1 [Nelumbo nucifera]
            gi|720001422|ref|XP_010256357.1| PREDICTED: U4/U6.U5
            tri-snRNP-associated protein 1 [Nelumbo nucifera]
            gi|720001427|ref|XP_010256358.1| PREDICTED: U4/U6.U5
            tri-snRNP-associated protein 1 [Nelumbo nucifera]
            gi|720001430|ref|XP_010256359.1| PREDICTED: U4/U6.U5
            tri-snRNP-associated protein 1 [Nelumbo nucifera]
            gi|720001433|ref|XP_010256360.1| PREDICTED: U4/U6.U5
            tri-snRNP-associated protein 1 [Nelumbo nucifera]
            gi|720001436|ref|XP_010256361.1| PREDICTED: U4/U6.U5
            tri-snRNP-associated protein 1 [Nelumbo nucifera]
          Length = 851

 Score =  428 bits (1100), Expect = e-135
 Identities = 237/452 (52%), Positives = 301/452 (66%), Gaps = 20/452 (4%)
 Frame = +1

Query: 745  KHRGKDTADRERVKDRSR-EKDKQADQEKDRTRDRERSSRKQK--DDSHDMAKDIEKERV 915
            KH+ ++   RE+VKDR + E+DK  +++K+R++D+ER +R  K  D+S    KD+ K+  
Sbjct: 111  KHKDRE---REKVKDREKLERDKSKEKDKERSKDKERDARNGKLDDESQGRGKDVGKDEK 167

Query: 916  DNSDDEYDSNILKQ--------------QEKDVIAGA--GYHQSASELEERISKMREERL 1047
             + D   D +++KQ              + K  + GA  G   S  ELEERI KMREER 
Sbjct: 168  LDLDGGNDRDVVKQVKEVQHDVVVDMSVENKKKVDGAMGGSQPSTGELEERILKMREERS 227

Query: 1048 IKPSEGASEVLSWVNRSRKLEEKRSVEKEKALQRSKVFEEQDNMIDEESDDEPATQHTKQ 1227
             K SEG SEVLSWVN+SRKLEEKR+ EK+KALQ SKVFEEQD +   ES+DE   +HT +
Sbjct: 228  KKKSEGVSEVLSWVNKSRKLEEKRNAEKQKALQLSKVFEEQDKIDQGESEDEDTARHTSK 287

Query: 1228 HLGGVKILHGLDKVLEGGAVVLTLKDQSILADGDINQEVDMLENVEIGEQXXXXXXXXXX 1407
             L GVKILHG+DKV+EGGAVVLTLKDQ+ILA+ D+N+E D+LENVEIGEQ          
Sbjct: 288  DLAGVKILHGIDKVIEGGAVVLTLKDQNILANDDVNEEADVLENVEIGEQKQRDAAYKAA 347

Query: 1408 XXXTGVYDDKFNDEPGAEKKILPQYDDPVADEGVTLDSSGRFTGEAXXXXXXXXXXINGV 1587
               TG+Y+DKF+ E GA+KKILPQYDDPV DEG+ LD SGRF GEA          + GV
Sbjct: 348  KKKTGIYEDKFSGEDGAQKKILPQYDDPVEDEGLVLDESGRFAGEAEKKLEELRKRLQGV 407

Query: 1588 SASTRGEDLNSTGKISTDYYTQEEMTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLGSE 1767
            SAS   EDLNS+ KI++D+YT EEM                              G G  
Sbjct: 408  SASNHFEDLNSSAKITSDFYTHEEMLQFKKPKKKKSLRKKVKLDLDALEAEAISAGFGVG 467

Query: 1768 DLGSRNDGRRQNLREEQDRIEAEMRSNAYQSAYAKADEASKALRQEQVRSMQIEEDDTPV 1947
            DLGSR DG+RQ  +E+Q+R EAEMRSNAYQSA+AKA+EASK LRQEQ  ++Q+EE+++PV
Sbjct: 468  DLGSRKDGQRQATKEQQERSEAEMRSNAYQSAFAKAEEASKTLRQEQTLTVQVEENESPV 527

Query: 1948 FGDDDDELRKSLERARKVALKKQEE-EKSGPQ 2040
            FGDD+++L KSLE+ARK+ALK Q E   SGPQ
Sbjct: 528  FGDDEEDLYKSLEKARKLALKTQNEAAASGPQ 559


Top