BLASTX nr result

ID: Cocculus22_contig00001312 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus22_contig00001312
         (2721 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI40671.3| unnamed protein product [Vitis vinifera]              873   0.0  
ref|XP_002264268.1| PREDICTED: uncharacterized protein LOC100266...   857   0.0  
ref|XP_002516516.1| conserved hypothetical protein [Ricinus comm...   827   0.0  
ref|XP_002297938.2| hypothetical protein POPTR_0001s11550g [Popu...   818   0.0  
ref|XP_007225495.1| hypothetical protein PRUPE_ppa000914mg [Prun...   813   0.0  
ref|XP_007022025.1| U4/U6.U5 tri-snRNP-associated protein 1 isof...   812   0.0  
ref|XP_003530377.1| PREDICTED: zinc finger CCCH domain-containin...   787   0.0  
ref|XP_006583920.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro...   779   0.0  
ref|XP_006583919.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro...   779   0.0  
ref|XP_006583918.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro...   779   0.0  
ref|XP_006583917.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro...   779   0.0  
ref|XP_004250062.1| PREDICTED: uncharacterized protein LOC101246...   778   0.0  
gb|EXB93293.1| hypothetical protein L484_015280 [Morus notabilis]     776   0.0  
ref|XP_004141556.1| PREDICTED: uncharacterized protein LOC101207...   775   0.0  
ref|XP_006361674.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro...   773   0.0  
ref|XP_007133507.1| hypothetical protein PHAVU_011G184800g [Phas...   771   0.0  
ref|XP_006836392.1| hypothetical protein AMTR_s00092p00135160 [A...   766   0.0  
gb|EYU25740.1| hypothetical protein MIMGU_mgv1a000914mg [Mimulus...   765   0.0  
ref|XP_007022027.1| U4/U6.U5 tri-snRNP-associated protein 1 isof...   762   0.0  
ref|XP_006471158.1| PREDICTED: U4/U6.U5 tri-snRNP-associated pro...   757   0.0  

>emb|CBI40671.3| unnamed protein product [Vitis vinifera]
          Length = 944

 Score =  873 bits (2255), Expect = 0.0
 Identities = 448/697 (64%), Positives = 543/697 (77%), Gaps = 22/697 (3%)
 Frame = +2

Query: 170  KIRDEGHHKAIDSENSKEIE-------------------RDDTNGSTMEHKQKNEASNGQ 292
            K RDEGH ++ D     +++                    D+ +   +EH++  E ++G 
Sbjct: 241  KNRDEGHDRSKDGGKDDKLKLDGGDNRDRDVTKQGRGSHHDEDDSRAIEHEKNAEGASGP 300

Query: 293  QSSASELGVRISKMREERLKQKNDGVSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQ 472
            QSS ++L  RI +M+EER+K+K++G SE+L+WVN+SRK+E++  +EKEKAL LSK FEEQ
Sbjct: 301  QSSTAQLQERILRMKEERVKRKSEGSSEVLAWVNRSRKVEEQRNAEKEKALQLSKIFEEQ 360

Query: 473  DNIVQGENEEDESTQQTTKDLAGIKILHGLDKVIEGGAVVLTLKDQNILADGDINDEMDM 652
            DNI QGE+++++ T+ +++DLAG+K+LHGLDKVIEGGAVVLTLKDQ+ILA+GDIN+++DM
Sbjct: 361  DNIDQGESDDEKPTRHSSQDLAGVKVLHGLDKVIEGGAVVLTLKDQDILANGDINEDVDM 420

Query: 653  LENVEIGQQKQRDAAYKAAKK-TGIYEDKFNDEAGAQKKILAQYDDPVEEEAVTLDESGR 829
            LENVEIG+QK+RD AYKAAKK TGIYEDKFNDE G++KKIL QYDDPV +E + LD SGR
Sbjct: 421  LENVEIGEQKRRDEAYKAAKKKTGIYEDKFNDEPGSEKKILPQYDDPVTDEGLALDASGR 480

Query: 830  FTGVAEKKLEELRKRLQGTSTSTKFEDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXX 1009
            FTG AEKKLEELR+RLQG ST+ +FEDL++  K ++DYYT EEM                
Sbjct: 481  FTGEAEKKLEELRRRLQGVSTNNRFEDLNTYGKNSSDYYTHEEMLQFKKPKKKKSLRKKE 540

Query: 1010 XXXXDALEAEAISTGLGVGDLGSRNDGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXX 1189
                DALEAEA+S GLGVGDLGSRNDG+RQ+ +EE++RSEA+                  
Sbjct: 541  KLNIDALEAEAVSAGLGVGDLGSRNDGKRQSIREEQERSEAEMRNSAYQLAYAKADEASK 600

Query: 1190 XLRQELSSTFKVEEDENPVFGDDDDLLYKSLEKARKLALTKQEEEAATGPQAIASLA-VS 1366
             LR + +   ++EE+EN VFG+DD+ L KSL++ARKL L KQ+E A +GPQAIA LA  +
Sbjct: 601  ALRLDQTLPVQLEENENQVFGEDDEELQKSLQRARKLVLQKQDEAATSGPQAIALLASTT 660

Query: 1367 TRNQSTEIQPPSSGDNQENRVVFTEMEEFVWGLQLNEEVHKPESEDVFMDEDEVVKPSDE 1546
            T +Q+ + Q P SG++QENRVVFTEMEEFVWGLQL +E HKP+ EDVFMDEDE  K SD+
Sbjct: 661  TSSQNVDNQNPISGESQENRVVFTEMEEFVWGLQLEDEAHKPDGEDVFMDEDEAPKASDQ 720

Query: 1547 EKKDDSGGWTEVMDTSKDEDPINVVKEEVVPDETIHEVPVGKGLSGALHLLKERGTLKES 1726
            E+KD++GGWTEV DT KDE P+N  KEE+VPD+TIHEV VGKGLSGAL LLKERGTLKE 
Sbjct: 721  ERKDEAGGWTEVKDTDKDELPVNENKEEMVPDDTIHEVAVGKGLSGALQLLKERGTLKEG 780

Query: 1727 IEWGGRNMDKKKSKLVGIYESDGPKEIHIERTDEFGRIMTPKEAFRMISHKFHGKGPGKM 1906
            IEWGGRNMDKKKSKLVGIY++ G KEI IERTDEFGRIMTPKEAFRMISHKFHGKGPGKM
Sbjct: 781  IEWGGRNMDKKKSKLVGIYDNTGTKEIRIERTDEFGRIMTPKEAFRMISHKFHGKGPGKM 840

Query: 1907 KQEKRMKQYQEELKLKQMKNSDTPSLSVERMREAQARLQIPYLVLSGHVKSGDNGDRRSG 2086
            KQEKRMKQYQEELKLKQMKNSDTPS SVERMREAQARL+ PYLVLSGHVK G   D RSG
Sbjct: 841  KQEKRMKQYQEELKLKQMKNSDTPSQSVERMREAQARLKTPYLVLSGHVKPGQTSDPRSG 900

Query: 2087 FAPGE-GLPGGLTPMLGDKKVEHFLGIKRKAEPGSMG 2194
            FA  E  +PG LTPMLGD+KVEHFLGIKRKAEP +MG
Sbjct: 901  FATVEKDVPGSLTPMLGDRKVEHFLGIKRKAEPSNMG 937


>ref|XP_002264268.1| PREDICTED: uncharacterized protein LOC100266959 [Vitis vinifera]
          Length = 902

 Score =  857 bits (2214), Expect = 0.0
 Identities = 441/675 (65%), Positives = 531/675 (78%), Gaps = 3/675 (0%)
 Frame = +2

Query: 179  DEGHHKAIDSENSKEIERDDTNGSTMEHKQKNEASNGQQSSASELGVRISKMREERLKQK 358
            D+   +  D +      RD+  G   +     + ++G QSS ++L  RI +M+EER+K+K
Sbjct: 226  DQDRDRYKDRDKGSRKNRDEDGGDNRDR----DGASGPQSSTAQLQERILRMKEERVKRK 281

Query: 359  NDGVSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDESTQQTTKDLA 538
            ++G SE+L+WVN+SRK+E++  +EKEKAL LSK FEEQDNI QGE+++++ T+ ++  LA
Sbjct: 282  SEGSSEVLAWVNRSRKVEEQRNAEKEKALQLSKIFEEQDNIDQGESDDEKPTRHSSH-LA 340

Query: 539  GIKILHGLDKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQRDAAYKAAKK- 715
            G+K+LHGLDKVIEGGAVVLTLKDQ+ILA+GDIN+++DMLENVEIG+QK+RD AYKAAKK 
Sbjct: 341  GVKVLHGLDKVIEGGAVVLTLKDQDILANGDINEDVDMLENVEIGEQKRRDEAYKAAKKK 400

Query: 716  TGIYEDKFNDEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKLEELRKRLQGTSTS 895
            TGIYEDKFNDE G++KKIL QYDDPV +E + LD SGRFTG AEKKLEELR+RLQG ST+
Sbjct: 401  TGIYEDKFNDEPGSEKKILPQYDDPVTDEGLALDASGRFTGEAEKKLEELRRRLQGVSTN 460

Query: 896  TKFEDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEAEAISTGLGVGDLG 1075
             +FEDL++  K ++DYYT EEM                    DALEAEA+S GLGVGDLG
Sbjct: 461  NRFEDLNTYGKNSSDYYTHEEMLQFKKPKKKKSLRKKEKLNIDALEAEAVSAGLGVGDLG 520

Query: 1076 SRNDGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFKVEEDENPVFGD 1255
            SRNDG+RQ+ +EE++RSEA+                   LR + +   ++EE+EN VFG+
Sbjct: 521  SRNDGKRQSIREEQERSEAEMRNSAYQLAYAKADEASKALRLDQTLPVQLEENENQVFGE 580

Query: 1256 DDDLLYKSLEKARKLALTKQEEEAATGPQAIASLA-VSTRNQSTEIQPPSSGDNQENRVV 1432
            DD+ L KSL++ARKL L KQ+E A +GPQAIA LA  +T +Q+ + Q P SG++QENRVV
Sbjct: 581  DDEELQKSLQRARKLVLQKQDEAATSGPQAIALLASTTTSSQNVDNQNPISGESQENRVV 640

Query: 1433 FTEMEEFVWGLQLNEEVHKPESEDVFMDEDEVVKPSDEEKKDDSGGWTEVMDTSKDEDPI 1612
            FTEMEEFVWGLQL +E HKP+ EDVFMDEDE  K SD+E+KD++GGWTEV DT KDE P+
Sbjct: 641  FTEMEEFVWGLQLEDEAHKPDGEDVFMDEDEAPKASDQERKDEAGGWTEVKDTDKDELPV 700

Query: 1613 NVVKEEVVPDETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDKKKSKLVGIYESD 1792
            N  KEE+VPD+TIHEV VGKGLSGAL LLKERGTLKE IEWGGRNMDKKKSKLVGIY++ 
Sbjct: 701  NENKEEMVPDDTIHEVAVGKGLSGALQLLKERGTLKEGIEWGGRNMDKKKSKLVGIYDNT 760

Query: 1793 GPKEIHIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSD 1972
            G KEI IERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSD
Sbjct: 761  GTKEIRIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSD 820

Query: 1973 TPSLSVERMREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-GLPGGLTPMLGDKKVE 2149
            TPS SVERMREAQARL+ PYLVLSGHVK G   D RSGFA  E  +PG LTPMLGD+KVE
Sbjct: 821  TPSQSVERMREAQARLKTPYLVLSGHVKPGQTSDPRSGFATVEKDVPGSLTPMLGDRKVE 880

Query: 2150 HFLGIKRKAEPGSMG 2194
            HFLGIKRKAEP +MG
Sbjct: 881  HFLGIKRKAEPSNMG 895


>ref|XP_002516516.1| conserved hypothetical protein [Ricinus communis]
            gi|223544336|gb|EEF45857.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 873

 Score =  827 bits (2137), Expect = 0.0
 Identities = 434/665 (65%), Positives = 513/665 (77%), Gaps = 6/665 (0%)
 Frame = +2

Query: 203  DSENSKEIERDDTNGSTMEHKQKNEASNGQQSSASELGVRISKMREERLKQKNDGVSEIL 382
            D    K++  DD N    + ++    S G  +S+ E   RI K+REERLK+ +D  SE+L
Sbjct: 202  DVGKQKKVSFDDDND---DEQKVERTSGGGLASSLEFEERILKVREERLKKNSDAGSEVL 258

Query: 383  SWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDESTQQTTKDLAGIKILHGL 562
            SWVN+SRK+ +K  +EK+KA  LSK FEEQD IVQGE+E++E+ +  T DLAG+K+LHGL
Sbjct: 259  SWVNRSRKLAEKKNAEKKKAKQLSKVFEEQDKIVQGESEDEEAGELATNDLAGVKVLHGL 318

Query: 563  DKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQRDAAYKAAKK-TGIYEDKF 739
            +KV+EGGAVVLTLKDQ+IL DGDIN+E+DMLEN+EIG+QK+R+ AYKAAKK TGIY+DKF
Sbjct: 319  EKVMEGGAVVLTLKDQSILVDGDINEEVDMLENIEIGEQKRRNEAYKAAKKKTGIYDDKF 378

Query: 740  NDEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKLEELRKRLQGTSTSTKFEDLDS 919
            ND+  +++KIL QYDDP  +E VTLDE GRFTG AEKKLEELR+RLQG  T   FEDL+S
Sbjct: 379  NDDPASERKILPQYDDPTTDEGVTLDERGRFTGEAEKKLEELRRRLQGALTDNCFEDLNS 438

Query: 920  SRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEAEAISTGLGVGDLGSRNDGRRQ 1099
            S K+++D+YT EEM                    DALEAEA+S GLGVGDLGSR+DGRRQ
Sbjct: 439  SGKMSSDFYTHEEMLQFKKPKKKKSLRKKEKLDIDALEAEAVSAGLGVGDLGSRSDGRRQ 498

Query: 1100 AAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFKVEEDENPVFGDDDDLLYKS 1279
            A +EE++RSEA+                   LR E +   KV E+ENPVF DDD+ L+KS
Sbjct: 499  AIREEQERSEAERRSSAYQSAYAKADEASKSLRLEQTLPAKVNEEENPVFADDDEDLFKS 558

Query: 1280 LEKARKLALTKQEEEAATGPQAIASLAVSTRNQSTEIQPPSSGDNQENRVVFTEMEEFVW 1459
            LE+ARKLAL KQEE  A+GPQAIA LA +T NQ  + Q P+ G++QEN+VVFTEMEEFVW
Sbjct: 559  LERARKLALKKQEE--ASGPQAIARLATATNNQIADDQNPADGESQENKVVFTEMEEFVW 616

Query: 1460 GLQLNEEVHKPESEDVFMDEDEVVKPSDEEKKDDSGGWTEVMDTSKDEDPINVVKEEVVP 1639
            GLQL+EE HKP SEDVFMDED   + SD+E KD++G WTEV D ++D++ +N  KE+VVP
Sbjct: 617  GLQLDEESHKPGSEDVFMDEDAAPRVSDQEMKDEAGRWTEVNDAAEDDNSVNENKEDVVP 676

Query: 1640 DETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDKKKSKLVGIYESDGP----KEI 1807
            DETIHEV VGKGLSGAL LLKERGTLKE+++WGGRNMDKKKSKLVGI +SD      KEI
Sbjct: 677  DETIHEVAVGKGLSGALKLLKERGTLKETVDWGGRNMDKKKSKLVGIVDSDADNEKFKEI 736

Query: 1808 HIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLS 1987
             IER DEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPS S
Sbjct: 737  RIERMDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSES 796

Query: 1988 VERMREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-GLPGGLTPMLGDKKVEHFLGI 2164
            VERMREAQ +L+ PYLVLSGHVKSG   D RS FA  E  LPGGLTPMLGDKKVEHFLGI
Sbjct: 797  VERMREAQKKLKTPYLVLSGHVKSGQASDPRSSFATVEKDLPGGLTPMLGDKKVEHFLGI 856

Query: 2165 KRKAE 2179
            KRKAE
Sbjct: 857  KRKAE 861


>ref|XP_002297938.2| hypothetical protein POPTR_0001s11550g [Populus trichocarpa]
            gi|550347020|gb|EEE82743.2| hypothetical protein
            POPTR_0001s11550g [Populus trichocarpa]
          Length = 862

 Score =  818 bits (2112), Expect = 0.0
 Identities = 439/690 (63%), Positives = 522/690 (75%), Gaps = 15/690 (2%)
 Frame = +2

Query: 170  KIRDEGHHKAIDSENSKEIERDDTNGSTMEHKQKNE-----ASNGQQSSASELGVRISKM 334
            K  +E +   +  +   E+++D+     +  + +++     AS G  SSASELG RI KM
Sbjct: 168  KSNEEDYDDKVQMDYEDEVDKDNRKQGKVSFRDEDDQSAEGASAGAHSSASELGQRILKM 227

Query: 335  REERLKQKNDGVSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDEST 514
            +EER K+K++  S+IL+WV KSRKIE+   + K++A HLSK FEEQDNI QG ++++E+ 
Sbjct: 228  KEERTKKKSEPGSDILAWVGKSRKIEENKYAAKKRAKHLSKIFEEQDNIGQGGSDDEEAD 287

Query: 515  QQTTKDLAGIKILHGLDKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQRDA 694
            Q    +LAGIK+L GLDKV+EGGAVVLTLKDQNILADGDIN+E+DMLENVEIG+QK+RD 
Sbjct: 288  QHNAYNLAGIKVLDGLDKVLEGGAVVLTLKDQNILADGDINEEVDMLENVEIGEQKRRDE 347

Query: 695  AYKAA-KKTGIYEDKFNDEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKLEELRK 871
            AYKAA KKTGIYEDKFND+  ++KK+L QYDD   +E VTLDE GRFTG AEKKLEELR+
Sbjct: 348  AYKAAKKKTGIYEDKFNDDPASEKKMLPQYDDANADEGVTLDERGRFTGEAEKKLEELRR 407

Query: 872  RLQGTSTSTKFEDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEAEAIST 1051
            RLQGTSTS + EDL+SS KI++DY+T EEM                    DALEAEA+S 
Sbjct: 408  RLQGTSTSARLEDLNSSGKISSDYFTHEEMLQFKKPKKKKSLRKKDKLDIDALEAEAVSA 467

Query: 1052 GLGVGDLGSRNDGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFKVEE 1231
            GLG+GDLGSR DGRRQA +EE++RSEA+                   LR + +   KVEE
Sbjct: 468  GLGIGDLGSRKDGRRQAIREEQERSEAEMRNNAYQSAYAKADEASKSLRLDRTLQTKVEE 527

Query: 1232 DENPVFGDDDDLLYKSLEKARKLALTKQEEEAATGPQAIASLAVST-RNQSTEIQPPSSG 1408
            +EN VF DD++ LYKSLE+ARKLAL KQE E A+GP AIA LA +T  +Q  + + P +G
Sbjct: 528  EENLVFADDEEDLYKSLERARKLALKKQEAE-ASGPLAIAHLASTTLSSQIADDKNPETG 586

Query: 1409 DNQENRVVFTEMEEFVWGLQLNEEVHKPESEDVFMDEDEVVKPSDEEKKDDSGGWTEVMD 1588
            ++ EN++VFTEMEEFV  +QL EEVHKP++EDVFMDEDE  + SDEE+KD++GGW EV D
Sbjct: 587  ESHENKLVFTEMEEFVSAIQLAEEVHKPDNEDVFMDEDEPPRVSDEEQKDEAGGWMEVPD 646

Query: 1589 TSKDEDPINVVKEEVVPDETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDKKKSK 1768
             SKDE+P+N   EE+VPDETIHEV VGKGLSGAL LLKERGTLKESI+WGGRNMDKKKSK
Sbjct: 647  NSKDENPVN-EDEEIVPDETIHEVAVGKGLSGALKLLKERGTLKESIDWGGRNMDKKKSK 705

Query: 1769 LVGIYESDGP-------KEIHIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMK 1927
            LVGI + D         K+I IERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMK
Sbjct: 706  LVGIVDDDVGTNNDNKFKDIRIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMK 765

Query: 1928 QYQEELKLKQMKNSDTPSLSVERMREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-G 2104
            QYQEELKLKQMKNSDTPSLSVERMR AQA+L+ PYLVLSGHVK G   D RSGFA  E  
Sbjct: 766  QYQEELKLKQMKNSDTPSLSVERMRGAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKD 825

Query: 2105 LPGGLTPMLGDKKVEHFLGIKRKAEPGSMG 2194
             PGGLTPMLGDKKVEHFLGIKRK E G  G
Sbjct: 826  FPGGLTPMLGDKKVEHFLGIKRKPETGFSG 855


>ref|XP_007225495.1| hypothetical protein PRUPE_ppa000914mg [Prunus persica]
            gi|596285693|ref|XP_007225496.1| hypothetical protein
            PRUPE_ppa000914mg [Prunus persica]
            gi|462422431|gb|EMJ26694.1| hypothetical protein
            PRUPE_ppa000914mg [Prunus persica]
            gi|462422432|gb|EMJ26695.1| hypothetical protein
            PRUPE_ppa000914mg [Prunus persica]
          Length = 963

 Score =  813 bits (2099), Expect = 0.0
 Identities = 433/700 (61%), Positives = 507/700 (72%), Gaps = 41/700 (5%)
 Frame = +2

Query: 218  KEIERDDTNGSTMEHKQKNEASNGQQSSASELGVRISKMREERLKQKNDGVSEILSWVNK 397
            K+I++   + +  + ++    S G   SA EL  RI K +EERLK+K + V E+L+WV++
Sbjct: 257  KDIKQGKVSHNAEDERKAEGLSGGAHLSALELEERIMKTKEERLKKKKEDVPEVLAWVSR 316

Query: 398  SRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDESTQQTTKDLAGIKILHGLDKVIE 577
            SRK+E K  +EK+KAL LSK FEEQDNI QGE+E++E+ Q TT DLAG+K+LHGLDKV+E
Sbjct: 317  SRKLEDKRNAEKQKALQLSKIFEEQDNIGQGESEDEETAQDTTHDLAGVKVLHGLDKVME 376

Query: 578  GGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQRDAAYKAAKK-TGIYEDKFNDEAG 754
            GGAVVLTLKDQNILADG +N+++DMLENVEIG+QKQRD AYKAAKK TGIY DKFND+  
Sbjct: 377  GGAVVLTLKDQNILADGGVNEDIDMLENVEIGEQKQRDDAYKAAKKKTGIYVDKFNDDLN 436

Query: 755  AQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKLEELRKRLQGTSTSTKFEDLDSSRKIA 934
             +KKIL QYDDPV +E +TLDE GRFTG AEKKLEELRKR+QG  T+ +FEDL+ S  I 
Sbjct: 437  TEKKILPQYDDPVPDEGLTLDERGRFTGEAEKKLEELRKRIQGVPTNNRFEDLNMSGNIT 496

Query: 935  TDYYTQEEMXXXXXXXXXXXXXXXXXXXXD--ALEAEAISTGLGVGDLGSRNDGRRQAAK 1108
            +D+YTQEEM                    D  ALEAEA+S GLGV DLGSRND +RQA K
Sbjct: 497  SDFYTQEEMLQFKKPKKGKKKSLRKKEKLDLDALEAEAVSAGLGVADLGSRNDAKRQANK 556

Query: 1109 EERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFKVEEDENPVFGDDDDLLYKSLEK 1288
            EE++R EA+                   LR E   T   EEDE P F DDDD LYKSLE+
Sbjct: 557  EEQERLEAERRNSAYQLAYAKADEASKSLRLEQILTVIPEEDETPAFADDDDDLYKSLER 616

Query: 1289 ARKLALTKQEEEAATGPQAIASLAVSTRN-QSTEIQPPSSGDNQENRVVFTEMEEFVWGL 1465
            ARKLAL K+EEE A+GPQAIA LA +T + Q+ + Q PS+G++Q+N+VVFTEMEEFVWGL
Sbjct: 617  ARKLALKKKEEETASGPQAIALLATTTASSQTADNQIPSTGESQDNKVVFTEMEEFVWGL 676

Query: 1466 QLNEEVHKPESEDVFMDEDEVVKPSDEEKKDDSGGWTEVMDTSKDEDPINVVKEEVVPDE 1645
            QL+EE HKPESEDVFM EDE  KPS EE+ ++ GGWTEV D  +DE P    KEE+VPDE
Sbjct: 677  QLDEESHKPESEDVFMQEDEEPKPSHEERMNEPGGWTEVKDMDEDEKPATEDKEEIVPDE 736

Query: 1646 TIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDKKKSKLVGIYESDG---------- 1795
            TIHEV VGKGLSG L LLK+RGTLKE IEWGGRNMDKKKSKL+GI + D           
Sbjct: 737  TIHEVAVGKGLSGVLKLLKDRGTLKEGIEWGGRNMDKKKSKLLGIVDDDDEPKEPHTSRQ 796

Query: 1796 --------------------------PKEIHIERTDEFGRIMTPKEAFRMISHKFHGKGP 1897
                                       K+IHIERTDEFGR +TPKEAFR +SHKFHGKGP
Sbjct: 797  KKDEHKDTRPSSSSHQKETRPSKVYQEKDIHIERTDEFGRTLTPKEAFRTLSHKFHGKGP 856

Query: 1898 GKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMREAQARLQIPYLVLSGHVKSGDNGDR 2077
            GKMKQEKRMKQYQEELKLKQMK+SDTPSLS ERMR+ QARLQ PYLVLSGHVK G   D 
Sbjct: 857  GKMKQEKRMKQYQEELKLKQMKSSDTPSLSAERMRDTQARLQTPYLVLSGHVKPGQTSDP 916

Query: 2078 RSGFAPGE-GLPGGLTPMLGDKKVEHFLGIKRKAEPGSMG 2194
            RSGFA  E   PGGLTPMLGD+KVE++LGIKRKAEP S G
Sbjct: 917  RSGFATVEKDFPGGLTPMLGDRKVENYLGIKRKAEPESSG 956


>ref|XP_007022025.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 1 [Theobroma cacao]
            gi|590611175|ref|XP_007022026.1| U4/U6.U5
            tri-snRNP-associated protein 1 isoform 1 [Theobroma
            cacao] gi|508721653|gb|EOY13550.1| U4/U6.U5
            tri-snRNP-associated protein 1 isoform 1 [Theobroma
            cacao] gi|508721654|gb|EOY13551.1| U4/U6.U5
            tri-snRNP-associated protein 1 isoform 1 [Theobroma
            cacao]
          Length = 907

 Score =  812 bits (2097), Expect = 0.0
 Identities = 429/681 (62%), Positives = 517/681 (75%), Gaps = 12/681 (1%)
 Frame = +2

Query: 182  EGHHKAIDSENSKEIERDDTNGSTMEHKQKNEASNG--QQSSASELGVRISKMREERLKQ 355
            + H +  +     E+  D  +    +  + N  SN    Q+S+SEL  RI++M+EERLK+
Sbjct: 219  KNHEEDYEGSKDGELALDYGDSRDKDEAELNAGSNAGVAQASSSELEERIARMKEERLKK 278

Query: 356  KNDGVSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDESTQQTTKDL 535
            K++GVSE+L WV   RK+E+K  +EKEKAL  SK FEEQD+ VQGENE++E+ +    DL
Sbjct: 279  KSEGVSEVLEWVGNFRKLEEKRNAEKEKALQRSKIFEEQDDFVQGENEDEEAVRHAAHDL 338

Query: 536  AGIKILHGLDKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQRDAAYKAA-K 712
            AG+K+LHGLDKV++GGAVVLTLKDQ+ILA+GDIN+++DMLENVEIG+Q++RD AYKAA K
Sbjct: 339  AGVKVLHGLDKVMDGGAVVLTLKDQSILANGDINEDVDMLENVEIGEQRRRDEAYKAAKK 398

Query: 713  KTGIYEDKFNDEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKLEELRKRLQGTST 892
            KTG+Y+DKFNDE G++KKIL QYD+PV +E VTLDE GRFTG AEKKL+ELRKRLQG  T
Sbjct: 399  KTGVYDDKFNDEPGSEKKILPQYDNPVADEGVTLDERGRFTGEAEKKLQELRKRLQGVPT 458

Query: 893  STKFEDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEAEAISTGLGVGDL 1072
            + + EDL+++ KIA+DYYTQEEM                    DALEAEAIS+GLG GDL
Sbjct: 459  NNRVEDLNNAGKIASDYYTQEEMLKFKKPKKKKALRKKEKLDIDALEAEAISSGLGAGDL 518

Query: 1073 GSRNDGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFKVEEDENPVFG 1252
            GSRND RRQA +EE  RSEA+                   L  E +   K EEDEN VF 
Sbjct: 519  GSRNDARRQAIREEEARSEAEKRNSAYQSAYAKADEASKSLWLEQTLIVKPEEDENQVFA 578

Query: 1253 DDDDLLYKSLEKARKLALTKQEEEAATGPQAIASLAVSTR-NQSTEIQPPSSGDNQENRV 1429
            DDDD LYKS+E++RKLA  KQE+E  +GPQAIA  A +   +Q+ + Q  ++G+ QEN++
Sbjct: 579  DDDDDLYKSIERSRKLAFKKQEDE-KSGPQAIALRATTAAISQTADDQTTTTGEAQENKL 637

Query: 1430 VFTEMEEFVWGLQLNEEVHKPESEDVFMDEDEVVKPSDEEKK---DDSGGWTEVMDTSKD 1600
            V TEMEEFVWGLQ +EE HKP+SEDVFMDEDEV   S+ + K   ++ GGWTEV+D S D
Sbjct: 638  VITEMEEFVWGLQHDEEAHKPDSEDVFMDEDEVPGVSEHDGKSGENEVGGWTEVVDASTD 697

Query: 1601 EDPINVVKEEVVPDETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDKKKSKLVGI 1780
            E+P N  K+++VPDETIHEV VGKGLSGAL LLK+RGTLKESIEWGGRNMDKKKSKLVGI
Sbjct: 698  ENPSNEDKDDIVPDETIHEVAVGKGLSGALKLLKDRGTLKESIEWGGRNMDKKKSKLVGI 757

Query: 1781 Y----ESDGPKEIHIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELK 1948
                 E+D  K+I IERTDEFGRI+TPKEAFR++SHKFHGKGPGKMKQEKR KQYQEELK
Sbjct: 758  VDDDRENDRFKDIRIERTDEFGRIITPKEAFRVLSHKFHGKGPGKMKQEKRQKQYQEELK 817

Query: 1949 LKQMKNSDTPSLSVERMREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-GLPGGLTP 2125
            LKQMKNSDTPSLSVERMREAQA+L+ PYLVLSGHVK G   D RSGFA  E   PGGLTP
Sbjct: 818  LKQMKNSDTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDFPGGLTP 877

Query: 2126 MLGDKKVEHFLGIKRKAEPGS 2188
            MLGD+KVEHFLGIKRKAEPG+
Sbjct: 878  MLGDRKVEHFLGIKRKAEPGN 898


>ref|XP_003530377.1| PREDICTED: zinc finger CCCH domain-containing protein 13-like
            [Glycine max]
          Length = 882

 Score =  787 bits (2033), Expect = 0.0
 Identities = 424/674 (62%), Positives = 501/674 (74%), Gaps = 11/674 (1%)
 Frame = +2

Query: 200  IDSENSKEIERDDTNGST-MEHKQKNEASNGQQS---SASELGVRISKMREERLKQKNDG 367
            +D +   + +RD+  G    + K  N+  +GQ S   S++EL  RI KM+E R K++ + 
Sbjct: 208  VDDKVDYQDKRDEEIGKQEKDSKLDNDNQDGQTSAHLSSTELEDRILKMKESRTKKQPEA 267

Query: 368  VSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDESTQQTTKDLAGIK 547
             SEI +WVNKSRKIEKK      +A  LSK FEEQDNI   E  +DE T Q T +LAG+K
Sbjct: 268  DSEISAWVNKSRKIEKK------RAFQLSKIFEEQDNIAV-EGSDDEDTAQHTDNLAGVK 320

Query: 548  ILHGLDKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQRDAAYKAAKK-TGI 724
            +LHGLDKV+EGG VVLT+KDQ ILADGD+N+++DMLEN+EIG+QK+RD AYKAAKK TG+
Sbjct: 321  VLHGLDKVMEGGTVVLTIKDQPILADGDVNEDVDMLENIEIGEQKRRDEAYKAAKKKTGV 380

Query: 725  YEDKFNDEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKLEELRKRLQGTSTSTKF 904
            Y+DKF+D+   +KK+L QYDDP  EE +TLD  GRF+G AEKKLEELR+RL G ST+T F
Sbjct: 381  YDDKFHDDPSTEKKMLPQYDDPAAEEGLTLDGKGRFSGEAEKKLEELRRRLTGVSTNT-F 439

Query: 905  EDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEAEAISTGLGVGDLGSRN 1084
            EDL SS K+++DYYT EEM                    +ALEAEA+S+GLGVGDLGSR 
Sbjct: 440  EDLTSSGKVSSDYYTHEEMLKFKKPKKKKSLRKKDKLDINALEAEAVSSGLGVGDLGSRK 499

Query: 1085 DGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFKVEEDENPVFGDDDD 1264
            D RRQA K+E++R EA+                   LR E +   K EEDE PVF DDD+
Sbjct: 500  DVRRQAIKDEQERLEAEMRSNAYQSAYAKADEASKLLRLEQTLNVKTEEDETPVFVDDDE 559

Query: 1265 LLYKSLEKARKLALTKQEEEAATGPQAIASLAVSTRNQSTEIQPPSSGDNQENRVVFTEM 1444
             L KSLEKAR+LAL K+E E A+GPQAIA LA S  N  T+ Q P++G+++EN+VVFTEM
Sbjct: 560  DLRKSLEKARRLALKKKEGEGASGPQAIALLATSNHNNETDDQNPTAGESRENKVVFTEM 619

Query: 1445 EEFVWGLQLNEEVHKPESEDVFMDEDEVVKPSDEEKKDDSGGWTEVMDTSKDEDPINVVK 1624
            EEFVWGL ++EE  KPESEDVFM +DE     DEEK ++ GGWTEV +TS+DE      K
Sbjct: 620  EEFVWGLHIDEEARKPESEDVFMHDDEEANVPDEEKINEVGGWTEVQETSEDEQRNTEDK 679

Query: 1625 EEVVPDETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDKKKSKLVGIY-----ES 1789
            EE++PDETIHEV VGKGLSGAL LLKERGTLKESIEWGGRNMDKKKSKLVGI      E+
Sbjct: 680  EEIIPDETIHEVAVGKGLSGALKLLKERGTLKESIEWGGRNMDKKKSKLVGIVDDEEKEA 739

Query: 1790 DGPKEIHIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNS 1969
               +EI IERTDEFGRI+TPKEAFRMISHKFHGKGPGKMKQEKRMKQY EELK+KQMK+S
Sbjct: 740  QKTREIRIERTDEFGRILTPKEAFRMISHKFHGKGPGKMKQEKRMKQYYEELKMKQMKSS 799

Query: 1970 DTPSLSVERMREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-GLPGGLTPMLGDKKV 2146
            DTPSLSVERMREAQARLQ PYLVLSGHVK G   D +SGFA  E  LPGGLTPMLGD+KV
Sbjct: 800  DTPSLSVERMREAQARLQTPYLVLSGHVKPGQTSDPKSGFATVEKDLPGGLTPMLGDRKV 859

Query: 2147 EHFLGIKRKAEPGS 2188
            EHFLGIKRKAEP S
Sbjct: 860  EHFLGIKRKAEPSS 873


>ref|XP_006583920.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1-like isoform X4
            [Glycine max] gi|571467371|ref|XP_006583921.1| PREDICTED:
            U4/U6.U5 tri-snRNP-associated protein 1-like isoform X5
            [Glycine max]
          Length = 880

 Score =  779 bits (2012), Expect = 0.0
 Identities = 423/674 (62%), Positives = 501/674 (74%), Gaps = 11/674 (1%)
 Frame = +2

Query: 200  IDSENSKEIERDDTNGS-TMEHKQKNEASNGQQS---SASELGVRISKMREERLKQKNDG 367
            +D +     +RD+  G    + K  N+  +GQ S   S++EL  RI KM+E R K++ + 
Sbjct: 208  VDDKVDYHDKRDEEIGKQAKDSKLDNDNQDGQTSAHLSSTELEERILKMKESRTKKQPEA 267

Query: 368  VSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDESTQQTTKDLAGIK 547
             SEI +WVNKSRKIEKK      +A  LSK FEEQDNI   E  ++E T Q T +LAG+K
Sbjct: 268  DSEISTWVNKSRKIEKK------RAFQLSKIFEEQDNIAV-EGSDNEDTAQHTDNLAGVK 320

Query: 548  ILHGLDKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQRDAAYKAAKK-TGI 724
            +LHGLDKV+EGG VVLT+KDQ ILADGD+N+++DMLEN+EIG+QK+RD AYKAAKK TG+
Sbjct: 321  VLHGLDKVMEGGTVVLTIKDQPILADGDVNEDVDMLENIEIGEQKRRDEAYKAAKKKTGV 380

Query: 725  YEDKFNDEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKLEELRKRLQGTSTSTKF 904
            Y+DKF D+   +KK+L QYDDP  EE +TLDE GRF+G AEKKLEELR+RL G ST+T F
Sbjct: 381  YDDKFTDDPSTEKKMLQQYDDPAAEEGLTLDEKGRFSGEAEKKLEELRRRLTGVSTNT-F 439

Query: 905  EDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEAEAISTGLGVGDLGSRN 1084
            EDL SS K+++DYYT EEM                    +ALEAEA+S+GLGVGDLGSR 
Sbjct: 440  EDLTSSGKVSSDYYTHEEMLKFKKPKKKKSLRKKDRLDINALEAEAVSSGLGVGDLGSRK 499

Query: 1085 DGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFKVEEDENPVFGDDDD 1264
            D RRQA K+E++R EA+T                  LR E +   K EEDE PVF DDD+
Sbjct: 500  DVRRQAIKDEQERLEAETRSNAYQSAYAKADEASKLLRLEQTLNVK-EEDETPVFVDDDE 558

Query: 1265 LLYKSLEKARKLALTKQEEEAATGPQAIASLAVSTRNQSTEIQPPSSGDNQENRVVFTEM 1444
             L KSLEKAR+LAL K+E E A+GPQAIA LA S  N  T+ Q P++G+++EN+VVFTEM
Sbjct: 559  DLCKSLEKARRLAL-KKEGEGASGPQAIALLATSNHNNETDDQNPTAGESRENKVVFTEM 617

Query: 1445 EEFVWGLQLNEEVHKPESEDVFMDEDEVVKPSDEEKKDDSGGWTEVMDTSKDEDPINVVK 1624
            EEFVWGL ++EE  KPESEDVFM +DE     DEE  +++GGWTEV +T++DE      K
Sbjct: 618  EEFVWGLHIDEEARKPESEDVFMHDDEETNVPDEENSNEAGGWTEVQETNEDEQHNTEDK 677

Query: 1625 EEVVPDETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDKKKSKLVGIY-----ES 1789
            EE+VPDETIHEV VGKGLSGAL LLKERGTLKESIEWGGR+MDKKKSKLVGI      E+
Sbjct: 678  EEIVPDETIHEVAVGKGLSGALKLLKERGTLKESIEWGGRSMDKKKSKLVGIVDDEEKEA 737

Query: 1790 DGPKEIHIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNS 1969
               +EI IERTDEFGRI+TPKEAFRMISHKFHGKGPGKMKQEKRMKQY EELK+KQMK+S
Sbjct: 738  QKTREIRIERTDEFGRILTPKEAFRMISHKFHGKGPGKMKQEKRMKQYHEELKMKQMKSS 797

Query: 1970 DTPSLSVERMREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-GLPGGLTPMLGDKKV 2146
            DTPSLSVERMREAQARLQ PYLVLSGHVK G   D +SGFA  E  LPGGLTPMLGD+KV
Sbjct: 798  DTPSLSVERMREAQARLQTPYLVLSGHVKPGQTSDPKSGFATVEKDLPGGLTPMLGDRKV 857

Query: 2147 EHFLGIKRKAEPGS 2188
            EHFLGIKRKAEP S
Sbjct: 858  EHFLGIKRKAEPSS 871


>ref|XP_006583919.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1-like isoform X3
            [Glycine max]
          Length = 909

 Score =  779 bits (2012), Expect = 0.0
 Identities = 423/674 (62%), Positives = 501/674 (74%), Gaps = 11/674 (1%)
 Frame = +2

Query: 200  IDSENSKEIERDDTNGS-TMEHKQKNEASNGQQS---SASELGVRISKMREERLKQKNDG 367
            +D +     +RD+  G    + K  N+  +GQ S   S++EL  RI KM+E R K++ + 
Sbjct: 237  VDDKVDYHDKRDEEIGKQAKDSKLDNDNQDGQTSAHLSSTELEERILKMKESRTKKQPEA 296

Query: 368  VSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDESTQQTTKDLAGIK 547
             SEI +WVNKSRKIEKK      +A  LSK FEEQDNI   E  ++E T Q T +LAG+K
Sbjct: 297  DSEISTWVNKSRKIEKK------RAFQLSKIFEEQDNIAV-EGSDNEDTAQHTDNLAGVK 349

Query: 548  ILHGLDKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQRDAAYKAAKK-TGI 724
            +LHGLDKV+EGG VVLT+KDQ ILADGD+N+++DMLEN+EIG+QK+RD AYKAAKK TG+
Sbjct: 350  VLHGLDKVMEGGTVVLTIKDQPILADGDVNEDVDMLENIEIGEQKRRDEAYKAAKKKTGV 409

Query: 725  YEDKFNDEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKLEELRKRLQGTSTSTKF 904
            Y+DKF D+   +KK+L QYDDP  EE +TLDE GRF+G AEKKLEELR+RL G ST+T F
Sbjct: 410  YDDKFTDDPSTEKKMLQQYDDPAAEEGLTLDEKGRFSGEAEKKLEELRRRLTGVSTNT-F 468

Query: 905  EDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEAEAISTGLGVGDLGSRN 1084
            EDL SS K+++DYYT EEM                    +ALEAEA+S+GLGVGDLGSR 
Sbjct: 469  EDLTSSGKVSSDYYTHEEMLKFKKPKKKKSLRKKDRLDINALEAEAVSSGLGVGDLGSRK 528

Query: 1085 DGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFKVEEDENPVFGDDDD 1264
            D RRQA K+E++R EA+T                  LR E +   K EEDE PVF DDD+
Sbjct: 529  DVRRQAIKDEQERLEAETRSNAYQSAYAKADEASKLLRLEQTLNVK-EEDETPVFVDDDE 587

Query: 1265 LLYKSLEKARKLALTKQEEEAATGPQAIASLAVSTRNQSTEIQPPSSGDNQENRVVFTEM 1444
             L KSLEKAR+LAL K+E E A+GPQAIA LA S  N  T+ Q P++G+++EN+VVFTEM
Sbjct: 588  DLCKSLEKARRLAL-KKEGEGASGPQAIALLATSNHNNETDDQNPTAGESRENKVVFTEM 646

Query: 1445 EEFVWGLQLNEEVHKPESEDVFMDEDEVVKPSDEEKKDDSGGWTEVMDTSKDEDPINVVK 1624
            EEFVWGL ++EE  KPESEDVFM +DE     DEE  +++GGWTEV +T++DE      K
Sbjct: 647  EEFVWGLHIDEEARKPESEDVFMHDDEETNVPDEENSNEAGGWTEVQETNEDEQHNTEDK 706

Query: 1625 EEVVPDETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDKKKSKLVGIY-----ES 1789
            EE+VPDETIHEV VGKGLSGAL LLKERGTLKESIEWGGR+MDKKKSKLVGI      E+
Sbjct: 707  EEIVPDETIHEVAVGKGLSGALKLLKERGTLKESIEWGGRSMDKKKSKLVGIVDDEEKEA 766

Query: 1790 DGPKEIHIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNS 1969
               +EI IERTDEFGRI+TPKEAFRMISHKFHGKGPGKMKQEKRMKQY EELK+KQMK+S
Sbjct: 767  QKTREIRIERTDEFGRILTPKEAFRMISHKFHGKGPGKMKQEKRMKQYHEELKMKQMKSS 826

Query: 1970 DTPSLSVERMREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-GLPGGLTPMLGDKKV 2146
            DTPSLSVERMREAQARLQ PYLVLSGHVK G   D +SGFA  E  LPGGLTPMLGD+KV
Sbjct: 827  DTPSLSVERMREAQARLQTPYLVLSGHVKPGQTSDPKSGFATVEKDLPGGLTPMLGDRKV 886

Query: 2147 EHFLGIKRKAEPGS 2188
            EHFLGIKRKAEP S
Sbjct: 887  EHFLGIKRKAEPSS 900


>ref|XP_006583918.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1-like isoform X2
            [Glycine max]
          Length = 936

 Score =  779 bits (2012), Expect = 0.0
 Identities = 423/674 (62%), Positives = 501/674 (74%), Gaps = 11/674 (1%)
 Frame = +2

Query: 200  IDSENSKEIERDDTNGS-TMEHKQKNEASNGQQS---SASELGVRISKMREERLKQKNDG 367
            +D +     +RD+  G    + K  N+  +GQ S   S++EL  RI KM+E R K++ + 
Sbjct: 264  VDDKVDYHDKRDEEIGKQAKDSKLDNDNQDGQTSAHLSSTELEERILKMKESRTKKQPEA 323

Query: 368  VSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDESTQQTTKDLAGIK 547
             SEI +WVNKSRKIEKK      +A  LSK FEEQDNI   E  ++E T Q T +LAG+K
Sbjct: 324  DSEISTWVNKSRKIEKK------RAFQLSKIFEEQDNIAV-EGSDNEDTAQHTDNLAGVK 376

Query: 548  ILHGLDKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQRDAAYKAAKK-TGI 724
            +LHGLDKV+EGG VVLT+KDQ ILADGD+N+++DMLEN+EIG+QK+RD AYKAAKK TG+
Sbjct: 377  VLHGLDKVMEGGTVVLTIKDQPILADGDVNEDVDMLENIEIGEQKRRDEAYKAAKKKTGV 436

Query: 725  YEDKFNDEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKLEELRKRLQGTSTSTKF 904
            Y+DKF D+   +KK+L QYDDP  EE +TLDE GRF+G AEKKLEELR+RL G ST+T F
Sbjct: 437  YDDKFTDDPSTEKKMLQQYDDPAAEEGLTLDEKGRFSGEAEKKLEELRRRLTGVSTNT-F 495

Query: 905  EDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEAEAISTGLGVGDLGSRN 1084
            EDL SS K+++DYYT EEM                    +ALEAEA+S+GLGVGDLGSR 
Sbjct: 496  EDLTSSGKVSSDYYTHEEMLKFKKPKKKKSLRKKDRLDINALEAEAVSSGLGVGDLGSRK 555

Query: 1085 DGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFKVEEDENPVFGDDDD 1264
            D RRQA K+E++R EA+T                  LR E +   K EEDE PVF DDD+
Sbjct: 556  DVRRQAIKDEQERLEAETRSNAYQSAYAKADEASKLLRLEQTLNVK-EEDETPVFVDDDE 614

Query: 1265 LLYKSLEKARKLALTKQEEEAATGPQAIASLAVSTRNQSTEIQPPSSGDNQENRVVFTEM 1444
             L KSLEKAR+LAL K+E E A+GPQAIA LA S  N  T+ Q P++G+++EN+VVFTEM
Sbjct: 615  DLCKSLEKARRLAL-KKEGEGASGPQAIALLATSNHNNETDDQNPTAGESRENKVVFTEM 673

Query: 1445 EEFVWGLQLNEEVHKPESEDVFMDEDEVVKPSDEEKKDDSGGWTEVMDTSKDEDPINVVK 1624
            EEFVWGL ++EE  KPESEDVFM +DE     DEE  +++GGWTEV +T++DE      K
Sbjct: 674  EEFVWGLHIDEEARKPESEDVFMHDDEETNVPDEENSNEAGGWTEVQETNEDEQHNTEDK 733

Query: 1625 EEVVPDETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDKKKSKLVGIY-----ES 1789
            EE+VPDETIHEV VGKGLSGAL LLKERGTLKESIEWGGR+MDKKKSKLVGI      E+
Sbjct: 734  EEIVPDETIHEVAVGKGLSGALKLLKERGTLKESIEWGGRSMDKKKSKLVGIVDDEEKEA 793

Query: 1790 DGPKEIHIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNS 1969
               +EI IERTDEFGRI+TPKEAFRMISHKFHGKGPGKMKQEKRMKQY EELK+KQMK+S
Sbjct: 794  QKTREIRIERTDEFGRILTPKEAFRMISHKFHGKGPGKMKQEKRMKQYHEELKMKQMKSS 853

Query: 1970 DTPSLSVERMREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-GLPGGLTPMLGDKKV 2146
            DTPSLSVERMREAQARLQ PYLVLSGHVK G   D +SGFA  E  LPGGLTPMLGD+KV
Sbjct: 854  DTPSLSVERMREAQARLQTPYLVLSGHVKPGQTSDPKSGFATVEKDLPGGLTPMLGDRKV 913

Query: 2147 EHFLGIKRKAEPGS 2188
            EHFLGIKRKAEP S
Sbjct: 914  EHFLGIKRKAEPSS 927


>ref|XP_006583917.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1-like isoform X1
            [Glycine max]
          Length = 971

 Score =  779 bits (2012), Expect = 0.0
 Identities = 423/674 (62%), Positives = 501/674 (74%), Gaps = 11/674 (1%)
 Frame = +2

Query: 200  IDSENSKEIERDDTNGS-TMEHKQKNEASNGQQS---SASELGVRISKMREERLKQKNDG 367
            +D +     +RD+  G    + K  N+  +GQ S   S++EL  RI KM+E R K++ + 
Sbjct: 299  VDDKVDYHDKRDEEIGKQAKDSKLDNDNQDGQTSAHLSSTELEERILKMKESRTKKQPEA 358

Query: 368  VSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDESTQQTTKDLAGIK 547
             SEI +WVNKSRKIEKK      +A  LSK FEEQDNI   E  ++E T Q T +LAG+K
Sbjct: 359  DSEISTWVNKSRKIEKK------RAFQLSKIFEEQDNIAV-EGSDNEDTAQHTDNLAGVK 411

Query: 548  ILHGLDKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQRDAAYKAAKK-TGI 724
            +LHGLDKV+EGG VVLT+KDQ ILADGD+N+++DMLEN+EIG+QK+RD AYKAAKK TG+
Sbjct: 412  VLHGLDKVMEGGTVVLTIKDQPILADGDVNEDVDMLENIEIGEQKRRDEAYKAAKKKTGV 471

Query: 725  YEDKFNDEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKLEELRKRLQGTSTSTKF 904
            Y+DKF D+   +KK+L QYDDP  EE +TLDE GRF+G AEKKLEELR+RL G ST+T F
Sbjct: 472  YDDKFTDDPSTEKKMLQQYDDPAAEEGLTLDEKGRFSGEAEKKLEELRRRLTGVSTNT-F 530

Query: 905  EDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEAEAISTGLGVGDLGSRN 1084
            EDL SS K+++DYYT EEM                    +ALEAEA+S+GLGVGDLGSR 
Sbjct: 531  EDLTSSGKVSSDYYTHEEMLKFKKPKKKKSLRKKDRLDINALEAEAVSSGLGVGDLGSRK 590

Query: 1085 DGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFKVEEDENPVFGDDDD 1264
            D RRQA K+E++R EA+T                  LR E +   K EEDE PVF DDD+
Sbjct: 591  DVRRQAIKDEQERLEAETRSNAYQSAYAKADEASKLLRLEQTLNVK-EEDETPVFVDDDE 649

Query: 1265 LLYKSLEKARKLALTKQEEEAATGPQAIASLAVSTRNQSTEIQPPSSGDNQENRVVFTEM 1444
             L KSLEKAR+LAL K+E E A+GPQAIA LA S  N  T+ Q P++G+++EN+VVFTEM
Sbjct: 650  DLCKSLEKARRLAL-KKEGEGASGPQAIALLATSNHNNETDDQNPTAGESRENKVVFTEM 708

Query: 1445 EEFVWGLQLNEEVHKPESEDVFMDEDEVVKPSDEEKKDDSGGWTEVMDTSKDEDPINVVK 1624
            EEFVWGL ++EE  KPESEDVFM +DE     DEE  +++GGWTEV +T++DE      K
Sbjct: 709  EEFVWGLHIDEEARKPESEDVFMHDDEETNVPDEENSNEAGGWTEVQETNEDEQHNTEDK 768

Query: 1625 EEVVPDETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDKKKSKLVGIY-----ES 1789
            EE+VPDETIHEV VGKGLSGAL LLKERGTLKESIEWGGR+MDKKKSKLVGI      E+
Sbjct: 769  EEIVPDETIHEVAVGKGLSGALKLLKERGTLKESIEWGGRSMDKKKSKLVGIVDDEEKEA 828

Query: 1790 DGPKEIHIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNS 1969
               +EI IERTDEFGRI+TPKEAFRMISHKFHGKGPGKMKQEKRMKQY EELK+KQMK+S
Sbjct: 829  QKTREIRIERTDEFGRILTPKEAFRMISHKFHGKGPGKMKQEKRMKQYHEELKMKQMKSS 888

Query: 1970 DTPSLSVERMREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-GLPGGLTPMLGDKKV 2146
            DTPSLSVERMREAQARLQ PYLVLSGHVK G   D +SGFA  E  LPGGLTPMLGD+KV
Sbjct: 889  DTPSLSVERMREAQARLQTPYLVLSGHVKPGQTSDPKSGFATVEKDLPGGLTPMLGDRKV 948

Query: 2147 EHFLGIKRKAEPGS 2188
            EHFLGIKRKAEP S
Sbjct: 949  EHFLGIKRKAEPSS 962


>ref|XP_004250062.1| PREDICTED: uncharacterized protein LOC101246008 [Solanum
            lycopersicum]
          Length = 898

 Score =  778 bits (2009), Expect = 0.0
 Identities = 416/685 (60%), Positives = 506/685 (73%), Gaps = 15/685 (2%)
 Frame = +2

Query: 176  RDEGHHKAIDSENSKEIERDDTNGSTME----HKQKNEASN------GQQSSA--SELGV 319
            RDEGH ++ D +  K+ + D    +  E    H+ +  + N      G QS+A  SEL  
Sbjct: 205  RDEGHDRSKDKDRRKDEDSDYRYAAKQEIVVSHEDEERSHNNAVETGGAQSAAAASELEE 264

Query: 320  RISKMREERLKQKNDGVSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENE 499
            RI KM+EERLK+K++G SE+L+WV+KSRKIE+   +EKEKAL LSK FEEQD + + E++
Sbjct: 265  RILKMKEERLKKKSEGASEVLAWVSKSRKIEEIRNAEKEKALQLSKIFEEQDKMNEEESD 324

Query: 500  EDESTQQTTKDLAGIKILHGLDKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQ 679
            ++E+ +   K+L G+K+LHGLDKV+EGGAVVLTLKDQ+ILA  D+N E+D+LENVEIG+Q
Sbjct: 325  DEENARLAAKELGGMKVLHGLDKVVEGGAVVLTLKDQSILAGDDVNQEVDVLENVEIGEQ 384

Query: 680  KQRDAAYKAAK-KTGIYEDKFNDEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKL 856
            K+RD AYKAAK KTGIY+DKFNDE G ++KIL +YDDP EEE V LD +G F+  AEKKL
Sbjct: 385  KRRDDAYKAAKNKTGIYDDKFNDEPGFERKILPKYDDPAEEEGVILDATGGFSLDAEKKL 444

Query: 857  EELRKRLQGTSTSTKFEDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEA 1036
            EELR+R+QG S+  + EDL+SS K+ +DYYTQEEM                    DALEA
Sbjct: 445  EELRRRIQGPSSINRMEDLNSSGKLLSDYYTQEEMVQFKKPKKKKSLRKKEKMDLDALEA 504

Query: 1037 EAISTGLGVGDLGSRNDGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSST 1216
            EA S GLGV DLGSRND  RQ  KEE++R++A+T                  LR + ++ 
Sbjct: 505  EAKSAGLGVSDLGSRNDKTRQVLKEEKERADAETRSNAYQAAYAKAEEASKALRPDKTNN 564

Query: 1217 FKVEEDENPVFGDDDDLLYKSLEKARKLALTKQEEEAATGPQAIASLAVSTRNQS-TEIQ 1393
             + EED+  VF DDD+ L KSLE+ARKLAL KQE  A T P++IASLA S  N S  +  
Sbjct: 565  NQREEDD-AVFDDDDEELRKSLERARKLALRKQEGLAKTFPESIASLAASRANDSMVDNS 623

Query: 1394 PPSSGDNQENRVVFTEMEEFVWGLQLNEEVHKPESEDVFMDEDEVVKPSDEEKKDDSGGW 1573
              +SG+ QEN+VVFTEMEEFVWGLQL+EE  KP S+DVFM+ED + KPSDEE K + GGW
Sbjct: 624  SSASGEAQENKVVFTEMEEFVWGLQLDEEEQKPGSDDVFMEEDVLPKPSDEELKSEDGGW 683

Query: 1574 TEVMDTSKDEDPINVVKEEVVPDETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMD 1753
            TEV +T ++E  +   + EV PD+TI EVPVGKGLSG L LL+ERGTLKE IEWGGRNMD
Sbjct: 684  TEVKETKEEEPSVKEEEMEVTPDDTIREVPVGKGLSGVLKLLQERGTLKEDIEWGGRNMD 743

Query: 1754 KKKSKLVGIYESDGPKEIHIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQY 1933
            KKKSKLVGI   DG KEI+IERTDE+GRI+TPKEAFR++SHKFHGKGPGKMKQEKRM+QY
Sbjct: 744  KKKSKLVGIRSEDGKKEINIERTDEYGRILTPKEAFRLLSHKFHGKGPGKMKQEKRMRQY 803

Query: 1934 QEELKLKQMKNSDTPSLSVERMREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-GLP 2110
            QEELK+KQMKNSDTPS SVERMRE  A+ + PY+VLSGHVK G   D RSGFA  E  LP
Sbjct: 804  QEELKIKQMKNSDTPSQSVERMRETHAQTRTPYIVLSGHVKPGQTSDPRSGFATVEKDLP 863

Query: 2111 GGLTPMLGDKKVEHFLGIKRKAEPG 2185
            GGLTPMLGDKKVEHFLGIKRK EPG
Sbjct: 864  GGLTPMLGDKKVEHFLGIKRKFEPG 888


>gb|EXB93293.1| hypothetical protein L484_015280 [Morus notabilis]
          Length = 952

 Score =  776 bits (2005), Expect = 0.0
 Identities = 420/727 (57%), Positives = 509/727 (70%), Gaps = 52/727 (7%)
 Frame = +2

Query: 170  KIRDEGHHKAI--DSENSKEIERDDTNGSTMEHKQKNEASNGQQS--------------- 298
            K RD    K++  D E  K+  RDD      ++K+  EA  G  S               
Sbjct: 220  KSRDRVSKKSVEEDYELGKDGGRDDKTKLDDDNKKDREAKQGNVSQYIDGEQITHDISHK 279

Query: 299  ---SASELGVRISKMREERLKQKNDGVSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEE 469
               + +EL  RI KM++ER K+K + V E+L+WVNKSRK+E+K   EKEKAL LSK FEE
Sbjct: 280  AHLTTTELEKRILKMKQERSKKKTEDVPEVLAWVNKSRKLEEKKNDEKEKALQLSKIFEE 339

Query: 470  QDNIVQGENEEDESTQQTTKDLAGIKILHGLDKVIEGGAVVLTLKDQNILADGDINDEMD 649
            QDNIVQ ++E++E+T Q   +LAG+K+LHG+DKV+EGGAVVLTLKDQNILADGDIN E+D
Sbjct: 340  QDNIVQEDSEDEETTTQHY-NLAGVKVLHGIDKVMEGGAVVLTLKDQNILADGDINLEID 398

Query: 650  MLENVEIGQQKQRDAAYKAAKK-TGIYEDKFNDEAGAQKKILAQYDDPVEEEAVTLDESG 826
            MLENVEIG+QK+RD AYKAAKK  GIY DKFND+  +++K+L QYDDP  +  VT+DE G
Sbjct: 399  MLENVEIGEQKRRDEAYKAAKKKVGIYVDKFNDDPNSERKMLPQYDDPSTDVGVTIDERG 458

Query: 827  RFTGVAEKKLEELRKRLQGTSTSTKFEDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXX 1006
            R T  AEKKLEELR+RLQG ST+++FEDL    K+++DYYT EEM               
Sbjct: 459  RITSEAEKKLEELRRRLQGASTNSRFEDLSFPGKVSSDYYTSEEMMQFKKPKKKKSLRKK 518

Query: 1007 XXXXXDALEAEAISTGLGVGDLGSRNDGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXX 1186
                 DALEAEA+S GLGVGDLGSRND +RQ  +EE+DR+EA+                 
Sbjct: 519  DKLDIDALEAEAVSAGLGVGDLGSRNDPKRQVIREEQDRAEAERRNNAYKTAFAKADEAS 578

Query: 1187 XXLRQELSSTFKVEEDENPVFGDDDDLLYKSLEKARKLALTKQEEEAATGPQAIASLAVS 1366
              LR E +   K+EE+EN VF DDD+  +K++E+ARK+A+ K+++E  +GP+A+A LA +
Sbjct: 579  KSLRLEQTLPVKLEEEENLVFADDDEDFHKAVERARKIAVKKEDKETPSGPEAVALLAAT 638

Query: 1367 TRNQSTEIQPPSSGDNQENRVVFTEMEEFVWGLQLNEEVHKPESEDVFMDEDEVVKPSDE 1546
              N     +   SG++QEN+VVFTEMEEFVWGLQL EE  KP++EDVFMDEDE  K  +E
Sbjct: 639  IANSQPADEQNPSGESQENKVVFTEMEEFVWGLQLEEEAQKPDNEDVFMDEDEEPKAYNE 698

Query: 1547 EKKDDSGGWTEVMDTSKDEDPINVVKEEVVPDETIHEVPVGKGLSGALHLLKERGTLKES 1726
            E K++ GGWTEV +T+ DE P    +EE+VPD  IHEV VGKGLSGAL LLKERGTLKES
Sbjct: 699  EIKNEPGGWTEVKETNNDEHPSKEEEEEIVPDGIIHEVAVGKGLSGALKLLKERGTLKES 758

Query: 1727 IEWGGRNMDKKKSKLVGIYESDGP------------------------------KEIHIE 1816
            I+WGGRNMDKKKSKLVGI + D P                              K+I IE
Sbjct: 759  IDWGGRNMDKKKSKLVGIVDDDEPGQQVHPKKDGTRTSSSSYSKETRASKVYEEKDIRIE 818

Query: 1817 RTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVER 1996
            RTDEFGRI+TPKEAFR+ISHKFHGKGPGKMKQEKRMKQYQEELKLKQMK+SDTPS SVER
Sbjct: 819  RTDEFGRILTPKEAFRIISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKSSDTPSQSVER 878

Query: 1997 MREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-GLPGGLTPMLGDKKVEHFLGIKRK 2173
            MREAQA+L+ PYLVLSGHVK G   D RSGFA  E   PGGLTPMLGD+KVEHFLGIKRK
Sbjct: 879  MREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDPPGGLTPMLGDRKVEHFLGIKRK 938

Query: 2174 AEPGSMG 2194
             EP + G
Sbjct: 939  PEPANSG 945


>ref|XP_004141556.1| PREDICTED: uncharacterized protein LOC101207335 [Cucumis sativus]
            gi|449522278|ref|XP_004168154.1| PREDICTED:
            uncharacterized LOC101207335 [Cucumis sativus]
          Length = 939

 Score =  775 bits (2001), Expect = 0.0
 Identities = 420/704 (59%), Positives = 514/704 (73%), Gaps = 28/704 (3%)
 Frame = +2

Query: 167  GKIRDEGHHKAIDSENSKEIERDDTNGSTMEHKQKNEASNG----QQSSASELGVRISKM 334
            G+I DEG    ++S+     +RD   G+ ++H    E  +G      +S++ L  RI  M
Sbjct: 234  GRIGDEGKDYMLESDGENNRDRDVNQGNMVQHLGVEENFDGLKVGSHASSTMLEERIRNM 293

Query: 335  REERLKQKNDGVSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDEST 514
            +E+RLK++ +  SE+LSWV +SRK+E+K +SEKEKAL LSK FEEQDNI QG +++D + 
Sbjct: 294  KEDRLKKQTEE-SEVLSWVKRSRKLEEKKLSEKEKALQLSKIFEEQDNIDQGVSDDDIAP 352

Query: 515  QQTTK--DLAGIKILHGLDKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQR 688
            + TT   DLAG+K+LHG+DKV+EGGAVVLTLKDQ+ILADG++N+E+D+LENVEIG+QKQR
Sbjct: 353  EDTTNNHDLAGVKVLHGVDKVLEGGAVVLTLKDQSILADGNVNEELDVLENVEIGEQKQR 412

Query: 689  DAAYKAAKK-TGIYEDKFNDEAGAQKKILAQYDDPVE-EEAVTLDESGRFTGVAEKKLEE 862
            D AYKAAKK TGIY+DKFNDE   +KK+L QYDDP + +E +TLD  G F   AEKKLEE
Sbjct: 413  DIAYKAAKKKTGIYDDKFNDENYGEKKMLPQYDDPADADEGLTLDGRGGFNNDAEKKLEE 472

Query: 863  LRKRLQGTSTSTKFEDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEAEA 1042
            LR+RLQG S+   FEDL+ S K++ DYYTQ+EM                    DALEAEA
Sbjct: 473  LRRRLQGASSVKHFEDLNVSTKVSHDYYTQDEMLKFKKPRKKKSLRKKEKLDIDALEAEA 532

Query: 1043 ISTGLGVGDLGSRNDGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFK 1222
            IS GLGVGDLGSRND RRQA KEE+++SEA+                   L+   +S+ +
Sbjct: 533  ISAGLGVGDLGSRNDSRRQAKKEEQEKSEAEMRLNAYQSAYAKADEASRSLQLVQNSSAR 592

Query: 1223 VEEDENPVFGDDDDLLYKSLEKARKLALTKQEEEAATGPQAIASLAVSTRN-QSTEIQPP 1399
            +E++++ +  DDD+  YKSLE+ARKLAL KQ+  AA+GP A+A LA +T + Q+T+ Q  
Sbjct: 593  LEDNDDALIADDDEDFYKSLERARKLALKKQD--AASGPGAVALLATATTSSQATDDQST 650

Query: 1400 SSGDNQENRVVFTEMEEFVWGLQLNEEVHKPESEDVFMDEDEVVKPS-DEEKKDDSGGWT 1576
             +G+ QEN+VVFTEMEEFVWGLQL+E+ HKPE +DVFMD+DE+ K    E+ KD  GGWT
Sbjct: 651  KAGELQENKVVFTEMEEFVWGLQLDEDAHKPEEDDVFMDDDEIPKEEYHEDVKDKDGGWT 710

Query: 1577 EVMDTSKDEDPINVVKEEVVPDETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDK 1756
            EV DT+ +E       E V PDETIHEVPVGKGLS AL LLK+RGTLKESIEWGGRNMDK
Sbjct: 711  EVKDTAMEESTPEE-NEAVAPDETIHEVPVGKGLSSALKLLKDRGTLKESIEWGGRNMDK 769

Query: 1757 KKSKLVGIYESDGP-----------------KEIHIERTDEFGRIMTPKEAFRMISHKFH 1885
            +KSKLVGI + D P                 KEIHIERTDEFGRIMTPKE+FR +SHKFH
Sbjct: 770  RKSKLVGIVDEDEPKESKSKDSRLSSLVDYKKEIHIERTDEFGRIMTPKESFRQLSHKFH 829

Query: 1886 GKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMREAQARLQIPYLVLSGHVKSGD 2065
            GKGPGKMKQEKRMKQYQEELKLKQMKN+DTPSLSVERMREAQA+L+ PYLVLSGHVK G 
Sbjct: 830  GKGPGKMKQEKRMKQYQEELKLKQMKNADTPSLSVERMREAQAQLKTPYLVLSGHVKPGQ 889

Query: 2066 NGDRRSGFAPGE-GLPGGLTPMLGDKKVEHFLGIKRKAEPGSMG 2194
              D RSGFA  E  LPGGLTPMLGD+KVEHFLGIKRK E  + G
Sbjct: 890  TSDPRSGFATVEKDLPGGLTPMLGDRKVEHFLGIKRKGEASNTG 933


>ref|XP_006361674.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1-like [Solanum
            tuberosum]
          Length = 880

 Score =  773 bits (1996), Expect = 0.0
 Identities = 416/685 (60%), Positives = 501/685 (73%), Gaps = 15/685 (2%)
 Frame = +2

Query: 176  RDEGHHKAIDSENSKEIERDDTNGSTME----HKQKNEASN------GQQSSA--SELGV 319
            RDE H ++ D +  K+ + D  + +  E    H+ +  + N      G QS+A  SEL  
Sbjct: 187  RDESHDRSKDKDRRKDEDSDYRDSAKQEIVVSHEDEERSHNNAVETGGSQSAAAASELEE 246

Query: 320  RISKMREERLKQKNDGVSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENE 499
            RI KM+EERLK+K++G SE+L+WV+KSRKIE+   +EKEKAL LSK FEEQD +   E++
Sbjct: 247  RILKMKEERLKKKSEGASEVLTWVSKSRKIEEIRNAEKEKALQLSKIFEEQDKMNGEESD 306

Query: 500  EDESTQQTTKDLAGIKILHGLDKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQ 679
            E+E+ +   K+L G+K+LHGLDKV+EGGAVVLTLKDQ+ILA  D+N E+D+LENVEIG+Q
Sbjct: 307  EEENARLAAKELGGMKVLHGLDKVVEGGAVVLTLKDQSILAGDDVNQEVDVLENVEIGEQ 366

Query: 680  KQRDAAYKAAK-KTGIYEDKFNDEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKL 856
            K+RD AYKAAK KTGIY+DKFNDE G ++KIL +YDDP EEE V LD +G F   AEKKL
Sbjct: 367  KRRDDAYKAAKNKTGIYDDKFNDEPGFERKILPKYDDPAEEEGVILDATGGFNIDAEKKL 426

Query: 857  EELRKRLQGTSTSTKFEDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEA 1036
            EELR+R+QG S+  + EDL+SS K+ +DYYTQEEM                    DALEA
Sbjct: 427  EELRRRIQGPSSINRSEDLNSSGKLLSDYYTQEEMVQFKKPKKKKSLRKKEKMDLDALEA 486

Query: 1037 EAISTGLGVGDLGSRNDGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSST 1216
            EA S GLGV DLGSRND  RQ  KEE++R++ +                   LR E +  
Sbjct: 487  EAKSAGLGVSDLGSRNDKTRQVLKEEKERADTEMRSNAYQAAYAKAEEASKALRPEKTKN 546

Query: 1217 FKVEEDENPVFGDDDDLLYKSLEKARKLALTKQEEEAATGPQAIASLAVSTRNQSTEIQP 1396
             + EED+  VF DDD+ L KSLE+ARKLAL KQE  A T P++IASLA S  N ST    
Sbjct: 547  NQREEDD-AVFDDDDEELRKSLERARKLALRKQEGLAKTFPESIASLAASRANDSTVDNT 605

Query: 1397 PS-SGDNQENRVVFTEMEEFVWGLQLNEEVHKPESEDVFMDEDEVVKPSDEEKKDDSGGW 1573
             S SG+ QEN+VVFTEMEEFVWGLQL+EE  KP S+DVFM+ED + KPSDEE K++ GGW
Sbjct: 606  SSASGEAQENKVVFTEMEEFVWGLQLDEEEQKPGSDDVFMEEDVLPKPSDEEMKNEDGGW 665

Query: 1574 TEVMDTSKDEDPINVVKEEVVPDETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMD 1753
            TEV +  ++E  +   + EV PD TI EVPVGKGLSG L LL+ERGTLKE IEWGGRNMD
Sbjct: 666  TEVKEIKEEEPSVKEEEMEVTPDNTIREVPVGKGLSGVLKLLQERGTLKEDIEWGGRNMD 725

Query: 1754 KKKSKLVGIYESDGPKEIHIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQY 1933
            KKKSKLVGI   DG KEIHIERTDE+GRI+TPKEAFR+ISHKFHGKGPGKMKQEKRM+QY
Sbjct: 726  KKKSKLVGIRSEDGKKEIHIERTDEYGRILTPKEAFRLISHKFHGKGPGKMKQEKRMRQY 785

Query: 1934 QEELKLKQMKNSDTPSLSVERMREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-GLP 2110
            QEELK+KQM+NSDTPS SVERMRE  A+ ++PY+VLSG+VK G   D RSGFA  E  LP
Sbjct: 786  QEELKIKQMRNSDTPSQSVERMRETHAQTRVPYIVLSGNVKPGQTSDPRSGFATVEKDLP 845

Query: 2111 GGLTPMLGDKKVEHFLGIKRKAEPG 2185
            GGLTPMLGDKKVEHFLGIKRK EPG
Sbjct: 846  GGLTPMLGDKKVEHFLGIKRKFEPG 870


>ref|XP_007133507.1| hypothetical protein PHAVU_011G184800g [Phaseolus vulgaris]
            gi|561006507|gb|ESW05501.1| hypothetical protein
            PHAVU_011G184800g [Phaseolus vulgaris]
          Length = 626

 Score =  771 bits (1992), Expect = 0.0
 Identities = 410/623 (65%), Positives = 479/623 (76%), Gaps = 7/623 (1%)
 Frame = +2

Query: 332  MREERLKQKNDGVSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDES 511
            M+E R K++++  SEI +WV KSRKIEKK      KAL LSK FEEQDNI   E  +DE 
Sbjct: 1    MKESRTKKQSEADSEISAWVTKSRKIEKK------KALQLSKIFEEQDNIAV-EGSDDED 53

Query: 512  TQQTTKDLAGIKILHGLDKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQRD 691
            T Q T++LAG+K+LHGLDKV+EGG VVLT+KDQ ILADGD+N+++DMLEN+EIG+QKQRD
Sbjct: 54   TAQHTENLAGLKVLHGLDKVMEGGTVVLTIKDQPILADGDVNEDVDMLENIEIGEQKQRD 113

Query: 692  AAYKAAKK-TGIYEDKFNDEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKLEELR 868
             AYKAAKK TG+Y+DKFND+  ++KK+L QYDDPV EE VTLDE GRF+G AEKKLEELR
Sbjct: 114  EAYKAAKKKTGVYDDKFNDDPFSEKKMLPQYDDPVAEEGVTLDEKGRFSGEAEKKLEELR 173

Query: 869  KRLQGTSTSTKFEDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEAEAIS 1048
            +RL G ST+T FEDL S  K+++DYYT EEM                     ALEAEA+S
Sbjct: 174  RRLSGVSTNT-FEDLTSYGKVSSDYYTHEEMLKFKKPKKKKSLRKKDKLDIKALEAEAVS 232

Query: 1049 TGLGVGDLGSRNDGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFKVE 1228
            +GLGVGDLGSR+  RRQA KEE++R +A+                   LR++  +  K E
Sbjct: 233  SGLGVGDLGSRSSVRRQAIKEEQERLDAKMRSNAYQSAYAKADEASKLLREQTLNV-KTE 291

Query: 1229 EDENPVFGDDDDLLYKSLEKARKLALTKQEEEAATGPQAIASLAVSTRNQSTEIQPPSSG 1408
            +DE P F DDD+ L KSLEKAR+LAL K EE  A+GPQAIA LA S  +  T+ Q P++G
Sbjct: 292  DDETPAFVDDDEDLRKSLEKARRLALKKHEEGGASGPQAIALLATSNHDNETDSQNPTAG 351

Query: 1409 DNQENRVVFTEMEEFVWGLQLNEEVHKPESEDVFMDEDEVVKPSDEEKKDDSGGWTEVMD 1588
            +++EN+VVFTEMEEFVWGL ++EE  KPESEDVFM +DE V   DEEK + +GGWTEV +
Sbjct: 352  ESRENKVVFTEMEEFVWGLHIDEEARKPESEDVFMHDDEEVIVPDEEKTNVAGGWTEVQE 411

Query: 1589 TSKDEDPINVVKEEVVPDETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDKKKSK 1768
            T++DE P    KEE+VPDETIHEV VGKGLSGAL LLKERGTLKESIEWGGRNMDKKKSK
Sbjct: 412  TNEDEQPNTEDKEEIVPDETIHEVAVGKGLSGALKLLKERGTLKESIEWGGRNMDKKKSK 471

Query: 1769 LVGIYESD-----GPKEIHIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQY 1933
            LVGI + D       +EI IERTDEFGRI+TPKEAFRMISHKFHGKGPGKMKQEKRMKQY
Sbjct: 472  LVGIVDDDEKETQKKREIRIERTDEFGRILTPKEAFRMISHKFHGKGPGKMKQEKRMKQY 531

Query: 1934 QEELKLKQMKNSDTPSLSVERMREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-GLP 2110
            QEELK+KQMK+SDTPSLSVERMREAQARLQ PYLVLSGHVK G   D +SGFA  E  LP
Sbjct: 532  QEELKMKQMKSSDTPSLSVERMREAQARLQTPYLVLSGHVKPGQTSDPKSGFATVEKDLP 591

Query: 2111 GGLTPMLGDKKVEHFLGIKRKAE 2179
            GGLTPMLGD+KVEHFLGIKRKAE
Sbjct: 592  GGLTPMLGDRKVEHFLGIKRKAE 614


>ref|XP_006836392.1| hypothetical protein AMTR_s00092p00135160 [Amborella trichopoda]
            gi|548838910|gb|ERM99245.1| hypothetical protein
            AMTR_s00092p00135160 [Amborella trichopoda]
          Length = 1028

 Score =  766 bits (1978), Expect = 0.0
 Identities = 408/670 (60%), Positives = 502/670 (74%), Gaps = 6/670 (0%)
 Frame = +2

Query: 203  DSENSKEIERDDTNGST--MEHKQKNEASNG-QQSSASELGVRISKMREERLKQKNDGVS 373
            + E++ + ++D+T   T  M+HK+KNE   G  + S SE+  R++KMREER+K+KN+GVS
Sbjct: 357  EQEDNVQDDKDNTYDRTGAMDHKEKNEIQAGVSRPSTSEIEERLAKMREERMKKKNEGVS 416

Query: 374  EILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDESTQQTTKDLAGIKIL 553
            E+ SWVNKSRKIE+K  SEKEKALHL+K F EQD++VQ E++E+E  Q + KDLAG+K+L
Sbjct: 417  EVSSWVNKSRKIEEKLSSEKEKALHLAKVFAEQDSVVQ-ESDEEEEAQHSGKDLAGVKVL 475

Query: 554  HGLDKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQRDAAYKAAKKT-GIYE 730
            HGL++VI GGAVVLTLKDQNILADGD+N+E+DMLENVE+G+QK+RD AYKAAKK  GIYE
Sbjct: 476  HGLEQVIVGGAVVLTLKDQNILADGDLNNEVDMLENVELGEQKRRDEAYKAAKKKPGIYE 535

Query: 731  DKFNDEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKLEELRKRLQGTSTSTKFED 910
            DKF D+ G+QKKIL QYDD  ++E V LDESG  T  A+KKLEELRKRLQG ST   FED
Sbjct: 536  DKFADDDGSQKKILPQYDDTSKDEGVALDESGHITREAQKKLEELRKRLQGASTGQHFED 595

Query: 911  LDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEAEAISTGLGVGDLGSRNDG 1090
            L ++ K+++DYYTQEEM                    DALEAEAI++GLGVGD GSR D 
Sbjct: 596  LTATGKVSSDYYTQEEMLQFKKPKKKKALRKKVKLDLDALEAEAIASGLGVGDRGSRADA 655

Query: 1091 RRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFKVEEDENPVFGDDDDLL 1270
            +RQ AKEE + +EA+T                  LR+E +   + +EDEN  FGDD+DL 
Sbjct: 656  QRQRAKEEEEWAEAETRKEAYQSAFAKANESTKALREEQTLKVEGDEDENLAFGDDEDL- 714

Query: 1271 YKSLEKARKLALTKQEEEAATGPQAIASLAVSTRNQSTEIQPPSSGDNQENRVVFTEMEE 1450
            +KS+E+ARKLA  KQ+E AA+GP A+A LAVS    S      +SG+ QENR+VFTE++E
Sbjct: 715  HKSIEEARKLARKKQDEGAASGPLAVAQLAVSA---SESKDAEASGEPQENRLVFTEVDE 771

Query: 1451 FVWGLQLNEEVHKPESEDVFMDEDEVVKP-SDEEKKDDSGGWTEVMDTSKDEDPINVVKE 1627
            FV GLQ +E    P++EDVF ++DEV  P   +E  +  GGWT+V+++ KDE       E
Sbjct: 772  FVLGLQHDEGAQNPDAEDVFKEDDEVQNPIKQDEPMEQVGGWTDVIESEKDEQMKTEEDE 831

Query: 1628 EVVPDETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDKKKSKLVGIYESDGPKEI 1807
            EVVPD TI E  VGKGLSGAL LLKERGTLKE+I+WGGRNMDKKKSKLVG+ E+DG KEI
Sbjct: 832  EVVPDATIQEAVVGKGLSGALQLLKERGTLKEAIDWGGRNMDKKKSKLVGVRENDGAKEI 891

Query: 1808 HIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLS 1987
             ++R DEFGRIMTPKEAFR +SHKFHGKGPGKMKQEKRMKQ+ EELKLKQMK SDTP LS
Sbjct: 892  VLDRLDEFGRIMTPKEAFRKLSHKFHGKGPGKMKQEKRMKQFMEELKLKQMKASDTPLLS 951

Query: 1988 VERMREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-GLPGGLTPMLGDKKVEHFLGI 2164
            +E+MREAQA+ + PY+VLSG +K G   D RSGFA  E   PG LTPMLGD+KVEHFLGI
Sbjct: 952  MEKMREAQAKTRSPYIVLSGQIKPGQTSDPRSGFATVEKDQPGSLTPMLGDRKVEHFLGI 1011

Query: 2165 KRKAEPGSMG 2194
            KRKAEP +MG
Sbjct: 1012 KRKAEPSNMG 1021


>gb|EYU25740.1| hypothetical protein MIMGU_mgv1a000914mg [Mimulus guttatus]
          Length = 944

 Score =  765 bits (1976), Expect = 0.0
 Identities = 402/682 (58%), Positives = 495/682 (72%), Gaps = 6/682 (0%)
 Frame = +2

Query: 167  GKIRDEGHHKAIDSENSKEIERDDTNGSTMEHKQKNEAS---NGQQSSASELGVRISKMR 337
            G +R E  +   +  N   ++  D    +   KQ++ A    +G   SAS+LG RISKMR
Sbjct: 256  GHLRLENDYSRDNQSNKVRVDNSDGENDSKILKQQDRAEKSVDGNSQSASDLGERISKMR 315

Query: 338  EERLKQKNDGVSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDESTQ 517
            +ERL + ++G SE+L+WVN+SRK+E K  +EKEKAL LSK FEEQDN+  G+++++ +TQ
Sbjct: 316  QERLVKSSEGASEVLAWVNRSRKLEDKR-TEKEKALQLSKVFEEQDNMNDGDSDDEAATQ 374

Query: 518  QTTKDLAGIKILHGLDKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQRDAA 697
              T+ L G+K+LHGL+KV+EGGA+VLTLKDQ+ILADGD+N E+DMLENVEIG+QK+R+ A
Sbjct: 375  AVTESLGGVKVLHGLEKVLEGGAIVLTLKDQSILADGDVNQEVDMLENVEIGEQKRRNEA 434

Query: 698  YKAAKK-TGIYEDKFNDEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKLEELRKR 874
            Y AAKK TG+Y DKF+DE G +KK+L QYDDPV +E +TLD +GRFTG AE+KLEELRKR
Sbjct: 435  YGAAKKKTGVYVDKFSDEPGTEKKMLPQYDDPVADEGLTLDSTGRFTGEAERKLEELRKR 494

Query: 875  LQGTSTSTKFEDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEAEAISTG 1054
            +QG   ST  EDL+S+ KI+TDYYTQEEM                    DALEAEA++ G
Sbjct: 495  IQGVPASTYGEDLNSTLKISTDYYTQEEMTKFKKPKKKKSLRKREKLDIDALEAEAVTAG 554

Query: 1055 LGVGDLGSRNDGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFKVEED 1234
            LG GDLGSRNDGR+Q  K+E++R +A+                   LR    +  + E+D
Sbjct: 555  LGAGDLGSRNDGRKQNLKKEQERVDAEMRSNAFQSAYAKAEEASKALRPGKVNIMRTEDD 614

Query: 1235 ENPVFGDDDDLLYKSLEKARKLALTKQEEEAATGPQAIASLAVSTRNQSTEIQPP-SSGD 1411
            +  VFGDDDD L KSLE+ARK+A  KQ+E+   GPQ I  LA ST N ST   P  SS D
Sbjct: 615  DT-VFGDDDDELRKSLERARKIAFKKQDEKEKPGPQMITLLASSTANDSTAENPNLSSVD 673

Query: 1412 NQENRVVFTEMEEFVWGLQLNEEVHKPESEDVFMDEDEVVKPSDEEKKDDSGGWTEVMDT 1591
              EN+VVFTEMEEFVWGLQL+EE   PE+E V M+ED     SD E  +  GGW+EV + 
Sbjct: 674  QSENKVVFTEMEEFVWGLQLDEEEKNPENEGVCMEEDLAPSTSDHEMTEVDGGWSEVKEA 733

Query: 1592 SKDEDPINVVKEEVVPDETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDKKKSKL 1771
             ++  P+   +EEVVPDETIHE  VGKGL+ AL LLK+RG+LKE+ EWGGRNMDKKKSKL
Sbjct: 734  VEEVAPLKEEEEEVVPDETIHETSVGKGLANALKLLKDRGSLKETTEWGGRNMDKKKSKL 793

Query: 1772 VGIYESDGPKEIHIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKL 1951
            VGI ++DG KEI IERTDEFGRI+TPKE+FR++SHKFHGKGPGKMKQEKRM+QYQEELK+
Sbjct: 794  VGINDNDGGKEIRIERTDEFGRILTPKESFRLLSHKFHGKGPGKMKQEKRMRQYQEELKV 853

Query: 1952 KQMKNSDTPSLSVERMREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-GLPGGLTPM 2128
            KQMKNSDTPS SV RM+EAQ +LQ PYLVLSG+VK G   D RSGFA  E  L GGLTPM
Sbjct: 854  KQMKNSDTPSSSVSRMKEAQEKLQTPYLVLSGNVKPGQTSDPRSGFATVEKSLTGGLTPM 913

Query: 2129 LGDKKVEHFLGIKRKAEPGSMG 2194
            LGDKKVEHFL IKR  +PG  G
Sbjct: 914  LGDKKVEHFLNIKRMPDPGESG 935


>ref|XP_007022027.1| U4/U6.U5 tri-snRNP-associated protein 1 isoform 3, partial [Theobroma
            cacao] gi|508721655|gb|EOY13552.1| U4/U6.U5
            tri-snRNP-associated protein 1 isoform 3, partial
            [Theobroma cacao]
          Length = 864

 Score =  762 bits (1967), Expect = 0.0
 Identities = 402/647 (62%), Positives = 488/647 (75%), Gaps = 11/647 (1%)
 Frame = +2

Query: 182  EGHHKAIDSENSKEIERDDTNGSTMEHKQKNEASNG--QQSSASELGVRISKMREERLKQ 355
            + H +  +     E+  D  +    +  + N  SN    Q+S+SEL  RI++M+EERLK+
Sbjct: 219  KNHEEDYEGSKDGELALDYGDSRDKDEAELNAGSNAGVAQASSSELEERIARMKEERLKK 278

Query: 356  KNDGVSEILSWVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDESTQQTTKDL 535
            K++GVSE+L WV   RK+E+K  +EKEKAL  SK FEEQD+ VQGENE++E+ +    DL
Sbjct: 279  KSEGVSEVLEWVGNFRKLEEKRNAEKEKALQRSKIFEEQDDFVQGENEDEEAVRHAAHDL 338

Query: 536  AGIKILHGLDKVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQRDAAYKAA-K 712
            AG+K+LHGLDKV++GGAVVLTLKDQ+ILA+GDIN+++DMLENVEIG+Q++RD AYKAA K
Sbjct: 339  AGVKVLHGLDKVMDGGAVVLTLKDQSILANGDINEDVDMLENVEIGEQRRRDEAYKAAKK 398

Query: 713  KTGIYEDKFNDEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKLEELRKRLQGTST 892
            KTG+Y+DKFNDE G++KKIL QYD+PV +E VTLDE GRFTG AEKKL+ELRKRLQG  T
Sbjct: 399  KTGVYDDKFNDEPGSEKKILPQYDNPVADEGVTLDERGRFTGEAEKKLQELRKRLQGVPT 458

Query: 893  STKFEDLDSSRKIATDYYTQEEMXXXXXXXXXXXXXXXXXXXXDALEAEAISTGLGVGDL 1072
            + + EDL+++ KIA+DYYTQEEM                    DALEAEAIS+GLG GDL
Sbjct: 459  NNRVEDLNNAGKIASDYYTQEEMLKFKKPKKKKALRKKEKLDIDALEAEAISSGLGAGDL 518

Query: 1073 GSRNDGRRQAAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFKVEEDENPVFG 1252
            GSRND RRQA +EE  RSEA+                   L  E +   K EEDEN VF 
Sbjct: 519  GSRNDARRQAIREEEARSEAEKRNSAYQSAYAKADEASKSLWLEQTLIVKPEEDENQVFA 578

Query: 1253 DDDDLLYKSLEKARKLALTKQEEEAATGPQAIASLAVSTR-NQSTEIQPPSSGDNQENRV 1429
            DDDD LYKS+E++RKLA  KQE+E  +GPQAIA  A +   +Q+ + Q  ++G+ QEN++
Sbjct: 579  DDDDDLYKSIERSRKLAFKKQEDE-KSGPQAIALRATTAAISQTADDQTTTTGEAQENKL 637

Query: 1430 VFTEMEEFVWGLQLNEEVHKPESEDVFMDEDEVVKPSDEEKK---DDSGGWTEVMDTSKD 1600
            V TEMEEFVWGLQ +EE HKP+SEDVFMDEDEV   S+ + K   ++ GGWTEV+D S D
Sbjct: 638  VITEMEEFVWGLQHDEEAHKPDSEDVFMDEDEVPGVSEHDGKSGENEVGGWTEVVDASTD 697

Query: 1601 EDPINVVKEEVVPDETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDKKKSKLVGI 1780
            E+P N  K+++VPDETIHEV VGKGLSGAL LLK+RGTLKESIEWGGRNMDKKKSKLVGI
Sbjct: 698  ENPSNEDKDDIVPDETIHEVAVGKGLSGALKLLKDRGTLKESIEWGGRNMDKKKSKLVGI 757

Query: 1781 Y----ESDGPKEIHIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELK 1948
                 E+D  K+I IERTDEFGRI+TPKEAFR++SHKFHGKGPGKMKQEKR KQYQEELK
Sbjct: 758  VDDDRENDRFKDIRIERTDEFGRIITPKEAFRVLSHKFHGKGPGKMKQEKRQKQYQEELK 817

Query: 1949 LKQMKNSDTPSLSVERMREAQARLQIPYLVLSGHVKSGDNGDRRSGF 2089
            LKQMKNSDTPSLSVERMREAQA+L+ PYLVLSGHVK G   D RSGF
Sbjct: 818  LKQMKNSDTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGF 864


>ref|XP_006471158.1| PREDICTED: U4/U6.U5 tri-snRNP-associated protein 1-like [Citrus
            sinensis]
          Length = 878

 Score =  757 bits (1955), Expect = 0.0
 Identities = 406/668 (60%), Positives = 492/668 (73%), Gaps = 11/668 (1%)
 Frame = +2

Query: 209  ENSKEIERDDTNGSTMEHKQKNEASNGQQS-SASELGVRISKMREERLKQKNDGVSEILS 385
            +N   + RD      + +   ++  N     S S LG RI KM+EERLK+ ++G  EILS
Sbjct: 205  DNEGNMNRDINKHGKVSYDDIDDQDNEDAHVSTSGLGDRILKMKEERLKKNSEGAPEILS 264

Query: 386  WVNKSRKIEKKTMSEKEKALHLSKAFEEQDNIVQGENEEDESTQQTTKDLAGIKILHGLD 565
            WVN+SRKIE+    EK+KAL LSK FEEQDNIVQGE+E++E+ Q  + DLAG+K+LHGLD
Sbjct: 265  WVNRSRKIEQIKNVEKKKALQLSKIFEEQDNIVQGESEDEEAGQHNSHDLAGVKVLHGLD 324

Query: 566  KVIEGGAVVLTLKDQNILADGDINDEMDMLENVEIGQQKQRDAAYKAA-KKTGIYEDKFN 742
            KV+EGGAVVLTLKDQ ILADGDIN+++DMLEN+EIG+QK+RD AYKAA KKTGIY+DKFN
Sbjct: 325  KVMEGGAVVLTLKDQQILADGDINEDVDMLENIEIGEQKRRDEAYKAAKKKTGIYDDKFN 384

Query: 743  DEAGAQKKILAQYDDPVEEEAVTLDESGRFTGVAEKKLEELRKRLQGTSTSTKFEDLDSS 922
            D+  ++KKIL QYD+P  +E +TLD  GRFTG AEKKLEELR+R+QG   +   EDL+ S
Sbjct: 385  DDPSSEKKILPQYDEPATDEGLTLDARGRFTGEAEKKLEELRRRIQGVQANNSTEDLNLS 444

Query: 923  RKIATDYYTQEEM-XXXXXXXXXXXXXXXXXXXXDALEAEAISTGLGVGDLGSRNDGRRQ 1099
              I +DY+TQEEM                     DALEAEA+S GLGV DLGSR DGRRQ
Sbjct: 445  ANITSDYFTQEEMLQFKKPKKKKKSIRKKEKLDLDALEAEALSAGLGVEDLGSRKDGRRQ 504

Query: 1100 AAKEERDRSEAQTXXXXXXXXXXXXXXXXXXLRQELSSTFKVEEDENPVFGDDDDLLYKS 1279
            A +EE+++SEA+                   LR E +   K+EE+      DD+D LYKS
Sbjct: 505  AIREEQEKSEAEMKNKAYQSAYAKAEEAVKSLRMEQTRPVKLEEENEEPIADDEDDLYKS 564

Query: 1280 LEKARKLALTKQEEEAATGPQAIASLAVSTRNQSTEIQPPSSGDNQENRVVFTEMEEFVW 1459
            LE+ARKLAL KQ  EA++GP+AIA LA S   Q+   Q  ++ +++E +VV TE++EFVW
Sbjct: 565  LERARKLALKKQ--EASSGPEAIARLATS---QTANEQSTTNEESEEKKVVITELQEFVW 619

Query: 1460 GLQLNEEVHKPESEDVFMDEDEVVKPSDEEKKDDSGGWTEVMDTSKDEDPINVVKEEVVP 1639
            GL + EEV K + +DVFMDEDE  + SD E KD+ GGWTEV +  ++E+P    KEE+VP
Sbjct: 620  GLPVGEEVQKQDRQDVFMDEDEGPRTSDLEMKDEPGGWTEVKEIGEEENPSKEDKEEIVP 679

Query: 1640 DETIHEVPVGKGLSGALHLLKERGTLKESIEWGGRNMDKKKSKLVGIYESDGP------K 1801
            DETIHE+ VGKGL+GAL LLK+RGTLKE I+WGGRNMDKKKSKL+G+ + D P      K
Sbjct: 680  DETIHELAVGKGLAGALSLLKDRGTLKEGIDWGGRNMDKKKSKLIGVVD-DNPNVDNRFK 738

Query: 1802 EIHIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPS 1981
            +I IERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTP+
Sbjct: 739  DIRIERTDEFGRIMTPKEAFRMISHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPT 798

Query: 1982 LSVERMREAQARLQIPYLVLSGHVKSGDNGDRRSGFAPGE-GLP-GGLTPMLGDKKVEHF 2155
             SVERMREAQARL+ PYLVLSGHVK G   D RSGFA  E  LP GGLTPMLG++KVEHF
Sbjct: 799  ESVERMREAQARLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPAGGLTPMLGNRKVEHF 858

Query: 2156 LGIKRKAE 2179
            LGIKRK +
Sbjct: 859  LGIKRKGD 866


Top