BLASTX nr result

ID: Zingiber24_contig00035242 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber24_contig00035242
         (2092 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_001062990.1| Os09g0363100 [Oryza sativa Japonica Group] g...   516   e-143
ref|XP_003578028.1| PREDICTED: chloroplastic group IIA intron sp...   500   e-138
tpg|DAA61175.1| TPA: hypothetical protein ZEAMMB73_652631 [Zea m...   499   e-138
ref|XP_002460114.1| hypothetical protein SORBIDRAFT_02g022940 [S...   491   e-136
ref|XP_004956664.1| PREDICTED: chloroplastic group IIA intron sp...   481   e-133
ref|XP_006661163.1| PREDICTED: chloroplastic group IIA intron sp...   466   e-128
gb|EMS55813.1| Chloroplastic group IIA intron splicing facilitat...   452   e-124
ref|XP_002514120.1| conserved hypothetical protein [Ricinus comm...   436   e-119
gb|EOY30435.1| CRS1 / YhbY domain-containing protein, putative i...   421   e-115
gb|EOY30434.1| CRS1 / YhbY domain-containing protein, putative i...   421   e-115
gb|EOY30431.1| CRS1 / YhbY domain-containing protein, putative i...   421   e-115
ref|XP_004171699.1| PREDICTED: chloroplastic group IIA intron sp...   420   e-114
ref|XP_004144114.1| PREDICTED: chloroplastic group IIA intron sp...   420   e-114
ref|XP_002309217.2| hypothetical protein POPTR_0006s15340g [Popu...   419   e-114
gb|EMT30138.1| Chloroplastic group IIA intron splicing facilitat...   416   e-113
ref|XP_006840356.1| hypothetical protein AMTR_s00045p00114550 [A...   413   e-112
ref|XP_006475470.1| PREDICTED: chloroplastic group IIA intron sp...   412   e-112
ref|XP_006475466.1| PREDICTED: chloroplastic group IIA intron sp...   412   e-112
ref|XP_006451488.1| hypothetical protein CICLE_v10007477mg [Citr...   412   e-112
ref|XP_006357699.1| PREDICTED: chloroplastic group IIA intron sp...   411   e-112

>ref|NP_001062990.1| Os09g0363100 [Oryza sativa Japonica Group]
            gi|48716728|dbj|BAD23409.1| putative CRS1 [Oryza sativa
            Japonica Group] gi|50726191|dbj|BAD33710.1| putative CRS1
            [Oryza sativa Japonica Group]
            gi|113631223|dbj|BAF24904.1| Os09g0363100 [Oryza sativa
            Japonica Group] gi|125591023|gb|EAZ31373.1| hypothetical
            protein OsJ_15500 [Oryza sativa Japonica Group]
          Length = 947

 Score =  516 bits (1330), Expect = e-143
 Identities = 328/714 (45%), Positives = 414/714 (57%), Gaps = 26/714 (3%)
 Frame = -1

Query: 2065 SPEDEDPLEVEKAGKDVKNTRKERKKRKLRPSFNAQTLERWSVKVSSSRSTFPWQDQKIE 1886
            SPE    L+V  +GK     R  +K+R L+PSF  Q + RWS +  S R++FPWQ Q+ +
Sbjct: 53   SPEKAPALDVA-SGK-----RGGKKRRSLKPSFEKQAIRRWSARAPSQRASFPWQQQQQQ 106

Query: 1885 SSLNLVSSGTPFDQSLDGIGAPMLEFGVPDENFDEDEKCSDS-------YLSDDSFENSV 1727
                        DQ     G+  L+  V   +FD D    D         +  ++ E   
Sbjct: 107  QQPGGGEGEAAGDQESGWSGSSTLQSIVDYFDFDYDSSDGDGDGDGDGVVVGGEAAEAQE 166

Query: 1726 DAIPPMANHIMGIPLGMQSKLAPWAHGGKHGEDGTRSRYKSEALVNNFEFGEDKVKPKET 1547
            D   P  + +    LG +   APW HG +  E  T      E  ++     ED++   + 
Sbjct: 167  DGPRPEPSFL----LGSRPVSAPWMHGEE--EPMTNQLVSDEEGLDGDGASEDEMGLVDG 220

Query: 1546 NFVDDKGVRSNYGDPIIASLKEECPILSSDMKARSCPTVTVSFGEERFSSQHSDLDSKVA 1367
            +  +D+ + S          +EE    SSD             GE  FS    D  +  A
Sbjct: 221  DGDEDEDLGS----------EEETLSESSD-------------GE--FS---EDYAAPAA 252

Query: 1366 GKSSVKDDAL----SGGR----------DNCVERYKDPKASSSSNVGIGFCVSSENSDLK 1229
              SS+ D  L    SGG           ++ V   ++    SS N  I  C  +E+   K
Sbjct: 253  NSSSMMDSVLDHVSSGGGFYRGTRRSSVNSIVNTMRNSMEESSRNAAIE-CPETEDFVQK 311

Query: 1228 LVPNSRNNDLSLASSVPFPWEREIDSMEGEQLQRSNTELAERTIPEPELQRLRNVALRMK 1049
            L P            V  PWERE D       +RSNTELAERTIPE EL+RLR+VALRMK
Sbjct: 312  LGP------------VLLPWEREGD--VDRPRKRSNTELAERTIPEHELRRLRDVALRMK 357

Query: 1048 ERMTVGPAGVTEAVVKHIHDKWKEVEVTKLRFEGPACLNMKRTHEILERKTGGVVIWRSG 869
            ERM VGP GVT+ +V+ IH KW+  EV KLRFEGP  LNMKRTH+ILE +TGG+VIWRSG
Sbjct: 358  ERMRVGPGGVTQLIVESIHQKWRVEEVVKLRFEGPPSLNMKRTHDILEERTGGIVIWRSG 417

Query: 868  RSVVLYRGMAYQLPCIQTYLQLSDANSFHNPSMGNCSNLIAGNQAETFETD----SAMNQ 701
            RSVVLYRGM Y L C+Q+Y Q ++ N     S  +   +   ++ +    D    SA   
Sbjct: 418  RSVVLYRGMNYNLRCVQSYTQTTEVNFDKRVSSNSVEPIHVEHKFQKSGADGLNRSAYIV 477

Query: 700  NLSKGFSSTAYIDRLLDQLGPRFKDWSGRNPIPVDADLLPGLVPGYKPPFRLLPYKTRTS 521
            N S+  + T  ID  LDQLGPR+KDWSGR PIPVDADLLPG+VPGYK PFRLLPY  +++
Sbjct: 478  NSSEKPTETFDIDSFLDQLGPRYKDWSGRGPIPVDADLLPGVVPGYKTPFRLLPYMVKST 537

Query: 520  LRDREMTVLRRLARTMPPHFVLGRNRQHQGLATAIVKLWDKSSIAKIAIKRGIPNTSNEI 341
            LR++EMT LRRLAR   PHF LGRNR+HQGLATAIVKLW+KSSIAKIAIKRG+PNT N+ 
Sbjct: 538  LRNKEMTALRRLARQTAPHFALGRNREHQGLATAIVKLWEKSSIAKIAIKRGVPNTCNDR 597

Query: 340  IAEEIKKLTGGVLLSRNKEYIVFYRGNDFVTPSIREVLVEKEKLATIKQDDEEVARLR-X 164
            +AEEI+KLTGGVLLSRNKEYIVFYRGNDF+TP +R+VLVEK++ A   QD+EE+ARL+  
Sbjct: 598  MAEEIRKLTGGVLLSRNKEYIVFYRGNDFITPKVRQVLVEKQEQAITWQDEEELARLKAS 657

Query: 163  XXXXXXXXXXXXSLVAGTLAETVEAKHRWGKPLNPEEREMAKKNMVLTKHASLV 2
                          VAGTLAET EAK RWG  +N E R+  K +M+LTKH SL+
Sbjct: 658  ASISVKPKVFKNPPVAGTLAETREAKSRWGDSINAELRKKEKNHMILTKHTSLL 711


>ref|XP_003578028.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Brachypodium distachyon]
          Length = 962

 Score =  500 bits (1287), Expect = e-138
 Identities = 308/717 (42%), Positives = 402/717 (56%), Gaps = 49/717 (6%)
 Frame = -1

Query: 2005 RKERKKRKLRPSFNAQTLERWSVKVSSSRSTFPWQDQK-----------------IESSL 1877
            R+++K+R L+PSF  Q L RWS +  S R++FPWQ Q+                    +L
Sbjct: 67   RRKKKRRNLKPSFEEQALRRWSARAPSQRASFPWQRQQQPQPAHRENEDAHDEDPSSDTL 126

Query: 1876 NLVSSGTPFDQSLDG-------------IGAPMLEFGVPDENFDEDEKCSDSYLSDDSFE 1736
              +     +D S DG             +G   +  G   ++ DE+     SYL      
Sbjct: 127  RSIVEYFDYDSSDDGDVGLDVRGHEGAGMGKDGVAHGEAAQDRDEESHSQPSYL------ 180

Query: 1735 NSVDAIPPMANHIMGIPLGMQSKLAPWAHGGKHGEDGTRSRYKSEALVNNFEFGEDKVKP 1556
                             +G +   APW HG +            + LV+    G D+ + 
Sbjct: 181  -----------------IGSRPVSAPWMHGEEEPS--------VDQLVSG-PVGGDEEEV 214

Query: 1555 KETNFVDDK------GVRSNYGDPIIASLKEECPILSSDMKARSCPTVTVSFGEERFSSQ 1394
                 VDD+           Y D +         +     +  + PT   SF  +    Q
Sbjct: 215  DTNGMVDDELGLVDGNEECAYNDDVFEEEPMNGNLEGELFEDSATPTANSSFLMDFVVDQ 274

Query: 1393 HS---DLDSKVAGKSSVKDDALSGGRDNCVERYKDPKASSSSNVGIGFCVSSENSDLKLV 1223
             S    +D  +  +SSV          + V   ++    S  N  IG C   E+   KL 
Sbjct: 275  GSRGGGIDRSIR-RSSVS---------SIVSTLRNSMEESGPNATIG-CSHEEDFVQKL- 322

Query: 1222 PNSRNNDLSLASSVPFPWEREID-SMEG-EQLQRSNTELAERTIPEPELQRLRNVALRMK 1049
                        SV  PWERE D + +G  Q  RSNTELAE+TIPEPEL+RLR+ ALRMK
Sbjct: 323  -----------GSVLLPWEREDDDAFDGVRQGNRSNTELAEKTIPEPELRRLRDAALRMK 371

Query: 1048 ERMTVGPAGVTEAVVKHIHDKWKEVEVTKLRFEGPACLNMKRTHEILERKTGGVVIWRSG 869
            ERM +GP GVT+A+VK IH KW   EV K+RFEGP  LNMKRTHEILE +TGG VIWRSG
Sbjct: 372  ERMRIGPGGVTQAIVKSIHSKWSVDEVVKMRFEGPPSLNMKRTHEILEDRTGGTVIWRSG 431

Query: 868  RSVVLYRGMAYQLPCIQTYLQLSDANSFHNPSMGNCSNLIAGNQAETFETDSAMNQNLS- 692
            RS+VLYRGM Y L C+Q+Y ++++ +S  +  + + S ++        +  SA   N S 
Sbjct: 432  RSIVLYRGMNYNLRCVQSYAKIAEVDS--SKKVSDVSTVVPSCVEHNLQKSSADGVNRST 489

Query: 691  ------KGFSSTAYIDRLLDQLGPRFKDWSGRNPIPVDADLLPGLVPGYKPPFRLLPYKT 530
                  +G + T  ID  LDQLGPR+KDWSGR+PIPVDADLLPG+VP YKPPFR LPY+T
Sbjct: 490  SIVSSSQGATETFDIDSFLDQLGPRYKDWSGRSPIPVDADLLPGVVPDYKPPFRQLPYRT 549

Query: 529  RTSLRDREMTVLRRLARTMPPHFVLGRNRQHQGLATAIVKLWDKSSIAKIAIKRGIPNTS 350
            + SLRD+EMT LRRLAR   PHF LGRNR+HQGLA+AIVKLW+KS+I KIAIKRG+PNT 
Sbjct: 550  KLSLRDKEMTALRRLARQTAPHFALGRNREHQGLASAIVKLWEKSTIVKIAIKRGVPNTC 609

Query: 349  NEIIAEEIKKLTGGVLLSRNKEYIVFYRGNDFVTPSIREVLVEKEKLATIKQDDEEVARL 170
            N+ +AEEIKKLTGGVL+SRNKEYI+FYRGNDF+TP IR+VLVE+++ A  +QD EE+ARL
Sbjct: 610  NDRMAEEIKKLTGGVLISRNKEYIIFYRGNDFMTPKIRQVLVEQQQQAITQQDQEELARL 669

Query: 169  R-XXXXXXXXXXXXXSLVAGTLAETVEAKHRWGKPLNPEEREMAKKNMVLTKHASLV 2
            +                VAGTLAET EA+ RWG  +N   R+  + +++L KH SL+
Sbjct: 670  KASASITLIPNALKNPQVAGTLAETREAESRWGDLINDGRRKKERNHLILAKHTSLL 726


>tpg|DAA61175.1| TPA: hypothetical protein ZEAMMB73_652631 [Zea mays]
          Length = 964

 Score =  499 bits (1284), Expect = e-138
 Identities = 313/720 (43%), Positives = 408/720 (56%), Gaps = 32/720 (4%)
 Frame = -1

Query: 2065 SPEDEDPLEVEKA-GKDVKNTRKERKKRKLRPSFNAQTLERWSVKVSSSRSTFPWQDQKI 1889
            SP    P+  + A G+  KN     K R L+PSF  Q L RWS +  S R++ PW+  + 
Sbjct: 50   SPSTSSPVASDGAVGRKSKN-----KSRPLKPSFEEQALRRWSARAPSQRASVPWEQPQQ 104

Query: 1888 ESSLNLV-------SSGTPFDQSLDGIGA-----PMLEFGVPDENFD-----EDEKCSDS 1760
            +S L           SG   DQ   G G+      ++++     + D     E+  C  +
Sbjct: 105  QSPLPPSLPHRAGRGSGDAGDQKRSGGGSSATLRSIVDYFAGGSSDDEGVRVEEGACDTT 164

Query: 1759 YLSDDSFENSVDAIPPMANHIMGIPLGMQSKLAPWAHGGKHGEDGTRSRYKSEALVNNFE 1580
             + D +     D      +++    LG     APW     H E+ T  R  S  +    E
Sbjct: 165  AVPDQAAREQDDGSHFRPSYL----LGSHPFSAPWI----HREESTNDRGVSGPVAEEEE 216

Query: 1579 FGEDK-VKPKETNFVDDKGVRSNYGDPIIASLKEECPILSSDMKARSCPTVTVSFGEERF 1403
              + +     E   VD+    ++ G+ ++    E+           + PT+  S+G +  
Sbjct: 217  RLDIRDASDDELGLVDEDKEETDNGEELLTGGLED-----EFYDDYATPTMNSSYGVD-- 269

Query: 1402 SSQHSDLDSKVAGKSSVKDDALSGGRDNCVERYKDPKASSSSNVGIGFCVSS-ENSDLKL 1226
                                 LS  +D    R+      SS N  +    +S E SD   
Sbjct: 270  ---------------------LSVDKDAYGSRFDRSMMQSSVNTIVKTLRNSMEESDPNA 308

Query: 1225 VPNSRNNDLSLASSVP--FPWEREIDSME----GEQLQRSNTELAERTIPEPELQRLRNV 1064
                 N +  +    P   PWERE +  E    G  ++RSNTELAER+IPEPEL+RLR+ 
Sbjct: 309  TVELSNAEDFVQKLGPALLPWEREEEDDEAFSGGRAVRRSNTELAERSIPEPELRRLRDT 368

Query: 1063 ALRMKERMTVGPAGVTEAVVKHIHDKWKEVEVTKLRFEGPACLNMKRTHEILERKTGGVV 884
            ALRMKER+ VGP GVT+ +V+ IH KWK  EV K+RFEGP  LNMKRTH++LE +TGGVV
Sbjct: 369  ALRMKERIKVGPGGVTQDIVESIHRKWKVDEVVKMRFEGPPSLNMKRTHDLLEDRTGGVV 428

Query: 883  IWRSGRSVVLYRGMAYQLPCIQTYLQLSDANSFHNPSMGNCSNLI-AGNQAETFETDS-- 713
            IWRSGRSVVLYRGM Y   C+Q+Y +  + +S    S  N + L   G+  +    D   
Sbjct: 429  IWRSGRSVVLYRGMNYNFQCVQSYAKFIEIDSGKGVSDANSAVLSHDGHNLQASRADGMK 488

Query: 712  --AMNQNLSKGFSSTAYIDRLLDQLGPRFKDWSGRNPIPVDADLLPGLVPGYKPPFRLLP 539
                  N S   S T  ID  LDQLGPR+KDWSGR PIPVDADLLPG+V GYKPPFR+LP
Sbjct: 489  SLTSTGNFSLESSETFDIDNFLDQLGPRYKDWSGRGPIPVDADLLPGVVHGYKPPFRVLP 548

Query: 538  YKTRTSLRDREMTVLRRLARTMPPHFVLGRNRQHQGLATAIVKLWDKSSIAKIAIKRGIP 359
            YK +++LRD+EMT LRRLAR   PHF LGRNR+HQGLA A+VKLW+KS+IAKIAIKRGIP
Sbjct: 549  YKIKSTLRDKEMTTLRRLARQTAPHFALGRNREHQGLAAAMVKLWEKSAIAKIAIKRGIP 608

Query: 358  NTSNEIIAEEIKKLTGGVLLSRNKEYIVFYRGNDFVTPSIREVLVEKEKLATIKQDDEEV 179
            NT N+ +AEEIKKLTGGVLLSRNKE+IVFYRGNDF+ P +R+VLVEK++ A  +QD+EE+
Sbjct: 609  NTCNDRMAEEIKKLTGGVLLSRNKEFIVFYRGNDFIAPKVRQVLVEKQEQAITQQDEEEL 668

Query: 178  ARLR-XXXXXXXXXXXXXSLVAGTLAETVEAKHRWGKPLNPEEREMAKKNMVLTKHASLV 2
            ARL+               LVAGTLAET EAK RWGK +N ++RE   K++ L KH SL+
Sbjct: 669  ARLKASASIITIPKDIKGPLVAGTLAETTEAKSRWGKSVNDKQREEEMKHLSLLKHTSLL 728


>ref|XP_002460114.1| hypothetical protein SORBIDRAFT_02g022940 [Sorghum bicolor]
            gi|241923491|gb|EER96635.1| hypothetical protein
            SORBIDRAFT_02g022940 [Sorghum bicolor]
          Length = 962

 Score =  491 bits (1265), Expect = e-136
 Identities = 310/715 (43%), Positives = 401/715 (56%), Gaps = 27/715 (3%)
 Frame = -1

Query: 2065 SPEDEDPLEVEKAGKDVKNTRKERKKRKLRPSFNAQTLERWSVKVSSSRSTFPW-QDQKI 1889
            SP    P+  + AG      + ++K R L+PSF    L RWS +  S R++ PW Q Q+ 
Sbjct: 50   SPSASSPVASDGAG----GRKSKKKSRPLKPSFEEHALRRWSSRAPSQRASVPWEQPQQQ 105

Query: 1888 ESSL---NLVSSGTPFDQSLDGIGAPML------EFGVPDENFD----EDEKCSDSYLSD 1748
              SL   +   SG    ++  G G+          FG    N D    E+     + +  
Sbjct: 106  SPSLPHRDSRESGGAGGRNRSGGGSSATLRSIVDYFGGVSSNDDGVGAEEGAWDTTAVQG 165

Query: 1747 DSFENSVDAIPPMANHIMGIPLGMQSKLAPWAHGGKHGEDGTRSRYKSEALVNNFEFGED 1568
            ++     D      +++    LG Q   APW HG    E+ T  R               
Sbjct: 166  EAAREQDDGSHFRPSYL----LGSQPVSAPWIHG----EESTSDRVSGPVAEGEEGMDMS 217

Query: 1567 KVKPKETNFVDDKGVRSNYGDPIIASLKEECPILSSDMKARSCPTVTVSFGEERFSSQHS 1388
             V   E +  D      + G+ +     EE   L  D    + PTV         SS   
Sbjct: 218  DVSDDELSLEDRDKEEIDDGEELPTGSSEEQ--LYDDY---ATPTVN--------SSYEV 264

Query: 1387 DLDSKVAGKSSVKDDALSGGRDNC-VERYKDPKASSSSNVGIGFCVSSENSDLKLVPNSR 1211
            DL +         D ++  G  N  V+  +     S  N  I    ++E+   KL P   
Sbjct: 265  DLSADRDSYGGRFDRSMRQGSVNTIVKTLRGSMEESDPNAAIELS-NAEDFVQKLGP--- 320

Query: 1210 NNDLSLASSVPFPWEREIDSME----GEQLQRSNTELAERTIPEPELQRLRNVALRMKER 1043
                     V  PWERE +  E    G   +RSNTELAERTIPEPEL+RLR+ ALRMKER
Sbjct: 321  ---------VLLPWEREEEDDEAFSGGRVGRRSNTELAERTIPEPELRRLRDTALRMKER 371

Query: 1042 MTVGPAGVTEAVVKHIHDKWKEVEVTKLRFEGPACLNMKRTHEILERKTGGVVIWRSGRS 863
            + VGP GVT+ +V+ IH KWK  EV K+RFEGP  LNMKRTH++LE +TGGVVIWRSGRS
Sbjct: 372  IKVGPGGVTQDIVESIHRKWKVDEVVKMRFEGPPSLNMKRTHDLLEDRTGGVVIWRSGRS 431

Query: 862  VVLYRGMAYQLPCIQTYLQLSDANSFHNPSMGNCSNLIAGNQAETFETDS-------AMN 704
            VVLYRGM Y L C+Q+Y +  + +S     + + S+ ++ +     +             
Sbjct: 432  VVLYRGMNYNLQCVQSYAKSIETDS--GKEVDDASSAVSSHGGHNLQDSREAGAKRLTST 489

Query: 703  QNLSKGFSSTAYIDRLLDQLGPRFKDWSGRNPIPVDADLLPGLVPGYKPPFRLLPYKTRT 524
            +N S   S T  ID  LDQLGPR++DWSGR P+PVDADLLPG+V GYKPPFR+LPYK ++
Sbjct: 490  ENFSLESSETFDIDNFLDQLGPRYRDWSGRGPVPVDADLLPGVVHGYKPPFRVLPYKIKS 549

Query: 523  SLRDREMTVLRRLARTMPPHFVLGRNRQHQGLATAIVKLWDKSSIAKIAIKRGIPNTSNE 344
            +LRD+EMT LRRL+R   PHF LGRNR+HQGLA A+VKLW+KS+IAKIAIKRG+PNT N+
Sbjct: 550  TLRDKEMTTLRRLSRQTAPHFALGRNREHQGLAAAMVKLWEKSAIAKIAIKRGVPNTCND 609

Query: 343  IIAEEIKKLTGGVLLSRNKEYIVFYRGNDFVTPSIREVLVEKEKLATIKQDDEEVARLR- 167
             +AEEIKKLTGGVLLSRNKEYIVFYRGNDF+ P +R+VLVEK++ A  +QD+EE+ARL+ 
Sbjct: 610  RMAEEIKKLTGGVLLSRNKEYIVFYRGNDFIAPKVRQVLVEKQEQAITQQDEEELARLKA 669

Query: 166  XXXXXXXXXXXXXSLVAGTLAETVEAKHRWGKPLNPEEREMAKKNMVLTKHASLV 2
                          LVAGTL ET EAK RWG  LN ++RE   K + L KH SL+
Sbjct: 670  SASIITVPKGIKGPLVAGTLTETTEAKSRWGMSLNDKQREEEMKRLSLLKHTSLL 724


>ref|XP_004956664.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Setaria italica]
          Length = 963

 Score =  481 bits (1239), Expect = e-133
 Identities = 300/695 (43%), Positives = 404/695 (58%), Gaps = 29/695 (4%)
 Frame = -1

Query: 1999 ERKKRKLRPSFNAQTLERWSVKVSSSRSTFPWQDQKIESS-----LNLVSSGTPFDQSLD 1835
            ++K+R L+PSF  Q L RWS +  S R++ PW+  + +S          S G+   ++ D
Sbjct: 67   KKKRRPLKPSFEEQALRRWSARAPSQRASVPWEQPQQQSPSPPHRAGRESVGSGGQKTTD 126

Query: 1834 GIGAPMLEF----------GVPDENFDEDEKCSDSYLS--DDSFENSVDAIPPMANHIMG 1691
            G  +  L            G   E  + +EK + +  +   ++  +  D      +++  
Sbjct: 127  GGSSKTLRSIVEYFAGGSSGDDGEGGEREEKGAGNAAAVRAEAARDQEDGSHFRPSYL-- 184

Query: 1690 IPLGMQSKLAPWAHGGKHGEDGTRSRYKSEALVNNFEFGEDKVKPKETNFVDDKGVRSNY 1511
              LG +   APW HG    E+ +  ++ S ++      GE+ V   + + + D  +    
Sbjct: 185  --LGNKPVSAPWMHG----EESSNDQWVSSSVAE----GEEGV---DMDDISDDELGLAE 231

Query: 1510 GDPIIASLKEECPILSSDMKARSCPTVTVSFGEERFSSQHSDLDSKVAGKSSVKDDALS- 1334
            GD       E+    SS+              EE +      + +   G   V D   + 
Sbjct: 232  GDDEELDSAEDLLNGSSE--------------EELYEDYAVQIANSSYGVDLVVDRGSNV 277

Query: 1333 GGRDNCVERYKDPKASSSSNVGIGFCVSSENSDLKLVPNSRNND-LSLASSVPFPWEREI 1157
            GG D  + R     +S +S V        E+S    +  S   D +     V  PWERE 
Sbjct: 278  GGFDRSMRR-----SSVNSIVKTLRSSMEESSPNVTIERSNAEDFVQKLGPVLLPWEREE 332

Query: 1156 DSME----GEQLQRSNTELAERTIPEPELQRLRNVALRMKERMTVGPAGVTEAVVKHIHD 989
            +  E    G+  +RSNTELAERTIPE EL+RLR+ ALRMKER+ VG  GVT+ +V+ IH 
Sbjct: 333  EDDEVFGGGKAGRRSNTELAERTIPENELRRLRDAALRMKERIKVGSGGVTQDIVESIHR 392

Query: 988  KWKEVEVTKLRFEGPACLNMKRTHEILERKTGGVVIWRSGRSVVLYRGMAYQLPCIQTYL 809
            KWK  EV K+RFEGP  LNMKRTH++LE +TGG+VIWRSGRSVVLYRGM Y L C+Q+Y 
Sbjct: 393  KWKVDEVVKMRFEGPPSLNMKRTHDLLEDRTGGIVIWRSGRSVVLYRGMNYNLQCVQSYA 452

Query: 808  QLSDANSFHNPSMGNCS-----NLIAGNQAETFETDSAMNQNLSKGFSSTAYIDRLLDQL 644
            + +  +S    +  N +     NL          + S+ N +L    +    ID  LDQL
Sbjct: 453  KSTQIDSDKEVADANSAIHGRHNLQKSRADGVKHSTSSGNFSLELEATEAFDIDSFLDQL 512

Query: 643  GPRFKDWSGRNPIPVDADLLPGLVPGYKPPFRLLPYKTRTSLRDREMTVLRRLARTMPPH 464
            GPR+KDWSGR+PIPVDADLLPG+VPGYK P+R+LPYK +++LRD+EMT LRRLAR   PH
Sbjct: 513  GPRYKDWSGRSPIPVDADLLPGVVPGYKQPYRVLPYKIKSTLRDKEMTALRRLARQTAPH 572

Query: 463  FVLGRNRQHQGLATAIVKLWDKSSIAKIAIKRGIPNTSNEIIAEEIKKLTGGVLLSRNKE 284
            F LGRNR+HQGLA A+VKLW+KS+IAKIAIKRG+PNT N+ +AEEIKKLTGGVLLSRNKE
Sbjct: 573  FALGRNREHQGLAAAMVKLWEKSAIAKIAIKRGVPNTCNDRMAEEIKKLTGGVLLSRNKE 632

Query: 283  YIVFYRGNDFVTPSIREVLVEKEKLATIKQDDEEVARLR-XXXXXXXXXXXXXSLVAGTL 107
            YI+FYRGNDF+ P +R+VLVEK++ A  + D+EE+ARL+               LVAGTL
Sbjct: 633  YIIFYRGNDFIAPKVRQVLVEKQEQAITQLDEEELARLKASASITTIPNELKGPLVAGTL 692

Query: 106  AETVEAKHRWGKPLNPEEREMAKKNMVLTKHASLV 2
            AET EAK RWG  LN ++RE   K + L KHASL+
Sbjct: 693  AETTEAKSRWGHSLNDKQREEEMKYLALMKHASLL 727


>ref|XP_006661163.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Oryza brachyantha]
          Length = 758

 Score =  466 bits (1200), Expect = e-128
 Identities = 243/399 (60%), Positives = 292/399 (73%), Gaps = 5/399 (1%)
 Frame = -1

Query: 1183 VPFPWEREIDSMEGEQL----QRSNTELAERTIPEPELQRLRNVALRMKERMTVGPAGVT 1016
            V  PWERE D      +    +RSNTELAERTIPEPEL+RLR+VALRMKERM VGP GVT
Sbjct: 126  VLLPWEREEDKEASSGVDRPRKRSNTELAERTIPEPELRRLRDVALRMKERMRVGPGGVT 185

Query: 1015 EAVVKHIHDKWKEVEVTKLRFEGPACLNMKRTHEILERKTGGVVIWRSGRSVVLYRGMAY 836
            + +V+ IH KW+  EV KLRFEGP  LNMKRTH+ILE +TGG+VIWRSGRSVVLYRGM Y
Sbjct: 186  QVLVESIHQKWRVDEVAKLRFEGPPSLNMKRTHDILEERTGGIVIWRSGRSVVLYRGMNY 245

Query: 835  QLPCIQTYLQLSDANSFHNPSMGNCSNLIAGNQAETFETDSAMNQNLSKGFSSTAYIDRL 656
             L C+Q+Y + ++ NS   P         +G         S  + + SK  + T  ID  
Sbjct: 246  NLRCVQSYTKTAEVNSDIEPIHVEHKFQKSGANGLNH---SGYSVSSSKKPTETFDIDSF 302

Query: 655  LDQLGPRFKDWSGRNPIPVDADLLPGLVPGYKPPFRLLPYKTRTSLRDREMTVLRRLART 476
            LDQLGPR+KDWSGR PIPVDADLLPG+V GY  PFRLLPYK +++LR++EMT LRRLAR 
Sbjct: 303  LDQLGPRYKDWSGRGPIPVDADLLPGVVHGYNTPFRLLPYKVKSTLRNKEMTALRRLARQ 362

Query: 475  MPPHFVLGRNRQHQGLATAIVKLWDKSSIAKIAIKRGIPNTSNEIIAEEIKKLTGGVLLS 296
              PHF LGRNR+HQGLA AIVKLW+KSSIAKIAIKRG+PNT N+ +AEE+KKLTGGVLLS
Sbjct: 363  TTPHFALGRNREHQGLAAAIVKLWEKSSIAKIAIKRGVPNTCNDRMAEELKKLTGGVLLS 422

Query: 295  RNKEYIVFYRGNDFVTPSIREVLVEKEKLATIKQDDEEVARLR-XXXXXXXXXXXXXSLV 119
            RNKEYIV YRGNDF+TP +R+VLVEK++ A   QD+EE+ARL+               L+
Sbjct: 423  RNKEYIVLYRGNDFITPKVRQVLVEKQEQAITWQDEEELARLKASASISSKPKVFKNPLI 482

Query: 118  AGTLAETVEAKHRWGKPLNPEEREMAKKNMVLTKHASLV 2
            AGTLAET EAK RWG  +N + R+  K +M++ KH SL+
Sbjct: 483  AGTLAETREAKSRWGDSINDDLRKKEKNHMIIAKHTSLL 521



 Score = 62.0 bits (149), Expect = 1e-06
 Identities = 81/330 (24%), Positives = 134/330 (40%), Gaps = 31/330 (9%)
 Frame = -1

Query: 1102 TIPEPELQRLRNVALRMKERMTVGPAGVTEAVVKHIHDKWKEVEVTKLRFEG--PACLNM 929
            T+   E+  LR +A +      +G     + +   I   W++  + K+  +   P   N 
Sbjct: 347  TLRNKEMTALRRLARQTTPHFALGRNREHQGLAAAIVKLWEKSSIAKIAIKRGVPNTCND 406

Query: 928  KRTHEILERKTGGVVIWRSGRSVVLYRGMAYQLPCIQTYL-----------------QLS 800
            +   E L++ TGGV++ R+   +VLYRG  +  P ++  L                 +L 
Sbjct: 407  RMAEE-LKKLTGGVLLSRNKEYIVLYRGNDFITPKVRQVLVEKQEQAITWQDEEELARLK 465

Query: 799  DANSFHNPSMGNCSNLIAGNQAETFETDS----AMNQNLSKGFSSTAYIDRLLDQLGPRF 632
             + S  +      + LIAG  AET E  S    ++N +L K   +   I +    L    
Sbjct: 466  ASASISSKPKVFKNPLIAGTLAETREAKSRWGDSINDDLRKKEKNHMIIAKHTSLLRNLK 525

Query: 631  KDWSGRNPIPVDADLLPGLVPGYKPPFRLLPYKTRTSLRDREMTVLRRLARTMPPHFVLG 452
            +           A+     V  Y  P  L P    T + D E  +LRR+   M    +LG
Sbjct: 526  RKLFLAKTKVTKAEEALAKVQEYLSPAEL-PTDLET-VTDEERFLLRRIGLKMKAFLMLG 583

Query: 451  RNRQHQGLATAIVKLWDKSSIAKIAIK-RGIPNTSNEIIAEEIKKLTGGVLLSRNKE--- 284
            R     G    +   W    + KI +K +  P   +  IA  ++  +GGVL+S +K    
Sbjct: 584  RREVFDGTVQNMHLHWKHRELVKILVKGKSFPQVKH--IAISLEAESGGVLISVDKTTKG 641

Query: 283  -YIVFYRGNDFVTPSI---REVLVEKEKLA 206
              I+ YRG ++  P I   R +L  ++ LA
Sbjct: 642  YAIILYRGKNYKRPQILKPRNLLSRRKALA 671


>gb|EMS55813.1| Chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic [Triticum urartu]
          Length = 913

 Score =  452 bits (1164), Expect = e-124
 Identities = 289/667 (43%), Positives = 381/667 (57%), Gaps = 50/667 (7%)
 Frame = -1

Query: 2065 SPEDEDPLEVEKAGKDVKNTRKERKKRKLRPSFNAQTLERWSVKVSSSRSTFPWQDQKIE 1886
            SP    P E        +  RK +K+R L+PSF  Q L RWS +  S R++FPWQ Q+ +
Sbjct: 45   SPSTSSPYEAPALDGGTE-IRKNKKRRNLKPSFEEQALRRWSARAPSLRASFPWQRQQSQ 103

Query: 1885 ----------------SSLNLVSSGTPFDQSLDGIGAPMLEFGVPDENFDEDEKCSDSYL 1754
                            ++L  +     +D S DG G      GV +      +K +D   
Sbjct: 104  LAHREDERAEDHEPSSATLRSIVEYFDYDSSEDGGG------GVGEGK----DKGNDGVA 153

Query: 1753 SDDSFENSVDAIPPMANHIMGIPLGMQSKLAPWAHGGKHGE------DGTRSRYKSEALV 1592
              ++ ++  +   P  N++    LG +   APW HG +          G     + EA+ 
Sbjct: 154  HGEAAQDRDEESRPQPNYL----LGTRPFSAPWMHGQEGPTVVDRPLSGPVGGDEEEAVR 209

Query: 1591 NNFEFGEDKVKPKETNFVDDKGVRSNYGDPIIASLKEECPILSSDMKARSCPTVTVSFGE 1412
            +     E   + ++  +VD+  V     +P+  +L+EE   L  D +  + PT + SF  
Sbjct: 210  SGVFDDELDSEDEDEEWVDNSEVLEE--EPMAVNLEEE---LYED-EDPAAPTASSSF-- 261

Query: 1411 ERFSSQHSDLDSKVAGKSSVKDDALSGGRDNCVERYKDPKASSSSNVGIGFCVSSENSDL 1232
                     LDS +  +SS        G D  + R      SS S++      S E S  
Sbjct: 262  --------PLDSILEDQSST-----GSGFDRSIRR------SSVSSIVNTLRNSMEES-- 300

Query: 1231 KLVPNSRNND-LSLASSVPFPWEREI-DSMEGEQLQR-SNTELAERTIPEPELQRLRNVA 1061
              + +S   D +    SV  PWERE  ++ +G++  R SNT+LAERTIPEPEL+RLR+ A
Sbjct: 301  ATIGSSEGEDFVQKLGSVLLPWEREEGNAFDGDKRGRHSNTKLAERTIPEPELRRLRDAA 360

Query: 1060 LRMKERMTVGPAGVTEAVVKHIHDKWKEVEVTKLRFEGPACLNMKRTHEILERKTGGVVI 881
            LRMKERM VGP GVT A+V++IH KWK  EV K+RFEGP  LNMKRTHEILE +TGG VI
Sbjct: 361  LRMKERMRVGPGGVTHAIVENIHSKWKVDEVVKMRFEGPPSLNMKRTHEILEDRTGGTVI 420

Query: 880  WRSGRSVVLYRGMAYQLPCIQTYLQLSDANSFHNPSMGNCSNLIAGN-----QAETFETD 716
            WRSGRS+VLYRGM Y L C+Q+Y ++++ +S  N   G+   ++  +     Q  T E D
Sbjct: 421  WRSGRSIVLYRGMNYNLRCVQSYAKIAEVDSSENA--GDAIGVVPSSEEHDLQKPTVEHD 478

Query: 715  --------------------SAMNQNLSKGFSSTAYIDRLLDQLGPRFKDWSGRNPIPVD 596
                                S    N S+  + T  ID  LDQLGPR+KDWSGR+P+PVD
Sbjct: 479  LQKPVVERNSQKSSAEDVKRSTSVMNFSQEATETFDIDSFLDQLGPRYKDWSGRSPVPVD 538

Query: 595  ADLLPGLVPGYKPPFRLLPYKTRTSLRDREMTVLRRLARTMPPHFVLGRNRQHQGLATAI 416
            ADLLPGLVPGYKPPFR LPY+T+ SL+D+EMT LRRLAR   PHF LGRNR+HQGLA AI
Sbjct: 539  ADLLPGLVPGYKPPFRQLPYRTKISLKDKEMTALRRLARQTAPHFALGRNREHQGLAAAI 598

Query: 415  VKLWDKSSIAKIAIKRGIPNTSNEIIAEEIKKLTGGVLLSRNKEYIVFYRGNDFVTPSIR 236
            VK+W+KSSI KIAIKRG+PNT N+ +AEEIKKLTGGVL+SRNKEYI FYRGNDFVTP  +
Sbjct: 599  VKVWEKSSIVKIAIKRGVPNTCNDRMAEEIKKLTGGVLVSRNKEYINFYRGNDFVTPKAK 658

Query: 235  EVLVEKE 215
              + + E
Sbjct: 659  TKVAKAE 665


>ref|XP_002514120.1| conserved hypothetical protein [Ricinus communis]
            gi|223546576|gb|EEF48074.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 930

 Score =  436 bits (1120), Expect = e-119
 Identities = 287/687 (41%), Positives = 393/687 (57%), Gaps = 22/687 (3%)
 Frame = -1

Query: 1996 RKKRKLRPSFNAQTLERWSVKVSSSRSTFPWQDQKIESSLNLVSSGTPFDQSLDGIGAPM 1817
            + KRK RPSF  Q  ++WS+KV S+R TFPWQ+                +Q  +  G   
Sbjct: 78   KTKRKPRPSFFEQIRDKWSLKVPSTRDTFPWQEP---------------EQQQEHQGQ-- 120

Query: 1816 LEFGVPDENFDEDEKCSDSYLSDDSFENSVDAIPP-MANHIMGIPLGMQSKLAPWAHGGK 1640
               G  DE  +E E+C  S ++    E  +DA P  + +  + + L      APW HG +
Sbjct: 121  ---GKNDE--EEIERCEISGVTLSKAE--IDANPSSIDDDSVSVSLPNHLTTAPWVHGTR 173

Query: 1639 HGEDGTRSRYKSEALVNNFEFGEDKVKPKETNFVDDKGVRSNYGDPIIASLKEECPILSS 1460
              ++   SR K          GE+ V+      VD           I+ +L++E  +  +
Sbjct: 174  PKKNHFSSRPK---------IGENVVQNDVHTVVD-----------IVENLEKE--VTCN 211

Query: 1459 DMKARSCPTVTVSFGEERFSSQHSDLDSKVAGKSSVKDDALSGGRDNCVERYKDPKASS- 1283
            D   +    + V   E      + D   K A K  V   ++   RDN + R K  K+ S 
Sbjct: 212  DKFKKEDNILHVDNAERLVKEVNYDKKFKEA-KVQVGGFSVELKRDNEIARAKYSKSPSY 270

Query: 1282 ------SSNVGIGFCVSSENSDLKLVPNSRNNDLSLASSVPFPWERE--IDSMEGE-QLQ 1130
                   +N G G  VS +++               +SS+  PWE+E  ++S+EG  + +
Sbjct: 271  INEKPFGANGGYGVQVSYDDN---------------SSSIELPWEKERVMESVEGYLRGK 315

Query: 1129 RSNTELAERTIPEPELQRLRNVALRMKERMTVGPAGVTEAVVKHIHDKWKEVEVTKLRFE 950
            RSNTELAER +PE EL+RLRNVALRM ER+ VG AG+ + +V  +H+KW+  EV KL+FE
Sbjct: 316  RSNTELAERMLPEHELKRLRNVALRMYERIKVGAAGINQDLVDAVHEKWRLDEVVKLKFE 375

Query: 949  GPACLNMKRTHEILERKTGGVVIWRSGRSVVLYRGMAYQLPCIQTYLQLSDANS---FHN 779
             P   NM+RTHEILE +TGG+VIWRSG SVVLYRG++Y+L C++++ +  +A      H 
Sbjct: 376  EPLSFNMRRTHEILENRTGGLVIWRSGSSVVLYRGISYKLHCVRSFSKQDEAGKEILAHP 435

Query: 778  PSMGNCSNLIAG-----NQAETFETDSAMN-QNLSKG-FSSTAYIDRLLDQLGPRFKDWS 620
              + + + L  G        E++  D A   ++LS+   +    +++ LD+LGPRF+DW 
Sbjct: 436  EEVTSNATLNIGVKHFIGTTESYIPDRAKYLKDLSREELTDFTELNQFLDELGPRFEDWC 495

Query: 619  GRNPIPVDADLLPGLVPGYKPPFRLLPYKTRTSLRDREMTVLRRLARTMPPHFVLGRNRQ 440
            GR P+PVDADLL  + PGYKPPFRLLPY  R  L D+EMT+ RRLART+PPHF LGRNRQ
Sbjct: 496  GREPLPVDADLLLAVDPGYKPPFRLLPYGVRHCLTDKEMTIFRRLARTVPPHFALGRNRQ 555

Query: 439  HQGLATAIVKLWDKSSIAKIAIKRGIPNTSNEIIAEEIKKLTGGVLLSRNKEYIVFYRGN 260
             QGLA AIVKLW++S+I KIAIKRG+ NT NE +AEE+K LTGG+LLSRNKEYIVFYRGN
Sbjct: 556  LQGLAKAIVKLWERSAIVKIAIKRGVQNTRNERMAEELKVLTGGILLSRNKEYIVFYRGN 615

Query: 259  DFVTPSIREVLVEKEKLATIKQDDEEVAR-LRXXXXXXXXXXXXXSLVAGTLAETVEAKH 83
            DF+ P+I + L E++KL  +KQD+EE AR +               LVAGTLAETV A  
Sbjct: 616  DFLPPAIVKTLKERKKLTYLKQDEEEQARQMALASVESSAKTSKVPLVAGTLAETVAATS 675

Query: 82   RWGKPLNPEEREMAKKNMVLTKHASLV 2
             W       + +   +  VL K ASLV
Sbjct: 676  HWRDQRGSPDIDEMLREAVLAKRASLV 702


>gb|EOY30435.1| CRS1 / YhbY domain-containing protein, putative isoform 5 [Theobroma
            cacao] gi|508783180|gb|EOY30436.1| CRS1 / YhbY
            domain-containing protein, putative isoform 5 [Theobroma
            cacao]
          Length = 822

 Score =  421 bits (1082), Expect = e-115
 Identities = 260/560 (46%), Positives = 336/560 (60%), Gaps = 33/560 (5%)
 Frame = -1

Query: 1582 EFGEDKVKPKETNFVDDKGVRSNYGDPIIASLKEECP-ILSSDMKARSCPTVTVS----F 1418
            EF E++V+ K++           +G  I  S ++E P +  SD  + S P+  +S     
Sbjct: 94   EFEEEEVERKQS-----------FGGAISESERDEDPQVEGSDPVSSSFPSRVISAPWSH 142

Query: 1417 GEERFSSQHSDLDSKVAGKSSVKDDALS---------GGRDNCVERYKDPKASSSSNVGI 1265
            G E F+  H D   +++   S  +D+ +         G +   V    D   S +  V I
Sbjct: 143  GSE-FNEPHFDFVPEISNFESKIEDSFASEKTIEFPGGNKAEVVGGLIDKSESLNEEVNI 201

Query: 1264 -----GFCVSSENS---DLKLVPNSRNNDLSLASSVPFPWEREIDSMEGEQLQRSNTELA 1109
                 G  V  E +    L  V +SR N   +++S       E DS  G   +RSNTE+ 
Sbjct: 202  NKQKIGLPVGKEVAAVEGLNDVVSSREN-FEVSNSDDEGGSVEGDS--GRSKKRSNTEMV 258

Query: 1108 ERTIPEPELQRLRNVALRMKERMTVGPAGVTEAVVKHIHDKWKEVEVTKLRFEGPACLNM 929
            +R IPE E QRLRNVALRM ER  VG AG+T+A+V++IH++WK  EV KL+FE P  LNM
Sbjct: 259  DRMIPEHESQRLRNVALRMVERTKVGVAGITQALVEYIHERWKMDEVVKLKFEEPLSLNM 318

Query: 928  KRTHEILERKTGGVVIWRSGRSVVLYRGMAYQLPCIQTYLQLS--DANSFH---NPSMGN 764
            KRTHEILE++TGG+VIWRSG S+VLYRGMAY+L C+Q+Y   +  D N+     N     
Sbjct: 319  KRTHEILEQRTGGLVIWRSGSSLVLYRGMAYKLHCVQSYTSQNKVDMNALDCSTNVESDT 378

Query: 763  CSNLIAGNQAETFETDSAMNQNLSKGFSSTAYID-----RLLDQLGPRFKDWSGRNPIPV 599
              N++      T E     +    K  S    +D      LLD+LGPR+KDWSGR P+PV
Sbjct: 379  TQNIVVKESVRTMECFMPSSSEYLKDLSKEELMDLCELNHLLDELGPRYKDWSGREPLPV 438

Query: 598  DADLLPGLVPGYKPPFRLLPYKTRTSLRDREMTVLRRLARTMPPHFVLGRNRQHQGLATA 419
            DADLLP +VPGY+PPFR LPY  R  L+D EMT  RRLART+PPHF LGRNR+ QGLA A
Sbjct: 439  DADLLPPVVPGYQPPFRRLPYGIRHCLKDHEMTTFRRLARTVPPHFALGRNRELQGLAEA 498

Query: 418  IVKLWDKSSIAKIAIKRGIPNTSNEIIAEEIKKLTGGVLLSRNKEYIVFYRGNDFVTPSI 239
            IVKLW+ S+IAKIAIKRG+ NT NE +AEE+K+LTGG LLSRNKE+IVFYRGNDF+ P +
Sbjct: 499  IVKLWESSAIAKIAIKRGVQNTRNERMAEELKQLTGGTLLSRNKEFIVFYRGNDFLPPVV 558

Query: 238  REVLVEKEKLATIKQDDEEVARLR-XXXXXXXXXXXXXSLVAGTLAETVEAKHRWGKPLN 62
             + L E++K   ++Q++EE AR R               LVAGTLAET  A  RWG   +
Sbjct: 559  TKTLKERQKSRNLQQEEEEKARERVLALVGSNAKASKLPLVAGTLAETTAATSRWGHQPS 618

Query: 61   PEEREMAKKNMVLTKHASLV 2
             EE E  KKN  LT+ ASLV
Sbjct: 619  IEEVEEMKKNSALTQQASLV 638


>gb|EOY30434.1| CRS1 / YhbY domain-containing protein, putative isoform 4 [Theobroma
            cacao]
          Length = 818

 Score =  421 bits (1082), Expect = e-115
 Identities = 260/560 (46%), Positives = 336/560 (60%), Gaps = 33/560 (5%)
 Frame = -1

Query: 1582 EFGEDKVKPKETNFVDDKGVRSNYGDPIIASLKEECP-ILSSDMKARSCPTVTVS----F 1418
            EF E++V+ K++           +G  I  S ++E P +  SD  + S P+  +S     
Sbjct: 94   EFEEEEVERKQS-----------FGGAISESERDEDPQVEGSDPVSSSFPSRVISAPWSH 142

Query: 1417 GEERFSSQHSDLDSKVAGKSSVKDDALS---------GGRDNCVERYKDPKASSSSNVGI 1265
            G E F+  H D   +++   S  +D+ +         G +   V    D   S +  V I
Sbjct: 143  GSE-FNEPHFDFVPEISNFESKIEDSFASEKTIEFPGGNKAEVVGGLIDKSESLNEEVNI 201

Query: 1264 -----GFCVSSENS---DLKLVPNSRNNDLSLASSVPFPWEREIDSMEGEQLQRSNTELA 1109
                 G  V  E +    L  V +SR N   +++S       E DS  G   +RSNTE+ 
Sbjct: 202  NKQKIGLPVGKEVAAVEGLNDVVSSREN-FEVSNSDDEGGSVEGDS--GRSKKRSNTEMV 258

Query: 1108 ERTIPEPELQRLRNVALRMKERMTVGPAGVTEAVVKHIHDKWKEVEVTKLRFEGPACLNM 929
            +R IPE E QRLRNVALRM ER  VG AG+T+A+V++IH++WK  EV KL+FE P  LNM
Sbjct: 259  DRMIPEHESQRLRNVALRMVERTKVGVAGITQALVEYIHERWKMDEVVKLKFEEPLSLNM 318

Query: 928  KRTHEILERKTGGVVIWRSGRSVVLYRGMAYQLPCIQTYLQLS--DANSFH---NPSMGN 764
            KRTHEILE++TGG+VIWRSG S+VLYRGMAY+L C+Q+Y   +  D N+     N     
Sbjct: 319  KRTHEILEQRTGGLVIWRSGSSLVLYRGMAYKLHCVQSYTSQNKVDMNALDCSTNVESDT 378

Query: 763  CSNLIAGNQAETFETDSAMNQNLSKGFSSTAYID-----RLLDQLGPRFKDWSGRNPIPV 599
              N++      T E     +    K  S    +D      LLD+LGPR+KDWSGR P+PV
Sbjct: 379  TQNIVVKESVRTMECFMPSSSEYLKDLSKEELMDLCELNHLLDELGPRYKDWSGREPLPV 438

Query: 598  DADLLPGLVPGYKPPFRLLPYKTRTSLRDREMTVLRRLARTMPPHFVLGRNRQHQGLATA 419
            DADLLP +VPGY+PPFR LPY  R  L+D EMT  RRLART+PPHF LGRNR+ QGLA A
Sbjct: 439  DADLLPPVVPGYQPPFRRLPYGIRHCLKDHEMTTFRRLARTVPPHFALGRNRELQGLAEA 498

Query: 418  IVKLWDKSSIAKIAIKRGIPNTSNEIIAEEIKKLTGGVLLSRNKEYIVFYRGNDFVTPSI 239
            IVKLW+ S+IAKIAIKRG+ NT NE +AEE+K+LTGG LLSRNKE+IVFYRGNDF+ P +
Sbjct: 499  IVKLWESSAIAKIAIKRGVQNTRNERMAEELKQLTGGTLLSRNKEFIVFYRGNDFLPPVV 558

Query: 238  REVLVEKEKLATIKQDDEEVARLR-XXXXXXXXXXXXXSLVAGTLAETVEAKHRWGKPLN 62
             + L E++K   ++Q++EE AR R               LVAGTLAET  A  RWG   +
Sbjct: 559  TKTLKERQKSRNLQQEEEEKARERVLALVGSNAKASKLPLVAGTLAETTAATSRWGHQPS 618

Query: 61   PEEREMAKKNMVLTKHASLV 2
             EE E  KKN  LT+ ASLV
Sbjct: 619  IEEVEEMKKNSALTQQASLV 638


>gb|EOY30431.1| CRS1 / YhbY domain-containing protein, putative isoform 1 [Theobroma
            cacao] gi|508783176|gb|EOY30432.1| CRS1 / YhbY
            domain-containing protein, putative isoform 1 [Theobroma
            cacao] gi|508783177|gb|EOY30433.1| CRS1 / YhbY
            domain-containing protein, putative isoform 1 [Theobroma
            cacao]
          Length = 873

 Score =  421 bits (1082), Expect = e-115
 Identities = 260/560 (46%), Positives = 336/560 (60%), Gaps = 33/560 (5%)
 Frame = -1

Query: 1582 EFGEDKVKPKETNFVDDKGVRSNYGDPIIASLKEECP-ILSSDMKARSCPTVTVS----F 1418
            EF E++V+ K++           +G  I  S ++E P +  SD  + S P+  +S     
Sbjct: 94   EFEEEEVERKQS-----------FGGAISESERDEDPQVEGSDPVSSSFPSRVISAPWSH 142

Query: 1417 GEERFSSQHSDLDSKVAGKSSVKDDALS---------GGRDNCVERYKDPKASSSSNVGI 1265
            G E F+  H D   +++   S  +D+ +         G +   V    D   S +  V I
Sbjct: 143  GSE-FNEPHFDFVPEISNFESKIEDSFASEKTIEFPGGNKAEVVGGLIDKSESLNEEVNI 201

Query: 1264 -----GFCVSSENS---DLKLVPNSRNNDLSLASSVPFPWEREIDSMEGEQLQRSNTELA 1109
                 G  V  E +    L  V +SR N   +++S       E DS  G   +RSNTE+ 
Sbjct: 202  NKQKIGLPVGKEVAAVEGLNDVVSSREN-FEVSNSDDEGGSVEGDS--GRSKKRSNTEMV 258

Query: 1108 ERTIPEPELQRLRNVALRMKERMTVGPAGVTEAVVKHIHDKWKEVEVTKLRFEGPACLNM 929
            +R IPE E QRLRNVALRM ER  VG AG+T+A+V++IH++WK  EV KL+FE P  LNM
Sbjct: 259  DRMIPEHESQRLRNVALRMVERTKVGVAGITQALVEYIHERWKMDEVVKLKFEEPLSLNM 318

Query: 928  KRTHEILERKTGGVVIWRSGRSVVLYRGMAYQLPCIQTYLQLS--DANSFH---NPSMGN 764
            KRTHEILE++TGG+VIWRSG S+VLYRGMAY+L C+Q+Y   +  D N+     N     
Sbjct: 319  KRTHEILEQRTGGLVIWRSGSSLVLYRGMAYKLHCVQSYTSQNKVDMNALDCSTNVESDT 378

Query: 763  CSNLIAGNQAETFETDSAMNQNLSKGFSSTAYID-----RLLDQLGPRFKDWSGRNPIPV 599
              N++      T E     +    K  S    +D      LLD+LGPR+KDWSGR P+PV
Sbjct: 379  TQNIVVKESVRTMECFMPSSSEYLKDLSKEELMDLCELNHLLDELGPRYKDWSGREPLPV 438

Query: 598  DADLLPGLVPGYKPPFRLLPYKTRTSLRDREMTVLRRLARTMPPHFVLGRNRQHQGLATA 419
            DADLLP +VPGY+PPFR LPY  R  L+D EMT  RRLART+PPHF LGRNR+ QGLA A
Sbjct: 439  DADLLPPVVPGYQPPFRRLPYGIRHCLKDHEMTTFRRLARTVPPHFALGRNRELQGLAEA 498

Query: 418  IVKLWDKSSIAKIAIKRGIPNTSNEIIAEEIKKLTGGVLLSRNKEYIVFYRGNDFVTPSI 239
            IVKLW+ S+IAKIAIKRG+ NT NE +AEE+K+LTGG LLSRNKE+IVFYRGNDF+ P +
Sbjct: 499  IVKLWESSAIAKIAIKRGVQNTRNERMAEELKQLTGGTLLSRNKEFIVFYRGNDFLPPVV 558

Query: 238  REVLVEKEKLATIKQDDEEVARLR-XXXXXXXXXXXXXSLVAGTLAETVEAKHRWGKPLN 62
             + L E++K   ++Q++EE AR R               LVAGTLAET  A  RWG   +
Sbjct: 559  TKTLKERQKSRNLQQEEEEKARERVLALVGSNAKASKLPLVAGTLAETTAATSRWGHQPS 618

Query: 61   PEEREMAKKNMVLTKHASLV 2
             EE E  KKN  LT+ ASLV
Sbjct: 619  IEEVEEMKKNSALTQQASLV 638


>ref|XP_004171699.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like, partial [Cucumis sativus]
          Length = 789

 Score =  420 bits (1080), Expect = e-114
 Identities = 270/686 (39%), Positives = 368/686 (53%), Gaps = 21/686 (3%)
 Frame = -1

Query: 1996 RKKRKLRPSFNAQTLERWSVKVSSSRSTFPWQDQKIESSLNLVSSGTPFDQSLDGIGAPM 1817
            +KKRK RPSF  Q   +WS K  SS  TFPWQ Q+ +             +  +G G   
Sbjct: 15   KKKRKPRPSFLEQIRHKWSTKPISSTHTFPWQQQEQDR----------HHKQDEGEG--- 61

Query: 1816 LEFGVPDENFDEDEKCSDSYLSDDSFENSVDAIPPMANHIMGIPLGMQSKLAPWAHGGKH 1637
                  +E  +E+E+ ++        + SV       +    +P+  +S  APWAHG   
Sbjct: 62   -----EEEEEEEEEQVAN--------QTSVSIPESTTDVTQAVPI-TRSISAPWAHG--- 104

Query: 1636 GEDGTRSRYKSEALVNNFEFGEDKVKPKETNFVDDKGVRSNYGDPIIASLKEECPILSSD 1457
                      S++    F+F     KPK  N           G+ I     E   I + D
Sbjct: 105  ----------SQSRNTQFDF-----KPKTPN-----------GEVI----NEISKISTDD 134

Query: 1456 MKARSCPTVTVSFGEERFSSQHSDLDSKVAGKSSVKDDALSGGRDNCVERYKDPKASSSS 1277
               R+  T+++    +  S   +++D+ V   +  +                     S+ 
Sbjct: 135  TSNRNASTISIDEISDDSSEDEAEIDTVVLPVTEKR---------------------STL 173

Query: 1276 NVGIGFCVSSENSDLKLVPNSRNNDLSLASSVPFPWERE--IDSMEGEQLQRSNTELAER 1103
            +  I   VSS+N D     N R         V  PW+RE   DS      +RS T LAE+
Sbjct: 174  SKKIVHSVSSDNDD-----NGR---------VDLPWKREPRRDSEVDAGQRRSKTLLAEQ 219

Query: 1102 TIPEPELQRLRNVALRMKERMTVGPAGVTEAVVKHIHDKWKEVEVTKLRFEGPACLNMKR 923
             +PE EL+RLRN++LRM ER+ VG  G+T+ ++  IH+KWK  EV KL+FEGP  +NMKR
Sbjct: 220  MLPEHELRRLRNISLRMVERIEVGVKGITQELLDSIHEKWKVDEVVKLKFEGPLTVNMKR 279

Query: 922  THEILERKTGGVVIWRSGRSVVLYRGMAYQLPCIQTYLQLSDA----------------- 794
             HE LE +TGG+VIWRSG  +VLYRGM Y LPC+Q+Y + + A                 
Sbjct: 280  AHEKLENRTGGLVIWRSGSLIVLYRGMTYHLPCVQSYAKQNQAKSNTLDVPNNVESDDIT 339

Query: 793  -NSFHNPSMGNCSNLIAGNQAETFETDSAMNQNLSKGFSSTAYIDRLLDQLGPRFKDWSG 617
             N   + ++G  S +++G    T          LS        ++ LLD++GPRFKDWSG
Sbjct: 340  RNEKLHTTVGTMSTIVSGASKHTKTLSKKELMELSD-------LNHLLDEIGPRFKDWSG 392

Query: 616  RNPIPVDADLLPGLVPGYKPPFRLLPYKTRTSLRDREMTVLRRLARTMPPHFVLGRNRQH 437
              P+PVDADLLPG+VPGYKPP R+LPY  R  LR++E+T+ RRLAR MPPHF LGRNRQ 
Sbjct: 393  CEPVPVDADLLPGIVPGYKPPTRILPYGVRHCLRNKEVTIFRRLARKMPPHFALGRNRQL 452

Query: 436  QGLATAIVKLWDKSSIAKIAIKRGIPNTSNEIIAEEIKKLTGGVLLSRNKEYIVFYRGND 257
            QGLA A+VKLW+K +IAKIAIKRG+ NT NE +AEE++ LTGG LLSRNKEYIVFYRGND
Sbjct: 453  QGLANAMVKLWEKCAIAKIAIKRGVENTRNERMAEELRILTGGTLLSRNKEYIVFYRGND 512

Query: 256  FVTPSIREVLVEKEKLATIKQD-DEEVARLRXXXXXXXXXXXXXSLVAGTLAETVEAKHR 80
            ++ P+I E L E+ KLA  +QD +E+V ++               LVAGTL ET+ A  R
Sbjct: 513  YLPPTITEALKERRKLADRQQDVEEQVRQVASAAIESKVKASNAPLVAGTLTETIAATSR 572

Query: 79   WGKPLNPEEREMAKKNMVLTKHASLV 2
            WG   +  + E  +++  L K  SL+
Sbjct: 573  WGSQPSGHDIENMREDSALAKLDSLI 598


>ref|XP_004144114.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Cucumis sativus]
          Length = 846

 Score =  420 bits (1080), Expect = e-114
 Identities = 270/686 (39%), Positives = 368/686 (53%), Gaps = 21/686 (3%)
 Frame = -1

Query: 1996 RKKRKLRPSFNAQTLERWSVKVSSSRSTFPWQDQKIESSLNLVSSGTPFDQSLDGIGAPM 1817
            +KKRK RPSF  Q   +WS K  SS  TFPWQ Q+ +             +  +G G   
Sbjct: 72   KKKRKPRPSFLEQIRHKWSTKPISSTHTFPWQQQEQDR----------HHKQDEGEG--- 118

Query: 1816 LEFGVPDENFDEDEKCSDSYLSDDSFENSVDAIPPMANHIMGIPLGMQSKLAPWAHGGKH 1637
                  +E  +E+E+ ++        + SV       +    +P+  +S  APWAHG   
Sbjct: 119  -----EEEEEEEEEQVAN--------QTSVSIPESTTDVTQAVPI-TRSISAPWAHG--- 161

Query: 1636 GEDGTRSRYKSEALVNNFEFGEDKVKPKETNFVDDKGVRSNYGDPIIASLKEECPILSSD 1457
                      S++    F+F     KPK  N           G+ I     E   I + D
Sbjct: 162  ----------SQSRNTQFDF-----KPKTPN-----------GEVI----NEISKISTDD 191

Query: 1456 MKARSCPTVTVSFGEERFSSQHSDLDSKVAGKSSVKDDALSGGRDNCVERYKDPKASSSS 1277
               R+  T+++    +  S   +++D+ V   +  +                     S+ 
Sbjct: 192  TSNRNASTISIDEISDDSSEDEAEIDTVVLPVTEKR---------------------STL 230

Query: 1276 NVGIGFCVSSENSDLKLVPNSRNNDLSLASSVPFPWERE--IDSMEGEQLQRSNTELAER 1103
            +  I   VSS+N D     N R         V  PW+RE   DS      +RS T LAE+
Sbjct: 231  SKKIVHSVSSDNDD-----NGR---------VDLPWKREPRRDSEVDAGQRRSKTLLAEQ 276

Query: 1102 TIPEPELQRLRNVALRMKERMTVGPAGVTEAVVKHIHDKWKEVEVTKLRFEGPACLNMKR 923
             +PE EL+RLRN++LRM ER+ VG  G+T+ ++  IH+KWK  EV KL+FEGP  +NMKR
Sbjct: 277  MLPEHELRRLRNISLRMVERIEVGVKGITQELLDSIHEKWKVDEVVKLKFEGPLTVNMKR 336

Query: 922  THEILERKTGGVVIWRSGRSVVLYRGMAYQLPCIQTYLQLSDA----------------- 794
             HE LE +TGG+VIWRSG  +VLYRGM Y LPC+Q+Y + + A                 
Sbjct: 337  AHEKLENRTGGLVIWRSGSLIVLYRGMTYHLPCVQSYAKQNQAKSNTLDVPNNVESDDIT 396

Query: 793  -NSFHNPSMGNCSNLIAGNQAETFETDSAMNQNLSKGFSSTAYIDRLLDQLGPRFKDWSG 617
             N   + ++G  S +++G    T          LS        ++ LLD++GPRFKDWSG
Sbjct: 397  RNEKLHTTVGTMSTIVSGASKHTKTLSKKELMELSD-------LNHLLDEIGPRFKDWSG 449

Query: 616  RNPIPVDADLLPGLVPGYKPPFRLLPYKTRTSLRDREMTVLRRLARTMPPHFVLGRNRQH 437
              P+PVDADLLPG+VPGYKPP R+LPY  R  LR++E+T+ RRLAR MPPHF LGRNRQ 
Sbjct: 450  CEPVPVDADLLPGIVPGYKPPTRILPYGVRHCLRNKEVTIFRRLARKMPPHFALGRNRQL 509

Query: 436  QGLATAIVKLWDKSSIAKIAIKRGIPNTSNEIIAEEIKKLTGGVLLSRNKEYIVFYRGND 257
            QGLA A+VKLW+K +IAKIAIKRG+ NT NE +AEE++ LTGG LLSRNKEYIVFYRGND
Sbjct: 510  QGLANAMVKLWEKCAIAKIAIKRGVENTRNERMAEELRILTGGTLLSRNKEYIVFYRGND 569

Query: 256  FVTPSIREVLVEKEKLATIKQD-DEEVARLRXXXXXXXXXXXXXSLVAGTLAETVEAKHR 80
            ++ P+I E L E+ KLA  +QD +E+V ++               LVAGTL ET+ A  R
Sbjct: 570  YLPPTITEALKERRKLADRQQDVEEQVRQVASAAIESKVKASNAPLVAGTLTETIAATSR 629

Query: 79   WGKPLNPEEREMAKKNMVLTKHASLV 2
            WG   +  + E  +++  L K  SL+
Sbjct: 630  WGSQPSGHDIENMREDSALAKLDSLI 655


>ref|XP_002309217.2| hypothetical protein POPTR_0006s15340g [Populus trichocarpa]
            gi|550336383|gb|EEE92740.2| hypothetical protein
            POPTR_0006s15340g [Populus trichocarpa]
          Length = 977

 Score =  419 bits (1078), Expect = e-114
 Identities = 277/731 (37%), Positives = 382/731 (52%), Gaps = 66/731 (9%)
 Frame = -1

Query: 1996 RKKRKLRPSFNAQTLERWSVKVSSSRSTFPWQDQKIESSLNLVSSGTPFDQSLDGIGAPM 1817
            + KRK +PSF  Q   +WS+K++S+R  FPWQ+Q+ +                       
Sbjct: 45   KSKRKPKPSFFEQIHHKWSLKLTSTRDKFPWQEQEQQQQQQQEEE--------------- 89

Query: 1816 LEFGVPDENFDEDEKCSDSYLSDDSFENSVDAIPPMANHIMGIPLGMQSKLAPWAHGG-- 1643
                  +E  +ED K              VDA+P +++ +    L  +    PW HG   
Sbjct: 90   ------EEEEEEDIK-------------EVDAVPSVSDTV-SFNLPNRLTTPPWIHGATP 129

Query: 1642 KHGEDGTRSRYKSEALVNNFEFGEDKVK----PKETNFVDDKGVRSNYGDPIIASLKEEC 1475
            K      + R    ++   FE  ED V      KE     +  + +N+ + ++       
Sbjct: 130  KQAHFDYQPRKGDNSIHGVFENREDNVVNGVIDKEERIEKEVNLDNNFKEQVVDFDDASV 189

Query: 1474 PILSSDMKARSCPTVTVSFGEERFSSQHSDLDSKVAGK------------SSVKDD---- 1343
              L    + + C     +   E  +++    +  VA K            +  KD     
Sbjct: 190  FQLPEAKEIKDCSVHRYAENREEDNAEEDSREDNVANKKESVGKKINCNLNKFKDKHYYN 249

Query: 1342 --ALSGGRDNCV-----------ERYKDPKASSSSNVGI---GFCVSSEN------SDLK 1229
               L G ++  +           E+  D       N+ +   G C S EN      +D+ 
Sbjct: 250  SVELPGDKEKSIVTDLNDVVSLTEKPFDGDDGDFGNIEVCNDGHCDSFENLSCKDSNDVV 309

Query: 1228 LVPNSRNNDLS--------LASSVPFPWERE--IDSM-EGEQLQRSNTELAERTIPEPEL 1082
             V   +  D          +++S   PW+R   +DS+ E +  ++SNT+LAER +PE EL
Sbjct: 310  SVSKKQLGDFENVEVSNNGVSNSNELPWKRTSGLDSLGEDKSRKKSNTDLAERMLPEHEL 369

Query: 1081 QRLRNVALRMKERMTVGPAGVTEAVVKHIHDKWKEVEVTKLRFEGPACLNMKRTHEILER 902
            +RLRNVALRM ER+ VG  G+T+ +V  IH+KWK  EV KL+FE P   NMKRTHEILE 
Sbjct: 370  KRLRNVALRMLERIKVGATGITQDLVDAIHEKWKLDEVVKLKFEWPLSCNMKRTHEILES 429

Query: 901  KTGGVVIWRSGRSVVLYRGMAYQLPCIQTYLQLSDANSF---HNPSMGNCSNLIAGNQ-- 737
            +TGG++IWRSG SVV+YRG  Y+  C+Q+Y + ++A      +     N +   AG +  
Sbjct: 430  RTGGLIIWRSGSSVVMYRGTTYKFQCVQSYTKQNEAGMDVLQYAEEATNSATSSAGMKDL 489

Query: 736  AETFETDSAMNQNLSKGFSSTAYID-----RLLDQLGPRFKDWSGRNPIPVDADLLPGLV 572
            A T E+         K  S    +D      LLD+LGPR+KDW GR P+PVDADLLP +V
Sbjct: 490  ARTMESIIPDAAKYLKDLSQEELMDFSELNHLLDELGPRYKDWCGREPLPVDADLLPAVV 549

Query: 571  PGYKPPFRLLPYKTRTSLRDREMTVLRRLARTMPPHFVLGRNRQHQGLATAIVKLWDKSS 392
            PGYK P RLLPY  +  L ++  T  RRLART PPHFVLGRNR+ QGLA A+VKLW++S+
Sbjct: 550  PGYKSPLRLLPYGVKPCLSNKNTTNFRRLARTTPPHFVLGRNRELQGLANAMVKLWERSA 609

Query: 391  IAKIAIKRGIPNTSNEIIAEEIKKLTGGVLLSRNKEYIVFYRGNDFVTPSIREVLVEKEK 212
            IAKIAIKRG+  T NEI+AEE+K+LTGG LLSRNKEYIVFYRGNDF+ P I E L E+ K
Sbjct: 610  IAKIAIKRGVQYTRNEIMAEELKRLTGGTLLSRNKEYIVFYRGNDFLPPVINETLKERRK 669

Query: 211  LATIKQDDEEVAR-LRXXXXXXXXXXXXXSLVAGTLAETVEAKHRWGKPLNPEEREMAKK 35
            LA + QD+E+ AR +               LVAGTL ETV A  RWG   + E+ E   +
Sbjct: 670  LAFLYQDEEDQARQMTSAFIGSSVKTTKGPLVAGTLVETVAAISRWGNQPSSEDVEEMIR 729

Query: 34   NMVLTKHASLV 2
            +  L +HASLV
Sbjct: 730  DSALARHASLV 740


>gb|EMT30138.1| Chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic [Aegilops tauschii]
          Length = 1128

 Score =  416 bits (1068), Expect = e-113
 Identities = 249/524 (47%), Positives = 320/524 (61%), Gaps = 34/524 (6%)
 Frame = -1

Query: 1684 LGMQSKLAPWAHGGKHGE------DGTRSRYKSEALVNNFEFGEDKVKPKETNFVDDKGV 1523
            LG +   APW HG +          G     + EA+ N     E   + ++  +VD+  V
Sbjct: 438  LGSRPVSAPWMHGQEGPTVVGRLLSGPVGGDEEEAVRNGVFDDELDSEDEDEEWVDNSEV 497

Query: 1522 RSNYGDPIIASLKEECPILSSDMKARSCPTVTVSFGEERFSSQHSDLDSKVAGKSSVKDD 1343
                 +P+  +L+EE   L  D  + + PT + SF           LDS +  ++S    
Sbjct: 498  LEE--EPMTVNLEEE---LYEDEDS-AAPTTSSSF----------PLDSILEDQAST--- 538

Query: 1342 ALSGGRDNCVERYKDPKASSSSNVGIGFCVSSENSDLKLVPNSRNND-LSLASSVPFPWE 1166
               GG D  + R      SS S++      S E S    + +S   D +    SV  PWE
Sbjct: 539  --GGGFDRNMRR------SSVSSIVNTLRNSMEES--ATIGSSEGEDFVQKLGSVLLPWE 588

Query: 1165 REI-DSMEGEQLQR-SNTELAERTIPEPELQRLRNVALRMKERMTVGPAGVTEAVVKHIH 992
            RE  ++ +G+   R SNT+LAERTIPEPEL+RLR+ ALRMKERM VGP GVT A+V+ IH
Sbjct: 589  REEGNAFDGDNRGRHSNTKLAERTIPEPELRRLRDAALRMKERMRVGPGGVTHAIVESIH 648

Query: 991  DKWKEVEVTKLRFEGPACLNMKRTHEILERKTGGVVIWRSGRSVVLYRGMAYQLPCIQTY 812
             KWK  EV K+RFEGP  LNMKRTHEILE +TGG VIWRSGRS+VLYRGM Y L C+Q+Y
Sbjct: 649  SKWKVDEVVKMRFEGPPSLNMKRTHEILEDRTGGTVIWRSGRSIVLYRGMNYNLRCVQSY 708

Query: 811  LQLSDANSFHNPSMGNCSNLIAGNQAETFETDSAMNQ----------------------- 701
             ++++ +S  N   G+   ++  ++    +  +  +                        
Sbjct: 709  AKIAEVDSSENA--GDAIGVVPSSEEHNLQKPTVEHDLQKPIVERNSQKSSADDVKRLKS 766

Query: 700  --NLSKGFSSTAYIDRLLDQLGPRFKDWSGRNPIPVDADLLPGLVPGYKPPFRLLPYKTR 527
              N S+  + T  ID  LDQLGPR+KDWSGR+PIPVDADLLPGLVPGYKPPFR LPY+T+
Sbjct: 767  IMNFSQEATETFDIDSFLDQLGPRYKDWSGRSPIPVDADLLPGLVPGYKPPFRQLPYRTK 826

Query: 526  TSLRDREMTVLRRLARTMPPHFVLGRNRQHQGLATAIVKLWDKSSIAKIAIKRGIPNTSN 347
             SL+D+EMT LRRLAR   PHF LGRNR+HQGLA AIVK+W+KSS  KIAIKRG+PNT N
Sbjct: 827  ISLKDKEMTALRRLARQTAPHFALGRNREHQGLAAAIVKVWEKSSTVKIAIKRGVPNTCN 886

Query: 346  EIIAEEIKKLTGGVLLSRNKEYIVFYRGNDFVTPSIREVLVEKE 215
            + +AEEIKKLTGGVL+SRNKEYI+FYRGNDFVTP  +  + + E
Sbjct: 887  DRMAEEIKKLTGGVLVSRNKEYIIFYRGNDFVTPKAKTKVAKAE 930


>ref|XP_006840356.1| hypothetical protein AMTR_s00045p00114550 [Amborella trichopoda]
            gi|548842074|gb|ERN02031.1| hypothetical protein
            AMTR_s00045p00114550 [Amborella trichopoda]
          Length = 1059

 Score =  413 bits (1061), Expect = e-112
 Identities = 217/408 (53%), Positives = 282/408 (69%), Gaps = 14/408 (3%)
 Frame = -1

Query: 1183 VPFPWEREIDSMEG--EQLQRSNTELAERTIPEPELQRLRNVALRMKERMTVGPAGVTEA 1010
            + FPW    +      ++  RS T LAE TIPEPEL RLR++AL MKER+ +G AGVT+A
Sbjct: 442  IEFPWVARAEERGNVEQRRSRSTTALAESTIPEPELLRLRSLALHMKERINIGVAGVTQA 501

Query: 1009 VVKHIHDKWKEVEVTKLRFEGPACLNMKRTHEILERKTGGVVIWRSGRSVVLYRGMAYQL 830
            +V  IHDKW+ VEV K++FEGP  +NMKRTHEILERKTGG+VI R G  VVLYRGM Y+L
Sbjct: 502  IVAAIHDKWRHVEVVKIKFEGPPAMNMKRTHEILERKTGGLVILRCGSFVVLYRGMGYEL 561

Query: 829  PCIQTYLQLSDANSFHNPSMGNCSNLIAGNQAETFETDSAMNQNLSKGFSSTAYIDR--- 659
            PC+Q+Y Q    +  H+    +   + A +     + ++ +   +S G SS    D+   
Sbjct: 562  PCVQSYRQ--HLHIIHDTLPHDM--IPATDNIGDTKVNALVRATVSSGTSSPTNYDKCES 617

Query: 658  --------LLDQLGPRFKDWSGRNPIPVDADLLPGLVPGYKPPFRLLPYKTRTSLRDREM 503
                    +L+ LGPRF+DWSG  P+PVDADLLP ++PGYKPPFR LP+  R  L++++M
Sbjct: 618  PHETDIEIILESLGPRFRDWSGCAPLPVDADLLPPVLPGYKPPFRFLPHGMRHCLKNKDM 677

Query: 502  TVLRRLARTMPPHFVLGRNRQHQGLATAIVKLWDKSSIAKIAIKRGIPNTSNEIIAEEIK 323
            T LRRLAR MPPHF LGRNR  QGLA A+V LW+ S IAKIAIKRG+ NT NE +AEE++
Sbjct: 678  TALRRLARQMPPHFALGRNRVLQGLAAAMVNLWETSVIAKIAIKRGVQNTCNERMAEELE 737

Query: 322  KLTGGVLLSRNKEYIVFYRGNDFVTPSIREVLVEKEKLATIKQDDEEVARLR-XXXXXXX 146
            KLTGG+L+SRNKEYIVFYRGNDF++PS++EVLV +EKLA    D+EE AR++        
Sbjct: 738  KLTGGILVSRNKEYIVFYRGNDFLSPSVKEVLVNREKLAKSLLDEEEKARMKAHASTLSN 797

Query: 145  XXXXXXSLVAGTLAETVEAKHRWGKPLNPEEREMAKKNMVLTKHASLV 2
                   LVAGTL ET+EAK RWG   +  ER+  K++M L++HA+L+
Sbjct: 798  TSTARGPLVAGTLEETLEAKSRWGMQPSTHERDEMKRDMTLSRHAALI 845


>ref|XP_006475470.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X5 [Citrus sinensis]
          Length = 803

 Score =  412 bits (1060), Expect = e-112
 Identities = 222/397 (55%), Positives = 276/397 (69%), Gaps = 6/397 (1%)
 Frame = -1

Query: 1174 PWEREIDSMEGEQLQRSNTELAERTIPEPELQRLRNVALRMKERMTVGPAGVTEAVVKHI 995
            PW+R  D     + +RSNTELAE+ IPE ELQRLRN++LRM ER  VG AG+T+A+V  I
Sbjct: 232  PWKRNTD-----RRRRSNTELAEKMIPEHELQRLRNISLRMLERTKVGSAGITQALVDSI 286

Query: 994  HDKWKEVEVTKLRFEGPACLNMKRTHEILERKTGGVVIWRSGRSVVLYRGMAYQLPCIQT 815
            H+KWK  EV KL+FE P  L MKRTHEILER+TGG+VIWRSG SVVL+RGMAY+LPC+Q+
Sbjct: 287  HEKWKLDEVVKLKFEEPHSLQMKRTHEILERRTGGLVIWRSGSSVVLFRGMAYKLPCVQS 346

Query: 814  YLQ---LSDANSFHNPSMGNCSNLIAGNQAETFETDSAMN-QNLSKG-FSSTAYIDRLLD 650
            + +           N  M N       +  E++  DSA N +NLSK        ++ LLD
Sbjct: 347  FTKHNHTQQTQDVTNEVMRNVGEHPPRSAMESYVPDSANNLENLSKEELMDLCELNYLLD 406

Query: 649  QLGPRFKDWSGRNPIPVDADLLPGLVPGYKPPFRLLPYKTRTSLRDREMTVLRRLARTMP 470
            +LGPRFKDW GR P+PVDADLLP +VP YKPP RLLPY  +  LRD E T  RRLAR  P
Sbjct: 407  ELGPRFKDWPGREPLPVDADLLPPVVPDYKPPLRLLPYGIKPGLRDCETTEFRRLARKTP 466

Query: 469  PHFVLGRNRQHQGLATAIVKLWDKSSIAKIAIKRGIPNTSNEIIAEEIKKLTGGVLLSRN 290
            PHF LGRNR+ QGLA A+VKLW+KS+IAKIAIKR + NT NE +AEE+KKLTGG LL RN
Sbjct: 467  PHFALGRNRELQGLAKAMVKLWEKSAIAKIAIKRDVMNTRNERMAEELKKLTGGTLLCRN 526

Query: 289  KEYIVFYRGNDFVTPSIREVLVEKEKLATIKQDDEEVAR-LRXXXXXXXXXXXXXSLVAG 113
            K+YIVFYRGNDF+ P + + + E+ KL  I+QD+EE AR +              SLVAG
Sbjct: 527  KDYIVFYRGNDFLPPVVTDAVKERSKLTDIRQDEEEQARHVASALIELKAKGFVGSLVAG 586

Query: 112  TLAETVEAKHRWGKPLNPEEREMAKKNMVLTKHASLV 2
            TLAET+ A  RWG+  + E+ E   ++  L++HASL+
Sbjct: 587  TLAETLAATSRWGRQPSYEDVEKMMRDSTLSRHASLL 623


>ref|XP_006475466.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X1 [Citrus sinensis]
            gi|568843115|ref|XP_006475467.1| PREDICTED: chloroplastic
            group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X2 [Citrus sinensis]
            gi|568843117|ref|XP_006475468.1| PREDICTED: chloroplastic
            group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X3 [Citrus sinensis]
            gi|568843119|ref|XP_006475469.1| PREDICTED: chloroplastic
            group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X4 [Citrus sinensis]
          Length = 812

 Score =  412 bits (1060), Expect = e-112
 Identities = 222/397 (55%), Positives = 276/397 (69%), Gaps = 6/397 (1%)
 Frame = -1

Query: 1174 PWEREIDSMEGEQLQRSNTELAERTIPEPELQRLRNVALRMKERMTVGPAGVTEAVVKHI 995
            PW+R  D     + +RSNTELAE+ IPE ELQRLRN++LRM ER  VG AG+T+A+V  I
Sbjct: 232  PWKRNTD-----RRRRSNTELAEKMIPEHELQRLRNISLRMLERTKVGSAGITQALVDSI 286

Query: 994  HDKWKEVEVTKLRFEGPACLNMKRTHEILERKTGGVVIWRSGRSVVLYRGMAYQLPCIQT 815
            H+KWK  EV KL+FE P  L MKRTHEILER+TGG+VIWRSG SVVL+RGMAY+LPC+Q+
Sbjct: 287  HEKWKLDEVVKLKFEEPHSLQMKRTHEILERRTGGLVIWRSGSSVVLFRGMAYKLPCVQS 346

Query: 814  YLQ---LSDANSFHNPSMGNCSNLIAGNQAETFETDSAMN-QNLSKG-FSSTAYIDRLLD 650
            + +           N  M N       +  E++  DSA N +NLSK        ++ LLD
Sbjct: 347  FTKHNHTQQTQDVTNEVMRNVGEHPPRSAMESYVPDSANNLENLSKEELMDLCELNYLLD 406

Query: 649  QLGPRFKDWSGRNPIPVDADLLPGLVPGYKPPFRLLPYKTRTSLRDREMTVLRRLARTMP 470
            +LGPRFKDW GR P+PVDADLLP +VP YKPP RLLPY  +  LRD E T  RRLAR  P
Sbjct: 407  ELGPRFKDWPGREPLPVDADLLPPVVPDYKPPLRLLPYGIKPGLRDCETTEFRRLARKTP 466

Query: 469  PHFVLGRNRQHQGLATAIVKLWDKSSIAKIAIKRGIPNTSNEIIAEEIKKLTGGVLLSRN 290
            PHF LGRNR+ QGLA A+VKLW+KS+IAKIAIKR + NT NE +AEE+KKLTGG LL RN
Sbjct: 467  PHFALGRNRELQGLAKAMVKLWEKSAIAKIAIKRDVMNTRNERMAEELKKLTGGTLLCRN 526

Query: 289  KEYIVFYRGNDFVTPSIREVLVEKEKLATIKQDDEEVAR-LRXXXXXXXXXXXXXSLVAG 113
            K+YIVFYRGNDF+ P + + + E+ KL  I+QD+EE AR +              SLVAG
Sbjct: 527  KDYIVFYRGNDFLPPVVTDAVKERSKLTDIRQDEEEQARHVASALIELKAKGFVGSLVAG 586

Query: 112  TLAETVEAKHRWGKPLNPEEREMAKKNMVLTKHASLV 2
            TLAET+ A  RWG+  + E+ E   ++  L++HASL+
Sbjct: 587  TLAETLAATSRWGRQPSYEDVEKMMRDSTLSRHASLL 623


>ref|XP_006451488.1| hypothetical protein CICLE_v10007477mg [Citrus clementina]
            gi|557554714|gb|ESR64728.1| hypothetical protein
            CICLE_v10007477mg [Citrus clementina]
          Length = 810

 Score =  412 bits (1060), Expect = e-112
 Identities = 222/397 (55%), Positives = 276/397 (69%), Gaps = 6/397 (1%)
 Frame = -1

Query: 1174 PWEREIDSMEGEQLQRSNTELAERTIPEPELQRLRNVALRMKERMTVGPAGVTEAVVKHI 995
            PW+R  D     + +RSNTELAE+ IPE ELQRLRN++LRM ER  VG AG+T+A+V  I
Sbjct: 230  PWKRNTD-----RRRRSNTELAEKMIPEHELQRLRNISLRMLERTKVGSAGITQALVDSI 284

Query: 994  HDKWKEVEVTKLRFEGPACLNMKRTHEILERKTGGVVIWRSGRSVVLYRGMAYQLPCIQT 815
            H+KWK  EV KL+FE P  L MKRTHEILER+TGG+VIWRSG SVVL+RGMAY+LPC+Q+
Sbjct: 285  HEKWKLDEVVKLKFEEPHSLQMKRTHEILERRTGGLVIWRSGSSVVLFRGMAYKLPCVQS 344

Query: 814  YLQ---LSDANSFHNPSMGNCSNLIAGNQAETFETDSAMN-QNLSKG-FSSTAYIDRLLD 650
            + +           N  M N       +  E++  DSA N +NLSK        ++ LLD
Sbjct: 345  FTKHNHTQQTQDVTNEVMRNVGEHPPRSAMESYVPDSANNLENLSKEELMDLCELNYLLD 404

Query: 649  QLGPRFKDWSGRNPIPVDADLLPGLVPGYKPPFRLLPYKTRTSLRDREMTVLRRLARTMP 470
            +LGPRFKDW GR P+PVDADLLP +VP YKPP RLLPY  +  LRD E T  RRLAR  P
Sbjct: 405  ELGPRFKDWPGREPLPVDADLLPPVVPDYKPPLRLLPYGIKPGLRDCETTEFRRLARKTP 464

Query: 469  PHFVLGRNRQHQGLATAIVKLWDKSSIAKIAIKRGIPNTSNEIIAEEIKKLTGGVLLSRN 290
            PHF LGRNR+ QGLA A+VKLW+KS+IAKIAIKR + NT NE +AEE+KKLTGG LL RN
Sbjct: 465  PHFALGRNRELQGLAKAMVKLWEKSAIAKIAIKRDVMNTRNERMAEELKKLTGGTLLCRN 524

Query: 289  KEYIVFYRGNDFVTPSIREVLVEKEKLATIKQDDEEVAR-LRXXXXXXXXXXXXXSLVAG 113
            K+YIVFYRGNDF+ P + + + E+ KL  I+QD+EE AR +              SLVAG
Sbjct: 525  KDYIVFYRGNDFLPPVVTDAVKERSKLTDIRQDEEEQARHVASALIELKAKGFVGSLVAG 584

Query: 112  TLAETVEAKHRWGKPLNPEEREMAKKNMVLTKHASLV 2
            TLAET+ A  RWG+  + E+ E   ++  L++HASL+
Sbjct: 585  TLAETLAATSRWGRQPSYEDVEKMMRDSTLSRHASLL 621


>ref|XP_006357699.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X1 [Solanum tuberosum]
            gi|565382761|ref|XP_006357700.1| PREDICTED: chloroplastic
            group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X2 [Solanum tuberosum]
          Length = 820

 Score =  411 bits (1057), Expect = e-112
 Identities = 215/398 (54%), Positives = 277/398 (69%), Gaps = 3/398 (0%)
 Frame = -1

Query: 1186 SVPFPWEREIDSMEGEQLQRSNTELAERTIPEPELQRLRNVALRMKERMTVGPAGVTEAV 1007
            SV  PWE       G++L++SN ELAE+ IPE +L+RLRN ALRM ER+ VG  GVT+ +
Sbjct: 214  SVRLPWE-------GDKLRKSNAELAEKLIPEAQLKRLRNAALRMVERIKVGSGGVTQEL 266

Query: 1006 VKHIHDKWKEVEVTKLRFEGPACLNMKRTHEILERKTGGVVIWRSGRSVVLYRGMAYQLP 827
            V  I DKWK  E+ KLRFEGP   NMKRTH+ILE +TGG+VIWRSG S+VLYRG++Y+LP
Sbjct: 267  VDSIQDKWKVDEIVKLRFEGPPSHNMKRTHDILEHRTGGLVIWRSGSSIVLYRGISYKLP 326

Query: 826  CIQTYLQLS-DANSFHNPSMGNCSNLIAGNQAETFETDSAMNQNLS-KGFSSTAYIDRLL 653
            C+Q++   + D +    P+  +C +L      E  E     + +LS +     + ++ +L
Sbjct: 327  CVQSFTSKNHDVDESEYPNNDSCQSLGVKCLNEAAERPRNGSTDLSSEEIVDLSELNMIL 386

Query: 652  DQLGPRFKDWSGRNPIPVDADLLPGLVPGYKPPFRLLPYKTRTSLRDREMTVLRRLARTM 473
            D++GPRFKDWSGR P+PVDADLLP +VPGY+PPFR LPY  + +L+++EMT LRR AR M
Sbjct: 387  DEVGPRFKDWSGREPLPVDADLLPAVVPGYRPPFRRLPYGAKLNLKNKEMTYLRRTARIM 446

Query: 472  PPHFVLGRNRQHQGLATAIVKLWDKSSIAKIAIKRGIPNTSNEIIAEEIKKLTGGVLLSR 293
            PPHF LGRNRQ QGLA A+VKLW +S+IAKIAIKRG+ NTSNE ++EE+K LTGG LLSR
Sbjct: 447  PPHFALGRNRQLQGLAAAMVKLWRRSAIAKIAIKRGVLNTSNERMSEELKVLTGGTLLSR 506

Query: 292  NKEYIVFYRGNDFVTPSIREVLVEKEKLATIKQDDEEVARLR-XXXXXXXXXXXXXSLVA 116
            NK+YIVFYRGNDF+ P + E L E E+ +   QD EE AR R               LVA
Sbjct: 507  NKDYIVFYRGNDFLPPRVTEALEEAERKSDFLQDQEEQARQRAVTSIDSDTRAPKRPLVA 566

Query: 115  GTLAETVEAKHRWGKPLNPEEREMAKKNMVLTKHASLV 2
            GTL+ET+ A  RWG   + EERE   ++  + +HASLV
Sbjct: 567  GTLSETMAATSRWGNQPSIEEREKMMRDAAVARHASLV 604


Top