BLASTX nr result

ID: Mentha22_contig00034907 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00034907
         (1120 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU44617.1| hypothetical protein MIMGU_mgv1a026522mg, partial...   389   e-106
ref|XP_006357840.1| PREDICTED: chloroplastic group IIA intron sp...   287   5e-75
ref|XP_002280704.1| PREDICTED: chloroplastic group IIA intron sp...   281   4e-73
ref|XP_002516757.1| conserved hypothetical protein [Ricinus comm...   276   1e-71
ref|XP_004233710.1| PREDICTED: chloroplastic group IIA intron sp...   270   8e-70
ref|XP_007217313.1| hypothetical protein PRUPE_ppa016241mg [Prun...   256   9e-66
gb|EXC20503.1| Chloroplastic group IIA intron splicing facilitat...   248   3e-63
gb|EPS58217.1| hypothetical protein M569_16596, partial [Genlise...   245   2e-62
ref|XP_007033221.1| maize chloroplast splicing factor CRS1, puta...   245   3e-62
ref|XP_007033220.1| maize chloroplast splicing factor CRS1, puta...   245   3e-62
ref|XP_007033219.1| maize chloroplast splicing factor CRS1, puta...   245   3e-62
ref|XP_007033218.1| maize chloroplast splicing factor CRS1, puta...   245   3e-62
ref|XP_007033217.1| maize chloroplast splicing factor CRS1, puta...   245   3e-62
ref|XP_007139175.1| hypothetical protein PHAVU_008G007700g [Phas...   239   1e-60
ref|XP_006430740.1| hypothetical protein CICLE_v10013368mg [Citr...   238   3e-60
ref|XP_006603058.1| PREDICTED: chloroplastic group IIA intron sp...   238   3e-60
ref|XP_006603055.1| PREDICTED: chloroplastic group IIA intron sp...   238   3e-60
ref|XP_006603054.1| PREDICTED: chloroplastic group IIA intron sp...   238   3e-60
ref|XP_006482225.1| PREDICTED: chloroplastic group IIA intron sp...   234   4e-59
ref|XP_004158502.1| PREDICTED: LOW QUALITY PROTEIN: chloroplasti...   232   2e-58

>gb|EYU44617.1| hypothetical protein MIMGU_mgv1a026522mg, partial [Mimulus guttatus]
          Length = 702

 Score =  389 bits (1000), Expect = e-106
 Identities = 213/374 (56%), Positives = 257/374 (68%), Gaps = 8/374 (2%)
 Frame = -2

Query: 1098 RIERGSEDSSAQSSKSIPHSRSKLKAPTAPWMAEPLFVKPNE-MESMKRRNNKGLELDRI 922
            +IE  + DS  +S + IPHSRS +KAPTAPWM  PL VKP+E +ES + R  K     R 
Sbjct: 4    KIEHENRDSRKESPEHIPHSRSTIKAPTAPWMNGPLLVKPSEILESRRTRTRKHFAAGRN 63

Query: 921  ------GEHPDEDLTGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPG 760
                  G HPD DLTGKVGGARG++AMKKI+K  EKLQ+T ++E   KN E+ KFKFAPG
Sbjct: 64   DGEHTGGGHPDVDLTGKVGGARGKVAMKKIYKGIEKLQDTQNVEEPGKNLENLKFKFAPG 123

Query: 759  ALCGNGDYGGDXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPWEKEEKMVIRR 580
            AL G+     +        +K A  NL  ++FD+P  +A  E KSKK+PWE +E +VIRR
Sbjct: 124  ALWGDKGEVEEN-------TKEARWNLKIDDFDLPFGEAENEAKSKKMPWESDETVVIRR 176

Query: 579  AKKEKVVTAAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALL 400
             +KEKVVT+AESSLD +LLERL+ EAAL++KWVKV KAGVTQ+VVDQV   WRN+ELAL+
Sbjct: 177  VQKEKVVTSAESSLDPVLLERLKEEAALIRKWVKVKKAGVTQSVVDQVSLFWRNNELALV 236

Query: 399  KFDLPLSRNMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQG 220
             FDLPL RNMDRAREI+E+KTGGLVVW NK+ LAVYRGCNY       RN          
Sbjct: 237  NFDLPLCRNMDRAREIIEMKTGGLVVWSNKEFLAVYRGCNYKSGPKQFRN---------- 286

Query: 219  NSSSNIIYQNTRTVARETSGESNPHELIHGRDGKLE-NLEMASLYEREADRLLDELGPRF 43
                  IY+NT  +A+E+           GRD + E ++ M SLYEREADRLLD LGPRF
Sbjct: 287  ------IYRNTTAIAQES---------CDGRDSEWESSIHMTSLYEREADRLLDGLGPRF 331

Query: 42   VDWWMQKPLPVDGD 1
            VDWWMQKPLPVDGD
Sbjct: 332  VDWWMQKPLPVDGD 345


>ref|XP_006357840.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Solanum tuberosum]
          Length = 802

 Score =  287 bits (735), Expect = 5e-75
 Identities = 163/353 (46%), Positives = 221/353 (62%), Gaps = 6/353 (1%)
 Frame = -2

Query: 1041 SRSKLKAPTAPWMAEPLFVKPNE-MESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRL 865
            S S +K PTAPWM  PL ++PN+ ++  K R  K     +  ++P++ L+GKV G RG+ 
Sbjct: 70   SSSGIKGPTAPWMRGPLLLEPNQFLDLSKSRKKKDANFAKT-QNPNDALSGKVSGGRGKK 128

Query: 864  AMKKIFKSFEKLQETHDLEAFSKNHESR-KFKFAPGALCGNGDYGGDXXXXXXECSKAAE 688
            AMK I++  +KLQET   E      +++ +F+F PG+L   GD   +         +   
Sbjct: 129  AMKMIYQGIDKLQETQIGEGTQVETDAKVEFQFPPGSLSEWGDVSYEIEEKNPYGEEDNV 188

Query: 687  ENLNGNEFDIPLFDA---GKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAESSLDEMLLER 517
            E+L G EF +   +    G      K+PWE E ++V RR KKEKVV  AES+LD MLLER
Sbjct: 189  ESLEGVEFGVLSREGEGRGSRKIGVKMPWESEVRIVYRRMKKEKVVMTAESNLDAMLLER 248

Query: 516  LRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREIVELKT 337
            LR EAA ++KWVKV KAGVT+ VVDQ+HF W+N+ELA+LKFDLPL RNMDRAREIVE+KT
Sbjct: 249  LRGEAARIQKWVKVKKAGVTRTVVDQIHFIWKNNELAMLKFDLPLCRNMDRAREIVEMKT 308

Query: 336  GGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQGNSSSNIIYQNTRTVARETSGE 157
            GG VVW  ++ L VYRGC+Y  +    + + +       NSS     + T   +   S  
Sbjct: 309  GGFVVWMKQNALVVYRGCSYTLQ---QKELQHDFLCSHQNSSFTENIKQTSIFSPLNSSG 365

Query: 156  SNPHELIHGRDGKLENLEM-ASLYEREADRLLDELGPRFVDWWMQKPLPVDGD 1
            S+  E+I   + + ++L M  SLY REA+RLLD+LGPR+VDWW  KPLPV+ D
Sbjct: 366  SSEDEMISVGNSEEDSLAMNESLYVREANRLLDDLGPRYVDWWWPKPLPVNAD 418


>ref|XP_002280704.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Vitis vinifera]
          Length = 1184

 Score =  281 bits (719), Expect = 4e-73
 Identities = 166/374 (44%), Positives = 211/374 (56%), Gaps = 20/374 (5%)
 Frame = -2

Query: 1062 SSKSIPHSRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVG 883
            SS+ +  + + +K PTAPWM  PL ++PNE+  + +   K +      E PD  LT KV 
Sbjct: 62   SSQPVSGTDAAIKMPTAPWMKGPLLLQPNEVLDLSKARPKKVAGSAGAEKPDRSLTEKVS 121

Query: 882  GARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXXXXXEC 703
            G RG  AMKKI +S  KLQETH                                      
Sbjct: 122  GGRGAKAMKKIMQSIVKLQETHT------------------------------------- 144

Query: 702  SKAAEENLNGNEFDIPLFDAGKEVKSK---KLPWEKEEKMVIRRAKKEKVVTAAESSLDE 532
            S   +EN    EF + L   G +  S+   K+PW K EK+V RR KKEKVVTAAE +LD 
Sbjct: 145  SDETQENTEEFEFGVSLEGIGGDENSRIGGKMPWLKTEKVVFRRTKKEKVVTAAELTLDP 204

Query: 531  MLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREI 352
            MLLERLR EA  M+KWVKV KAGVT++VVDQ+H  W++DELA++KFD+PL RNMDRAREI
Sbjct: 205  MLLERLRGEAVKMRKWVKVKKAGVTESVVDQIHMVWKSDELAMVKFDMPLCRNMDRAREI 264

Query: 351  VELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNM--NYSSAGDQGNSSSN-IIYQNTRT 181
            +E+KT GLV+W  KD L VYRG NY     + + M     +  D  NS  N   +++  T
Sbjct: 265  LEIKTRGLVIWSKKDTLVVYRGSNYQSTSKHFQKMRPGLVAGADASNSKLNQSNFEDDLT 324

Query: 180  VARETSGESNPHELIHGRDGKLENL-------EM-------ASLYEREADRLLDELGPRF 43
            ++     ES   E +  +DG+ ++        EM        SLYEREADRLLD LGPRF
Sbjct: 325  ISEIKFHESTTGEKMGRKDGEEDSSPTGIFMEEMVDSQPVNGSLYEREADRLLDGLGPRF 384

Query: 42   VDWWMQKPLPVDGD 1
            +DWW  KPLPVD D
Sbjct: 385  IDWWRPKPLPVDAD 398


>ref|XP_002516757.1| conserved hypothetical protein [Ricinus communis]
            gi|223544130|gb|EEF45655.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 742

 Score =  276 bits (705), Expect = 1e-71
 Identities = 163/363 (44%), Positives = 208/363 (57%), Gaps = 12/363 (3%)
 Frame = -2

Query: 1053 SIPHSRSK--LKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGG 880
            S+P+S+S   +K PTAPWM  PL ++P+E+ ++ +  NK    +   E  D+ LTGK  G
Sbjct: 48   SVPNSQSNAPIKVPTAPWMKGPLLLQPHELINLSKPRNKNSSNNANIEKSDKVLTGKESG 107

Query: 879  ARGRLAMKKIFKSFEKLQETHDLE-------AFSKNH-ESRKFKFAP--GALCGNGDYGG 730
             RG+ AM+KI KS E+LQE   LE       A+ K   +S  F+     G +  +GD+G 
Sbjct: 108  VRGKKAMEKIVKSIEQLQENQALEKTQCDSQAYEKTQLDSEAFEIGEKLGLIREHGDFG- 166

Query: 729  DXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPWEKEEKMVIRRAKKEKVVTAA 550
                                            V  K  PWE+EEK V  R KKEK VT A
Sbjct: 167  --------------------------------VNKKLKPWEREEKFVYWRIKKEKAVTKA 194

Query: 549  ESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNM 370
            E  L++ LLE LR EA+ M+KWVKVMKAGVTQ+VVDQ+ + WRN+ELA++KFDLPL RNM
Sbjct: 195  ELILEKELLEILRTEASKMRKWVKVMKAGVTQSVVDQIRYAWRNNELAMVKFDLPLCRNM 254

Query: 369  DRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQGNSSSNIIYQN 190
            DRAREIVELKTGGLVVW  KD L +YRGCNY                     SS++   +
Sbjct: 255  DRAREIVELKTGGLVVWTRKDSLVIYRGCNY-----------------HLTKSSHVSTMD 297

Query: 189  TRTVARETSGESNPHELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKPLPV 10
             +  +++   E  P  +  G D     +   SL+ERE DRLLD LGPRFVDWWM+KPLPV
Sbjct: 298  EKIGSKDGEEEYIPTSIFIGDDANTPTIN-GSLFERETDRLLDGLGPRFVDWWMRKPLPV 356

Query: 9    DGD 1
            D D
Sbjct: 357  DAD 359


>ref|XP_004233710.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Solanum lycopersicum]
          Length = 766

 Score =  270 bits (690), Expect = 8e-70
 Identities = 161/377 (42%), Positives = 214/377 (56%), Gaps = 5/377 (1%)
 Frame = -2

Query: 1116 RRVEDRRIERGSEDSSAQSSKSIPHSRSKLKAPTAPWMAEPLFVKPNE-MESMKRRNNKG 940
            ++VE   +E  ++D  + SS         +K PTAPWM  PL ++PN+ ++  K R  K 
Sbjct: 53   KKVEQCNLEFENQDYGSSSSG--------IKGPTAPWMRGPLLLEPNQVLDLSKSRKKKD 104

Query: 939  LELDRIGEHPDEDLTGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESR-KFKFAP 763
                +  ++P++ L+GKV G RG+ AMK I++  +KLQET   E      + + +F+F P
Sbjct: 105  TNFAKT-QNPNDALSGKVSGGRGKKAMKMIYQGIDKLQETQIGECTQVETDVKVEFQFPP 163

Query: 762  GALCGNGDYGGDXXXXXXECSKAAEENLNGNEFDIPLFDA---GKEVKSKKLPWEKEEKM 592
            G+L G GD   +         +   E+L G EF +   +    G      ++PWE EE++
Sbjct: 164  GSLSGWGDVSYEIEEKNPYGEEDNVESLEGVEFGVLSREGEGRGSRKSGARMPWESEERI 223

Query: 591  VIRRAKKEKVVTAAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDE 412
            V RR KKEKVV  AES+LD MLLERLR EAA ++KWVKV KAGVT+ VVDQ+ F W+N+E
Sbjct: 224  VYRRMKKEKVVRTAESNLDAMLLERLRGEAARIQKWVKVKKAGVTRTVVDQIQFIWKNNE 283

Query: 411  LALLKFDLPLSRNMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSA 232
            LA+LKFDLPL RNMDRAR+IVE+KTGG VVW  ++ L VYRG              Y   
Sbjct: 284  LAMLKFDLPLCRNMDRARDIVEMKTGGFVVWMKQNALVVYRG--------------YEMI 329

Query: 231  GDQGNSSSNIIYQNTRTVARETSGESNPHELIHGRDGKLENLEMASLYEREADRLLDELG 52
               GNS  + +  N                               SLYEREA+RLLD+LG
Sbjct: 330  -SVGNSEEDSLVMN------------------------------ESLYEREANRLLDDLG 358

Query: 51   PRFVDWWMQKPLPVDGD 1
            PR+VDWW  KPLPVD D
Sbjct: 359  PRYVDWWWPKPLPVDAD 375


>ref|XP_007217313.1| hypothetical protein PRUPE_ppa016241mg [Prunus persica]
            gi|462413463|gb|EMJ18512.1| hypothetical protein
            PRUPE_ppa016241mg [Prunus persica]
          Length = 809

 Score =  256 bits (655), Expect = 9e-66
 Identities = 159/365 (43%), Positives = 212/365 (58%), Gaps = 22/365 (6%)
 Frame = -2

Query: 1029 LKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRLAMKKI 850
            +KAPTAPWM  PL ++P+E+    +  NK    +   E PD  L GK+ G RG  A+K+I
Sbjct: 85   IKAPTAPWMKGPLLLQPHEVIDFSKPRNKKTHNNAKAEKPDTVLAGKLVGIRGDKAIKQI 144

Query: 849  FKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYG-GDXXXXXXECSKAAEENLNG 673
             +S E+L          K  E++K         G G++   D      +  K  E + + 
Sbjct: 145  VQSIERLGPNQ------KTDETQK---------GFGEFRIWDSLEGLGQNEKWDETHKDF 189

Query: 672  NEFDIP--LFDAGKEVKSK---KLPWEKEEKMVIRRAKKEKVVTAAESSLDEMLLERLRN 508
             EF I   L   GK   S+   K+PWE++E++V +R KK++V +AAE SL++ LLERLR 
Sbjct: 190  VEFGIGGCLEGLGKAADSRFGGKMPWERDERIVFQRIKKKRVASAAELSLEKELLERLRA 249

Query: 507  EAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREIVELKTGGL 328
            EAA M+KWVKV KAGVTQA+VD + F W+ +ELA++KFD+PL RNM RA+EIVE KTGG+
Sbjct: 250  EAAKMRKWVKVKKAGVTQAIVDDIKFIWKTNELAMVKFDVPLCRNMHRAQEIVETKTGGM 309

Query: 327  VVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQGNSSSNIIYQN--TRTVARETSGES 154
            VVWG KD L +YRGCNY         M   SA  Q   SS+ +  +    +  +  S ES
Sbjct: 310  VVWGKKDTLVIYRGCNYQSSSKFFPKMRPCSADRQETLSSDHMQPDLEENSSYQYKSFES 369

Query: 153  NPHELIHGRD--------GKLENLEMA------SLYEREADRLLDELGPRFVDWWMQKPL 16
               E +  +D        G  +   M+      SLYE+EADRLLD LGPRF+DWWM KPL
Sbjct: 370  PVDEKMSRKDAEEDCIQSGTFQETSMSCQPTSRSLYEKEADRLLDGLGPRFIDWWMHKPL 429

Query: 15   PVDGD 1
            PVD D
Sbjct: 430  PVDAD 434


>gb|EXC20503.1| Chloroplastic group IIA intron splicing facilitator CRS1 [Morus
            notabilis]
          Length = 828

 Score =  248 bits (634), Expect = 3e-63
 Identities = 152/377 (40%), Positives = 204/377 (54%), Gaps = 23/377 (6%)
 Frame = -2

Query: 1062 SSKSIPHSRSKL---KAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTG 892
            S+K  P S+  L   K PT PWM  PL ++P+E+  + +  N     +R  E     LT 
Sbjct: 48   STKENPDSKPPLEPIKMPTPPWMKGPLVLQPHEVTDLSKPENDNKFSNRKAEKSVNGLTD 107

Query: 891  KVGGARGRLAMKKIFKSFEKL--QETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXX 718
            K+ G RG+  +KKI +  E+L  +   D E   K+   +          G GD       
Sbjct: 108  KLVGRRGKNVIKKIARRIEELGRKSKVDSEETQKDFVGKN---------GIGD------- 151

Query: 717  XXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAESSL 538
                C +   E+ +G E               ++PWEK+E  V RR KKEK+V++AE  L
Sbjct: 152  ----CLEGLGESRSGGE---------------RMPWEKDEGFVFRRMKKEKIVSSAELRL 192

Query: 537  DEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAR 358
            +  LLERLR+EA  M+KWVKV KAGVT+ VV+ V F W+++ELA++KFD+PL RNMDRA+
Sbjct: 193  ERELLERLRSEARKMRKWVKVKKAGVTKEVVEDVKFVWKSNELAMVKFDVPLCRNMDRAQ 252

Query: 357  EIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQGNSSSNII------- 199
            EI+E+KTGGLVVW  KD   +YRGCNY              +G Q    SN++       
Sbjct: 253  EILEMKTGGLVVWRRKDAQVIYRGCNYQPTSKTFPRTYAGFSGHQETPFSNLVQLDSRKG 312

Query: 198  --------YQNT---RTVARETSGESNPHELIHGRDGKLENLEMASLYEREADRLLDELG 52
                    Y+NT   +   + T GE+ P  +I   D   +    +SLY READRLLD LG
Sbjct: 313  NSVSEVKSYENTIERKISKKNTEGETIPTAIILKNDANFQ--PSSSLYVREADRLLDGLG 370

Query: 51   PRFVDWWMQKPLPVDGD 1
            PRF+DWWM KPLPVD D
Sbjct: 371  PRFIDWWMNKPLPVDAD 387


>gb|EPS58217.1| hypothetical protein M569_16596, partial [Genlisea aurea]
          Length = 668

 Score =  245 bits (626), Expect = 2e-62
 Identities = 152/355 (42%), Positives = 207/355 (58%), Gaps = 8/355 (2%)
 Frame = -2

Query: 1041 SRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRLA 862
            S   + APTAPWM +PLFV P+++  +++   K    ++  +  D+DL+ KVG  R +LA
Sbjct: 1    SSESVSAPTAPWMRKPLFVNPSQLLDLRKSPIKKNSFNK--QRLDKDLSEKVGNGRNKLA 58

Query: 861  MKKIFKSFEKLQETHDLEAFSKNHESRK---FKFAPGALCGNGDYGGDXXXXXXECSKAA 691
            M++IF+  +KLQE+      +    S K   FKF PG L GN     +       C + +
Sbjct: 59   MRQIFRGIKKLQESRPSSEAAATEGSPKNFEFKFRPGELSGNPQDSKNDG-----CERNS 113

Query: 690  EENLNGNEFDIPLFDA--GKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAE-SSLDEMLLE 520
            E     + F IPL +A  G+EV+ K +PW++E   V R A    ++ AA+ +++DE+LLE
Sbjct: 114  ETT---DGFCIPLREAAEGEEVRLKAMPWQREA--VGRMATNRPLMKAAKLNAIDELLLE 168

Query: 519  RLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRND--ELALLKFDLPLSRNMDRAREIVE 346
            RL+NEAA M+KW+KV K GVT  VVDQVH TWR+   +LALLKFD+PL+R M RAREIVE
Sbjct: 169  RLQNEAAKMRKWIKVKKLGVTPTVVDQVHSTWRSSRSQLALLKFDVPLNRCMSRAREIVE 228

Query: 345  LKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQGNSSSNIIYQNTRTVARET 166
            +KTGG+ +W +KDL+AVYR                      G+ SSN         A+++
Sbjct: 229  MKTGGIAIWKSKDLIAVYR----------------------GSESSN---------AQQS 257

Query: 165  SGESNPHELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKPLPVDGD 1
            S                     +SLYERE DRLLDELGPRFVDWW+ KPLPVD D
Sbjct: 258  SA------------------SFSSLYERETDRLLDELGPRFVDWWLHKPLPVDAD 294


>ref|XP_007033221.1| maize chloroplast splicing factor CRS1, putative isoform 5 [Theobroma
            cacao] gi|508712250|gb|EOY04147.1| maize chloroplast
            splicing factor CRS1, putative isoform 5 [Theobroma
            cacao]
          Length = 788

 Score =  245 bits (625), Expect = 3e-62
 Identities = 149/383 (38%), Positives = 212/383 (55%), Gaps = 19/383 (4%)
 Frame = -2

Query: 1092 ERGSEDSSAQSSKSIPHSRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEH 913
            E  S +++++ S S   +   +K PTAPWM  PL ++P+E+ +  +  +K     +  + 
Sbjct: 67   ENRSLNNNSKFSVSKDPNNGPIKMPTAPWMKGPLLLQPHEVLNPSKSTSKKSSNSK-AKA 125

Query: 912  PDEDLTGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYG 733
            PD+ L GK  G RG+  MKKI ++ E LQ    LE         + +F  G      ++G
Sbjct: 126  PDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLE---DTQIGIREEFEVGNWLE--EFG 180

Query: 732  GDXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPW-EKEEKMVIRRAKKEKVVT 556
             D                     ++  FD        K+PW  +EEK+V RR KKEK++T
Sbjct: 181  SDG--------------------EVKRFDG-------KMPWLREEEKVVFRRMKKEKLLT 213

Query: 555  AAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSR 376
             AE SLD+ LLERLR +A  M+KW+KVMK GVT+AVVD++   WR +EL ++KF +PL R
Sbjct: 214  QAEISLDKDLLERLRRKAMRMRKWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCR 273

Query: 375  NMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGD----QGNSSS 208
            NMDRAREI+E+KT GLVVWG KD L VYRGC++G     + +M Y    D      ++ S
Sbjct: 274  NMDRAREIIEMKTRGLVVWGKKDALVVYRGCSHG-LTSKISSMKYPRCADGQEISSSTFS 332

Query: 207  NIIYQNTRTVARETSGESNPHELIHGRDGKLENLE--------------MASLYEREADR 70
            ++   N   ++ E    S     ++  D + E++               + SLYERE DR
Sbjct: 333  HLTSSNNINMSLEKFNGSTLQSGLYREDREKESMPINIFMKEDENNQPVIGSLYERETDR 392

Query: 69   LLDELGPRFVDWWMQKPLPVDGD 1
            LLD LGPRF+DWWM+KPLP+D D
Sbjct: 393  LLDGLGPRFIDWWMRKPLPIDAD 415


>ref|XP_007033220.1| maize chloroplast splicing factor CRS1, putative isoform 4 [Theobroma
            cacao] gi|508712249|gb|EOY04146.1| maize chloroplast
            splicing factor CRS1, putative isoform 4 [Theobroma
            cacao]
          Length = 767

 Score =  245 bits (625), Expect = 3e-62
 Identities = 149/383 (38%), Positives = 212/383 (55%), Gaps = 19/383 (4%)
 Frame = -2

Query: 1092 ERGSEDSSAQSSKSIPHSRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEH 913
            E  S +++++ S S   +   +K PTAPWM  PL ++P+E+ +  +  +K     +  + 
Sbjct: 41   ENRSLNNNSKFSVSKDPNNGPIKMPTAPWMKGPLLLQPHEVLNPSKSTSKKSSNSK-AKA 99

Query: 912  PDEDLTGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYG 733
            PD+ L GK  G RG+  MKKI ++ E LQ    LE         + +F  G      ++G
Sbjct: 100  PDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLE---DTQIGIREEFEVGNWLE--EFG 154

Query: 732  GDXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPW-EKEEKMVIRRAKKEKVVT 556
             D                     ++  FD        K+PW  +EEK+V RR KKEK++T
Sbjct: 155  SDG--------------------EVKRFDG-------KMPWLREEEKVVFRRMKKEKLLT 187

Query: 555  AAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSR 376
             AE SLD+ LLERLR +A  M+KW+KVMK GVT+AVVD++   WR +EL ++KF +PL R
Sbjct: 188  QAEISLDKDLLERLRRKAMRMRKWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCR 247

Query: 375  NMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGD----QGNSSS 208
            NMDRAREI+E+KT GLVVWG KD L VYRGC++G     + +M Y    D      ++ S
Sbjct: 248  NMDRAREIIEMKTRGLVVWGKKDALVVYRGCSHG-LTSKISSMKYPRCADGQEISSSTFS 306

Query: 207  NIIYQNTRTVARETSGESNPHELIHGRDGKLENLE--------------MASLYEREADR 70
            ++   N   ++ E    S     ++  D + E++               + SLYERE DR
Sbjct: 307  HLTSSNNINMSLEKFNGSTLQSGLYREDREKESMPINIFMKEDENNQPVIGSLYERETDR 366

Query: 69   LLDELGPRFVDWWMQKPLPVDGD 1
            LLD LGPRF+DWWM+KPLP+D D
Sbjct: 367  LLDGLGPRFIDWWMRKPLPIDAD 389


>ref|XP_007033219.1| maize chloroplast splicing factor CRS1, putative isoform 3 [Theobroma
            cacao] gi|508712248|gb|EOY04145.1| maize chloroplast
            splicing factor CRS1, putative isoform 3 [Theobroma
            cacao]
          Length = 788

 Score =  245 bits (625), Expect = 3e-62
 Identities = 149/383 (38%), Positives = 212/383 (55%), Gaps = 19/383 (4%)
 Frame = -2

Query: 1092 ERGSEDSSAQSSKSIPHSRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEH 913
            E  S +++++ S S   +   +K PTAPWM  PL ++P+E+ +  +  +K     +  + 
Sbjct: 41   ENRSLNNNSKFSVSKDPNNGPIKMPTAPWMKGPLLLQPHEVLNPSKSTSKKSSNSK-AKA 99

Query: 912  PDEDLTGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYG 733
            PD+ L GK  G RG+  MKKI ++ E LQ    LE         + +F  G      ++G
Sbjct: 100  PDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLE---DTQIGIREEFEVGNWLE--EFG 154

Query: 732  GDXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPW-EKEEKMVIRRAKKEKVVT 556
             D                     ++  FD        K+PW  +EEK+V RR KKEK++T
Sbjct: 155  SDG--------------------EVKRFDG-------KMPWLREEEKVVFRRMKKEKLLT 187

Query: 555  AAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSR 376
             AE SLD+ LLERLR +A  M+KW+KVMK GVT+AVVD++   WR +EL ++KF +PL R
Sbjct: 188  QAEISLDKDLLERLRRKAMRMRKWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCR 247

Query: 375  NMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGD----QGNSSS 208
            NMDRAREI+E+KT GLVVWG KD L VYRGC++G     + +M Y    D      ++ S
Sbjct: 248  NMDRAREIIEMKTRGLVVWGKKDALVVYRGCSHG-LTSKISSMKYPRCADGQEISSSTFS 306

Query: 207  NIIYQNTRTVARETSGESNPHELIHGRDGKLENLE--------------MASLYEREADR 70
            ++   N   ++ E    S     ++  D + E++               + SLYERE DR
Sbjct: 307  HLTSSNNINMSLEKFNGSTLQSGLYREDREKESMPINIFMKEDENNQPVIGSLYERETDR 366

Query: 69   LLDELGPRFVDWWMQKPLPVDGD 1
            LLD LGPRF+DWWM+KPLP+D D
Sbjct: 367  LLDGLGPRFIDWWMRKPLPIDAD 389


>ref|XP_007033218.1| maize chloroplast splicing factor CRS1, putative isoform 2 [Theobroma
            cacao] gi|508712247|gb|EOY04144.1| maize chloroplast
            splicing factor CRS1, putative isoform 2 [Theobroma
            cacao]
          Length = 804

 Score =  245 bits (625), Expect = 3e-62
 Identities = 149/383 (38%), Positives = 212/383 (55%), Gaps = 19/383 (4%)
 Frame = -2

Query: 1092 ERGSEDSSAQSSKSIPHSRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEH 913
            E  S +++++ S S   +   +K PTAPWM  PL ++P+E+ +  +  +K     +  + 
Sbjct: 67   ENRSLNNNSKFSVSKDPNNGPIKMPTAPWMKGPLLLQPHEVLNPSKSTSKKSSNSK-AKA 125

Query: 912  PDEDLTGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYG 733
            PD+ L GK  G RG+  MKKI ++ E LQ    LE         + +F  G      ++G
Sbjct: 126  PDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLE---DTQIGIREEFEVGNWLE--EFG 180

Query: 732  GDXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPW-EKEEKMVIRRAKKEKVVT 556
             D                     ++  FD        K+PW  +EEK+V RR KKEK++T
Sbjct: 181  SDG--------------------EVKRFDG-------KMPWLREEEKVVFRRMKKEKLLT 213

Query: 555  AAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSR 376
             AE SLD+ LLERLR +A  M+KW+KVMK GVT+AVVD++   WR +EL ++KF +PL R
Sbjct: 214  QAEISLDKDLLERLRRKAMRMRKWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCR 273

Query: 375  NMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGD----QGNSSS 208
            NMDRAREI+E+KT GLVVWG KD L VYRGC++G     + +M Y    D      ++ S
Sbjct: 274  NMDRAREIIEMKTRGLVVWGKKDALVVYRGCSHG-LTSKISSMKYPRCADGQEISSSTFS 332

Query: 207  NIIYQNTRTVARETSGESNPHELIHGRDGKLENLE--------------MASLYEREADR 70
            ++   N   ++ E    S     ++  D + E++               + SLYERE DR
Sbjct: 333  HLTSSNNINMSLEKFNGSTLQSGLYREDREKESMPINIFMKEDENNQPVIGSLYERETDR 392

Query: 69   LLDELGPRFVDWWMQKPLPVDGD 1
            LLD LGPRF+DWWM+KPLP+D D
Sbjct: 393  LLDGLGPRFIDWWMRKPLPIDAD 415


>ref|XP_007033217.1| maize chloroplast splicing factor CRS1, putative isoform 1 [Theobroma
            cacao] gi|508712246|gb|EOY04143.1| maize chloroplast
            splicing factor CRS1, putative isoform 1 [Theobroma
            cacao]
          Length = 818

 Score =  245 bits (625), Expect = 3e-62
 Identities = 149/383 (38%), Positives = 212/383 (55%), Gaps = 19/383 (4%)
 Frame = -2

Query: 1092 ERGSEDSSAQSSKSIPHSRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEH 913
            E  S +++++ S S   +   +K PTAPWM  PL ++P+E+ +  +  +K     +  + 
Sbjct: 67   ENRSLNNNSKFSVSKDPNNGPIKMPTAPWMKGPLLLQPHEVLNPSKSTSKKSSNSK-AKA 125

Query: 912  PDEDLTGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYG 733
            PD+ L GK  G RG+  MKKI ++ E LQ    LE         + +F  G      ++G
Sbjct: 126  PDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLE---DTQIGIREEFEVGNWLE--EFG 180

Query: 732  GDXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPW-EKEEKMVIRRAKKEKVVT 556
             D                     ++  FD        K+PW  +EEK+V RR KKEK++T
Sbjct: 181  SDG--------------------EVKRFDG-------KMPWLREEEKVVFRRMKKEKLLT 213

Query: 555  AAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSR 376
             AE SLD+ LLERLR +A  M+KW+KVMK GVT+AVVD++   WR +EL ++KF +PL R
Sbjct: 214  QAEISLDKDLLERLRRKAMRMRKWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCR 273

Query: 375  NMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGD----QGNSSS 208
            NMDRAREI+E+KT GLVVWG KD L VYRGC++G     + +M Y    D      ++ S
Sbjct: 274  NMDRAREIIEMKTRGLVVWGKKDALVVYRGCSHG-LTSKISSMKYPRCADGQEISSSTFS 332

Query: 207  NIIYQNTRTVARETSGESNPHELIHGRDGKLENLE--------------MASLYEREADR 70
            ++   N   ++ E    S     ++  D + E++               + SLYERE DR
Sbjct: 333  HLTSSNNINMSLEKFNGSTLQSGLYREDREKESMPINIFMKEDENNQPVIGSLYERETDR 392

Query: 69   LLDELGPRFVDWWMQKPLPVDGD 1
            LLD LGPRF+DWWM+KPLP+D D
Sbjct: 393  LLDGLGPRFIDWWMRKPLPIDAD 415


>ref|XP_007139175.1| hypothetical protein PHAVU_008G007700g [Phaseolus vulgaris]
            gi|561012308|gb|ESW11169.1| hypothetical protein
            PHAVU_008G007700g [Phaseolus vulgaris]
          Length = 744

 Score =  239 bits (611), Expect = 1e-60
 Identities = 153/371 (41%), Positives = 200/371 (53%), Gaps = 17/371 (4%)
 Frame = -2

Query: 1062 SSKSIPHSRSK-----LKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDL 898
            SS  +P+S +      +K PT PWM  PL ++PNE+  +    +K  +L+R  E  D+DL
Sbjct: 24   SSSMLPNSNNTPSQLPIKGPTPPWMKGPLLLQPNELLDLSNPKSKKFKLER-QELSDKDL 82

Query: 897  TGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXX 718
             GK   ARG+  MKKI +  EKL  TH+               + GAL G+ +       
Sbjct: 83   MGKE--ARGKKTMKKIVEKVEKLHGTHN---------------SAGALIGSPNV------ 119

Query: 717  XXXECSKAAEENLNGNEFDIPLFDAGKEVKSKK--LPWEKEEKMVIRRAKKEKVVTAAES 544
                      EN+ G    +      +EV+  K  +PWE + K V  + K+++ VTAAE 
Sbjct: 120  ----------ENIGGV---LDSLKENEEVRRTKGRMPWENDWKFVYEKIKRKRTVTAAEL 166

Query: 543  SLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDR 364
            +LD++L  RLRNEAA M+ W+KV KAGVTQ VVDQ+ +TWR +ELA++KFD+PL RNM R
Sbjct: 167  TLDKVLFRRLRNEAATMRTWIKVKKAGVTQDVVDQIKWTWRRNELAMVKFDIPLCRNMSR 226

Query: 363  AREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQGNSSSNIIYQNTR 184
            AREIVE KTGGLVV   KD L VY G N+      L    Y S     +  S      T 
Sbjct: 227  AREIVETKTGGLVVLSKKDFLVVYHGGNH-----QLTTTGYPSLRTNHSEMSGAELATTG 281

Query: 183  TVARETSGESNPHELIHGRDGK------LENLEM----ASLYEREADRLLDELGPRFVDW 34
             +    S  S    L    + K       +N+       SLYERE DRLLD+LGPRF+DW
Sbjct: 282  DICSVDSNHSLSEMLNFIAEDKDSIATSEQNMNFQTANGSLYERETDRLLDDLGPRFIDW 341

Query: 33   WMQKPLPVDGD 1
            WM KPLPVD D
Sbjct: 342  WMAKPLPVDAD 352


>ref|XP_006430740.1| hypothetical protein CICLE_v10013368mg [Citrus clementina]
            gi|557532797|gb|ESR43980.1| hypothetical protein
            CICLE_v10013368mg [Citrus clementina]
          Length = 770

 Score =  238 bits (608), Expect = 3e-60
 Identities = 153/360 (42%), Positives = 204/360 (56%), Gaps = 18/360 (5%)
 Frame = -2

Query: 1026 KAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRLAMKKIF 847
            K PTAPWM  P+ ++P+E+    +   K     +  +  D+ LT K  G RG+ AMKKI 
Sbjct: 60   KMPTAPWMRSPIVLQPDEIIKPSKPKTK-----KSFKKTDKGLTAKESGVRGKQAMKKII 114

Query: 846  KSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXXXXXECSKAAEENLNGNE 667
            ++ EKLQ+   L+   K  +  KF+F  G    NG               + EE+L G  
Sbjct: 115  ENIEKLQKDQILDETQKK-DMEKFEFR-GCFEENG---------------SDEEDLRGGF 157

Query: 666  FDIPLFDAGKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAESSLDEMLLERLRNEAALMKK 487
                           K+PW +EE+ V RR KKE++VT AE+ LD  L+ERL++EA  M+K
Sbjct: 158  -------------GGKVPWLREERFVFRRMKKERMVTKAETMLDGELIERLKDEARKMRK 204

Query: 486  WVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREIVELKTGGLVVWGNKD 307
            WVKV KAGVT++VV ++   WR +ELA++KFD+PL RNMDRAREI+ELKTGGLV+W  KD
Sbjct: 205  WVKVKKAGVTESVVFEIRLAWRRNELAMVKFDVPLCRNMDRAREILELKTGGLVIWTKKD 264

Query: 306  LLAVYRGCNYGRRLGNLRNMNYSSAGDQG---NSSSNI----------IYQNTRTVARET 166
               VYRG      +     M   SA DQ    + S+++          I  NT T+ +  
Sbjct: 265  AHVVYRGDGSKSSV----KMCPRSADDQEAPLSKSTHLHLEKKVNVSWIKSNTATLDQNR 320

Query: 165  S---GESN--PHELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKPLPVDGD 1
            S   GE N  P  +   ++ +++     SLYERE DRLLD LGPRFVDWWM KPLPVDGD
Sbjct: 321  SLKDGEENSLPTSIFMDKNLRIDK----SLYEREGDRLLDGLGPRFVDWWMWKPLPVDGD 376


>ref|XP_006603058.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X5 [Glycine max]
          Length = 744

 Score =  238 bits (607), Expect = 3e-60
 Identities = 152/366 (41%), Positives = 209/366 (57%), Gaps = 19/366 (5%)
 Frame = -2

Query: 1041 SRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRLA 862
            S+  +K+PT PWM  PL ++P+E+  +    +K  + ++  E  D+ L GK    RG+ A
Sbjct: 40   SQVPIKSPTPPWMKVPLLLQPHELVDLSNPKSKKFKPEK-HELSDKALMGKE--VRGKRA 96

Query: 861  MKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXXXXXECSKAAEEN 682
            MKKI    EKL +T +      ++E+R                               ++
Sbjct: 97   MKKIVDRVEKLHKTQN------SNETRV------------------------------DS 120

Query: 681  LNGNEFD--IPLFDAGKEVKSK-KLPWEKEEKMVIRRAKKEKVVTAAESSLDEMLLERLR 511
            LN   F   + +    +EV+SK ++PWEK+EK    + K+EK VTAAE +LD+ LL RLR
Sbjct: 121  LNVENFGGYLEILKENEEVRSKGRMPWEKDEKFGFVKVKREKAVTAAELTLDKALLRRLR 180

Query: 510  NEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREIVELKTGG 331
            NEAA M+ W+KV KAGVTQ VVDQ+  TWR +ELA++KFD+PL RNMDRAREIVE KTGG
Sbjct: 181  NEAARMRTWIKVKKAGVTQDVVDQIKRTWRRNELAMIKFDIPLCRNMDRAREIVETKTGG 240

Query: 330  LVVWGNKDLLAVYRGCNY---GRRLGNLRNMNY-------SSAGD-----QGNSSSNIIY 196
            LVV   KD L VYRGCN+    +   +LR  +Y       ++ GD       +SSS ++ 
Sbjct: 241  LVVLSKKDFLVVYRGCNHQLTTKGSPSLRTNHYEMNRVELATKGDIFRVESNHSSSEMLN 300

Query: 195  QNTRTVARETSGESNPH-ELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKP 19
             N       ++G  + + +L++G           SLYERE +RLLD LGPRF+DWWM KP
Sbjct: 301  WNADHKDSISTGIQDVNCQLVNG-----------SLYERETERLLDGLGPRFIDWWMHKP 349

Query: 18   LPVDGD 1
            LPVD D
Sbjct: 350  LPVDAD 355


>ref|XP_006603055.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X2 [Glycine max]
            gi|571550194|ref|XP_006603056.1| PREDICTED: chloroplastic
            group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X3 [Glycine max]
            gi|571550197|ref|XP_006603057.1| PREDICTED: chloroplastic
            group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X4 [Glycine max]
          Length = 747

 Score =  238 bits (607), Expect = 3e-60
 Identities = 152/366 (41%), Positives = 209/366 (57%), Gaps = 19/366 (5%)
 Frame = -2

Query: 1041 SRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRLA 862
            S+  +K+PT PWM  PL ++P+E+  +    +K  + ++  E  D+ L GK    RG+ A
Sbjct: 40   SQVPIKSPTPPWMKVPLLLQPHELVDLSNPKSKKFKPEK-HELSDKALMGKE--VRGKRA 96

Query: 861  MKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXXXXXECSKAAEEN 682
            MKKI    EKL +T +      ++E+R                               ++
Sbjct: 97   MKKIVDRVEKLHKTQN------SNETRV------------------------------DS 120

Query: 681  LNGNEFD--IPLFDAGKEVKSK-KLPWEKEEKMVIRRAKKEKVVTAAESSLDEMLLERLR 511
            LN   F   + +    +EV+SK ++PWEK+EK    + K+EK VTAAE +LD+ LL RLR
Sbjct: 121  LNVENFGGYLEILKENEEVRSKGRMPWEKDEKFGFVKVKREKAVTAAELTLDKALLRRLR 180

Query: 510  NEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREIVELKTGG 331
            NEAA M+ W+KV KAGVTQ VVDQ+  TWR +ELA++KFD+PL RNMDRAREIVE KTGG
Sbjct: 181  NEAARMRTWIKVKKAGVTQDVVDQIKRTWRRNELAMIKFDIPLCRNMDRAREIVETKTGG 240

Query: 330  LVVWGNKDLLAVYRGCNY---GRRLGNLRNMNY-------SSAGD-----QGNSSSNIIY 196
            LVV   KD L VYRGCN+    +   +LR  +Y       ++ GD       +SSS ++ 
Sbjct: 241  LVVLSKKDFLVVYRGCNHQLTTKGSPSLRTNHYEMNRVELATKGDIFRVESNHSSSEMLN 300

Query: 195  QNTRTVARETSGESNPH-ELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKP 19
             N       ++G  + + +L++G           SLYERE +RLLD LGPRF+DWWM KP
Sbjct: 301  WNADHKDSISTGIQDVNCQLVNG-----------SLYERETERLLDGLGPRFIDWWMHKP 349

Query: 18   LPVDGD 1
            LPVD D
Sbjct: 350  LPVDAD 355


>ref|XP_006603054.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X1 [Glycine max]
          Length = 750

 Score =  238 bits (607), Expect = 3e-60
 Identities = 152/366 (41%), Positives = 209/366 (57%), Gaps = 19/366 (5%)
 Frame = -2

Query: 1041 SRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRLA 862
            S+  +K+PT PWM  PL ++P+E+  +    +K  + ++  E  D+ L GK    RG+ A
Sbjct: 40   SQVPIKSPTPPWMKVPLLLQPHELVDLSNPKSKKFKPEK-HELSDKALMGKE--VRGKRA 96

Query: 861  MKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXXXXXECSKAAEEN 682
            MKKI    EKL +T +      ++E+R                               ++
Sbjct: 97   MKKIVDRVEKLHKTQN------SNETRV------------------------------DS 120

Query: 681  LNGNEFD--IPLFDAGKEVKSK-KLPWEKEEKMVIRRAKKEKVVTAAESSLDEMLLERLR 511
            LN   F   + +    +EV+SK ++PWEK+EK    + K+EK VTAAE +LD+ LL RLR
Sbjct: 121  LNVENFGGYLEILKENEEVRSKGRMPWEKDEKFGFVKVKREKAVTAAELTLDKALLRRLR 180

Query: 510  NEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREIVELKTGG 331
            NEAA M+ W+KV KAGVTQ VVDQ+  TWR +ELA++KFD+PL RNMDRAREIVE KTGG
Sbjct: 181  NEAARMRTWIKVKKAGVTQDVVDQIKRTWRRNELAMIKFDIPLCRNMDRAREIVETKTGG 240

Query: 330  LVVWGNKDLLAVYRGCNY---GRRLGNLRNMNY-------SSAGD-----QGNSSSNIIY 196
            LVV   KD L VYRGCN+    +   +LR  +Y       ++ GD       +SSS ++ 
Sbjct: 241  LVVLSKKDFLVVYRGCNHQLTTKGSPSLRTNHYEMNRVELATKGDIFRVESNHSSSEMLN 300

Query: 195  QNTRTVARETSGESNPH-ELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKP 19
             N       ++G  + + +L++G           SLYERE +RLLD LGPRF+DWWM KP
Sbjct: 301  WNADHKDSISTGIQDVNCQLVNG-----------SLYERETERLLDGLGPRFIDWWMHKP 349

Query: 18   LPVDGD 1
            LPVD D
Sbjct: 350  LPVDAD 355


>ref|XP_006482225.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X1 [Citrus sinensis]
            gi|568857343|ref|XP_006482226.1| PREDICTED: chloroplastic
            group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X2 [Citrus sinensis]
          Length = 771

 Score =  234 bits (598), Expect = 4e-59
 Identities = 152/360 (42%), Positives = 203/360 (56%), Gaps = 18/360 (5%)
 Frame = -2

Query: 1026 KAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRLAMKKIF 847
            K PTAPWM  P+ ++P+E+    +   K     +  +  D+ LT K  G RG+ AMKKI 
Sbjct: 54   KMPTAPWMRSPIVLQPDEIIKPSKPKTK-----KSFKKTDKGLTAKESGVRGKQAMKKII 108

Query: 846  KSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXXXXXECSKAAEENLNGNE 667
            ++ EKLQ+   L+   K     KF+F  G    N  +               EE+L G  
Sbjct: 109  ENIEKLQKDQILDETQKK-VMEKFEF-KGCFEENVSH---------------EEDLRGG- 150

Query: 666  FDIPLFDAGKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAESSLDEMLLERLRNEAALMKK 487
                           K+PW +E++ V RR KKE++VT AE+ LD  LLERL++EA  M+K
Sbjct: 151  ------------FGGKVPWLREDRFVFRRMKKERMVTKAETMLDGELLERLKDEARKMRK 198

Query: 486  WVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREIVELKTGGLVVWGNKD 307
            WVKV KAGVT++VV ++   WR +ELA++KFD+PL RNMDRAREI+ELKTGGLV+W  KD
Sbjct: 199  WVKVKKAGVTESVVFEIRLAWRRNELAMVKFDVPLCRNMDRAREILELKTGGLVIWTKKD 258

Query: 306  LLAVYRGCNYGRRLGNLRNMNYSSAGDQG---NSSSNI----------IYQNTRTVARET 166
               VYRG +    +     M   SA DQ    + S+++          I  NT T+ +  
Sbjct: 259  AHVVYRGDSSKSSV----KMCPRSADDQEAPLSKSTHLHLEKKVNVSWIKSNTATLDQNR 314

Query: 165  S---GESN--PHELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKPLPVDGD 1
            S   GE N  P  +   ++ +++     SLYERE DRLLD LGPRFVDWWM KPLPVDGD
Sbjct: 315  SLKDGEENSLPTSIFMDKNLRIDK----SLYEREGDRLLDGLGPRFVDWWMWKPLPVDGD 370


>ref|XP_004158502.1| PREDICTED: LOW QUALITY PROTEIN: chloroplastic group IIA intron
            splicing facilitator CRS1, chloroplastic-like [Cucumis
            sativus]
          Length = 760

 Score =  232 bits (591), Expect = 2e-58
 Identities = 151/387 (39%), Positives = 204/387 (52%), Gaps = 26/387 (6%)
 Frame = -2

Query: 1083 SEDSSAQSSKSIPH----SRSKLKAPTAPWMAEPLFVKPNEME-------SMKRRNNKGL 937
            S  S+   S  +P     S + +   TAPWM  PL ++P + E       + KRRN    
Sbjct: 36   SATSTPSQSSVLPEPPSISNAAVNLRTAPWMKAPLHLQPQQQEEEGVDPANPKRRNGS-- 93

Query: 936  ELDRIGEHPDEDLTGKVG-GARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPG 760
              D  G        G  G    G+ AM++I KS  KL+                      
Sbjct: 94   --DGSGRDKCSRALGDSGIDKTGKYAMRRIAKSIGKLRR--------------------- 130

Query: 759  ALCGNGDYGGDXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPWEKEEKMVIRR 580
                NGD G          ++   E +   +FD+  F+  +    +++PWEK++  ++ R
Sbjct: 131  ----NGDLGE---------TRMKLEEVEFGDFDLEGFE--ESGTRRRMPWEKDDDGIVLR 175

Query: 579  AKKEKVVTAAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALL 400
              K+K VT+AE +LD +LLERL+ EA+ M+KWVKV K GVTQ VV+Q+ F W  +ELA+L
Sbjct: 176  RMKKKTVTSAELNLDRVLLERLKGEASKMEKWVKVNKVGVTQDVVNQIQFMWERNELAML 235

Query: 399  KFDLPLSRNMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQG 220
            KFD+PLSRNMDRAREIVE+KTGG+VVW  K+ L +YRGCNY        N+ +S+     
Sbjct: 236  KFDVPLSRNMDRAREIVEMKTGGMVVWSKKNALVIYRGCNYP------LNLKHSTKKQVH 289

Query: 219  NSSSNIIYQNTRT-VARETSGESNPHELIHGRDG-----------KLENLE--MASLYER 82
             S  N +   T T  +     ES  +  I+  DG           + ENL+    SLYER
Sbjct: 290  ISPQNPVKVETDTHFSLSGHYESGLNRSINDNDGEWEEASSFFLIRHENLQPLSGSLYER 349

Query: 81   EADRLLDELGPRFVDWWMQKPLPVDGD 1
            E DRLLD+LGPRF+DWWM KPLPVD D
Sbjct: 350  ETDRLLDDLGPRFIDWWMHKPLPVDAD 376


Top