BLASTX nr result

ID: Rehmannia24_contig00026179 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia24_contig00026179
         (1302 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006357840.1| PREDICTED: chloroplastic group IIA intron sp...   284   6e-74
ref|XP_004233710.1| PREDICTED: chloroplastic group IIA intron sp...   273   1e-70
ref|XP_002280704.1| PREDICTED: chloroplastic group IIA intron sp...   250   1e-63
gb|EMJ18512.1| hypothetical protein PRUPE_ppa016241mg [Prunus pe...   236   2e-59
ref|XP_002516757.1| conserved hypothetical protein [Ricinus comm...   234   5e-59
ref|XP_006430740.1| hypothetical protein CICLE_v10013368mg [Citr...   215   4e-53
ref|XP_006482225.1| PREDICTED: chloroplastic group IIA intron sp...   211   7e-52
gb|EOY04147.1| maize chloroplast splicing factor CRS1, putative ...   206   1e-50
gb|EOY04146.1| maize chloroplast splicing factor CRS1, putative ...   206   1e-50
gb|EOY04145.1| maize chloroplast splicing factor CRS1, putative ...   206   1e-50
gb|EOY04144.1| maize chloroplast splicing factor CRS1, putative ...   206   1e-50
gb|EOY04143.1| maize chloroplast splicing factor CRS1, putative ...   206   1e-50
gb|EXC20503.1| Chloroplastic group IIA intron splicing facilitat...   205   3e-50
gb|EPS58217.1| hypothetical protein M569_16596, partial [Genlise...   202   2e-49
ref|XP_006287146.1| hypothetical protein CARUB_v10000322mg, part...   201   6e-49
ref|XP_006603058.1| PREDICTED: chloroplastic group IIA intron sp...   198   4e-48
ref|XP_006603055.1| PREDICTED: chloroplastic group IIA intron sp...   198   4e-48
ref|XP_006603054.1| PREDICTED: chloroplastic group IIA intron sp...   198   4e-48
ref|XP_006400150.1| hypothetical protein EUTSA_v10012836mg [Eutr...   197   1e-47
ref|NP_197122.2| chloroplast splicing factor CRS1-like protein [...   194   9e-47

>ref|XP_006357840.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Solanum tuberosum]
          Length = 802

 Score =  284 bits (726), Expect = 6e-74
 Identities = 176/385 (45%), Positives = 233/385 (60%), Gaps = 7/385 (1%)
 Frame = +3

Query: 114  MSASPFLTHFSNAITPFLYNPSKIPTPFLSSNPSKYNFITFSSPPSNNRSNGTKIERKSA 293
            MSA   L   SN +     N        L S      F +FSS  ++N +     E+ + 
Sbjct: 1    MSAPLVLAPNSNTLCYHHSNSFINQKTLLFSKSFNSKFTSFSSQYNDNNNPIKNEEQYNL 60

Query: 294  KYETHDSYTKSSESTPHSGSTIKAPTAPWMSKPLLVKPNEIMEFTKRRTKKDFTIGENGE 473
            ++E  D Y  SS       S IK PTAPWM  PLL++PN+ ++ +K R KKD    +  +
Sbjct: 61   EFENQD-YGSSS-------SGIKGPTAPWMRGPLLLEPNQFLDLSKSRKKKDANFAKT-Q 111

Query: 474  HPDMALTGKVGGARGKVAMKKIFKGIEKLQETHDLEEIS-QTPENLKFKFAPGDL--WGN 644
            +P+ AL+GKV G RGK AMK I++GI+KLQET   E    +T   ++F+F PG L  WG+
Sbjct: 112  NPNDALSGKVSGGRGKKAMKMIYQGIDKLQETQIGEGTQVETDAKVEFQFPPGSLSEWGD 171

Query: 645  GVYESDAEVQEYSEEVQL-SLESTEFDIPLAXXXXXXXXXXX---PWERDERMVIRRVKK 812
              YE + E   Y EE  + SLE  EF +                 PWE + R+V RR+KK
Sbjct: 172  VSYEIE-EKNPYGEEDNVESLEGVEFGVLSREGEGRGSRKIGVKMPWESEVRIVYRRMKK 230

Query: 813  EKVVTAAESILDGILLERLRNEAAMIRKWVKVKKAGVTQAVVDQVHFIWRNNELALLKFD 992
            EKVV  AES LD +LLERLR EAA I+KWVKVKKAGVT+ VVDQ+HFIW+NNELA+LKFD
Sbjct: 231  EKVVMTAESNLDAMLLERLRGEAARIQKWVKVKKAGVTRTVVDQIHFIWKNNELAMLKFD 290

Query: 993  IPLCQNMDRAQEIVEMKTGGVVVWRNKDFLAVYRGRIYESGSKHIRNIRRNSVGDRENSS 1172
            +PLC+NMDRA+EIVEMKTGG VVW  ++ L VYRG  Y    K +++   + +   +NSS
Sbjct: 291  LPLCRNMDRAREIVEMKTGGFVVWMKQNALVVYRGCSYTLQQKELQH---DFLCSHQNSS 347

Query: 1173 STMNYQNTTTFARVSSDGSSLDDTI 1247
             T N + T+ F+ ++S GSS D+ I
Sbjct: 348  FTENIKQTSIFSPLNSSGSSEDEMI 372


>ref|XP_004233710.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Solanum lycopersicum]
          Length = 766

 Score =  273 bits (697), Expect = 1e-70
 Identities = 171/365 (46%), Positives = 222/365 (60%), Gaps = 8/365 (2%)
 Frame = +3

Query: 114  MSASPFLTHFSNAITPFLYNPSKIPTPFLSSNPSKYNFITFSSPPSNNRSNGTKIERKSA 293
            MSA+  +   SN +     N        L S      F TFSS  ++N +   K+E+ + 
Sbjct: 1    MSATLLVAPNSNTLCCHHANSFINQKTLLFSKSFNSKFTTFSSQSNDNNNPIKKVEQCNL 60

Query: 294  KYETHDSYTKSSESTPHSGSTIKAPTAPWMSKPLLVKPNEIMEFTKRRTKKDFTIGENGE 473
            ++E  D Y  SS       S IK PTAPWM  PLL++PN++++ +K R KKD    +  +
Sbjct: 61   EFENQD-YGSSS-------SGIKGPTAPWMRGPLLLEPNQVLDLSKSRKKKDTNFAKT-Q 111

Query: 474  HPDMALTGKVGGARGKVAMKKIFKGIEKLQETHDLEEISQTPENLK--FKFAPGDL--WG 641
            +P+ AL+GKV G RGK AMK I++GI+KLQET  + E +Q   ++K  F+F PG L  WG
Sbjct: 112  NPNDALSGKVSGGRGKKAMKMIYQGIDKLQETQ-IGECTQVETDVKVEFQFPPGSLSGWG 170

Query: 642  NGVYESDAEVQEYSEEVQL-SLESTEFDIPLAXXXXXXXXXXX---PWERDERMVIRRVK 809
            +  YE + E   Y EE  + SLE  EF +                 PWE +ER+V RR+K
Sbjct: 171  DVSYEIE-EKNPYGEEDNVESLEGVEFGVLSREGEGRGSRKSGARMPWESEERIVYRRMK 229

Query: 810  KEKVVTAAESILDGILLERLRNEAAMIRKWVKVKKAGVTQAVVDQVHFIWRNNELALLKF 989
            KEKVV  AES LD +LLERLR EAA I+KWVKVKKAGVT+ VVDQ+ FIW+NNELA+LKF
Sbjct: 230  KEKVVRTAESNLDAMLLERLRGEAARIQKWVKVKKAGVTRTVVDQIQFIWKNNELAMLKF 289

Query: 990  DIPLCQNMDRAQEIVEMKTGGVVVWRNKDFLAVYRGRIYESGSKHIRNIRRNSVGDRENS 1169
            D+PLC+NMDRA++IVEMKTGG VVW  ++ L VYRG  YE            SVG+ E  
Sbjct: 290  DLPLCRNMDRARDIVEMKTGGFVVWMKQNALVVYRG--YE----------MISVGNSEED 337

Query: 1170 SSTMN 1184
            S  MN
Sbjct: 338  SLVMN 342


>ref|XP_002280704.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Vitis vinifera]
          Length = 1184

 Score =  250 bits (638), Expect = 1e-63
 Identities = 149/367 (40%), Positives = 205/367 (55%), Gaps = 9/367 (2%)
 Frame = +3

Query: 186  PTPFLSSNPSKYNFITFSS-----PPSNNRSNGTKIERKSAKYETHDSYTKSSESTPHSG 350
            P P  S  PS  N ++ SS     P   +      I   +     H  ++ SS+    + 
Sbjct: 11   PIPNHSQFPSNSNSLSNSSIRILNPQRIHSFKPPPISATTTATTNHPDHSISSQPVSGTD 70

Query: 351  STIKAPTAPWMSKPLLVKPNEIMEFTKRRTKKDFTIGENG-EHPDMALTGKVGGARGKVA 527
            + IK PTAPWM  PLL++PNE+++ +K R KK    G  G E PD +LT KV G RG  A
Sbjct: 71   AAIKMPTAPWMKGPLLLQPNEVLDLSKARPKK--VAGSAGAEKPDRSLTEKVSGGRGAKA 128

Query: 528  MKKIFKGIEKLQETHDLEEISQTPENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQLSLE 707
            MKKI + I KLQETH  +E  +  E  +F                           +SLE
Sbjct: 129  MKKIMQSIVKLQETHTSDETQENTEEFEFG--------------------------VSLE 162

Query: 708  STEFDIPLAXXXXXXXXXXXPWERDERMVIRRVKKEKVVTAAESILDGILLERLRNEAAM 887
                D               PW + E++V RR KKEKVVTAAE  LD +LLERLR EA  
Sbjct: 163  GIGGD------ENSRIGGKMPWLKTEKVVFRRTKKEKVVTAAELTLDPMLLERLRGEAVK 216

Query: 888  IRKWVKVKKAGVTQAVVDQVHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVWR 1067
            +RKWVKVKKAGVT++VVDQ+H +W+++ELA++KFD+PLC+NMDRA+EI+E+KT G+V+W 
Sbjct: 217  MRKWVKVKKAGVTESVVDQIHMVWKSDELAMVKFDMPLCRNMDRAREILEIKTRGLVIWS 276

Query: 1068 NKDFLAVYRGRIYESGSKHIRNIRRNSVGDRENSSSTM---NYQNTTTFARVSSDGSSLD 1238
             KD L VYRG  Y+S SKH + +R   V   + S+S +   N+++  T + +    S+  
Sbjct: 277  KKDTLVVYRGSNYQSTSKHFQKMRPGLVAGADASNSKLNQSNFEDDLTISEIKFHESTTG 336

Query: 1239 DTIHGKD 1259
            + +  KD
Sbjct: 337  EKMGRKD 343


>gb|EMJ18512.1| hypothetical protein PRUPE_ppa016241mg [Prunus persica]
          Length = 809

 Score =  236 bits (601), Expect = 2e-59
 Identities = 148/362 (40%), Positives = 201/362 (55%), Gaps = 8/362 (2%)
 Frame = +3

Query: 114  MSASPFLTHFSNAITPFLYNPSKIPTPFLSSNPSKYNFITFSSPPSNNRSNGTKIERKSA 293
            M A+ FLT  S       + PS    PF S NP        +S P  N      I  KS 
Sbjct: 1    MPATLFLTPLSTLPNITHHLPSHSNPPFHSYNPISS---ALNSKPPQNPKPTNPIPSKSP 57

Query: 294  KYETHDSYTKS------SESTPHSGSTIKAPTAPWMSKPLLVKPNEIMEFTKRRTKKDFT 455
               +  S T +      SE    + + IKAPTAPWM  PLL++P+E+++F+K R KK   
Sbjct: 58   NSLSLSSTTTTPNSKAPSEPNSSTDACIKAPTAPWMKGPLLLQPHEVIDFSKPRNKKTHN 117

Query: 456  IGENGEHPDMALTGKVGGARGKVAMKKIFKGIEKLQETHDLEEISQTPENLKFKFAPGDL 635
                 E PD  L GK+ G RG  A+K+I + IE+L      +E  +        F    +
Sbjct: 118  -NAKAEKPDTVLAGKLVGIRGDKAIKQIVQSIERLGPNQKTDETQKG-------FGEFRI 169

Query: 636  WGN--GVYESDAEVQEYSEEVQLSLESTEFDIPLAXXXXXXXXXXXPWERDERMVIRRVK 809
            W +  G+ +++   + + + V+  +      +  A           PWERDER+V +R+K
Sbjct: 170  WDSLEGLGQNEKWDETHKDFVEFGIGGCLEGLGKAADSRFGGKM--PWERDERIVFQRIK 227

Query: 810  KEKVVTAAESILDGILLERLRNEAAMIRKWVKVKKAGVTQAVVDQVHFIWRNNELALLKF 989
            K++V +AAE  L+  LLERLR EAA +RKWVKVKKAGVTQA+VD + FIW+ NELA++KF
Sbjct: 228  KKRVASAAELSLEKELLERLRAEAAKMRKWVKVKKAGVTQAIVDDIKFIWKTNELAMVKF 287

Query: 990  DIPLCQNMDRAQEIVEMKTGGVVVWRNKDFLAVYRGRIYESGSKHIRNIRRNSVGDRENS 1169
            D+PLC+NM RAQEIVE KTGG+VVW  KD L +YRG  Y+S SK    +R  S   +E  
Sbjct: 288  DVPLCRNMHRAQEIVETKTGGMVVWGKKDTLVIYRGCNYQSSSKFFPKMRPCSADRQETL 347

Query: 1170 SS 1175
            SS
Sbjct: 348  SS 349


>ref|XP_002516757.1| conserved hypothetical protein [Ricinus communis]
            gi|223544130|gb|EEF45655.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 742

 Score =  234 bits (598), Expect = 5e-59
 Identities = 141/322 (43%), Positives = 183/322 (56%), Gaps = 4/322 (1%)
 Frame = +3

Query: 174  PSKIPTPFLSSNPSKYNFITFSSPPSNNRSNGTKIERKSAKYETHDSYTKSSESTPHSGS 353
            PS +   F S NP     I  S  P+ N+S+    +        +  +T  S     S +
Sbjct: 2    PSALFLQFFSYNP-----IASSLNPATNKSSLNNAQNPKFATNKNTEFTLLSVPNSQSNA 56

Query: 354  TIKAPTAPWMSKPLLVKPNEIMEFTKRRTKKDFTIGENGEHPDMALTGKVGGARGKVAMK 533
             IK PTAPWM  PLL++P+E++  +K R K       N E  D  LTGK  G RGK AM+
Sbjct: 57   PIKVPTAPWMKGPLLLQPHELINLSKPRNKNSSN-NANIEKSDKVLTGKESGVRGKKAME 115

Query: 534  KIFKGIEKLQETHDLEEI---SQTPENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQLSL 704
            KI K IE+LQE   LE+    SQ  E  +                D+E  E  E++ L  
Sbjct: 116  KIVKSIEQLQENQALEKTQCDSQAYEKTQL---------------DSEAFEIGEKLGLIR 160

Query: 705  ESTEFDIPLAXXXXXXXXXXXPWERDERMVIRRVKKEKVVTAAESILDGILLERLRNEAA 884
            E  +F +              PWER+E+ V  R+KKEK VT AE IL+  LLE LR EA+
Sbjct: 161  EHGDFGV---------NKKLKPWEREEKFVYWRIKKEKAVTKAELILEKELLEILRTEAS 211

Query: 885  MIRKWVKVKKAGVTQAVVDQVHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVW 1064
             +RKWVKV KAGVTQ+VVDQ+ + WRNNELA++KFD+PLC+NMDRA+EIVE+KTGG+VVW
Sbjct: 212  KMRKWVKVMKAGVTQSVVDQIRYAWRNNELAMVKFDLPLCRNMDRAREIVELKTGGLVVW 271

Query: 1065 RNKDFLAVYRGRIYE-SGSKHI 1127
              KD L +YRG  Y  + S H+
Sbjct: 272  TRKDSLVIYRGCNYHLTKSSHV 293


>ref|XP_006430740.1| hypothetical protein CICLE_v10013368mg [Citrus clementina]
            gi|557532797|gb|ESR43980.1| hypothetical protein
            CICLE_v10013368mg [Citrus clementina]
          Length = 770

 Score =  215 bits (547), Expect = 4e-53
 Identities = 126/307 (41%), Positives = 181/307 (58%), Gaps = 3/307 (0%)
 Frame = +3

Query: 360  KAPTAPWMSKPLLVKPNEIMEFTKRRTKKDFTIGENGEHPDMALTGKVGGARGKVAMKKI 539
            K PTAPWM  P++++P+EI++ +K +TKK F      +  D  LT K  G RGK AMKKI
Sbjct: 60   KMPTAPWMRSPIVLQPDEIIKPSKPKTKKSF------KKTDKGLTAKESGVRGKQAMKKI 113

Query: 540  FKGIEKLQETHDLEEISQTPENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQLSLESTEF 719
             + IEKLQ+   L+E +Q  +  KF+F        G +E +   +E              
Sbjct: 114  IENIEKLQKDQILDE-TQKKDMEKFEF-------RGCFEENGSDEE-------------- 151

Query: 720  DIPLAXXXXXXXXXXXPWERDERMVIRRVKKEKVVTAAESILDGILLERLRNEAAMIRKW 899
                            PW R+ER V RR+KKE++VT AE++LDG L+ERL++EA  +RKW
Sbjct: 152  ------DLRGGFGGKVPWLREERFVFRRMKKERMVTKAETMLDGELIERLKDEARKMRKW 205

Query: 900  VKVKKAGVTQAVVDQVHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVWRNKDF 1079
            VKVKKAGVT++VV ++   WR NELA++KFD+PLC+NMDRA+EI+E+KTGG+V+W  KD 
Sbjct: 206  VKVKKAGVTESVVFEIRLAWRRNELAMVKFDVPLCRNMDRAREILELKTGGLVIWTKKDA 265

Query: 1080 LAVYRGRIYESGSKHIRNIRRNSVGDRE---NSSSTMNYQNTTTFARVSSDGSSLDDTIH 1250
              VYRG     GSK    +   S  D+E   + S+ ++ +     + + S+ ++LD    
Sbjct: 266  HVVYRG----DGSKSSVKMCPRSADDQEAPLSKSTHLHLEKKVNVSWIKSNTATLDQNRS 321

Query: 1251 GKDKWES 1271
             KD  E+
Sbjct: 322  LKDGEEN 328


>ref|XP_006482225.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X1 [Citrus sinensis]
            gi|568857343|ref|XP_006482226.1| PREDICTED: chloroplastic
            group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X2 [Citrus sinensis]
          Length = 771

 Score =  211 bits (536), Expect = 7e-52
 Identities = 125/307 (40%), Positives = 179/307 (58%), Gaps = 3/307 (0%)
 Frame = +3

Query: 360  KAPTAPWMSKPLLVKPNEIMEFTKRRTKKDFTIGENGEHPDMALTGKVGGARGKVAMKKI 539
            K PTAPWM  P++++P+EI++ +K +TKK F      +  D  LT K  G RGK AMKKI
Sbjct: 54   KMPTAPWMRSPIVLQPDEIIKPSKPKTKKSF------KKTDKGLTAKESGVRGKQAMKKI 107

Query: 540  FKGIEKLQETHDLEEISQTPENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQLSLESTEF 719
             + IEKLQ+   L+E +Q     KF+F        G +E +   +E              
Sbjct: 108  IENIEKLQKDQILDE-TQKKVMEKFEF-------KGCFEENVSHEE-------------- 145

Query: 720  DIPLAXXXXXXXXXXXPWERDERMVIRRVKKEKVVTAAESILDGILLERLRNEAAMIRKW 899
                            PW R++R V RR+KKE++VT AE++LDG LLERL++EA  +RKW
Sbjct: 146  ------DLRGGFGGKVPWLREDRFVFRRMKKERMVTKAETMLDGELLERLKDEARKMRKW 199

Query: 900  VKVKKAGVTQAVVDQVHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVWRNKDF 1079
            VKVKKAGVT++VV ++   WR NELA++KFD+PLC+NMDRA+EI+E+KTGG+V+W  KD 
Sbjct: 200  VKVKKAGVTESVVFEIRLAWRRNELAMVKFDVPLCRNMDRAREILELKTGGLVIWTKKDA 259

Query: 1080 LAVYRGRIYESGSKHIRNIRRNSVGDRE---NSSSTMNYQNTTTFARVSSDGSSLDDTIH 1250
              VYRG      SK    +   S  D+E   + S+ ++ +     + + S+ ++LD    
Sbjct: 260  HVVYRG----DSSKSSVKMCPRSADDQEAPLSKSTHLHLEKKVNVSWIKSNTATLDQNRS 315

Query: 1251 GKDKWES 1271
             KD  E+
Sbjct: 316  LKDGEEN 322


>gb|EOY04147.1| maize chloroplast splicing factor CRS1, putative isoform 5 [Theobroma
            cacao]
          Length = 788

 Score =  206 bits (525), Expect = 1e-50
 Identities = 134/350 (38%), Positives = 199/350 (56%), Gaps = 5/350 (1%)
 Frame = +3

Query: 228  ITFSSPPSNNRSNGTKIERKSAKYETHDSYTKSSESTPHSGSTIKAPTAPWMSKPLLVKP 407
            I FSS   N+  N +K  +++     +  ++ S +  P++G  IK PTAPWM  PLL++P
Sbjct: 49   IPFSSS-LNSSQNPSKTHKENRSLNNNSKFSVSKD--PNNGP-IKMPTAPWMKGPLLLQP 104

Query: 408  NEIMEFTKRRTKKDFTIGENGEHPDMALTGKVGGARGKVAMKKIFKGIEKLQETHDLEEI 587
            +E++  +K  +KK  +     + PD AL GK  G RGK  MKKI + +E LQ    LE+ 
Sbjct: 105  HEVLNPSKSTSKK--SSNSKAKAPDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLED- 161

Query: 588  SQTPENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQLSLESTEFDIPLAXXXXXXXXXXX 767
              T   ++ +F  G+      + SD EV+ +  ++                         
Sbjct: 162  --TQIGIREEFEVGNWLEE--FGSDGEVKRFDGKM------------------------- 192

Query: 768  PWERDE-RMVIRRVKKEKVVTAAESILDGILLERLRNEAAMIRKWVKVKKAGVTQAVVDQ 944
            PW R+E ++V RR+KKEK++T AE  LD  LLERLR +A  +RKW+KV K GVT+AVVD+
Sbjct: 193  PWLREEEKVVFRRMKKEKLLTQAEISLDKDLLERLRRKAMRMRKWIKVMKLGVTKAVVDE 252

Query: 945  VHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVWRNKDFLAVYRGRIYESGSKH 1124
            +   WR NEL ++KF +PLC+NMDRA+EI+EMKT G+VVW  KD L VYRG  +   SK 
Sbjct: 253  IKLAWRKNELVMVKFGVPLCRNMDRAREIIEMKTRGLVVWGKKDALVVYRGCSHGLTSK- 311

Query: 1125 IRNIRRNSVGD-RENSSSTMNY---QNTTTFARVSSDGSSLDDTIHGKDK 1262
            I +++     D +E SSST ++    N    +    +GS+L   ++ +D+
Sbjct: 312  ISSMKYPRCADGQEISSSTFSHLTSSNNINMSLEKFNGSTLQSGLYREDR 361


>gb|EOY04146.1| maize chloroplast splicing factor CRS1, putative isoform 4 [Theobroma
            cacao]
          Length = 767

 Score =  206 bits (525), Expect = 1e-50
 Identities = 134/350 (38%), Positives = 199/350 (56%), Gaps = 5/350 (1%)
 Frame = +3

Query: 228  ITFSSPPSNNRSNGTKIERKSAKYETHDSYTKSSESTPHSGSTIKAPTAPWMSKPLLVKP 407
            I FSS   N+  N +K  +++     +  ++ S +  P++G  IK PTAPWM  PLL++P
Sbjct: 23   IPFSSS-LNSSQNPSKTHKENRSLNNNSKFSVSKD--PNNGP-IKMPTAPWMKGPLLLQP 78

Query: 408  NEIMEFTKRRTKKDFTIGENGEHPDMALTGKVGGARGKVAMKKIFKGIEKLQETHDLEEI 587
            +E++  +K  +KK  +     + PD AL GK  G RGK  MKKI + +E LQ    LE+ 
Sbjct: 79   HEVLNPSKSTSKK--SSNSKAKAPDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLED- 135

Query: 588  SQTPENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQLSLESTEFDIPLAXXXXXXXXXXX 767
              T   ++ +F  G+      + SD EV+ +  ++                         
Sbjct: 136  --TQIGIREEFEVGNWLEE--FGSDGEVKRFDGKM------------------------- 166

Query: 768  PWERDE-RMVIRRVKKEKVVTAAESILDGILLERLRNEAAMIRKWVKVKKAGVTQAVVDQ 944
            PW R+E ++V RR+KKEK++T AE  LD  LLERLR +A  +RKW+KV K GVT+AVVD+
Sbjct: 167  PWLREEEKVVFRRMKKEKLLTQAEISLDKDLLERLRRKAMRMRKWIKVMKLGVTKAVVDE 226

Query: 945  VHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVWRNKDFLAVYRGRIYESGSKH 1124
            +   WR NEL ++KF +PLC+NMDRA+EI+EMKT G+VVW  KD L VYRG  +   SK 
Sbjct: 227  IKLAWRKNELVMVKFGVPLCRNMDRAREIIEMKTRGLVVWGKKDALVVYRGCSHGLTSK- 285

Query: 1125 IRNIRRNSVGD-RENSSSTMNY---QNTTTFARVSSDGSSLDDTIHGKDK 1262
            I +++     D +E SSST ++    N    +    +GS+L   ++ +D+
Sbjct: 286  ISSMKYPRCADGQEISSSTFSHLTSSNNINMSLEKFNGSTLQSGLYREDR 335


>gb|EOY04145.1| maize chloroplast splicing factor CRS1, putative isoform 3 [Theobroma
            cacao]
          Length = 788

 Score =  206 bits (525), Expect = 1e-50
 Identities = 134/350 (38%), Positives = 199/350 (56%), Gaps = 5/350 (1%)
 Frame = +3

Query: 228  ITFSSPPSNNRSNGTKIERKSAKYETHDSYTKSSESTPHSGSTIKAPTAPWMSKPLLVKP 407
            I FSS   N+  N +K  +++     +  ++ S +  P++G  IK PTAPWM  PLL++P
Sbjct: 23   IPFSSS-LNSSQNPSKTHKENRSLNNNSKFSVSKD--PNNGP-IKMPTAPWMKGPLLLQP 78

Query: 408  NEIMEFTKRRTKKDFTIGENGEHPDMALTGKVGGARGKVAMKKIFKGIEKLQETHDLEEI 587
            +E++  +K  +KK  +     + PD AL GK  G RGK  MKKI + +E LQ    LE+ 
Sbjct: 79   HEVLNPSKSTSKK--SSNSKAKAPDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLED- 135

Query: 588  SQTPENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQLSLESTEFDIPLAXXXXXXXXXXX 767
              T   ++ +F  G+      + SD EV+ +  ++                         
Sbjct: 136  --TQIGIREEFEVGNWLEE--FGSDGEVKRFDGKM------------------------- 166

Query: 768  PWERDE-RMVIRRVKKEKVVTAAESILDGILLERLRNEAAMIRKWVKVKKAGVTQAVVDQ 944
            PW R+E ++V RR+KKEK++T AE  LD  LLERLR +A  +RKW+KV K GVT+AVVD+
Sbjct: 167  PWLREEEKVVFRRMKKEKLLTQAEISLDKDLLERLRRKAMRMRKWIKVMKLGVTKAVVDE 226

Query: 945  VHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVWRNKDFLAVYRGRIYESGSKH 1124
            +   WR NEL ++KF +PLC+NMDRA+EI+EMKT G+VVW  KD L VYRG  +   SK 
Sbjct: 227  IKLAWRKNELVMVKFGVPLCRNMDRAREIIEMKTRGLVVWGKKDALVVYRGCSHGLTSK- 285

Query: 1125 IRNIRRNSVGD-RENSSSTMNY---QNTTTFARVSSDGSSLDDTIHGKDK 1262
            I +++     D +E SSST ++    N    +    +GS+L   ++ +D+
Sbjct: 286  ISSMKYPRCADGQEISSSTFSHLTSSNNINMSLEKFNGSTLQSGLYREDR 335


>gb|EOY04144.1| maize chloroplast splicing factor CRS1, putative isoform 2 [Theobroma
            cacao]
          Length = 804

 Score =  206 bits (525), Expect = 1e-50
 Identities = 134/350 (38%), Positives = 199/350 (56%), Gaps = 5/350 (1%)
 Frame = +3

Query: 228  ITFSSPPSNNRSNGTKIERKSAKYETHDSYTKSSESTPHSGSTIKAPTAPWMSKPLLVKP 407
            I FSS   N+  N +K  +++     +  ++ S +  P++G  IK PTAPWM  PLL++P
Sbjct: 49   IPFSSS-LNSSQNPSKTHKENRSLNNNSKFSVSKD--PNNGP-IKMPTAPWMKGPLLLQP 104

Query: 408  NEIMEFTKRRTKKDFTIGENGEHPDMALTGKVGGARGKVAMKKIFKGIEKLQETHDLEEI 587
            +E++  +K  +KK  +     + PD AL GK  G RGK  MKKI + +E LQ    LE+ 
Sbjct: 105  HEVLNPSKSTSKK--SSNSKAKAPDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLED- 161

Query: 588  SQTPENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQLSLESTEFDIPLAXXXXXXXXXXX 767
              T   ++ +F  G+      + SD EV+ +  ++                         
Sbjct: 162  --TQIGIREEFEVGNWLEE--FGSDGEVKRFDGKM------------------------- 192

Query: 768  PWERDE-RMVIRRVKKEKVVTAAESILDGILLERLRNEAAMIRKWVKVKKAGVTQAVVDQ 944
            PW R+E ++V RR+KKEK++T AE  LD  LLERLR +A  +RKW+KV K GVT+AVVD+
Sbjct: 193  PWLREEEKVVFRRMKKEKLLTQAEISLDKDLLERLRRKAMRMRKWIKVMKLGVTKAVVDE 252

Query: 945  VHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVWRNKDFLAVYRGRIYESGSKH 1124
            +   WR NEL ++KF +PLC+NMDRA+EI+EMKT G+VVW  KD L VYRG  +   SK 
Sbjct: 253  IKLAWRKNELVMVKFGVPLCRNMDRAREIIEMKTRGLVVWGKKDALVVYRGCSHGLTSK- 311

Query: 1125 IRNIRRNSVGD-RENSSSTMNY---QNTTTFARVSSDGSSLDDTIHGKDK 1262
            I +++     D +E SSST ++    N    +    +GS+L   ++ +D+
Sbjct: 312  ISSMKYPRCADGQEISSSTFSHLTSSNNINMSLEKFNGSTLQSGLYREDR 361


>gb|EOY04143.1| maize chloroplast splicing factor CRS1, putative isoform 1 [Theobroma
            cacao]
          Length = 818

 Score =  206 bits (525), Expect = 1e-50
 Identities = 134/350 (38%), Positives = 199/350 (56%), Gaps = 5/350 (1%)
 Frame = +3

Query: 228  ITFSSPPSNNRSNGTKIERKSAKYETHDSYTKSSESTPHSGSTIKAPTAPWMSKPLLVKP 407
            I FSS   N+  N +K  +++     +  ++ S +  P++G  IK PTAPWM  PLL++P
Sbjct: 49   IPFSSS-LNSSQNPSKTHKENRSLNNNSKFSVSKD--PNNGP-IKMPTAPWMKGPLLLQP 104

Query: 408  NEIMEFTKRRTKKDFTIGENGEHPDMALTGKVGGARGKVAMKKIFKGIEKLQETHDLEEI 587
            +E++  +K  +KK  +     + PD AL GK  G RGK  MKKI + +E LQ    LE+ 
Sbjct: 105  HEVLNPSKSTSKK--SSNSKAKAPDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLED- 161

Query: 588  SQTPENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQLSLESTEFDIPLAXXXXXXXXXXX 767
              T   ++ +F  G+      + SD EV+ +  ++                         
Sbjct: 162  --TQIGIREEFEVGNWLEE--FGSDGEVKRFDGKM------------------------- 192

Query: 768  PWERDE-RMVIRRVKKEKVVTAAESILDGILLERLRNEAAMIRKWVKVKKAGVTQAVVDQ 944
            PW R+E ++V RR+KKEK++T AE  LD  LLERLR +A  +RKW+KV K GVT+AVVD+
Sbjct: 193  PWLREEEKVVFRRMKKEKLLTQAEISLDKDLLERLRRKAMRMRKWIKVMKLGVTKAVVDE 252

Query: 945  VHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVWRNKDFLAVYRGRIYESGSKH 1124
            +   WR NEL ++KF +PLC+NMDRA+EI+EMKT G+VVW  KD L VYRG  +   SK 
Sbjct: 253  IKLAWRKNELVMVKFGVPLCRNMDRAREIIEMKTRGLVVWGKKDALVVYRGCSHGLTSK- 311

Query: 1125 IRNIRRNSVGD-RENSSSTMNY---QNTTTFARVSSDGSSLDDTIHGKDK 1262
            I +++     D +E SSST ++    N    +    +GS+L   ++ +D+
Sbjct: 312  ISSMKYPRCADGQEISSSTFSHLTSSNNINMSLEKFNGSTLQSGLYREDR 361


>gb|EXC20503.1| Chloroplastic group IIA intron splicing facilitator CRS1 [Morus
            notabilis]
          Length = 828

 Score =  205 bits (522), Expect = 3e-50
 Identities = 138/364 (37%), Positives = 189/364 (51%), Gaps = 2/364 (0%)
 Frame = +3

Query: 171  NPSKIP-TPFLSSNPSKYNFITFSSPPSNNRSNGTKIERKSAKYETHDSYTKSSESTPHS 347
            +PS  P T  LSSN        F  P  +     + +  K   Y  H S  ++ +S P  
Sbjct: 8    SPSTFPNTHHLSSN--------FKRPSDSYILISSSLNPKPTNYHHHASTKENPDSKPPL 59

Query: 348  GSTIKAPTAPWMSKPLLVKPNEIMEFTKRRTKKDFTIGENGEHPDMALTGKVGGARGKVA 527
               IK PT PWM  PL+++P+E+ + +K      F+     E     LT K+ G RGK  
Sbjct: 60   -EPIKMPTPPWMKGPLVLQPHEVTDLSKPENDNKFS-NRKAEKSVNGLTDKLVGRRGKNV 117

Query: 528  MKKIFKGIEKLQETHDLEEISQTPENLKFKFAPGD-LWGNGVYESDAEVQEYSEEVQLSL 704
            +KKI + IE+L     ++   +T ++   K   GD L G G   S  E            
Sbjct: 118  IKKIARRIEELGRKSKVDS-EETQKDFVGKNGIGDCLEGLGESRSGGE------------ 164

Query: 705  ESTEFDIPLAXXXXXXXXXXXPWERDERMVIRRVKKEKVVTAAESILDGILLERLRNEAA 884
                                 PWE+DE  V RR+KKEK+V++AE  L+  LLERLR+EA 
Sbjct: 165  -------------------RMPWEKDEGFVFRRMKKEKIVSSAELRLERELLERLRSEAR 205

Query: 885  MIRKWVKVKKAGVTQAVVDQVHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVW 1064
             +RKWVKVKKAGVT+ VV+ V F+W++NELA++KFD+PLC+NMDRAQEI+EMKTGG+VVW
Sbjct: 206  KMRKWVKVKKAGVTKEVVEDVKFVWKSNELAMVKFDVPLCRNMDRAQEILEMKTGGLVVW 265

Query: 1065 RNKDFLAVYRGRIYESGSKHIRNIRRNSVGDRENSSSTMNYQNTTTFARVSSDGSSLDDT 1244
            R KD   +YRG  Y+  SK          G +E   S +  Q  +      S+  S ++T
Sbjct: 266  RRKDAQVIYRGCNYQPTSKTFPRTYAGFSGHQETPFSNL-VQLDSRKGNSVSEVKSYENT 324

Query: 1245 IHGK 1256
            I  K
Sbjct: 325  IERK 328


>gb|EPS58217.1| hypothetical protein M569_16596, partial [Genlisea aurea]
          Length = 668

 Score =  202 bits (514), Expect = 2e-49
 Identities = 122/281 (43%), Positives = 171/281 (60%), Gaps = 8/281 (2%)
 Frame = +3

Query: 345  SGSTIKAPTAPWMSKPLLVKPNEIMEFTKRRTKKDFTIGENGEHPDMALTGKVGGARGKV 524
            S  ++ APTAPWM KPL V P+++++  K   KK+     N +  D  L+ KVG  R K+
Sbjct: 1    SSESVSAPTAPWMRKPLFVNPSQLLDLRKSPIKKN---SFNKQRLDKDLSEKVGNGRNKL 57

Query: 525  AMKKIFKGIEKLQETHDLEEISQT---PENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQ 695
            AM++IF+GI+KLQE+    E + T   P+N +FKF PG+L GN     +   +  SE   
Sbjct: 58   AMRQIFRGIKKLQESRPSSEAAATEGSPKNFEFKFRPGELSGNPQDSKNDGCERNSETTD 117

Query: 696  LSLESTEFDIPL--AXXXXXXXXXXXPWERDERMVIRRVKKEKVVTAAE-SILDGILLER 866
                   F IPL  A           PW+R+   V R      ++ AA+ + +D +LLER
Sbjct: 118  ------GFCIPLREAAEGEEVRLKAMPWQREA--VGRMATNRPLMKAAKLNAIDELLLER 169

Query: 867  LRNEAAMIRKWVKVKKAGVTQAVVDQVHFIWRNN--ELALLKFDIPLCQNMDRAQEIVEM 1040
            L+NEAA +RKW+KVKK GVT  VVDQVH  WR++  +LALLKFD+PL + M RA+EIVEM
Sbjct: 170  LQNEAAKMRKWIKVKKLGVTPTVVDQVHSTWRSSRSQLALLKFDVPLNRCMSRAREIVEM 229

Query: 1041 KTGGVVVWRNKDFLAVYRGRIYESGSKHIRNIRRNSVGDRE 1163
            KTGG+ +W++KD +AVYRG   ES +    +   +S+ +RE
Sbjct: 230  KTGGIAIWKSKDLIAVYRGS--ESSNAQQSSASFSSLYERE 268


>ref|XP_006287146.1| hypothetical protein CARUB_v10000322mg, partial [Capsella rubella]
            gi|482555852|gb|EOA20044.1| hypothetical protein
            CARUB_v10000322mg, partial [Capsella rubella]
          Length = 726

 Score =  201 bits (511), Expect = 6e-49
 Identities = 122/302 (40%), Positives = 169/302 (55%), Gaps = 1/302 (0%)
 Frame = +3

Query: 195  FLSSNPSKYNFITFSSPPSNNRSNGTKIERKSAKYETHDSYTKSSESTPHSGSTIKAPTA 374
            FLS+          S  P+ N SN  K  +          + K  E+  HS + IK PTA
Sbjct: 34   FLSARAFPSLITNSSLNPNQNPSNAAKTPQ----------FDKFRENRGHSDAAIKVPTA 83

Query: 375  PWMSKPLLVKPNEIMEFTKRRTKKDFTIGENGEHPDMALTGKVGGARGKVAMKKIFKGIE 554
            PWM  PLL++P+EI++  KR   +     ++ E    AL  +  G RG+ AMKKI + +E
Sbjct: 84   PWMKGPLLLRPDEILDAEKRNKPR-----KSEEKTFKALNRRESGVRGRKAMKKIVRNVE 138

Query: 555  KLQETHDLEEISQTPENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQLSLESTEFDIPLA 734
            KL E  D ++              GDL             E+    +++ E+TE      
Sbjct: 139  KLDEDSDSQDTQM-----------GDL------------SEFDCLARIAEETTESS---- 171

Query: 735  XXXXXXXXXXXPWERDE-RMVIRRVKKEKVVTAAESILDGILLERLRNEAAMIRKWVKVK 911
                       PWER+E R ++RR KKE+V T AE ILD  LL +LR EA+ +RKWV V+
Sbjct: 172  ---EKRFGGRMPWEREEERFILRRTKKERVPTTAELILDEGLLNKLRREASKMRKWVNVR 228

Query: 912  KAGVTQAVVDQVHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVWRNKDFLAVY 1091
            KAGVT+ VV+++  IWR+NELA+++FDIPLC+NM+RAQEI+EMKTGG+VV   K+F+ VY
Sbjct: 229  KAGVTEPVVNEIRSIWRSNELAMVRFDIPLCRNMERAQEIIEMKTGGLVVLSKKEFVVVY 288

Query: 1092 RG 1097
            RG
Sbjct: 289  RG 290


>ref|XP_006603058.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X5 [Glycine max]
          Length = 744

 Score =  198 bits (504), Expect = 4e-48
 Identities = 135/323 (41%), Positives = 177/323 (54%), Gaps = 6/323 (1%)
 Frame = +3

Query: 195  FLSSNPSKYN------FITFSSPPSNNRSNGTKIERKSAKYETHDSYTKSSESTPHSGST 356
            FLS +PS +        I+ S PP++N  NG            H+    S    P     
Sbjct: 3    FLSFHPSLFPNSYSRFHISSSLPPNSN--NG------------HNHQHTSPSQVP----- 43

Query: 357  IKAPTAPWMSKPLLVKPNEIMEFTKRRTKKDFTIGENGEHPDMALTGKVGGARGKVAMKK 536
            IK+PT PWM  PLL++P+E+++ +  ++KK     E  E  D AL GK    RGK AMKK
Sbjct: 44   IKSPTPPWMKVPLLLQPHELVDLSNPKSKKFKP--EKHELSDKALMGKE--VRGKRAMKK 99

Query: 537  IFKGIEKLQETHDLEEISQTPENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQLSLESTE 716
            I   +EKL +T +  E      N++           G Y    E+ + +EEV+       
Sbjct: 100  IVDRVEKLHKTQNSNETRVDSLNVE---------NFGGY---LEILKENEEVRSK----- 142

Query: 717  FDIPLAXXXXXXXXXXXPWERDERMVIRRVKKEKVVTAAESILDGILLERLRNEAAMIRK 896
                             PWE+DE+    +VK+EK VTAAE  LD  LL RLRNEAA +R 
Sbjct: 143  --------------GRMPWEKDEKFGFVKVKREKAVTAAELTLDKALLRRLRNEAARMRT 188

Query: 897  WVKVKKAGVTQAVVDQVHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVWRNKD 1076
            W+KVKKAGVTQ VVDQ+   WR NELA++KFDIPLC+NMDRA+EIVE KTGG+VV   KD
Sbjct: 189  WIKVKKAGVTQDVVDQIKRTWRRNELAMIKFDIPLCRNMDRAREIVETKTGGLVVLSKKD 248

Query: 1077 FLAVYRGRIYESGSKHIRNIRRN 1145
            FL VYRG  ++  +K   ++R N
Sbjct: 249  FLVVYRGCNHQLTTKGSPSLRTN 271


>ref|XP_006603055.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X2 [Glycine max]
            gi|571550194|ref|XP_006603056.1| PREDICTED: chloroplastic
            group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X3 [Glycine max]
            gi|571550197|ref|XP_006603057.1| PREDICTED: chloroplastic
            group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X4 [Glycine max]
          Length = 747

 Score =  198 bits (504), Expect = 4e-48
 Identities = 135/323 (41%), Positives = 177/323 (54%), Gaps = 6/323 (1%)
 Frame = +3

Query: 195  FLSSNPSKYN------FITFSSPPSNNRSNGTKIERKSAKYETHDSYTKSSESTPHSGST 356
            FLS +PS +        I+ S PP++N  NG            H+    S    P     
Sbjct: 3    FLSFHPSLFPNSYSRFHISSSLPPNSN--NG------------HNHQHTSPSQVP----- 43

Query: 357  IKAPTAPWMSKPLLVKPNEIMEFTKRRTKKDFTIGENGEHPDMALTGKVGGARGKVAMKK 536
            IK+PT PWM  PLL++P+E+++ +  ++KK     E  E  D AL GK    RGK AMKK
Sbjct: 44   IKSPTPPWMKVPLLLQPHELVDLSNPKSKKFKP--EKHELSDKALMGKE--VRGKRAMKK 99

Query: 537  IFKGIEKLQETHDLEEISQTPENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQLSLESTE 716
            I   +EKL +T +  E      N++           G Y    E+ + +EEV+       
Sbjct: 100  IVDRVEKLHKTQNSNETRVDSLNVE---------NFGGY---LEILKENEEVRSK----- 142

Query: 717  FDIPLAXXXXXXXXXXXPWERDERMVIRRVKKEKVVTAAESILDGILLERLRNEAAMIRK 896
                             PWE+DE+    +VK+EK VTAAE  LD  LL RLRNEAA +R 
Sbjct: 143  --------------GRMPWEKDEKFGFVKVKREKAVTAAELTLDKALLRRLRNEAARMRT 188

Query: 897  WVKVKKAGVTQAVVDQVHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVWRNKD 1076
            W+KVKKAGVTQ VVDQ+   WR NELA++KFDIPLC+NMDRA+EIVE KTGG+VV   KD
Sbjct: 189  WIKVKKAGVTQDVVDQIKRTWRRNELAMIKFDIPLCRNMDRAREIVETKTGGLVVLSKKD 248

Query: 1077 FLAVYRGRIYESGSKHIRNIRRN 1145
            FL VYRG  ++  +K   ++R N
Sbjct: 249  FLVVYRGCNHQLTTKGSPSLRTN 271


>ref|XP_006603054.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X1 [Glycine max]
          Length = 750

 Score =  198 bits (504), Expect = 4e-48
 Identities = 135/323 (41%), Positives = 177/323 (54%), Gaps = 6/323 (1%)
 Frame = +3

Query: 195  FLSSNPSKYN------FITFSSPPSNNRSNGTKIERKSAKYETHDSYTKSSESTPHSGST 356
            FLS +PS +        I+ S PP++N  NG            H+    S    P     
Sbjct: 3    FLSFHPSLFPNSYSRFHISSSLPPNSN--NG------------HNHQHTSPSQVP----- 43

Query: 357  IKAPTAPWMSKPLLVKPNEIMEFTKRRTKKDFTIGENGEHPDMALTGKVGGARGKVAMKK 536
            IK+PT PWM  PLL++P+E+++ +  ++KK     E  E  D AL GK    RGK AMKK
Sbjct: 44   IKSPTPPWMKVPLLLQPHELVDLSNPKSKKFKP--EKHELSDKALMGKE--VRGKRAMKK 99

Query: 537  IFKGIEKLQETHDLEEISQTPENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQLSLESTE 716
            I   +EKL +T +  E      N++           G Y    E+ + +EEV+       
Sbjct: 100  IVDRVEKLHKTQNSNETRVDSLNVE---------NFGGY---LEILKENEEVRSK----- 142

Query: 717  FDIPLAXXXXXXXXXXXPWERDERMVIRRVKKEKVVTAAESILDGILLERLRNEAAMIRK 896
                             PWE+DE+    +VK+EK VTAAE  LD  LL RLRNEAA +R 
Sbjct: 143  --------------GRMPWEKDEKFGFVKVKREKAVTAAELTLDKALLRRLRNEAARMRT 188

Query: 897  WVKVKKAGVTQAVVDQVHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVWRNKD 1076
            W+KVKKAGVTQ VVDQ+   WR NELA++KFDIPLC+NMDRA+EIVE KTGG+VV   KD
Sbjct: 189  WIKVKKAGVTQDVVDQIKRTWRRNELAMIKFDIPLCRNMDRAREIVETKTGGLVVLSKKD 248

Query: 1077 FLAVYRGRIYESGSKHIRNIRRN 1145
            FL VYRG  ++  +K   ++R N
Sbjct: 249  FLVVYRGCNHQLTTKGSPSLRTN 271


>ref|XP_006400150.1| hypothetical protein EUTSA_v10012836mg [Eutrema salsugineum]
            gi|557101240|gb|ESQ41603.1| hypothetical protein
            EUTSA_v10012836mg [Eutrema salsugineum]
          Length = 698

 Score =  197 bits (500), Expect = 1e-47
 Identities = 128/326 (39%), Positives = 183/326 (56%), Gaps = 3/326 (0%)
 Frame = +3

Query: 195  FLSSNPSKYNFITFSSPPSNNRSNGTKIERKSAKYETHDSYTKSSESTPHSGSTIKAPT- 371
            FLS+          S  PS N SN +KI +          + + SE++  S + IK PT 
Sbjct: 6    FLSARAFPSLITNSSLNPSRNPSNASKIPQ----------FDQFSENSRFSDAPIKVPTT 55

Query: 372  APWMSKPLLVKPNEIMEFTKRRTKKDFTIGENGEHPDM-ALTGKVGGARGKVAMKKIFKG 548
            APWM  PLL++P+E+++ + R  K+    G+N E     AL  +  G RG  AMKKI + 
Sbjct: 56   APWMKGPLLIRPDEVLDTSYRHEKR--IRGQNAEEKTFKALNRRESGVRGSKAMKKIVRK 113

Query: 549  IEKLQETHDLEEISQTPENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQLSLESTEFDIP 728
            +EKL+E  D EE +Q    ++FK          + E     + +S ++            
Sbjct: 114  VEKLEENSDSEE-TQMDNPVEFKSL------GRIAEETESGKRFSGKM------------ 154

Query: 729  LAXXXXXXXXXXXPWERDE-RMVIRRVKKEKVVTAAESILDGILLERLRNEAAMIRKWVK 905
                         PW+R+E + ++RR KKE+  TAA+ ILD  LL+RLR EA+ +RKWV 
Sbjct: 155  -------------PWDREEEKFILRRTKKERAPTAADLILDEGLLKRLRREASKMRKWVN 201

Query: 906  VKKAGVTQAVVDQVHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVWRNKDFLA 1085
            V+KAGVT+ VV+ +  IWR NELA+++FD+PLC+NM+RAQEI+EMKTGG+VV   K+FL 
Sbjct: 202  VRKAGVTETVVNDIRSIWRLNELAMVRFDVPLCRNMERAQEIIEMKTGGLVVLSKKEFLV 261

Query: 1086 VYRGRIYESGSKHIRNIRRNSVGDRE 1163
            VYRG    S  K    I  +S+ +RE
Sbjct: 262  VYRGPPSYSSEKTNSEI-NSSLYERE 286


>ref|NP_197122.2| chloroplast splicing factor CRS1-like protein [Arabidopsis thaliana]
            gi|374095377|sp|Q9LF10.2|CRS1_ARATH RecName:
            Full=Chloroplastic group IIA intron splicing facilitator
            CRS1, chloroplastic; AltName: Full=Chloroplastic RNA
            splicing factor 1; AltName: Full=Protein CHLOROPLAST RNA
            SPLICING 1; Flags: Precursor gi|332004875|gb|AED92258.1|
            chloroplast splicing factor CRS1-like protein
            [Arabidopsis thaliana]
          Length = 720

 Score =  194 bits (492), Expect = 9e-47
 Identities = 119/301 (39%), Positives = 164/301 (54%), Gaps = 2/301 (0%)
 Frame = +3

Query: 201  SSNPSKYNFITF-SSPPSNNRSNGTKIERKSAKYETHDSYTKSSESTPHSGSTIKAPTAP 377
            S NP    F    +S   + R+  + I   S++      + +  E+   S + IK PTAP
Sbjct: 28   SDNPKLQTFTQMLNSLFLSARAFPSLITNSSSRRAKSSQFDQFRENRGVSDAAIKVPTAP 87

Query: 378  WMSKPLLVKPNEIMEFTKRRTKKDFTIGENGEHPDMALTGKVGGARGKVAMKKIFKGIEK 557
            WM  PLL++P+EI++  KR   +        E    AL  +  G RGK AMKKI + +EK
Sbjct: 88   WMKGPLLLRPDEILDTKKRNKPRKVE-----EKTFKALNRRESGVRGKKAMKKIVRNVEK 142

Query: 558  LQETHDLEEISQTPENLKFKFAPGDLWGNGVYESDAEVQEYSEEVQLSLESTEFDIPLAX 737
            L E  D EE                         D    EY   ++  +ES +       
Sbjct: 143  LDEDSDSEETQM---------------------DDLSEFEYLGRIEEKVESKD------- 174

Query: 738  XXXXXXXXXXPWERDE-RMVIRRVKKEKVVTAAESILDGILLERLRNEAAMIRKWVKVKK 914
                      PWER+E R ++RR+KKE V T AE ILD  LL RLR EA+ +RKWV V+K
Sbjct: 175  ----RFGGKMPWEREEERFILRRMKKESVPTTAELILDEGLLNRLRREASKMRKWVNVRK 230

Query: 915  AGVTQAVVDQVHFIWRNNELALLKFDIPLCQNMDRAQEIVEMKTGGVVVWRNKDFLAVYR 1094
            AGVT+ VV+++  +W+ NELA+++FD+PLC+NM+RAQEI+EMKTGG+VV   K+FL VYR
Sbjct: 231  AGVTELVVNKIKSMWKLNELAMVRFDVPLCRNMERAQEIIEMKTGGLVVLSKKEFLVVYR 290

Query: 1095 G 1097
            G
Sbjct: 291  G 291


Top