BLASTX nr result

ID: Sinomenium22_contig00020798 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00020798
         (1590 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002280704.1| PREDICTED: chloroplastic group IIA intron sp...   514   e-143
ref|XP_006482225.1| PREDICTED: chloroplastic group IIA intron sp...   450   e-124
ref|XP_007217313.1| hypothetical protein PRUPE_ppa016241mg [Prun...   444   e-122
ref|XP_006430740.1| hypothetical protein CICLE_v10013368mg [Citr...   442   e-121
ref|XP_007033221.1| maize chloroplast splicing factor CRS1, puta...   434   e-119
ref|XP_007033220.1| maize chloroplast splicing factor CRS1, puta...   434   e-119
ref|XP_007033219.1| maize chloroplast splicing factor CRS1, puta...   434   e-119
ref|XP_007033218.1| maize chloroplast splicing factor CRS1, puta...   434   e-119
ref|XP_007033217.1| maize chloroplast splicing factor CRS1, puta...   434   e-119
ref|XP_006357840.1| PREDICTED: chloroplastic group IIA intron sp...   421   e-115
ref|XP_002516757.1| conserved hypothetical protein [Ricinus comm...   421   e-115
ref|XP_006603058.1| PREDICTED: chloroplastic group IIA intron sp...   421   e-115
ref|XP_006603055.1| PREDICTED: chloroplastic group IIA intron sp...   421   e-115
ref|XP_006603054.1| PREDICTED: chloroplastic group IIA intron sp...   421   e-115
ref|XP_006842364.1| hypothetical protein AMTR_s00079p00185530 [A...   419   e-114
gb|EXC20503.1| Chloroplastic group IIA intron splicing facilitat...   408   e-111
ref|XP_004507538.1| PREDICTED: chloroplastic group IIA intron sp...   398   e-108
ref|XP_007139175.1| hypothetical protein PHAVU_008G007700g [Phas...   397   e-107
ref|XP_004138635.1| PREDICTED: chloroplastic group IIA intron sp...   383   e-103
gb|EYU44617.1| hypothetical protein MIMGU_mgv1a026522mg, partial...   380   e-102

>ref|XP_002280704.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Vitis vinifera]
          Length = 1184

 Score =  514 bits (1325), Expect = e-143
 Identities = 284/510 (55%), Positives = 353/510 (69%), Gaps = 8/510 (1%)
 Frame = -2

Query: 1508 FLSPQSLPSNANISTLSSSTHFQNPLKTPPKQLHNFNKPLKILASHTDITPSPNSIEKPQ 1329
            FLS   +P+++   + S+S    +     P+++H+F KP  I A+ T  T          
Sbjct: 6    FLSLSPIPNHSQFPSNSNSLSNSSIRILNPQRIHSF-KPPPISATTTATT---------- 54

Query: 1328 PSNPNFSFLSEAFQSTNDTIKVPSAPWMTGPLLLPSNEVLNLSTLRTKRGKNRVQGAERT 1149
             ++P+ S  S+    T+  IK+P+APWM GPLLL  NEVL+LS  R K+      GAE+ 
Sbjct: 55   -NHPDHSISSQPVSGTDAAIKMPTAPWMKGPLLLQPNEVLDLSKARPKKVAGSA-GAEKP 112

Query: 1148 DLSLTDKISGGRGRRAMRRIVESITKLQELANSEEAQKGVEKFELRVPLKPIFEDGNSNS 969
            D SLT+K+SGGRG +AM++I++SI KLQE   S+E Q+  E+FE  V L+ I  D NS  
Sbjct: 113  DRSLTEKVSGGRGAKAMKKIMQSIVKLQETHTSDETQENTEEFEFGVSLEGIGGDENSRI 172

Query: 968  EARMPWMAEEKIVFRRMKKERVLTKAELSLSETVLKRLRNNAVKMTKWVKVKKAGVTDAV 789
              +MPW+  EK+VFRR KKE+V+T AEL+L   +L+RLR  AVKM KWVKVKKAGVT++V
Sbjct: 173  GGKMPWLKTEKVVFRRTKKEKVVTAAELTLDPMLLERLRGEAVKMRKWVKVKKAGVTESV 232

Query: 788  VDEIKRTWSGNELVMVKFYIPLCRNMDRAREIIELKTGGLVIWSKKDTHVVYRGCNYELS 609
            VD+I   W  +EL MVKF +PLCRNMDRAREI+E+KT GLVIWSKKDT VVYRG NY+ +
Sbjct: 233  VDQIHMVWKSDELAMVKFDMPLCRNMDRAREILEIKTRGLVIWSKKDTLVVYRGSNYQST 292

Query: 608  SKACQKLHSEHGHVVEASPSDDIFMES--EDEANISYIQPNEVT----QDGKDSISNSCP 447
            SK  QK+    G V  A  S+    +S  ED+  IS I+ +E T       KD   +S P
Sbjct: 293  SKHFQKMRP--GLVAGADASNSKLNQSNFEDDLTISEIKFHESTTGEKMGRKDGEEDSSP 350

Query: 446  TS--MEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWWPKPLPIDADLLPEVVPGYMTP 273
            T   MEE V  Q V+G+LYEREADRLL+GLGPRF+DWW PKPLP+DADLLPEV+PG+  P
Sbjct: 351  TGIFMEEMVDSQPVNGSLYEREADRLLDGLGPRFIDWWRPKPLPVDADLLPEVLPGFRPP 410

Query: 272  FRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKLQGLATAMIKLWEKSLIAKIAV 93
            FR   PQ R KLTDDELT LRK A  LPTHF LGRN KLQGLA A++KLWEKSLI KIA+
Sbjct: 411  FRLSPPQTRSKLTDDELTYLRKLAYALPTHFVLGRNRKLQGLAAAILKLWEKSLIVKIAI 470

Query: 92   KWGIPNTNNELMAQEIKHLTGGVLILRNKF 3
            KWGIPNT NE MA E+K LTGGVL+LRNKF
Sbjct: 471  KWGIPNTKNEQMANELKCLTGGVLLLRNKF 500


>ref|XP_006482225.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X1 [Citrus sinensis]
            gi|568857343|ref|XP_006482226.1| PREDICTED: chloroplastic
            group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X2 [Citrus sinensis]
          Length = 771

 Score =  450 bits (1157), Expect = e-124
 Identities = 249/461 (54%), Positives = 313/461 (67%), Gaps = 13/461 (2%)
 Frame = -2

Query: 1346 SIEKPQPSNPNFSFLSEAFQSTNDTIK-VPSAPWMTGPLLLPSNEVLNLSTLRTKRGKNR 1170
            S   P  ++P    L +   S N  IK +P+APWM  P++L  +E++  S  +TK+    
Sbjct: 27   SSSNPNQNSPKTLKLPDIKLSPNAPIKKMPTAPWMRSPIVLQPDEIIKPSKPKTKKS--- 83

Query: 1169 VQGAERTDLSLTDKISGGRGRRAMRRIVESITKLQELANSEEAQKGV-EKFELRVPLKPI 993
                ++TD  LT K SG RG++AM++I+E+I KLQ+    +E QK V EKFE     K  
Sbjct: 84   ---FKKTDKGLTAKESGVRGKQAMKKIIENIEKLQKDQILDETQKKVMEKFEF----KGC 136

Query: 992  FEDGNSNSE-------ARMPWMAEEKIVFRRMKKERVLTKAELSLSETVLKRLRNNAVKM 834
            FE+  S+ E        ++PW+ E++ VFRRMKKER++TKAE  L   +L+RL++ A KM
Sbjct: 137  FEENVSHEEDLRGGFGGKVPWLREDRFVFRRMKKERMVTKAETMLDGELLERLKDEARKM 196

Query: 833  TKWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREIIELKTGGLVIWSK 654
             KWVKVKKAGVT++VV EI+  W  NEL MVKF +PLCRNMDRAREI+ELKTGGLVIW+K
Sbjct: 197  RKWVKVKKAGVTESVVFEIRLAWRRNELAMVKFDVPLCRNMDRAREILELKTGGLVIWTK 256

Query: 653  KDTHVVYRGCNYELSSKACQKLHSEHGHVVEASPSDDIFMESEDEANISYIQPNEVTQDG 474
            KD HVVYRG + + S K C +   +     EA  S    +  E + N+S+I+ N  T D 
Sbjct: 257  KDAHVVYRGDSSKSSVKMCPRSADDQ----EAPLSKSTHLHLEKKVNVSWIKSNTATLDQ 312

Query: 473  ----KDSISNSCPTSMEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWWPKPLPIDADL 306
                KD   NS PTS+  +  L+ +  +LYERE DRLL+GLGPRFVDWW  KPLP+D DL
Sbjct: 313  NRSLKDGEENSLPTSIFMDKNLR-IDKSLYEREGDRLLDGLGPRFVDWWMWKPLPVDGDL 371

Query: 305  LPEVVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKLQGLATAMIKL 126
            LPEVVPG+  PFR   P  R KLTDDELT LRK A  LPTHF LGRN  LQGLATA++KL
Sbjct: 372  LPEVVPGFKPPFRLSPPDARSKLTDDELTYLRKLAHPLPTHFVLGRNRGLQGLATAILKL 431

Query: 125  WEKSLIAKIAVKWGIPNTNNELMAQEIKHLTGGVLILRNKF 3
            WEKSL+AKI VKWGIPNT+NE MA E+KHLTGGVL+LRNKF
Sbjct: 432  WEKSLVAKITVKWGIPNTDNEQMANELKHLTGGVLLLRNKF 472


>ref|XP_007217313.1| hypothetical protein PRUPE_ppa016241mg [Prunus persica]
            gi|462413463|gb|EMJ18512.1| hypothetical protein
            PRUPE_ppa016241mg [Prunus persica]
          Length = 809

 Score =  444 bits (1141), Expect = e-122
 Identities = 254/530 (47%), Positives = 326/530 (61%), Gaps = 40/530 (7%)
 Frame = -2

Query: 1472 ISTLSSSTHFQNPLKTPPKQLHNFNKPLKILASHTDITPSP-NSIEKPQPSN-------- 1320
            +STL + TH       PP   H++N     L S     P P N I    P++        
Sbjct: 10   LSTLPNITHHLPSHSNPP--FHSYNPISSALNSKPPQNPKPTNPIPSKSPNSLSLSSTTT 67

Query: 1319 -PNFSFLSEAFQSTNDTIKVPSAPWMTGPLLLPSNEVLNLSTLRTKRGKNRVQGAERTDL 1143
             PN    SE   ST+  IK P+APWM GPLLL  +EV++ S  R K+  N  + AE+ D 
Sbjct: 68   TPNSKAPSEPNSSTDACIKAPTAPWMKGPLLLQPHEVIDFSKPRNKKTHNNAK-AEKPDT 126

Query: 1142 SLTDKISGGRGRRAMRRIVESITKLQELANSEEAQKGVEKFELRVPLKPIFED------- 984
             L  K+ G RG +A+++IV+SI +L     ++E QKG  +F +   L+ + ++       
Sbjct: 127  VLAGKLVGIRGDKAIKQIVQSIERLGPNQKTDETQKGFGEFRIWDSLEGLGQNEKWDETH 186

Query: 983  ------------------GNSNSEARMPWMAEEKIVFRRMKKERVLTKAELSLSETVLKR 858
                               +S    +MPW  +E+IVF+R+KK+RV + AELSL + +L+R
Sbjct: 187  KDFVEFGIGGCLEGLGKAADSRFGGKMPWERDERIVFQRIKKKRVASAAELSLEKELLER 246

Query: 857  LRNNAVKMTKWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREIIELKT 678
            LR  A KM KWVKVKKAGVT A+VD+IK  W  NEL MVKF +PLCRNM RA+EI+E KT
Sbjct: 247  LRAEAAKMRKWVKVKKAGVTQAIVDDIKFIWKTNELAMVKFDVPLCRNMHRAQEIVETKT 306

Query: 677  GGLVIWSKKDTHVVYRGCNYELSSKACQKLHSEHGHVVEASPSDDIFMESEDEANISYIQ 498
            GG+V+W KKDT V+YRGCNY+ SSK   K+        E   SD +  + E+ ++  Y  
Sbjct: 307  GGMVVWGKKDTLVIYRGCNYQSSSKFFPKMRPCSADRQETLSSDHMQPDLEENSSYQYKS 366

Query: 497  ---PNEVTQDGKDSISNSCP--TSMEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWWP 333
               P +     KD+  +     T  E ++  Q  S +LYE+EADRLL+GLGPRF+DWW  
Sbjct: 367  FESPVDEKMSRKDAEEDCIQSGTFQETSMSCQPTSRSLYEKEADRLLDGLGPRFIDWWMH 426

Query: 332  KPLPIDADLLPEVVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKLQ 153
            KPLP+DADLLPEVVPG+  P RRC P  R KLTDDELT LRKFAR LPTHF LGRN KLQ
Sbjct: 427  KPLPVDADLLPEVVPGFKAPIRRCPPHTRSKLTDDELTFLRKFARSLPTHFVLGRNRKLQ 486

Query: 152  GLATAMIKLWEKSLIAKIAVKWGIPNTNNELMAQEIKHLTGGVLILRNKF 3
            GLA A++KLWEKSLIAKIAVK+G+PNTNNE MA E++     VLILRNKF
Sbjct: 487  GLAAAILKLWEKSLIAKIAVKFGVPNTNNEQMAYELR---ARVLILRNKF 533


>ref|XP_006430740.1| hypothetical protein CICLE_v10013368mg [Citrus clementina]
            gi|557532797|gb|ESR43980.1| hypothetical protein
            CICLE_v10013368mg [Citrus clementina]
          Length = 770

 Score =  442 bits (1138), Expect = e-121
 Identities = 249/474 (52%), Positives = 313/474 (66%), Gaps = 26/474 (5%)
 Frame = -2

Query: 1346 SIEKPQPSNPNFSFLSEAFQSTNDTIK-VPSAPWMTGPLLLPSNEVLNLSTLRTKRGKNR 1170
            S   P  ++P    L +   S N  IK +P+APWM  P++L  +E++  S  +TK+    
Sbjct: 33   SSSNPNQNSPKTLKLPDIKLSPNAPIKKMPTAPWMRSPIVLQPDEIIKPSKPKTKKS--- 89

Query: 1169 VQGAERTDLSLTDKISGGRGRRAMRRIVESITKLQELANSEEAQK-GVEKFELRVPLKPI 993
                ++TD  LT K SG RG++AM++I+E+I KLQ+    +E QK  +EKFE R      
Sbjct: 90   ---FKKTDKGLTAKESGVRGKQAMKKIIENIEKLQKDQILDETQKKDMEKFEFR----GC 142

Query: 992  FEDGNSNSE-------ARMPWMAEEKIVFRRMKKERVLTKAELSLSETVLKRLRNNAVKM 834
            FE+  S+ E        ++PW+ EE+ VFRRMKKER++TKAE  L   +++RL++ A KM
Sbjct: 143  FEENGSDEEDLRGGFGGKVPWLREERFVFRRMKKERMVTKAETMLDGELIERLKDEARKM 202

Query: 833  TKWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREIIELKTGGLVIWSK 654
             KWVKVKKAGVT++VV EI+  W  NEL MVKF +PLCRNMDRAREI+ELKTGGLVIW+K
Sbjct: 203  RKWVKVKKAGVTESVVFEIRLAWRRNELAMVKFDVPLCRNMDRAREILELKTGGLVIWTK 262

Query: 653  KDTHVVYRGCNYELSSKACQKLHSEHGHVVEASPSDDIFMESEDEANISYIQPNEVTQDG 474
            KD HVVYRG   + S K C +   +     EA  S    +  E + N+S+I+ N  T D 
Sbjct: 263  KDAHVVYRGDGSKSSVKMCPRSADDQ----EAPLSKSTHLHLEKKVNVSWIKSNTATLDQ 318

Query: 473  ----KDSISNSCPTSMEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWWPKPLPIDADL 306
                KD   NS PTS+  +  L+ +  +LYERE DRLL+GLGPRFVDWW  KPLP+D DL
Sbjct: 319  NRSLKDGEENSLPTSIFMDKNLR-IDKSLYEREGDRLLDGLGPRFVDWWMWKPLPVDGDL 377

Query: 305  LPEVVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKLQGLATAMIKL 126
            LPEVVPG+  PFR   P  R KLTDDELT LRK A  LPTHF LGRN  LQGLATA++KL
Sbjct: 378  LPEVVPGFKPPFRLSPPDARSKLTDDELTYLRKLAHPLPTHFVLGRNRGLQGLATAILKL 437

Query: 125  WEKSLIAKIAVKWGIPNTNNELMAQEIK-------------HLTGGVLILRNKF 3
            WEKSL+AKIAVKWGIPNT+NE MA E+K             HLTGGVL+LRNKF
Sbjct: 438  WEKSLVAKIAVKWGIPNTDNEQMANELKNFKFSDDGVLLMQHLTGGVLLLRNKF 491


>ref|XP_007033221.1| maize chloroplast splicing factor CRS1, putative isoform 5 [Theobroma
            cacao] gi|508712250|gb|EOY04147.1| maize chloroplast
            splicing factor CRS1, putative isoform 5 [Theobroma
            cacao]
          Length = 788

 Score =  434 bits (1117), Expect = e-119
 Identities = 243/468 (51%), Positives = 310/468 (66%), Gaps = 9/468 (1%)
 Frame = -2

Query: 1379 ASHTDITPSPNSIEKPQPSNPNFSFLSEAFQSTNDTIKVPSAPWMTGPLLLPSNEVLNLS 1200
            +S  + + +P+   K   S  N S  S +    N  IK+P+APWM GPLLL  +EVLN S
Sbjct: 52   SSSLNSSQNPSKTHKENRSLNNNSKFSVSKDPNNGPIKMPTAPWMKGPLLLQPHEVLNPS 111

Query: 1199 TLRTKRGKNRVQGAERTDLSLTDKISGGRGRRAMRRIVESITKLQELANSEEAQKGV-EK 1023
               +K+  N    A+  D +L  K SG RG++ M++I+ ++  LQ     E+ Q G+ E+
Sbjct: 112  KSTSKKSSN--SKAKAPDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLEDTQIGIREE 169

Query: 1022 FELRVPLKPIFEDGN-SNSEARMPWMAEE-KIVFRRMKKERVLTKAELSLSETVLKRLRN 849
            FE+   L+    DG     + +MPW+ EE K+VFRRMKKE++LT+AE+SL + +L+RLR 
Sbjct: 170  FEVGNWLEEFGSDGEVKRFDGKMPWLREEEKVVFRRMKKEKLLTQAEISLDKDLLERLRR 229

Query: 848  NAVKMTKWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREIIELKTGGL 669
             A++M KW+KV K GVT AVVDEIK  W  NELVMVKF +PLCRNMDRAREIIE+KT GL
Sbjct: 230  KAMRMRKWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCRNMDRAREIIEMKTRGL 289

Query: 668  VIWSKKDTHVVYRGCNYELSSKACQKLHSEHGHVVEASPSDDIFMESEDEANISYIQPNE 489
            V+W KKD  VVYRGC++ L+SK     +       E S S    + S +  N+S  + N 
Sbjct: 290  VVWGKKDALVVYRGCSHGLTSKISSMKYPRCADGQEISSSTFSHLTSSNNINMSLEKFNG 349

Query: 488  VT-QDG---KDSISNSCPTS--MEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWWPKP 327
             T Q G   +D    S P +  M+E+   Q V G+LYERE DRLL+GLGPRF+DWW  KP
Sbjct: 350  STLQSGLYREDREKESMPINIFMKEDENNQPVIGSLYERETDRLLDGLGPRFIDWWMRKP 409

Query: 326  LPIDADLLPEVVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKLQGL 147
            LPIDADLLPE VPG+  P R   P  RP LTDDEL  LRK    LP HFALG+N  LQGL
Sbjct: 410  LPIDADLLPEEVPGFRPPLRLSPPNTRPNLTDDELKYLRKLTHPLPFHFALGKNRNLQGL 469

Query: 146  ATAMIKLWEKSLIAKIAVKWGIPNTNNELMAQEIKHLTGGVLILRNKF 3
            A A++KLWEKSLIAKIA+KWGI NT+NE MA E+K+LTGGVL++RNKF
Sbjct: 470  AAAILKLWEKSLIAKIAIKWGIQNTDNEQMAYELKNLTGGVLLVRNKF 517


>ref|XP_007033220.1| maize chloroplast splicing factor CRS1, putative isoform 4 [Theobroma
            cacao] gi|508712249|gb|EOY04146.1| maize chloroplast
            splicing factor CRS1, putative isoform 4 [Theobroma
            cacao]
          Length = 767

 Score =  434 bits (1117), Expect = e-119
 Identities = 243/468 (51%), Positives = 310/468 (66%), Gaps = 9/468 (1%)
 Frame = -2

Query: 1379 ASHTDITPSPNSIEKPQPSNPNFSFLSEAFQSTNDTIKVPSAPWMTGPLLLPSNEVLNLS 1200
            +S  + + +P+   K   S  N S  S +    N  IK+P+APWM GPLLL  +EVLN S
Sbjct: 26   SSSLNSSQNPSKTHKENRSLNNNSKFSVSKDPNNGPIKMPTAPWMKGPLLLQPHEVLNPS 85

Query: 1199 TLRTKRGKNRVQGAERTDLSLTDKISGGRGRRAMRRIVESITKLQELANSEEAQKGV-EK 1023
               +K+  N    A+  D +L  K SG RG++ M++I+ ++  LQ     E+ Q G+ E+
Sbjct: 86   KSTSKKSSN--SKAKAPDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLEDTQIGIREE 143

Query: 1022 FELRVPLKPIFEDGN-SNSEARMPWMAEE-KIVFRRMKKERVLTKAELSLSETVLKRLRN 849
            FE+   L+    DG     + +MPW+ EE K+VFRRMKKE++LT+AE+SL + +L+RLR 
Sbjct: 144  FEVGNWLEEFGSDGEVKRFDGKMPWLREEEKVVFRRMKKEKLLTQAEISLDKDLLERLRR 203

Query: 848  NAVKMTKWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREIIELKTGGL 669
             A++M KW+KV K GVT AVVDEIK  W  NELVMVKF +PLCRNMDRAREIIE+KT GL
Sbjct: 204  KAMRMRKWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCRNMDRAREIIEMKTRGL 263

Query: 668  VIWSKKDTHVVYRGCNYELSSKACQKLHSEHGHVVEASPSDDIFMESEDEANISYIQPNE 489
            V+W KKD  VVYRGC++ L+SK     +       E S S    + S +  N+S  + N 
Sbjct: 264  VVWGKKDALVVYRGCSHGLTSKISSMKYPRCADGQEISSSTFSHLTSSNNINMSLEKFNG 323

Query: 488  VT-QDG---KDSISNSCPTS--MEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWWPKP 327
             T Q G   +D    S P +  M+E+   Q V G+LYERE DRLL+GLGPRF+DWW  KP
Sbjct: 324  STLQSGLYREDREKESMPINIFMKEDENNQPVIGSLYERETDRLLDGLGPRFIDWWMRKP 383

Query: 326  LPIDADLLPEVVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKLQGL 147
            LPIDADLLPE VPG+  P R   P  RP LTDDEL  LRK    LP HFALG+N  LQGL
Sbjct: 384  LPIDADLLPEEVPGFRPPLRLSPPNTRPNLTDDELKYLRKLTHPLPFHFALGKNRNLQGL 443

Query: 146  ATAMIKLWEKSLIAKIAVKWGIPNTNNELMAQEIKHLTGGVLILRNKF 3
            A A++KLWEKSLIAKIA+KWGI NT+NE MA E+K+LTGGVL++RNKF
Sbjct: 444  AAAILKLWEKSLIAKIAIKWGIQNTDNEQMAYELKNLTGGVLLVRNKF 491


>ref|XP_007033219.1| maize chloroplast splicing factor CRS1, putative isoform 3 [Theobroma
            cacao] gi|508712248|gb|EOY04145.1| maize chloroplast
            splicing factor CRS1, putative isoform 3 [Theobroma
            cacao]
          Length = 788

 Score =  434 bits (1117), Expect = e-119
 Identities = 243/468 (51%), Positives = 310/468 (66%), Gaps = 9/468 (1%)
 Frame = -2

Query: 1379 ASHTDITPSPNSIEKPQPSNPNFSFLSEAFQSTNDTIKVPSAPWMTGPLLLPSNEVLNLS 1200
            +S  + + +P+   K   S  N S  S +    N  IK+P+APWM GPLLL  +EVLN S
Sbjct: 26   SSSLNSSQNPSKTHKENRSLNNNSKFSVSKDPNNGPIKMPTAPWMKGPLLLQPHEVLNPS 85

Query: 1199 TLRTKRGKNRVQGAERTDLSLTDKISGGRGRRAMRRIVESITKLQELANSEEAQKGV-EK 1023
               +K+  N    A+  D +L  K SG RG++ M++I+ ++  LQ     E+ Q G+ E+
Sbjct: 86   KSTSKKSSN--SKAKAPDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLEDTQIGIREE 143

Query: 1022 FELRVPLKPIFEDGN-SNSEARMPWMAEE-KIVFRRMKKERVLTKAELSLSETVLKRLRN 849
            FE+   L+    DG     + +MPW+ EE K+VFRRMKKE++LT+AE+SL + +L+RLR 
Sbjct: 144  FEVGNWLEEFGSDGEVKRFDGKMPWLREEEKVVFRRMKKEKLLTQAEISLDKDLLERLRR 203

Query: 848  NAVKMTKWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREIIELKTGGL 669
             A++M KW+KV K GVT AVVDEIK  W  NELVMVKF +PLCRNMDRAREIIE+KT GL
Sbjct: 204  KAMRMRKWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCRNMDRAREIIEMKTRGL 263

Query: 668  VIWSKKDTHVVYRGCNYELSSKACQKLHSEHGHVVEASPSDDIFMESEDEANISYIQPNE 489
            V+W KKD  VVYRGC++ L+SK     +       E S S    + S +  N+S  + N 
Sbjct: 264  VVWGKKDALVVYRGCSHGLTSKISSMKYPRCADGQEISSSTFSHLTSSNNINMSLEKFNG 323

Query: 488  VT-QDG---KDSISNSCPTS--MEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWWPKP 327
             T Q G   +D    S P +  M+E+   Q V G+LYERE DRLL+GLGPRF+DWW  KP
Sbjct: 324  STLQSGLYREDREKESMPINIFMKEDENNQPVIGSLYERETDRLLDGLGPRFIDWWMRKP 383

Query: 326  LPIDADLLPEVVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKLQGL 147
            LPIDADLLPE VPG+  P R   P  RP LTDDEL  LRK    LP HFALG+N  LQGL
Sbjct: 384  LPIDADLLPEEVPGFRPPLRLSPPNTRPNLTDDELKYLRKLTHPLPFHFALGKNRNLQGL 443

Query: 146  ATAMIKLWEKSLIAKIAVKWGIPNTNNELMAQEIKHLTGGVLILRNKF 3
            A A++KLWEKSLIAKIA+KWGI NT+NE MA E+K+LTGGVL++RNKF
Sbjct: 444  AAAILKLWEKSLIAKIAIKWGIQNTDNEQMAYELKNLTGGVLLVRNKF 491


>ref|XP_007033218.1| maize chloroplast splicing factor CRS1, putative isoform 2 [Theobroma
            cacao] gi|508712247|gb|EOY04144.1| maize chloroplast
            splicing factor CRS1, putative isoform 2 [Theobroma
            cacao]
          Length = 804

 Score =  434 bits (1117), Expect = e-119
 Identities = 243/468 (51%), Positives = 310/468 (66%), Gaps = 9/468 (1%)
 Frame = -2

Query: 1379 ASHTDITPSPNSIEKPQPSNPNFSFLSEAFQSTNDTIKVPSAPWMTGPLLLPSNEVLNLS 1200
            +S  + + +P+   K   S  N S  S +    N  IK+P+APWM GPLLL  +EVLN S
Sbjct: 52   SSSLNSSQNPSKTHKENRSLNNNSKFSVSKDPNNGPIKMPTAPWMKGPLLLQPHEVLNPS 111

Query: 1199 TLRTKRGKNRVQGAERTDLSLTDKISGGRGRRAMRRIVESITKLQELANSEEAQKGV-EK 1023
               +K+  N    A+  D +L  K SG RG++ M++I+ ++  LQ     E+ Q G+ E+
Sbjct: 112  KSTSKKSSN--SKAKAPDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLEDTQIGIREE 169

Query: 1022 FELRVPLKPIFEDGN-SNSEARMPWMAEE-KIVFRRMKKERVLTKAELSLSETVLKRLRN 849
            FE+   L+    DG     + +MPW+ EE K+VFRRMKKE++LT+AE+SL + +L+RLR 
Sbjct: 170  FEVGNWLEEFGSDGEVKRFDGKMPWLREEEKVVFRRMKKEKLLTQAEISLDKDLLERLRR 229

Query: 848  NAVKMTKWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREIIELKTGGL 669
             A++M KW+KV K GVT AVVDEIK  W  NELVMVKF +PLCRNMDRAREIIE+KT GL
Sbjct: 230  KAMRMRKWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCRNMDRAREIIEMKTRGL 289

Query: 668  VIWSKKDTHVVYRGCNYELSSKACQKLHSEHGHVVEASPSDDIFMESEDEANISYIQPNE 489
            V+W KKD  VVYRGC++ L+SK     +       E S S    + S +  N+S  + N 
Sbjct: 290  VVWGKKDALVVYRGCSHGLTSKISSMKYPRCADGQEISSSTFSHLTSSNNINMSLEKFNG 349

Query: 488  VT-QDG---KDSISNSCPTS--MEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWWPKP 327
             T Q G   +D    S P +  M+E+   Q V G+LYERE DRLL+GLGPRF+DWW  KP
Sbjct: 350  STLQSGLYREDREKESMPINIFMKEDENNQPVIGSLYERETDRLLDGLGPRFIDWWMRKP 409

Query: 326  LPIDADLLPEVVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKLQGL 147
            LPIDADLLPE VPG+  P R   P  RP LTDDEL  LRK    LP HFALG+N  LQGL
Sbjct: 410  LPIDADLLPEEVPGFRPPLRLSPPNTRPNLTDDELKYLRKLTHPLPFHFALGKNRNLQGL 469

Query: 146  ATAMIKLWEKSLIAKIAVKWGIPNTNNELMAQEIKHLTGGVLILRNKF 3
            A A++KLWEKSLIAKIA+KWGI NT+NE MA E+K+LTGGVL++RNKF
Sbjct: 470  AAAILKLWEKSLIAKIAIKWGIQNTDNEQMAYELKNLTGGVLLVRNKF 517


>ref|XP_007033217.1| maize chloroplast splicing factor CRS1, putative isoform 1 [Theobroma
            cacao] gi|508712246|gb|EOY04143.1| maize chloroplast
            splicing factor CRS1, putative isoform 1 [Theobroma
            cacao]
          Length = 818

 Score =  434 bits (1117), Expect = e-119
 Identities = 243/468 (51%), Positives = 310/468 (66%), Gaps = 9/468 (1%)
 Frame = -2

Query: 1379 ASHTDITPSPNSIEKPQPSNPNFSFLSEAFQSTNDTIKVPSAPWMTGPLLLPSNEVLNLS 1200
            +S  + + +P+   K   S  N S  S +    N  IK+P+APWM GPLLL  +EVLN S
Sbjct: 52   SSSLNSSQNPSKTHKENRSLNNNSKFSVSKDPNNGPIKMPTAPWMKGPLLLQPHEVLNPS 111

Query: 1199 TLRTKRGKNRVQGAERTDLSLTDKISGGRGRRAMRRIVESITKLQELANSEEAQKGV-EK 1023
               +K+  N    A+  D +L  K SG RG++ M++I+ ++  LQ     E+ Q G+ E+
Sbjct: 112  KSTSKKSSN--SKAKAPDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLEDTQIGIREE 169

Query: 1022 FELRVPLKPIFEDGN-SNSEARMPWMAEE-KIVFRRMKKERVLTKAELSLSETVLKRLRN 849
            FE+   L+    DG     + +MPW+ EE K+VFRRMKKE++LT+AE+SL + +L+RLR 
Sbjct: 170  FEVGNWLEEFGSDGEVKRFDGKMPWLREEEKVVFRRMKKEKLLTQAEISLDKDLLERLRR 229

Query: 848  NAVKMTKWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREIIELKTGGL 669
             A++M KW+KV K GVT AVVDEIK  W  NELVMVKF +PLCRNMDRAREIIE+KT GL
Sbjct: 230  KAMRMRKWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCRNMDRAREIIEMKTRGL 289

Query: 668  VIWSKKDTHVVYRGCNYELSSKACQKLHSEHGHVVEASPSDDIFMESEDEANISYIQPNE 489
            V+W KKD  VVYRGC++ L+SK     +       E S S    + S +  N+S  + N 
Sbjct: 290  VVWGKKDALVVYRGCSHGLTSKISSMKYPRCADGQEISSSTFSHLTSSNNINMSLEKFNG 349

Query: 488  VT-QDG---KDSISNSCPTS--MEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWWPKP 327
             T Q G   +D    S P +  M+E+   Q V G+LYERE DRLL+GLGPRF+DWW  KP
Sbjct: 350  STLQSGLYREDREKESMPINIFMKEDENNQPVIGSLYERETDRLLDGLGPRFIDWWMRKP 409

Query: 326  LPIDADLLPEVVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKLQGL 147
            LPIDADLLPE VPG+  P R   P  RP LTDDEL  LRK    LP HFALG+N  LQGL
Sbjct: 410  LPIDADLLPEEVPGFRPPLRLSPPNTRPNLTDDELKYLRKLTHPLPFHFALGKNRNLQGL 469

Query: 146  ATAMIKLWEKSLIAKIAVKWGIPNTNNELMAQEIKHLTGGVLILRNKF 3
            A A++KLWEKSLIAKIA+KWGI NT+NE MA E+K+LTGGVL++RNKF
Sbjct: 470  AAAILKLWEKSLIAKIAIKWGIQNTDNEQMAYELKNLTGGVLLVRNKF 517


>ref|XP_006357840.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Solanum tuberosum]
          Length = 802

 Score =  421 bits (1083), Expect = e-115
 Identities = 241/530 (45%), Positives = 316/530 (59%), Gaps = 39/530 (7%)
 Frame = -2

Query: 1475 NISTLSSSTHFQNPLKTPPKQLHNFNKPLKILASHTDITPSPNSIEKPQPSNPNFSFLSE 1296
            N  TL  S  F +   +   Q ++ N P+K                       N  F ++
Sbjct: 24   NQKTLLFSKSFNSKFTSFSSQYNDNNNPIK------------------NEEQYNLEFENQ 65

Query: 1295 AFQSTNDTIKVPSAPWMTGPLLLPSNEVLNLSTLRTKRGKNRVQGAERTDLSLTDKISGG 1116
             + S++  IK P+APWM GPLLL  N+ L+LS  R K+  N  +     D +L+ K+SGG
Sbjct: 66   DYGSSSSGIKGPTAPWMRGPLLLEPNQFLDLSKSRKKKDANFAKTQNPND-ALSGKVSGG 124

Query: 1115 RGRRAMRRIVESITKLQELANSEEAQKGVE-KFELRVPLKPIFEDGNSNSE--------- 966
            RG++AM+ I + I KLQE    E  Q   + K E + P   + E G+ + E         
Sbjct: 125  RGKKAMKMIYQGIDKLQETQIGEGTQVETDAKVEFQFPPGSLSEWGDVSYEIEEKNPYGE 184

Query: 965  ----------------------------ARMPWMAEEKIVFRRMKKERVLTKAELSLSET 870
                                         +MPW +E +IV+RRMKKE+V+  AE +L   
Sbjct: 185  EDNVESLEGVEFGVLSREGEGRGSRKIGVKMPWESEVRIVYRRMKKEKVVMTAESNLDAM 244

Query: 869  VLKRLRNNAVKMTKWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREII 690
            +L+RLR  A ++ KWVKVKKAGVT  VVD+I   W  NEL M+KF +PLCRNMDRAREI+
Sbjct: 245  LLERLRGEAARIQKWVKVKKAGVTRTVVDQIHFIWKNNELAMLKFDLPLCRNMDRAREIV 304

Query: 689  ELKTGGLVIWSKKDTHVVYRGCNYELSSKACQKLHSEHGHVVEASPSDDIFMESEDEANI 510
            E+KTGG V+W K++  VVYRGC+Y L  K  Q       H    S  +  F E+  + +I
Sbjct: 305  EMKTGGFVVWMKQNALVVYRGCSYTLQQKELQ-------HDFLCSHQNSSFTENIKQTSI 357

Query: 509  -SYIQPNEVTQDGKDSISNSCPTSMEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWWP 333
             S +  +  ++D   S+ NS   S+       +++ +LY REA+RLL+ LGPR+VDWWWP
Sbjct: 358  FSPLNSSGSSEDEMISVGNSEEDSL-------AMNESLYVREANRLLDDLGPRYVDWWWP 410

Query: 332  KPLPIDADLLPEVVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKLQ 153
            KPLP++ADLLPEVVPG+  PFR C P+ R KLTDDELT LRK AR LPTHF LGRN KLQ
Sbjct: 411  KPLPVNADLLPEVVPGFKPPFRLCPPRSRSKLTDDELTQLRKLARSLPTHFVLGRNRKLQ 470

Query: 152  GLATAMIKLWEKSLIAKIAVKWGIPNTNNELMAQEIKHLTGGVLILRNKF 3
            GLA A++KLWEK  IAKIA+KWGIPNT+NELMA E+K+LTGGVL+LRNKF
Sbjct: 471  GLAAAVVKLWEKCHIAKIALKWGIPNTSNELMANELKYLTGGVLLLRNKF 520


>ref|XP_002516757.1| conserved hypothetical protein [Ricinus communis]
            gi|223544130|gb|EEF45655.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 742

 Score =  421 bits (1083), Expect = e-115
 Identities = 236/467 (50%), Positives = 296/467 (63%), Gaps = 16/467 (3%)
 Frame = -2

Query: 1355 SPNSIEKPQ---PSNPNFSFLSEAFQSTNDTIKVPSAPWMTGPLLLPSNEVLNLSTLRTK 1185
            S N+ + P+     N  F+ LS     +N  IKVP+APWM GPLLL  +E++NLS  R K
Sbjct: 27   SLNNAQNPKFATNKNTEFTLLSVPNSQSNAPIKVPTAPWMKGPLLLQPHELINLSKPRNK 86

Query: 1184 RGKNRVQGAERTDLSLTDKISGGRGRRAMRRIVESITKLQE----------LANSEEAQK 1035
               N     E++D  LT K SG RG++AM +IV+SI +LQE              E+ Q 
Sbjct: 87   NSSNNAN-IEKSDKVLTGKESGVRGKKAMEKIVKSIEQLQENQALEKTQCDSQAYEKTQL 145

Query: 1034 GVEKFELRVPLKPIFEDGNSNSEARM-PWMAEEKIVFRRMKKERVLTKAELSLSETVLKR 858
              E FE+   L  I E G+     ++ PW  EEK V+ R+KKE+ +TKAEL L + +L+ 
Sbjct: 146  DSEAFEIGEKLGLIREHGDFGVNKKLKPWEREEKFVYWRIKKEKAVTKAELILEKELLEI 205

Query: 857  LRNNAVKMTKWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREIIELKT 678
            LR  A KM KWVKV KAGVT +VVD+I+  W  NEL MVKF +PLCRNMDRAREI+ELKT
Sbjct: 206  LRTEASKMRKWVKVMKAGVTQSVVDQIRYAWRNNELAMVKFDLPLCRNMDRAREIVELKT 265

Query: 677  GGLVIWSKKDTHVVYRGCNYELSSKACQKLHSEHGHVVEASPSDDIFMESEDEANISYIQ 498
            GGLV+W++KD+ V+YRGCNY L+                                 S++ 
Sbjct: 266  GGLVVWTRKDSLVIYRGCNYHLTKS-------------------------------SHVS 294

Query: 497  PNEVTQDGKDSISNSCPTS--MEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWWPKPL 324
              +     KD      PTS  + ++    +++G+L+ERE DRLL+GLGPRFVDWW  KPL
Sbjct: 295  TMDEKIGSKDGEEEYIPTSIFIGDDANTPTINGSLFERETDRLLDGLGPRFVDWWMRKPL 354

Query: 323  PIDADLLPEVVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKLQGLA 144
            P+DADLLPEVV G+M P R      R KL DDELT LRK A  LPTHF LGRN +LQGLA
Sbjct: 355  PVDADLLPEVVAGFMPPSR--FHYARAKLKDDELTYLRKLAYALPTHFVLGRNRRLQGLA 412

Query: 143  TAMIKLWEKSLIAKIAVKWGIPNTNNELMAQEIKHLTGGVLILRNKF 3
             A++KLWE+SLIAKIAVKWGIPNT+NE MA E+KHLTGGVL+LRNKF
Sbjct: 413  AAILKLWERSLIAKIAVKWGIPNTDNEQMANELKHLTGGVLLLRNKF 459


>ref|XP_006603058.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X5 [Glycine max]
          Length = 744

 Score =  421 bits (1082), Expect = e-115
 Identities = 236/471 (50%), Positives = 310/471 (65%), Gaps = 11/471 (2%)
 Frame = -2

Query: 1382 LASHTDITPSPNS---IEKPQPSNPNFSFLSEAFQSTNDTIKVPSAPWMTGPLLLPSNEV 1212
            L+ H  + P+  S   I    P N N     +    +   IK P+ PWM  PLLL  +E+
Sbjct: 4    LSFHPSLFPNSYSRFHISSSLPPNSNNGHNHQHTSPSQVPIKSPTPPWMKVPLLLQPHEL 63

Query: 1211 LNLSTLRTKRGKNRVQGAERTDLSLTDKISGG---RGRRAMRRIVESITKLQELANSEEA 1041
            ++LS  ++K+ K      E+ +LS  DK   G   RG+RAM++IV+ + KL +  NS E 
Sbjct: 64   VDLSNPKSKKFK-----PEKHELS--DKALMGKEVRGKRAMKKIVDRVEKLHKTQNSNET 116

Query: 1040 QK---GVEKFELRVPLKPIFEDGNSNSEARMPWMAEEKIVFRRMKKERVLTKAELSLSET 870
            +     VE F   + +  + E+    S+ RMPW  +EK  F ++K+E+ +T AEL+L + 
Sbjct: 117  RVDSLNVENFGGYLEI--LKENEEVRSKGRMPWEKDEKFGFVKVKREKAVTAAELTLDKA 174

Query: 869  VLKRLRNNAVKMTKWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREII 690
            +L+RLRN A +M  W+KVKKAGVT  VVD+IKRTW  NEL M+KF IPLCRNMDRAREI+
Sbjct: 175  LLRRLRNEAARMRTWIKVKKAGVTQDVVDQIKRTWRRNELAMIKFDIPLCRNMDRAREIV 234

Query: 689  ELKTGGLVIWSKKDTHVVYRGCNYELSSKACQKLHSEHGHV--VEASPSDDIFMESEDEA 516
            E KTGGLV+ SKKD  VVYRGCN++L++K    L + H  +  VE +   DIF    + +
Sbjct: 235  ETKTGGLVVLSKKDFLVVYRGCNHQLTTKGSPSLRTNHYEMNRVELATKGDIFRVESNHS 294

Query: 515  NISYIQPNEVTQDGKDSISNSCPTSMEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWW 336
            +   +  N    D KDSIS        ++V  Q V+G+LYERE +RLL+GLGPRF+DWW 
Sbjct: 295  SSEMLNWNA---DHKDSISTGI-----QDVNCQLVNGSLYERETERLLDGLGPRFIDWWM 346

Query: 335  PKPLPIDADLLPEVVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKL 156
             KPLP+DADLLPE VPG+  PFR C P    KLTD ELT  RK A+ LPTHF LGRN  L
Sbjct: 347  HKPLPVDADLLPEEVPGFQPPFRLCPPHSSAKLTDYELTYFRKLAQSLPTHFVLGRNKGL 406

Query: 155  QGLATAMIKLWEKSLIAKIAVKWGIPNTNNELMAQEIKHLTGGVLILRNKF 3
            +GLA+A++KLWEKSLIAKIA+K+GIPNT+NE+MA E+K LTGGVL+LRNKF
Sbjct: 407  KGLASAILKLWEKSLIAKIAIKYGIPNTDNEMMANELKCLTGGVLLLRNKF 457


>ref|XP_006603055.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X2 [Glycine max]
            gi|571550194|ref|XP_006603056.1| PREDICTED: chloroplastic
            group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X3 [Glycine max]
            gi|571550197|ref|XP_006603057.1| PREDICTED: chloroplastic
            group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X4 [Glycine max]
          Length = 747

 Score =  421 bits (1082), Expect = e-115
 Identities = 236/471 (50%), Positives = 310/471 (65%), Gaps = 11/471 (2%)
 Frame = -2

Query: 1382 LASHTDITPSPNS---IEKPQPSNPNFSFLSEAFQSTNDTIKVPSAPWMTGPLLLPSNEV 1212
            L+ H  + P+  S   I    P N N     +    +   IK P+ PWM  PLLL  +E+
Sbjct: 4    LSFHPSLFPNSYSRFHISSSLPPNSNNGHNHQHTSPSQVPIKSPTPPWMKVPLLLQPHEL 63

Query: 1211 LNLSTLRTKRGKNRVQGAERTDLSLTDKISGG---RGRRAMRRIVESITKLQELANSEEA 1041
            ++LS  ++K+ K      E+ +LS  DK   G   RG+RAM++IV+ + KL +  NS E 
Sbjct: 64   VDLSNPKSKKFK-----PEKHELS--DKALMGKEVRGKRAMKKIVDRVEKLHKTQNSNET 116

Query: 1040 QK---GVEKFELRVPLKPIFEDGNSNSEARMPWMAEEKIVFRRMKKERVLTKAELSLSET 870
            +     VE F   + +  + E+    S+ RMPW  +EK  F ++K+E+ +T AEL+L + 
Sbjct: 117  RVDSLNVENFGGYLEI--LKENEEVRSKGRMPWEKDEKFGFVKVKREKAVTAAELTLDKA 174

Query: 869  VLKRLRNNAVKMTKWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREII 690
            +L+RLRN A +M  W+KVKKAGVT  VVD+IKRTW  NEL M+KF IPLCRNMDRAREI+
Sbjct: 175  LLRRLRNEAARMRTWIKVKKAGVTQDVVDQIKRTWRRNELAMIKFDIPLCRNMDRAREIV 234

Query: 689  ELKTGGLVIWSKKDTHVVYRGCNYELSSKACQKLHSEHGHV--VEASPSDDIFMESEDEA 516
            E KTGGLV+ SKKD  VVYRGCN++L++K    L + H  +  VE +   DIF    + +
Sbjct: 235  ETKTGGLVVLSKKDFLVVYRGCNHQLTTKGSPSLRTNHYEMNRVELATKGDIFRVESNHS 294

Query: 515  NISYIQPNEVTQDGKDSISNSCPTSMEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWW 336
            +   +  N    D KDSIS        ++V  Q V+G+LYERE +RLL+GLGPRF+DWW 
Sbjct: 295  SSEMLNWNA---DHKDSISTGI-----QDVNCQLVNGSLYERETERLLDGLGPRFIDWWM 346

Query: 335  PKPLPIDADLLPEVVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKL 156
             KPLP+DADLLPE VPG+  PFR C P    KLTD ELT  RK A+ LPTHF LGRN  L
Sbjct: 347  HKPLPVDADLLPEEVPGFQPPFRLCPPHSSAKLTDYELTYFRKLAQSLPTHFVLGRNKGL 406

Query: 155  QGLATAMIKLWEKSLIAKIAVKWGIPNTNNELMAQEIKHLTGGVLILRNKF 3
            +GLA+A++KLWEKSLIAKIA+K+GIPNT+NE+MA E+K LTGGVL+LRNKF
Sbjct: 407  KGLASAILKLWEKSLIAKIAIKYGIPNTDNEMMANELKCLTGGVLLLRNKF 457


>ref|XP_006603054.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X1 [Glycine max]
          Length = 750

 Score =  421 bits (1082), Expect = e-115
 Identities = 236/471 (50%), Positives = 310/471 (65%), Gaps = 11/471 (2%)
 Frame = -2

Query: 1382 LASHTDITPSPNS---IEKPQPSNPNFSFLSEAFQSTNDTIKVPSAPWMTGPLLLPSNEV 1212
            L+ H  + P+  S   I    P N N     +    +   IK P+ PWM  PLLL  +E+
Sbjct: 4    LSFHPSLFPNSYSRFHISSSLPPNSNNGHNHQHTSPSQVPIKSPTPPWMKVPLLLQPHEL 63

Query: 1211 LNLSTLRTKRGKNRVQGAERTDLSLTDKISGG---RGRRAMRRIVESITKLQELANSEEA 1041
            ++LS  ++K+ K      E+ +LS  DK   G   RG+RAM++IV+ + KL +  NS E 
Sbjct: 64   VDLSNPKSKKFK-----PEKHELS--DKALMGKEVRGKRAMKKIVDRVEKLHKTQNSNET 116

Query: 1040 QK---GVEKFELRVPLKPIFEDGNSNSEARMPWMAEEKIVFRRMKKERVLTKAELSLSET 870
            +     VE F   + +  + E+    S+ RMPW  +EK  F ++K+E+ +T AEL+L + 
Sbjct: 117  RVDSLNVENFGGYLEI--LKENEEVRSKGRMPWEKDEKFGFVKVKREKAVTAAELTLDKA 174

Query: 869  VLKRLRNNAVKMTKWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREII 690
            +L+RLRN A +M  W+KVKKAGVT  VVD+IKRTW  NEL M+KF IPLCRNMDRAREI+
Sbjct: 175  LLRRLRNEAARMRTWIKVKKAGVTQDVVDQIKRTWRRNELAMIKFDIPLCRNMDRAREIV 234

Query: 689  ELKTGGLVIWSKKDTHVVYRGCNYELSSKACQKLHSEHGHV--VEASPSDDIFMESEDEA 516
            E KTGGLV+ SKKD  VVYRGCN++L++K    L + H  +  VE +   DIF    + +
Sbjct: 235  ETKTGGLVVLSKKDFLVVYRGCNHQLTTKGSPSLRTNHYEMNRVELATKGDIFRVESNHS 294

Query: 515  NISYIQPNEVTQDGKDSISNSCPTSMEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWW 336
            +   +  N    D KDSIS        ++V  Q V+G+LYERE +RLL+GLGPRF+DWW 
Sbjct: 295  SSEMLNWNA---DHKDSISTGI-----QDVNCQLVNGSLYERETERLLDGLGPRFIDWWM 346

Query: 335  PKPLPIDADLLPEVVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKL 156
             KPLP+DADLLPE VPG+  PFR C P    KLTD ELT  RK A+ LPTHF LGRN  L
Sbjct: 347  HKPLPVDADLLPEEVPGFQPPFRLCPPHSSAKLTDYELTYFRKLAQSLPTHFVLGRNKGL 406

Query: 155  QGLATAMIKLWEKSLIAKIAVKWGIPNTNNELMAQEIKHLTGGVLILRNKF 3
            +GLA+A++KLWEKSLIAKIA+K+GIPNT+NE+MA E+K LTGGVL+LRNKF
Sbjct: 407  KGLASAILKLWEKSLIAKIAIKYGIPNTDNEMMANELKCLTGGVLLLRNKF 457


>ref|XP_006842364.1| hypothetical protein AMTR_s00079p00185530 [Amborella trichopoda]
            gi|548844430|gb|ERN04039.1| hypothetical protein
            AMTR_s00079p00185530 [Amborella trichopoda]
          Length = 886

 Score =  419 bits (1077), Expect = e-114
 Identities = 247/551 (44%), Positives = 329/551 (59%), Gaps = 52/551 (9%)
 Frame = -2

Query: 1499 PQSLPSNANISTLSSSTHFQNPLKTPPKQLHNFNKPLKILASHTDIT---------PSPN 1347
            P    S  N +T +S  +   PLKT PK+    N P K   + +++          P  N
Sbjct: 84   PPCKDSYFNGATPTSPVNVPLPLKTIPKKQFEMNLPFKENDTISELPWQKMHNLSDPIGN 143

Query: 1346 SIEKPQPSN--------------PNFSFLSEAFQSTNDTIKVPSAPWMTGPLLLPSNEVL 1209
            S     P+N              P  SFL +     ++ +K+P+APWM GPLLLP+++VL
Sbjct: 144  SPPSKGPANGLVRPEKTKGQQGLPTLSFLRKI--GHHNEVKMPTAPWMRGPLLLPADDVL 201

Query: 1208 NLSTLRTKRGKNRVQGAERTDLSLTDKISGGRGRRAMRRIVESITKLQELANSEEAQKGV 1029
            +LS  R K+  N +      D +LT  + GGR + AMR I+E+ITKL+E+    E +K  
Sbjct: 202  DLSKSR-KKSSNEMNS---DDKALTGGVRGGRSKHAMRLIMENITKLKEIHEENEQKKET 257

Query: 1028 -----EKFELRVPLKPIFEDGNSNS--------------------EARMPWMAEEKIVFR 924
                 ++ ++R  +   F +G + S                    E ++PW   EK VFR
Sbjct: 258  HIVLSDEVDIRSKINSSFSEGATKSIEAGFNLPLKEVSVSEDQAMETKLPWTMAEKNVFR 317

Query: 923  RMKKERVLTKAELSLSETVLKRLRNNAVKMTKWVKVKKAGVTDAVVDEIKRTWSGNELVM 744
            R+KKE+  TKAELSL + +L RLR+    +TKWVKVKKAGVT  V++EI   W   EL M
Sbjct: 318  RVKKEKTPTKAELSLPKPLLTRLRDRGRTLTKWVKVKKAGVTQEVMNEIYAVWKKRELAM 377

Query: 743  VKFYIPLCRNMDRAREIIELKTGGLVIWSKKDTHVVYRGCNYELSSKACQKLHSEHGHVV 564
            +KF +PLCRNMDRA EI+E KTGGLV+W KK T VVYRG NY   SK             
Sbjct: 378  LKFDVPLCRNMDRATEIVETKTGGLVVWRKKGTLVVYRGTNYHSLSKTS----------- 426

Query: 563  EASP-SDDIFMESEDEANISYIQPNEVT---QDGKDSISNSCPTSMEENVGLQSVSGTLY 396
            E +P S ++F +++  A   ++   + T   Q G D +       M+E         TL+
Sbjct: 427  ETNPWSLELFDDNKISAPNGFLNFKDDTMIYQAGSDGL-------MKE---------TLF 470

Query: 395  EREADRLLNGLGPRFVDWWWPKPLPIDADLLPEVVPGYMTPFRRCLPQVRPKLTDDELTN 216
            EREA+RLL+ LGPRF+DWWW  PLP+DADLLPEV+P +  P R C P ++ KLTD+ELT 
Sbjct: 471  EREANRLLDELGPRFIDWWWSTPLPVDADLLPEVIPNFRPPLRLCPPHMQSKLTDEELTY 530

Query: 215  LRKFARYLPTHFALGRNTKLQGLATAMIKLWEKSLIAKIAVKWGIPNTNNELMAQEIKHL 36
            LRKFA++LPTHFALG+NTKLQGLA A++KLWEKSLIAKIA+KWGIPN N++ MA E+KHL
Sbjct: 531  LRKFAKHLPTHFALGKNTKLQGLAAAILKLWEKSLIAKIAIKWGIPNVNHQQMAYELKHL 590

Query: 35   TGGVLILRNKF 3
            TGGVL+L+NKF
Sbjct: 591  TGGVLLLQNKF 601


>gb|EXC20503.1| Chloroplastic group IIA intron splicing facilitator CRS1 [Morus
            notabilis]
          Length = 828

 Score =  408 bits (1048), Expect = e-111
 Identities = 241/517 (46%), Positives = 324/517 (62%), Gaps = 15/517 (2%)
 Frame = -2

Query: 1508 FLSPQSLPSNANISTLSSSTHFQNPLKTPPKQLHNFNKPLKILASHTDITPSPNSIEKPQ 1329
            FLSP + P+  ++S+                   NF +P     S+  I+ S N    P+
Sbjct: 6    FLSPSTFPNTHHLSS-------------------NFKRPSD---SYILISSSLN----PK 39

Query: 1328 PSNPNFSFLSEAFQSTN---DTIKVPSAPWMTGPLLLPSNEVLNLSTLRTKRGKNRVQGA 1158
            P+N +    ++    +    + IK+P+ PWM GPL+L  +EV +LS       K   + A
Sbjct: 40   PTNYHHHASTKENPDSKPPLEPIKMPTPPWMKGPLVLQPHEVTDLSKPENDN-KFSNRKA 98

Query: 1157 ERTDLSLTDKISGGRGRRAMRRIVESITKL--QELANSEEAQKG-VEKFELRVPLKPIFE 987
            E++   LTDK+ G RG+  +++I   I +L  +   +SEE QK  V K  +   L+ + E
Sbjct: 99   EKSVNGLTDKLVGRRGKNVIKKIARRIEELGRKSKVDSEETQKDFVGKNGIGDCLEGLGE 158

Query: 986  DGNSNSEARMPWMAEEKIVFRRMKKERVLTKAELSLSETVLKRLRNNAVKMTKWVKVKKA 807
              +     RMPW  +E  VFRRMKKE++++ AEL L   +L+RLR+ A KM KWVKVKKA
Sbjct: 159  SRSGGE--RMPWEKDEGFVFRRMKKEKIVSSAELRLERELLERLRSEARKMRKWVKVKKA 216

Query: 806  GVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREIIELKTGGLVIWSKKDTHVVYRG 627
            GVT  VV+++K  W  NEL MVKF +PLCRNMDRA+EI+E+KTGGLV+W +KD  V+YRG
Sbjct: 217  GVTKEVVEDVKFVWKSNELAMVKFDVPLCRNMDRAQEILEMKTGGLVVWRRKDAQVIYRG 276

Query: 626  CNYELSSKACQKLHSEHGHVVEASPSDDIFMESEDEANISYIQPNEVTQDGKDSISNS-- 453
            CNY+ +SK   + ++      E   S+ + ++S    ++S ++  E T + K S  N+  
Sbjct: 277  CNYQPTSKTFPRTYAGFSGHQETPFSNLVQLDSRKGNSVSEVKSYENTIERKISKKNTEG 336

Query: 452  --CPTS--MEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWWPKPLPIDADLLPEVVPG 285
               PT+  ++ +   Q  S +LY READRLL+GLGPRF+DWW  KPLP+DADLLPEVVPG
Sbjct: 337  ETIPTAIILKNDANFQP-SSSLYVREADRLLDGLGPRFIDWWMNKPLPVDADLLPEVVPG 395

Query: 284  YMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKLQGLATAMIKLWEKSLIA 105
            +  PFRRC P  R KLTD+ELT LRK A  LPTHF LGRN KLQGLA A++KLWEK  IA
Sbjct: 396  FRPPFRRCPPHTRSKLTDEELTYLRKLAHSLPTHFVLGRNRKLQGLAAAILKLWEKCHIA 455

Query: 104  KIAVKWGIPNTNNELMAQEIKH---LTGGVLILRNKF 3
            KIAVK G+PNTNNE MA E+K    LTGG L+LRNKF
Sbjct: 456  KIAVKLGVPNTNNEQMAYELKARICLTGGDLLLRNKF 492


>ref|XP_004507538.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Cicer arietinum]
          Length = 764

 Score =  398 bits (1023), Expect = e-108
 Identities = 224/481 (46%), Positives = 301/481 (62%), Gaps = 23/481 (4%)
 Frame = -2

Query: 1376 SHTDITPSPNSIEKPQPSNPNFSFLSEAFQSTNDT----------IKVPSAPWMTGPL-L 1230
            S++ I  S +S   P P N N     +     N+           IK P+ PW+  PL L
Sbjct: 9    SYSYIHISSSSSFSPNPKNNNNLNHHKPLSIPNNNNSHSHDHISIIKSPTPPWIKSPLHL 68

Query: 1229 LPSNEVLNLSTLRTKRGKNRVQGAERTDLSLTDKISGGRGRRAMRRIVESITKLQELANS 1050
             P   +LN          + V+ ++ +D +L  K   G+  + +R+I   + KL +  +S
Sbjct: 69   QPQQHLLN----------SNVEKSDLSDKALNSKEISGK--KVLRKIAHKVEKLHKALDS 116

Query: 1049 EE----AQKGVEKFE-LRVPLKPIFEDGNSNSEARMPWMAEEKIVFRRMKKERVLTKAEL 885
            E+     Q G EK E     L  + E+    ++ RMPW  +EKI F ++K+E+  + A+L
Sbjct: 117  EKNETLTQMGSEKVENFGDCLDILMENEEVVNKGRMPWEKDEKIGFFKVKREKTFSAADL 176

Query: 884  SLSETVLKRLRNNAVKMTKWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDR 705
            ++ + VL RLR  A +M KWVKVKK GVT  VVDEIKR+W  NEL MVKF IPLC+NM R
Sbjct: 177  NVDKVVLHRLRGEAARMRKWVKVKKIGVTQDVVDEIKRSWRMNELAMVKFDIPLCQNMGR 236

Query: 704  AREIIELKTGGLVIWSKKDTHVVYRGCNYELSSKACQKLHSEHGHVVEASPSDDIFMESE 525
            AREI+E KTGGLVIW KKDT VVYRGCNY+L+SK+  K+H+ +    + +  +   ++S 
Sbjct: 237  AREIVETKTGGLVIWCKKDTLVVYRGCNYQLTSKSSPKIHTGYIRSQKTNSYETNEVKSA 296

Query: 524  DEANISYIQPNEVTQD-------GKDSISNSCPTSMEENVGLQSVSGTLYEREADRLLNG 366
             + ++S ++  + + +        KDS+S         N+  Q  SG+LYE+E DRLL+G
Sbjct: 297  TKGDLSRVESTQSSSEILSSNAEHKDSLSTD-----NYNMNYQPRSGSLYEKECDRLLDG 351

Query: 365  LGPRFVDWWWPKPLPIDADLLPEVVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPT 186
            LGPRFVDWW  KPLP+DADLLPEVVPG+  PFR C P  R KLTDDELT  RK +  LPT
Sbjct: 352  LGPRFVDWWMDKPLPVDADLLPEVVPGFEPPFRLCPPHARSKLTDDELTYFRKISHPLPT 411

Query: 185  HFALGRNTKLQGLATAMIKLWEKSLIAKIAVKWGIPNTNNELMAQEIKHLTGGVLILRNK 6
            HF LGRN  LQGLA A++KLW+KS  AKIA+K+G+PNT+NE+MA E+K LTGGVL+LRNK
Sbjct: 412  HFVLGRNRGLQGLAAAILKLWQKSHTAKIAIKYGVPNTDNEVMANELKRLTGGVLLLRNK 471

Query: 5    F 3
            F
Sbjct: 472  F 472


>ref|XP_007139175.1| hypothetical protein PHAVU_008G007700g [Phaseolus vulgaris]
            gi|561012308|gb|ESW11169.1| hypothetical protein
            PHAVU_008G007700g [Phaseolus vulgaris]
          Length = 744

 Score =  397 bits (1019), Expect = e-107
 Identities = 222/458 (48%), Positives = 297/458 (64%), Gaps = 16/458 (3%)
 Frame = -2

Query: 1328 PSNPNFSFL---SEAFQSTNDT-----IKVPSAPWMTGPLLLPSNEVLNLSTLRTKRGKN 1173
            P+  ++S++   S    ++N+T     IK P+ PWM GPLLL  NE+L+LS  ++K+ K 
Sbjct: 13   PNAYSYSYIHISSSMLPNSNNTPSQLPIKGPTPPWMKGPLLLQPNELLDLSNPKSKKFK- 71

Query: 1172 RVQGAERTDLSLTDKISG-GRGRRAMRRIVESITKLQELANSEEAQKGVEKFELRVPLKP 996
                 ER +LS  D +    RG++ M++IVE + KL    NS  A  G    E    +  
Sbjct: 72   ----LERQELSDKDLMGKEARGKKTMKKIVEKVEKLHGTHNSAGALIGSPNVE---NIGG 124

Query: 995  IFEDGNSNSEAR-----MPWMAEEKIVFRRMKKERVLTKAELSLSETVLKRLRNNAVKMT 831
            + +    N E R     MPW  + K V+ ++K++R +T AEL+L + + +RLRN A  M 
Sbjct: 125  VLDSLKENEEVRRTKGRMPWENDWKFVYEKIKRKRTVTAAELTLDKVLFRRLRNEAATMR 184

Query: 830  KWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREIIELKTGGLVIWSKK 651
             W+KVKKAGVT  VVD+IK TW  NEL MVKF IPLCRNM RAREI+E KTGGLV+ SKK
Sbjct: 185  TWIKVKKAGVTQDVVDQIKWTWRRNELAMVKFDIPLCRNMSRAREIVETKTGGLVVLSKK 244

Query: 650  DTHVVYRGCNYELSSKACQKLHSEHGHV--VEASPSDDIFMESEDEANISYIQPNEVTQD 477
            D  VVY G N++L++     L + H  +   E + + DI      ++N S  +      +
Sbjct: 245  DFLVVYHGGNHQLTTTGYPSLRTNHSEMSGAELATTGDI---CSVDSNHSLSEMLNFIAE 301

Query: 476  GKDSISNSCPTSMEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWWPKPLPIDADLLPE 297
             KDSI+ S     E+N+  Q+ +G+LYERE DRLL+ LGPRF+DWW  KPLP+DADLLPE
Sbjct: 302  DKDSIATS-----EQNMNFQTANGSLYERETDRLLDDLGPRFIDWWMAKPLPVDADLLPE 356

Query: 296  VVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKLQGLATAMIKLWEK 117
             VPG+  P R C P    KL+D ELT  RK A+ LPTHF LGRN +L+GLA A++KLWEK
Sbjct: 357  DVPGFQPPLRICPPHSCAKLSDYELTYFRKLAQLLPTHFVLGRNKRLKGLAAAILKLWEK 416

Query: 116  SLIAKIAVKWGIPNTNNELMAQEIKHLTGGVLILRNKF 3
            SLIAKI++K+GIPNT+NE+MA E+K+LTGGVL+LRNKF
Sbjct: 417  SLIAKISIKYGIPNTDNEMMANELKYLTGGVLLLRNKF 454


>ref|XP_004138635.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Cucumis sativus]
          Length = 760

 Score =  383 bits (983), Expect = e-103
 Identities = 229/504 (45%), Positives = 308/504 (61%), Gaps = 14/504 (2%)
 Frame = -2

Query: 1472 ISTLSSSTHFQNPLKTPPKQLHNFNKPLKILASHTDI----TPSPNSIEKPQPSNPNFSF 1305
            + T S S+     L  P  + H+   PL  L++H  I    TPS +S+    PS      
Sbjct: 2    LPTTSFSSSLPRSLIPPSFRSHS---PLLHLSTHNPISATSTPSQSSVLPEPPS------ 52

Query: 1304 LSEAFQSTNDTIKVPSAPWMTGPLLLP----SNEVLNLSTLRTKRGKNRVQGAERTDLSL 1137
                   +N  + + +APWM  PL L       E ++ +  + + G +   G ++   +L
Sbjct: 53   ------ISNAAVNLRTAPWMKAPLHLQPQQQEEEGVDPANPKRRNGSDG-SGRDKCSRAL 105

Query: 1136 TDKISGGRGRRAMRRIVESITKLQELANSEEAQKGVEKFELRVPLKPIFEDGNSNSEARM 957
             D      G+ AMRRI +SI KL+   +  E +  +E+ E        FE+  S +  RM
Sbjct: 106  GDSGIDKTGKYAMRRIAKSIGKLRRNGDLGETRMKLEEVEFGGFDLEGFEE--SGTRRRM 163

Query: 956  PWMAEEK-IVFRRMKKERVLTKAELSLSETVLKRLRNNAVKMTKWVKVKKAGVTDAVVDE 780
            PW  ++  IV RRMKK+ V T AEL+L   +L+RL+  A KM KWVKV K GVT  VV++
Sbjct: 164  PWEKDDDGIVLRRMKKKTV-TSAELNLDRVLLERLKGEASKMEKWVKVNKVGVTQDVVNQ 222

Query: 779  IKRTWSGNELVMVKFYIPLCRNMDRAREIIELKTGGLVIWSKKDTHVVYRGCNYELSSKA 600
            I+  W  NEL M+KF +PL RNMDRAREI+E+KTGG+V+WSKK+  VVYRGCNY L+ K 
Sbjct: 223  IQFMWERNELAMLKFDVPLSRNMDRAREIVEMKTGGMVVWSKKNALVVYRGCNYPLNLKH 282

Query: 599  CQKLHSEHGHVVEASPSDDIFMESEDEANISYIQPNEVTQ-----DGKDSISNSCPTSME 435
              K        V  SP + + +E++   ++S    + + +     DG+   ++S      
Sbjct: 283  STKKQ------VHISPQNPVKVETDTHFSLSGHYESGLNRSINDNDGEWEEASSFFLIRH 336

Query: 434  ENVGLQSVSGTLYEREADRLLNGLGPRFVDWWWPKPLPIDADLLPEVVPGYMTPFRRCLP 255
            EN  LQ +SG+LYERE DRLL+ LGPRF+DWW  KPLP+DAD+LPEVVPGYM PFRRC P
Sbjct: 337  EN--LQPLSGSLYERETDRLLDDLGPRFIDWWMHKPLPVDADMLPEVVPGYMPPFRRCPP 394

Query: 254  QVRPKLTDDELTNLRKFARYLPTHFALGRNTKLQGLATAMIKLWEKSLIAKIAVKWGIPN 75
              +  LTD  L +LRK A  LPTHF LGRN KLQGLA +++KLWEKS+IAKIA+KWG+PN
Sbjct: 395  YTKQNLTDAGLQHLRKLAHSLPTHFVLGRNRKLQGLAASILKLWEKSMIAKIALKWGVPN 454

Query: 74   TNNELMAQEIKHLTGGVLILRNKF 3
            T+NE MA E+K+LTGG L+LRNKF
Sbjct: 455  TDNEQMALELKNLTGGTLLLRNKF 478


>gb|EYU44617.1| hypothetical protein MIMGU_mgv1a026522mg, partial [Mimulus guttatus]
          Length = 702

 Score =  380 bits (975), Expect = e-102
 Identities = 211/452 (46%), Positives = 269/452 (59%), Gaps = 33/452 (7%)
 Frame = -2

Query: 1298 EAFQSTNDTIKVPSAPWMTGPLLLPSNEVLNLSTLRTKR----GKNRVQ--GAERTDLSL 1137
            E    +  TIK P+APWM GPLL+  +E+L     RT++    G+N  +  G    D+ L
Sbjct: 18   EHIPHSRSTIKAPTAPWMNGPLLVKPSEILESRRTRTRKHFAAGRNDGEHTGGGHPDVDL 77

Query: 1136 TDKISGGRGRRAMRRIVESITKLQELANSEEAQKGVEKF--------------------- 1020
            T K+ G RG+ AM++I + I KLQ+  N EE  K +E                       
Sbjct: 78   TGKVGGARGKVAMKKIYKGIEKLQDTQNVEEPGKNLENLKFKFAPGALWGDKGEVEENTK 137

Query: 1019 ELRVPLK------PIFEDGNSNSEARMPWMAEEKIVFRRMKKERVLTKAELSLSETVLKR 858
            E R  LK      P  E  N     +MPW ++E +V RR++KE+V+T AE SL   +L+R
Sbjct: 138  EARWNLKIDDFDLPFGEAENEAKSKKMPWESDETVVIRRVQKEKVVTSAESSLDPVLLER 197

Query: 857  LRNNAVKMTKWVKVKKAGVTDAVVDEIKRTWSGNELVMVKFYIPLCRNMDRAREIIELKT 678
            L+  A  + KWVKVKKAGVT +VVD++   W  NEL +V F +PLCRNMDRAREIIE+KT
Sbjct: 198  LKEEAALIRKWVKVKKAGVTQSVVDQVSLFWRNNELALVNFDLPLCRNMDRAREIIEMKT 257

Query: 677  GGLVIWSKKDTHVVYRGCNYELSSKACQKLHSEHGHVVEASPSDDIFMESEDEANISYIQ 498
            GGLV+WS K+   VYRGCNY+   K  + ++     + + S                   
Sbjct: 258  GGLVVWSNKEFLAVYRGCNYKSGPKQFRNIYRNTTAIAQESC------------------ 299

Query: 497  PNEVTQDGKDSISNSCPTSMEENVGLQSVSGTLYEREADRLLNGLGPRFVDWWWPKPLPI 318
                  DG+DS         E ++ + S    LYEREADRLL+GLGPRFVDWW  KPLP+
Sbjct: 300  ------DGRDS-------EWESSIHMTS----LYEREADRLLDGLGPRFVDWWMQKPLPV 342

Query: 317  DADLLPEVVPGYMTPFRRCLPQVRPKLTDDELTNLRKFARYLPTHFALGRNTKLQGLATA 138
            D DLLPEV+PG+ TPFR   P  R K+TD+ELT LRK AR LPTHF LGRN KLQGLA A
Sbjct: 343  DGDLLPEVIPGFKTPFRLSPPSTRAKITDNELTYLRKLARPLPTHFVLGRNRKLQGLAVA 402

Query: 137  MIKLWEKSLIAKIAVKWGIPNTNNELMAQEIK 42
            ++KLWEK  IAKIAVKWG+ NT+NE MA E+K
Sbjct: 403  ILKLWEKCHIAKIAVKWGVQNTDNEQMANELK 434


Top