BLASTX nr result
ID: Mentha22_contig00034907
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00034907 (1120 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU44617.1| hypothetical protein MIMGU_mgv1a026522mg, partial... 389 e-106 ref|XP_006357840.1| PREDICTED: chloroplastic group IIA intron sp... 287 5e-75 ref|XP_002280704.1| PREDICTED: chloroplastic group IIA intron sp... 281 4e-73 ref|XP_002516757.1| conserved hypothetical protein [Ricinus comm... 276 1e-71 ref|XP_004233710.1| PREDICTED: chloroplastic group IIA intron sp... 270 8e-70 ref|XP_007217313.1| hypothetical protein PRUPE_ppa016241mg [Prun... 256 9e-66 gb|EXC20503.1| Chloroplastic group IIA intron splicing facilitat... 248 3e-63 gb|EPS58217.1| hypothetical protein M569_16596, partial [Genlise... 245 2e-62 ref|XP_007033221.1| maize chloroplast splicing factor CRS1, puta... 245 3e-62 ref|XP_007033220.1| maize chloroplast splicing factor CRS1, puta... 245 3e-62 ref|XP_007033219.1| maize chloroplast splicing factor CRS1, puta... 245 3e-62 ref|XP_007033218.1| maize chloroplast splicing factor CRS1, puta... 245 3e-62 ref|XP_007033217.1| maize chloroplast splicing factor CRS1, puta... 245 3e-62 ref|XP_007139175.1| hypothetical protein PHAVU_008G007700g [Phas... 239 1e-60 ref|XP_006430740.1| hypothetical protein CICLE_v10013368mg [Citr... 238 3e-60 ref|XP_006603058.1| PREDICTED: chloroplastic group IIA intron sp... 238 3e-60 ref|XP_006603055.1| PREDICTED: chloroplastic group IIA intron sp... 238 3e-60 ref|XP_006603054.1| PREDICTED: chloroplastic group IIA intron sp... 238 3e-60 ref|XP_006482225.1| PREDICTED: chloroplastic group IIA intron sp... 234 4e-59 ref|XP_004158502.1| PREDICTED: LOW QUALITY PROTEIN: chloroplasti... 232 2e-58 >gb|EYU44617.1| hypothetical protein MIMGU_mgv1a026522mg, partial [Mimulus guttatus] Length = 702 Score = 389 bits (1000), Expect = e-106 Identities = 213/374 (56%), Positives = 257/374 (68%), Gaps = 8/374 (2%) Frame = -2 Query: 1098 RIERGSEDSSAQSSKSIPHSRSKLKAPTAPWMAEPLFVKPNE-MESMKRRNNKGLELDRI 922 +IE + DS +S + IPHSRS +KAPTAPWM PL VKP+E +ES + R K R Sbjct: 4 KIEHENRDSRKESPEHIPHSRSTIKAPTAPWMNGPLLVKPSEILESRRTRTRKHFAAGRN 63 Query: 921 ------GEHPDEDLTGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPG 760 G HPD DLTGKVGGARG++AMKKI+K EKLQ+T ++E KN E+ KFKFAPG Sbjct: 64 DGEHTGGGHPDVDLTGKVGGARGKVAMKKIYKGIEKLQDTQNVEEPGKNLENLKFKFAPG 123 Query: 759 ALCGNGDYGGDXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPWEKEEKMVIRR 580 AL G+ + +K A NL ++FD+P +A E KSKK+PWE +E +VIRR Sbjct: 124 ALWGDKGEVEEN-------TKEARWNLKIDDFDLPFGEAENEAKSKKMPWESDETVVIRR 176 Query: 579 AKKEKVVTAAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALL 400 +KEKVVT+AESSLD +LLERL+ EAAL++KWVKV KAGVTQ+VVDQV WRN+ELAL+ Sbjct: 177 VQKEKVVTSAESSLDPVLLERLKEEAALIRKWVKVKKAGVTQSVVDQVSLFWRNNELALV 236 Query: 399 KFDLPLSRNMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQG 220 FDLPL RNMDRAREI+E+KTGGLVVW NK+ LAVYRGCNY RN Sbjct: 237 NFDLPLCRNMDRAREIIEMKTGGLVVWSNKEFLAVYRGCNYKSGPKQFRN---------- 286 Query: 219 NSSSNIIYQNTRTVARETSGESNPHELIHGRDGKLE-NLEMASLYEREADRLLDELGPRF 43 IY+NT +A+E+ GRD + E ++ M SLYEREADRLLD LGPRF Sbjct: 287 ------IYRNTTAIAQES---------CDGRDSEWESSIHMTSLYEREADRLLDGLGPRF 331 Query: 42 VDWWMQKPLPVDGD 1 VDWWMQKPLPVDGD Sbjct: 332 VDWWMQKPLPVDGD 345 >ref|XP_006357840.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Solanum tuberosum] Length = 802 Score = 287 bits (735), Expect = 5e-75 Identities = 163/353 (46%), Positives = 221/353 (62%), Gaps = 6/353 (1%) Frame = -2 Query: 1041 SRSKLKAPTAPWMAEPLFVKPNE-MESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRL 865 S S +K PTAPWM PL ++PN+ ++ K R K + ++P++ L+GKV G RG+ Sbjct: 70 SSSGIKGPTAPWMRGPLLLEPNQFLDLSKSRKKKDANFAKT-QNPNDALSGKVSGGRGKK 128 Query: 864 AMKKIFKSFEKLQETHDLEAFSKNHESR-KFKFAPGALCGNGDYGGDXXXXXXECSKAAE 688 AMK I++ +KLQET E +++ +F+F PG+L GD + + Sbjct: 129 AMKMIYQGIDKLQETQIGEGTQVETDAKVEFQFPPGSLSEWGDVSYEIEEKNPYGEEDNV 188 Query: 687 ENLNGNEFDIPLFDA---GKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAESSLDEMLLER 517 E+L G EF + + G K+PWE E ++V RR KKEKVV AES+LD MLLER Sbjct: 189 ESLEGVEFGVLSREGEGRGSRKIGVKMPWESEVRIVYRRMKKEKVVMTAESNLDAMLLER 248 Query: 516 LRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREIVELKT 337 LR EAA ++KWVKV KAGVT+ VVDQ+HF W+N+ELA+LKFDLPL RNMDRAREIVE+KT Sbjct: 249 LRGEAARIQKWVKVKKAGVTRTVVDQIHFIWKNNELAMLKFDLPLCRNMDRAREIVEMKT 308 Query: 336 GGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQGNSSSNIIYQNTRTVARETSGE 157 GG VVW ++ L VYRGC+Y + + + + NSS + T + S Sbjct: 309 GGFVVWMKQNALVVYRGCSYTLQ---QKELQHDFLCSHQNSSFTENIKQTSIFSPLNSSG 365 Query: 156 SNPHELIHGRDGKLENLEM-ASLYEREADRLLDELGPRFVDWWMQKPLPVDGD 1 S+ E+I + + ++L M SLY REA+RLLD+LGPR+VDWW KPLPV+ D Sbjct: 366 SSEDEMISVGNSEEDSLAMNESLYVREANRLLDDLGPRYVDWWWPKPLPVNAD 418 >ref|XP_002280704.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Vitis vinifera] Length = 1184 Score = 281 bits (719), Expect = 4e-73 Identities = 166/374 (44%), Positives = 211/374 (56%), Gaps = 20/374 (5%) Frame = -2 Query: 1062 SSKSIPHSRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVG 883 SS+ + + + +K PTAPWM PL ++PNE+ + + K + E PD LT KV Sbjct: 62 SSQPVSGTDAAIKMPTAPWMKGPLLLQPNEVLDLSKARPKKVAGSAGAEKPDRSLTEKVS 121 Query: 882 GARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXXXXXEC 703 G RG AMKKI +S KLQETH Sbjct: 122 GGRGAKAMKKIMQSIVKLQETHT------------------------------------- 144 Query: 702 SKAAEENLNGNEFDIPLFDAGKEVKSK---KLPWEKEEKMVIRRAKKEKVVTAAESSLDE 532 S +EN EF + L G + S+ K+PW K EK+V RR KKEKVVTAAE +LD Sbjct: 145 SDETQENTEEFEFGVSLEGIGGDENSRIGGKMPWLKTEKVVFRRTKKEKVVTAAELTLDP 204 Query: 531 MLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREI 352 MLLERLR EA M+KWVKV KAGVT++VVDQ+H W++DELA++KFD+PL RNMDRAREI Sbjct: 205 MLLERLRGEAVKMRKWVKVKKAGVTESVVDQIHMVWKSDELAMVKFDMPLCRNMDRAREI 264 Query: 351 VELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNM--NYSSAGDQGNSSSN-IIYQNTRT 181 +E+KT GLV+W KD L VYRG NY + + M + D NS N +++ T Sbjct: 265 LEIKTRGLVIWSKKDTLVVYRGSNYQSTSKHFQKMRPGLVAGADASNSKLNQSNFEDDLT 324 Query: 180 VARETSGESNPHELIHGRDGKLENL-------EM-------ASLYEREADRLLDELGPRF 43 ++ ES E + +DG+ ++ EM SLYEREADRLLD LGPRF Sbjct: 325 ISEIKFHESTTGEKMGRKDGEEDSSPTGIFMEEMVDSQPVNGSLYEREADRLLDGLGPRF 384 Query: 42 VDWWMQKPLPVDGD 1 +DWW KPLPVD D Sbjct: 385 IDWWRPKPLPVDAD 398 >ref|XP_002516757.1| conserved hypothetical protein [Ricinus communis] gi|223544130|gb|EEF45655.1| conserved hypothetical protein [Ricinus communis] Length = 742 Score = 276 bits (705), Expect = 1e-71 Identities = 163/363 (44%), Positives = 208/363 (57%), Gaps = 12/363 (3%) Frame = -2 Query: 1053 SIPHSRSK--LKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGG 880 S+P+S+S +K PTAPWM PL ++P+E+ ++ + NK + E D+ LTGK G Sbjct: 48 SVPNSQSNAPIKVPTAPWMKGPLLLQPHELINLSKPRNKNSSNNANIEKSDKVLTGKESG 107 Query: 879 ARGRLAMKKIFKSFEKLQETHDLE-------AFSKNH-ESRKFKFAP--GALCGNGDYGG 730 RG+ AM+KI KS E+LQE LE A+ K +S F+ G + +GD+G Sbjct: 108 VRGKKAMEKIVKSIEQLQENQALEKTQCDSQAYEKTQLDSEAFEIGEKLGLIREHGDFG- 166 Query: 729 DXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPWEKEEKMVIRRAKKEKVVTAA 550 V K PWE+EEK V R KKEK VT A Sbjct: 167 --------------------------------VNKKLKPWEREEKFVYWRIKKEKAVTKA 194 Query: 549 ESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNM 370 E L++ LLE LR EA+ M+KWVKVMKAGVTQ+VVDQ+ + WRN+ELA++KFDLPL RNM Sbjct: 195 ELILEKELLEILRTEASKMRKWVKVMKAGVTQSVVDQIRYAWRNNELAMVKFDLPLCRNM 254 Query: 369 DRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQGNSSSNIIYQN 190 DRAREIVELKTGGLVVW KD L +YRGCNY SS++ + Sbjct: 255 DRAREIVELKTGGLVVWTRKDSLVIYRGCNY-----------------HLTKSSHVSTMD 297 Query: 189 TRTVARETSGESNPHELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKPLPV 10 + +++ E P + G D + SL+ERE DRLLD LGPRFVDWWM+KPLPV Sbjct: 298 EKIGSKDGEEEYIPTSIFIGDDANTPTIN-GSLFERETDRLLDGLGPRFVDWWMRKPLPV 356 Query: 9 DGD 1 D D Sbjct: 357 DAD 359 >ref|XP_004233710.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Solanum lycopersicum] Length = 766 Score = 270 bits (690), Expect = 8e-70 Identities = 161/377 (42%), Positives = 214/377 (56%), Gaps = 5/377 (1%) Frame = -2 Query: 1116 RRVEDRRIERGSEDSSAQSSKSIPHSRSKLKAPTAPWMAEPLFVKPNE-MESMKRRNNKG 940 ++VE +E ++D + SS +K PTAPWM PL ++PN+ ++ K R K Sbjct: 53 KKVEQCNLEFENQDYGSSSSG--------IKGPTAPWMRGPLLLEPNQVLDLSKSRKKKD 104 Query: 939 LELDRIGEHPDEDLTGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESR-KFKFAP 763 + ++P++ L+GKV G RG+ AMK I++ +KLQET E + + +F+F P Sbjct: 105 TNFAKT-QNPNDALSGKVSGGRGKKAMKMIYQGIDKLQETQIGECTQVETDVKVEFQFPP 163 Query: 762 GALCGNGDYGGDXXXXXXECSKAAEENLNGNEFDIPLFDA---GKEVKSKKLPWEKEEKM 592 G+L G GD + + E+L G EF + + G ++PWE EE++ Sbjct: 164 GSLSGWGDVSYEIEEKNPYGEEDNVESLEGVEFGVLSREGEGRGSRKSGARMPWESEERI 223 Query: 591 VIRRAKKEKVVTAAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDE 412 V RR KKEKVV AES+LD MLLERLR EAA ++KWVKV KAGVT+ VVDQ+ F W+N+E Sbjct: 224 VYRRMKKEKVVRTAESNLDAMLLERLRGEAARIQKWVKVKKAGVTRTVVDQIQFIWKNNE 283 Query: 411 LALLKFDLPLSRNMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSA 232 LA+LKFDLPL RNMDRAR+IVE+KTGG VVW ++ L VYRG Y Sbjct: 284 LAMLKFDLPLCRNMDRARDIVEMKTGGFVVWMKQNALVVYRG--------------YEMI 329 Query: 231 GDQGNSSSNIIYQNTRTVARETSGESNPHELIHGRDGKLENLEMASLYEREADRLLDELG 52 GNS + + N SLYEREA+RLLD+LG Sbjct: 330 -SVGNSEEDSLVMN------------------------------ESLYEREANRLLDDLG 358 Query: 51 PRFVDWWMQKPLPVDGD 1 PR+VDWW KPLPVD D Sbjct: 359 PRYVDWWWPKPLPVDAD 375 >ref|XP_007217313.1| hypothetical protein PRUPE_ppa016241mg [Prunus persica] gi|462413463|gb|EMJ18512.1| hypothetical protein PRUPE_ppa016241mg [Prunus persica] Length = 809 Score = 256 bits (655), Expect = 9e-66 Identities = 159/365 (43%), Positives = 212/365 (58%), Gaps = 22/365 (6%) Frame = -2 Query: 1029 LKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRLAMKKI 850 +KAPTAPWM PL ++P+E+ + NK + E PD L GK+ G RG A+K+I Sbjct: 85 IKAPTAPWMKGPLLLQPHEVIDFSKPRNKKTHNNAKAEKPDTVLAGKLVGIRGDKAIKQI 144 Query: 849 FKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYG-GDXXXXXXECSKAAEENLNG 673 +S E+L K E++K G G++ D + K E + + Sbjct: 145 VQSIERLGPNQ------KTDETQK---------GFGEFRIWDSLEGLGQNEKWDETHKDF 189 Query: 672 NEFDIP--LFDAGKEVKSK---KLPWEKEEKMVIRRAKKEKVVTAAESSLDEMLLERLRN 508 EF I L GK S+ K+PWE++E++V +R KK++V +AAE SL++ LLERLR Sbjct: 190 VEFGIGGCLEGLGKAADSRFGGKMPWERDERIVFQRIKKKRVASAAELSLEKELLERLRA 249 Query: 507 EAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREIVELKTGGL 328 EAA M+KWVKV KAGVTQA+VD + F W+ +ELA++KFD+PL RNM RA+EIVE KTGG+ Sbjct: 250 EAAKMRKWVKVKKAGVTQAIVDDIKFIWKTNELAMVKFDVPLCRNMHRAQEIVETKTGGM 309 Query: 327 VVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQGNSSSNIIYQN--TRTVARETSGES 154 VVWG KD L +YRGCNY M SA Q SS+ + + + + S ES Sbjct: 310 VVWGKKDTLVIYRGCNYQSSSKFFPKMRPCSADRQETLSSDHMQPDLEENSSYQYKSFES 369 Query: 153 NPHELIHGRD--------GKLENLEMA------SLYEREADRLLDELGPRFVDWWMQKPL 16 E + +D G + M+ SLYE+EADRLLD LGPRF+DWWM KPL Sbjct: 370 PVDEKMSRKDAEEDCIQSGTFQETSMSCQPTSRSLYEKEADRLLDGLGPRFIDWWMHKPL 429 Query: 15 PVDGD 1 PVD D Sbjct: 430 PVDAD 434 >gb|EXC20503.1| Chloroplastic group IIA intron splicing facilitator CRS1 [Morus notabilis] Length = 828 Score = 248 bits (634), Expect = 3e-63 Identities = 152/377 (40%), Positives = 204/377 (54%), Gaps = 23/377 (6%) Frame = -2 Query: 1062 SSKSIPHSRSKL---KAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTG 892 S+K P S+ L K PT PWM PL ++P+E+ + + N +R E LT Sbjct: 48 STKENPDSKPPLEPIKMPTPPWMKGPLVLQPHEVTDLSKPENDNKFSNRKAEKSVNGLTD 107 Query: 891 KVGGARGRLAMKKIFKSFEKL--QETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXX 718 K+ G RG+ +KKI + E+L + D E K+ + G GD Sbjct: 108 KLVGRRGKNVIKKIARRIEELGRKSKVDSEETQKDFVGKN---------GIGD------- 151 Query: 717 XXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAESSL 538 C + E+ +G E ++PWEK+E V RR KKEK+V++AE L Sbjct: 152 ----CLEGLGESRSGGE---------------RMPWEKDEGFVFRRMKKEKIVSSAELRL 192 Query: 537 DEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAR 358 + LLERLR+EA M+KWVKV KAGVT+ VV+ V F W+++ELA++KFD+PL RNMDRA+ Sbjct: 193 ERELLERLRSEARKMRKWVKVKKAGVTKEVVEDVKFVWKSNELAMVKFDVPLCRNMDRAQ 252 Query: 357 EIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQGNSSSNII------- 199 EI+E+KTGGLVVW KD +YRGCNY +G Q SN++ Sbjct: 253 EILEMKTGGLVVWRRKDAQVIYRGCNYQPTSKTFPRTYAGFSGHQETPFSNLVQLDSRKG 312 Query: 198 --------YQNT---RTVARETSGESNPHELIHGRDGKLENLEMASLYEREADRLLDELG 52 Y+NT + + T GE+ P +I D + +SLY READRLLD LG Sbjct: 313 NSVSEVKSYENTIERKISKKNTEGETIPTAIILKNDANFQ--PSSSLYVREADRLLDGLG 370 Query: 51 PRFVDWWMQKPLPVDGD 1 PRF+DWWM KPLPVD D Sbjct: 371 PRFIDWWMNKPLPVDAD 387 >gb|EPS58217.1| hypothetical protein M569_16596, partial [Genlisea aurea] Length = 668 Score = 245 bits (626), Expect = 2e-62 Identities = 152/355 (42%), Positives = 207/355 (58%), Gaps = 8/355 (2%) Frame = -2 Query: 1041 SRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRLA 862 S + APTAPWM +PLFV P+++ +++ K ++ + D+DL+ KVG R +LA Sbjct: 1 SSESVSAPTAPWMRKPLFVNPSQLLDLRKSPIKKNSFNK--QRLDKDLSEKVGNGRNKLA 58 Query: 861 MKKIFKSFEKLQETHDLEAFSKNHESRK---FKFAPGALCGNGDYGGDXXXXXXECSKAA 691 M++IF+ +KLQE+ + S K FKF PG L GN + C + + Sbjct: 59 MRQIFRGIKKLQESRPSSEAAATEGSPKNFEFKFRPGELSGNPQDSKNDG-----CERNS 113 Query: 690 EENLNGNEFDIPLFDA--GKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAE-SSLDEMLLE 520 E + F IPL +A G+EV+ K +PW++E V R A ++ AA+ +++DE+LLE Sbjct: 114 ETT---DGFCIPLREAAEGEEVRLKAMPWQREA--VGRMATNRPLMKAAKLNAIDELLLE 168 Query: 519 RLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRND--ELALLKFDLPLSRNMDRAREIVE 346 RL+NEAA M+KW+KV K GVT VVDQVH TWR+ +LALLKFD+PL+R M RAREIVE Sbjct: 169 RLQNEAAKMRKWIKVKKLGVTPTVVDQVHSTWRSSRSQLALLKFDVPLNRCMSRAREIVE 228 Query: 345 LKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQGNSSSNIIYQNTRTVARET 166 +KTGG+ +W +KDL+AVYR G+ SSN A+++ Sbjct: 229 MKTGGIAIWKSKDLIAVYR----------------------GSESSN---------AQQS 257 Query: 165 SGESNPHELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKPLPVDGD 1 S +SLYERE DRLLDELGPRFVDWW+ KPLPVD D Sbjct: 258 SA------------------SFSSLYERETDRLLDELGPRFVDWWLHKPLPVDAD 294 >ref|XP_007033221.1| maize chloroplast splicing factor CRS1, putative isoform 5 [Theobroma cacao] gi|508712250|gb|EOY04147.1| maize chloroplast splicing factor CRS1, putative isoform 5 [Theobroma cacao] Length = 788 Score = 245 bits (625), Expect = 3e-62 Identities = 149/383 (38%), Positives = 212/383 (55%), Gaps = 19/383 (4%) Frame = -2 Query: 1092 ERGSEDSSAQSSKSIPHSRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEH 913 E S +++++ S S + +K PTAPWM PL ++P+E+ + + +K + + Sbjct: 67 ENRSLNNNSKFSVSKDPNNGPIKMPTAPWMKGPLLLQPHEVLNPSKSTSKKSSNSK-AKA 125 Query: 912 PDEDLTGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYG 733 PD+ L GK G RG+ MKKI ++ E LQ LE + +F G ++G Sbjct: 126 PDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLE---DTQIGIREEFEVGNWLE--EFG 180 Query: 732 GDXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPW-EKEEKMVIRRAKKEKVVT 556 D ++ FD K+PW +EEK+V RR KKEK++T Sbjct: 181 SDG--------------------EVKRFDG-------KMPWLREEEKVVFRRMKKEKLLT 213 Query: 555 AAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSR 376 AE SLD+ LLERLR +A M+KW+KVMK GVT+AVVD++ WR +EL ++KF +PL R Sbjct: 214 QAEISLDKDLLERLRRKAMRMRKWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCR 273 Query: 375 NMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGD----QGNSSS 208 NMDRAREI+E+KT GLVVWG KD L VYRGC++G + +M Y D ++ S Sbjct: 274 NMDRAREIIEMKTRGLVVWGKKDALVVYRGCSHG-LTSKISSMKYPRCADGQEISSSTFS 332 Query: 207 NIIYQNTRTVARETSGESNPHELIHGRDGKLENLE--------------MASLYEREADR 70 ++ N ++ E S ++ D + E++ + SLYERE DR Sbjct: 333 HLTSSNNINMSLEKFNGSTLQSGLYREDREKESMPINIFMKEDENNQPVIGSLYERETDR 392 Query: 69 LLDELGPRFVDWWMQKPLPVDGD 1 LLD LGPRF+DWWM+KPLP+D D Sbjct: 393 LLDGLGPRFIDWWMRKPLPIDAD 415 >ref|XP_007033220.1| maize chloroplast splicing factor CRS1, putative isoform 4 [Theobroma cacao] gi|508712249|gb|EOY04146.1| maize chloroplast splicing factor CRS1, putative isoform 4 [Theobroma cacao] Length = 767 Score = 245 bits (625), Expect = 3e-62 Identities = 149/383 (38%), Positives = 212/383 (55%), Gaps = 19/383 (4%) Frame = -2 Query: 1092 ERGSEDSSAQSSKSIPHSRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEH 913 E S +++++ S S + +K PTAPWM PL ++P+E+ + + +K + + Sbjct: 41 ENRSLNNNSKFSVSKDPNNGPIKMPTAPWMKGPLLLQPHEVLNPSKSTSKKSSNSK-AKA 99 Query: 912 PDEDLTGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYG 733 PD+ L GK G RG+ MKKI ++ E LQ LE + +F G ++G Sbjct: 100 PDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLE---DTQIGIREEFEVGNWLE--EFG 154 Query: 732 GDXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPW-EKEEKMVIRRAKKEKVVT 556 D ++ FD K+PW +EEK+V RR KKEK++T Sbjct: 155 SDG--------------------EVKRFDG-------KMPWLREEEKVVFRRMKKEKLLT 187 Query: 555 AAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSR 376 AE SLD+ LLERLR +A M+KW+KVMK GVT+AVVD++ WR +EL ++KF +PL R Sbjct: 188 QAEISLDKDLLERLRRKAMRMRKWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCR 247 Query: 375 NMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGD----QGNSSS 208 NMDRAREI+E+KT GLVVWG KD L VYRGC++G + +M Y D ++ S Sbjct: 248 NMDRAREIIEMKTRGLVVWGKKDALVVYRGCSHG-LTSKISSMKYPRCADGQEISSSTFS 306 Query: 207 NIIYQNTRTVARETSGESNPHELIHGRDGKLENLE--------------MASLYEREADR 70 ++ N ++ E S ++ D + E++ + SLYERE DR Sbjct: 307 HLTSSNNINMSLEKFNGSTLQSGLYREDREKESMPINIFMKEDENNQPVIGSLYERETDR 366 Query: 69 LLDELGPRFVDWWMQKPLPVDGD 1 LLD LGPRF+DWWM+KPLP+D D Sbjct: 367 LLDGLGPRFIDWWMRKPLPIDAD 389 >ref|XP_007033219.1| maize chloroplast splicing factor CRS1, putative isoform 3 [Theobroma cacao] gi|508712248|gb|EOY04145.1| maize chloroplast splicing factor CRS1, putative isoform 3 [Theobroma cacao] Length = 788 Score = 245 bits (625), Expect = 3e-62 Identities = 149/383 (38%), Positives = 212/383 (55%), Gaps = 19/383 (4%) Frame = -2 Query: 1092 ERGSEDSSAQSSKSIPHSRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEH 913 E S +++++ S S + +K PTAPWM PL ++P+E+ + + +K + + Sbjct: 41 ENRSLNNNSKFSVSKDPNNGPIKMPTAPWMKGPLLLQPHEVLNPSKSTSKKSSNSK-AKA 99 Query: 912 PDEDLTGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYG 733 PD+ L GK G RG+ MKKI ++ E LQ LE + +F G ++G Sbjct: 100 PDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLE---DTQIGIREEFEVGNWLE--EFG 154 Query: 732 GDXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPW-EKEEKMVIRRAKKEKVVT 556 D ++ FD K+PW +EEK+V RR KKEK++T Sbjct: 155 SDG--------------------EVKRFDG-------KMPWLREEEKVVFRRMKKEKLLT 187 Query: 555 AAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSR 376 AE SLD+ LLERLR +A M+KW+KVMK GVT+AVVD++ WR +EL ++KF +PL R Sbjct: 188 QAEISLDKDLLERLRRKAMRMRKWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCR 247 Query: 375 NMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGD----QGNSSS 208 NMDRAREI+E+KT GLVVWG KD L VYRGC++G + +M Y D ++ S Sbjct: 248 NMDRAREIIEMKTRGLVVWGKKDALVVYRGCSHG-LTSKISSMKYPRCADGQEISSSTFS 306 Query: 207 NIIYQNTRTVARETSGESNPHELIHGRDGKLENLE--------------MASLYEREADR 70 ++ N ++ E S ++ D + E++ + SLYERE DR Sbjct: 307 HLTSSNNINMSLEKFNGSTLQSGLYREDREKESMPINIFMKEDENNQPVIGSLYERETDR 366 Query: 69 LLDELGPRFVDWWMQKPLPVDGD 1 LLD LGPRF+DWWM+KPLP+D D Sbjct: 367 LLDGLGPRFIDWWMRKPLPIDAD 389 >ref|XP_007033218.1| maize chloroplast splicing factor CRS1, putative isoform 2 [Theobroma cacao] gi|508712247|gb|EOY04144.1| maize chloroplast splicing factor CRS1, putative isoform 2 [Theobroma cacao] Length = 804 Score = 245 bits (625), Expect = 3e-62 Identities = 149/383 (38%), Positives = 212/383 (55%), Gaps = 19/383 (4%) Frame = -2 Query: 1092 ERGSEDSSAQSSKSIPHSRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEH 913 E S +++++ S S + +K PTAPWM PL ++P+E+ + + +K + + Sbjct: 67 ENRSLNNNSKFSVSKDPNNGPIKMPTAPWMKGPLLLQPHEVLNPSKSTSKKSSNSK-AKA 125 Query: 912 PDEDLTGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYG 733 PD+ L GK G RG+ MKKI ++ E LQ LE + +F G ++G Sbjct: 126 PDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLE---DTQIGIREEFEVGNWLE--EFG 180 Query: 732 GDXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPW-EKEEKMVIRRAKKEKVVT 556 D ++ FD K+PW +EEK+V RR KKEK++T Sbjct: 181 SDG--------------------EVKRFDG-------KMPWLREEEKVVFRRMKKEKLLT 213 Query: 555 AAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSR 376 AE SLD+ LLERLR +A M+KW+KVMK GVT+AVVD++ WR +EL ++KF +PL R Sbjct: 214 QAEISLDKDLLERLRRKAMRMRKWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCR 273 Query: 375 NMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGD----QGNSSS 208 NMDRAREI+E+KT GLVVWG KD L VYRGC++G + +M Y D ++ S Sbjct: 274 NMDRAREIIEMKTRGLVVWGKKDALVVYRGCSHG-LTSKISSMKYPRCADGQEISSSTFS 332 Query: 207 NIIYQNTRTVARETSGESNPHELIHGRDGKLENLE--------------MASLYEREADR 70 ++ N ++ E S ++ D + E++ + SLYERE DR Sbjct: 333 HLTSSNNINMSLEKFNGSTLQSGLYREDREKESMPINIFMKEDENNQPVIGSLYERETDR 392 Query: 69 LLDELGPRFVDWWMQKPLPVDGD 1 LLD LGPRF+DWWM+KPLP+D D Sbjct: 393 LLDGLGPRFIDWWMRKPLPIDAD 415 >ref|XP_007033217.1| maize chloroplast splicing factor CRS1, putative isoform 1 [Theobroma cacao] gi|508712246|gb|EOY04143.1| maize chloroplast splicing factor CRS1, putative isoform 1 [Theobroma cacao] Length = 818 Score = 245 bits (625), Expect = 3e-62 Identities = 149/383 (38%), Positives = 212/383 (55%), Gaps = 19/383 (4%) Frame = -2 Query: 1092 ERGSEDSSAQSSKSIPHSRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEH 913 E S +++++ S S + +K PTAPWM PL ++P+E+ + + +K + + Sbjct: 67 ENRSLNNNSKFSVSKDPNNGPIKMPTAPWMKGPLLLQPHEVLNPSKSTSKKSSNSK-AKA 125 Query: 912 PDEDLTGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYG 733 PD+ L GK G RG+ MKKI ++ E LQ LE + +F G ++G Sbjct: 126 PDKALFGKESGVRGKKVMKKIIRNVEMLQGNEVLE---DTQIGIREEFEVGNWLE--EFG 180 Query: 732 GDXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPW-EKEEKMVIRRAKKEKVVT 556 D ++ FD K+PW +EEK+V RR KKEK++T Sbjct: 181 SDG--------------------EVKRFDG-------KMPWLREEEKVVFRRMKKEKLLT 213 Query: 555 AAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSR 376 AE SLD+ LLERLR +A M+KW+KVMK GVT+AVVD++ WR +EL ++KF +PL R Sbjct: 214 QAEISLDKDLLERLRRKAMRMRKWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCR 273 Query: 375 NMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGD----QGNSSS 208 NMDRAREI+E+KT GLVVWG KD L VYRGC++G + +M Y D ++ S Sbjct: 274 NMDRAREIIEMKTRGLVVWGKKDALVVYRGCSHG-LTSKISSMKYPRCADGQEISSSTFS 332 Query: 207 NIIYQNTRTVARETSGESNPHELIHGRDGKLENLE--------------MASLYEREADR 70 ++ N ++ E S ++ D + E++ + SLYERE DR Sbjct: 333 HLTSSNNINMSLEKFNGSTLQSGLYREDREKESMPINIFMKEDENNQPVIGSLYERETDR 392 Query: 69 LLDELGPRFVDWWMQKPLPVDGD 1 LLD LGPRF+DWWM+KPLP+D D Sbjct: 393 LLDGLGPRFIDWWMRKPLPIDAD 415 >ref|XP_007139175.1| hypothetical protein PHAVU_008G007700g [Phaseolus vulgaris] gi|561012308|gb|ESW11169.1| hypothetical protein PHAVU_008G007700g [Phaseolus vulgaris] Length = 744 Score = 239 bits (611), Expect = 1e-60 Identities = 153/371 (41%), Positives = 200/371 (53%), Gaps = 17/371 (4%) Frame = -2 Query: 1062 SSKSIPHSRSK-----LKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDL 898 SS +P+S + +K PT PWM PL ++PNE+ + +K +L+R E D+DL Sbjct: 24 SSSMLPNSNNTPSQLPIKGPTPPWMKGPLLLQPNELLDLSNPKSKKFKLER-QELSDKDL 82 Query: 897 TGKVGGARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXX 718 GK ARG+ MKKI + EKL TH+ + GAL G+ + Sbjct: 83 MGKE--ARGKKTMKKIVEKVEKLHGTHN---------------SAGALIGSPNV------ 119 Query: 717 XXXECSKAAEENLNGNEFDIPLFDAGKEVKSKK--LPWEKEEKMVIRRAKKEKVVTAAES 544 EN+ G + +EV+ K +PWE + K V + K+++ VTAAE Sbjct: 120 ----------ENIGGV---LDSLKENEEVRRTKGRMPWENDWKFVYEKIKRKRTVTAAEL 166 Query: 543 SLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDR 364 +LD++L RLRNEAA M+ W+KV KAGVTQ VVDQ+ +TWR +ELA++KFD+PL RNM R Sbjct: 167 TLDKVLFRRLRNEAATMRTWIKVKKAGVTQDVVDQIKWTWRRNELAMVKFDIPLCRNMSR 226 Query: 363 AREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQGNSSSNIIYQNTR 184 AREIVE KTGGLVV KD L VY G N+ L Y S + S T Sbjct: 227 AREIVETKTGGLVVLSKKDFLVVYHGGNH-----QLTTTGYPSLRTNHSEMSGAELATTG 281 Query: 183 TVARETSGESNPHELIHGRDGK------LENLEM----ASLYEREADRLLDELGPRFVDW 34 + S S L + K +N+ SLYERE DRLLD+LGPRF+DW Sbjct: 282 DICSVDSNHSLSEMLNFIAEDKDSIATSEQNMNFQTANGSLYERETDRLLDDLGPRFIDW 341 Query: 33 WMQKPLPVDGD 1 WM KPLPVD D Sbjct: 342 WMAKPLPVDAD 352 >ref|XP_006430740.1| hypothetical protein CICLE_v10013368mg [Citrus clementina] gi|557532797|gb|ESR43980.1| hypothetical protein CICLE_v10013368mg [Citrus clementina] Length = 770 Score = 238 bits (608), Expect = 3e-60 Identities = 153/360 (42%), Positives = 204/360 (56%), Gaps = 18/360 (5%) Frame = -2 Query: 1026 KAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRLAMKKIF 847 K PTAPWM P+ ++P+E+ + K + + D+ LT K G RG+ AMKKI Sbjct: 60 KMPTAPWMRSPIVLQPDEIIKPSKPKTK-----KSFKKTDKGLTAKESGVRGKQAMKKII 114 Query: 846 KSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXXXXXECSKAAEENLNGNE 667 ++ EKLQ+ L+ K + KF+F G NG + EE+L G Sbjct: 115 ENIEKLQKDQILDETQKK-DMEKFEFR-GCFEENG---------------SDEEDLRGGF 157 Query: 666 FDIPLFDAGKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAESSLDEMLLERLRNEAALMKK 487 K+PW +EE+ V RR KKE++VT AE+ LD L+ERL++EA M+K Sbjct: 158 -------------GGKVPWLREERFVFRRMKKERMVTKAETMLDGELIERLKDEARKMRK 204 Query: 486 WVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREIVELKTGGLVVWGNKD 307 WVKV KAGVT++VV ++ WR +ELA++KFD+PL RNMDRAREI+ELKTGGLV+W KD Sbjct: 205 WVKVKKAGVTESVVFEIRLAWRRNELAMVKFDVPLCRNMDRAREILELKTGGLVIWTKKD 264 Query: 306 LLAVYRGCNYGRRLGNLRNMNYSSAGDQG---NSSSNI----------IYQNTRTVARET 166 VYRG + M SA DQ + S+++ I NT T+ + Sbjct: 265 AHVVYRGDGSKSSV----KMCPRSADDQEAPLSKSTHLHLEKKVNVSWIKSNTATLDQNR 320 Query: 165 S---GESN--PHELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKPLPVDGD 1 S GE N P + ++ +++ SLYERE DRLLD LGPRFVDWWM KPLPVDGD Sbjct: 321 SLKDGEENSLPTSIFMDKNLRIDK----SLYEREGDRLLDGLGPRFVDWWMWKPLPVDGD 376 >ref|XP_006603058.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like isoform X5 [Glycine max] Length = 744 Score = 238 bits (607), Expect = 3e-60 Identities = 152/366 (41%), Positives = 209/366 (57%), Gaps = 19/366 (5%) Frame = -2 Query: 1041 SRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRLA 862 S+ +K+PT PWM PL ++P+E+ + +K + ++ E D+ L GK RG+ A Sbjct: 40 SQVPIKSPTPPWMKVPLLLQPHELVDLSNPKSKKFKPEK-HELSDKALMGKE--VRGKRA 96 Query: 861 MKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXXXXXECSKAAEEN 682 MKKI EKL +T + ++E+R ++ Sbjct: 97 MKKIVDRVEKLHKTQN------SNETRV------------------------------DS 120 Query: 681 LNGNEFD--IPLFDAGKEVKSK-KLPWEKEEKMVIRRAKKEKVVTAAESSLDEMLLERLR 511 LN F + + +EV+SK ++PWEK+EK + K+EK VTAAE +LD+ LL RLR Sbjct: 121 LNVENFGGYLEILKENEEVRSKGRMPWEKDEKFGFVKVKREKAVTAAELTLDKALLRRLR 180 Query: 510 NEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREIVELKTGG 331 NEAA M+ W+KV KAGVTQ VVDQ+ TWR +ELA++KFD+PL RNMDRAREIVE KTGG Sbjct: 181 NEAARMRTWIKVKKAGVTQDVVDQIKRTWRRNELAMIKFDIPLCRNMDRAREIVETKTGG 240 Query: 330 LVVWGNKDLLAVYRGCNY---GRRLGNLRNMNY-------SSAGD-----QGNSSSNIIY 196 LVV KD L VYRGCN+ + +LR +Y ++ GD +SSS ++ Sbjct: 241 LVVLSKKDFLVVYRGCNHQLTTKGSPSLRTNHYEMNRVELATKGDIFRVESNHSSSEMLN 300 Query: 195 QNTRTVARETSGESNPH-ELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKP 19 N ++G + + +L++G SLYERE +RLLD LGPRF+DWWM KP Sbjct: 301 WNADHKDSISTGIQDVNCQLVNG-----------SLYERETERLLDGLGPRFIDWWMHKP 349 Query: 18 LPVDGD 1 LPVD D Sbjct: 350 LPVDAD 355 >ref|XP_006603055.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like isoform X2 [Glycine max] gi|571550194|ref|XP_006603056.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like isoform X3 [Glycine max] gi|571550197|ref|XP_006603057.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like isoform X4 [Glycine max] Length = 747 Score = 238 bits (607), Expect = 3e-60 Identities = 152/366 (41%), Positives = 209/366 (57%), Gaps = 19/366 (5%) Frame = -2 Query: 1041 SRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRLA 862 S+ +K+PT PWM PL ++P+E+ + +K + ++ E D+ L GK RG+ A Sbjct: 40 SQVPIKSPTPPWMKVPLLLQPHELVDLSNPKSKKFKPEK-HELSDKALMGKE--VRGKRA 96 Query: 861 MKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXXXXXECSKAAEEN 682 MKKI EKL +T + ++E+R ++ Sbjct: 97 MKKIVDRVEKLHKTQN------SNETRV------------------------------DS 120 Query: 681 LNGNEFD--IPLFDAGKEVKSK-KLPWEKEEKMVIRRAKKEKVVTAAESSLDEMLLERLR 511 LN F + + +EV+SK ++PWEK+EK + K+EK VTAAE +LD+ LL RLR Sbjct: 121 LNVENFGGYLEILKENEEVRSKGRMPWEKDEKFGFVKVKREKAVTAAELTLDKALLRRLR 180 Query: 510 NEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREIVELKTGG 331 NEAA M+ W+KV KAGVTQ VVDQ+ TWR +ELA++KFD+PL RNMDRAREIVE KTGG Sbjct: 181 NEAARMRTWIKVKKAGVTQDVVDQIKRTWRRNELAMIKFDIPLCRNMDRAREIVETKTGG 240 Query: 330 LVVWGNKDLLAVYRGCNY---GRRLGNLRNMNY-------SSAGD-----QGNSSSNIIY 196 LVV KD L VYRGCN+ + +LR +Y ++ GD +SSS ++ Sbjct: 241 LVVLSKKDFLVVYRGCNHQLTTKGSPSLRTNHYEMNRVELATKGDIFRVESNHSSSEMLN 300 Query: 195 QNTRTVARETSGESNPH-ELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKP 19 N ++G + + +L++G SLYERE +RLLD LGPRF+DWWM KP Sbjct: 301 WNADHKDSISTGIQDVNCQLVNG-----------SLYERETERLLDGLGPRFIDWWMHKP 349 Query: 18 LPVDGD 1 LPVD D Sbjct: 350 LPVDAD 355 >ref|XP_006603054.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like isoform X1 [Glycine max] Length = 750 Score = 238 bits (607), Expect = 3e-60 Identities = 152/366 (41%), Positives = 209/366 (57%), Gaps = 19/366 (5%) Frame = -2 Query: 1041 SRSKLKAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRLA 862 S+ +K+PT PWM PL ++P+E+ + +K + ++ E D+ L GK RG+ A Sbjct: 40 SQVPIKSPTPPWMKVPLLLQPHELVDLSNPKSKKFKPEK-HELSDKALMGKE--VRGKRA 96 Query: 861 MKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXXXXXECSKAAEEN 682 MKKI EKL +T + ++E+R ++ Sbjct: 97 MKKIVDRVEKLHKTQN------SNETRV------------------------------DS 120 Query: 681 LNGNEFD--IPLFDAGKEVKSK-KLPWEKEEKMVIRRAKKEKVVTAAESSLDEMLLERLR 511 LN F + + +EV+SK ++PWEK+EK + K+EK VTAAE +LD+ LL RLR Sbjct: 121 LNVENFGGYLEILKENEEVRSKGRMPWEKDEKFGFVKVKREKAVTAAELTLDKALLRRLR 180 Query: 510 NEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREIVELKTGG 331 NEAA M+ W+KV KAGVTQ VVDQ+ TWR +ELA++KFD+PL RNMDRAREIVE KTGG Sbjct: 181 NEAARMRTWIKVKKAGVTQDVVDQIKRTWRRNELAMIKFDIPLCRNMDRAREIVETKTGG 240 Query: 330 LVVWGNKDLLAVYRGCNY---GRRLGNLRNMNY-------SSAGD-----QGNSSSNIIY 196 LVV KD L VYRGCN+ + +LR +Y ++ GD +SSS ++ Sbjct: 241 LVVLSKKDFLVVYRGCNHQLTTKGSPSLRTNHYEMNRVELATKGDIFRVESNHSSSEMLN 300 Query: 195 QNTRTVARETSGESNPH-ELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKP 19 N ++G + + +L++G SLYERE +RLLD LGPRF+DWWM KP Sbjct: 301 WNADHKDSISTGIQDVNCQLVNG-----------SLYERETERLLDGLGPRFIDWWMHKP 349 Query: 18 LPVDGD 1 LPVD D Sbjct: 350 LPVDAD 355 >ref|XP_006482225.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like isoform X1 [Citrus sinensis] gi|568857343|ref|XP_006482226.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like isoform X2 [Citrus sinensis] Length = 771 Score = 234 bits (598), Expect = 4e-59 Identities = 152/360 (42%), Positives = 203/360 (56%), Gaps = 18/360 (5%) Frame = -2 Query: 1026 KAPTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEHPDEDLTGKVGGARGRLAMKKIF 847 K PTAPWM P+ ++P+E+ + K + + D+ LT K G RG+ AMKKI Sbjct: 54 KMPTAPWMRSPIVLQPDEIIKPSKPKTK-----KSFKKTDKGLTAKESGVRGKQAMKKII 108 Query: 846 KSFEKLQETHDLEAFSKNHESRKFKFAPGALCGNGDYGGDXXXXXXECSKAAEENLNGNE 667 ++ EKLQ+ L+ K KF+F G N + EE+L G Sbjct: 109 ENIEKLQKDQILDETQKK-VMEKFEF-KGCFEENVSH---------------EEDLRGG- 150 Query: 666 FDIPLFDAGKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAESSLDEMLLERLRNEAALMKK 487 K+PW +E++ V RR KKE++VT AE+ LD LLERL++EA M+K Sbjct: 151 ------------FGGKVPWLREDRFVFRRMKKERMVTKAETMLDGELLERLKDEARKMRK 198 Query: 486 WVKVMKAGVTQAVVDQVHFTWRNDELALLKFDLPLSRNMDRAREIVELKTGGLVVWGNKD 307 WVKV KAGVT++VV ++ WR +ELA++KFD+PL RNMDRAREI+ELKTGGLV+W KD Sbjct: 199 WVKVKKAGVTESVVFEIRLAWRRNELAMVKFDVPLCRNMDRAREILELKTGGLVIWTKKD 258 Query: 306 LLAVYRGCNYGRRLGNLRNMNYSSAGDQG---NSSSNI----------IYQNTRTVARET 166 VYRG + + M SA DQ + S+++ I NT T+ + Sbjct: 259 AHVVYRGDSSKSSV----KMCPRSADDQEAPLSKSTHLHLEKKVNVSWIKSNTATLDQNR 314 Query: 165 S---GESN--PHELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKPLPVDGD 1 S GE N P + ++ +++ SLYERE DRLLD LGPRFVDWWM KPLPVDGD Sbjct: 315 SLKDGEENSLPTSIFMDKNLRIDK----SLYEREGDRLLDGLGPRFVDWWMWKPLPVDGD 370 >ref|XP_004158502.1| PREDICTED: LOW QUALITY PROTEIN: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Cucumis sativus] Length = 760 Score = 232 bits (591), Expect = 2e-58 Identities = 151/387 (39%), Positives = 204/387 (52%), Gaps = 26/387 (6%) Frame = -2 Query: 1083 SEDSSAQSSKSIPH----SRSKLKAPTAPWMAEPLFVKPNEME-------SMKRRNNKGL 937 S S+ S +P S + + TAPWM PL ++P + E + KRRN Sbjct: 36 SATSTPSQSSVLPEPPSISNAAVNLRTAPWMKAPLHLQPQQQEEEGVDPANPKRRNGS-- 93 Query: 936 ELDRIGEHPDEDLTGKVG-GARGRLAMKKIFKSFEKLQETHDLEAFSKNHESRKFKFAPG 760 D G G G G+ AM++I KS KL+ Sbjct: 94 --DGSGRDKCSRALGDSGIDKTGKYAMRRIAKSIGKLRR--------------------- 130 Query: 759 ALCGNGDYGGDXXXXXXECSKAAEENLNGNEFDIPLFDAGKEVKSKKLPWEKEEKMVIRR 580 NGD G ++ E + +FD+ F+ + +++PWEK++ ++ R Sbjct: 131 ----NGDLGE---------TRMKLEEVEFGDFDLEGFE--ESGTRRRMPWEKDDDGIVLR 175 Query: 579 AKKEKVVTAAESSLDEMLLERLRNEAALMKKWVKVMKAGVTQAVVDQVHFTWRNDELALL 400 K+K VT+AE +LD +LLERL+ EA+ M+KWVKV K GVTQ VV+Q+ F W +ELA+L Sbjct: 176 RMKKKTVTSAELNLDRVLLERLKGEASKMEKWVKVNKVGVTQDVVNQIQFMWERNELAML 235 Query: 399 KFDLPLSRNMDRAREIVELKTGGLVVWGNKDLLAVYRGCNYGRRLGNLRNMNYSSAGDQG 220 KFD+PLSRNMDRAREIVE+KTGG+VVW K+ L +YRGCNY N+ +S+ Sbjct: 236 KFDVPLSRNMDRAREIVEMKTGGMVVWSKKNALVIYRGCNYP------LNLKHSTKKQVH 289 Query: 219 NSSSNIIYQNTRT-VARETSGESNPHELIHGRDG-----------KLENLE--MASLYER 82 S N + T T + ES + I+ DG + ENL+ SLYER Sbjct: 290 ISPQNPVKVETDTHFSLSGHYESGLNRSINDNDGEWEEASSFFLIRHENLQPLSGSLYER 349 Query: 81 EADRLLDELGPRFVDWWMQKPLPVDGD 1 E DRLLD+LGPRF+DWWM KPLPVD D Sbjct: 350 ETDRLLDDLGPRFIDWWMHKPLPVDAD 376