BLASTX nr result
ID: Mentha28_contig00016243
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha28_contig00016243 (1544 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU44617.1| hypothetical protein MIMGU_mgv1a026522mg, partial... 478 e-132 ref|XP_006357840.1| PREDICTED: chloroplastic group IIA intron sp... 417 e-114 ref|XP_002280704.1| PREDICTED: chloroplastic group IIA intron sp... 411 e-112 ref|XP_002516757.1| conserved hypothetical protein [Ricinus comm... 390 e-106 ref|XP_006482225.1| PREDICTED: chloroplastic group IIA intron sp... 387 e-105 gb|EXC20503.1| Chloroplastic group IIA intron splicing facilitat... 385 e-104 ref|XP_006430740.1| hypothetical protein CICLE_v10013368mg [Citr... 383 e-103 ref|XP_006603058.1| PREDICTED: chloroplastic group IIA intron sp... 377 e-102 ref|XP_006603055.1| PREDICTED: chloroplastic group IIA intron sp... 377 e-102 ref|XP_006603054.1| PREDICTED: chloroplastic group IIA intron sp... 377 e-102 ref|XP_007217313.1| hypothetical protein PRUPE_ppa016241mg [Prun... 374 e-101 ref|XP_007139175.1| hypothetical protein PHAVU_008G007700g [Phas... 370 e-100 ref|XP_004138635.1| PREDICTED: chloroplastic group IIA intron sp... 370 e-100 ref|XP_004507538.1| PREDICTED: chloroplastic group IIA intron sp... 370 1e-99 ref|XP_004158502.1| PREDICTED: LOW QUALITY PROTEIN: chloroplasti... 369 3e-99 ref|XP_007033221.1| maize chloroplast splicing factor CRS1, puta... 367 6e-99 ref|XP_007033220.1| maize chloroplast splicing factor CRS1, puta... 367 6e-99 ref|XP_007033219.1| maize chloroplast splicing factor CRS1, puta... 367 6e-99 ref|XP_007033218.1| maize chloroplast splicing factor CRS1, puta... 367 6e-99 ref|XP_007033217.1| maize chloroplast splicing factor CRS1, puta... 367 6e-99 >gb|EYU44617.1| hypothetical protein MIMGU_mgv1a026522mg, partial [Mimulus guttatus] Length = 702 Score = 478 bits (1231), Expect = e-132 Identities = 260/438 (59%), Positives = 303/438 (69%), Gaps = 19/438 (4%) Frame = -3 Query: 1296 APTAPWMAEPLFVKPNE-MESMKRRNNKGLELDRI------GEYPDEDLT---------- 1168 APTAPWM PL VKP+E +ES + R K R G +PD DLT Sbjct: 29 APTAPWMNGPLLVKPSEILESRRTRTRKHFAAGRNDGEHTGGGHPDVDLTGKVGGARGKV 88 Query: 1167 -MKKIFKSFEKLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEE 991 MKKI+K EKLQ+T ++E KN E+ KFKFAPGAL +K A Sbjct: 89 AMKKIYKGIEKLQDTQNVEEPGKNLENLKFKFAPGALWGDKGEVEEN-------TKEARW 141 Query: 990 NLNGNEFDIPLFSAGKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNE 811 NL ++FD+P A E KSKK+PWE +E +VIRR +KEKVVT+AE SLD +LLERL+ E Sbjct: 142 NLKIDDFDLPFGEAENEAKSKKMPWESDETVVIRRVQKEKVVTSAESSLDPVLLERLKEE 201 Query: 810 AALMKKWVKVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLV 631 AAL++KWVKV KAG+TQ+VVDQV WRNNE+AL+ FD L RNMDRAREI+E+KTGGLV Sbjct: 202 AALIRKWVKVKKAGVTQSVVDQVSLFWRNNELALVNFDLPLCRNMDRAREIIEMKTGGLV 261 Query: 630 VWRKKDLLAVYRGCNYGRGLENLRNMNYSSAGDQGNPSSNITYQNTGTVARETSGESNPH 451 VW K+ LAVYRGCNY G + RN+ Y+NT +A+E+ Sbjct: 262 VWSNKEFLAVYRGCNYKSGPKQFRNI----------------YRNTTAIAQESCD----- 300 Query: 450 ELIHGRDGKLEN-LEMASLYEREADRLLDELGPRFVDWWMQKPLQVDGDLLPAVVPGYKT 274 GRD + E+ + M SLYEREADRLLD LGPRFVDWWMQKPL VDGDLLP V+PG+KT Sbjct: 301 ----GRDSEWESSIHMTSLYEREADRLLDGLGPRFVDWWMQKPLPVDGDLLPEVIPGFKT 356 Query: 273 PFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAILKLWEKCHIAKIA 94 PFRL PP RAK+ D+ELTYLR+LARPLPTHFVLGRNR LQGLA AILKLWEKCHIAKIA Sbjct: 357 PFRLSPPSTRAKITDNELTYLRKLARPLPTHFVLGRNRKLQGLAVAILKLWEKCHIAKIA 416 Query: 93 VKWGVPNTDNEQMANELK 40 VKWGV NTDNEQMANELK Sbjct: 417 VKWGVQNTDNEQMANELK 434 >ref|XP_006357840.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Solanum tuberosum] Length = 802 Score = 417 bits (1073), Expect = e-114 Identities = 234/449 (52%), Positives = 296/449 (65%), Gaps = 18/449 (4%) Frame = -3 Query: 1293 PTAPWMAEPLFVKPNE-MESMKRRNNKGLELDRIGEYPDEDLT-----------MKKIFK 1150 PTAPWM PL ++PN+ ++ K R K + + P++ L+ MK I++ Sbjct: 77 PTAPWMRGPLLLEPNQFLDLSKSRKKKDANFAKT-QNPNDALSGKVSGGRGKKAMKMIYQ 135 Query: 1149 SFEKLQETHDLEAFNKNHESG-KFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNE 973 +KLQET E ++ +F+F PG+L + E+L G E Sbjct: 136 GIDKLQETQIGEGTQVETDAKVEFQFPPGSLSEWGDVSYEIEEKNPYGEEDNVESLEGVE 195 Query: 972 FDIPLFSAGKEVKSKKL----PWEKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAA 805 F + L G+ S+K+ PWE E ++V RR KKEKVV AE +LD MLLERLR EAA Sbjct: 196 FGV-LSREGEGRGSRKIGVKMPWESEVRIVYRRMKKEKVVMTAESNLDAMLLERLRGEAA 254 Query: 804 LMKKWVKVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVW 625 ++KWVKV KAG+T+ VVDQ+HF W+NNE+A+LKFD L RNMDRAREIVE+KTGG VVW Sbjct: 255 RIQKWVKVKKAGVTRTVVDQIHFIWKNNELAMLKFDLPLCRNMDRAREIVEMKTGGFVVW 314 Query: 624 RKKDLLAVYRGCNYGRGLENLRNMNYSSAGDQGNPSSNITYQNTGTVARETSGESNPHEL 445 K++ L VYRGC+Y + L++ S N S + T + S S+ E+ Sbjct: 315 MKQNALVVYRGCSYTLQQKELQHDFLCS---HQNSSFTENIKQTSIFSPLNSSGSSEDEM 371 Query: 444 IHGRDGKLENLEM-ASLYEREADRLLDELGPRFVDWWMQKPLQVDGDLLPAVVPGYKTPF 268 I + + ++L M SLY REA+RLLD+LGPR+VDWW KPL V+ DLLP VVPG+K PF Sbjct: 372 ISVGNSEEDSLAMNESLYVREANRLLDDLGPRYVDWWWPKPLPVNADLLPEVVPGFKPPF 431 Query: 267 RLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAILKLWEKCHIAKIAVK 88 RLCPP +R+KL DDELT LR+LAR LPTHFVLGRNR LQGLAAA++KLWEKCHIAKIA+K Sbjct: 432 RLCPPRSRSKLTDDELTQLRKLARSLPTHFVLGRNRKLQGLAAAVVKLWEKCHIAKIALK 491 Query: 87 WGVPNTDNEQMANELKKLTGGVLLLRNKF 1 WG+PNT NE MANELK LTGGVLLLRNKF Sbjct: 492 WGIPNTSNELMANELKYLTGGVLLLRNKF 520 >ref|XP_002280704.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Vitis vinifera] Length = 1184 Score = 411 bits (1057), Expect = e-112 Identities = 234/462 (50%), Positives = 285/462 (61%), Gaps = 31/462 (6%) Frame = -3 Query: 1293 PTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEYPDEDLT-----------MKKIFKS 1147 PTAPWM PL ++PNE+ + + K + E PD LT MKKI +S Sbjct: 76 PTAPWMKGPLLLQPNEVLDLSKARPKKVAGSAGAEKPDRSLTEKVSGGRGAKAMKKIMQS 135 Query: 1146 FEKLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEFD 967 KLQETH S +EN EF Sbjct: 136 IVKLQETHT-------------------------------------SDETQENTEEFEFG 158 Query: 966 IPLFSAGKEVKSK---KLPWEKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAALMK 796 + L G + S+ K+PW K EK+V RR KKEKVVTAAEL+LD MLLERLR EA M+ Sbjct: 159 VSLEGIGGDENSRIGGKMPWLKTEKVVFRRTKKEKVVTAAELTLDPMLLERLRGEAVKMR 218 Query: 795 KWVKVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVWRKK 616 KWVKV KAG+T++VVDQ+H W+++E+A++KFD L RNMDRAREI+E+KT GLV+W KK Sbjct: 219 KWVKVKKAGVTESVVDQIHMVWKSDELAMVKFDMPLCRNMDRAREILEIKTRGLVIWSKK 278 Query: 615 DLLAVYRGCNYGRGLENLRNMNYS--SAGDQGNPSSNIT-YQNTGTVARETSGESNPHEL 445 D L VYRG NY ++ + M + D N N + +++ T++ ES E Sbjct: 279 DTLVVYRGSNYQSTSKHFQKMRPGLVAGADASNSKLNQSNFEDDLTISEIKFHESTTGEK 338 Query: 444 IHGRDGKLENL-------EMA-------SLYEREADRLLDELGPRFVDWWMQKPLQVDGD 307 + +DG+ ++ EM SLYEREADRLLD LGPRF+DWW KPL VD D Sbjct: 339 MGRKDGEEDSSPTGIFMEEMVDSQPVNGSLYEREADRLLDGLGPRFIDWWRPKPLPVDAD 398 Query: 306 LLPAVVPGYKTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAILK 127 LLP V+PG++ PFRL PP R+KL DDELTYLR+LA LPTHFVLGRNR LQGLAAAILK Sbjct: 399 LLPEVLPGFRPPFRLSPPQTRSKLTDDELTYLRKLAYALPTHFVLGRNRKLQGLAAAILK 458 Query: 126 LWEKCHIAKIAVKWGVPNTDNEQMANELKKLTGGVLLLRNKF 1 LWEK I KIA+KWG+PNT NEQMANELK LTGGVLLLRNKF Sbjct: 459 LWEKSLIVKIAIKWGIPNTKNEQMANELKCLTGGVLLLRNKF 500 >ref|XP_002516757.1| conserved hypothetical protein [Ricinus communis] gi|223544130|gb|EEF45655.1| conserved hypothetical protein [Ricinus communis] Length = 742 Score = 390 bits (1002), Expect = e-106 Identities = 228/453 (50%), Positives = 272/453 (60%), Gaps = 22/453 (4%) Frame = -3 Query: 1293 PTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEYPDEDLT-----------MKKIFKS 1147 PTAPWM PL ++P+E+ ++ + NK + E D+ LT M+KI KS Sbjct: 61 PTAPWMKGPLLLQPHELINLSKPRNKNSSNNANIEKSDKVLTGKESGVRGKKAMEKIVKS 120 Query: 1146 FEKLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEFD 967 E+LQE LE C A E + D Sbjct: 121 IEQLQENQALEKTQ-------------------------------CDSQAYEK---TQLD 146 Query: 966 IPLFSAGKE-----------VKSKKLPWEKEEKMVIRRAKKEKVVTAAELSLDEMLLERL 820 F G++ V K PWE+EEK V R KKEK VT AEL L++ LLE L Sbjct: 147 SEAFEIGEKLGLIREHGDFGVNKKLKPWEREEKFVYWRIKKEKAVTKAELILEKELLEIL 206 Query: 819 RNEAALMKKWVKVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTG 640 R EA+ M+KWVKVMKAG+TQ+VVDQ+ +AWRNNE+A++KFD L RNMDRAREIVELKTG Sbjct: 207 RTEASKMRKWVKVMKAGVTQSVVDQIRYAWRNNELAMVKFDLPLCRNMDRAREIVELKTG 266 Query: 639 GLVVWRKKDLLAVYRGCNYGRGLENLRNMNYSSAGDQGNPSSNITYQNTGTVARETSGES 460 GLVVW +KD L +YRGCNY SS+++ + +++ E Sbjct: 267 GLVVWTRKDSLVIYRGCNY-----------------HLTKSSHVSTMDEKIGSKDGEEEY 309 Query: 459 NPHELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKPLQVDGDLLPAVVPGY 280 P + G D + SL+ERE DRLLD LGPRFVDWWM+KPL VD DLLP VV G+ Sbjct: 310 IPTSIFIGDDANTPTIN-GSLFERETDRLLDGLGPRFVDWWMRKPLPVDADLLPEVVAGF 368 Query: 279 KTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAILKLWEKCHIAK 100 P R + RAKL DDELTYLR+LA LPTHFVLGRNR LQGLAAAILKLWE+ IAK Sbjct: 369 MPPSRF--HYARAKLKDDELTYLRKLAYALPTHFVLGRNRRLQGLAAAILKLWERSLIAK 426 Query: 99 IAVKWGVPNTDNEQMANELKKLTGGVLLLRNKF 1 IAVKWG+PNTDNEQMANELK LTGGVLLLRNKF Sbjct: 427 IAVKWGIPNTDNEQMANELKHLTGGVLLLRNKF 459 >ref|XP_006482225.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like isoform X1 [Citrus sinensis] gi|568857343|ref|XP_006482226.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like isoform X2 [Citrus sinensis] Length = 771 Score = 387 bits (995), Expect = e-105 Identities = 230/457 (50%), Positives = 285/457 (62%), Gaps = 26/457 (5%) Frame = -3 Query: 1293 PTAPWMAEPLFVKPNEM--------ESMKRRNNKGLELDRIGEYPDEDLTMKKIFKSFEK 1138 PTAPWM P+ ++P+E+ + ++ +KGL G + MKKI ++ EK Sbjct: 56 PTAPWMRSPIVLQPDEIIKPSKPKTKKSFKKTDKGLTAKESGVRGKQ--AMKKIIENIEK 113 Query: 1137 LQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEFDIPL 958 LQ+ L+ K KF+F EEN++ +E D+ Sbjct: 114 LQKDQILDETQKK-VMEKFEF----------------------KGCFEENVS-HEEDLRG 149 Query: 957 FSAGKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAALMKKWVKVM 778 G K+PW +E++ V RR KKE++VT AE LD LLERL++EA M+KWVKV Sbjct: 150 GFGG------KVPWLREDRFVFRRMKKERMVTKAETMLDGELLERLKDEARKMRKWVKVK 203 Query: 777 KAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVWRKKDLLAVY 598 KAG+T++VV ++ AWR NE+A++KFD L RNMDRAREI+ELKTGGLV+W KKD VY Sbjct: 204 KAGVTESVVFEIRLAWRRNELAMVKFDVPLCRNMDRAREILELKTGGLVIWTKKDAHVVY 263 Query: 597 RGCNYGRGLENLRNMNYSSAGDQGNPSSNITY-------------QNTGTVARETS---G 466 RG + ++ M SA DQ P S T+ NT T+ + S G Sbjct: 264 RGDSSKSSVK----MCPRSADDQEAPLSKSTHLHLEKKVNVSWIKSNTATLDQNRSLKDG 319 Query: 465 ESN--PHELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKPLQVDGDLLPAV 292 E N P + ++ +++ SLYERE DRLLD LGPRFVDWWM KPL VDGDLLP V Sbjct: 320 EENSLPTSIFMDKNLRIDK----SLYEREGDRLLDGLGPRFVDWWMWKPLPVDGDLLPEV 375 Query: 291 VPGYKTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAILKLWEKC 112 VPG+K PFRL PP R+KL DDELTYLR+LA PLPTHFVLGRNR LQGLA AILKLWEK Sbjct: 376 VPGFKPPFRLSPPDARSKLTDDELTYLRKLAHPLPTHFVLGRNRGLQGLATAILKLWEKS 435 Query: 111 HIAKIAVKWGVPNTDNEQMANELKKLTGGVLLLRNKF 1 +AKI VKWG+PNTDNEQMANELK LTGGVLLLRNKF Sbjct: 436 LVAKITVKWGIPNTDNEQMANELKHLTGGVLLLRNKF 472 >gb|EXC20503.1| Chloroplastic group IIA intron splicing facilitator CRS1 [Morus notabilis] Length = 828 Score = 385 bits (989), Expect = e-104 Identities = 221/463 (47%), Positives = 279/463 (60%), Gaps = 32/463 (6%) Frame = -3 Query: 1293 PTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEYPDEDLT-----------MKKIFKS 1147 PT PWM PL ++P+E+ + + N +R E LT +KKI + Sbjct: 65 PTPPWMKGPLVLQPHEVTDLSKPENDNKFSNRKAEKSVNGLTDKLVGRRGKNVIKKIARR 124 Query: 1146 FEKLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEFD 967 E+L +++ E+ K + +C + E+ +G E Sbjct: 125 IEELGRKSKVDS----EETQKDFVGKNGI--------------GDCLEGLGESRSGGE-- 164 Query: 966 IPLFSAGKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAALMKKWV 787 ++PWEK+E V RR KKEK+V++AEL L+ LLERLR+EA M+KWV Sbjct: 165 -------------RMPWEKDEGFVFRRMKKEKIVSSAELRLERELLERLRSEARKMRKWV 211 Query: 786 KVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVWRKKDLL 607 KV KAG+T+ VV+ V F W++NE+A++KFD L RNMDRA+EI+E+KTGGLVVWR+KD Sbjct: 212 KVKKAGVTKEVVEDVKFVWKSNELAMVKFDVPLCRNMDRAQEILEMKTGGLVVWRRKDAQ 271 Query: 606 AVYRGCNYGRGLENLRNMNYSSAGDQGNPSSNI---------------TYQNT---GTVA 481 +YRGCNY + +G Q P SN+ +Y+NT Sbjct: 272 VIYRGCNYQPTSKTFPRTYAGFSGHQETPFSNLVQLDSRKGNSVSEVKSYENTIERKISK 331 Query: 480 RETSGESNPHELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKPLQVDGDLL 301 + T GE+ P +I D + +SLY READRLLD LGPRF+DWWM KPL VD DLL Sbjct: 332 KNTEGETIPTAIILKNDANFQ--PSSSLYVREADRLLDGLGPRFIDWWMNKPLPVDADLL 389 Query: 300 PAVVPGYKTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAILKLW 121 P VVPG++ PFR CPP R+KL D+ELTYLR+LA LPTHFVLGRNR LQGLAAAILKLW Sbjct: 390 PEVVPGFRPPFRRCPPHTRSKLTDEELTYLRKLAHSLPTHFVLGRNRKLQGLAAAILKLW 449 Query: 120 EKCHIAKIAVKWGVPNTDNEQMANELKK---LTGGVLLLRNKF 1 EKCHIAKIAVK GVPNT+NEQMA ELK LTGG LLLRNKF Sbjct: 450 EKCHIAKIAVKLGVPNTNNEQMAYELKARICLTGGDLLLRNKF 492 >ref|XP_006430740.1| hypothetical protein CICLE_v10013368mg [Citrus clementina] gi|557532797|gb|ESR43980.1| hypothetical protein CICLE_v10013368mg [Citrus clementina] Length = 770 Score = 383 bits (983), Expect = e-103 Identities = 230/470 (48%), Positives = 284/470 (60%), Gaps = 39/470 (8%) Frame = -3 Query: 1293 PTAPWMAEPLFVKPNEM--------ESMKRRNNKGLELDRIGEYPDEDLTMKKIFKSFEK 1138 PTAPWM P+ ++P+E+ + ++ +KGL G + MKKI ++ EK Sbjct: 62 PTAPWMRSPIVLQPDEIIKPSKPKTKKSFKKTDKGLTAKESGVRGKQ--AMKKIIENIEK 119 Query: 1137 LQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEFDIPL 958 LQ+ L+ K + KF+F + E +E D+ Sbjct: 120 LQKDQILDETQKK-DMEKFEF-----------------------RGCFEENGSDEEDLRG 155 Query: 957 FSAGKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAALMKKWVKVM 778 GK +PW +EE+ V RR KKE++VT AE LD L+ERL++EA M+KWVKV Sbjct: 156 GFGGK------VPWLREERFVFRRMKKERMVTKAETMLDGELIERLKDEARKMRKWVKVK 209 Query: 777 KAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVWRKKDLLAVY 598 KAG+T++VV ++ AWR NE+A++KFD L RNMDRAREI+ELKTGGLV+W KKD VY Sbjct: 210 KAGVTESVVFEIRLAWRRNELAMVKFDVPLCRNMDRAREILELKTGGLVIWTKKDAHVVY 269 Query: 597 RGCNYGRGLENLRNMNYSSAGDQGNPSSNITY-------------QNTGTVARETS---G 466 RG G ++ M SA DQ P S T+ NT T+ + S G Sbjct: 270 RG----DGSKSSVKMCPRSADDQEAPLSKSTHLHLEKKVNVSWIKSNTATLDQNRSLKDG 325 Query: 465 ESN--PHELIHGRDGKLENLEMASLYEREADRLLDELGPRFVDWWMQKPLQVDGDLLPAV 292 E N P + ++ +++ SLYERE DRLLD LGPRFVDWWM KPL VDGDLLP V Sbjct: 326 EENSLPTSIFMDKNLRIDK----SLYEREGDRLLDGLGPRFVDWWMWKPLPVDGDLLPEV 381 Query: 291 VPGYKTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAILKLWEKC 112 VPG+K PFRL PP R+KL DDELTYLR+LA PLPTHFVLGRNR LQGLA AILKLWEK Sbjct: 382 VPGFKPPFRLSPPDARSKLTDDELTYLRKLAHPLPTHFVLGRNRGLQGLATAILKLWEKS 441 Query: 111 HIAKIAVKWGVPNTDNEQMANELKK-------------LTGGVLLLRNKF 1 +AKIAVKWG+PNTDNEQMANELK LTGGVLLLRNKF Sbjct: 442 LVAKIAVKWGIPNTDNEQMANELKNFKFSDDGVLLMQHLTGGVLLLRNKF 491 >ref|XP_006603058.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like isoform X5 [Glycine max] Length = 744 Score = 377 bits (968), Expect = e-102 Identities = 218/457 (47%), Positives = 274/457 (59%), Gaps = 25/457 (5%) Frame = -3 Query: 1296 APTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEYPDEDL---------TMKKIFKSF 1144 +PT PWM PL ++P+E+ + +K + ++ E D+ L MKKI Sbjct: 46 SPTPPWMKVPLLLQPHELVDLSNPKSKKFKPEK-HELSDKALMGKEVRGKRAMKKIVDRV 104 Query: 1143 EKLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEFD- 967 EKL +T + ++ ++LN F Sbjct: 105 EKLHKTQN------------------------------------SNETRVDSLNVENFGG 128 Query: 966 -IPLFSAGKEVKSK-KLPWEKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAALMKK 793 + + +EV+SK ++PWEK+EK + K+EK VTAAEL+LD+ LL RLRNEAA M+ Sbjct: 129 YLEILKENEEVRSKGRMPWEKDEKFGFVKVKREKAVTAAELTLDKALLRRLRNEAARMRT 188 Query: 792 WVKVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVWRKKD 613 W+KV KAG+TQ VVDQ+ WR NE+A++KFD L RNMDRAREIVE KTGGLVV KKD Sbjct: 189 WIKVKKAGVTQDVVDQIKRTWRRNELAMIKFDIPLCRNMDRAREIVETKTGGLVVLSKKD 248 Query: 612 LLAVYRGCNYG---RGLENLRNMNYSSAGDQGNPSSNITYQNTGTVARETSGESNPHELI 442 L VYRGCN+ +G +LR +Y + + G + R S S+ L Sbjct: 249 FLVVYRGCNHQLTTKGSPSLRTNHYEM--------NRVELATKGDIFRVESNHSSSEMLN 300 Query: 441 HGRDGKLE----------NLEMASLYEREADRLLDELGPRFVDWWMQKPLQVDGDLLPAV 292 D K L SLYERE +RLLD LGPRF+DWWM KPL VD DLLP Sbjct: 301 WNADHKDSISTGIQDVNCQLVNGSLYERETERLLDGLGPRFIDWWMHKPLPVDADLLPEE 360 Query: 291 VPGYKTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAILKLWEKC 112 VPG++ PFRLCPP + AKL D ELTY R+LA+ LPTHFVLGRN+ L+GLA+AILKLWEK Sbjct: 361 VPGFQPPFRLCPPHSSAKLTDYELTYFRKLAQSLPTHFVLGRNKGLKGLASAILKLWEKS 420 Query: 111 HIAKIAVKWGVPNTDNEQMANELKKLTGGVLLLRNKF 1 IAKIA+K+G+PNTDNE MANELK LTGGVLLLRNKF Sbjct: 421 LIAKIAIKYGIPNTDNEMMANELKCLTGGVLLLRNKF 457 >ref|XP_006603055.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like isoform X2 [Glycine max] gi|571550194|ref|XP_006603056.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like isoform X3 [Glycine max] gi|571550197|ref|XP_006603057.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like isoform X4 [Glycine max] Length = 747 Score = 377 bits (968), Expect = e-102 Identities = 218/457 (47%), Positives = 274/457 (59%), Gaps = 25/457 (5%) Frame = -3 Query: 1296 APTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEYPDEDL---------TMKKIFKSF 1144 +PT PWM PL ++P+E+ + +K + ++ E D+ L MKKI Sbjct: 46 SPTPPWMKVPLLLQPHELVDLSNPKSKKFKPEK-HELSDKALMGKEVRGKRAMKKIVDRV 104 Query: 1143 EKLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEFD- 967 EKL +T + ++ ++LN F Sbjct: 105 EKLHKTQN------------------------------------SNETRVDSLNVENFGG 128 Query: 966 -IPLFSAGKEVKSK-KLPWEKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAALMKK 793 + + +EV+SK ++PWEK+EK + K+EK VTAAEL+LD+ LL RLRNEAA M+ Sbjct: 129 YLEILKENEEVRSKGRMPWEKDEKFGFVKVKREKAVTAAELTLDKALLRRLRNEAARMRT 188 Query: 792 WVKVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVWRKKD 613 W+KV KAG+TQ VVDQ+ WR NE+A++KFD L RNMDRAREIVE KTGGLVV KKD Sbjct: 189 WIKVKKAGVTQDVVDQIKRTWRRNELAMIKFDIPLCRNMDRAREIVETKTGGLVVLSKKD 248 Query: 612 LLAVYRGCNYG---RGLENLRNMNYSSAGDQGNPSSNITYQNTGTVARETSGESNPHELI 442 L VYRGCN+ +G +LR +Y + + G + R S S+ L Sbjct: 249 FLVVYRGCNHQLTTKGSPSLRTNHYEM--------NRVELATKGDIFRVESNHSSSEMLN 300 Query: 441 HGRDGKLE----------NLEMASLYEREADRLLDELGPRFVDWWMQKPLQVDGDLLPAV 292 D K L SLYERE +RLLD LGPRF+DWWM KPL VD DLLP Sbjct: 301 WNADHKDSISTGIQDVNCQLVNGSLYERETERLLDGLGPRFIDWWMHKPLPVDADLLPEE 360 Query: 291 VPGYKTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAILKLWEKC 112 VPG++ PFRLCPP + AKL D ELTY R+LA+ LPTHFVLGRN+ L+GLA+AILKLWEK Sbjct: 361 VPGFQPPFRLCPPHSSAKLTDYELTYFRKLAQSLPTHFVLGRNKGLKGLASAILKLWEKS 420 Query: 111 HIAKIAVKWGVPNTDNEQMANELKKLTGGVLLLRNKF 1 IAKIA+K+G+PNTDNE MANELK LTGGVLLLRNKF Sbjct: 421 LIAKIAIKYGIPNTDNEMMANELKCLTGGVLLLRNKF 457 >ref|XP_006603054.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like isoform X1 [Glycine max] Length = 750 Score = 377 bits (968), Expect = e-102 Identities = 218/457 (47%), Positives = 274/457 (59%), Gaps = 25/457 (5%) Frame = -3 Query: 1296 APTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEYPDEDL---------TMKKIFKSF 1144 +PT PWM PL ++P+E+ + +K + ++ E D+ L MKKI Sbjct: 46 SPTPPWMKVPLLLQPHELVDLSNPKSKKFKPEK-HELSDKALMGKEVRGKRAMKKIVDRV 104 Query: 1143 EKLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEFD- 967 EKL +T + ++ ++LN F Sbjct: 105 EKLHKTQN------------------------------------SNETRVDSLNVENFGG 128 Query: 966 -IPLFSAGKEVKSK-KLPWEKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAALMKK 793 + + +EV+SK ++PWEK+EK + K+EK VTAAEL+LD+ LL RLRNEAA M+ Sbjct: 129 YLEILKENEEVRSKGRMPWEKDEKFGFVKVKREKAVTAAELTLDKALLRRLRNEAARMRT 188 Query: 792 WVKVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVWRKKD 613 W+KV KAG+TQ VVDQ+ WR NE+A++KFD L RNMDRAREIVE KTGGLVV KKD Sbjct: 189 WIKVKKAGVTQDVVDQIKRTWRRNELAMIKFDIPLCRNMDRAREIVETKTGGLVVLSKKD 248 Query: 612 LLAVYRGCNYG---RGLENLRNMNYSSAGDQGNPSSNITYQNTGTVARETSGESNPHELI 442 L VYRGCN+ +G +LR +Y + + G + R S S+ L Sbjct: 249 FLVVYRGCNHQLTTKGSPSLRTNHYEM--------NRVELATKGDIFRVESNHSSSEMLN 300 Query: 441 HGRDGKLE----------NLEMASLYEREADRLLDELGPRFVDWWMQKPLQVDGDLLPAV 292 D K L SLYERE +RLLD LGPRF+DWWM KPL VD DLLP Sbjct: 301 WNADHKDSISTGIQDVNCQLVNGSLYERETERLLDGLGPRFIDWWMHKPLPVDADLLPEE 360 Query: 291 VPGYKTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAILKLWEKC 112 VPG++ PFRLCPP + AKL D ELTY R+LA+ LPTHFVLGRN+ L+GLA+AILKLWEK Sbjct: 361 VPGFQPPFRLCPPHSSAKLTDYELTYFRKLAQSLPTHFVLGRNKGLKGLASAILKLWEKS 420 Query: 111 HIAKIAVKWGVPNTDNEQMANELKKLTGGVLLLRNKF 1 IAKIA+K+G+PNTDNE MANELK LTGGVLLLRNKF Sbjct: 421 LIAKIAIKYGIPNTDNEMMANELKCLTGGVLLLRNKF 457 >ref|XP_007217313.1| hypothetical protein PRUPE_ppa016241mg [Prunus persica] gi|462413463|gb|EMJ18512.1| hypothetical protein PRUPE_ppa016241mg [Prunus persica] Length = 809 Score = 374 bits (961), Expect = e-101 Identities = 224/466 (48%), Positives = 281/466 (60%), Gaps = 34/466 (7%) Frame = -3 Query: 1296 APTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEYPDE-----------DLTMKKIFK 1150 APTAPWM PL ++P+E+ + NK + E PD D +K+I + Sbjct: 87 APTAPWMKGPLLLQPHEVIDFSKPRNKKTHNNAKAEKPDTVLAGKLVGIRGDKAIKQIVQ 146 Query: 1149 SFEKLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEF 970 S E+L + K G+F+ + K E + + EF Sbjct: 147 SIERLGPNQKTDETQKGF--GEFRI------------WDSLEGLGQNEKWDETHKDFVEF 192 Query: 969 DIP--LFSAGKEVKSK---KLPWEKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAA 805 I L GK S+ K+PWE++E++V +R KK++V +AAELSL++ LLERLR EAA Sbjct: 193 GIGGCLEGLGKAADSRFGGKMPWERDERIVFQRIKKKRVASAAELSLEKELLERLRAEAA 252 Query: 804 LMKKWVKVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVW 625 M+KWVKV KAG+TQA+VD + F W+ NE+A++KFD L RNM RA+EIVE KTGG+VVW Sbjct: 253 KMRKWVKVKKAGVTQAIVDDIKFIWKTNELAMVKFDVPLCRNMHRAQEIVETKTGGMVVW 312 Query: 624 RKKDLLAVYRGCNYGRGLENLRNMNYSSAGDQGNPSS---------NITYQNTG---TVA 481 KKD L +YRGCNY + M SA Q SS N +YQ V Sbjct: 313 GKKDTLVIYRGCNYQSSSKFFPKMRPCSADRQETLSSDHMQPDLEENSSYQYKSFESPVD 372 Query: 480 RETSGESNPHELIHGRDGKLENLEMA------SLYEREADRLLDELGPRFVDWWMQKPLQ 319 + S + + I + G + M+ SLYE+EADRLLD LGPRF+DWWM KPL Sbjct: 373 EKMSRKDAEEDCI--QSGTFQETSMSCQPTSRSLYEKEADRLLDGLGPRFIDWWMHKPLP 430 Query: 318 VDGDLLPAVVPGYKTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAA 139 VD DLLP VVPG+K P R CPP R+KL DDELT+LR+ AR LPTHFVLGRNR LQGLAA Sbjct: 431 VDADLLPEVVPGFKAPIRRCPPHTRSKLTDDELTFLRKFARSLPTHFVLGRNRKLQGLAA 490 Query: 138 AILKLWEKCHIAKIAVKWGVPNTDNEQMANELKKLTGGVLLLRNKF 1 AILKLWEK IAKIAVK+GVPNT+NEQMA EL+ VL+LRNKF Sbjct: 491 AILKLWEKSLIAKIAVKFGVPNTNNEQMAYELR---ARVLILRNKF 533 >ref|XP_007139175.1| hypothetical protein PHAVU_008G007700g [Phaseolus vulgaris] gi|561012308|gb|ESW11169.1| hypothetical protein PHAVU_008G007700g [Phaseolus vulgaris] Length = 744 Score = 370 bits (951), Expect = e-100 Identities = 218/452 (48%), Positives = 270/452 (59%), Gaps = 21/452 (4%) Frame = -3 Query: 1293 PTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEYPDEDL---------TMKKIFKSFE 1141 PT PWM PL ++PNE+ + +K +L+R E D+DL TMKKI + E Sbjct: 43 PTPPWMKGPLLLQPNELLDLSNPKSKKFKLER-QELSDKDLMGKEARGKKTMKKIVEKVE 101 Query: 1140 KLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEFDIP 961 KL TH+ + GAL EN+ G + Sbjct: 102 KLHGTHN---------------SAGALIGSPNV----------------ENIGGV---LD 127 Query: 960 LFSAGKEVKSKK--LPWEKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAALMKKWV 787 +EV+ K +PWE + K V + K+++ VTAAEL+LD++L RLRNEAA M+ W+ Sbjct: 128 SLKENEEVRRTKGRMPWENDWKFVYEKIKRKRTVTAAELTLDKVLFRRLRNEAATMRTWI 187 Query: 786 KVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVWRKKDLL 607 KV KAG+TQ VVDQ+ + WR NE+A++KFD L RNM RAREIVE KTGGLVV KKD L Sbjct: 188 KVKKAGVTQDVVDQIKWTWRRNELAMVKFDIPLCRNMSRAREIVETKTGGLVVLSKKDFL 247 Query: 606 AVYRGCNYGRGLENLRNMNYSSAGDQGNPSSNITYQNTGTVARETSGESNPHELIHGRDG 427 VY G N+ L Y S + S TG + S S L + Sbjct: 248 VVYHGGNH-----QLTTTGYPSLRTNHSEMSGAELATTGDICSVDSNHSLSEMLNFIAED 302 Query: 426 KLE--------NLEMA--SLYEREADRLLDELGPRFVDWWMQKPLQVDGDLLPAVVPGYK 277 K N + A SLYERE DRLLD+LGPRF+DWWM KPL VD DLLP VPG++ Sbjct: 303 KDSIATSEQNMNFQTANGSLYERETDRLLDDLGPRFIDWWMAKPLPVDADLLPEDVPGFQ 362 Query: 276 TPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAILKLWEKCHIAKI 97 P R+CPP + AKL+D ELTY R+LA+ LPTHFVLGRN+ L+GLAAAILKLWEK IAKI Sbjct: 363 PPLRICPPHSCAKLSDYELTYFRKLAQLLPTHFVLGRNKRLKGLAAAILKLWEKSLIAKI 422 Query: 96 AVKWGVPNTDNEQMANELKKLTGGVLLLRNKF 1 ++K+G+PNTDNE MANELK LTGGVLLLRNKF Sbjct: 423 SIKYGIPNTDNEMMANELKYLTGGVLLLRNKF 454 >ref|XP_004138635.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Cucumis sativus] Length = 760 Score = 370 bits (951), Expect = e-100 Identities = 214/463 (46%), Positives = 269/463 (58%), Gaps = 33/463 (7%) Frame = -3 Query: 1290 TAPWMAEPLFVKPNEME-------SMKRRNNKGLE-------------LDRIGEYPDEDL 1171 TAPWM PL ++P + E + KRRN +D+ G+Y Sbjct: 62 TAPWMKAPLHLQPQQQEEEGVDPANPKRRNGSDGSGRDKCSRALGDSGIDKTGKY----- 116 Query: 1170 TMKKIFKSFEKLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEE 991 M++I KS KL+ DL E +F Sbjct: 117 AMRRIAKSIGKLRRNGDLGETRMKLEEVEF------------------------------ 146 Query: 990 NLNGNEFDIPLFSAGKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNE 811 FD+ F + +++PWEK++ ++ R K+K VT+AEL+LD +LLERL+ E Sbjct: 147 ----GGFDLEGFE--ESGTRRRMPWEKDDDGIVLRRMKKKTVTSAELNLDRVLLERLKGE 200 Query: 810 AALMKKWVKVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLV 631 A+ M+KWVKV K G+TQ VV+Q+ F W NE+A+LKFD LSRNMDRAREIVE+KTGG+V Sbjct: 201 ASKMEKWVKVNKVGVTQDVVNQIQFMWERNELAMLKFDVPLSRNMDRAREIVEMKTGGMV 260 Query: 630 VWRKKDLLAVYRGCNYGRGLENLRNMNYSSAGDQGNPSSNITYQNTGTVARETSGESNPH 451 VW KK+ L VYRGCNY L++ + P + + + + ES + Sbjct: 261 VWSKKNALVVYRGCNYPLNLKHSTKKQVHIS-----PQNPVKVETDTHFSLSGHYESGLN 315 Query: 450 ELIHGRDGKLE-----------NLE--MASLYEREADRLLDELGPRFVDWWMQKPLQVDG 310 I+ DG+ E NL+ SLYERE DRLLD+LGPRF+DWWM KPL VD Sbjct: 316 RSINDNDGEWEEASSFFLIRHENLQPLSGSLYERETDRLLDDLGPRFIDWWMHKPLPVDA 375 Query: 309 DLLPAVVPGYKTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAIL 130 D+LP VVPGY PFR CPP+ + L D L +LR+LA LPTHFVLGRNR LQGLAA+IL Sbjct: 376 DMLPEVVPGYMPPFRRCPPYTKQNLTDAGLQHLRKLAHSLPTHFVLGRNRKLQGLAASIL 435 Query: 129 KLWEKCHIAKIAVKWGVPNTDNEQMANELKKLTGGVLLLRNKF 1 KLWEK IAKIA+KWGVPNTDNEQMA ELK LTGG LLLRNKF Sbjct: 436 KLWEKSMIAKIALKWGVPNTDNEQMALELKNLTGGTLLLRNKF 478 >ref|XP_004507538.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Cicer arietinum] Length = 764 Score = 370 bits (949), Expect = 1e-99 Identities = 207/446 (46%), Positives = 273/446 (61%), Gaps = 14/446 (3%) Frame = -3 Query: 1296 APTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEYPDEDLTMKKIFKSFEKLQETHDL 1117 +PT PW+ PL ++P + N +E + D+ L K+I + H + Sbjct: 56 SPTPPWIKSPLHLQPQQ-----HLLNSNVEKSDLS---DKALNSKEISGKKVLRKIAHKV 107 Query: 1116 EAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEFDIPLFSAGKEV 937 E +K +S K + ++ E + + + +EV Sbjct: 108 EKLHKALDSEKNE---------------------TLTQMGSEKVENFGDCLDILMENEEV 146 Query: 936 KSK-KLPWEKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAALMKKWVKVMKAGITQ 760 +K ++PWEK+EK+ + K+EK +AA+L++D+++L RLR EAA M+KWVKV K G+TQ Sbjct: 147 VNKGRMPWEKDEKIGFFKVKREKTFSAADLNVDKVVLHRLRGEAARMRKWVKVKKIGVTQ 206 Query: 759 AVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVWRKKDLLAVYRGCNY- 583 VVD++ +WR NE+A++KFD L +NM RAREIVE KTGGLV+W KKD L VYRGCNY Sbjct: 207 DVVDEIKRSWRMNELAMVKFDIPLCQNMGRAREIVETKTGGLVIWCKKDTLVVYRGCNYQ 266 Query: 582 --GRGLENLRNMNYSSAGDQGNPSSNITYQNTGTVARETSGESNPHELIHGRDGK----L 421 + + S ++ + G ++R S +S+ L + K Sbjct: 267 LTSKSSPKIHTGYIRSQKTNSYETNEVKSATKGDLSRVESTQSSSEILSSNAEHKDSLST 326 Query: 420 ENLEM------ASLYEREADRLLDELGPRFVDWWMQKPLQVDGDLLPAVVPGYKTPFRLC 259 +N M SLYE+E DRLLD LGPRFVDWWM KPL VD DLLP VVPG++ PFRLC Sbjct: 327 DNYNMNYQPRSGSLYEKECDRLLDGLGPRFVDWWMDKPLPVDADLLPEVVPGFEPPFRLC 386 Query: 258 PPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAILKLWEKCHIAKIAVKWGV 79 PP R+KL DDELTY R+++ PLPTHFVLGRNR LQGLAAAILKLW+K H AKIA+K+GV Sbjct: 387 PPHARSKLTDDELTYFRKISHPLPTHFVLGRNRGLQGLAAAILKLWQKSHTAKIAIKYGV 446 Query: 78 PNTDNEQMANELKKLTGGVLLLRNKF 1 PNTDNE MANELK+LTGGVLLLRNKF Sbjct: 447 PNTDNEVMANELKRLTGGVLLLRNKF 472 >ref|XP_004158502.1| PREDICTED: LOW QUALITY PROTEIN: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Cucumis sativus] Length = 760 Score = 369 bits (946), Expect = 3e-99 Identities = 212/463 (45%), Positives = 269/463 (58%), Gaps = 33/463 (7%) Frame = -3 Query: 1290 TAPWMAEPLFVKPNEME-------SMKRRNNKGLE-------------LDRIGEYPDEDL 1171 TAPWM PL ++P + E + KRRN +D+ G+Y Sbjct: 62 TAPWMKAPLHLQPQQQEEEGVDPANPKRRNGSDGSGRDKCSRALGDSGIDKTGKY----- 116 Query: 1170 TMKKIFKSFEKLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEE 991 M++I KS KL+ DL E +F Sbjct: 117 AMRRIAKSIGKLRRNGDLGETRMKLEEVEF------------------------------ 146 Query: 990 NLNGNEFDIPLFSAGKEVKSKKLPWEKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNE 811 +FD+ F + +++PWEK++ ++ R K+K VT+AEL+LD +LLERL+ E Sbjct: 147 ----GDFDLEGFE--ESGTRRRMPWEKDDDGIVLRRMKKKTVTSAELNLDRVLLERLKGE 200 Query: 810 AALMKKWVKVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLV 631 A+ M+KWVKV K G+TQ VV+Q+ F W NE+A+LKFD LSRNMDRAREIVE+KTGG+V Sbjct: 201 ASKMEKWVKVNKVGVTQDVVNQIQFMWERNELAMLKFDVPLSRNMDRAREIVEMKTGGMV 260 Query: 630 VWRKKDLLAVYRGCNYGRGLENLRNMNYSSAGDQGNPSSNITYQNTGTVARETSGESNPH 451 VW KK+ L +YRGCNY L++ + P + + + + ES + Sbjct: 261 VWSKKNALVIYRGCNYPLNLKHSTKKQVHIS-----PQNPVKVETDTHFSLSGHYESGLN 315 Query: 450 ELIHGRDGKLE-----------NLE--MASLYEREADRLLDELGPRFVDWWMQKPLQVDG 310 I+ DG+ E NL+ SLYERE DRLLD+LGPRF+DWWM KPL VD Sbjct: 316 RSINDNDGEWEEASSFFLIRHENLQPLSGSLYERETDRLLDDLGPRFIDWWMHKPLPVDA 375 Query: 309 DLLPAVVPGYKTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAIL 130 D+L VVPGY PFR CPP+ + L D L +LR+LA LPTHFVLGRNR LQGLAA+IL Sbjct: 376 DMLQEVVPGYMPPFRRCPPYTKQNLTDAGLQHLRKLAHSLPTHFVLGRNRKLQGLAASIL 435 Query: 129 KLWEKCHIAKIAVKWGVPNTDNEQMANELKKLTGGVLLLRNKF 1 KLWEK IAKIA+KWGVPNTDNEQMA ELK LTGG LLLRNKF Sbjct: 436 KLWEKSMIAKIALKWGVPNTDNEQMALELKNLTGGTLLLRNKF 478 >ref|XP_007033221.1| maize chloroplast splicing factor CRS1, putative isoform 5 [Theobroma cacao] gi|508712250|gb|EOY04147.1| maize chloroplast splicing factor CRS1, putative isoform 5 [Theobroma cacao] Length = 788 Score = 367 bits (943), Expect = 6e-99 Identities = 216/463 (46%), Positives = 276/463 (59%), Gaps = 32/463 (6%) Frame = -3 Query: 1293 PTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEYPDEDL-----------TMKKIFKS 1147 PTAPWM PL ++P+E+ + + +K + + PD+ L MKKI ++ Sbjct: 91 PTAPWMKGPLLLQPHEVLNPSKSTSKKSSNSK-AKAPDKALFGKESGVRGKKVMKKIIRN 149 Query: 1146 FEKLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEFD 967 E LQ LE + + G EE GN + Sbjct: 150 VEMLQGNEVLE----DTQIG----------------------------IREEFEVGNWLE 177 Query: 966 IPLFSAGKEVK--SKKLPW-EKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAALMK 796 F + EVK K+PW +EEK+V RR KKEK++T AE+SLD+ LLERLR +A M+ Sbjct: 178 E--FGSDGEVKRFDGKMPWLREEEKVVFRRMKKEKLLTQAEISLDKDLLERLRRKAMRMR 235 Query: 795 KWVKVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVWRKK 616 KW+KVMK G+T+AVVD++ AWR NE+ ++KF L RNMDRAREI+E+KT GLVVW KK Sbjct: 236 KWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCRNMDRAREIIEMKTRGLVVWGKK 295 Query: 615 DLLAVYRGCNYGRGLENLRNMNYSSAGD----QGNPSSNITYQNTGTVARETSGESNPHE 448 D L VYRGC++G + +M Y D + S++T N ++ E S Sbjct: 296 DALVVYRGCSHGL-TSKISSMKYPRCADGQEISSSTFSHLTSSNNINMSLEKFNGSTLQS 354 Query: 447 LIHGRDGKLENLEM--------------ASLYEREADRLLDELGPRFVDWWMQKPLQVDG 310 ++ D + E++ + SLYERE DRLLD LGPRF+DWWM+KPL +D Sbjct: 355 GLYREDREKESMPINIFMKEDENNQPVIGSLYERETDRLLDGLGPRFIDWWMRKPLPIDA 414 Query: 309 DLLPAVVPGYKTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAIL 130 DLLP VPG++ P RL PP R L DDEL YLR+L PLP HF LG+NRNLQGLAAAIL Sbjct: 415 DLLPEEVPGFRPPLRLSPPNTRPNLTDDELKYLRKLTHPLPFHFALGKNRNLQGLAAAIL 474 Query: 129 KLWEKCHIAKIAVKWGVPNTDNEQMANELKKLTGGVLLLRNKF 1 KLWEK IAKIA+KWG+ NTDNEQMA ELK LTGGVLL+RNKF Sbjct: 475 KLWEKSLIAKIAIKWGIQNTDNEQMAYELKNLTGGVLLVRNKF 517 >ref|XP_007033220.1| maize chloroplast splicing factor CRS1, putative isoform 4 [Theobroma cacao] gi|508712249|gb|EOY04146.1| maize chloroplast splicing factor CRS1, putative isoform 4 [Theobroma cacao] Length = 767 Score = 367 bits (943), Expect = 6e-99 Identities = 216/463 (46%), Positives = 276/463 (59%), Gaps = 32/463 (6%) Frame = -3 Query: 1293 PTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEYPDEDL-----------TMKKIFKS 1147 PTAPWM PL ++P+E+ + + +K + + PD+ L MKKI ++ Sbjct: 65 PTAPWMKGPLLLQPHEVLNPSKSTSKKSSNSK-AKAPDKALFGKESGVRGKKVMKKIIRN 123 Query: 1146 FEKLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEFD 967 E LQ LE + + G EE GN + Sbjct: 124 VEMLQGNEVLE----DTQIG----------------------------IREEFEVGNWLE 151 Query: 966 IPLFSAGKEVK--SKKLPW-EKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAALMK 796 F + EVK K+PW +EEK+V RR KKEK++T AE+SLD+ LLERLR +A M+ Sbjct: 152 E--FGSDGEVKRFDGKMPWLREEEKVVFRRMKKEKLLTQAEISLDKDLLERLRRKAMRMR 209 Query: 795 KWVKVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVWRKK 616 KW+KVMK G+T+AVVD++ AWR NE+ ++KF L RNMDRAREI+E+KT GLVVW KK Sbjct: 210 KWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCRNMDRAREIIEMKTRGLVVWGKK 269 Query: 615 DLLAVYRGCNYGRGLENLRNMNYSSAGD----QGNPSSNITYQNTGTVARETSGESNPHE 448 D L VYRGC++G + +M Y D + S++T N ++ E S Sbjct: 270 DALVVYRGCSHGL-TSKISSMKYPRCADGQEISSSTFSHLTSSNNINMSLEKFNGSTLQS 328 Query: 447 LIHGRDGKLENLEM--------------ASLYEREADRLLDELGPRFVDWWMQKPLQVDG 310 ++ D + E++ + SLYERE DRLLD LGPRF+DWWM+KPL +D Sbjct: 329 GLYREDREKESMPINIFMKEDENNQPVIGSLYERETDRLLDGLGPRFIDWWMRKPLPIDA 388 Query: 309 DLLPAVVPGYKTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAIL 130 DLLP VPG++ P RL PP R L DDEL YLR+L PLP HF LG+NRNLQGLAAAIL Sbjct: 389 DLLPEEVPGFRPPLRLSPPNTRPNLTDDELKYLRKLTHPLPFHFALGKNRNLQGLAAAIL 448 Query: 129 KLWEKCHIAKIAVKWGVPNTDNEQMANELKKLTGGVLLLRNKF 1 KLWEK IAKIA+KWG+ NTDNEQMA ELK LTGGVLL+RNKF Sbjct: 449 KLWEKSLIAKIAIKWGIQNTDNEQMAYELKNLTGGVLLVRNKF 491 >ref|XP_007033219.1| maize chloroplast splicing factor CRS1, putative isoform 3 [Theobroma cacao] gi|508712248|gb|EOY04145.1| maize chloroplast splicing factor CRS1, putative isoform 3 [Theobroma cacao] Length = 788 Score = 367 bits (943), Expect = 6e-99 Identities = 216/463 (46%), Positives = 276/463 (59%), Gaps = 32/463 (6%) Frame = -3 Query: 1293 PTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEYPDEDL-----------TMKKIFKS 1147 PTAPWM PL ++P+E+ + + +K + + PD+ L MKKI ++ Sbjct: 65 PTAPWMKGPLLLQPHEVLNPSKSTSKKSSNSK-AKAPDKALFGKESGVRGKKVMKKIIRN 123 Query: 1146 FEKLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEFD 967 E LQ LE + + G EE GN + Sbjct: 124 VEMLQGNEVLE----DTQIG----------------------------IREEFEVGNWLE 151 Query: 966 IPLFSAGKEVK--SKKLPW-EKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAALMK 796 F + EVK K+PW +EEK+V RR KKEK++T AE+SLD+ LLERLR +A M+ Sbjct: 152 E--FGSDGEVKRFDGKMPWLREEEKVVFRRMKKEKLLTQAEISLDKDLLERLRRKAMRMR 209 Query: 795 KWVKVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVWRKK 616 KW+KVMK G+T+AVVD++ AWR NE+ ++KF L RNMDRAREI+E+KT GLVVW KK Sbjct: 210 KWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCRNMDRAREIIEMKTRGLVVWGKK 269 Query: 615 DLLAVYRGCNYGRGLENLRNMNYSSAGD----QGNPSSNITYQNTGTVARETSGESNPHE 448 D L VYRGC++G + +M Y D + S++T N ++ E S Sbjct: 270 DALVVYRGCSHGL-TSKISSMKYPRCADGQEISSSTFSHLTSSNNINMSLEKFNGSTLQS 328 Query: 447 LIHGRDGKLENLEM--------------ASLYEREADRLLDELGPRFVDWWMQKPLQVDG 310 ++ D + E++ + SLYERE DRLLD LGPRF+DWWM+KPL +D Sbjct: 329 GLYREDREKESMPINIFMKEDENNQPVIGSLYERETDRLLDGLGPRFIDWWMRKPLPIDA 388 Query: 309 DLLPAVVPGYKTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAIL 130 DLLP VPG++ P RL PP R L DDEL YLR+L PLP HF LG+NRNLQGLAAAIL Sbjct: 389 DLLPEEVPGFRPPLRLSPPNTRPNLTDDELKYLRKLTHPLPFHFALGKNRNLQGLAAAIL 448 Query: 129 KLWEKCHIAKIAVKWGVPNTDNEQMANELKKLTGGVLLLRNKF 1 KLWEK IAKIA+KWG+ NTDNEQMA ELK LTGGVLL+RNKF Sbjct: 449 KLWEKSLIAKIAIKWGIQNTDNEQMAYELKNLTGGVLLVRNKF 491 >ref|XP_007033218.1| maize chloroplast splicing factor CRS1, putative isoform 2 [Theobroma cacao] gi|508712247|gb|EOY04144.1| maize chloroplast splicing factor CRS1, putative isoform 2 [Theobroma cacao] Length = 804 Score = 367 bits (943), Expect = 6e-99 Identities = 216/463 (46%), Positives = 276/463 (59%), Gaps = 32/463 (6%) Frame = -3 Query: 1293 PTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEYPDEDL-----------TMKKIFKS 1147 PTAPWM PL ++P+E+ + + +K + + PD+ L MKKI ++ Sbjct: 91 PTAPWMKGPLLLQPHEVLNPSKSTSKKSSNSK-AKAPDKALFGKESGVRGKKVMKKIIRN 149 Query: 1146 FEKLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEFD 967 E LQ LE + + G EE GN + Sbjct: 150 VEMLQGNEVLE----DTQIG----------------------------IREEFEVGNWLE 177 Query: 966 IPLFSAGKEVK--SKKLPW-EKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAALMK 796 F + EVK K+PW +EEK+V RR KKEK++T AE+SLD+ LLERLR +A M+ Sbjct: 178 E--FGSDGEVKRFDGKMPWLREEEKVVFRRMKKEKLLTQAEISLDKDLLERLRRKAMRMR 235 Query: 795 KWVKVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVWRKK 616 KW+KVMK G+T+AVVD++ AWR NE+ ++KF L RNMDRAREI+E+KT GLVVW KK Sbjct: 236 KWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCRNMDRAREIIEMKTRGLVVWGKK 295 Query: 615 DLLAVYRGCNYGRGLENLRNMNYSSAGD----QGNPSSNITYQNTGTVARETSGESNPHE 448 D L VYRGC++G + +M Y D + S++T N ++ E S Sbjct: 296 DALVVYRGCSHGL-TSKISSMKYPRCADGQEISSSTFSHLTSSNNINMSLEKFNGSTLQS 354 Query: 447 LIHGRDGKLENLEM--------------ASLYEREADRLLDELGPRFVDWWMQKPLQVDG 310 ++ D + E++ + SLYERE DRLLD LGPRF+DWWM+KPL +D Sbjct: 355 GLYREDREKESMPINIFMKEDENNQPVIGSLYERETDRLLDGLGPRFIDWWMRKPLPIDA 414 Query: 309 DLLPAVVPGYKTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAIL 130 DLLP VPG++ P RL PP R L DDEL YLR+L PLP HF LG+NRNLQGLAAAIL Sbjct: 415 DLLPEEVPGFRPPLRLSPPNTRPNLTDDELKYLRKLTHPLPFHFALGKNRNLQGLAAAIL 474 Query: 129 KLWEKCHIAKIAVKWGVPNTDNEQMANELKKLTGGVLLLRNKF 1 KLWEK IAKIA+KWG+ NTDNEQMA ELK LTGGVLL+RNKF Sbjct: 475 KLWEKSLIAKIAIKWGIQNTDNEQMAYELKNLTGGVLLVRNKF 517 >ref|XP_007033217.1| maize chloroplast splicing factor CRS1, putative isoform 1 [Theobroma cacao] gi|508712246|gb|EOY04143.1| maize chloroplast splicing factor CRS1, putative isoform 1 [Theobroma cacao] Length = 818 Score = 367 bits (943), Expect = 6e-99 Identities = 216/463 (46%), Positives = 276/463 (59%), Gaps = 32/463 (6%) Frame = -3 Query: 1293 PTAPWMAEPLFVKPNEMESMKRRNNKGLELDRIGEYPDEDL-----------TMKKIFKS 1147 PTAPWM PL ++P+E+ + + +K + + PD+ L MKKI ++ Sbjct: 91 PTAPWMKGPLLLQPHEVLNPSKSTSKKSSNSK-AKAPDKALFGKESGVRGKKVMKKIIRN 149 Query: 1146 FEKLQETHDLEAFNKNHESGKFKFAPGALCXXXXXXXXXXXXXXECSKAAEENLNGNEFD 967 E LQ LE + + G EE GN + Sbjct: 150 VEMLQGNEVLE----DTQIG----------------------------IREEFEVGNWLE 177 Query: 966 IPLFSAGKEVK--SKKLPW-EKEEKMVIRRAKKEKVVTAAELSLDEMLLERLRNEAALMK 796 F + EVK K+PW +EEK+V RR KKEK++T AE+SLD+ LLERLR +A M+ Sbjct: 178 E--FGSDGEVKRFDGKMPWLREEEKVVFRRMKKEKLLTQAEISLDKDLLERLRRKAMRMR 235 Query: 795 KWVKVMKAGITQAVVDQVHFAWRNNEVALLKFDRLLSRNMDRAREIVELKTGGLVVWRKK 616 KW+KVMK G+T+AVVD++ AWR NE+ ++KF L RNMDRAREI+E+KT GLVVW KK Sbjct: 236 KWIKVMKLGVTKAVVDEIKLAWRKNELVMVKFGVPLCRNMDRAREIIEMKTRGLVVWGKK 295 Query: 615 DLLAVYRGCNYGRGLENLRNMNYSSAGD----QGNPSSNITYQNTGTVARETSGESNPHE 448 D L VYRGC++G + +M Y D + S++T N ++ E S Sbjct: 296 DALVVYRGCSHGL-TSKISSMKYPRCADGQEISSSTFSHLTSSNNINMSLEKFNGSTLQS 354 Query: 447 LIHGRDGKLENLEM--------------ASLYEREADRLLDELGPRFVDWWMQKPLQVDG 310 ++ D + E++ + SLYERE DRLLD LGPRF+DWWM+KPL +D Sbjct: 355 GLYREDREKESMPINIFMKEDENNQPVIGSLYERETDRLLDGLGPRFIDWWMRKPLPIDA 414 Query: 309 DLLPAVVPGYKTPFRLCPPFNRAKLADDELTYLRRLARPLPTHFVLGRNRNLQGLAAAIL 130 DLLP VPG++ P RL PP R L DDEL YLR+L PLP HF LG+NRNLQGLAAAIL Sbjct: 415 DLLPEEVPGFRPPLRLSPPNTRPNLTDDELKYLRKLTHPLPFHFALGKNRNLQGLAAAIL 474 Query: 129 KLWEKCHIAKIAVKWGVPNTDNEQMANELKKLTGGVLLLRNKF 1 KLWEK IAKIA+KWG+ NTDNEQMA ELK LTGGVLL+RNKF Sbjct: 475 KLWEKSLIAKIAIKWGIQNTDNEQMAYELKNLTGGVLLVRNKF 517