BLASTX nr result
ID: Catharanthus23_contig00000844
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00000844 (3245 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006341605.1| PREDICTED: chloroplastic group IIA intron sp... 954 0.0 ref|XP_004235759.1| PREDICTED: chloroplastic group IIA intron sp... 952 0.0 gb|EOY21034.1| CRS1 / YhbY domain-containing protein [Theobroma ... 914 0.0 emb|CBI15459.3| unnamed protein product [Vitis vinifera] 900 0.0 emb|CAN72582.1| hypothetical protein VITISV_035294 [Vitis vinifera] 894 0.0 gb|EMJ12507.1| hypothetical protein PRUPE_ppa001468mg [Prunus pe... 889 0.0 ref|XP_004300521.1| PREDICTED: chloroplastic group IIA intron sp... 871 0.0 ref|XP_002317913.2| hypothetical protein POPTR_0012s05260g [Popu... 868 0.0 gb|EXB56911.1| Chloroplastic group IIA intron splicing facilitat... 868 0.0 ref|XP_006440978.1| hypothetical protein CICLE_v10018859mg [Citr... 865 0.0 ref|XP_006485796.1| PREDICTED: chloroplastic group IIA intron sp... 861 0.0 ref|XP_006440981.1| hypothetical protein CICLE_v10018859mg [Citr... 857 0.0 ref|XP_002532154.1| conserved hypothetical protein [Ricinus comm... 845 0.0 ref|XP_003530015.2| PREDICTED: chloroplastic group IIA intron sp... 835 0.0 gb|EPS74467.1| hypothetical protein M569_00278, partial [Genlise... 830 0.0 ref|XP_004512920.1| PREDICTED: chloroplastic group IIA intron sp... 830 0.0 ref|XP_003533366.1| PREDICTED: chloroplastic group IIA intron sp... 825 0.0 ref|XP_006406610.1| hypothetical protein EUTSA_v10020034mg [Eutr... 821 0.0 ref|XP_002883129.1| EMB1865 [Arabidopsis lyrata subsp. lyrata] g... 820 0.0 ref|NP_188468.1| CRS1 / YhbY domain-containing protein EMB1865 [... 816 0.0 >ref|XP_006341605.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Solanum tuberosum] Length = 824 Score = 954 bits (2465), Expect = 0.0 Identities = 524/821 (63%), Positives = 598/821 (72%), Gaps = 23/821 (2%) Frame = -1 Query: 3110 ISLKASRMVMATAKLYSEMXXXXXXXXLSIPSRKSSFSVFFLKPFSSSLHPTTKLPRKAT 2931 ++L ++ T +L+S S PS + F +F + ++ + T +PRK Sbjct: 1 MALSTAKFTQLTPQLFSSF---------STPSDRPPFFLFLRRTITAG-NTRTNIPRKDN 50 Query: 2930 QVP---NFSSGIP-----DHSSSWLKKWPSTSPLPPIHYKKPRTLQQESTSEAQFLDEAV 2775 + P + SS P SS+WL KWP+TSP P H RT+ ES +E ++ DE Sbjct: 51 RKPYRDSNSSSTPVKSNNSRSSTWLNKWPNTSP-PVKHSSNSRTV--ESKTETRYFDENT 107 Query: 2774 KPPTSAIDRIVLRLRNXXXXXXXXXXXXXXXXXXXXXXXXXXE-------KLGDLLKRDW 2616 + T+AIDRIVLRLRN KLGDLLKRDW Sbjct: 108 RVGTTAIDRIVLRLRNLGLGSDDEGEGEDEEEGNLKLDSSSTMQVNGEEEKLGDLLKRDW 167 Query: 2615 VRPDKXXXXXXXXXXXXXLPWERSAEEGAIKDDKEEGSRTKRRSVKAPTLAELTIEDXXX 2436 VRPD PWERS EE A+ E R +R+VKAP+LAELTIED Sbjct: 168 VRPDMILEESDDEGDTYL-PWERSVEEEAV-----EVQRGGKRTVKAPSLAELTIEDEEL 221 Query: 2435 XXXXXXXXXXXXRINVPKAGVTKEVLEKIHDKWRKNELVRLKFHEDLAHDMRTGHEIVER 2256 RINVPKAGVT VLEKIH WRKNELVRLKFHE LAHDMRTGHEIVER Sbjct: 222 RRLRRMGMTLRERINVPKAGVTGAVLEKIHHSWRKNELVRLKFHEVLAHDMRTGHEIVER 281 Query: 2255 RTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVNREGDALFVPIVSSSGNLLEKNSSSSD 2076 RT GLVIWR+GSVMVV+RG+NYEGP SS++QSVN E +ALFVP VSS ++ + N S + Sbjct: 282 RTRGLVIWRAGSVMVVYRGSNYEGP-SSRSQSVNEEDNALFVPDVSSDKSITKDNKSFNP 340 Query: 2075 NIDETRAPVVPNRTESMTAEEAEFNSLLDGLGPRFEEWWGTGILPVDADLLPQKVPGYKT 1896 I E R V PN +SMT EE+EFN +LDGLGPRFE+WWGTG+LPVDADLLPQ +PGYKT Sbjct: 341 VI-ENRNQVHPNSVQSMTVEESEFNRVLDGLGPRFEDWWGTGVLPVDADLLPQTIPGYKT 399 Query: 1895 PFRLLPTGMRPRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIIKLWEKSLVVKIA 1716 PFRLLPTGMR RLTNAEMT+LRK+AKSLPCHFALGRNRNHQGLAAAI+KLWEKSLVVKIA Sbjct: 400 PFRLLPTGMRSRLTNAEMTNLRKIAKSLPCHFALGRNRNHQGLAAAIVKLWEKSLVVKIA 459 Query: 1715 VKRGIQNTNNKLMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVAAALTERQEMTKQT 1536 VKRGIQNTNNKLM+EELK LTGGVLLLRNKYYI+ YRGKDF+PPTVAA L ERQE+TKQ Sbjct: 460 VKRGIQNTNNKLMSEELKMLTGGVLLLRNKYYIIFYRGKDFVPPTVAAVLAERQELTKQI 519 Query: 1535 QDAEEKVRGVPIEPVATIGEGEALAGTLAEFYEAQARWGREISVEEREKMKEEASRAKTA 1356 QD EE+ R P + +G+A+AG+LAEFYEAQARWGREIS EERE+M +EA+ AKTA Sbjct: 520 QDVEEQTRSGPAKVAPLTTDGQAVAGSLAEFYEAQARWGREISAEERERMLKEAAMAKTA 579 Query: 1355 RVVKRLEHKLAISQTKKLKAEKLLDKIISSWVPVGPDDDQETITDEERAMFRRVGLRMKA 1176 RVVKRLEHK ISQTKKLKAEK+L KI+ SW+P GP DD ETIT+EER M RRVGLRMK+ Sbjct: 580 RVVKRLEHKFEISQTKKLKAEKILAKIVESWIPAGPSDDLETITEEERVMLRRVGLRMKS 639 Query: 1175 YLPLGIRGVFDGVIENMHLHWKHRELVKLLSKDKEIAFVEETARLLEYESGGILVAIERV 996 YLPLGIRGVFDGVIENMHLHWKHRELVKL+SK+K +AFVEETARLLEYESGGILVAIERV Sbjct: 640 YLPLGIRGVFDGVIENMHLHWKHRELVKLISKEKVLAFVEETARLLEYESGGILVAIERV 699 Query: 995 PKGFVLIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQHIDELEKTIEQTRK 816 PKG+ LIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQHI ELE TIEQT+ Sbjct: 700 PKGYALIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQHIAELETTIEQTKS 759 Query: 815 DIDD-------PKVVESGAQFNNVSE-FSESEDENSQMGSD 717 I D +E+ QFN+VSE SE ED + + G D Sbjct: 760 KIVDFGKADINTSNLEALDQFNHVSESLSEDEDSSLESGDD 800 >ref|XP_004235759.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Solanum lycopersicum] Length = 820 Score = 952 bits (2462), Expect = 0.0 Identities = 522/817 (63%), Positives = 597/817 (73%), Gaps = 19/817 (2%) Frame = -1 Query: 3110 ISLKASRMVMATAKLYSEMXXXXXXXXLSIPSRKSSFSVFFLKPFSSSLHPTTKLPRKAT 2931 ++L ++ T +L+S S P+ + F +F + ++ + T +PRK Sbjct: 1 MALSTAKFTQLTPQLFSSF---------STPTDRPPFFLFLRRTITAG-NTRTNIPRKDN 50 Query: 2930 QVP---NFSSGIP-----DHSSSWLKKWPSTSPLPPIHYKKPRTLQQESTSEAQFLDEAV 2775 + P + SS P SS+WL KWP+TS P H RT+ ES +E ++ DE Sbjct: 51 RKPYRDSNSSSTPVKSNNSRSSTWLNKWPNTSS-PVKHSSNSRTV--ESKTETRYFDENT 107 Query: 2774 KPPTSAIDRIVLRLRNXXXXXXXXXXXXXXXXXXXXXXXXXXE-------KLGDLLKRDW 2616 + T+AIDRIVLRLRN KLGDLLKRDW Sbjct: 108 RVGTTAIDRIVLRLRNLGLGSDDEGEGEDEEEGNLKLDSSSTMQVNGEEEKLGDLLKRDW 167 Query: 2615 VRPDKXXXXXXXXXXXXXLPWERSAEEGAIKDDKEEGSRTKRRSVKAPTLAELTIEDXXX 2436 VRPD PWERS EE A+ E R +R+V+AP+LAELTIED Sbjct: 168 VRPDMILEESDDEGDTYL-PWERSVEEEAV-----EVQRGGKRTVRAPSLAELTIEDEEL 221 Query: 2435 XXXXXXXXXXXXRINVPKAGVTKEVLEKIHDKWRKNELVRLKFHEDLAHDMRTGHEIVER 2256 RINVPKAGVT VLEKIH WRKNELVRLKFHE LAHDMRTGHEIVER Sbjct: 222 RRLRRIGMTLRERINVPKAGVTGAVLEKIHHSWRKNELVRLKFHEVLAHDMRTGHEIVER 281 Query: 2255 RTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVNREGDALFVPIVSSSGNLLEKNSSSSD 2076 RT GLVIWR+GSVMVV+RG+NYEGP SS++QSVN E +ALFVP VSS ++ + N S + Sbjct: 282 RTKGLVIWRAGSVMVVYRGSNYEGP-SSRSQSVNEEDNALFVPDVSSDKSITKDNKSFNP 340 Query: 2075 NIDETRAPVVPNRTESMTAEEAEFNSLLDGLGPRFEEWWGTGILPVDADLLPQKVPGYKT 1896 I E R V PNR +SMT EE+EFN +LDGLGPRFE+WWGTG+LPVDADLLPQ +PGYKT Sbjct: 341 VI-ENRNQVHPNRVQSMTEEESEFNRVLDGLGPRFEDWWGTGVLPVDADLLPQTIPGYKT 399 Query: 1895 PFRLLPTGMRPRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIIKLWEKSLVVKIA 1716 PFRLLPTGMR RLTNAEMT+LRK+AKSLPCHFALGRNRNHQGLAAAI+KLWEKSLVVKIA Sbjct: 400 PFRLLPTGMRSRLTNAEMTNLRKIAKSLPCHFALGRNRNHQGLAAAIVKLWEKSLVVKIA 459 Query: 1715 VKRGIQNTNNKLMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVAAALTERQEMTKQT 1536 VKRGIQNTNNKLM+EELK LTGGVLLLRNKYYI+ YRGKDF+PPTVAA L ERQE+TKQ Sbjct: 460 VKRGIQNTNNKLMSEELKMLTGGVLLLRNKYYIIFYRGKDFVPPTVAAVLAERQELTKQI 519 Query: 1535 QDAEEKVRGVPIEPVATIGEGEALAGTLAEFYEAQARWGREISVEEREKMKEEASRAKTA 1356 QD EE+ R P + I +G+A+AG+LAEFYEAQARWGREIS EERE+M +EA+ AK A Sbjct: 520 QDVEEQTRSGPAKVAPLITDGQAVAGSLAEFYEAQARWGREISAEERERMLKEAAMAKMA 579 Query: 1355 RVVKRLEHKLAISQTKKLKAEKLLDKIISSWVPVGPDDDQETITDEERAMFRRVGLRMKA 1176 RVVKRLEHK ISQTKKLKAEK+L KI+ SW+P GP DD ETIT+EER M RRVGLRMK+ Sbjct: 580 RVVKRLEHKFEISQTKKLKAEKILAKIVESWIPAGPSDDLETITEEERVMLRRVGLRMKS 639 Query: 1175 YLPLGIRGVFDGVIENMHLHWKHRELVKLLSKDKEIAFVEETARLLEYESGGILVAIERV 996 YLPLGIRGVFDGVIENMHLHWKHRELVKL+SK+K +AFVEETARLLEYESGGILVAIERV Sbjct: 640 YLPLGIRGVFDGVIENMHLHWKHRELVKLISKEKVLAFVEETARLLEYESGGILVAIERV 699 Query: 995 PKGFVLIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQHIDELEKTIEQTRK 816 PKG+ LIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQHI ELE TIEQT+ Sbjct: 700 PKGYALIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQHIGELETTIEQTKS 759 Query: 815 ---DIDDPKVVESGAQFNNVSE-FSESEDENSQMGSD 717 D D +E QFN+VSE SE ED + + G D Sbjct: 760 KIVDFGDTSNLEVLDQFNHVSESLSEDEDSSLESGDD 796 >gb|EOY21034.1| CRS1 / YhbY domain-containing protein [Theobroma cacao] Length = 919 Score = 914 bits (2361), Expect = 0.0 Identities = 518/835 (62%), Positives = 597/835 (71%), Gaps = 37/835 (4%) Frame = -1 Query: 3110 ISLKASRMVMATAKLYSEMXXXXXXXXLSIPSRKSSFSVF-----------FLKPFSS-- 2970 + ++ RM AT K ++EM S SS S+ F +PFSS Sbjct: 59 LKIQIKRMAFATTK-FTEMPLRTSLPFASYSYSYSSSSLNLFFSAPKPSFRFFRPFSSLR 117 Query: 2969 -SLHPTTKLPR-------KATQVPNFSSGIPDHSSSWLKKWPSTSPLPPIHYKKPRTLQQ 2814 P++K R +A+ PN S+ SSS L+ W S S + +Q Sbjct: 118 TGNSPSSKFNRYSYPWDQEASVPPNSSA-----SSSSLQAWSSPSQ---------KVIQS 163 Query: 2813 ESTS----EAQFLDEAVKPPTSAIDRIVLRLRNXXXXXXXXXXXXXXXXXXXXXXXXXXE 2646 + E ++ D SAI+RIVLRLRN E Sbjct: 164 DGDDKTDVETRYFDR--DKSQSAIERIVLRLRNLGLGSDDEDEGEDETDQYNSTPVTGEE 221 Query: 2645 KLGDLLKRDWVRPDKXXXXXXXXXXXXXLPWERSAEEGAIKDDKEEGSRTKRRSVKAPTL 2466 +LGDLLKR+WVRPD PWER +E ++ KE K+R V+APTL Sbjct: 222 RLGDLLKREWVRPDTMLIEREKEEAVL--PWER--DEAEVEVVKEGVLGVKKRRVRAPTL 277 Query: 2465 AELTIEDXXXXXXXXXXXXXXXRINVPKAGVTKEVLEKIHDKWRKNELVRLKFHEDLAHD 2286 AELTIED RINVPKAG+T+ VLEKIHDKWRK ELVRLKFHE LA D Sbjct: 278 AELTIEDEELRRLRRMGMYLRERINVPKAGITQAVLEKIHDKWRKEELVRLKFHEVLATD 337 Query: 2285 MRTGHEIVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVNREGDALFVPIVSSSGN 2106 M+T HEIVERRTGGLV+WRSGSVMVV+RG+NYEGP S++QS++REG+ALF+P VSS+ N Sbjct: 338 MKTAHEIVERRTGGLVLWRSGSVMVVYRGSNYEGP--SRSQSIDREGEALFIPDVSSASN 395 Query: 2105 LLEKNSSSSDNIDETRAPVV--PNRTESMTAEEAEFNSLLDGLGPRFEEWWGTGILPVDA 1932 + + + + E PVV P R+ESMT EEAE+NSLLDG+GPRF EWWGTG+LPVDA Sbjct: 396 AVRGSETGKTSTPEKCEPVVVKPERSESMTEEEAEYNSLLDGVGPRFVEWWGTGVLPVDA 455 Query: 1931 DLLPQKVPGYKTPFRLLPTGMRPRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAII 1752 DLLPQK+PGYKTPFRLLP GMRPRLTNAEMT+LRKLAKSLPCHFALGRNRNHQGLAAAII Sbjct: 456 DLLPQKIPGYKTPFRLLPAGMRPRLTNAEMTNLRKLAKSLPCHFALGRNRNHQGLAAAII 515 Query: 1751 KLWEKSLVVKIAVKRGIQNTNNKLMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVAA 1572 KLWEKSLVVKIAVKRGIQNTNNKLMAEELK LTGGVLLLRNKY+IV+YRGKDFLP +VAA Sbjct: 516 KLWEKSLVVKIAVKRGIQNTNNKLMAEELKNLTGGVLLLRNKYFIVIYRGKDFLPTSVAA 575 Query: 1571 ALTERQEMTKQTQDAEEKVRGVPIEPVATIGE--GEALAGTLAEFYEAQARWGREISVEE 1398 AL ERQE+TKQ QD EEKVR +EP A GE GEA AGTLAEFYEAQA WGREIS EE Sbjct: 576 ALAERQELTKQIQDVEEKVRIRAVEP-AQSGEDKGEAPAGTLAEFYEAQACWGREISAEE 634 Query: 1397 REKMKEEASRAKTARVVKRLEHKLAISQTKKLKAEKLLDKIISSWVPVGPDDDQETITDE 1218 REKM EEAS+AK AR+VKR+EHKLA++Q KKL+AE+LL KI SS +P PD DQETITDE Sbjct: 635 REKMIEEASKAKHARLVKRVEHKLAVAQAKKLRAERLLAKIESSMIPAAPDYDQETITDE 694 Query: 1217 ERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKDKEIAFVEETARLL 1038 ER MFRRVGLRMK YLPLGIRGVFDGVIENMHLHWKHRELVKL+SK K +AFVE+TARLL Sbjct: 695 ERVMFRRVGLRMKPYLPLGIRGVFDGVIENMHLHWKHRELVKLISKQKTLAFVEDTARLL 754 Query: 1037 EYESGGILVAIERVPKGFVLIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQ 858 E+ESGGILVAIERVPKG+ LI+YRGKNY RPISLRPRNLLTKAKALKR VA+QR+EALSQ Sbjct: 755 EFESGGILVAIERVPKGYALIYYRGKNYHRPISLRPRNLLTKAKALKRSVAMQRHEALSQ 814 Query: 857 HIDELEKTIEQTRKDI--------DDPKVVESGAQFNNVSEFSESEDENSQMGSD 717 HI ELE+TIE+ +K+I +D +V QF+ VSE ++SEDE S M SD Sbjct: 815 HISELERTIEEMKKEIGASQDVEDEDSQVSGEHGQFDPVSELTQSEDEASYMASD 869 >emb|CBI15459.3| unnamed protein product [Vitis vinifera] Length = 830 Score = 900 bits (2326), Expect = 0.0 Identities = 506/816 (62%), Positives = 581/816 (71%), Gaps = 25/816 (3%) Frame = -1 Query: 3089 MVMATAKLYSEMXXXXXXXXLSIPSRKSSFSVFFLKPFSS----------------SLHP 2958 M ATAKL +E L K+ S+ LKPFSS SL+P Sbjct: 1 MAFATAKL-TEFPFTSHSSSLHFLFPKTPLSL--LKPFSSLRTTDSNNLRNRKTKRSLYP 57 Query: 2957 TTKLPRKATQVPNFSSGIPDHSSSWLKKWPSTSPLPPIHYKKPRTLQQESTSEAQFLDEA 2778 + + N +S + SW+ KWPS +P +K + ++ T E+++ D Sbjct: 58 WDHQNSRKSSNTNPNSS----TKSWINKWPSPNPSIESEHKGIDSKGRDGT-ESRYFDG- 111 Query: 2777 VKPPTSAIDRIVLRLRNXXXXXXXXXXXXXXXXXXXXXXXXXXEKLGDLLKRDWVRPDKX 2598 + TSAI+RIVLRLRN EKLGDLL+RDWVRPD Sbjct: 112 -RSGTSAIERIVLRLRNLGLGSDDEDKNEGEVESGDTMPVTGDEKLGDLLQRDWVRPDSM 170 Query: 2597 XXXXXXXXXXXXLPWERSAEEGAIKDDKEEGSRTKRRSVKAPTLAELTIEDXXXXXXXXX 2418 PWER E + ++E R KRR+V+APTLAELTIED Sbjct: 171 LIEDEDEDDMIL-PWERGEE----RQEEEGDGRLKRRAVRAPTLAELTIEDEELRRLRRL 225 Query: 2417 XXXXXXRINVPKAGVTKEVLEKIHDKWRKNELVRLKFHEDLAHDMRTGHEIVERRTGGLV 2238 RINVPKAG+T+ VL KIH+KWRK ELVRLKFHE LAHDM+T HEIVERRTGGLV Sbjct: 226 GMTIRERINVPKAGITQAVLGKIHEKWRKEELVRLKFHEALAHDMKTAHEIVERRTGGLV 285 Query: 2237 IWRSGSVMVVFRGTNYEGPLSSQAQSVNREGDALFVPIVSSSGNLLEKNSSSSD-NIDET 2061 WRSGSVMVVFRGTNYEGP + Q V+ EGD+LFVP VSS N +N ++ +++ Sbjct: 286 TWRSGSVMVVFRGTNYEGP--PKPQPVDGEGDSLFVPDVSSVDNPAMRNDNNGGPTLEKG 343 Query: 2060 RAPVV-PNRTESMTAEEAEFNSLLDGLGPRFEEWWGTGILPVDADLLPQKVPGYKTPFRL 1884 PV P E+MT EEAE+NSLLDGLGPRF +WWGTG+LPVD DLLPQ +PGYKTP R+ Sbjct: 344 SLPVRNPVHAENMTEEEAEYNSLLDGLGPRFVDWWGTGVLPVDGDLLPQSIPGYKTPLRI 403 Query: 1883 LPTGMRPRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIIKLWEKSLVVKIAVKRG 1704 LPTGMRPRLTNAEMT+LRKLAKSLPCHFALGRNRNHQGLAAAIIKLWEKS+VVKIAVK G Sbjct: 404 LPTGMRPRLTNAEMTNLRKLAKSLPCHFALGRNRNHQGLAAAIIKLWEKSIVVKIAVKPG 463 Query: 1703 IQNTNNKLMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVAAALTERQEMTKQTQDAE 1524 IQNTNNKLMAEE+K LTGGVLLLRNKYYIV+YRGKDFLP +VAAAL+ER+E+TK Q E Sbjct: 464 IQNTNNKLMAEEIKNLTGGVLLLRNKYYIVIYRGKDFLPTSVAAALSEREELTKHIQVVE 523 Query: 1523 EKVR--GVPIEPVATIGEGEALAGTLAEFYEAQARWGREISVEEREKMKEEASRAKTARV 1350 EKVR G P G G+ LAGTLAEFYEAQARWGREIS EE EKM EEASRAK+ARV Sbjct: 524 EKVRTGGAEAIPSGEDGVGQPLAGTLAEFYEAQARWGREISAEEHEKMIEEASRAKSARV 583 Query: 1349 VKRLEHKLAISQTKKLKAEKLLDKIISSWVPVGPDDDQETITDEERAMFRRVGLRMKAYL 1170 VKR+EHKLA++Q KKL+AE+LL KI +S +P GP DDQETITDEER MFRR+GLRMKAYL Sbjct: 584 VKRIEHKLALAQAKKLRAERLLAKIEASMIPAGPSDDQETITDEERFMFRRLGLRMKAYL 643 Query: 1169 PLGIRGVFDGVIENMHLHWKHRELVKLLSKDKEIAFVEETARLLEYESGGILVAIERVPK 990 LG+RGVFDGVIENMHLHWKHRELVKL+SK K +AFVE+TARLLEYESGGILVAIERVPK Sbjct: 644 LLGVRGVFDGVIENMHLHWKHRELVKLISKQKTLAFVEDTARLLEYESGGILVAIERVPK 703 Query: 989 GFVLIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQHIDELEKTIEQTRKDI 810 G+ LI+YRGKNYRRP+SLRPRNLLTKAKALKR VA+QR+EALSQHI ELE+TIEQ + +I Sbjct: 704 GYALIYYRGKNYRRPVSLRPRNLLTKAKALKRSVAMQRHEALSQHISELERTIEQMKMEI 763 Query: 809 DDPKVVE-----SGAQFNNVSEFSESEDENSQMGSD 717 D K E S + SESEDE S M SD Sbjct: 764 GDSKDAEDKDSWSTEGHGQFDQVSESEDEASGMDSD 799 >emb|CAN72582.1| hypothetical protein VITISV_035294 [Vitis vinifera] Length = 850 Score = 894 bits (2311), Expect = 0.0 Identities = 508/836 (60%), Positives = 584/836 (69%), Gaps = 45/836 (5%) Frame = -1 Query: 3089 MVMATAKLYSEMXXXXXXXXLSIPSRKSSFSVFFLKPFSS----------------SLHP 2958 M ATAKL +E L K+ S+ LKPFSS SL+P Sbjct: 1 MAFATAKL-TEFPFTSHSSSLHFLFPKTPLSL--LKPFSSLRTTDSNNLRNRKTKRSLYP 57 Query: 2957 TTKLPRKATQVPNFSSGIPDHSSSWLKKWPSTSPLPPIHYKKPRTLQQESTSEAQFLDEA 2778 + + N +S + SW+ KWPS +P +K + ++ T E+++ D Sbjct: 58 WDHQNSRKSSNTNPNSS----TKSWINKWPSPNPSIESEHKGIDSKGRDGT-ESRYFDG- 111 Query: 2777 VKPPTSAIDRIVLRLRNXXXXXXXXXXXXXXXXXXXXXXXXXXEKLGDLLKRDWVRPDKX 2598 + TSAI+RIVLRLRN EKLGDLL+RDWVRPD Sbjct: 112 -RSGTSAIERIVLRLRNLGLGSDDEDKNEGEVESGDTMPVTGDEKLGDLLQRDWVRPDSM 170 Query: 2597 XXXXXXXXXXXXLPWERSAEEGAIKDDKEEGSRTKRRSVKAPTLAELTIEDXXXXXXXXX 2418 PWER E + ++E R KRR+V+APTLAELTIED Sbjct: 171 LIEDEDEDDMIL-PWERGEE----RQEEEGDGRLKRRAVRAPTLAELTIEDEELRRLRRL 225 Query: 2417 XXXXXXRINVPKAGVTKEVLEKIHDKWRKNELVRLKFHEDLAHDMRTGHEIVERRTGGLV 2238 RINVPKAG+T+ VL KIH+KWRK ELVRLKFHE LAHDM+T HEIVERRTGGLV Sbjct: 226 GMTIRERINVPKAGITQAVLGKIHEKWRKEELVRLKFHEALAHDMKTAHEIVERRTGGLV 285 Query: 2237 IWRSGSVMVVFRGTNYEGPLSSQAQSVNREGDALFVPIVSSSGNLLEKNSSSSD-NIDET 2061 WRSGSVMVVFRGTNYEGP + Q V+ EGD+LFVP VSS N +N ++ +++ Sbjct: 286 TWRSGSVMVVFRGTNYEGP--PKPQPVDGEGDSLFVPDVSSVDNPAMRNDNNGGPTLEKG 343 Query: 2060 RAPVV-PNRTESMTAEEAEFNSLLDGLGPRFEEWWGTGILPVDADLLPQKVPGYKTPFRL 1884 PV P E+MT EEAE+NSLLDGLGPRF +WWGTG+LPVD DLLPQ +PGYKTP R+ Sbjct: 344 SLPVRNPVHAENMTEEEAEYNSLLDGLGPRFVDWWGTGVLPVDGDLLPQSIPGYKTPLRI 403 Query: 1883 LPTGMRPRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIIKLWEKSLVVKIAVKRG 1704 LPTGMRPRLTNAEMT+LRKLAKSLPCHFALGRNRNHQGLAAAIIKLWEKS+VVKIAVK G Sbjct: 404 LPTGMRPRLTNAEMTNLRKLAKSLPCHFALGRNRNHQGLAAAIIKLWEKSIVVKIAVKPG 463 Query: 1703 IQNTNNKLMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVAAALTERQEMTKQTQDAE 1524 IQNTNNKLMAEE+K LTGGVLLLRNKYYIV+YRGKDFLP +VAAAL+ER+E+TK Q E Sbjct: 464 IQNTNNKLMAEEIKNLTGGVLLLRNKYYIVIYRGKDFLPTSVAAALSEREELTKHIQVVE 523 Query: 1523 EKVR--GVPIEPVATIGEGEALAGTLAEFYEAQARWGREISVEEREKMKEEASRAKTARV 1350 EKVR G P G G+ LAGTLAEFYEAQARWGREIS EE EKM EEASRAK+ARV Sbjct: 524 EKVRTGGAEAIPSGEDGVGQPLAGTLAEFYEAQARWGREISAEEHEKMIEEASRAKSARV 583 Query: 1349 VKRLEHKLAISQTKKLKAEKLLDKIISSWVPVGPDDDQETITDEERAMFRRVGLRMKAYL 1170 VKR+EHKLA++Q KKL+ E+LL KI +S +P GP DDQETITDEER MFRR+GLRMKAYL Sbjct: 584 VKRIEHKLALAQAKKLRPERLLAKIEASMIPAGPSDDQETITDEERFMFRRLGLRMKAYL 643 Query: 1169 PLGIRGVFDGVIENMHLHWKHRELVKLLSKDKEIAFVEETARLLEYESGGILVAIERVPK 990 LG+RGVFDGVIENMHLHWKHRELVKL+SK K +AFVE+TARLLEYESGGILVAIERVPK Sbjct: 644 LLGVRGVFDGVIENMHLHWKHRELVKLISKQKTLAFVEDTARLLEYESGGILVAIERVPK 703 Query: 989 GFVLIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQHIDELEKTIEQTRKDI 810 G+ LI+YRGKNYRRP+SLRPRNLLTKAKALKR VA+QR+EALSQHI ELE+TIEQ + +I Sbjct: 704 GYALIYYRGKNYRRPVSLRPRNLLTKAKALKRSVAMQRHEALSQHISELERTIEQMKMEI 763 Query: 809 DDPK--------VVESGAQFNNVSEFSE-----------------SEDENSQMGSD 717 D K E QF+ VSE S+ SEDE S M SD Sbjct: 764 GDSKDAEDKDSWSTEGHGQFDQVSEVSKVRYSVFCCQIFLVASILSEDEASGMDSD 819 >gb|EMJ12507.1| hypothetical protein PRUPE_ppa001468mg [Prunus persica] Length = 820 Score = 889 bits (2298), Expect = 0.0 Identities = 503/820 (61%), Positives = 579/820 (70%), Gaps = 29/820 (3%) Frame = -1 Query: 3089 MVMATAKLYSEMXXXXXXXXLSIPSRKSSFSVFFLKPFSSSLHPTTKLPRKATQVPNFSS 2910 M TAK+ SEM S S +F KP L P + L KAT+ + Sbjct: 1 MAFTTAKI-SEMPLRSSLPLTSHSSSSLNFLFSASKPSFRLLKPFSSL--KATEHSGNPN 57 Query: 2909 GIPDHSSS-----WLKKWPSTSPLPPIHYKKPRTLQQESTSEAQFLDEAVKPPT------ 2763 P H S WL WP + + +K E +E+ D+AVK T Sbjct: 58 AKPSHKSKPPSAPWLNTWPPRNSPAELPCQKVN----EKVNESHGRDQAVKANTTRYFDK 113 Query: 2762 ----SAIDRIVLRLRNXXXXXXXXXXXXXXXXXXXXXXXXXXE---KLGDLLKRDWVRPD 2604 SAI+RIVLRLRN KLGDLL+R+WVRPD Sbjct: 114 NKGQSAIERIVLRLRNLGLGSDDEEEDDGLGLDGQDSMQPAESGEEKLGDLLQREWVRPD 173 Query: 2603 KXXXXXXXXXXXXXLPWERSAEEGAIKDDKEEGSRTKRRSVKAPTLAELTIEDXXXXXXX 2424 LPWE+ E ++EE ++R VKAP+LAELTIED Sbjct: 174 -YVLAEQKSNDEVALPWEKEDEIS----EEEEVKGLRKRRVKAPSLAELTIEDEELKRLR 228 Query: 2423 XXXXXXXXRINVPKAGVTKEVLEKIHDKWRKNELVRLKFHEDLAHDMRTGHEIVERRTGG 2244 RI+VPKAG+T+ VLEKIHD WRK ELVRLKFHE LA DM+T HEIVERRTGG Sbjct: 229 RMGMVLRERISVPKAGITQAVLEKIHDTWRKEELVRLKFHEVLALDMKTAHEIVERRTGG 288 Query: 2243 LVIWRSGSVMVVFRGTNYEGPLSSQAQSVNREGDALFVPIVSSSGNLLEK--NSSSSDNI 2070 LV+WRSGSVMVV+RG+NY+GP S++Q+V+REG ALF+P VSS+ + N ++S Sbjct: 289 LVLWRSGSVMVVYRGSNYKGP--SKSQTVDREGGALFIPDVSSAETSATRSGNDATSGPD 346 Query: 2069 DETRAPVVPNRTESMTAEEAEFNSLLDGLGPRFEEWWGTGILPVDADLLPQKVPGYKTPF 1890 + +A +P +MT EEAEFNSLLD LGPRF EWWGTG+LPVDADLLP+ +PGYKTPF Sbjct: 347 NNEKAVKIPAHLPNMTEEEAEFNSLLDDLGPRFVEWWGTGVLPVDADLLPKTIPGYKTPF 406 Query: 1889 RLLPTGMRPRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIIKLWEKSLVVKIAVK 1710 RLLPTGMR RLTNAEMT+LRKLAKSLPCHFALGRNRNHQGLA+AIIKLWEKS V KIAVK Sbjct: 407 RLLPTGMRSRLTNAEMTNLRKLAKSLPCHFALGRNRNHQGLASAIIKLWEKSSVAKIAVK 466 Query: 1709 RGIQNTNNKLMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVAAALTERQEMTKQTQD 1530 RGIQNTNNKLMAEELKTLTGGVLLLRNKYYIV YRGKDFLP +VAAAL ERQE+TKQ QD Sbjct: 467 RGIQNTNNKLMAEELKTLTGGVLLLRNKYYIVFYRGKDFLPTSVAAALAERQELTKQVQD 526 Query: 1529 AEEKVRGVPIEPVAT-IGEGEALAGTLAEFYEAQARWGREISVEEREKMKEEASRAKTAR 1353 EEK+R I+ ++ EG+ALAGTLAEFYEAQARWGREIS EEREKM EE S+AK AR Sbjct: 527 VEEKMRIKAIDAASSGAEEGQALAGTLAEFYEAQARWGREISAEEREKMIEEDSKAKNAR 586 Query: 1352 VVKRLEHKLAISQTKKLKAEKLLDKIISSWVPVGPDDDQETITDEERAMFRRVGLRMKAY 1173 +VKR+EHKL ++Q KKL+AEKLL KI SS +P GPD DQET+TDEER MFRRVGLRMKAY Sbjct: 587 LVKRIEHKLGVAQAKKLRAEKLLSKIESSMLPAGPDYDQETVTDEERVMFRRVGLRMKAY 646 Query: 1172 LPLGIRGVFDGVIENMHLHWKHRELVKLLSKDKEIAFVEETARLLEYESGGILVAIERVP 993 LPLGIRGVFDGV+ENMHLHWKHRELVKL+SK K +AFVE+TARLLE+ESGGILVAIERVP Sbjct: 647 LPLGIRGVFDGVVENMHLHWKHRELVKLISKQKTLAFVEDTARLLEFESGGILVAIERVP 706 Query: 992 KGFVLIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQHIDELEKTIEQ---- 825 KG+ LI+YRGKNY+RPI+LRPRNLLTKAKALKR VA+QR+EALSQHI ELEKTIEQ Sbjct: 707 KGYALIYYRGKNYQRPITLRPRNLLTKAKALKRSVAIQRHEALSQHISELEKTIEQMSSE 766 Query: 824 --TRKDIDDPKVVES--GAQFNNVSEFSESEDENSQMGSD 717 +DI D S Q + SEF +SEDE S+MGSD Sbjct: 767 IGVSEDIADESTWSSRDPDQIHGASEFVQSEDEASRMGSD 806 >ref|XP_004300521.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 820 Score = 871 bits (2251), Expect = 0.0 Identities = 497/821 (60%), Positives = 588/821 (71%), Gaps = 30/821 (3%) Frame = -1 Query: 3089 MVMATAKLYSEMXXXXXXXXLS-IPSR-----KSSFSVFFLKPFSS---SLHPTTKLPRK 2937 M ATAK+ SEM S PS K SF + LKPFS+ + H R Sbjct: 1 MAFATAKI-SEMPLRNSLPLTSHSPSSLHLLLKPSFRI--LKPFSALRTTEHGGNPNARH 57 Query: 2936 ATQVPNFSSGIPDHSSSWLKKWPSTSPLP--------PIHYKKPRTLQQESTSEAQFLDE 2781 ++ + SS P WL KWPS P K+ ++ S++ A+++D+ Sbjct: 58 KSKPSSSSSTAP-----WLNKWPSRGQAPAEPPRQKFSDRVKESDGREKPSSNAARYVDK 112 Query: 2780 AVKPPTSAIDRIVLRLRN-XXXXXXXXXXXXXXXXXXXXXXXXXXEKLGDLLKRDWVRPD 2604 SAI+RIV RLRN EKLGDLL+R+WVRPD Sbjct: 113 --DKGQSAIERIVFRLRNLGLGDDEEEEESGDGVELDSMPAASGAEKLGDLLQREWVRPD 170 Query: 2603 KXXXXXXXXXXXXXLPWERSAEEGAIKDDKEEGSRTKRRSVKAPTLAELTIEDXXXXXXX 2424 LPWE+ EE +D++ +G R RRS KAP+LAELTIED Sbjct: 171 -YILAEEKGDDDVALPWEKEEEE-LSEDEEVKGMRKARRS-KAPSLAELTIEDEELRRLR 227 Query: 2423 XXXXXXXXRINVPKAGVTKEVLEKIHDKWRKNELVRLKFHEDLAHDMRTGHEIVERRTGG 2244 RI+VPKAG+T+ VLEKIHDKWRK ELVRLKFHE LAHDM+T HEIVERRTGG Sbjct: 228 RLGMVLRERISVPKAGITQAVLEKIHDKWRKEELVRLKFHEVLAHDMKTAHEIVERRTGG 287 Query: 2243 LVIWRSGSVMVVFRGTNYEGPLSSQAQSVNREGDALFVPIVSSSGNLLEK---NSSSSDN 2073 LV+WRSGSVMVV+RG+NY+GP S+++ R GDALF+P VSS+ + + +++S+ + Sbjct: 288 LVLWRSGSVMVVYRGSNYKGP--SKSEPAGRGGDALFIPDVSSAETSVTRGGNDATSAPD 345 Query: 2072 IDETRAPVVPNRTESMTAEEAEFNSLLDGLGPRFEEWWGTGILPVDADLLPQKVPGYKTP 1893 E + + MT EEAEFNSLLD LGPRF E+WGTGILPVDADLLP+ +PGYKTP Sbjct: 346 KTEQAVKIPEPLPKKMTDEEAEFNSLLDELGPRFVEYWGTGILPVDADLLPKTIPGYKTP 405 Query: 1892 FRLLPTGMRPRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIIKLWEKSLVVKIAV 1713 FRLLPTGMR RLTNAEMT+LRKLAKS+PCHFALGRNRNHQGLA+AI+K+WEKS V KIAV Sbjct: 406 FRLLPTGMRSRLTNAEMTNLRKLAKSIPCHFALGRNRNHQGLASAILKVWEKSSVAKIAV 465 Query: 1712 KRGIQNTNNKLMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVAAALTERQEMTKQTQ 1533 KRGIQNTNNK+MAEELK LTGGVLLLRNKYYIV+YRGKDF+P TVA AL ERQE+TKQ Q Sbjct: 466 KRGIQNTNNKIMAEELKALTGGVLLLRNKYYIVIYRGKDFVPTTVATALAERQELTKQVQ 525 Query: 1532 DAEEKVRGVPIEPVA-TIGEGEALAGTLAEFYEAQARWGREISVEEREKMKEEASRAKTA 1356 D EE VR PI+ A + EG+ALAGTLAEFYEAQARWGREIS EER+KM EE S+AK A Sbjct: 526 DVEEIVRIKPIDAAASSTEEGQALAGTLAEFYEAQARWGREISAEERKKMIEEDSKAKMA 585 Query: 1355 RVVKRLEHKLAISQTKKLKAEKLLDKIISSWVPVGPDDDQETITDEERAMFRRVGLRMKA 1176 R KR+EHKL ++Q KKL+AE LL+KI S+ +P GPD DQETITDEER MFRRVGLRMKA Sbjct: 586 RRAKRIEHKLGVAQAKKLRAESLLNKIESAMLPAGPDYDQETITDEERVMFRRVGLRMKA 645 Query: 1175 YLPLGIRGVFDGVIENMHLHWKHRELVKLLSKDKEIAFVEETARLLEYESGGILVAIERV 996 YLPLGIRGVFDGVIENMHLHWKHRELVKL+SK K +AFVE++ARLLEYESGGILVAIERV Sbjct: 646 YLPLGIRGVFDGVIENMHLHWKHRELVKLISKQKTLAFVEDSARLLEYESGGILVAIERV 705 Query: 995 PKGFVLIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQHIDELEKTIEQTRK 816 PKG+ LI+YRGKNY+RPI+LRPRNLLTKAKALKR VA+QR+EALSQHI+ELE+TIEQ R Sbjct: 706 PKGYALIYYRGKNYQRPITLRPRNLLTKAKALKRSVAMQRHEALSQHIEELERTIEQMRS 765 Query: 815 DIDDPKVVESGA--------QFNNVSEFSESEDENSQMGSD 717 +I + V++ Q + SEF++SEDE+S M SD Sbjct: 766 EIGISEDVDNERTWGSRDPHQSGHDSEFNQSEDEDSDMESD 806 >ref|XP_002317913.2| hypothetical protein POPTR_0012s05260g [Populus trichocarpa] gi|550326426|gb|EEE96133.2| hypothetical protein POPTR_0012s05260g [Populus trichocarpa] Length = 807 Score = 868 bits (2243), Expect = 0.0 Identities = 493/809 (60%), Positives = 569/809 (70%), Gaps = 25/809 (3%) Frame = -1 Query: 3089 MVMATAKLYSEMXXXXXXXXLSIPSRKSSFSVF--FLKPFS---SSLHPTTKLPRKATQV 2925 M TAKL LS S S + F KPFS SS T K P+ + Sbjct: 2 MTFTTAKLTELPLRTTSTLPLSSHSLLSKIATFQSLKKPFSTATSSSLRTNKTPKTQQKN 61 Query: 2924 PNFSSGIPDHSSSWLKKWPST------SPLPPIHYKKPRTLQQESTSEAQFLDEAVKPPT 2763 PN W+ KW + +P + +KP + Sbjct: 62 PN-----------WISKWKPSQNHSIKNPPSEVSQEKPHYFSNDKGQ------------- 97 Query: 2762 SAIDRIVLRLRN-XXXXXXXXXXXXXXXXXXXXXXXXXXEKLGDLLKRDWVRPDK--XXX 2592 +AI+RIVLRLRN E+LGDLLKR+WVRPD Sbjct: 98 NAIERIVLRLRNLGLGSDDEDELEGLEGSEINGGGLTGEERLGDLLKREWVRPDTVVFSN 157 Query: 2591 XXXXXXXXXXLPWERSAEEGAIKDDKEEGSRTKRRSVKAPTLAELTIEDXXXXXXXXXXX 2412 LPWER E GA++ + S KRR KAPTLAELTIED Sbjct: 158 DEGSDSDESVLPWERE-ERGAVEMEGGIESGRKRRG-KAPTLAELTIEDEELRRLRRMGM 215 Query: 2411 XXXXRINVPKAGVTKEVLEKIHDKWRKNELVRLKFHEDLAHDMRTGHEIVERRTGGLVIW 2232 RI++PKAG+T VLE IHD+WRK ELVRLKFHE LAHDM+T HEIVERRTGGLVIW Sbjct: 216 FIRERISIPKAGITNAVLENIHDRWRKEELVRLKFHEVLAHDMKTAHEIVERRTGGLVIW 275 Query: 2231 RSGSVMVVFRGTNYEGPLSSQAQSVNREGDALFVPIVSSSGNLLEKNSSSSDNIDETRAP 2052 R+GSVMVVFRGTNY+GP S+ Q +REGDALFVP VSS+ +++ ++S+ + + E Sbjct: 276 RAGSVMVVFRGTNYQGP-PSKLQPADREGDALFVPDVSSTDSVMTRSSNIATSSSEKSKL 334 Query: 2051 V--VPNRTESMTAEEAEFNSLLDGLGPRFEEWWGTGILPVDADLLPQKVPGYKTPFRLLP 1878 V + TE+MT EEAE NSLLD LGPRFEEWWGTG+LPVDADLLP KVP YKTPFRLLP Sbjct: 335 VMRITEPTENMTEEEAELNSLLDDLGPRFEEWWGTGLLPVDADLLPPKVPCYKTPFRLLP 394 Query: 1877 TGMRPRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIIKLWEKSLVVKIAVKRGIQ 1698 GMR RLTNAEMT++RKLAK+LPCHFALGRNRNHQGLA AI+KLWEKSLV KIAVKRGIQ Sbjct: 395 VGMRARLTNAEMTNMRKLAKALPCHFALGRNRNHQGLAVAILKLWEKSLVAKIAVKRGIQ 454 Query: 1697 NTNNKLMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVAAALTERQEMTKQTQDAEEK 1518 NTNNKLMA+ELK LTGGVLLLRNKYYIV++RGKDFLP +VAAAL ERQE+TKQ QD EE+ Sbjct: 455 NTNNKLMADELKMLTGGVLLLRNKYYIVIFRGKDFLPQSVAAALAERQEVTKQIQDVEER 514 Query: 1517 VRGVPIEPVAT-IGEGEALAGTLAEFYEAQARWGREISVEEREKMKEEASRAKTARVVKR 1341 VR +E + EG+ALAGTLAEFYEAQARWGR+IS EEREKM EEAS+AKTAR+VKR Sbjct: 515 VRSNSVEAAPSGEDEGKALAGTLAEFYEAQARWGRDISTEEREKMIEEASKAKTARLVKR 574 Query: 1340 LEHKLAISQTKKLKAEKLLDKIISSWVPVGPDDDQETITDEERAMFRRVGLRMKAYLPLG 1161 EHKLAI+Q KKL+AE LL KI ++ VP GPD DQETI++EER MFRRVGLRMKAYLPLG Sbjct: 575 TEHKLAIAQAKKLRAESLLSKIETTMVPSGPDFDQETISEEERVMFRRVGLRMKAYLPLG 634 Query: 1160 IRGVFDGVIENMHLHWKHRELVKLLSKDKEIAFVEETARLLEYESGGILVAIERVPKGFV 981 IRGVFDGVIENMHLHWKHRELVKL+SK K +AFVE+TA+LLEYESGG+LVAIERVPKGF Sbjct: 635 IRGVFDGVIENMHLHWKHRELVKLISKQKTLAFVEDTAKLLEYESGGVLVAIERVPKGFA 694 Query: 980 LIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQHIDELEKTIEQTRKDIDDP 801 LI+YRGKNYRRPIS+RPRNLLTKAKALKR VA+QR+EALSQHI ELEK IE+ K++ Sbjct: 695 LIYYRGKNYRRPISIRPRNLLTKAKALKRSVAMQRHEALSQHIFELEKNIEEMVKEMGLS 754 Query: 800 KVVES--------GAQFNNVSEFSESEDE 738 K E+ A NNVS+ ++SED+ Sbjct: 755 KEEENENNWSSEEHAPLNNVSKLTQSEDK 783 >gb|EXB56911.1| Chloroplastic group IIA intron splicing facilitator CRS1 [Morus notabilis] Length = 838 Score = 868 bits (2242), Expect = 0.0 Identities = 485/775 (62%), Positives = 556/775 (71%), Gaps = 39/775 (5%) Frame = -1 Query: 2924 PNFSSGIPDH-----SSSWLKKWPSTSPLPPIHYKKPRTLQQESTSEAQFLDEAVKPPT- 2763 P+ SS H S+ WL KWP P +E+ D +P T Sbjct: 66 PSSSSSSSSHRHKPPSAPWLNKWP------------PVESSDRKVAESTDRDRTDRPDTV 113 Query: 2762 ---------SAIDRIVLRLRNXXXXXXXXXXXXXXXXXXXXXXXXXXE----KLGDLLKR 2622 +AI+RIVLRLRN KLGDLL+R Sbjct: 114 GYVDRDRGRNAIERIVLRLRNLGLGSDDEDEDDKEGDIGLDGQDAMPVTGEEKLGDLLRR 173 Query: 2621 DWVRPDKXXXXXXXXXXXXXLPWERSAEEGAIKDDKEEGSRT-KRRSVKAPTLAELTIED 2445 +W+RPD LPWER EE + +EG+R ++R V APTLAELTIED Sbjct: 174 EWIRPD-FVLEEEESKDDLTLPWEREEEEKGV----DEGTRELRKRRVNAPTLAELTIED 228 Query: 2444 XXXXXXXXXXXXXXXRINVPKAGVTKEVLEKIHDKWRKNELVRLKFHEDLAHDMRTGHEI 2265 RI+VPKAG+T+ VLEKIHDKWRK ELVRLKFHE LAHDM+T HEI Sbjct: 229 EELRRLRRMGMFLRDRISVPKAGLTQAVLEKIHDKWRKEELVRLKFHEVLAHDMKTAHEI 288 Query: 2264 VERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVNREGDALFVPIVSSSGNLLEKNSS 2085 VERRTGGLV WRSGSVMVV+RG+NYEGP + Q VN+E DALF+P VSS+ N L ++ Sbjct: 289 VERRTGGLVTWRSGSVMVVYRGSNYEGP--PKTQPVNKERDALFIPDVSSAENFLTRSGD 346 Query: 2084 S-SDNIDETRAPVV-PNRTESMTAEEAEFNSLLDGLGPRFEEWWGTGILPVDADLLPQKV 1911 S + N +++ PV P ++MT EEAEFNSLLD LGPRF+EWWGTG++PVDADLLP K+ Sbjct: 347 SLTSNAEKSETPVRNPVSVQNMTEEEAEFNSLLDDLGPRFDEWWGTGVIPVDADLLPPKI 406 Query: 1910 PGYKTPFRLLPTGMRPRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIIKLWEKSL 1731 PGYKTPFRLLPTGMR RLTN EMT+LRK+AKSLP HFALGRNRNHQGLAAAIIKLWEKSL Sbjct: 407 PGYKTPFRLLPTGMRSRLTNGEMTNLRKVAKSLPSHFALGRNRNHQGLAAAIIKLWEKSL 466 Query: 1730 VVKIAVKRGIQNTNNKLMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVAAALTERQE 1551 V KIAVKRGIQNTNNKLMAEELK LTGGVLLLRNKYYIV+YRGKDFLP TVAA L ERQ+ Sbjct: 467 VAKIAVKRGIQNTNNKLMAEELKNLTGGVLLLRNKYYIVIYRGKDFLPTTVAATLAERQK 526 Query: 1550 MTKQTQDAEEKVRGVPIEP------VATIG----EGEALAGTLAEFYEAQARWGREISVE 1401 + KQ QD EE+VR IE V ++ EG+ALAGTLAEFYEAQARWGREI+ E Sbjct: 527 LAKQVQDLEEQVRVQDIEQKMQKKAVDSVPSGEEEGQALAGTLAEFYEAQARWGREITSE 586 Query: 1400 EREKMKEEASRAKTARVVKRLEHKLAISQTKKLKAEKLLDKIISSWVPVGPDDDQETITD 1221 EREKM EEA+ AK AR+VKR+EHK A++Q KKL+AEKLL KI +S VP GPD DQETIT+ Sbjct: 587 EREKMIEEAAVAKHARLVKRIEHKAAVAQAKKLRAEKLLAKIEASMVPAGPDYDQETITE 646 Query: 1220 EERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKDKEIAFVEETARL 1041 EER MFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKL++K K +AFVE+TARL Sbjct: 647 EERVMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLITKQKTLAFVEDTARL 706 Query: 1040 LEYESGGILVAIERVPKGFVLIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALS 861 LEYESGGILVAIERVPKGF LI+YRGKNYRRPISLRPRNLLTKAKALKR VA+QR+EALS Sbjct: 707 LEYESGGILVAIERVPKGFALIYYRGKNYRRPISLRPRNLLTKAKALKRSVAMQRHEALS 766 Query: 860 QHIDELEKTIEQTRKDIDDPKVVESGAQF-------NNVSEFSESEDENSQMGSD 717 QHI ELE TIEQ + I K + + +NVSEF +SE++++ SD Sbjct: 767 QHISELETTIEQMQDKIVASKSGQDEGSWSTDENLNDNVSEFIQSENDDAFEDSD 821 >ref|XP_006440978.1| hypothetical protein CICLE_v10018859mg [Citrus clementina] gi|567896982|ref|XP_006440979.1| hypothetical protein CICLE_v10018859mg [Citrus clementina] gi|567896984|ref|XP_006440980.1| hypothetical protein CICLE_v10018859mg [Citrus clementina] gi|557543240|gb|ESR54218.1| hypothetical protein CICLE_v10018859mg [Citrus clementina] gi|557543241|gb|ESR54219.1| hypothetical protein CICLE_v10018859mg [Citrus clementina] gi|557543242|gb|ESR54220.1| hypothetical protein CICLE_v10018859mg [Citrus clementina] Length = 833 Score = 865 bits (2234), Expect = 0.0 Identities = 479/781 (61%), Positives = 562/781 (71%), Gaps = 17/781 (2%) Frame = -1 Query: 3026 SIPSRKSSFSVFFLKPFSSSLHPTTKLPRKATQVPNFSSG-IPDHSSSWLKKW-----PS 2865 S SRK+ S LKPFSS T + PR +Q F P S+ WL W PS Sbjct: 33 SSSSRKTP-SFQLLKPFSSLR--TNQNPRTDSQNQQFPKPRSPSTSAPWLNNWSRPKPPS 89 Query: 2864 TSPLPPIHYKKPRTLQQESTSEAQFLDEAVKPPTSAIDRIVLRLRNXXXXXXXXXXXXXX 2685 T + + +Q S ++ +AI+RIVLRLRN Sbjct: 90 TENANKLGGRNQIDEKQTSPDSYPRYSDSDNKGRNAIERIVLRLRNLGLGSDDEEEGEEE 149 Query: 2684 XXXXXXXXXXXXEKLGDLLKRDWVRPDKXXXXXXXXXXXXXLPWERSAEEGAIKDDKEEG 2505 +L DLL+R+WVRP+ LPWER EE ++ Sbjct: 150 EDDINDAATGEE-RLEDLLRREWVRPNTVLREVEGEEDDSLLPWEREEEENLRAGGEKPA 208 Query: 2504 SRTKRRSVKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTKEVLEKIHDKWRKNE 2325 T+RR +KAPTLAELTIED RINVPKAG+T++V+ KIHDKWRK+E Sbjct: 209 GETRRRRMKAPTLAELTIEDEELRRLRRNGMYLRERINVPKAGLTQDVMRKIHDKWRKDE 268 Query: 2324 LVRLKFHEDLAHDMRTGHEIVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVNREG 2145 LVRLKFHE LA DM+T HEIVERRTGGLVIWR+GSVMVV+RG+NY GP SS+ Q ++ +G Sbjct: 269 LVRLKFHEVLATDMKTAHEIVERRTGGLVIWRAGSVMVVYRGSNYAGP-SSKPQPIDGDG 327 Query: 2144 DALFVPIVSSSGNLLEKNSSSSDNIDE-TRAPV-VPNRTESMTAEEAEFNSLLDGLGPRF 1971 D LFVP VSS+ + S++ ++DE + PV + + ++ MT EEAE NSLLD LGPRF Sbjct: 328 DTLFVPHVSST------DGSTARSVDEKSEVPVRILDHSKPMTEEEAECNSLLDSLGPRF 381 Query: 1970 EEWWGTGILPVDADLLPQKVPGYKTPFRLLPTGMRPRLTNAEMTDLRKLAKSLPCHFALG 1791 +EWWGTGILPVDADLLP KV GYKTPFRLLPTGMR RLTNAEMTDLR+LA+SLPCHFALG Sbjct: 382 QEWWGTGILPVDADLLPPKVDGYKTPFRLLPTGMRSRLTNAEMTDLRRLARSLPCHFALG 441 Query: 1790 RNRNHQGLAAAIIKLWEKSLVVKIAVKRGIQNTNNKLMAEELKTLTGGVLLLRNKYYIVM 1611 RNRNHQGLA AI+KLWEKSLV KIAVKRGIQNTNNKLMAEELK+LTGG LL RNK+YIV+ Sbjct: 442 RNRNHQGLAVAILKLWEKSLVAKIAVKRGIQNTNNKLMAEELKSLTGGTLLQRNKFYIVL 501 Query: 1610 YRGKDFLPPTVAAALTERQEMTKQTQDAEEKVRGVPIEPVAT-IGEGEALAGTLAEFYEA 1434 YRGKDFLPP VA+AL ER++ KQ QD EEKVR +E + EG+A AGTLAEFYEA Sbjct: 502 YRGKDFLPPNVASALAEREQCAKQIQDVEEKVRSKTLEATPSGETEGQAPAGTLAEFYEA 561 Query: 1433 QARWGREISVEEREKMKEEASRAKTARVVKRLEHKLAISQTKKLKAEKLLDKIISSWVPV 1254 Q RWGRE+S EEREKM EEAS+AK R+VKR+EHKLA+SQ KKL+AE+LL KI +S VP Sbjct: 562 QKRWGREVSAEEREKMVEEASKAKHGRLVKRIEHKLAVSQAKKLRAERLLAKIEASMVPS 621 Query: 1253 GPDDDQETITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKDK 1074 GPD DQETITDEERAMFRRVGLRMKA+LPLGIRGVFDGV+ENMHLHWK+RELVKL++K K Sbjct: 622 GPDYDQETITDEERAMFRRVGLRMKAFLPLGIRGVFDGVVENMHLHWKYRELVKLITKQK 681 Query: 1073 EIAFVEETARLLEYESGGILVAIERVPKGFVLIFYRGKNYRRPISLRPRNLLTKAKALKR 894 +A+VE+TARLLEYES GIL+AIERVPKGF LIFYRGKNYRRPISLRPRNLLTKAKALKR Sbjct: 682 TLAYVEDTARLLEYESVGILIAIERVPKGFALIFYRGKNYRRPISLRPRNLLTKAKALKR 741 Query: 893 RVALQRYEALSQHIDELEKTIEQTRKDIDDPKVVESG--------AQFNNVSEFSESEDE 738 VA+QR+EALSQHI +LE TIEQ +K+I K E G QF++VS ++ED Sbjct: 742 SVAMQRHEALSQHISDLENTIEQMKKEIGVSKDEEDGNIRCSGDLKQFDHVSVLPQNEDN 801 Query: 737 N 735 + Sbjct: 802 D 802 >ref|XP_006485796.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Citrus sinensis] Length = 837 Score = 861 bits (2224), Expect = 0.0 Identities = 482/793 (60%), Positives = 563/793 (70%), Gaps = 29/793 (3%) Frame = -1 Query: 3026 SIPSRKSSFSVFFLKPFSSSLHPTTKLPRKATQVPNFSSG-IPDHSSSWLKKWPSTSPLP 2850 S SRK+ S LKPFSS T + PR +Q F P S+ WL W P Sbjct: 33 SSSSRKTP-SFQLLKPFSSLR--TNQNPRTDSQNQKFPKPRFPSTSAPWLNNWSRPKP-- 87 Query: 2849 PIHYKKPRTLQQESTSEAQFLDEAVKPPTS-------------AIDRIVLRLRNXXXXXX 2709 P T + +DE P S AI+RIVLRLRN Sbjct: 88 ------PSTENVNKSDGRNQIDEKQTAPDSYPRYSDSDNKGRNAIERIVLRLRNLGLGSD 141 Query: 2708 XXXXXXXXXXXXXXXXXXXXEKLGDLLKRDWVRPDKXXXXXXXXXXXXXLPWERSAEEGA 2529 +L DLL+R+WVRP+ LPWER EE Sbjct: 142 DEEEGEEEEDDINGAATGEE-RLEDLLRREWVRPNTVLREVEGEEDDSLLPWEREEEENL 200 Query: 2528 IKDDKEEGSRTKRRSVKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTKEVLEKI 2349 ++ T+RR +KAPTLAELTIED RINVPKAG+T++V+ KI Sbjct: 201 RAGGEKPAGETRRRRMKAPTLAELTIEDEELRRLRRNGMYLRERINVPKAGLTQDVMRKI 260 Query: 2348 HDKWRKNELVRLKFHEDLAHDMRTGHEIVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQ 2169 HDKWRK+ELVRLKFHE LA DM+T HEIVERRTGGLVIWR+GSVMVV++G+NY GP SS+ Sbjct: 261 HDKWRKDELVRLKFHEVLATDMKTAHEIVERRTGGLVIWRAGSVMVVYQGSNYAGP-SSK 319 Query: 2168 AQSVNREGDA----LFVPIVSSSGNLLEKNSSSSDNIDE-TRAPV-VPNRTESMTAEEAE 2007 Q ++ +GD LFVP VSS+ + S++ ++DE + PV + + ++ MT EEAE Sbjct: 320 PQPLDGDGDGDGDTLFVPHVSST------DGSTARSVDEKSEVPVRILDHSKPMTEEEAE 373 Query: 2006 FNSLLDGLGPRFEEWWGTGILPVDADLLPQKVPGYKTPFRLLPTGMRPRLTNAEMTDLRK 1827 NSLLD LGPRF+EWWGTGILPVDADLLP KV GYKTPFRLLPTGMR RLTNAEMTDLR+ Sbjct: 374 CNSLLDSLGPRFQEWWGTGILPVDADLLPPKVDGYKTPFRLLPTGMRSRLTNAEMTDLRR 433 Query: 1826 LAKSLPCHFALGRNRNHQGLAAAIIKLWEKSLVVKIAVKRGIQNTNNKLMAEELKTLTGG 1647 LA+SLPCHFALGRNRNHQGLA AI+KLWEKSLV KIAVKRGIQNTNNKLMAEELK+LTGG Sbjct: 434 LARSLPCHFALGRNRNHQGLAVAILKLWEKSLVAKIAVKRGIQNTNNKLMAEELKSLTGG 493 Query: 1646 VLLLRNKYYIVMYRGKDFLPPTVAAALTERQEMTKQTQDAEEKVRGVPIEPVAT-IGEGE 1470 LL RNK+YIV+YRGKDFLPP VA+AL ER++ KQ QD EEKVR +E + EG+ Sbjct: 494 TLLQRNKFYIVLYRGKDFLPPNVASALAEREQCAKQIQDVEEKVRSKTLEATPSGETEGQ 553 Query: 1469 ALAGTLAEFYEAQARWGREISVEEREKMKEEASRAKTARVVKRLEHKLAISQTKKLKAEK 1290 A AGTLAEFYEAQ RWGRE+S EEREKM EEAS+AK AR+VKR+EHKLA+SQ KKL+AE+ Sbjct: 554 APAGTLAEFYEAQKRWGREVSAEEREKMVEEASKAKHARLVKRIEHKLAVSQAKKLRAER 613 Query: 1289 LLDKIISSWVPVGPDDDQETITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWK 1110 LL KI +S VP GPD DQETITDEERAMFRRVGLRMKA+LPLGIRGVFDGV+ENMHLHWK Sbjct: 614 LLAKIEASMVPSGPDYDQETITDEERAMFRRVGLRMKAFLPLGIRGVFDGVVENMHLHWK 673 Query: 1109 HRELVKLLSKDKEIAFVEETARLLEYESGGILVAIERVPKGFVLIFYRGKNYRRPISLRP 930 +RELVKL++K K +A+VE+TARLLEYESGGIL+AIERVPKGF LIFYRGKNYRRPISLRP Sbjct: 674 YRELVKLITKQKTLAYVEDTARLLEYESGGILIAIERVPKGFALIFYRGKNYRRPISLRP 733 Query: 929 RNLLTKAKALKRRVALQRYEALSQHIDELEKTIEQTRKDIDDPKVVESG--------AQF 774 RNLLTKAKALKR VA+QR+EALSQHI +LE TIEQ +K+I K E G QF Sbjct: 734 RNLLTKAKALKRSVAMQRHEALSQHISDLENTIEQMKKEIGVFKDEEDGNIRCSGDLKQF 793 Query: 773 NNVSEFSESEDEN 735 ++VS ++ED++ Sbjct: 794 DHVSVLPQNEDDD 806 >ref|XP_006440981.1| hypothetical protein CICLE_v10018859mg [Citrus clementina] gi|557543243|gb|ESR54221.1| hypothetical protein CICLE_v10018859mg [Citrus clementina] Length = 806 Score = 857 bits (2215), Expect = 0.0 Identities = 470/748 (62%), Positives = 548/748 (73%), Gaps = 9/748 (1%) Frame = -1 Query: 3026 SIPSRKSSFSVFFLKPFSSSLHPTTKLPRKATQVPNFSSG-IPDHSSSWLKKW-----PS 2865 S SRK+ S LKPFSS T + PR +Q F P S+ WL W PS Sbjct: 33 SSSSRKTP-SFQLLKPFSSLR--TNQNPRTDSQNQQFPKPRSPSTSAPWLNNWSRPKPPS 89 Query: 2864 TSPLPPIHYKKPRTLQQESTSEAQFLDEAVKPPTSAIDRIVLRLRNXXXXXXXXXXXXXX 2685 T + + +Q S ++ +AI+RIVLRLRN Sbjct: 90 TENANKLGGRNQIDEKQTSPDSYPRYSDSDNKGRNAIERIVLRLRNLGLGSDDEEEGEEE 149 Query: 2684 XXXXXXXXXXXXEKLGDLLKRDWVRPDKXXXXXXXXXXXXXLPWERSAEEGAIKDDKEEG 2505 +L DLL+R+WVRP+ LPWER EE ++ Sbjct: 150 EDDINDAATGEE-RLEDLLRREWVRPNTVLREVEGEEDDSLLPWEREEEENLRAGGEKPA 208 Query: 2504 SRTKRRSVKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTKEVLEKIHDKWRKNE 2325 T+RR +KAPTLAELTIED RINVPKAG+T++V+ KIHDKWRK+E Sbjct: 209 GETRRRRMKAPTLAELTIEDEELRRLRRNGMYLRERINVPKAGLTQDVMRKIHDKWRKDE 268 Query: 2324 LVRLKFHEDLAHDMRTGHEIVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVNREG 2145 LVRLKFHE LA DM+T HEIVERRTGGLVIWR+GSVMVV+RG+NY GP SS+ Q ++ +G Sbjct: 269 LVRLKFHEVLATDMKTAHEIVERRTGGLVIWRAGSVMVVYRGSNYAGP-SSKPQPIDGDG 327 Query: 2144 DALFVPIVSSSGNLLEKNSSSSDNIDE-TRAPV-VPNRTESMTAEEAEFNSLLDGLGPRF 1971 D LFVP VSS+ + S++ ++DE + PV + + ++ MT EEAE NSLLD LGPRF Sbjct: 328 DTLFVPHVSST------DGSTARSVDEKSEVPVRILDHSKPMTEEEAECNSLLDSLGPRF 381 Query: 1970 EEWWGTGILPVDADLLPQKVPGYKTPFRLLPTGMRPRLTNAEMTDLRKLAKSLPCHFALG 1791 +EWWGTGILPVDADLLP KV GYKTPFRLLPTGMR RLTNAEMTDLR+LA+SLPCHFALG Sbjct: 382 QEWWGTGILPVDADLLPPKVDGYKTPFRLLPTGMRSRLTNAEMTDLRRLARSLPCHFALG 441 Query: 1790 RNRNHQGLAAAIIKLWEKSLVVKIAVKRGIQNTNNKLMAEELKTLTGGVLLLRNKYYIVM 1611 RNRNHQGLA AI+KLWEKSLV KIAVKRGIQNTNNKLMAEELK+LTGG LL RNK+YIV+ Sbjct: 442 RNRNHQGLAVAILKLWEKSLVAKIAVKRGIQNTNNKLMAEELKSLTGGTLLQRNKFYIVL 501 Query: 1610 YRGKDFLPPTVAAALTERQEMTKQTQDAEEKVRGVPIEPVAT-IGEGEALAGTLAEFYEA 1434 YRGKDFLPP VA+AL ER++ KQ QD EEKVR +E + EG+A AGTLAEFYEA Sbjct: 502 YRGKDFLPPNVASALAEREQCAKQIQDVEEKVRSKTLEATPSGETEGQAPAGTLAEFYEA 561 Query: 1433 QARWGREISVEEREKMKEEASRAKTARVVKRLEHKLAISQTKKLKAEKLLDKIISSWVPV 1254 Q RWGRE+S EEREKM EEAS+AK R+VKR+EHKLA+SQ KKL+AE+LL KI +S VP Sbjct: 562 QKRWGREVSAEEREKMVEEASKAKHGRLVKRIEHKLAVSQAKKLRAERLLAKIEASMVPS 621 Query: 1253 GPDDDQETITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKDK 1074 GPD DQETITDEERAMFRRVGLRMKA+LPLGIRGVFDGV+ENMHLHWK+RELVKL++K K Sbjct: 622 GPDYDQETITDEERAMFRRVGLRMKAFLPLGIRGVFDGVVENMHLHWKYRELVKLITKQK 681 Query: 1073 EIAFVEETARLLEYESGGILVAIERVPKGFVLIFYRGKNYRRPISLRPRNLLTKAKALKR 894 +A+VE+TARLLEYES GIL+AIERVPKGF LIFYRGKNYRRPISLRPRNLLTKAKALKR Sbjct: 682 TLAYVEDTARLLEYESVGILIAIERVPKGFALIFYRGKNYRRPISLRPRNLLTKAKALKR 741 Query: 893 RVALQRYEALSQHIDELEKTIEQTRKDI 810 VA+QR+EALSQHI +LE TIEQ +K+I Sbjct: 742 SVAMQRHEALSQHISDLENTIEQMKKEI 769 >ref|XP_002532154.1| conserved hypothetical protein [Ricinus communis] gi|223528164|gb|EEF30228.1| conserved hypothetical protein [Ricinus communis] Length = 745 Score = 845 bits (2182), Expect = 0.0 Identities = 458/716 (63%), Positives = 532/716 (74%), Gaps = 8/716 (1%) Frame = -1 Query: 2984 KPFSSSLHPTTKLPRKATQVPNFSSGIPDHSSSWLKKWPSTSPLPPIHYKKPRTLQQEST 2805 +PFSSS ++ T N + + S WL KW S PP P+ L Q+ Sbjct: 37 RPFSSSSSSSSSSSSLGT---NQNPKPNNPKSPWLSKWAPHSSPPPTVKTSPK-LAQDKK 92 Query: 2804 SEAQFLDEAVKPPTSAIDRIVLRLRNXXXXXXXXXXXXXXXXXXXXXXXXXXE---KLGD 2634 ++ D+ +AI+RIVLRLRN +L D Sbjct: 93 IQSLTKDKG----QNAIERIVLRLRNLGLGSDDEEEEGDMEYKPNGGDSIAVTGEERLAD 148 Query: 2633 LLKRDWVRPDKXXXXXXXXXXXXXL--PWERSAEEGAIKDDKEEGSRTKRRSVKAPTLAE 2460 LL+R+WVRPD L PWER E+ + +KEEG R +RR VKAPTLAE Sbjct: 149 LLQREWVRPDTIFIKDDEEDDNDDLVLPWERK-EKVRREGEKEEGERERRRVVKAPTLAE 207 Query: 2459 LTIEDXXXXXXXXXXXXXXXRINVPKAGVTKEVLEKIHDKWRKNELVRLKFHEDLAHDMR 2280 LTIED R+NVPKAG+TKEV+EKIHDKWRKNELVRLKFHE LAHDM+ Sbjct: 208 LTIEDEELRRLRRMGMFLRERVNVPKAGLTKEVVEKIHDKWRKNELVRLKFHEVLAHDMK 267 Query: 2279 TGHEIVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVNREGDALFVPIVSSSGNLL 2100 T HEI ERRTGGLVIWR+GSVMVV+RG++YEGP S+ Q VNREGDALF+P VSS+G+ Sbjct: 268 TAHEITERRTGGLVIWRAGSVMVVYRGSSYEGP-PSKTQPVNREGDALFIPDVSSAGSET 326 Query: 2099 EKNSSSSDNIDETRAPVVP--NRTESMTAEEAEFNSLLDGLGPRFEEWWGTGILPVDADL 1926 K + + + E R + + ++ MT EE E++S LD LGPRFEEWWGTGILPVDADL Sbjct: 327 MKGDNVAPSAAEKRELAMRRLDHSKDMTEEEIEYDSFLDSLGPRFEEWWGTGILPVDADL 386 Query: 1925 LPQKVPGYKTPFRLLPTGMRPRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIIKL 1746 LP K+P YKTPFRLLPTGMR RLTNAEMT+LRKLAK LPCHFALGRNRNHQGLA+ I+K+ Sbjct: 387 LPPKIPDYKTPFRLLPTGMRSRLTNAEMTNLRKLAKKLPCHFALGRNRNHQGLASTILKV 446 Query: 1745 WEKSLVVKIAVKRGIQNTNNKLMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVAAAL 1566 WEKSLV KIAVKRGIQNTNNKLMA+ELK LTGGVLLLRNKYYIV+YRGKDFLP +VAAAL Sbjct: 447 WEKSLVAKIAVKRGIQNTNNKLMADELKMLTGGVLLLRNKYYIVIYRGKDFLPTSVAAAL 506 Query: 1565 TERQEMTKQTQDAEEKVRGVPIEPVATIGE-GEALAGTLAEFYEAQARWGREISVEEREK 1389 TERQE+TK+ QD EEKVR IE V + E G+ LAGTLAEFYEAQ+RWG++ S E+REK Sbjct: 507 TERQELTKKIQDVEEKVRSREIEAVPSKEEEGKPLAGTLAEFYEAQSRWGKDTSAEDREK 566 Query: 1388 MKEEASRAKTARVVKRLEHKLAISQTKKLKAEKLLDKIISSWVPVGPDDDQETITDEERA 1209 M E+ +RAK AR+VKR+EHKLA++Q KKL+AE+LL KI S +P GPD DQETITDEERA Sbjct: 567 MIEDDTRAKRARIVKRIEHKLAVAQAKKLRAERLLAKIEVSMLPSGPDYDQETITDEERA 626 Query: 1208 MFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKDKEIAFVEETARLLEYE 1029 +FRR+GLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKL+SK K +AF E+TARLLEYE Sbjct: 627 VFRRIGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLISKQKTLAFAEDTARLLEYE 686 Query: 1028 SGGILVAIERVPKGFVLIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALS 861 SGGILVAIERVPKGF LI+YRGKNYRRPI+LRPRNLLTKAKALKR VA+QR+E S Sbjct: 687 SGGILVAIERVPKGFALIYYRGKNYRRPINLRPRNLLTKAKALKRSVAMQRHEVSS 742 >ref|XP_003530015.2| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like isoform X1 [Glycine max] Length = 791 Score = 835 bits (2156), Expect = 0.0 Identities = 459/772 (59%), Positives = 550/772 (71%), Gaps = 19/772 (2%) Frame = -1 Query: 2990 FLKPFSSSLHPTT-----KLPRKATQVPNFSSGIPDHSSSWLKKWPSTSPLPPIHYKKPR 2826 F FSS HP K P + + + P+ S+ WL K PS P+ Sbjct: 14 FKSSFSSLNHPHPPRSFRKFPLRTLTFASLPTPKPNPSAPWLTKSPS-----------PK 62 Query: 2825 TLQQESTSEAQFLDEAVKPPTSAIDRIVLRLRN-XXXXXXXXXXXXXXXXXXXXXXXXXX 2649 + T+ D K P + ++RIVLRLRN Sbjct: 63 RATEPLTAGDPIPD---KKPHNPVERIVLRLRNLGLPSEEEEQEEEEEIPANNPAPVTGE 119 Query: 2648 EKLGDLLKRDWVRPDKXXXXXXXXXXXXXLPWERSAEEGAIKDDKEEGSRTKRRSVKAPT 2469 E+LG+LL+R+WVRPD LPWER E+ + EEG KRR V+AP+ Sbjct: 120 ERLGELLRREWVRPDAVLVGEDDGEEEMILPWEREEEKEVVVVVSEEGLLKKRR-VRAPS 178 Query: 2468 LAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTKEVLEKIHDKWRKNELVRLKFHEDLAH 2289 LA+LT+ED R++VPKAG+T+EV+EKIH +WRK ELVRLKFHE+LA Sbjct: 179 LADLTLEDELLRRLRREGMRVRERVSVPKAGLTQEVMEKIHKRWRKEELVRLKFHEELAK 238 Query: 2288 DMRTGHEIVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVNREGDALFVPIVSSSG 2109 DMR HEIVERRTGGLV WRSGSVM+V+RG +Y+GP SQ + ++GD FVP VS Sbjct: 239 DMRKAHEIVERRTGGLVTWRSGSVMMVYRGIDYQGP-DSQKEVNEKKGDGFFVPDVSK-- 295 Query: 2108 NLLEKNSSSSDNIDETRAPVVPNR--TESMTAEEAEFNSLLDGLGPRFEEWWGTGILPVD 1935 ++SS++ + E VV R E+M+ EAE+N+LLDGLGPRF WWGTGILPVD Sbjct: 296 ---REDSSTATSTSEKSEVVVREREHPENMSEAEAEYNALLDGLGPRFVGWWGTGILPVD 352 Query: 1934 ADLLPQKVPGYKTPFRLLPTGMRPRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAI 1755 ADLLP+ VPGYKTPFRLLPTGMR RLTNAEMT+LRKLAKSLPCHFALGRNRNHQGLA AI Sbjct: 353 ADLLPRTVPGYKTPFRLLPTGMRSRLTNAEMTNLRKLAKSLPCHFALGRNRNHQGLACAI 412 Query: 1754 IKLWEKSLVVKIAVKRGIQNTNNKLMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVA 1575 +KLWEKSLV KIAVKRGIQNTNN+LMAEELK LTGG LLLRNKY+IV+YRGKDF+P +VA Sbjct: 413 LKLWEKSLVAKIAVKRGIQNTNNELMAEELKMLTGGTLLLRNKYFIVIYRGKDFVPTSVA 472 Query: 1574 AALTERQEMTKQTQDAEEKVRGVPIEPVATIGEGEAL--AGTLAEFYEAQARWGREISVE 1401 A L ER+E+TKQ QD E+KVR ++ + +G+GEA AGTLAEFYEAQARWGREIS E Sbjct: 473 AVLAEREELTKQVQDVEDKVRCRAVDAI-PLGQGEATAQAGTLAEFYEAQARWGREISPE 531 Query: 1400 EREKMKEEASRAKTARVVKRLEHKLAISQTKKLKAEKLLDKIISSWVPVGPDDDQETITD 1221 EREKM EEA++ KTA++V+++EHK+ I+QTKKL+AEKLL KI +S VP GPD DQETITD Sbjct: 532 EREKMVEEAAKTKTAKLVRQIEHKIFIAQTKKLRAEKLLAKIEASMVPAGPDYDQETITD 591 Query: 1220 EERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKDKEIAFVEETARL 1041 EER MFR+VGLRMK YLPLGIRGVFDGV+ENMHLHWKHRELVKL++K K +AFVE+TARL Sbjct: 592 EERVMFRKVGLRMKPYLPLGIRGVFDGVVENMHLHWKHRELVKLMTKQKTVAFVEDTARL 651 Query: 1040 LEYESGGILVAIERVPKGFVLIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALS 861 LEYESGGILVAIE+V K F LI+YRGKNY+RPI+LRPRNLLTK KALKR VA+QR+EALS Sbjct: 652 LEYESGGILVAIEKVSKEFALIYYRGKNYKRPITLRPRNLLTKGKALKRHVAMQRHEALS 711 Query: 860 QHIDELEKTIEQTRKDI---------DDPKVVESGAQFNNVSEFSESEDENS 732 QHI ELEKTIEQ +K++ D + E ++SE + SEDE+S Sbjct: 712 QHITELEKTIEQMKKELGMTQDSDVEDGGSIEEDDHNQIDISELALSEDEDS 763 >gb|EPS74467.1| hypothetical protein M569_00278, partial [Genlisea aurea] Length = 693 Score = 830 bits (2144), Expect = 0.0 Identities = 438/688 (63%), Positives = 504/688 (73%), Gaps = 11/688 (1%) Frame = -1 Query: 2759 AIDRIVLRLRNXXXXXXXXXXXXXXXXXXXXXXXXXXE-----KLGDLLKRDWVRPDKXX 2595 AIDRIVLRLRN E KLGDLLKRDWVRPD Sbjct: 1 AIDRIVLRLRNLGLGSDEEGDDGRGLSREDSIDSKLEELGEEEKLGDLLKRDWVRPDTIL 60 Query: 2594 XXXXXXXXXXXL--PWERSAEEGAIKDDKEEGSRTKRRSVKAPTLAELTIEDXXXXXXXX 2421 L PWER +D+ E +++ ++APT+AELTIED Sbjct: 61 VQDSDSDSDSELLLPWERRGN-ATEQDEMEAKGASRKGEMRAPTMAELTIEDEELRRLRR 119 Query: 2420 XXXXXXXRINVPKAGVTKEVLEKIHDKWRKNELVRLKFHEDLAHDMRTGHEIVERRTGGL 2241 RINVPKAG+T +LEKIH+KWRK+ELVRLKFHE+LAHDM+T H+IVERRTGGL Sbjct: 120 MGMTLRERINVPKAGITGVILEKIHEKWRKSELVRLKFHEELAHDMKTAHQIVERRTGGL 179 Query: 2240 VIWRSGSVMVVFRGTNYEGPLSS-QAQSVNREGDALFVPIVSSSGNLLEKNSSSSDNIDE 2064 V WRSGSVMVVFRGTNYEGP+S Q +++ E D FVP V S + + S+ E Sbjct: 180 VTWRSGSVMVVFRGTNYEGPVSKPQRPNIDEEDDGPFVPTVPSGEVVTSETGDSTSKTLE 239 Query: 2063 TRAPVVPNRTESMTAEEAEFNSLLDGLGPRFEEWWGTGILPVDADLLPQKVPGYKTPFRL 1884 + ++ + ES+T +EAE+N LLDGLGPRFE+WWGTG+LPVDADLLP VPGYKTPFRL Sbjct: 240 KPSRIIASAAESVTEQEAEYNMLLDGLGPRFEDWWGTGVLPVDADLLPPAVPGYKTPFRL 299 Query: 1883 LPTGMRPRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIIKLWEKSLVVKIAVKRG 1704 LP GMR RLTNAEMT LRKLAK LP HFALG+NR HQGLA+AI+KLWEKSL+VKIAVKRG Sbjct: 300 LPVGMRSRLTNAEMTHLRKLAKRLPSHFALGKNRKHQGLASAIVKLWEKSLLVKIAVKRG 359 Query: 1703 IQNTNNKLMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVAAALTERQEMTKQTQDAE 1524 IQNTNNKLMAEELK LTGGVLLLRNKYYI+MYRGKDFLPP+VA+AL ER EMTKQ QD E Sbjct: 360 IQNTNNKLMAEELKALTGGVLLLRNKYYIIMYRGKDFLPPSVASALAERNEMTKQIQDVE 419 Query: 1523 EKVRGVPIEPVATIGEG---EALAGTLAEFYEAQARWGREISVEEREKMKEEASRAKTAR 1353 E+VR P + + EA AGTL+EFYEAQ RWG EIS ++R KM EEASR+ + Sbjct: 420 ERVRRGPAAAITNGDDDDGKEASAGTLSEFYEAQVRWGMEISPDQRNKMLEEASRSIKMK 479 Query: 1352 VVKRLEHKLAISQTKKLKAEKLLDKIISSWVPVGPDDDQETITDEERAMFRRVGLRMKAY 1173 +KRLE K+A +Q KKL+AEKLL KI+ SWVPV P DDQETITDEER M+RR+GLRM Y Sbjct: 480 ALKRLERKVAAAQAKKLRAEKLLSKIVDSWVPVDPSDDQETITDEERVMYRRLGLRMTPY 539 Query: 1172 LPLGIRGVFDGVIENMHLHWKHRELVKLLSKDKEIAFVEETARLLEYESGGILVAIERVP 993 LPLGIRGVFDGVIENMHLHWKHRELVKL+SK+KE +FVEETARLLEYESGGILVAIERVP Sbjct: 540 LPLGIRGVFDGVIENMHLHWKHRELVKLISKEKETSFVEETARLLEYESGGILVAIERVP 599 Query: 992 KGFVLIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQHIDELEKTIEQTRKD 813 KG LI+YRGKNY+RP+SLRPRNLL K+ ALKRRVALQRYEALSQHI ELEKTI Q ++ Sbjct: 600 KGHALIYYRGKNYQRPLSLRPRNLLNKSNALKRRVALQRYEALSQHISELEKTISQAKQQ 659 Query: 812 IDDPKVVESGAQFNNVSEFSESEDENSQ 729 + E + + E ED N + Sbjct: 660 MAATDPPEEDEEEEEEKKEEEEEDPNKE 687 >ref|XP_004512920.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Cicer arietinum] Length = 809 Score = 830 bits (2144), Expect = 0.0 Identities = 463/776 (59%), Positives = 561/776 (72%), Gaps = 14/776 (1%) Frame = -1 Query: 3017 SRKSSFSVFFLKPFSSSLHPTTKLPRKATQVPNFSSGIPDHSSSWLKKWPSTSPLPPIHY 2838 + SSFS+ L ++H L P FSS H SS +++P PP Sbjct: 18 NNNSSFSLINLSRSYFTIH---FLSHPKPSFPIFSSLKTTHHSS---PKSNSNPTPPWLS 71 Query: 2837 KKPRTLQQESTSEAQFLDEAVKPPTSAIDRIVLRLRN-XXXXXXXXXXXXXXXXXXXXXX 2661 R + +E+ L P + ++RIV RLRN Sbjct: 72 SPKRVTESPIKNESLNLQHDNNKPKNPVERIVFRLRNLGLAEEEGEKEQQEEEVEVSELP 131 Query: 2660 XXXXEKLGDLLKRDWVRPDKXXXXXXXXXXXXXLPWERSAEE---GAIKDDKEEGSRTKR 2490 EKL +LLKR WVRPD LPW+R E G EEG K+ Sbjct: 132 VSGDEKLSELLKRKWVRPDALLDDEDKEEDEMVLPWKREEEREMGGGDVGIDEEG--LKK 189 Query: 2489 RSVKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTKEVLEKIHDKWRKNELVRLK 2310 R++KAP+LAELT+ED R++VPKAG+T+EV+EKIH++WRK ELVRLK Sbjct: 190 RTIKAPSLAELTLEDELLRRLRREGMRVRERVSVPKAGLTQEVMEKIHERWRKEELVRLK 249 Query: 2309 FHEDLAHDMRTGHEIVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVNREGDALFV 2130 FHE+LA +MR HEIVERRTGGLV WR+GSVM+V+RG NY+GP SS+ +EGD FV Sbjct: 250 FHEELAKNMRVAHEIVERRTGGLVTWRAGSVMMVYRGKNYQGPNSSKELDA-KEGDGFFV 308 Query: 2129 PIVSSSGNLLEKNSSSSDNIDETRAPVVPN--RTESMTAEEAEFNSLLDGLGPRFEEWWG 1956 P VSS + K+SS++ ++ + A V N + E+MT EEAE+N+LLDGLGPRF EWWG Sbjct: 309 PDVSSKSSSRTKDSSTTASL-KNSAQVRRNDEQPENMTKEEAEYNALLDGLGPRFFEWWG 367 Query: 1955 TGILPVDADLLPQKVPGYKTPFRLLPTGMRPRLTNAEMTDLRKLAKSLPCHFALGRNRNH 1776 TGILPVDADLLP+ +PGYKTP+RLLPTGMR RLT+AE+TDLRK+AKSLPCHFALGRNR H Sbjct: 368 TGILPVDADLLPRDIPGYKTPYRLLPTGMRSRLTSAEITDLRKIAKSLPCHFALGRNRYH 427 Query: 1775 QGLAAAIIKLWEKSLVVKIAVKRGIQNTNNKLMAEELKTLTGGVLLLRNKYYIVMYRGKD 1596 QGLA AI+KLWEKSL+ KIAVK GIQNTNNKLMA+EL TLTGG LLLR+KYYIV+YRGKD Sbjct: 428 QGLACAILKLWEKSLIAKIAVKPGIQNTNNKLMADELVTLTGGTLLLRDKYYIVIYRGKD 487 Query: 1595 FLPPTVAAALTERQEMTKQTQDAEEKVRGVPIEPVAT-IGEGEA--LAGTLAEFYEAQAR 1425 F+P VAA L ERQE+TK+ QD EEKVR + VAT G+GEA LAGTLAEFYEAQAR Sbjct: 488 FVPTGVAAVLAERQELTKEVQDVEEKVRCKAV--VATPSGQGEATVLAGTLAEFYEAQAR 545 Query: 1424 WGREISVEEREKMKEEASRAKTARVVKRLEHKLAISQTKKLKAEKLLDKIISSWVPVGPD 1245 WGR+IS EERE+M EEA++AK+ ++VK++EH+L+++QTKK++AEKLL KI S VPVGPD Sbjct: 546 WGRDISTEERERMIEEAAKAKSVKLVKQIEHRLSLAQTKKIRAEKLLAKIEVSMVPVGPD 605 Query: 1244 DDQETITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKDKEIA 1065 DQETITDEERA+FRR+GLRMK YLPLGIRGVFDGVIENMHLHWKHRELVKL++K K +A Sbjct: 606 YDQETITDEERAVFRRIGLRMKPYLPLGIRGVFDGVIENMHLHWKHRELVKLITKQKNLA 665 Query: 1064 FVEETARLLEYESGGILVAIERVPKGFVLIFYRGKNYRRPISLRPRNLLTKAKALKRRVA 885 FVE+TARLLEYESGGILVAIE+V K F LI+YRGKNY+RPISLRPRNLLTKAKALKR VA Sbjct: 666 FVEDTARLLEYESGGILVAIEKVSKEFALIYYRGKNYKRPISLRPRNLLTKAKALKRSVA 725 Query: 884 LQRYEALSQHIDELEKTIEQTRKDI---DDPKVVESG--AQFNNVSEFSESEDENS 732 +QR+EALS HI ELE TIEQ +++I DD ++ G Q ++ SEF++SEDE+S Sbjct: 726 MQRHEALSNHITELETTIEQMKQEIGLSDDEWSMKEGHENQLDHNSEFTQSEDEDS 781 >ref|XP_003533366.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Glycine max] Length = 791 Score = 825 bits (2130), Expect = 0.0 Identities = 456/777 (58%), Positives = 551/777 (70%), Gaps = 24/777 (3%) Frame = -1 Query: 2990 FLKPFSSSLHPTT---KLPRKATQVPNFSSGIPDHSSSWLKKWPS----TSPLPPIHYKK 2832 F F+S HP + K P + + + P+ S+ WL K PS PLP Sbjct: 14 FNSSFASLNHPHSSFRKFPFRTLTFASLPTPKPNPSAPWLTKSPSPKRAVEPLPA----- 68 Query: 2831 PRTLQQESTSEAQFLDEAVKPPTSAIDRIVLRLRNXXXXXXXXXXXXXXXXXXXXXXXXX 2652 + T + + P +A+DRIVLRLRN Sbjct: 69 -----GDPTPD--------RKPQNAVDRIVLRLRNLGLPSEEEEQEQEHEEEIPATNPAP 115 Query: 2651 XE---KLGDLLKRDWVRPDKXXXXXXXXXXXXXL-PWERSAEEGAIKDDKEEGSRTKRRS 2484 +LG+LL+R+WVRPD + PWER EE + EEG KRR Sbjct: 116 VTGEERLGELLQREWVRPDAVLVGEDDDEEEEMMLPWERDEEEKEVVVVSEEGLLKKRR- 174 Query: 2483 VKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTKEVLEKIHDKWRKNELVRLKFH 2304 V+AP+LA+LT+ED R++VPKAG+T+EV+EKIH +WRK ELVRLKFH Sbjct: 175 VRAPSLADLTLEDELLRRLRREGMRVRERVSVPKAGLTEEVMEKIHKRWRKEELVRLKFH 234 Query: 2303 EDLAHDMRTGHEIVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVNREGDALFVPI 2124 E+LA DMR HEIVERRTGGLV WRSGSVM+V+RG +Y+GP S+ + ++GD FVP Sbjct: 235 EELAKDMRKAHEIVERRTGGLVTWRSGSVMMVYRGIDYQGP-DSRKELNEKKGDGFFVPD 293 Query: 2123 VSSSGNLLEKNSSSSDNIDETRAPVVPNRT--ESMTAEEAEFNSLLDGLGPRFEEWWGTG 1950 VS ++ S++ + E VV R E+M+ EAE+N+LLDGLGPRF WWGTG Sbjct: 294 VS------KREDSTATSTSEKSEVVVREREHPENMSEAEAEYNALLDGLGPRFFGWWGTG 347 Query: 1949 ILPVDADLLPQKVPGYKTPFRLLPTGMRPRLTNAEMTDLRKLAKSLPCHFALGRNRNHQG 1770 ILPVDADLLP+ VPGYKTPFRLLPTGMR RLTNAEMT+LRKLAKSLPCHFA+GRNRNHQG Sbjct: 348 ILPVDADLLPRTVPGYKTPFRLLPTGMRSRLTNAEMTNLRKLAKSLPCHFAVGRNRNHQG 407 Query: 1769 LAAAIIKLWEKSLVVKIAVKRGIQNTNNKLMAEELKTLTGGVLLLRNKYYIVMYRGKDFL 1590 LA AI+KLWEKSLV KIAVKRGIQNTNN+LMAEELK LTGG LLLRNKY+IV+YRGKDF+ Sbjct: 408 LACAILKLWEKSLVSKIAVKRGIQNTNNELMAEELKMLTGGTLLLRNKYFIVIYRGKDFV 467 Query: 1589 PPTVAAALTERQEMTKQTQDAEEKVRGVPIEPVATIGEGEALA--GTLAEFYEAQARWGR 1416 P +VAA L ER+E+TKQ QD E+KVR ++ + + G+GEA A GTLAEFYEAQARWGR Sbjct: 468 PTSVAAVLAEREELTKQVQDVEDKVRCRAVDAIPS-GQGEATAQAGTLAEFYEAQARWGR 526 Query: 1415 EISVEEREKMKEEASRAKTARVVKRLEHKLAISQTKKLKAEKLLDKIISSWVPVGPDDDQ 1236 EIS +EREKM EEA++AKTA++V+++EHK+ I+QTKKL+AEKLL KI +S VP GPD DQ Sbjct: 527 EISPDEREKMMEEAAKAKTAKLVRQIEHKIFIAQTKKLRAEKLLAKIEASMVPAGPDYDQ 586 Query: 1235 ETITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKDKEIAFVE 1056 ETITDEER MFR+VGLRMK YLPLGIRGVFDGV+ENMHLHWKHRELVKL++K K +AFVE Sbjct: 587 ETITDEERVMFRKVGLRMKPYLPLGIRGVFDGVVENMHLHWKHRELVKLMTKQKTLAFVE 646 Query: 1055 ETARLLEYESGGILVAIERVPKGFVLIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQR 876 +TARLLEYESGGILVAIE+V K F LI+YRGKNY+RPI+LRPRNLLTK KALKR VA+QR Sbjct: 647 DTARLLEYESGGILVAIEKVSKEFALIYYRGKNYKRPITLRPRNLLTKGKALKRHVAMQR 706 Query: 875 YEALSQHIDELEKTIEQTRKDI---------DDPKVVESGAQFNNVSEFSESEDENS 732 +EALSQHI ELEKTIEQ +K++ D + E ++SE + SEDE+S Sbjct: 707 HEALSQHITELEKTIEQMKKELGMTQDSDVEDGGSIEEDDHNQIDISELALSEDEDS 763 >ref|XP_006406610.1| hypothetical protein EUTSA_v10020034mg [Eutrema salsugineum] gi|557107756|gb|ESQ48063.1| hypothetical protein EUTSA_v10020034mg [Eutrema salsugineum] Length = 874 Score = 821 bits (2121), Expect = 0.0 Identities = 440/765 (57%), Positives = 538/765 (70%), Gaps = 49/765 (6%) Frame = -1 Query: 2885 WLKKWPSTSPLPPIHYKKP----------RTLQQESTSEAQFLDEAVKPPTSAIDRIVLR 2736 W+ KWP +S H K R+ ++E+ ++ ++L++ SAI+RIVLR Sbjct: 82 WIDKWPPSSAGAGDHSGKKVAEQNGGGKIRSAEEEAEAKRRYLEK--DKGHSAIERIVLR 139 Query: 2735 LRNXXXXXXXXXXXXXXXXXXXXXXXXXXE----KLGDLLKRDWVRPDKXXXXXXXXXXX 2568 LRN +LGDLLKR+WVRPD Sbjct: 140 LRNLGLASDDEDDVEDNEGDGINGGDVKPVTGEERLGDLLKREWVRPDMMLAEGEEESDE 199 Query: 2567 XXL---PWERSAEEGAIKDDKEEGSRTKRRSVKAPTLAELTIEDXXXXXXXXXXXXXXXR 2397 PWE++ EE A + + +G+ K+R +AP+LAELT+ED R Sbjct: 200 DDDVLLPWEKNEEEQAAERMEGDGAAVKKRRARAPSLAELTVEDSELRRLRRDGMYLRVR 259 Query: 2396 INVPKAGVTKEVLEKIHDKWRKNELVRLKFHEDLAHDMRTGHEIVERRTGGLVIWRSGSV 2217 I++PKAG+T+ V+EKIHD WRK ELVRLKFHE LA DMRT HEIVERRTGG+VIWR+GSV Sbjct: 260 ISIPKAGLTQAVMEKIHDTWRKEELVRLKFHEVLARDMRTAHEIVERRTGGMVIWRAGSV 319 Query: 2216 MVVFRGTNYEGPLSSQAQSVNREGDALFVPIVSSSGNLLEKNSSSSDNIDETRAPVVPN- 2040 MVV+RG +Y+GP S + + R + LFVP VSS+G+ + + E + P+V N Sbjct: 320 MVVYRGRDYQGP-SMISNQMARPEETLFVPDVSSAGDEATGSKDNQSAPPEIKDPIVRNP 378 Query: 2039 -RTESMTAEEAEFNSLLDGLGPRFEEWWGTGILPVDADLLPQKVPGYKTPFRLLPTGMRP 1863 R E+MT EEAEFNSLLD LGPRF EWWGTG+LPV+ADLLP +PGYKTPFRLLPTGMR Sbjct: 379 IRKETMTEEEAEFNSLLDSLGPRFHEWWGTGVLPVNADLLPPTIPGYKTPFRLLPTGMRS 438 Query: 1862 RLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIIKLWEKSLVVKIAVKRGIQNTNNK 1683 LTNAEMT+LRK+ K+LPCHFALGRNRNHQGLAAAI+KLWEKSL+ KIAVKRGIQNTNNK Sbjct: 439 NLTNAEMTNLRKIGKTLPCHFALGRNRNHQGLAAAILKLWEKSLIAKIAVKRGIQNTNNK 498 Query: 1682 LMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVAAALTERQEMTKQTQDAEEKVRGVP 1503 LMA+E+KTLTGGVLLLRNKYYIV+YRGKDFLP +VAA L ERQE+TK+ QD EE+VR Sbjct: 499 LMADEIKTLTGGVLLLRNKYYIVIYRGKDFLPSSVAATLAERQELTKEIQDVEERVRTRD 558 Query: 1502 IEPVATIGEG------------------------------EALAGTLAEFYEAQARWGRE 1413 IE +G+ A AGTLAEFYEAQARWG+E Sbjct: 559 IETSQPVGDTVPAEAGTLADIEERVNNRDIEASQPVGDKVPAEAGTLAEFYEAQARWGKE 618 Query: 1412 ISVEEREKMKEEASRAKTARVVKRLEHKLAISQTKKLKAEKLLDKIISSWVPVGPDDDQE 1233 I+ + REKM EEASR +ARVVKR++HKL ++Q+K +AEKLL KI +S +P GPD DQE Sbjct: 619 ITPDHREKMIEEASRVASARVVKRIQHKLNLAQSKFHRAEKLLSKIEASMIPNGPDYDQE 678 Query: 1232 TITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKDKEIAFVEE 1053 I++EER MFR+VGL+MK+YLPLGIRGVFDGVIENMHLHWKHRELVKL+SK K +AFVE+ Sbjct: 679 VISEEERIMFRKVGLKMKSYLPLGIRGVFDGVIENMHLHWKHRELVKLISKQKSLAFVED 738 Query: 1052 TARLLEYESGGILVAIERVPKGFVLIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRY 873 TARLLEYESGG+LVAIE+VPKGF LI+YRGKNY+RPISLRPRNLLTKAKALKR +A+QR+ Sbjct: 739 TARLLEYESGGVLVAIEKVPKGFALIYYRGKNYQRPISLRPRNLLTKAKALKRSIAMQRH 798 Query: 872 EALSQHIDELEKTIEQTRKDIDDPKVVESGAQFNNVSEFSESEDE 738 EALSQHI ELEKTIEQ + ++ S +++ N + + E+E Sbjct: 799 EALSQHISELEKTIEQMQNELTAKNPSYSESEWENEDDDDDEEEE 843 >ref|XP_002883129.1| EMB1865 [Arabidopsis lyrata subsp. lyrata] gi|297328969|gb|EFH59388.1| EMB1865 [Arabidopsis lyrata subsp. lyrata] Length = 846 Score = 820 bits (2119), Expect = 0.0 Identities = 437/739 (59%), Positives = 537/739 (72%), Gaps = 22/739 (2%) Frame = -1 Query: 2885 WLKKWP-STSPLPPIHYKKP----------RTLQQESTSEAQFLDEAVKPPTSAIDRIVL 2739 W+ KWP S++ + H K R+ ++E+ ++ ++L+ +AI+RIVL Sbjct: 82 WIDKWPPSSAGVGGDHAGKRGGENNGGDKIRSAEEEAEAKLRYLER--DKGQNAIERIVL 139 Query: 2738 RLRNXXXXXXXXXXXXXXXXXXXXXXXXXXE----KLGDLLKRDWVRPDKXXXXXXXXXX 2571 RLRN +LGDLLKR+WVRPD Sbjct: 140 RLRNLGLGSDDEEDVEDEEGGGINGGDVKPVTGEERLGDLLKREWVRPDMMLAEGEESEE 199 Query: 2570 XXXL--PWERSAEEGAIKDDKEEGSRT--KRRSVKAPTLAELTIEDXXXXXXXXXXXXXX 2403 + PWE++ EE A + + EG K+ +AP+LAELT+ED Sbjct: 200 EDEVLLPWEKNEEEQAAERVEGEGGVAVMKKGRARAPSLAELTVEDSELRRLRRDGMYLR 259 Query: 2402 XRINVPKAGVTKEVLEKIHDKWRKNELVRLKFHEDLAHDMRTGHEIVERRTGGLVIWRSG 2223 RIN+PKAG+T+ V+EKI+D WRK ELVRLKFHE LA DM+T HEIVERRTGG+VIWR+G Sbjct: 260 VRINIPKAGLTQAVMEKIYDTWRKEELVRLKFHEVLARDMKTAHEIVERRTGGMVIWRAG 319 Query: 2222 SVMVVFRGTNYEGPLSSQAQSVNREGDALFVPIVSSSGNLLEKNSSSSDNIDETRAPVVP 2043 SVMVV+RG +Y+GP Q + + LFVP VSS+G+ + E + P++ Sbjct: 320 SVMVVYRGLDYKGPPVISNQMAGPK-ETLFVPDVSSAGDEATNAKDNQSPPSEIKDPIIK 378 Query: 2042 N--RTESMTAEEAEFNSLLDGLGPRFEEWWGTGILPVDADLLPQKVPGYKTPFRLLPTGM 1869 N R E+MT EEAEFNSLLD LGPRF+EWWGTG+LPVDADLLP +PGYKTPFRLLPTGM Sbjct: 379 NPIRKENMTEEEAEFNSLLDSLGPRFQEWWGTGVLPVDADLLPPTIPGYKTPFRLLPTGM 438 Query: 1868 RPRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIIKLWEKSLVVKIAVKRGIQNTN 1689 R LTNAEMT+LRK+ K+LPCHFALGRNRNHQGLAAAI+++WEKSL+ KIAVKRGIQNTN Sbjct: 439 RSNLTNAEMTNLRKIGKTLPCHFALGRNRNHQGLAAAILQIWEKSLIAKIAVKRGIQNTN 498 Query: 1688 NKLMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVAAALTERQEMTKQTQDAEEKVRG 1509 NKLMA+E+K LTGGVLLLRNKYYIV+YRGKDFLP +VAA L ERQE+TK+ QD EE+VR Sbjct: 499 NKLMADEVKALTGGVLLLRNKYYIVIYRGKDFLPSSVAATLAERQELTKEIQDVEERVRN 558 Query: 1508 VPIEPVATIGEG-EALAGTLAEFYEAQARWGREISVEEREKMKEEASRAKTARVVKRLEH 1332 IE V +G+ A AGTLAEFYEAQARWG+EI+ + REKM EEASR ARVVKR++H Sbjct: 559 REIEAVQPVGDKVPAEAGTLAEFYEAQARWGKEITPDHREKMIEEASRVANARVVKRIQH 618 Query: 1331 KLAISQTKKLKAEKLLDKIISSWVPVGPDDDQETITDEERAMFRRVGLRMKAYLPLGIRG 1152 KL ++Q+K +AEKLL KI +S +P GPD DQE I++EERAMFR+VGL+MKAYLPLGIRG Sbjct: 619 KLNLAQSKFQRAEKLLSKIEASMIPNGPDYDQEVISEEERAMFRKVGLKMKAYLPLGIRG 678 Query: 1151 VFDGVIENMHLHWKHRELVKLLSKDKEIAFVEETARLLEYESGGILVAIERVPKGFVLIF 972 VFDGVIENMHLHWKHRELVKL+SK K +AFVE+TARLLEYESGG+LVAIE+VPKGF LI+ Sbjct: 679 VFDGVIENMHLHWKHRELVKLISKQKNLAFVEDTARLLEYESGGVLVAIEKVPKGFALIY 738 Query: 971 YRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQHIDELEKTIEQTRKDIDDPKVV 792 YRGKNYRRPISLRPRNLLTKAKALKR +A+QR+EALSQHI ELE+TIEQ + ++ Sbjct: 739 YRGKNYRRPISLRPRNLLTKAKALKRSIAMQRHEALSQHISELERTIEQMQSELTSKTPS 798 Query: 791 ESGAQFNNVSEFSESEDEN 735 S +++ N + E E+++ Sbjct: 799 YSESEWENDEDDDEEEEKD 817 >ref|NP_188468.1| CRS1 / YhbY domain-containing protein EMB1865 [Arabidopsis thaliana] gi|11994102|dbj|BAB01105.1| unnamed protein product [Arabidopsis thaliana] gi|17380904|gb|AAL36264.1| unknown protein [Arabidopsis thaliana] gi|332642570|gb|AEE76091.1| CRS1 / YhbY (CRM) domain-containing protein [Arabidopsis thaliana] Length = 848 Score = 816 bits (2108), Expect = 0.0 Identities = 445/783 (56%), Positives = 549/783 (70%), Gaps = 22/783 (2%) Frame = -1 Query: 3020 PSRKSSFSVFFLKPFSSSLHPTTKLPRKATQVPNFSSGIPDHSSSWLKKWP-STSPLPPI 2844 PSR+ ++PFSS L + + ++ + W+ KWP S+S Sbjct: 43 PSRQQ-----IVRPFSS-LRTSERSNNRSNNNRRLDQRNHKPTPPWIDKWPPSSSGAGGD 96 Query: 2843 HYKKP----------RTLQQESTSEAQFLDEAVKPPTSAIDRIVLRLRNXXXXXXXXXXX 2694 H K R+ ++E+ ++ ++L++ +AI+RIVLRLRN Sbjct: 97 HAGKKGGENNGGDRIRSAEEEAEAKLRYLEK--DKGQNAIERIVLRLRNLGLGSDDEDDV 154 Query: 2693 XXXXXXXXXXXXXXXE----KLGDLLKRDWVRPDKXXXXXXXXXXXXXL--PWERSAEEG 2532 +LGDLLKR+WVRPD + PWE++ EE Sbjct: 155 EDDEGGGINGGDVKPVTGEERLGDLLKREWVRPDMMLAEGEESEEEDEVLLPWEKNEEEQ 214 Query: 2531 AIKDDKEEGSRT--KRRSVKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTKEVL 2358 A + EG ++R +AP+LAELT+ED RIN+PKAG+T+ V+ Sbjct: 215 AAERVVGEGGVAVMQKRRARAPSLAELTVEDSELRRLRRDGMYLRVRINIPKAGLTQAVM 274 Query: 2357 EKIHDKWRKNELVRLKFHEDLAHDMRTGHEIVERRTGGLVIWRSGSVMVVFRGTNYEGPL 2178 EKI+D WRK ELVRLKFHE LA DM+T HEIVERRTGG+VIWR+GSVMVV+RG +Y+GP Sbjct: 275 EKIYDTWRKEELVRLKFHEVLARDMKTAHEIVERRTGGMVIWRAGSVMVVYRGLDYKGPP 334 Query: 2177 SSQAQSVNREGDALFVPIVSSSGNLLEKNSSSSDNIDETRAPVVPN--RTESMTAEEAEF 2004 Q + + LFVP VSS+G+ + + P++ N R E+MT EE EF Sbjct: 335 VISNQMAGPK-ETLFVPDVSSAGDEATNAKDNQSAPLVIKDPIIKNPIRKENMTEEEVEF 393 Query: 2003 NSLLDGLGPRFEEWWGTGILPVDADLLPQKVPGYKTPFRLLPTGMRPRLTNAEMTDLRKL 1824 NSLLD LGPRF+EWWGTG+LPVDADLLP +PGYKTPFRLLPTGMR LTNAEMT+LRK+ Sbjct: 394 NSLLDSLGPRFQEWWGTGVLPVDADLLPPTIPGYKTPFRLLPTGMRSNLTNAEMTNLRKI 453 Query: 1823 AKSLPCHFALGRNRNHQGLAAAIIKLWEKSLVVKIAVKRGIQNTNNKLMAEELKTLTGGV 1644 K+LPCHFALGRNRNHQGLAAAI+++WEKSL+ KIAVKRGIQNTNNKLMA+E+KTLTGGV Sbjct: 454 GKTLPCHFALGRNRNHQGLAAAILQIWEKSLIAKIAVKRGIQNTNNKLMADEVKTLTGGV 513 Query: 1643 LLLRNKYYIVMYRGKDFLPPTVAAALTERQEMTKQTQDAEEKVRGVPIEPVATIGEG-EA 1467 LLLRNKYYIV+YRGKDFLP +VAA L ERQE+TK+ QD EE+VR IE V +G+ A Sbjct: 514 LLLRNKYYIVIYRGKDFLPSSVAATLAERQELTKEIQDVEERVRNREIEAVQPVGDKVPA 573 Query: 1466 LAGTLAEFYEAQARWGREISVEEREKMKEEASRAKTARVVKRLEHKLAISQTKKLKAEKL 1287 AGTLAEFYEAQARWG+EI+ + REKM EEASR ARVVKR++HKL ++Q+K +AEKL Sbjct: 574 EAGTLAEFYEAQARWGKEITPDHREKMIEEASRVANARVVKRIQHKLNLAQSKFQRAEKL 633 Query: 1286 LDKIISSWVPVGPDDDQETITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKH 1107 L KI +S +P GPD DQE I++EERAMFR+VGL+MKAYLP+GIRGVFDGVIENMHLHWKH Sbjct: 634 LSKIEASMIPNGPDYDQEVISEEERAMFRKVGLKMKAYLPIGIRGVFDGVIENMHLHWKH 693 Query: 1106 RELVKLLSKDKEIAFVEETARLLEYESGGILVAIERVPKGFVLIFYRGKNYRRPISLRPR 927 RELVKL+SK K AFVEETARLLEYESGG+LVAIE+VPKGF LI+YRGKNYRRPISLRPR Sbjct: 694 RELVKLISKQKNQAFVEETARLLEYESGGVLVAIEKVPKGFALIYYRGKNYRRPISLRPR 753 Query: 926 NLLTKAKALKRRVALQRYEALSQHIDELEKTIEQTRKDIDDPKVVESGAQFNNVSEFSES 747 NLLTKAKALKR +A+QR+EALSQHI ELE+TIEQ + + S +++ N + + Sbjct: 754 NLLTKAKALKRSIAMQRHEALSQHISELERTIEQMQSQLTSKNPSYSESEWENDEDDDDD 813 Query: 746 EDE 738 E+E Sbjct: 814 EEE 816