BLASTX nr result

ID: Rauwolfia21_contig00001100 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00001100
         (2745 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006341605.1| PREDICTED: chloroplastic group IIA intron sp...   886   0.0  
ref|XP_004235759.1| PREDICTED: chloroplastic group IIA intron sp...   881   0.0  
emb|CBI15459.3| unnamed protein product [Vitis vinifera]              874   0.0  
gb|EOY21034.1| CRS1 / YhbY domain-containing protein [Theobroma ...   872   0.0  
emb|CAN72582.1| hypothetical protein VITISV_035294 [Vitis vinifera]   868   0.0  
gb|EMJ12507.1| hypothetical protein PRUPE_ppa001468mg [Prunus pe...   864   0.0  
gb|EXB56911.1| Chloroplastic group IIA intron splicing facilitat...   858   0.0  
ref|XP_002317913.2| hypothetical protein POPTR_0012s05260g [Popu...   853   0.0  
ref|XP_004300521.1| PREDICTED: chloroplastic group IIA intron sp...   852   0.0  
ref|XP_006440978.1| hypothetical protein CICLE_v10018859mg [Citr...   850   0.0  
ref|XP_006440981.1| hypothetical protein CICLE_v10018859mg [Citr...   843   0.0  
ref|XP_006485796.1| PREDICTED: chloroplastic group IIA intron sp...   840   0.0  
gb|EPS74467.1| hypothetical protein M569_00278, partial [Genlise...   820   0.0  
ref|XP_006406610.1| hypothetical protein EUTSA_v10020034mg [Eutr...   813   0.0  
ref|XP_002883129.1| EMB1865 [Arabidopsis lyrata subsp. lyrata] g...   808   0.0  
ref|NP_188468.1| CRS1 / YhbY domain-containing protein EMB1865 [...   807   0.0  
ref|XP_002532154.1| conserved hypothetical protein [Ricinus comm...   805   0.0  
ref|XP_006296939.1| hypothetical protein CARUB_v10012930mg, part...   800   0.0  
ref|XP_003533366.1| PREDICTED: chloroplastic group IIA intron sp...   800   0.0  
ref|XP_003530015.2| PREDICTED: chloroplastic group IIA intron sp...   798   0.0  

>ref|XP_006341605.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Solanum tuberosum]
          Length = 824

 Score =  886 bits (2290), Expect = 0.0
 Identities = 486/762 (63%), Positives = 560/762 (73%), Gaps = 21/762 (2%)
 Frame = -1

Query: 2457 KTPRNPIQTPNINGATP-----RHSSSWLNKWPCATPLPPLHYKNPRTLQEESTSENQFL 2293
            K  R P +  N + +TP       SS+WLNKWP  +P P  H  N RT+  ES +E ++ 
Sbjct: 48   KDNRKPYRDSN-SSSTPVKSNNSRSSTWLNKWPNTSP-PVKHSSNSRTV--ESKTETRYF 103

Query: 2292 DEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEA------VQDAKKVTGEE-KLGDLL 2134
            DE  R GT+A+DRIVLRLRN              E            +V GEE KLGDLL
Sbjct: 104  DENTRVGTTAIDRIVLRLRNLGLGSDDEGEGEDEEEGNLKLDSSSTMQVNGEEEKLGDLL 163

Query: 2133 KRDWVRPDRILVEDEEDSASMSLPWXXXXXXXXXXXXXXXXXXXXXXSIKAPTLAELTIE 1954
            KRDWVRPD IL E+ +D     LPW                       +KAP+LAELTIE
Sbjct: 164  KRDWVRPDMIL-EESDDEGDTYLPWERSVEEEAVEVQRGGKRT-----VKAPSLAELTIE 217

Query: 1953 DXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSELVRLKFHEDLAHDMKTAHE 1774
            D               RINVPKAGVT  V +KIH  WRK+ELVRLKFHE LAHDM+T HE
Sbjct: 218  DEELRRLRRMGMTLRERINVPKAGVTGAVLEKIHHSWRKNELVRLKFHEVLAHDMRTGHE 277

Query: 1773 VVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGDTLFVPVVSSPGNLLAKSS 1594
            +VERRT GLVIWR+GSVMVV+RG+NYEGP SS++QSV  E + LFVP VSS  ++   + 
Sbjct: 278  IVERRTRGLVIWRAGSVMVVYRGSNYEGP-SSRSQSVNEEDNALFVPDVSSDKSITKDNK 336

Query: 1593 SDDSCIDEKSRPVVVPTCAESMTQEEAVFNSLLDGLGPRFEGWWGTGILPVDADLLPQKV 1414
            S +  I+ +++  V P   +SMT EE+ FN +LDGLGPRFE WWGTG+LPVDADLLPQ +
Sbjct: 337  SFNPVIENRNQ--VHPNSVQSMTVEESEFNRVLDGLGPRFEDWWGTGVLPVDADLLPQTI 394

Query: 1413 PGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIFKLWEKSL 1234
            PGYKTPFRLLP GMRSRLTNAEMT+LRK+AKSLPCHFALGRNRNHQGLAAAI KLWEKSL
Sbjct: 395  PGYKTPFRLLPTGMRSRLTNAEMTNLRKIAKSLPCHFALGRNRNHQGLAAAIVKLWEKSL 454

Query: 1233 VVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVATALAERQE 1054
            VVKIAVKRGIQNTNNK+M+EELK LTGGVLLLRNKYYI+ YRGKDF+PPTVA  LAERQE
Sbjct: 455  VVKIAVKRGIQNTNNKLMSEELKMLTGGVLLLRNKYYIIFYRGKDFVPPTVAAVLAERQE 514

Query: 1053 MTKQTQDEEEKMRGGPIEPVSTTEEDQALAGTLGEFYEAQARWGREISLDEREKMIEEAS 874
            +TKQ QD EE+ R GP +    T + QA+AG+L EFYEAQARWGREIS +ERE+M++EA+
Sbjct: 515  LTKQIQDVEEQTRSGPAKVAPLTTDGQAVAGSLAEFYEAQARWGREISAEERERMLKEAA 574

Query: 873  RGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVGPDDDQETITDEERAMFRRVG 694
              KT RVVKRLEHK  ISQ           KI+ SW+P GP DD ETIT+EER M RRVG
Sbjct: 575  MAKTARVVKRLEHKFEISQTKKLKAEKILAKIVESWIPAGPSDDLETITEEERVMLRRVG 634

Query: 693  LRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKEIAFVEETARLLEYESGGILV 514
            LRMK+YLPLGIRGVFDGVIENMHLHWKHRELVKL+SKEK +AFVEETARLLEYESGGILV
Sbjct: 635  LRMKSYLPLGIRGVFDGVIENMHLHWKHRELVKLISKEKVLAFVEETARLLEYESGGILV 694

Query: 513  AIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRHEALSQHISELEKTI 334
            AIE VPKG+ LIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQR+EALSQHI+ELE TI
Sbjct: 695  AIERVPKGYALIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQHIAELETTI 754

Query: 333  QQTRKEI------DVKNGD-PAL--FNNVSEFTESEDENSPM 235
            +QT+ +I      D+   +  AL  FN+VSE + SEDE+S +
Sbjct: 755  EQTKSKIVDFGKADINTSNLEALDQFNHVSE-SLSEDEDSSL 795


>ref|XP_004235759.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Solanum lycopersicum]
          Length = 820

 Score =  881 bits (2277), Expect = 0.0
 Identities = 483/759 (63%), Positives = 556/759 (73%), Gaps = 18/759 (2%)
 Frame = -1

Query: 2457 KTPRNPIQTPNINGATP-----RHSSSWLNKWPCATPLPPLHYKNPRTLQEESTSENQFL 2293
            K  R P +  N + +TP       SS+WLNKWP  T  P  H  N RT+  ES +E ++ 
Sbjct: 48   KDNRKPYRDSN-SSSTPVKSNNSRSSTWLNKWP-NTSSPVKHSSNSRTV--ESKTETRYF 103

Query: 2292 DEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEA------VQDAKKVTGEE-KLGDLL 2134
            DE  R GT+A+DRIVLRLRN              E            +V GEE KLGDLL
Sbjct: 104  DENTRVGTTAIDRIVLRLRNLGLGSDDEGEGEDEEEGNLKLDSSSTMQVNGEEEKLGDLL 163

Query: 2133 KRDWVRPDRILVEDEEDSASMSLPWXXXXXXXXXXXXXXXXXXXXXXSIKAPTLAELTIE 1954
            KRDWVRPD IL E+ +D     LPW                       ++AP+LAELTIE
Sbjct: 164  KRDWVRPDMIL-EESDDEGDTYLPWERSVEEEAVEVQRGGKRT-----VRAPSLAELTIE 217

Query: 1953 DXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSELVRLKFHEDLAHDMKTAHE 1774
            D               RINVPKAGVT  V +KIH  WRK+ELVRLKFHE LAHDM+T HE
Sbjct: 218  DEELRRLRRIGMTLRERINVPKAGVTGAVLEKIHHSWRKNELVRLKFHEVLAHDMRTGHE 277

Query: 1773 VVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGDTLFVPVVSSPGNLLAKSS 1594
            +VERRT GLVIWR+GSVMVV+RG+NYEGP SS++QSV  E + LFVP VSS  ++   + 
Sbjct: 278  IVERRTKGLVIWRAGSVMVVYRGSNYEGP-SSRSQSVNEEDNALFVPDVSSDKSITKDNK 336

Query: 1593 SDDSCIDEKSRPVVVPTCAESMTQEEAVFNSLLDGLGPRFEGWWGTGILPVDADLLPQKV 1414
            S +  I+ +++  V P   +SMT+EE+ FN +LDGLGPRFE WWGTG+LPVDADLLPQ +
Sbjct: 337  SFNPVIENRNQ--VHPNRVQSMTEEESEFNRVLDGLGPRFEDWWGTGVLPVDADLLPQTI 394

Query: 1413 PGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIFKLWEKSL 1234
            PGYKTPFRLLP GMRSRLTNAEMT+LRK+AKSLPCHFALGRNRNHQGLAAAI KLWEKSL
Sbjct: 395  PGYKTPFRLLPTGMRSRLTNAEMTNLRKIAKSLPCHFALGRNRNHQGLAAAIVKLWEKSL 454

Query: 1233 VVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVATALAERQE 1054
            VVKIAVKRGIQNTNNK+M+EELK LTGGVLLLRNKYYI+ YRGKDF+PPTVA  LAERQE
Sbjct: 455  VVKIAVKRGIQNTNNKLMSEELKMLTGGVLLLRNKYYIIFYRGKDFVPPTVAAVLAERQE 514

Query: 1053 MTKQTQDEEEKMRGGPIEPVSTTEEDQALAGTLGEFYEAQARWGREISLDEREKMIEEAS 874
            +TKQ QD EE+ R GP +      + QA+AG+L EFYEAQARWGREIS +ERE+M++EA+
Sbjct: 515  LTKQIQDVEEQTRSGPAKVAPLITDGQAVAGSLAEFYEAQARWGREISAEERERMLKEAA 574

Query: 873  RGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVGPDDDQETITDEERAMFRRVG 694
              K  RVVKRLEHK  ISQ           KI+ SW+P GP DD ETIT+EER M RRVG
Sbjct: 575  MAKMARVVKRLEHKFEISQTKKLKAEKILAKIVESWIPAGPSDDLETITEEERVMLRRVG 634

Query: 693  LRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKEIAFVEETARLLEYESGGILV 514
            LRMK+YLPLGIRGVFDGVIENMHLHWKHRELVKL+SKEK +AFVEETARLLEYESGGILV
Sbjct: 635  LRMKSYLPLGIRGVFDGVIENMHLHWKHRELVKLISKEKVLAFVEETARLLEYESGGILV 694

Query: 513  AIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRHEALSQHISELEKTI 334
            AIE VPKG+ LIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQR+EALSQHI ELE TI
Sbjct: 695  AIERVPKGYALIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRYEALSQHIGELETTI 754

Query: 333  QQTRKEIDVKNGDPA------LFNNVSEFTESEDENSPM 235
            +QT+ +I V  GD +       FN+VSE + SEDE+S +
Sbjct: 755  EQTKSKI-VDFGDTSNLEVLDQFNHVSE-SLSEDEDSSL 791


>emb|CBI15459.3| unnamed protein product [Vitis vinifera]
          Length = 830

 Score =  874 bits (2257), Expect = 0.0
 Identities = 486/781 (62%), Positives = 558/781 (71%), Gaps = 23/781 (2%)
 Frame = -1

Query: 2493 LKPFSSSLRPTT-------KTPRNPIQTPNING-----ATPRHSS-SWLNKWPCATPLPP 2353
            LKPFSS LR T        KT R+     + N        P  S+ SW+NKWP   P   
Sbjct: 32   LKPFSS-LRTTDSNNLRNRKTKRSLYPWDHQNSRKSSNTNPNSSTKSWINKWPSPNPSIE 90

Query: 2352 LHYKNPRTLQEESTSENQFLDEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEAVQDA 2173
              +K   +   + T E+++ D   R GTSA++RIVLRLRN                  D 
Sbjct: 91   SEHKGIDSKGRDGT-ESRYFDG--RSGTSAIERIVLRLRNLGLGSDDEDKNEGEVESGDT 147

Query: 2172 KKVTGEEKLGDLLKRDWVRPDRILVEDEEDSASMSLPWXXXXXXXXXXXXXXXXXXXXXX 1993
              VTG+EKLGDLL+RDWVRPD +L+EDE++   M LPW                      
Sbjct: 148  MPVTGDEKLGDLLQRDWVRPDSMLIEDEDED-DMILPWERGEERQEEEGDGRLKRRA--- 203

Query: 1992 SIKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSELVRLKF 1813
             ++APTLAELTIED               RINVPKAG+T  V  KIH+KWRK ELVRLKF
Sbjct: 204  -VRAPTLAELTIEDEELRRLRRLGMTIRERINVPKAGITQAVLGKIHEKWRKEELVRLKF 262

Query: 1812 HEDLAHDMKTAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGDTLFVP 1633
            HE LAHDMKTAHE+VERRTGGLV WRSGSVMVVFRGTNYEGP   + Q V  EGD+LFVP
Sbjct: 263  HEALAHDMKTAHEIVERRTGGLVTWRSGSVMVVFRGTNYEGP--PKPQPVDGEGDSLFVP 320

Query: 1632 VVSSPGNLLAKSSSDDSCIDEK-SRPVVVPTCAESMTQEEAVFNSLLDGLGPRFEGWWGT 1456
             VSS  N   ++ ++     EK S PV  P  AE+MT+EEA +NSLLDGLGPRF  WWGT
Sbjct: 321  DVSSVDNPAMRNDNNGGPTLEKGSLPVRNPVHAENMTEEEAEYNSLLDGLGPRFVDWWGT 380

Query: 1455 GILPVDADLLPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGRNRNHQ 1276
            G+LPVD DLLPQ +PGYKTP R+LP GMR RLTNAEMT+LRKLAKSLPCHFALGRNRNHQ
Sbjct: 381  GVLPVDGDLLPQSIPGYKTPLRILPTGMRPRLTNAEMTNLRKLAKSLPCHFALGRNRNHQ 440

Query: 1275 GLAAAIFKLWEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMYRGKDF 1096
            GLAAAI KLWEKS+VVKIAVK GIQNTNNK+MAEE+K LTGGVLLLRNKYYIV+YRGKDF
Sbjct: 441  GLAAAIIKLWEKSIVVKIAVKPGIQNTNNKLMAEEIKNLTGGVLLLRNKYYIVIYRGKDF 500

Query: 1095 LPPTVATALAERQEMTKQTQDEEEKMRGGPIEPVSTTEED--QALAGTLGEFYEAQARWG 922
            LP +VA AL+ER+E+TK  Q  EEK+R G  E + + E+   Q LAGTL EFYEAQARWG
Sbjct: 501  LPTSVAAALSEREELTKHIQVVEEKVRTGGAEAIPSGEDGVGQPLAGTLAEFYEAQARWG 560

Query: 921  REISLDEREKMIEEASRGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVGPDDD 742
            REIS +E EKMIEEASR K+ RVVKR+EHKLA++Q           KI +S +P GP DD
Sbjct: 561  REISAEEHEKMIEEASRAKSARVVKRIEHKLALAQAKKLRAERLLAKIEASMIPAGPSDD 620

Query: 741  QETITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKEIAFV 562
            QETITDEER MFRR+GLRMKAYL LG+RGVFDGVIENMHLHWKHRELVKL+SK+K +AFV
Sbjct: 621  QETITDEERFMFRRLGLRMKAYLLLGVRGVFDGVIENMHLHWKHRELVKLISKQKTLAFV 680

Query: 561  EETARLLEYESGGILVAIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQ 382
            E+TARLLEYESGGILVAIE VPKG+ LI+YRGKNYRRP+SLRPRNLLTKAKALKR VA+Q
Sbjct: 681  EDTARLLEYESGGILVAIERVPKGYALIYYRGKNYRRPVSLRPRNLLTKAKALKRSVAMQ 740

Query: 381  RHEALSQHISELEKTIQQTRKEI----DVKNGDPALFNNVSEF---TESEDENSPMSWSI 223
            RHEALSQHISELE+TI+Q + EI    D ++ D        +F   +ESEDE S M    
Sbjct: 741  RHEALSQHISELERTIEQMKMEIGDSKDAEDKDSWSTEGHGQFDQVSESEDEASGMDSDA 800

Query: 222  D 220
            D
Sbjct: 801  D 801


>gb|EOY21034.1| CRS1 / YhbY domain-containing protein [Theobroma cacao]
          Length = 919

 Score =  872 bits (2254), Expect = 0.0
 Identities = 481/771 (62%), Positives = 563/771 (73%), Gaps = 16/771 (2%)
 Frame = -1

Query: 2496 FLKPFSSSLR----PTTKTPRNPIQTPNINGATPRHSSSWLNKWPCATPLPPLHYKNPRT 2329
            F +PFSS LR    P++K  R            P  S+S  +    ++P   +   +   
Sbjct: 109  FFRPFSS-LRTGNSPSSKFNRYSYPWDQEASVPPNSSASSSSLQAWSSPSQKVIQSDG-- 165

Query: 2328 LQEESTSENQFLDEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEAVQDAKKVTGEEK 2149
              +++  E ++ D       SA++RIVLRLRN                  ++  VTGEE+
Sbjct: 166  -DDKTDVETRYFDRD--KSQSAIERIVLRLRNLGLGSDDEDEGEDETDQYNSTPVTGEER 222

Query: 2148 LGDLLKRDWVRPDRILVEDEEDSASMSLPWXXXXXXXXXXXXXXXXXXXXXXSIKAPTLA 1969
            LGDLLKR+WVRPD +L+E E++ A   LPW                       ++APTLA
Sbjct: 223  LGDLLKREWVRPDTMLIEREKEEAV--LPWERDEAEVEVVKEGVLGVKKRR--VRAPTLA 278

Query: 1968 ELTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSELVRLKFHEDLAHDM 1789
            ELTIED               RINVPKAG+T  V +KIHDKWRK ELVRLKFHE LA DM
Sbjct: 279  ELTIEDEELRRLRRMGMYLRERINVPKAGITQAVLEKIHDKWRKEELVRLKFHEVLATDM 338

Query: 1788 KTAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGDTLFVPVVSSPGNL 1609
            KTAHE+VERRTGGLV+WRSGSVMVV+RG+NYEGP  S++QS+ REG+ LF+P VSS  N 
Sbjct: 339  KTAHEIVERRTGGLVLWRSGSVMVVYRGSNYEGP--SRSQSIDREGEALFIPDVSSASNA 396

Query: 1608 LAKSSSDDSCIDEKSRPVVV-PTCAESMTQEEAVFNSLLDGLGPRFEGWWGTGILPVDAD 1432
            +  S +  +   EK  PVVV P  +ESMT+EEA +NSLLDG+GPRF  WWGTG+LPVDAD
Sbjct: 397  VRGSETGKTSTPEKCEPVVVKPERSESMTEEEAEYNSLLDGVGPRFVEWWGTGVLPVDAD 456

Query: 1431 LLPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIFK 1252
            LLPQK+PGYKTPFRLLP GMR RLTNAEMT+LRKLAKSLPCHFALGRNRNHQGLAAAI K
Sbjct: 457  LLPQKIPGYKTPFRLLPAGMRPRLTNAEMTNLRKLAKSLPCHFALGRNRNHQGLAAAIIK 516

Query: 1251 LWEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVATA 1072
            LWEKSLVVKIAVKRGIQNTNNK+MAEELK LTGGVLLLRNKY+IV+YRGKDFLP +VA A
Sbjct: 517  LWEKSLVVKIAVKRGIQNTNNKLMAEELKNLTGGVLLLRNKYFIVIYRGKDFLPTSVAAA 576

Query: 1071 LAERQEMTKQTQDEEEKMRGGPIEPVSTTEED-QALAGTLGEFYEAQARWGREISLDERE 895
            LAERQE+TKQ QD EEK+R   +EP  + E+  +A AGTL EFYEAQA WGREIS +ERE
Sbjct: 577  LAERQELTKQIQDVEEKVRIRAVEPAQSGEDKGEAPAGTLAEFYEAQACWGREISAEERE 636

Query: 894  KMIEEASRGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVGPDDDQETITDEER 715
            KMIEEAS+ K  R+VKR+EHKLA++Q           KI SS +P  PD DQETITDEER
Sbjct: 637  KMIEEASKAKHARLVKRVEHKLAVAQAKKLRAERLLAKIESSMIPAAPDYDQETITDEER 696

Query: 714  AMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKEIAFVEETARLLEY 535
             MFRRVGLRMK YLPLGIRGVFDGVIENMHLHWKHRELVKL+SK+K +AFVE+TARLLE+
Sbjct: 697  VMFRRVGLRMKPYLPLGIRGVFDGVIENMHLHWKHRELVKLISKQKTLAFVEDTARLLEF 756

Query: 534  ESGGILVAIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRHEALSQHI 355
            ESGGILVAIE VPKG+ LI+YRGKNY RPISLRPRNLLTKAKALKR VA+QRHEALSQHI
Sbjct: 757  ESGGILVAIERVPKGYALIYYRGKNYHRPISLRPRNLLTKAKALKRSVAMQRHEALSQHI 816

Query: 354  SELEKTIQQTRKEI----DVK------NGDPALFNNVSEFTESEDENSPMS 232
            SELE+TI++ +KEI    DV+      +G+   F+ VSE T+SEDE S M+
Sbjct: 817  SELERTIEEMKKEIGASQDVEDEDSQVSGEHGQFDPVSELTQSEDEASYMA 867


>emb|CAN72582.1| hypothetical protein VITISV_035294 [Vitis vinifera]
          Length = 850

 Score =  868 bits (2244), Expect = 0.0
 Identities = 475/743 (63%), Positives = 543/743 (73%), Gaps = 16/743 (2%)
 Frame = -1

Query: 2493 LKPFSSSLRPTT-------KTPRNPIQTPNING-----ATPRHSS-SWLNKWPCATPLPP 2353
            LKPFSS LR T        KT R+     + N        P  S+ SW+NKWP   P   
Sbjct: 32   LKPFSS-LRTTDSNNLRNRKTKRSLYPWDHQNSRKSSNTNPNSSTKSWINKWPSPNPSIE 90

Query: 2352 LHYKNPRTLQEESTSENQFLDEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEAVQDA 2173
              +K   +   + T E+++ D   R GTSA++RIVLRLRN                  D 
Sbjct: 91   SEHKGIDSKGRDGT-ESRYFDG--RSGTSAIERIVLRLRNLGLGSDDEDKNEGEVESGDT 147

Query: 2172 KKVTGEEKLGDLLKRDWVRPDRILVEDEEDSASMSLPWXXXXXXXXXXXXXXXXXXXXXX 1993
              VTG+EKLGDLL+RDWVRPD +L+EDE++   M LPW                      
Sbjct: 148  MPVTGDEKLGDLLQRDWVRPDSMLIEDEDED-DMILPWERGEERQEEEGDGRLKRRA--- 203

Query: 1992 SIKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSELVRLKF 1813
             ++APTLAELTIED               RINVPKAG+T  V  KIH+KWRK ELVRLKF
Sbjct: 204  -VRAPTLAELTIEDEELRRLRRLGMTIRERINVPKAGITQAVLGKIHEKWRKEELVRLKF 262

Query: 1812 HEDLAHDMKTAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGDTLFVP 1633
            HE LAHDMKTAHE+VERRTGGLV WRSGSVMVVFRGTNYEGP   + Q V  EGD+LFVP
Sbjct: 263  HEALAHDMKTAHEIVERRTGGLVTWRSGSVMVVFRGTNYEGP--PKPQPVDGEGDSLFVP 320

Query: 1632 VVSSPGNLLAKSSSDDSCIDEK-SRPVVVPTCAESMTQEEAVFNSLLDGLGPRFEGWWGT 1456
             VSS  N   ++ ++     EK S PV  P  AE+MT+EEA +NSLLDGLGPRF  WWGT
Sbjct: 321  DVSSVDNPAMRNDNNGGPTLEKGSLPVRNPVHAENMTEEEAEYNSLLDGLGPRFVDWWGT 380

Query: 1455 GILPVDADLLPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGRNRNHQ 1276
            G+LPVD DLLPQ +PGYKTP R+LP GMR RLTNAEMT+LRKLAKSLPCHFALGRNRNHQ
Sbjct: 381  GVLPVDGDLLPQSIPGYKTPLRILPTGMRPRLTNAEMTNLRKLAKSLPCHFALGRNRNHQ 440

Query: 1275 GLAAAIFKLWEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMYRGKDF 1096
            GLAAAI KLWEKS+VVKIAVK GIQNTNNK+MAEE+K LTGGVLLLRNKYYIV+YRGKDF
Sbjct: 441  GLAAAIIKLWEKSIVVKIAVKPGIQNTNNKLMAEEIKNLTGGVLLLRNKYYIVIYRGKDF 500

Query: 1095 LPPTVATALAERQEMTKQTQDEEEKMRGGPIEPVSTTEED--QALAGTLGEFYEAQARWG 922
            LP +VA AL+ER+E+TK  Q  EEK+R G  E + + E+   Q LAGTL EFYEAQARWG
Sbjct: 501  LPTSVAAALSEREELTKHIQVVEEKVRTGGAEAIPSGEDGVGQPLAGTLAEFYEAQARWG 560

Query: 921  REISLDEREKMIEEASRGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVGPDDD 742
            REIS +E EKMIEEASR K+ RVVKR+EHKLA++Q           KI +S +P GP DD
Sbjct: 561  REISAEEHEKMIEEASRAKSARVVKRIEHKLALAQAKKLRPERLLAKIEASMIPAGPSDD 620

Query: 741  QETITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKEIAFV 562
            QETITDEER MFRR+GLRMKAYL LG+RGVFDGVIENMHLHWKHRELVKL+SK+K +AFV
Sbjct: 621  QETITDEERFMFRRLGLRMKAYLLLGVRGVFDGVIENMHLHWKHRELVKLISKQKTLAFV 680

Query: 561  EETARLLEYESGGILVAIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQ 382
            E+TARLLEYESGGILVAIE VPKG+ LI+YRGKNYRRP+SLRPRNLLTKAKALKR VA+Q
Sbjct: 681  EDTARLLEYESGGILVAIERVPKGYALIYYRGKNYRRPVSLRPRNLLTKAKALKRSVAMQ 740

Query: 381  RHEALSQHISELEKTIQQTRKEI 313
            RHEALSQHISELE+TI+Q + EI
Sbjct: 741  RHEALSQHISELERTIEQMKMEI 763


>gb|EMJ12507.1| hypothetical protein PRUPE_ppa001468mg [Prunus persica]
          Length = 820

 Score =  864 bits (2232), Expect = 0.0
 Identities = 480/780 (61%), Positives = 553/780 (70%), Gaps = 27/780 (3%)
 Frame = -1

Query: 2493 LKPFSSSLRPTTKTPRNPIQTPNINGATPRHSSSWLNKWPCATPLPPLHYKNPRTLQEES 2314
            LKPFSS     T+   NP   P+     P  S+ WLN WP       L    P     E 
Sbjct: 41   LKPFSSL--KATEHSGNPNAKPSHKSKPP--SAPWLNTWPPRNSPAEL----PCQKVNEK 92

Query: 2313 TSENQFLDEAVRPGT----------SAMDRIVLRLRNXXXXXXXXXXXXXXE-----AVQ 2179
             +E+   D+AV+  T          SA++RIVLRLRN                    ++Q
Sbjct: 93   VNESHGRDQAVKANTTRYFDKNKGQSAIERIVLRLRNLGLGSDDEEEDDGLGLDGQDSMQ 152

Query: 2178 DAKKVTGEEKLGDLLKRDWVRPDRILVEDEEDSASMSLPWXXXXXXXXXXXXXXXXXXXX 1999
             A+  +GEEKLGDLL+R+WVRPD +L E + +   ++LPW                    
Sbjct: 153  PAE--SGEEKLGDLLQREWVRPDYVLAEQKSND-EVALPWEKEDEISEEEEVKGLRKRR- 208

Query: 1998 XXSIKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSELVRL 1819
               +KAP+LAELTIED               RI+VPKAG+T  V +KIHD WRK ELVRL
Sbjct: 209  ---VKAPSLAELTIEDEELKRLRRMGMVLRERISVPKAGITQAVLEKIHDTWRKEELVRL 265

Query: 1818 KFHEDLAHDMKTAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGDTLF 1639
            KFHE LA DMKTAHE+VERRTGGLV+WRSGSVMVV+RG+NY+GP  S++Q+V REG  LF
Sbjct: 266  KFHEVLALDMKTAHEIVERRTGGLVLWRSGSVMVVYRGSNYKGP--SKSQTVDREGGALF 323

Query: 1638 VPVVSSPGNLLAKSSSD-DSCIDEKSRPVVVPTCAESMTQEEAVFNSLLDGLGPRFEGWW 1462
            +P VSS      +S +D  S  D   + V +P    +MT+EEA FNSLLD LGPRF  WW
Sbjct: 324  IPDVSSAETSATRSGNDATSGPDNNEKAVKIPAHLPNMTEEEAEFNSLLDDLGPRFVEWW 383

Query: 1461 GTGILPVDADLLPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGRNRN 1282
            GTG+LPVDADLLP+ +PGYKTPFRLLP GMRSRLTNAEMT+LRKLAKSLPCHFALGRNRN
Sbjct: 384  GTGVLPVDADLLPKTIPGYKTPFRLLPTGMRSRLTNAEMTNLRKLAKSLPCHFALGRNRN 443

Query: 1281 HQGLAAAIFKLWEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMYRGK 1102
            HQGLA+AI KLWEKS V KIAVKRGIQNTNNK+MAEELKTLTGGVLLLRNKYYIV YRGK
Sbjct: 444  HQGLASAIIKLWEKSSVAKIAVKRGIQNTNNKLMAEELKTLTGGVLLLRNKYYIVFYRGK 503

Query: 1101 DFLPPTVATALAERQEMTKQTQDEEEKMRGGPIEPVST-TEEDQALAGTLGEFYEAQARW 925
            DFLP +VA ALAERQE+TKQ QD EEKMR   I+  S+  EE QALAGTL EFYEAQARW
Sbjct: 504  DFLPTSVAAALAERQELTKQVQDVEEKMRIKAIDAASSGAEEGQALAGTLAEFYEAQARW 563

Query: 924  GREISLDEREKMIEEASRGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVGPDD 745
            GREIS +EREKMIEE S+ K  R+VKR+EHKL ++Q           KI SS +P GPD 
Sbjct: 564  GREISAEEREKMIEEDSKAKNARLVKRIEHKLGVAQAKKLRAEKLLSKIESSMLPAGPDY 623

Query: 744  DQETITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKEIAF 565
            DQET+TDEER MFRRVGLRMKAYLPLGIRGVFDGV+ENMHLHWKHRELVKL+SK+K +AF
Sbjct: 624  DQETVTDEERVMFRRVGLRMKAYLPLGIRGVFDGVVENMHLHWKHRELVKLISKQKTLAF 683

Query: 564  VEETARLLEYESGGILVAIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRRVAL 385
            VE+TARLLE+ESGGILVAIE VPKG+ LI+YRGKNY+RPI+LRPRNLLTKAKALKR VA+
Sbjct: 684  VEDTARLLEFESGGILVAIERVPKGYALIYYRGKNYQRPITLRPRNLLTKAKALKRSVAI 743

Query: 384  QRHEALSQHISELEKTIQQTRKEIDV----------KNGDPALFNNVSEFTESEDENSPM 235
            QRHEALSQHISELEKTI+Q   EI V           + DP   +  SEF +SEDE S M
Sbjct: 744  QRHEALSQHISELEKTIEQMSSEIGVSEDIADESTWSSRDPDQIHGASEFVQSEDEASRM 803


>gb|EXB56911.1| Chloroplastic group IIA intron splicing facilitator CRS1 [Morus
            notabilis]
          Length = 838

 Score =  858 bits (2216), Expect = 0.0
 Identities = 473/764 (61%), Positives = 548/764 (71%), Gaps = 28/764 (3%)
 Frame = -1

Query: 2448 RNPIQTPNINGATPRH---SSSWLNKWPCATPLPPLHYKNPRTLQEESTSENQFLDEAVR 2278
            +NP  + + + ++ RH   S+ WLNKWP   P+     K   +   + T     +    R
Sbjct: 62   QNPKPSSSSSSSSHRHKPPSAPWLNKWP---PVESSDRKVAESTDRDRTDRPDTVGYVDR 118

Query: 2277 P-GTSAMDRIVLRLRNXXXXXXXXXXXXXXEAV----QDAKKVTGEEKLGDLLKRDWVRP 2113
              G +A++RIVLRLRN                +    QDA  VTGEEKLGDLL+R+W+RP
Sbjct: 119  DRGRNAIERIVLRLRNLGLGSDDEDEDDKEGDIGLDGQDAMPVTGEEKLGDLLRREWIRP 178

Query: 2112 DRILVEDEEDSASMSLPWXXXXXXXXXXXXXXXXXXXXXXSIKAPTLAELTIEDXXXXXX 1933
            D +L E+EE    ++LPW                       + APTLAELTIED      
Sbjct: 179  DFVL-EEEESKDDLTLPWEREEEEKGVDEGTRELRKRR---VNAPTLAELTIEDEELRRL 234

Query: 1932 XXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSELVRLKFHEDLAHDMKTAHEVVERRTG 1753
                     RI+VPKAG+T  V +KIHDKWRK ELVRLKFHE LAHDMKTAHE+VERRTG
Sbjct: 235  RRMGMFLRDRISVPKAGLTQAVLEKIHDKWRKEELVRLKFHEVLAHDMKTAHEIVERRTG 294

Query: 1752 GLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGDTLFVPVVSSPGNLLAKSSSDDSCID 1573
            GLV WRSGSVMVV+RG+NYEGP   + Q V +E D LF+P VSS  N L +S    +   
Sbjct: 295  GLVTWRSGSVMVVYRGSNYEGP--PKTQPVNKERDALFIPDVSSAENFLTRSGDSLTSNA 352

Query: 1572 EKSR-PVVVPTCAESMTQEEAVFNSLLDGLGPRFEGWWGTGILPVDADLLPQKVPGYKTP 1396
            EKS  PV  P   ++MT+EEA FNSLLD LGPRF+ WWGTG++PVDADLLP K+PGYKTP
Sbjct: 353  EKSETPVRNPVSVQNMTEEEAEFNSLLDDLGPRFDEWWGTGVIPVDADLLPPKIPGYKTP 412

Query: 1395 FRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIFKLWEKSLVVKIAV 1216
            FRLLP GMRSRLTN EMT+LRK+AKSLP HFALGRNRNHQGLAAAI KLWEKSLV KIAV
Sbjct: 413  FRLLPTGMRSRLTNGEMTNLRKVAKSLPSHFALGRNRNHQGLAAAIIKLWEKSLVAKIAV 472

Query: 1215 KRGIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVATALAERQEMTKQTQ 1036
            KRGIQNTNNK+MAEELK LTGGVLLLRNKYYIV+YRGKDFLP TVA  LAERQ++ KQ Q
Sbjct: 473  KRGIQNTNNKLMAEELKNLTGGVLLLRNKYYIVIYRGKDFLPTTVAATLAERQKLAKQVQ 532

Query: 1035 DEEEKMRGGPIEPV----------STTEEDQALAGTLGEFYEAQARWGREISLDEREKMI 886
            D EE++R   IE            S  EE QALAGTL EFYEAQARWGREI+ +EREKMI
Sbjct: 533  DLEEQVRVQDIEQKMQKKAVDSVPSGEEEGQALAGTLAEFYEAQARWGREITSEEREKMI 592

Query: 885  EEASRGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVGPDDDQETITDEERAMF 706
            EEA+  K  R+VKR+EHK A++Q           KI +S VP GPD DQETIT+EER MF
Sbjct: 593  EEAAVAKHARLVKRIEHKAAVAQAKKLRAEKLLAKIEASMVPAGPDYDQETITEEERVMF 652

Query: 705  RRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKEIAFVEETARLLEYESG 526
            RRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKL++K+K +AFVE+TARLLEYESG
Sbjct: 653  RRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLITKQKTLAFVEDTARLLEYESG 712

Query: 525  GILVAIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRHEALSQHISEL 346
            GILVAIE VPKGF LI+YRGKNYRRPISLRPRNLLTKAKALKR VA+QRHEALSQHISEL
Sbjct: 713  GILVAIERVPKGFALIYYRGKNYRRPISLRPRNLLTKAKALKRSVAMQRHEALSQHISEL 772

Query: 345  EKTIQQTR-KEIDVKNG--------DPALFNNVSEFTESEDENS 241
            E TI+Q + K +  K+G        D  L +NVSEF +SE++++
Sbjct: 773  ETTIEQMQDKIVASKSGQDEGSWSTDENLNDNVSEFIQSENDDA 816


>ref|XP_002317913.2| hypothetical protein POPTR_0012s05260g [Populus trichocarpa]
            gi|550326426|gb|EEE96133.2| hypothetical protein
            POPTR_0012s05260g [Populus trichocarpa]
          Length = 807

 Score =  853 bits (2205), Expect = 0.0
 Identities = 472/767 (61%), Positives = 555/767 (72%), Gaps = 19/767 (2%)
 Frame = -1

Query: 2490 KPFS----SSLRPTTKTPRNPIQTPNINGATPRHSSSWLNKWPCATPLPPLHYKNPRTLQ 2323
            KPFS    SSLR T KTP+   + PN           W++KW    P      KNP +  
Sbjct: 39   KPFSTATSSSLR-TNKTPKTQQKNPN-----------WISKWK---PSQNHSIKNPPS-- 81

Query: 2322 EESTSENQFLDEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEAVQ-DAKKVTGEEKL 2146
            E S  +  +       G +A++RIVLRLRN              E  + +   +TGEE+L
Sbjct: 82   EVSQEKPHYFSND--KGQNAIERIVLRLRNLGLGSDDEDELEGLEGSEINGGGLTGEERL 139

Query: 2145 GDLLKRDWVRPDRILVEDEE--DSASMSLPWXXXXXXXXXXXXXXXXXXXXXXSIKAPTL 1972
            GDLLKR+WVRPD ++  ++E  DS    LPW                        KAPTL
Sbjct: 140  GDLLKREWVRPDTVVFSNDEGSDSDESVLPWEREERGAVEMEGGIESGRKRRG--KAPTL 197

Query: 1971 AELTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSELVRLKFHEDLAHD 1792
            AELTIED               RI++PKAG+T  V + IHD+WRK ELVRLKFHE LAHD
Sbjct: 198  AELTIEDEELRRLRRMGMFIRERISIPKAGITNAVLENIHDRWRKEELVRLKFHEVLAHD 257

Query: 1791 MKTAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGDTLFVPVVSSPGN 1612
            MKTAHE+VERRTGGLVIWR+GSVMVVFRGTNY+GP  S+ Q   REGD LFVP VSS  +
Sbjct: 258  MKTAHEIVERRTGGLVIWRAGSVMVVFRGTNYQGP-PSKLQPADREGDALFVPDVSSTDS 316

Query: 1611 LLAKSSSDDSCIDEKSRPVV-VPTCAESMTQEEAVFNSLLDGLGPRFEGWWGTGILPVDA 1435
            ++ +SS+  +   EKS+ V+ +    E+MT+EEA  NSLLD LGPRFE WWGTG+LPVDA
Sbjct: 317  VMTRSSNIATSSSEKSKLVMRITEPTENMTEEEAELNSLLDDLGPRFEEWWGTGLLPVDA 376

Query: 1434 DLLPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIF 1255
            DLLP KVP YKTPFRLLP+GMR+RLTNAEMT++RKLAK+LPCHFALGRNRNHQGLA AI 
Sbjct: 377  DLLPPKVPCYKTPFRLLPVGMRARLTNAEMTNMRKLAKALPCHFALGRNRNHQGLAVAIL 436

Query: 1254 KLWEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVAT 1075
            KLWEKSLV KIAVKRGIQNTNNK+MA+ELK LTGGVLLLRNKYYIV++RGKDFLP +VA 
Sbjct: 437  KLWEKSLVAKIAVKRGIQNTNNKLMADELKMLTGGVLLLRNKYYIVIFRGKDFLPQSVAA 496

Query: 1074 ALAERQEMTKQTQDEEEKMRGGPIEPV-STTEEDQALAGTLGEFYEAQARWGREISLDER 898
            ALAERQE+TKQ QD EE++R   +E   S  +E +ALAGTL EFYEAQARWGR+IS +ER
Sbjct: 497  ALAERQEVTKQIQDVEERVRSNSVEAAPSGEDEGKALAGTLAEFYEAQARWGRDISTEER 556

Query: 897  EKMIEEASRGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVGPDDDQETITDEE 718
            EKMIEEAS+ KT R+VKR EHKLAI+Q           KI ++ VP GPD DQETI++EE
Sbjct: 557  EKMIEEASKAKTARLVKRTEHKLAIAQAKKLRAESLLSKIETTMVPSGPDFDQETISEEE 616

Query: 717  RAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKEIAFVEETARLLE 538
            R MFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKL+SK+K +AFVE+TA+LLE
Sbjct: 617  RVMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLISKQKTLAFVEDTAKLLE 676

Query: 537  YESGGILVAIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRHEALSQH 358
            YESGG+LVAIE VPKGF LI+YRGKNYRRPIS+RPRNLLTKAKALKR VA+QRHEALSQH
Sbjct: 677  YESGGVLVAIERVPKGFALIYYRGKNYRRPISIRPRNLLTKAKALKRSVAMQRHEALSQH 736

Query: 357  ISELEKTIQQTRKEIDV----------KNGDPALFNNVSEFTESEDE 247
            I ELEK I++  KE+ +           + + A  NNVS+ T+SED+
Sbjct: 737  IFELEKNIEEMVKEMGLSKEEENENNWSSEEHAPLNNVSKLTQSEDK 783


>ref|XP_004300521.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 820

 Score =  852 bits (2202), Expect = 0.0
 Identities = 476/775 (61%), Positives = 557/775 (71%), Gaps = 22/775 (2%)
 Frame = -1

Query: 2493 LKPFSSSLRPTTKTPRNPIQTPNINGATPRHSSSWLNKWPC---ATPLPPLHYKNPRTLQ 2323
            LKPFS+ LR TT+   NP        ++   ++ WLNKWP    A   PP    + R  +
Sbjct: 38   LKPFSA-LR-TTEHGGNPNARHKSKPSSSSSTAPWLNKWPSRGQAPAEPPRQKFSDRVKE 95

Query: 2322 EE-----STSENQFLDEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEAVQDAKKV-T 2161
             +     S++  +++D+    G SA++RIV RLRN                  D+    +
Sbjct: 96   SDGREKPSSNAARYVDKD--KGQSAIERIVFRLRNLGLGDDEEEEESGDGVELDSMPAAS 153

Query: 2160 GEEKLGDLLKRDWVRPDRILVEDEEDSASMSLPWXXXXXXXXXXXXXXXXXXXXXXSIKA 1981
            G EKLGDLL+R+WVRPD IL E++ D   ++LPW                        KA
Sbjct: 154  GAEKLGDLLQREWVRPDYILAEEKGDD-DVALPWEKEEEELSEDEEVKGMRKARRS--KA 210

Query: 1980 PTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSELVRLKFHEDL 1801
            P+LAELTIED               RI+VPKAG+T  V +KIHDKWRK ELVRLKFHE L
Sbjct: 211  PSLAELTIEDEELRRLRRLGMVLRERISVPKAGITQAVLEKIHDKWRKEELVRLKFHEVL 270

Query: 1800 AHDMKTAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGDTLFVPVVSS 1621
            AHDMKTAHE+VERRTGGLV+WRSGSVMVV+RG+NY+GP  S+++   R GD LF+P VSS
Sbjct: 271  AHDMKTAHEIVERRTGGLVLWRSGSVMVVYRGSNYKGP--SKSEPAGRGGDALFIPDVSS 328

Query: 1620 PGNLLAKSSSD-DSCIDEKSRPVVVPT-CAESMTQEEAVFNSLLDGLGPRFEGWWGTGIL 1447
                + +  +D  S  D+  + V +P    + MT EEA FNSLLD LGPRF  +WGTGIL
Sbjct: 329  AETSVTRGGNDATSAPDKTEQAVKIPEPLPKKMTDEEAEFNSLLDELGPRFVEYWGTGIL 388

Query: 1446 PVDADLLPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLA 1267
            PVDADLLP+ +PGYKTPFRLLP GMRSRLTNAEMT+LRKLAKS+PCHFALGRNRNHQGLA
Sbjct: 389  PVDADLLPKTIPGYKTPFRLLPTGMRSRLTNAEMTNLRKLAKSIPCHFALGRNRNHQGLA 448

Query: 1266 AAIFKLWEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPP 1087
            +AI K+WEKS V KIAVKRGIQNTNNKIMAEELK LTGGVLLLRNKYYIV+YRGKDF+P 
Sbjct: 449  SAILKVWEKSSVAKIAVKRGIQNTNNKIMAEELKALTGGVLLLRNKYYIVIYRGKDFVPT 508

Query: 1086 TVATALAERQEMTKQTQDEEEKMRGGPIEPV-STTEEDQALAGTLGEFYEAQARWGREIS 910
            TVATALAERQE+TKQ QD EE +R  PI+   S+TEE QALAGTL EFYEAQARWGREIS
Sbjct: 509  TVATALAERQELTKQVQDVEEIVRIKPIDAAASSTEEGQALAGTLAEFYEAQARWGREIS 568

Query: 909  LDEREKMIEEASRGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVGPDDDQETI 730
             +ER+KMIEE S+ K  R  KR+EHKL ++Q           KI S+ +P GPD DQETI
Sbjct: 569  AEERKKMIEEDSKAKMARRAKRIEHKLGVAQAKKLRAESLLNKIESAMLPAGPDYDQETI 628

Query: 729  TDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKEIAFVEETA 550
            TDEER MFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKL+SK+K +AFVE++A
Sbjct: 629  TDEERVMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLISKQKTLAFVEDSA 688

Query: 549  RLLEYESGGILVAIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRHEA 370
            RLLEYESGGILVAIE VPKG+ LI+YRGKNY+RPI+LRPRNLLTKAKALKR VA+QRHEA
Sbjct: 689  RLLEYESGGILVAIERVPKGYALIYYRGKNYQRPITLRPRNLLTKAKALKRSVAMQRHEA 748

Query: 369  LSQHISELEKTIQQTRKEI----DVKN------GDPALFNNVSEFTESEDENSPM 235
            LSQHI ELE+TI+Q R EI    DV N       DP    + SEF +SEDE+S M
Sbjct: 749  LSQHIEELERTIEQMRSEIGISEDVDNERTWGSRDPHQSGHDSEFNQSEDEDSDM 803


>ref|XP_006440978.1| hypothetical protein CICLE_v10018859mg [Citrus clementina]
            gi|567896982|ref|XP_006440979.1| hypothetical protein
            CICLE_v10018859mg [Citrus clementina]
            gi|567896984|ref|XP_006440980.1| hypothetical protein
            CICLE_v10018859mg [Citrus clementina]
            gi|557543240|gb|ESR54218.1| hypothetical protein
            CICLE_v10018859mg [Citrus clementina]
            gi|557543241|gb|ESR54219.1| hypothetical protein
            CICLE_v10018859mg [Citrus clementina]
            gi|557543242|gb|ESR54220.1| hypothetical protein
            CICLE_v10018859mg [Citrus clementina]
          Length = 833

 Score =  850 bits (2196), Expect = 0.0
 Identities = 477/780 (61%), Positives = 555/780 (71%), Gaps = 21/780 (2%)
 Frame = -1

Query: 2520 RKTSSSVFFLKPFSSSLRPTTKTPRNPIQTPNI-NGATPRHSSSWLNKWPCATPLPPLHY 2344
            RKT S    LKPFSS LR T + PR   Q        +P  S+ WLN W  + P PP   
Sbjct: 37   RKTPSFQL-LKPFSS-LR-TNQNPRTDSQNQQFPKPRSPSTSAPWLNNW--SRPKPPSTE 91

Query: 2343 KNPRT-----LQEESTSENQF--LDEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEA 2185
               +      + E+ TS + +    ++   G +A++RIVLRLRN              E 
Sbjct: 92   NANKLGGRNQIDEKQTSPDSYPRYSDSDNKGRNAIERIVLRLRNLGLGSDDEEEGEEEED 151

Query: 2184 -VQDAKKVTGEEKLGDLLKRDWVRPDRILVEDEEDSASMSLPWXXXXXXXXXXXXXXXXX 2008
             + DA   TGEE+L DLL+R+WVRP+ +L E E +     LPW                 
Sbjct: 152  DINDA--ATGEERLEDLLRREWVRPNTVLREVEGEEDDSLLPWEREEEENLRAGGEKPAG 209

Query: 2007 XXXXXSIKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSEL 1828
                  +KAPTLAELTIED               RINVPKAG+T +V  KIHDKWRK EL
Sbjct: 210  ETRRRRMKAPTLAELTIEDEELRRLRRNGMYLRERINVPKAGLTQDVMRKIHDKWRKDEL 269

Query: 1827 VRLKFHEDLAHDMKTAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGD 1648
            VRLKFHE LA DMKTAHE+VERRTGGLVIWR+GSVMVV+RG+NY GP SS+ Q +  +GD
Sbjct: 270  VRLKFHEVLATDMKTAHEIVERRTGGLVIWRAGSVMVVYRGSNYAGP-SSKPQPIDGDGD 328

Query: 1647 TLFVPVVSSPGNLLAKSSSDDSCIDEKSR-PVVVPTCAESMTQEEAVFNSLLDGLGPRFE 1471
            TLFVP VSS     A+S      +DEKS  PV +   ++ MT+EEA  NSLLD LGPRF+
Sbjct: 329  TLFVPHVSSTDGSTARS------VDEKSEVPVRILDHSKPMTEEEAECNSLLDSLGPRFQ 382

Query: 1470 GWWGTGILPVDADLLPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGR 1291
             WWGTGILPVDADLLP KV GYKTPFRLLP GMRSRLTNAEMTDLR+LA+SLPCHFALGR
Sbjct: 383  EWWGTGILPVDADLLPPKVDGYKTPFRLLPTGMRSRLTNAEMTDLRRLARSLPCHFALGR 442

Query: 1290 NRNHQGLAAAIFKLWEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMY 1111
            NRNHQGLA AI KLWEKSLV KIAVKRGIQNTNNK+MAEELK+LTGG LL RNK+YIV+Y
Sbjct: 443  NRNHQGLAVAILKLWEKSLVAKIAVKRGIQNTNNKLMAEELKSLTGGTLLQRNKFYIVLY 502

Query: 1110 RGKDFLPPTVATALAERQEMTKQTQDEEEKMRGGPIEPVSTTE-EDQALAGTLGEFYEAQ 934
            RGKDFLPP VA+ALAER++  KQ QD EEK+R   +E   + E E QA AGTL EFYEAQ
Sbjct: 503  RGKDFLPPNVASALAEREQCAKQIQDVEEKVRSKTLEATPSGETEGQAPAGTLAEFYEAQ 562

Query: 933  ARWGREISLDEREKMIEEASRGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVG 754
             RWGRE+S +EREKM+EEAS+ K  R+VKR+EHKLA+SQ           KI +S VP G
Sbjct: 563  KRWGREVSAEEREKMVEEASKAKHGRLVKRIEHKLAVSQAKKLRAERLLAKIEASMVPSG 622

Query: 753  PDDDQETITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKE 574
            PD DQETITDEERAMFRRVGLRMKA+LPLGIRGVFDGV+ENMHLHWK+RELVKL++K+K 
Sbjct: 623  PDYDQETITDEERAMFRRVGLRMKAFLPLGIRGVFDGVVENMHLHWKYRELVKLITKQKT 682

Query: 573  IAFVEETARLLEYESGGILVAIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRR 394
            +A+VE+TARLLEYES GIL+AIE VPKGF LIFYRGKNYRRPISLRPRNLLTKAKALKR 
Sbjct: 683  LAYVEDTARLLEYESVGILIAIERVPKGFALIFYRGKNYRRPISLRPRNLLTKAKALKRS 742

Query: 393  VALQRHEALSQHISELEKTIQQTRKEIDVK----------NGDPALFNNVSEFTESEDEN 244
            VA+QRHEALSQHIS+LE TI+Q +KEI V           +GD   F++VS   ++ED +
Sbjct: 743  VAMQRHEALSQHISDLENTIEQMKKEIGVSKDEEDGNIRCSGDLKQFDHVSVLPQNEDND 802


>ref|XP_006440981.1| hypothetical protein CICLE_v10018859mg [Citrus clementina]
            gi|557543243|gb|ESR54221.1| hypothetical protein
            CICLE_v10018859mg [Citrus clementina]
          Length = 806

 Score =  843 bits (2179), Expect = 0.0
 Identities = 469/747 (62%), Positives = 541/747 (72%), Gaps = 11/747 (1%)
 Frame = -1

Query: 2520 RKTSSSVFFLKPFSSSLRPTTKTPRNPIQTPNI-NGATPRHSSSWLNKWPCATPLPPLHY 2344
            RKT S    LKPFSS LR T + PR   Q        +P  S+ WLN W  + P PP   
Sbjct: 37   RKTPSFQL-LKPFSS-LR-TNQNPRTDSQNQQFPKPRSPSTSAPWLNNW--SRPKPPSTE 91

Query: 2343 KNPRT-----LQEESTSENQF--LDEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEA 2185
               +      + E+ TS + +    ++   G +A++RIVLRLRN              E 
Sbjct: 92   NANKLGGRNQIDEKQTSPDSYPRYSDSDNKGRNAIERIVLRLRNLGLGSDDEEEGEEEED 151

Query: 2184 -VQDAKKVTGEEKLGDLLKRDWVRPDRILVEDEEDSASMSLPWXXXXXXXXXXXXXXXXX 2008
             + DA   TGEE+L DLL+R+WVRP+ +L E E +     LPW                 
Sbjct: 152  DINDA--ATGEERLEDLLRREWVRPNTVLREVEGEEDDSLLPWEREEEENLRAGGEKPAG 209

Query: 2007 XXXXXSIKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSEL 1828
                  +KAPTLAELTIED               RINVPKAG+T +V  KIHDKWRK EL
Sbjct: 210  ETRRRRMKAPTLAELTIEDEELRRLRRNGMYLRERINVPKAGLTQDVMRKIHDKWRKDEL 269

Query: 1827 VRLKFHEDLAHDMKTAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGD 1648
            VRLKFHE LA DMKTAHE+VERRTGGLVIWR+GSVMVV+RG+NY GP SS+ Q +  +GD
Sbjct: 270  VRLKFHEVLATDMKTAHEIVERRTGGLVIWRAGSVMVVYRGSNYAGP-SSKPQPIDGDGD 328

Query: 1647 TLFVPVVSSPGNLLAKSSSDDSCIDEKSR-PVVVPTCAESMTQEEAVFNSLLDGLGPRFE 1471
            TLFVP VSS     A+S      +DEKS  PV +   ++ MT+EEA  NSLLD LGPRF+
Sbjct: 329  TLFVPHVSSTDGSTARS------VDEKSEVPVRILDHSKPMTEEEAECNSLLDSLGPRFQ 382

Query: 1470 GWWGTGILPVDADLLPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGR 1291
             WWGTGILPVDADLLP KV GYKTPFRLLP GMRSRLTNAEMTDLR+LA+SLPCHFALGR
Sbjct: 383  EWWGTGILPVDADLLPPKVDGYKTPFRLLPTGMRSRLTNAEMTDLRRLARSLPCHFALGR 442

Query: 1290 NRNHQGLAAAIFKLWEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMY 1111
            NRNHQGLA AI KLWEKSLV KIAVKRGIQNTNNK+MAEELK+LTGG LL RNK+YIV+Y
Sbjct: 443  NRNHQGLAVAILKLWEKSLVAKIAVKRGIQNTNNKLMAEELKSLTGGTLLQRNKFYIVLY 502

Query: 1110 RGKDFLPPTVATALAERQEMTKQTQDEEEKMRGGPIEPVSTTE-EDQALAGTLGEFYEAQ 934
            RGKDFLPP VA+ALAER++  KQ QD EEK+R   +E   + E E QA AGTL EFYEAQ
Sbjct: 503  RGKDFLPPNVASALAEREQCAKQIQDVEEKVRSKTLEATPSGETEGQAPAGTLAEFYEAQ 562

Query: 933  ARWGREISLDEREKMIEEASRGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVG 754
             RWGRE+S +EREKM+EEAS+ K  R+VKR+EHKLA+SQ           KI +S VP G
Sbjct: 563  KRWGREVSAEEREKMVEEASKAKHGRLVKRIEHKLAVSQAKKLRAERLLAKIEASMVPSG 622

Query: 753  PDDDQETITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKE 574
            PD DQETITDEERAMFRRVGLRMKA+LPLGIRGVFDGV+ENMHLHWK+RELVKL++K+K 
Sbjct: 623  PDYDQETITDEERAMFRRVGLRMKAFLPLGIRGVFDGVVENMHLHWKYRELVKLITKQKT 682

Query: 573  IAFVEETARLLEYESGGILVAIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRR 394
            +A+VE+TARLLEYES GIL+AIE VPKGF LIFYRGKNYRRPISLRPRNLLTKAKALKR 
Sbjct: 683  LAYVEDTARLLEYESVGILIAIERVPKGFALIFYRGKNYRRPISLRPRNLLTKAKALKRS 742

Query: 393  VALQRHEALSQHISELEKTIQQTRKEI 313
            VA+QRHEALSQHIS+LE TI+Q +KEI
Sbjct: 743  VAMQRHEALSQHISDLENTIEQMKKEI 769


>ref|XP_006485796.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Citrus sinensis]
          Length = 837

 Score =  840 bits (2171), Expect = 0.0
 Identities = 475/783 (60%), Positives = 557/783 (71%), Gaps = 24/783 (3%)
 Frame = -1

Query: 2520 RKTSSSVFFLKPFSSSLRPTTKTPRNPIQTPNI-NGATPRHSSSWLNKWPCATPLPP--- 2353
            RKT S    LKPFSS LR T + PR   Q         P  S+ WLN W  + P PP   
Sbjct: 37   RKTPSFQL-LKPFSS-LR-TNQNPRTDSQNQKFPKPRFPSTSAPWLNNW--SRPKPPSTE 91

Query: 2352 -LHYKNPRT-LQEESTSENQF--LDEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEA 2185
             ++  + R  + E+ T+ + +    ++   G +A++RIVLRLRN              E 
Sbjct: 92   NVNKSDGRNQIDEKQTAPDSYPRYSDSDNKGRNAIERIVLRLRNLGLGSDDEEEGEEEED 151

Query: 2184 VQDAKKVTGEEKLGDLLKRDWVRPDRILVEDEEDSASMSLPWXXXXXXXXXXXXXXXXXX 2005
              +    TGEE+L DLL+R+WVRP+ +L E E +     LPW                  
Sbjct: 152  DINGA-ATGEERLEDLLRREWVRPNTVLREVEGEEDDSLLPWEREEEENLRAGGEKPAGE 210

Query: 2004 XXXXSIKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSELV 1825
                 +KAPTLAELTIED               RINVPKAG+T +V  KIHDKWRK ELV
Sbjct: 211  TRRRRMKAPTLAELTIEDEELRRLRRNGMYLRERINVPKAGLTQDVMRKIHDKWRKDELV 270

Query: 1824 RLKFHEDLAHDMKTAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGD- 1648
            RLKFHE LA DMKTAHE+VERRTGGLVIWR+GSVMVV++G+NY GP SS+ Q +  +GD 
Sbjct: 271  RLKFHEVLATDMKTAHEIVERRTGGLVIWRAGSVMVVYQGSNYAGP-SSKPQPLDGDGDG 329

Query: 1647 ---TLFVPVVSSPGNLLAKSSSDDSCIDEKSR-PVVVPTCAESMTQEEAVFNSLLDGLGP 1480
               TLFVP VSS     A+S      +DEKS  PV +   ++ MT+EEA  NSLLD LGP
Sbjct: 330  DGDTLFVPHVSSTDGSTARS------VDEKSEVPVRILDHSKPMTEEEAECNSLLDSLGP 383

Query: 1479 RFEGWWGTGILPVDADLLPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFA 1300
            RF+ WWGTGILPVDADLLP KV GYKTPFRLLP GMRSRLTNAEMTDLR+LA+SLPCHFA
Sbjct: 384  RFQEWWGTGILPVDADLLPPKVDGYKTPFRLLPTGMRSRLTNAEMTDLRRLARSLPCHFA 443

Query: 1299 LGRNRNHQGLAAAIFKLWEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYI 1120
            LGRNRNHQGLA AI KLWEKSLV KIAVKRGIQNTNNK+MAEELK+LTGG LL RNK+YI
Sbjct: 444  LGRNRNHQGLAVAILKLWEKSLVAKIAVKRGIQNTNNKLMAEELKSLTGGTLLQRNKFYI 503

Query: 1119 VMYRGKDFLPPTVATALAERQEMTKQTQDEEEKMRGGPIEPVSTTE-EDQALAGTLGEFY 943
            V+YRGKDFLPP VA+ALAER++  KQ QD EEK+R   +E   + E E QA AGTL EFY
Sbjct: 504  VLYRGKDFLPPNVASALAEREQCAKQIQDVEEKVRSKTLEATPSGETEGQAPAGTLAEFY 563

Query: 942  EAQARWGREISLDEREKMIEEASRGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWV 763
            EAQ RWGRE+S +EREKM+EEAS+ K  R+VKR+EHKLA+SQ           KI +S V
Sbjct: 564  EAQKRWGREVSAEEREKMVEEASKAKHARLVKRIEHKLAVSQAKKLRAERLLAKIEASMV 623

Query: 762  PVGPDDDQETITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSK 583
            P GPD DQETITDEERAMFRRVGLRMKA+LPLGIRGVFDGV+ENMHLHWK+RELVKL++K
Sbjct: 624  PSGPDYDQETITDEERAMFRRVGLRMKAFLPLGIRGVFDGVVENMHLHWKYRELVKLITK 683

Query: 582  EKEIAFVEETARLLEYESGGILVAIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKAL 403
            +K +A+VE+TARLLEYESGGIL+AIE VPKGF LIFYRGKNYRRPISLRPRNLLTKAKAL
Sbjct: 684  QKTLAYVEDTARLLEYESGGILIAIERVPKGFALIFYRGKNYRRPISLRPRNLLTKAKAL 743

Query: 402  KRRVALQRHEALSQHISELEKTIQQTRKEIDV----------KNGDPALFNNVSEFTESE 253
            KR VA+QRHEALSQHIS+LE TI+Q +KEI V           +GD   F++VS   ++E
Sbjct: 744  KRSVAMQRHEALSQHISDLENTIEQMKKEIGVFKDEEDGNIRCSGDLKQFDHVSVLPQNE 803

Query: 252  DEN 244
            D++
Sbjct: 804  DDD 806


>gb|EPS74467.1| hypothetical protein M569_00278, partial [Genlisea aurea]
          Length = 693

 Score =  820 bits (2119), Expect = 0.0
 Identities = 433/687 (63%), Positives = 507/687 (73%), Gaps = 11/687 (1%)
 Frame = -1

Query: 2265 AMDRIVLRLRNXXXXXXXXXXXXXXEAVQDA-----KKVTGEEKLGDLLKRDWVRPDRIL 2101
            A+DRIVLRLRN               + +D+     +++  EEKLGDLLKRDWVRPD IL
Sbjct: 1    AIDRIVLRLRNLGLGSDEEGDDGRGLSREDSIDSKLEELGEEEKLGDLLKRDWVRPDTIL 60

Query: 2100 VEDEEDSAS--MSLPWXXXXXXXXXXXXXXXXXXXXXXSIKAPTLAELTIEDXXXXXXXX 1927
            V+D +  +   + LPW                       ++APT+AELTIED        
Sbjct: 61   VQDSDSDSDSELLLPWERRGNATEQDEMEAKGASRKGE-MRAPTMAELTIEDEELRRLRR 119

Query: 1926 XXXXXXXRINVPKAGVTMEVFDKIHDKWRKSELVRLKFHEDLAHDMKTAHEVVERRTGGL 1747
                   RINVPKAG+T  + +KIH+KWRKSELVRLKFHE+LAHDMKTAH++VERRTGGL
Sbjct: 120  MGMTLRERINVPKAGITGVILEKIHEKWRKSELVRLKFHEELAHDMKTAHQIVERRTGGL 179

Query: 1746 VIWRSGSVMVVFRGTNYEGPLSS-QAQSVKREGDTLFVPVVSSPGNLLAKSSSDDSCIDE 1570
            V WRSGSVMVVFRGTNYEGP+S  Q  ++  E D  FVP V S G ++   + D +    
Sbjct: 180  VTWRSGSVMVVFRGTNYEGPVSKPQRPNIDEEDDGPFVPTVPS-GEVVTSETGDSTSKTL 238

Query: 1569 KSRPVVVPTCAESMTQEEAVFNSLLDGLGPRFEGWWGTGILPVDADLLPQKVPGYKTPFR 1390
            +    ++ + AES+T++EA +N LLDGLGPRFE WWGTG+LPVDADLLP  VPGYKTPFR
Sbjct: 239  EKPSRIIASAAESVTEQEAEYNMLLDGLGPRFEDWWGTGVLPVDADLLPPAVPGYKTPFR 298

Query: 1389 LLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIFKLWEKSLVVKIAVKR 1210
            LLP+GMRSRLTNAEMT LRKLAK LP HFALG+NR HQGLA+AI KLWEKSL+VKIAVKR
Sbjct: 299  LLPVGMRSRLTNAEMTHLRKLAKRLPSHFALGKNRKHQGLASAIVKLWEKSLLVKIAVKR 358

Query: 1209 GIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVATALAERQEMTKQTQDE 1030
            GIQNTNNK+MAEELK LTGGVLLLRNKYYI+MYRGKDFLPP+VA+ALAER EMTKQ QD 
Sbjct: 359  GIQNTNNKLMAEELKALTGGVLLLRNKYYIIMYRGKDFLPPSVASALAERNEMTKQIQDV 418

Query: 1029 EEKMRGGPIEPVSTTEED---QALAGTLGEFYEAQARWGREISLDEREKMIEEASRGKTN 859
            EE++R GP   ++  ++D   +A AGTL EFYEAQ RWG EIS D+R KM+EEASR    
Sbjct: 419  EERVRRGPAAAITNGDDDDGKEASAGTLSEFYEAQVRWGMEISPDQRNKMLEEASRSIKM 478

Query: 858  RVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVGPDDDQETITDEERAMFRRVGLRMKA 679
            + +KRLE K+A +Q           KI+ SWVPV P DDQETITDEER M+RR+GLRM  
Sbjct: 479  KALKRLERKVAAAQAKKLRAEKLLSKIVDSWVPVDPSDDQETITDEERVMYRRLGLRMTP 538

Query: 678  YLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKEIAFVEETARLLEYESGGILVAIESV 499
            YLPLGIRGVFDGVIENMHLHWKHRELVKL+SKEKE +FVEETARLLEYESGGILVAIE V
Sbjct: 539  YLPLGIRGVFDGVIENMHLHWKHRELVKLISKEKETSFVEETARLLEYESGGILVAIERV 598

Query: 498  PKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRHEALSQHISELEKTIQQTRK 319
            PKG  LI+YRGKNY+RP+SLRPRNLL K+ ALKRRVALQR+EALSQHISELEKTI Q ++
Sbjct: 599  PKGHALIYYRGKNYQRPLSLRPRNLLNKSNALKRRVALQRYEALSQHISELEKTISQAKQ 658

Query: 318  EIDVKNGDPALFNNVSEFTESEDENSP 238
            ++      P       E  + E+E  P
Sbjct: 659  QM-AATDPPEEDEEEEEEKKEEEEEDP 684


>ref|XP_006406610.1| hypothetical protein EUTSA_v10020034mg [Eutrema salsugineum]
            gi|557107756|gb|ESQ48063.1| hypothetical protein
            EUTSA_v10020034mg [Eutrema salsugineum]
          Length = 874

 Score =  813 bits (2099), Expect = 0.0
 Identities = 452/796 (56%), Positives = 545/796 (68%), Gaps = 51/796 (6%)
 Frame = -1

Query: 2478 SSLRPTTKTPRNPIQTPNINGATPRHSSS---WLNKWPCATPLPPLHYKNP--------- 2335
            SSLR + ++  N  ++ N      RHS     W++KWP ++     H             
Sbjct: 52   SSLRTSERSSNN--RSHNNRRLDQRHSKPTPPWIDKWPPSSAGAGDHSGKKVAEQNGGGK 109

Query: 2334 -RTLQEESTSENQFLDEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEAVQ----DAK 2170
             R+ +EE+ ++ ++L++    G SA++RIVLRLRN              E       D K
Sbjct: 110  IRSAEEEAEAKRRYLEKD--KGHSAIERIVLRLRNLGLASDDEDDVEDNEGDGINGGDVK 167

Query: 2169 KVTGEEKLGDLLKRDWVRPDRILVEDEEDSAS---MSLPWXXXXXXXXXXXXXXXXXXXX 1999
             VTGEE+LGDLLKR+WVRPD +L E EE+S     + LPW                    
Sbjct: 168  PVTGEERLGDLLKREWVRPDMMLAEGEEESDEDDDVLLPWEKNEEEQAAERMEGDGAAVK 227

Query: 1998 XXSIKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSELVRL 1819
                +AP+LAELT+ED               RI++PKAG+T  V +KIHD WRK ELVRL
Sbjct: 228  KRRARAPSLAELTVEDSELRRLRRDGMYLRVRISIPKAGLTQAVMEKIHDTWRKEELVRL 287

Query: 1818 KFHEDLAHDMKTAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGDTLF 1639
            KFHE LA DM+TAHE+VERRTGG+VIWR+GSVMVV+RG +Y+GP S  +  + R  +TLF
Sbjct: 288  KFHEVLARDMRTAHEIVERRTGGMVIWRAGSVMVVYRGRDYQGP-SMISNQMARPEETLF 346

Query: 1638 VPVVSSPGNLLAKSSSDDSCIDEKSRPVVV-PTCAESMTQEEAVFNSLLDGLGPRFEGWW 1462
            VP VSS G+    S  + S   E   P+V  P   E+MT+EEA FNSLLD LGPRF  WW
Sbjct: 347  VPDVSSAGDEATGSKDNQSAPPEIKDPIVRNPIRKETMTEEEAEFNSLLDSLGPRFHEWW 406

Query: 1461 GTGILPVDADLLPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGRNRN 1282
            GTG+LPV+ADLLP  +PGYKTPFRLLP GMRS LTNAEMT+LRK+ K+LPCHFALGRNRN
Sbjct: 407  GTGVLPVNADLLPPTIPGYKTPFRLLPTGMRSNLTNAEMTNLRKIGKTLPCHFALGRNRN 466

Query: 1281 HQGLAAAIFKLWEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMYRGK 1102
            HQGLAAAI KLWEKSL+ KIAVKRGIQNTNNK+MA+E+KTLTGGVLLLRNKYYIV+YRGK
Sbjct: 467  HQGLAAAILKLWEKSLIAKIAVKRGIQNTNNKLMADEIKTLTGGVLLLRNKYYIVIYRGK 526

Query: 1101 DFLPPTVATALAERQEMTKQTQDEEEKMRGGPIE---PVSTT------------------ 985
            DFLP +VA  LAERQE+TK+ QD EE++R   IE   PV  T                  
Sbjct: 527  DFLPSSVAATLAERQELTKEIQDVEERVRTRDIETSQPVGDTVPAEAGTLADIEERVNNR 586

Query: 984  ---------EEDQALAGTLGEFYEAQARWGREISLDEREKMIEEASRGKTNRVVKRLEHK 832
                     ++  A AGTL EFYEAQARWG+EI+ D REKMIEEASR  + RVVKR++HK
Sbjct: 587  DIEASQPVGDKVPAEAGTLAEFYEAQARWGKEITPDHREKMIEEASRVASARVVKRIQHK 646

Query: 831  LAISQXXXXXXXXXXXKIISSWVPVGPDDDQETITDEERAMFRRVGLRMKAYLPLGIRGV 652
            L ++Q           KI +S +P GPD DQE I++EER MFR+VGL+MK+YLPLGIRGV
Sbjct: 647  LNLAQSKFHRAEKLLSKIEASMIPNGPDYDQEVISEEERIMFRKVGLKMKSYLPLGIRGV 706

Query: 651  FDGVIENMHLHWKHRELVKLLSKEKEIAFVEETARLLEYESGGILVAIESVPKGFILIFY 472
            FDGVIENMHLHWKHRELVKL+SK+K +AFVE+TARLLEYESGG+LVAIE VPKGF LI+Y
Sbjct: 707  FDGVIENMHLHWKHRELVKLISKQKSLAFVEDTARLLEYESGGVLVAIEKVPKGFALIYY 766

Query: 471  RGKNYRRPISLRPRNLLTKAKALKRRVALQRHEALSQHISELEKTIQQTRKEIDVKNGDP 292
            RGKNY+RPISLRPRNLLTKAKALKR +A+QRHEALSQHISELEKTI+Q + E+  KN   
Sbjct: 767  RGKNYQRPISLRPRNLLTKAKALKRSIAMQRHEALSQHISELEKTIEQMQNELTAKN--- 823

Query: 291  ALFNNVSEFTESEDEN 244
                    ++ESE EN
Sbjct: 824  ------PSYSESEWEN 833


>ref|XP_002883129.1| EMB1865 [Arabidopsis lyrata subsp. lyrata]
            gi|297328969|gb|EFH59388.1| EMB1865 [Arabidopsis lyrata
            subsp. lyrata]
          Length = 846

 Score =  808 bits (2088), Expect = 0.0
 Identities = 442/772 (57%), Positives = 542/772 (70%), Gaps = 23/772 (2%)
 Frame = -1

Query: 2493 LKPFSSSLRPTTKTPRNPIQTPNINGATPRHSSSWLNKWPCATPLPPLHYKNPR------ 2332
            ++PF S LR + ++         ++    + +  W++KWP ++      +   R      
Sbjct: 49   IRPFFS-LRTSERSNNRSNNNRRVDQRNHKPTPPWIDKWPPSSAGVGGDHAGKRGGENNG 107

Query: 2331 -----TLQEESTSENQFLDEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEAVQ---- 2179
                 + +EE+ ++ ++L+     G +A++RIVLRLRN              E       
Sbjct: 108  GDKIRSAEEEAEAKLRYLERD--KGQNAIERIVLRLRNLGLGSDDEEDVEDEEGGGINGG 165

Query: 2178 DAKKVTGEEKLGDLLKRDWVRPDRILVEDEE--DSASMSLPWXXXXXXXXXXXXXXXXXX 2005
            D K VTGEE+LGDLLKR+WVRPD +L E EE  +   + LPW                  
Sbjct: 166  DVKPVTGEERLGDLLKREWVRPDMMLAEGEESEEEDEVLLPWEKNEEEQAAERVEGEGGV 225

Query: 2004 XXXXS--IKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSE 1831
                    +AP+LAELT+ED               RIN+PKAG+T  V +KI+D WRK E
Sbjct: 226  AVMKKGRARAPSLAELTVEDSELRRLRRDGMYLRVRINIPKAGLTQAVMEKIYDTWRKEE 285

Query: 1830 LVRLKFHEDLAHDMKTAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGP--LSSQAQSVKR 1657
            LVRLKFHE LA DMKTAHE+VERRTGG+VIWR+GSVMVV+RG +Y+GP  +S+Q    K 
Sbjct: 286  LVRLKFHEVLARDMKTAHEIVERRTGGMVIWRAGSVMVVYRGLDYKGPPVISNQMAGPK- 344

Query: 1656 EGDTLFVPVVSSPGNLLAKSSSDDSCIDEKSRPVVV-PTCAESMTQEEAVFNSLLDGLGP 1480
              +TLFVP VSS G+    +  + S   E   P++  P   E+MT+EEA FNSLLD LGP
Sbjct: 345  --ETLFVPDVSSAGDEATNAKDNQSPPSEIKDPIIKNPIRKENMTEEEAEFNSLLDSLGP 402

Query: 1479 RFEGWWGTGILPVDADLLPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFA 1300
            RF+ WWGTG+LPVDADLLP  +PGYKTPFRLLP GMRS LTNAEMT+LRK+ K+LPCHFA
Sbjct: 403  RFQEWWGTGVLPVDADLLPPTIPGYKTPFRLLPTGMRSNLTNAEMTNLRKIGKTLPCHFA 462

Query: 1299 LGRNRNHQGLAAAIFKLWEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYI 1120
            LGRNRNHQGLAAAI ++WEKSL+ KIAVKRGIQNTNNK+MA+E+K LTGGVLLLRNKYYI
Sbjct: 463  LGRNRNHQGLAAAILQIWEKSLIAKIAVKRGIQNTNNKLMADEVKALTGGVLLLRNKYYI 522

Query: 1119 VMYRGKDFLPPTVATALAERQEMTKQTQDEEEKMRGGPIEPVSTT-EEDQALAGTLGEFY 943
            V+YRGKDFLP +VA  LAERQE+TK+ QD EE++R   IE V    ++  A AGTL EFY
Sbjct: 523  VIYRGKDFLPSSVAATLAERQELTKEIQDVEERVRNREIEAVQPVGDKVPAEAGTLAEFY 582

Query: 942  EAQARWGREISLDEREKMIEEASRGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWV 763
            EAQARWG+EI+ D REKMIEEASR    RVVKR++HKL ++Q           KI +S +
Sbjct: 583  EAQARWGKEITPDHREKMIEEASRVANARVVKRIQHKLNLAQSKFQRAEKLLSKIEASMI 642

Query: 762  PVGPDDDQETITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSK 583
            P GPD DQE I++EERAMFR+VGL+MKAYLPLGIRGVFDGVIENMHLHWKHRELVKL+SK
Sbjct: 643  PNGPDYDQEVISEEERAMFRKVGLKMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLISK 702

Query: 582  EKEIAFVEETARLLEYESGGILVAIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKAL 403
            +K +AFVE+TARLLEYESGG+LVAIE VPKGF LI+YRGKNYRRPISLRPRNLLTKAKAL
Sbjct: 703  QKNLAFVEDTARLLEYESGGVLVAIEKVPKGFALIYYRGKNYRRPISLRPRNLLTKAKAL 762

Query: 402  KRRVALQRHEALSQHISELEKTIQQTRKEIDVKNGDPALFNNVSEFTESEDE 247
            KR +A+QRHEALSQHISELE+TI+Q + E+  K   P+   +  E  E +DE
Sbjct: 763  KRSIAMQRHEALSQHISELERTIEQMQSELTSKT--PSYSESEWENDEDDDE 812


>ref|NP_188468.1| CRS1 / YhbY domain-containing protein EMB1865 [Arabidopsis thaliana]
            gi|11994102|dbj|BAB01105.1| unnamed protein product
            [Arabidopsis thaliana] gi|17380904|gb|AAL36264.1| unknown
            protein [Arabidopsis thaliana]
            gi|332642570|gb|AEE76091.1| CRS1 / YhbY (CRM)
            domain-containing protein [Arabidopsis thaliana]
          Length = 848

 Score =  807 bits (2084), Expect = 0.0
 Identities = 441/773 (57%), Positives = 544/773 (70%), Gaps = 23/773 (2%)
 Frame = -1

Query: 2493 LKPFSSSLRPTTKTPRNPIQTPNINGATPRHSSSWLNKWPCATPLPPLHYKNP------- 2335
            ++PFSS LR + ++         ++    + +  W++KWP ++      +          
Sbjct: 49   VRPFSS-LRTSERSNNRSNNNRRLDQRNHKPTPPWIDKWPPSSSGAGGDHAGKKGGENNG 107

Query: 2334 ----RTLQEESTSENQFLDEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEAVQ---- 2179
                R+ +EE+ ++ ++L++    G +A++RIVLRLRN              E       
Sbjct: 108  GDRIRSAEEEAEAKLRYLEKD--KGQNAIERIVLRLRNLGLGSDDEDDVEDDEGGGINGG 165

Query: 2178 DAKKVTGEEKLGDLLKRDWVRPDRILVEDEE--DSASMSLPWXXXXXXXXXXXXXXXXXX 2005
            D K VTGEE+LGDLLKR+WVRPD +L E EE  +   + LPW                  
Sbjct: 166  DVKPVTGEERLGDLLKREWVRPDMMLAEGEESEEEDEVLLPWEKNEEEQAAERVVGEGGV 225

Query: 2004 XXXXS--IKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSE 1831
                    +AP+LAELT+ED               RIN+PKAG+T  V +KI+D WRK E
Sbjct: 226  AVMQKRRARAPSLAELTVEDSELRRLRRDGMYLRVRINIPKAGLTQAVMEKIYDTWRKEE 285

Query: 1830 LVRLKFHEDLAHDMKTAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGP--LSSQAQSVKR 1657
            LVRLKFHE LA DMKTAHE+VERRTGG+VIWR+GSVMVV+RG +Y+GP  +S+Q    K 
Sbjct: 286  LVRLKFHEVLARDMKTAHEIVERRTGGMVIWRAGSVMVVYRGLDYKGPPVISNQMAGPK- 344

Query: 1656 EGDTLFVPVVSSPGNLLAKSSSDDSCIDEKSRPVVV-PTCAESMTQEEAVFNSLLDGLGP 1480
              +TLFVP VSS G+    +  + S       P++  P   E+MT+EE  FNSLLD LGP
Sbjct: 345  --ETLFVPDVSSAGDEATNAKDNQSAPLVIKDPIIKNPIRKENMTEEEVEFNSLLDSLGP 402

Query: 1479 RFEGWWGTGILPVDADLLPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFA 1300
            RF+ WWGTG+LPVDADLLP  +PGYKTPFRLLP GMRS LTNAEMT+LRK+ K+LPCHFA
Sbjct: 403  RFQEWWGTGVLPVDADLLPPTIPGYKTPFRLLPTGMRSNLTNAEMTNLRKIGKTLPCHFA 462

Query: 1299 LGRNRNHQGLAAAIFKLWEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYI 1120
            LGRNRNHQGLAAAI ++WEKSL+ KIAVKRGIQNTNNK+MA+E+KTLTGGVLLLRNKYYI
Sbjct: 463  LGRNRNHQGLAAAILQIWEKSLIAKIAVKRGIQNTNNKLMADEVKTLTGGVLLLRNKYYI 522

Query: 1119 VMYRGKDFLPPTVATALAERQEMTKQTQDEEEKMRGGPIEPVSTT-EEDQALAGTLGEFY 943
            V+YRGKDFLP +VA  LAERQE+TK+ QD EE++R   IE V    ++  A AGTL EFY
Sbjct: 523  VIYRGKDFLPSSVAATLAERQELTKEIQDVEERVRNREIEAVQPVGDKVPAEAGTLAEFY 582

Query: 942  EAQARWGREISLDEREKMIEEASRGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWV 763
            EAQARWG+EI+ D REKMIEEASR    RVVKR++HKL ++Q           KI +S +
Sbjct: 583  EAQARWGKEITPDHREKMIEEASRVANARVVKRIQHKLNLAQSKFQRAEKLLSKIEASMI 642

Query: 762  PVGPDDDQETITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSK 583
            P GPD DQE I++EERAMFR+VGL+MKAYLP+GIRGVFDGVIENMHLHWKHRELVKL+SK
Sbjct: 643  PNGPDYDQEVISEEERAMFRKVGLKMKAYLPIGIRGVFDGVIENMHLHWKHRELVKLISK 702

Query: 582  EKEIAFVEETARLLEYESGGILVAIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKAL 403
            +K  AFVEETARLLEYESGG+LVAIE VPKGF LI+YRGKNYRRPISLRPRNLLTKAKAL
Sbjct: 703  QKNQAFVEETARLLEYESGGVLVAIEKVPKGFALIYYRGKNYRRPISLRPRNLLTKAKAL 762

Query: 402  KRRVALQRHEALSQHISELEKTIQQTRKEIDVKNGDPALFNNVSEFTESEDEN 244
            KR +A+QRHEALSQHISELE+TI+Q + ++  KN  P+   +  E  E +D++
Sbjct: 763  KRSIAMQRHEALSQHISELERTIEQMQSQLTSKN--PSYSESEWENDEDDDDD 813


>ref|XP_002532154.1| conserved hypothetical protein [Ricinus communis]
            gi|223528164|gb|EEF30228.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 745

 Score =  805 bits (2078), Expect = 0.0
 Identities = 436/716 (60%), Positives = 518/716 (72%), Gaps = 7/716 (0%)
 Frame = -1

Query: 2490 KPFSSSLRPTTKTPRNPIQTPNINGATPRHSSSWLNKWPCATPLPPLHYKNPRTLQEEST 2311
            +PFSSS   ++ +  +     N N       S WL+KW   +  PP    +P+  Q++  
Sbjct: 37   RPFSSS---SSSSSSSSSLGTNQNPKPNNPKSPWLSKWAPHSSPPPTVKTSPKLAQDKKI 93

Query: 2310 SENQFLDEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEAVQ---DAKKVTGEEKLGD 2140
               Q L +    G +A++RIVLRLRN              E      D+  VTGEE+L D
Sbjct: 94   ---QSLTKD--KGQNAIERIVLRLRNLGLGSDDEEEEGDMEYKPNGGDSIAVTGEERLAD 148

Query: 2139 LLKRDWVRPDRILVEDEE--DSASMSLPWXXXXXXXXXXXXXXXXXXXXXXSIKAPTLAE 1966
            LL+R+WVRPD I ++D+E  D+  + LPW                       +KAPTLAE
Sbjct: 149  LLQREWVRPDTIFIKDDEEDDNDDLVLPWERKEKVRREGEKEEGERERRRV-VKAPTLAE 207

Query: 1965 LTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSELVRLKFHEDLAHDMK 1786
            LTIED               R+NVPKAG+T EV +KIHDKWRK+ELVRLKFHE LAHDMK
Sbjct: 208  LTIEDEELRRLRRMGMFLRERVNVPKAGLTKEVVEKIHDKWRKNELVRLKFHEVLAHDMK 267

Query: 1785 TAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGDTLFVPVVSSPGNLL 1606
            TAHE+ ERRTGGLVIWR+GSVMVV+RG++YEGP  S+ Q V REGD LF+P VSS G+  
Sbjct: 268  TAHEITERRTGGLVIWRAGSVMVVYRGSSYEGP-PSKTQPVNREGDALFIPDVSSAGSET 326

Query: 1605 AKSSS-DDSCIDEKSRPVVVPTCAESMTQEEAVFNSLLDGLGPRFEGWWGTGILPVDADL 1429
             K  +   S  +++   +     ++ MT+EE  ++S LD LGPRFE WWGTGILPVDADL
Sbjct: 327  MKGDNVAPSAAEKRELAMRRLDHSKDMTEEEIEYDSFLDSLGPRFEEWWGTGILPVDADL 386

Query: 1428 LPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGLAAAIFKL 1249
            LP K+P YKTPFRLLP GMRSRLTNAEMT+LRKLAK LPCHFALGRNRNHQGLA+ I K+
Sbjct: 387  LPPKIPDYKTPFRLLPTGMRSRLTNAEMTNLRKLAKKLPCHFALGRNRNHQGLASTILKV 446

Query: 1248 WEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMYRGKDFLPPTVATAL 1069
            WEKSLV KIAVKRGIQNTNNK+MA+ELK LTGGVLLLRNKYYIV+YRGKDFLP +VA AL
Sbjct: 447  WEKSLVAKIAVKRGIQNTNNKLMADELKMLTGGVLLLRNKYYIVIYRGKDFLPTSVAAAL 506

Query: 1068 AERQEMTKQTQDEEEKMRGGPIEPVSTTEED-QALAGTLGEFYEAQARWGREISLDEREK 892
             ERQE+TK+ QD EEK+R   IE V + EE+ + LAGTL EFYEAQ+RWG++ S ++REK
Sbjct: 507  TERQELTKKIQDVEEKVRSREIEAVPSKEEEGKPLAGTLAEFYEAQSRWGKDTSAEDREK 566

Query: 891  MIEEASRGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVGPDDDQETITDEERA 712
            MIE+ +R K  R+VKR+EHKLA++Q           KI  S +P GPD DQETITDEERA
Sbjct: 567  MIEDDTRAKRARIVKRIEHKLAVAQAKKLRAERLLAKIEVSMLPSGPDYDQETITDEERA 626

Query: 711  MFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKEIAFVEETARLLEYE 532
            +FRR+GLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKL+SK+K +AF E+TARLLEYE
Sbjct: 627  VFRRIGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLISKQKTLAFAEDTARLLEYE 686

Query: 531  SGGILVAIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRHEALS 364
            SGGILVAIE VPKGF LI+YRGKNYRRPI+LRPRNLLTKAKALKR VA+QRHE  S
Sbjct: 687  SGGILVAIERVPKGFALIYYRGKNYRRPINLRPRNLLTKAKALKRSVAMQRHEVSS 742


>ref|XP_006296939.1| hypothetical protein CARUB_v10012930mg, partial [Capsella rubella]
            gi|482565648|gb|EOA29837.1| hypothetical protein
            CARUB_v10012930mg, partial [Capsella rubella]
          Length = 910

 Score =  800 bits (2067), Expect = 0.0
 Identities = 446/809 (55%), Positives = 543/809 (67%), Gaps = 54/809 (6%)
 Frame = -1

Query: 2508 SSVFFLKPFSSSLRPTTKTPRNPIQTPNINGATPRHSSSWLNKWPCATPLPPLHYKNP-- 2335
            S    ++PFSS LR + ++         ++    + S  W++KWP ++      +     
Sbjct: 75   SRQLIIRPFSS-LRTSERSNNRSHNNRRLDNRNHKPSPPWIDKWPPSSSGAGSDHSGKKG 133

Query: 2334 ---------RTLQEESTSENQFLDEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEAV 2182
                     R+ +EE+ ++ ++L+     G +A++RIVLRLRN              E  
Sbjct: 134  GEHNGGAKIRSAEEEAEAKLRYLERD--KGQNAIERIVLRLRNLGLGSDDEEDVEDDEES 191

Query: 2181 Q----DAKKVTGEEKLGDLLKRDWVRPDRILVEDEE--DSASMSLPWXXXXXXXXXXXXX 2020
                 D K VTGEE+LGDLLKR+WVRPD +L E EE  +   + LPW             
Sbjct: 192  GMNGGDVKLVTGEERLGDLLKREWVRPDMMLAEGEESEEEDDVLLPWEKNEQEQAAERVE 251

Query: 2019 XXXXXXXXXS--IKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDK 1846
                         +AP+LAELT+ED               RIN+PKAG+T  V +KIHD 
Sbjct: 252  GEGGVAVMTKRRARAPSLAELTVEDSELRRLRRDGMYLRVRINIPKAGLTQAVMEKIHDT 311

Query: 1845 WRKSELVRLKFHEDLAHDMKTAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQS 1666
            WRK ELVRLKFHE LA DMKTAHE+VERRTGG+VIWR+GSVMVV+RG +Y+GP S  +  
Sbjct: 312  WRKEELVRLKFHEVLARDMKTAHEIVERRTGGMVIWRAGSVMVVYRGLDYQGP-SVISNR 370

Query: 1665 VKREGDTLFVPVVSSPGNLLAKSSSDDSCIDEKSRPVVV-PTCAESMTQEEAVFNSLLDG 1489
            +    +TLFVP VSS G+    +  + +   E   P+V  P   ++MT+EE  FN+LLD 
Sbjct: 371  MAGPKETLFVPDVSSAGDEATNAKDNQNPPLEIRDPIVKNPIRKQNMTEEEIEFNNLLDS 430

Query: 1488 LGPRFEGWWGTGILPVDADLLPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPC 1309
            LGPRF+ WWGTG+LPVDADLLP  VPGYKTPFRLLP GMRS LTNAEMT+LRK+ K+LPC
Sbjct: 431  LGPRFQEWWGTGVLPVDADLLPPTVPGYKTPFRLLPTGMRSNLTNAEMTNLRKIGKTLPC 490

Query: 1308 HFALGRNRNHQGLAAAIFKLWEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNK 1129
            HFALGRNRNHQGLAAAI ++WEKSL+ KIAVKRGIQNTNNK+MA+ELK LTGGVLLLRNK
Sbjct: 491  HFALGRNRNHQGLAAAILQIWEKSLIAKIAVKRGIQNTNNKLMADELKALTGGVLLLRNK 550

Query: 1128 YYIVMYRGKDFLPPTVATALAERQEMTKQTQDEEEKMRGG------------PIEPVSTT 985
            YYIV+YRGKDFLP +VA  LAERQE+TK+ QD EE++R              P+E    T
Sbjct: 551  YYIVIYRGKDFLPSSVAATLAERQELTKEIQDVEERVRTRDIEAIQPVGDKVPVERQELT 610

Query: 984  EEDQ----------------------ALAGTLGEFYEAQARWGREISLDEREKMIEEASR 871
            EE Q                      A AGTL EFYEAQARWG+EI+ D REKMIEEASR
Sbjct: 611  EEIQHVEESVRTRDIKAIQPVGDKVPAEAGTLAEFYEAQARWGKEITPDHREKMIEEASR 670

Query: 870  GKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVGPDDDQETITDEERAMFRRVGL 691
                RVVKR++HKL I Q           KI +S +P GPD DQE I++EERAMFR+VGL
Sbjct: 671  VANARVVKRIQHKLNIGQSKFQRAEKLLSKIEASMIPNGPDYDQEVISEEERAMFRKVGL 730

Query: 690  RMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKEIAFVEETARLLEYESGGILVA 511
            +MKAYLPLGIRGVFDGVIENMHLHWKHRELVKL+SK+K +AFVE+TARLLEYESGG+LVA
Sbjct: 731  KMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLISKQKNLAFVEDTARLLEYESGGVLVA 790

Query: 510  IESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRHEALSQHISELEKTIQ 331
            IE VPKGF LI+YRGKNYRRPISLRPRNLLTKAKALKR +A+QRHEALSQHISELE+TI+
Sbjct: 791  IEKVPKGFALIYYRGKNYRRPISLRPRNLLTKAKALKRSIAMQRHEALSQHISELERTIE 850

Query: 330  QTRKEIDVKNGDPALFNNVSEFTESEDEN 244
            Q + ++  KN       N SE+   +D++
Sbjct: 851  QMQSQLTAKNPS----YNESEWENDDDDD 875


>ref|XP_003533366.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like [Glycine max]
          Length = 791

 Score =  800 bits (2065), Expect = 0.0
 Identities = 443/777 (57%), Positives = 546/777 (70%), Gaps = 21/777 (2%)
 Frame = -1

Query: 2508 SSVFFLKPFSSSLRPTTKTPRNPIQT---PNINGATPRHSSSWLNKWPCATPLPPLHYKN 2338
            S + F   F+S   P +   + P +T    ++    P  S+ WL K P           +
Sbjct: 10   SELSFNSSFASLNHPHSSFRKFPFRTLTFASLPTPKPNPSAPWLTKSP-----------S 58

Query: 2337 PRTLQEESTSENQFLDEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEAVQDAKK--- 2167
            P+   E   + +   D   R   +A+DRIVLRLRN              E    A     
Sbjct: 59   PKRAVEPLPAGDPTPD---RKPQNAVDRIVLRLRNLGLPSEEEEQEQEHEEEIPATNPAP 115

Query: 2166 VTGEEKLGDLLKRDWVRPDRILV-EDEEDSASMSLPWXXXXXXXXXXXXXXXXXXXXXXS 1990
            VTGEE+LG+LL+R+WVRPD +LV ED+++   M LPW                       
Sbjct: 116  VTGEERLGELLQREWVRPDAVLVGEDDDEEEEMMLPWERDEEEKEVVVVSEEGLLKKRR- 174

Query: 1989 IKAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSELVRLKFH 1810
            ++AP+LA+LT+ED               R++VPKAG+T EV +KIH +WRK ELVRLKFH
Sbjct: 175  VRAPSLADLTLEDELLRRLRREGMRVRERVSVPKAGLTEEVMEKIHKRWRKEELVRLKFH 234

Query: 1809 EDLAHDMKTAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGDTLFVPV 1630
            E+LA DM+ AHE+VERRTGGLV WRSGSVM+V+RG +Y+GP  S+ +  +++GD  FVP 
Sbjct: 235  EELAKDMRKAHEIVERRTGGLVTWRSGSVMMVYRGIDYQGP-DSRKELNEKKGDGFFVPD 293

Query: 1629 VSSPGNLLAKSSSDDS--CIDEKSRPVVVPTCAESMTQEEAVFNSLLDGLGPRFEGWWGT 1456
            VS   +  A S+S+ S   + E+  P       E+M++ EA +N+LLDGLGPRF GWWGT
Sbjct: 294  VSKREDSTATSTSEKSEVVVREREHP-------ENMSEAEAEYNALLDGLGPRFFGWWGT 346

Query: 1455 GILPVDADLLPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGRNRNHQ 1276
            GILPVDADLLP+ VPGYKTPFRLLP GMRSRLTNAEMT+LRKLAKSLPCHFA+GRNRNHQ
Sbjct: 347  GILPVDADLLPRTVPGYKTPFRLLPTGMRSRLTNAEMTNLRKLAKSLPCHFAVGRNRNHQ 406

Query: 1275 GLAAAIFKLWEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMYRGKDF 1096
            GLA AI KLWEKSLV KIAVKRGIQNTNN++MAEELK LTGG LLLRNKY+IV+YRGKDF
Sbjct: 407  GLACAILKLWEKSLVSKIAVKRGIQNTNNELMAEELKMLTGGTLLLRNKYFIVIYRGKDF 466

Query: 1095 LPPTVATALAERQEMTKQTQDEEEKMRGGPIEPVSTTE-EDQALAGTLGEFYEAQARWGR 919
            +P +VA  LAER+E+TKQ QD E+K+R   ++ + + + E  A AGTL EFYEAQARWGR
Sbjct: 467  VPTSVAAVLAEREELTKQVQDVEDKVRCRAVDAIPSGQGEATAQAGTLAEFYEAQARWGR 526

Query: 918  EISLDEREKMIEEASRGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVGPDDDQ 739
            EIS DEREKM+EEA++ KT ++V+++EHK+ I+Q           KI +S VP GPD DQ
Sbjct: 527  EISPDEREKMMEEAAKAKTAKLVRQIEHKIFIAQTKKLRAEKLLAKIEASMVPAGPDYDQ 586

Query: 738  ETITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKEIAFVE 559
            ETITDEER MFR+VGLRMK YLPLGIRGVFDGV+ENMHLHWKHRELVKL++K+K +AFVE
Sbjct: 587  ETITDEERVMFRKVGLRMKPYLPLGIRGVFDGVVENMHLHWKHRELVKLMTKQKTLAFVE 646

Query: 558  ETARLLEYESGGILVAIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQR 379
            +TARLLEYESGGILVAIE V K F LI+YRGKNY+RPI+LRPRNLLTK KALKR VA+QR
Sbjct: 647  DTARLLEYESGGILVAIEKVSKEFALIYYRGKNYKRPITLRPRNLLTKGKALKRHVAMQR 706

Query: 378  HEALSQHISELEKTIQQTRKEI------DVKNG-----DPALFNNVSEFTESEDENS 241
            HEALSQHI+ELEKTI+Q +KE+      DV++G     D     ++SE   SEDE+S
Sbjct: 707  HEALSQHITELEKTIEQMKKELGMTQDSDVEDGGSIEEDDHNQIDISELALSEDEDS 763


>ref|XP_003530015.2| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1,
            chloroplastic-like isoform X1 [Glycine max]
          Length = 791

 Score =  798 bits (2061), Expect = 0.0
 Identities = 441/775 (56%), Positives = 539/775 (69%), Gaps = 19/775 (2%)
 Frame = -1

Query: 2508 SSVFFLKPFSSSLRPTT-----KTPRNPIQTPNINGATPRHSSSWLNKWPCATPLPPLHY 2344
            S + F   FSS   P       K P   +   ++    P  S+ WL K P          
Sbjct: 10   SELSFKSSFSSLNHPHPPRSFRKFPLRTLTFASLPTPKPNPSAPWLTKSP---------- 59

Query: 2343 KNPRTLQEESTSENQFLDEAVRPGTSAMDRIVLRLRNXXXXXXXXXXXXXXEA-VQDAKK 2167
             +P+   E  T+ +   D+      + ++RIVLRLRN              E    +   
Sbjct: 60   -SPKRATEPLTAGDPIPDKKPH---NPVERIVLRLRNLGLPSEEEEQEEEEEIPANNPAP 115

Query: 2166 VTGEEKLGDLLKRDWVRPDRILVEDEEDSASMSLPWXXXXXXXXXXXXXXXXXXXXXXSI 1987
            VTGEE+LG+LL+R+WVRPD +LV +++    M LPW                       +
Sbjct: 116  VTGEERLGELLRREWVRPDAVLVGEDDGEEEMILPWEREEEKEVVVVVSEEGLLKKRR-V 174

Query: 1986 KAPTLAELTIEDXXXXXXXXXXXXXXXRINVPKAGVTMEVFDKIHDKWRKSELVRLKFHE 1807
            +AP+LA+LT+ED               R++VPKAG+T EV +KIH +WRK ELVRLKFHE
Sbjct: 175  RAPSLADLTLEDELLRRLRREGMRVRERVSVPKAGLTQEVMEKIHKRWRKEELVRLKFHE 234

Query: 1806 DLAHDMKTAHEVVERRTGGLVIWRSGSVMVVFRGTNYEGPLSSQAQSVKREGDTLFVPVV 1627
            +LA DM+ AHE+VERRTGGLV WRSGSVM+V+RG +Y+GP  SQ +  +++GD  FVP V
Sbjct: 235  ELAKDMRKAHEIVERRTGGLVTWRSGSVMMVYRGIDYQGP-DSQKEVNEKKGDGFFVPDV 293

Query: 1626 SSPGNLLAKSSSDDSCIDEKSRPVVVPT-CAESMTQEEAVFNSLLDGLGPRFEGWWGTGI 1450
            S       + SS  +   EKS  VV      E+M++ EA +N+LLDGLGPRF GWWGTGI
Sbjct: 294  SK-----REDSSTATSTSEKSEVVVREREHPENMSEAEAEYNALLDGLGPRFVGWWGTGI 348

Query: 1449 LPVDADLLPQKVPGYKTPFRLLPIGMRSRLTNAEMTDLRKLAKSLPCHFALGRNRNHQGL 1270
            LPVDADLLP+ VPGYKTPFRLLP GMRSRLTNAEMT+LRKLAKSLPCHFALGRNRNHQGL
Sbjct: 349  LPVDADLLPRTVPGYKTPFRLLPTGMRSRLTNAEMTNLRKLAKSLPCHFALGRNRNHQGL 408

Query: 1269 AAAIFKLWEKSLVVKIAVKRGIQNTNNKIMAEELKTLTGGVLLLRNKYYIVMYRGKDFLP 1090
            A AI KLWEKSLV KIAVKRGIQNTNN++MAEELK LTGG LLLRNKY+IV+YRGKDF+P
Sbjct: 409  ACAILKLWEKSLVAKIAVKRGIQNTNNELMAEELKMLTGGTLLLRNKYFIVIYRGKDFVP 468

Query: 1089 PTVATALAERQEMTKQTQDEEEKMRGGPIEPVSTTE-EDQALAGTLGEFYEAQARWGREI 913
             +VA  LAER+E+TKQ QD E+K+R   ++ +   + E  A AGTL EFYEAQARWGREI
Sbjct: 469  TSVAAVLAEREELTKQVQDVEDKVRCRAVDAIPLGQGEATAQAGTLAEFYEAQARWGREI 528

Query: 912  SLDEREKMIEEASRGKTNRVVKRLEHKLAISQXXXXXXXXXXXKIISSWVPVGPDDDQET 733
            S +EREKM+EEA++ KT ++V+++EHK+ I+Q           KI +S VP GPD DQET
Sbjct: 529  SPEEREKMVEEAAKTKTAKLVRQIEHKIFIAQTKKLRAEKLLAKIEASMVPAGPDYDQET 588

Query: 732  ITDEERAMFRRVGLRMKAYLPLGIRGVFDGVIENMHLHWKHRELVKLLSKEKEIAFVEET 553
            ITDEER MFR+VGLRMK YLPLGIRGVFDGV+ENMHLHWKHRELVKL++K+K +AFVE+T
Sbjct: 589  ITDEERVMFRKVGLRMKPYLPLGIRGVFDGVVENMHLHWKHRELVKLMTKQKTVAFVEDT 648

Query: 552  ARLLEYESGGILVAIESVPKGFILIFYRGKNYRRPISLRPRNLLTKAKALKRRVALQRHE 373
            ARLLEYESGGILVAIE V K F LI+YRGKNY+RPI+LRPRNLLTK KALKR VA+QRHE
Sbjct: 649  ARLLEYESGGILVAIEKVSKEFALIYYRGKNYKRPITLRPRNLLTKGKALKRHVAMQRHE 708

Query: 372  ALSQHISELEKTIQQTRKEI------DVKNG-----DPALFNNVSEFTESEDENS 241
            ALSQHI+ELEKTI+Q +KE+      DV++G     D     ++SE   SEDE+S
Sbjct: 709  ALSQHITELEKTIEQMKKELGMTQDSDVEDGGSIEEDDHNQIDISELALSEDEDS 763


Top