BLASTX nr result

ID: Rauwolfia21_contig00006821 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00006821
         (2602 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006346695.1| PREDICTED: pentatricopeptide repeat-containi...  1059   0.0  
ref|XP_004244963.1| PREDICTED: pentatricopeptide repeat-containi...  1048   0.0  
ref|XP_002272226.1| PREDICTED: pentatricopeptide repeat-containi...  1014   0.0  
gb|EXB56341.1| hypothetical protein L484_024883 [Morus notabilis]     984   0.0  
gb|EOY15235.1| Pentatricopeptide repeat (PPR-like) superfamily p...   977   0.0  
gb|EOY15236.1| Pentatricopeptide repeat (PPR-like) superfamily p...   973   0.0  
ref|XP_002510663.1| pentatricopeptide repeat-containing protein,...   970   0.0  
ref|XP_002306972.1| hypothetical protein POPTR_0005s27160g [Popu...   969   0.0  
ref|XP_006279580.1| hypothetical protein CARUB_v10025981mg [Caps...   961   0.0  
gb|EMJ26384.1| hypothetical protein PRUPE_ppa002191mg [Prunus pe...   959   0.0  
ref|XP_004292910.1| PREDICTED: pentatricopeptide repeat-containi...   958   0.0  
ref|XP_006405260.1| hypothetical protein EUTSA_v10027665mg [Eutr...   957   0.0  
ref|XP_004144287.1| PREDICTED: pentatricopeptide repeat-containi...   956   0.0  
ref|NP_199046.1| pentatricopeptide repeat-containing protein [Ar...   955   0.0  
ref|XP_006473770.1| PREDICTED: pentatricopeptide repeat-containi...   953   0.0  
ref|XP_006435342.1| hypothetical protein CICLE_v10000451mg [Citr...   949   0.0  
ref|XP_002865541.1| pentatricopeptide repeat-containing protein ...   947   0.0  
ref|XP_002301924.1| hypothetical protein POPTR_0002s01200g [Popu...   946   0.0  
dbj|BAB10204.1| maize crp1 protein-like [Arabidopsis thaliana]        944   0.0  
ref|XP_004500883.1| PREDICTED: pentatricopeptide repeat-containi...   941   0.0  

>ref|XP_006346695.1| PREDICTED: pentatricopeptide repeat-containing protein At5g42310,
            mitochondrial-like [Solanum tuberosum]
          Length = 697

 Score = 1059 bits (2739), Expect = 0.0
 Identities = 525/634 (82%), Positives = 580/634 (91%)
 Frame = -3

Query: 2417 DKDESYDVASLNNRRYDFTPLLEFLSAYFPSPVDRQSNSPTQLEPTELRLAESYRAVPAP 2238
            ++DE Y + S +N+RYDFTPLL+FLS   P+     ++SPTQL PTELRLAESYRAVPAP
Sbjct: 64   EEDEEYHIGSYSNQRYDFTPLLQFLSTTEPNS---DNSSPTQLHPTELRLAESYRAVPAP 120

Query: 2237 LWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEKLYEAFLLSQ 2058
            LWHSLLK L++TPSSIS AYALV WLQKHNLCFSYELLYSILIHALGRSEKLYEAFLLSQ
Sbjct: 121  LWHSLLKGLSSTPSSISIAYALVIWLQKHNLCFSYELLYSILIHALGRSEKLYEAFLLSQ 180

Query: 2057 RQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSLMRNNSIDST 1878
            RQTLTPLTYNALIGACARN DLEKALNLM +MRRDGYQSD+VNYSLIIQSL+R+NSID T
Sbjct: 181  RQTLTPLTYNALIGACARNGDLEKALNLMCRMRRDGYQSDYVNYSLIIQSLIRSNSIDLT 240

Query: 1877 LLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSPKTATLVAII 1698
            +L KF  EIEADMIELD QLLND+ VGFAK+GD +RAL F+S++QGNGLSPKTAT+V +I
Sbjct: 241  MLHKFCYEIEADMIELDGQLLNDMIVGFAKAGDVDRALGFMSIVQGNGLSPKTATVVNLI 300

Query: 1697 SELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVVSEMERSGVA 1518
            SELG SGRT           EGGLKPRTRA+NALLKGYV+TG+LKDAE +VSEME SGVA
Sbjct: 301  SELGNSGRTEEAEAIFEELKEGGLKPRTRAFNALLKGYVKTGSLKDAEYIVSEMESSGVA 360

Query: 1517 PDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDRGEWQRSFQV 1338
            PDE+TYSLLIDAYGNAGRWESARIVLKEMEAN+V+P+S+VFSRILASYRDRGEWQRSFQV
Sbjct: 361  PDEHTYSLLIDAYGNAGRWESARIVLKEMEANNVQPNSFVFSRILASYRDRGEWQRSFQV 420

Query: 1337 LKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTWNTLIDCHCK 1158
            LKEM+N+GV PDRQFYN+MIDTFGKYNCLDHAM TFERMKLE IEPDTVTWNTLIDCH K
Sbjct: 421  LKEMKNSGVNPDRQFYNIMIDTFGKYNCLDHAMSTFERMKLEEIEPDTVTWNTLIDCHSK 480

Query: 1157 HGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQSQGLLPNVVT 978
            HGHHNKAE+LFE MQESGC PCTTTYNIM+NSFGE E+WE+V+ LL KMQSQGLLPNVVT
Sbjct: 481  HGHHNKAEELFETMQESGCSPCTTTYNIMINSFGELEKWEEVKCLLSKMQSQGLLPNVVT 540

Query: 977  YTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQAVNAFWVMR 798
            YTTL++IYGQSGRFN+AIECLEVMKSAG+KPSSTMYNALINAYAQRGL++QAVNAF +M+
Sbjct: 541  YTTLINIYGQSGRFNDAIECLEVMKSAGLKPSSTMYNALINAYAQRGLSEQAVNAFRIMK 600

Query: 797  GDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMKALVRVEKFE 618
            GDGLKPSLLALNSLINAF EDRRDAEAFAVL+Y+KEND+KPDVVTYTTLMK L+RVEKFE
Sbjct: 601  GDGLKPSLLALNSLINAFGEDRRDAEAFAVLKYLKENDMKPDVVTYTTLMKTLIRVEKFE 660

Query: 617  KVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            +VPAVYEEML CGC+PDRKARAMLRSALRYMKST
Sbjct: 661  RVPAVYEEMLLCGCIPDRKARAMLRSALRYMKST 694


>ref|XP_004244963.1| PREDICTED: pentatricopeptide repeat-containing protein At5g42310,
            mitochondrial-like [Solanum lycopersicum]
          Length = 699

 Score = 1048 bits (2710), Expect = 0.0
 Identities = 522/634 (82%), Positives = 578/634 (91%)
 Frame = -3

Query: 2417 DKDESYDVASLNNRRYDFTPLLEFLSAYFPSPVDRQSNSPTQLEPTELRLAESYRAVPAP 2238
            +++E  DV S +N+RYDFT LL+FLS   P+     ++SPTQL+PTELRLAESYRAVPAP
Sbjct: 66   EEEEDDDVGSYSNQRYDFTRLLQFLSTTEPNS---DNSSPTQLDPTELRLAESYRAVPAP 122

Query: 2237 LWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEKLYEAFLLSQ 2058
            LWHSLLK L++TPSSIS AYALV WLQKHNLC+SYELLYSILIHALGRSEKLYEAFLLSQ
Sbjct: 123  LWHSLLKDLSSTPSSISIAYALVIWLQKHNLCYSYELLYSILIHALGRSEKLYEAFLLSQ 182

Query: 2057 RQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSLMRNNSIDST 1878
            RQTLTPLTYNALIGACARN DLEKALNLM +MRRDGYQSD+VNYSLIIQSL+R+NSID T
Sbjct: 183  RQTLTPLTYNALIGACARNGDLEKALNLMCRMRRDGYQSDYVNYSLIIQSLIRSNSIDLT 242

Query: 1877 LLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSPKTATLVAII 1698
            +L KF  EIEADMIELD QLLND+ VGFAK+GD + AL F+SV+QGNGLSPK AT+V +I
Sbjct: 243  MLHKFCYEIEADMIELDGQLLNDMIVGFAKAGDVDTALGFMSVVQGNGLSPKIATVVNLI 302

Query: 1697 SELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVVSEMERSGVA 1518
            SELG SGRT           EGGLKPRTRA+N+LLKGYV+TG+LKDAE +VSEMERSGVA
Sbjct: 303  SELGNSGRTDEAEAIFEELKEGGLKPRTRAFNSLLKGYVKTGSLKDAEYIVSEMERSGVA 362

Query: 1517 PDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDRGEWQRSFQV 1338
            PDE+TYSLLIDAYGNAGRWESARIVLKEMEAN+V+P+S+VFSRILASYRDRGEWQRSFQV
Sbjct: 363  PDEHTYSLLIDAYGNAGRWESARIVLKEMEANNVQPNSFVFSRILASYRDRGEWQRSFQV 422

Query: 1337 LKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTWNTLIDCHCK 1158
            LKEM+N+GV PDRQFYN+MIDTFGKYNCLDHAM TFERMKLE IEPDTVTWNTLIDCH K
Sbjct: 423  LKEMKNSGVNPDRQFYNIMIDTFGKYNCLDHAMSTFERMKLEEIEPDTVTWNTLIDCHSK 482

Query: 1157 HGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQSQGLLPNVVT 978
            HGHHNKAE+LFE MQESGC PCTTTYNIM+NSFGE E+WE+V+ LL KMQSQGLLPNVVT
Sbjct: 483  HGHHNKAEELFEVMQESGCSPCTTTYNIMINSFGELEKWEEVKGLLSKMQSQGLLPNVVT 542

Query: 977  YTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQAVNAFWVMR 798
            YTTL++IYGQSGRFN+AIECLEVMKSAG+KPSSTMYNALINAYAQRGL++QAVNAF +M+
Sbjct: 543  YTTLINIYGQSGRFNDAIECLEVMKSAGLKPSSTMYNALINAYAQRGLSEQAVNAFRIMK 602

Query: 797  GDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMKALVRVEKFE 618
            GDGLKPSLLALNSLINAF EDRRDAEAFAVL+YMKEND+KPDVVTYTTLMK L+RVEKFE
Sbjct: 603  GDGLKPSLLALNSLINAFGEDRRDAEAFAVLKYMKENDMKPDVVTYTTLMKTLIRVEKFE 662

Query: 617  KVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            +VPAVYEEML  GC+PDRKARAMLRSALRYMKST
Sbjct: 663  RVPAVYEEMLLSGCIPDRKARAMLRSALRYMKST 696


>ref|XP_002272226.1| PREDICTED: pentatricopeptide repeat-containing protein At5g42310,
            mitochondrial [Vitis vinifera]
            gi|297745544|emb|CBI40709.3| unnamed protein product
            [Vitis vinifera]
          Length = 695

 Score = 1014 bits (2621), Expect = 0.0
 Identities = 518/697 (74%), Positives = 579/697 (83%), Gaps = 3/697 (0%)
 Frame = -3

Query: 2597 VLQPPPLQFRFPSVPFTPPTFLLHHHQNFQKPLFXXXXXXXXXXXXXXXXPLSSQQLCHG 2418
            +L P PL  RFPS  F  P  +L  H+  Q PL                    S+ L   
Sbjct: 2    LLLPAPLPTRFPSYHFLSP--VLRDHRILQPPLLATTSAAVTTASGEASQF--SKPL--N 55

Query: 2417 DKDESYDVASLNNRRYDFTPLLEFLS---AYFPSPVDRQSNSPTQLEPTELRLAESYRAV 2247
            +   S D+ S+ NRRYDFTPLL FLS   +   S  + +S  PT L+ TE +L ESYRAV
Sbjct: 56   EYGSSGDLNSVPNRRYDFTPLLRFLSNSESDSDSGAEVESPPPTSLDFTEFQLVESYRAV 115

Query: 2246 PAPLWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEKLYEAFL 2067
            PAPLWHSLLKSL +  SSI  AY+LVTWL++HNLCFSYELLYSILIHALGRSEKLYEAFL
Sbjct: 116  PAPLWHSLLKSLCSDSSSIGTAYSLVTWLERHNLCFSYELLYSILIHALGRSEKLYEAFL 175

Query: 2066 LSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSLMRNNSI 1887
            LSQRQTLTPLTYNALIGACARNDDLEKALNLM++MRRDG+ SDFVNYS IIQSL R N  
Sbjct: 176  LSQRQTLTPLTYNALIGACARNDDLEKALNLMSRMRRDGFPSDFVNYSFIIQSLTRTNKS 235

Query: 1886 DSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSPKTATLV 1707
            DS++L+K Y EIE+D IELD QLLNDI VGFAKSGD NRA+ FL+++QGNGLSPKTATLV
Sbjct: 236  DSSMLQKIYAEIESDKIELDGQLLNDIIVGFAKSGDVNRAMSFLAMVQGNGLSPKTATLV 295

Query: 1706 AIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVVSEMERS 1527
            A+I+ LG +GRT           EGGL PRTRAYNALLKGYV+TG+LKDAE +VSEMERS
Sbjct: 296  AVITALGNAGRTEEAEAIFEELKEGGLMPRTRAYNALLKGYVKTGSLKDAESIVSEMERS 355

Query: 1526 GVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDRGEWQRS 1347
            G +PDE+TYSLLIDAY NAGRWESARIVLKEMEA+ V P+SYVFSRILASYRDRG+WQ+S
Sbjct: 356  GFSPDEHTYSLLIDAYANAGRWESARIVLKEMEASGVRPNSYVFSRILASYRDRGKWQKS 415

Query: 1346 FQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTWNTLIDC 1167
            FQVL+EMRN+GV+PDR FYNVMIDTFGK NCLDHA+ TF+RM++EG++PD VTWNTLIDC
Sbjct: 416  FQVLREMRNSGVSPDRHFYNVMIDTFGKCNCLDHALATFDRMRMEGVQPDAVTWNTLIDC 475

Query: 1166 HCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQSQGLLPN 987
            HCK GHHNKAE+LFEAMQESGC PCTTTYNIM+NSFGEQERWEDV+ LL KMQSQGLL N
Sbjct: 476  HCKSGHHNKAEELFEAMQESGCSPCTTTYNIMINSFGEQERWEDVKTLLGKMQSQGLLAN 535

Query: 986  VVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQAVNAFW 807
            VVTYTTLVDIYGQSGRF +AIECLEVMKS G+KPSSTMYNALINAYAQRGL++QA+NAF 
Sbjct: 536  VVTYTTLVDIYGQSGRFKDAIECLEVMKSVGLKPSSTMYNALINAYAQRGLSEQAINAFR 595

Query: 806  VMRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMKALVRVE 627
            VMR DGLKPS+L LNSLINAF EDRRDAEAF+VLQYMKENDLKPDVVTYTTLMKAL+RVE
Sbjct: 596  VMRADGLKPSVLVLNSLINAFGEDRRDAEAFSVLQYMKENDLKPDVVTYTTLMKALIRVE 655

Query: 626  KFEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            KF+KVPAVYEEM   GC PDRKARAMLRSALRYM+ T
Sbjct: 656  KFDKVPAVYEEMTLSGCTPDRKARAMLRSALRYMERT 692


>gb|EXB56341.1| hypothetical protein L484_024883 [Morus notabilis]
          Length = 734

 Score =  984 bits (2543), Expect = 0.0
 Identities = 500/703 (71%), Positives = 570/703 (81%), Gaps = 8/703 (1%)
 Frame = -3

Query: 2600 RVLQPPPLQFRFPSVPFTPPTFLLHHHQNF------QKPLFXXXXXXXXXXXXXXXXPLS 2439
            ++L PPP   +FPS+  T      HHHQ+       Q  LF                  S
Sbjct: 2    QLLLPPPASAKFPSIQTTTTFHTRHHHQHHYHYTSSQLLLFSAAASAVSTSGSGEAPLSS 61

Query: 2438 SQQLCHGDKDESYDVASLNNRRYDFTPLLEFLS--AYFPSPVDRQSNSPTQLEPTELRLA 2265
            S        D+  D+ SL NRRYDF PLL FLS      +  +  S+ PT L+  E  LA
Sbjct: 62   SSSSMRRRFDDENDLVSLRNRRYDFNPLLNFLSNRTNISAATESGSDPPTSLDREEFELA 121

Query: 2264 ESYRAVPAPLWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEK 2085
            ESYRAVPA LWHSLLKSL +  SSI  AYA+V+WLQKHNLCFSYELLYSILIHALGRSEK
Sbjct: 122  ESYRAVPALLWHSLLKSLCSKSSSIGLAYAVVSWLQKHNLCFSYELLYSILIHALGRSEK 181

Query: 2084 LYEAFLLSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSL 1905
            LYEAFLLSQRQTLTPLTYNALIGACARNDDLEKALNLMA+MR+DG+ SDFVNYSLIIQSL
Sbjct: 182  LYEAFLLSQRQTLTPLTYNALIGACARNDDLEKALNLMARMRQDGFPSDFVNYSLIIQSL 241

Query: 1904 MRNNSIDSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSP 1725
             R N IDS +L+K Y EIE D IELD QLLNDI VGFAK+GD ++A++FL+V+Q  GLSP
Sbjct: 242  TRKNKIDSPILQKLYKEIECDKIELDGQLLNDIIVGFAKAGDPSQAMHFLAVVQAMGLSP 301

Query: 1724 KTATLVAIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVV 1545
            KTATL A+IS LG SGR            +GGL+PRTRAYNALLKGYV+  +LKDAE VV
Sbjct: 302  KTATLTAVISALGNSGRIVEAEALFEEIKDGGLQPRTRAYNALLKGYVKASSLKDAESVV 361

Query: 1544 SEMERSGVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDR 1365
            SEME +GV+PDE+TYSLLIDAY NAGRWESARIVLKEMEA++V+P+SYVFSRILASYRDR
Sbjct: 362  SEMEMNGVSPDEHTYSLLIDAYANAGRWESARIVLKEMEASNVQPNSYVFSRILASYRDR 421

Query: 1364 GEWQRSFQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTW 1185
            GEWQ++FQVL+EM+++GV PDR FYNVMIDTFGK+NCLDHAM TFERM L+GI+PDTVTW
Sbjct: 422  GEWQKTFQVLREMKSSGVRPDRHFYNVMIDTFGKFNCLDHAMATFERMILDGIQPDTVTW 481

Query: 1184 NTLIDCHCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQS 1005
            NTLI+CHCK G H +AE+LFE MQE G  PC TTYNI++NSFGEQERW+DV+ LL KMQS
Sbjct: 482  NTLINCHCKAGRHERAEELFEEMQERGYPPCATTYNILINSFGEQERWDDVKVLLGKMQS 541

Query: 1004 QGLLPNVVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQ 825
            QGLLPNVVTYTTL+DIYGQSGRFN+A++CL+ MK++G+KPSSTMYNALINAYAQRGL++Q
Sbjct: 542  QGLLPNVVTYTTLIDIYGQSGRFNDAMDCLQDMKTSGLKPSSTMYNALINAYAQRGLSEQ 601

Query: 824  AVNAFWVMRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMK 645
            A+NAF +MRGDGLKPS+LALNSLINAF EDRRDAEAFAVLQYMKEN LKPDVVTYTTLMK
Sbjct: 602  ALNAFRLMRGDGLKPSILALNSLINAFGEDRRDAEAFAVLQYMKENGLKPDVVTYTTLMK 661

Query: 644  ALVRVEKFEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            AL RV+KF+KVP VYEEM+S GC PDRKAR MLRSALRYMK T
Sbjct: 662  ALNRVDKFDKVPVVYEEMISSGCTPDRKAREMLRSALRYMKQT 704


>gb|EOY15235.1| Pentatricopeptide repeat (PPR-like) superfamily protein isoform 1
            [Theobroma cacao]
          Length = 703

 Score =  977 bits (2525), Expect = 0.0
 Identities = 496/699 (70%), Positives = 575/699 (82%), Gaps = 7/699 (1%)
 Frame = -3

Query: 2597 VLQPPPLQFRFPSVPFTPPTFLLHHHQNFQKPLFXXXXXXXXXXXXXXXXPLSSQQLCHG 2418
            +L PPPL  RFPS+  + P   LH   + Q  ++                     +  + 
Sbjct: 2    LLLPPPLPARFPSIQLSSPITRLH--VSLQTSIYTAAAATAAEASISLSIDKDKDRDRYD 59

Query: 2417 DKDESY-DVASLNNRRYDFTPLLEFLSAYFPSP-VDRQSNSPTQLEPTELRLAESYRAVP 2244
            D+D+   DV S++ RRYDFTPLL +LS+    P  D  S SPT L+P E +LAESYRAVP
Sbjct: 60   DEDDDQSDVLSIHKRRYDFTPLLNYLSSSNSEPDSDSDSASPTSLDPIEFQLAESYRAVP 119

Query: 2243 APLWHSLLKSL-----TATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEKLY 2079
            APLWHSLLKS+     +++ SSI+ AYA+V+WLQ+HNLCFSYELLYSILIHALGRSEKLY
Sbjct: 120  APLWHSLLKSMCSSSSSSSSSSINLAYAVVSWLQRHNLCFSYELLYSILIHALGRSEKLY 179

Query: 2078 EAFLLSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSLMR 1899
            EAFLLSQRQTLTPLTYNALI ACARN+DLEKALNLM++MR+DGYQSDFVNYSLIIQSL R
Sbjct: 180  EAFLLSQRQTLTPLTYNALINACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTR 239

Query: 1898 NNSIDSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSPKT 1719
            +N IDS+LL+K Y EIE D IE+D QLLNDI VGFAK+ D + AL FL++ Q  GL+PKT
Sbjct: 240  SNKIDSSLLQKLYGEIECDKIEVDGQLLNDIIVGFAKANDPSHALKFLAMAQAIGLNPKT 299

Query: 1718 ATLVAIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVVSE 1539
            ATLVA+I  LG  GR              GLKPRTRAYNALLKGYV+ G+LKDAE VVSE
Sbjct: 300  ATLVAVIYSLGCCGRIAEAEAVFEEMKGTGLKPRTRAYNALLKGYVKAGSLKDAELVVSE 359

Query: 1538 MERSGVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDRGE 1359
            MERSGV+PDE+TYSLLIDAY NAGRWESARIVLKEMEAN+V+P+S+V+SRILASYR++GE
Sbjct: 360  MERSGVSPDEHTYSLLIDAYANAGRWESARIVLKEMEANNVQPNSFVYSRILASYRNKGE 419

Query: 1358 WQRSFQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTWNT 1179
            WQRSFQVL+EM++NG+ PDR FYNVMIDTFGKYNCLDHAMDTF+RM  EGI+PDTVTWNT
Sbjct: 420  WQRSFQVLREMKSNGIQPDRHFYNVMIDTFGKYNCLDHAMDTFDRMLSEGIKPDTVTWNT 479

Query: 1178 LIDCHCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQSQG 999
            LIDCHCK G H +AE+LFE M+ESG  PCTTTYNIM+NSFG QERW++V+ LL KMQSQG
Sbjct: 480  LIDCHCKAGRHGRAEELFEEMKESGYSPCTTTYNIMINSFGGQERWDNVKSLLGKMQSQG 539

Query: 998  LLPNVVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQAV 819
            LLPN+VTYTTLVDIYG+SGRF++A+ECLE+MKSAG+KPS TMYNALINAYAQRGL++QA+
Sbjct: 540  LLPNIVTYTTLVDIYGKSGRFSDAMECLELMKSAGLKPSLTMYNALINAYAQRGLSEQAI 599

Query: 818  NAFWVMRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMKAL 639
            NA  +MR DGLKP+LLALNSLINAF EDRRD EAFAVLQYMKEND+KPDVVTYTTLMK+L
Sbjct: 600  NALRIMRADGLKPNLLALNSLINAFGEDRRDVEAFAVLQYMKENDVKPDVVTYTTLMKSL 659

Query: 638  VRVEKFEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMK 522
            +RV+KF KVPAVYEEM+  GC PDRKARAMLRSALRYMK
Sbjct: 660  IRVDKFHKVPAVYEEMILSGCTPDRKARAMLRSALRYMK 698


>gb|EOY15236.1| Pentatricopeptide repeat (PPR-like) superfamily protein isoform 2,
            partial [Theobroma cacao]
          Length = 698

 Score =  973 bits (2515), Expect = 0.0
 Identities = 494/695 (71%), Positives = 572/695 (82%), Gaps = 7/695 (1%)
 Frame = -3

Query: 2585 PPLQFRFPSVPFTPPTFLLHHHQNFQKPLFXXXXXXXXXXXXXXXXPLSSQQLCHGDKDE 2406
            PPL  RFPS+  + P   LH   + Q  ++                     +  + D+D+
Sbjct: 1    PPLPARFPSIQLSSPITRLH--VSLQTSIYTAAAATAAEASISLSIDKDKDRDRYDDEDD 58

Query: 2405 SY-DVASLNNRRYDFTPLLEFLSAYFPSP-VDRQSNSPTQLEPTELRLAESYRAVPAPLW 2232
               DV S++ RRYDFTPLL +LS+    P  D  S SPT L+P E +LAESYRAVPAPLW
Sbjct: 59   DQSDVLSIHKRRYDFTPLLNYLSSSNSEPDSDSDSASPTSLDPIEFQLAESYRAVPAPLW 118

Query: 2231 HSLLKSL-----TATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEKLYEAFL 2067
            HSLLKS+     +++ SSI+ AYA+V+WLQ+HNLCFSYELLYSILIHALGRSEKLYEAFL
Sbjct: 119  HSLLKSMCSSSSSSSSSSINLAYAVVSWLQRHNLCFSYELLYSILIHALGRSEKLYEAFL 178

Query: 2066 LSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSLMRNNSI 1887
            LSQRQTLTPLTYNALI ACARN+DLEKALNLM++MR+DGYQSDFVNYSLIIQSL R+N I
Sbjct: 179  LSQRQTLTPLTYNALINACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRSNKI 238

Query: 1886 DSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSPKTATLV 1707
            DS+LL+K Y EIE D IE+D QLLNDI VGFAK+ D + AL FL++ Q  GL+PKTATLV
Sbjct: 239  DSSLLQKLYGEIECDKIEVDGQLLNDIIVGFAKANDPSHALKFLAMAQAIGLNPKTATLV 298

Query: 1706 AIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVVSEMERS 1527
            A+I  LG  GR              GLKPRTRAYNALLKGYV+ G+LKDAE VVSEMERS
Sbjct: 299  AVIYSLGCCGRIAEAEAVFEEMKGTGLKPRTRAYNALLKGYVKAGSLKDAELVVSEMERS 358

Query: 1526 GVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDRGEWQRS 1347
            GV+PDE+TYSLLIDAY NAGRWESARIVLKEMEAN+V+P+S+V+SRILASYR++GEWQRS
Sbjct: 359  GVSPDEHTYSLLIDAYANAGRWESARIVLKEMEANNVQPNSFVYSRILASYRNKGEWQRS 418

Query: 1346 FQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTWNTLIDC 1167
            FQVL+EM++NG+ PDR FYNVMIDTFGKYNCLDHAMDTF+RM  EGI+PDTVTWNTLIDC
Sbjct: 419  FQVLREMKSNGIQPDRHFYNVMIDTFGKYNCLDHAMDTFDRMLSEGIKPDTVTWNTLIDC 478

Query: 1166 HCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQSQGLLPN 987
            HCK G H +AE+LFE M+ESG  PCTTTYNIM+NSFG QERW++V+ LL KMQSQGLLPN
Sbjct: 479  HCKAGRHGRAEELFEEMKESGYSPCTTTYNIMINSFGGQERWDNVKSLLGKMQSQGLLPN 538

Query: 986  VVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQAVNAFW 807
            +VTYTTLVDIYG+SGRF++A+ECLE+MKSAG+KPS TMYNALINAYAQRGL++QA+NA  
Sbjct: 539  IVTYTTLVDIYGKSGRFSDAMECLELMKSAGLKPSLTMYNALINAYAQRGLSEQAINALR 598

Query: 806  VMRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMKALVRVE 627
            +MR DGLKP+LLALNSLINAF EDRRD EAFAVLQYMKEND+KPDVVTYTTLMK+L+RV+
Sbjct: 599  IMRADGLKPNLLALNSLINAFGEDRRDVEAFAVLQYMKENDVKPDVVTYTTLMKSLIRVD 658

Query: 626  KFEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMK 522
            KF KVPAVYEEM+  GC PDRKARAMLRSALRYMK
Sbjct: 659  KFHKVPAVYEEMILSGCTPDRKARAMLRSALRYMK 693


>ref|XP_002510663.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223551364|gb|EEF52850.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 695

 Score =  970 bits (2508), Expect = 0.0
 Identities = 486/694 (70%), Positives = 568/694 (81%)
 Frame = -3

Query: 2597 VLQPPPLQFRFPSVPFTPPTFLLHHHQNFQKPLFXXXXXXXXXXXXXXXXPLSSQQLCHG 2418
            +L PPP   RFPS+  T P  + H+H + Q P                     S  L + 
Sbjct: 2    LLFPPPPATRFPSITVTCPIPVRHYHHHSQLPPLSATNTSATAIASSFSNL--SLSLDNN 59

Query: 2417 DKDESYDVASLNNRRYDFTPLLEFLSAYFPSPVDRQSNSPTQLEPTELRLAESYRAVPAP 2238
             KD   D+ +L +RRYDFTPLL FLS    +  +  S+SPT L+ TE +LAESYRAVP P
Sbjct: 60   QKDTEQDILALQSRRYDFTPLLNFLSNQIKASPNT-SSSPTSLDTTEFQLAESYRAVPGP 118

Query: 2237 LWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEKLYEAFLLSQ 2058
            LWHSLLKSL+++ SSI  AYA+V+WLQKHNLCFSYELLYSILIHALGRSEKLYEAFLLSQ
Sbjct: 119  LWHSLLKSLSSSSSSIGLAYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFLLSQ 178

Query: 2057 RQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSLMRNNSIDST 1878
            +Q L+PLTYNALI ACARN+DLEKA+NL+++MR+DGY SDFVNYSLIIQSL+R+N IDS 
Sbjct: 179  QQALSPLTYNALINACARNNDLEKAINLISRMRQDGYPSDFVNYSLIIQSLVRSNRIDSP 238

Query: 1877 LLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSPKTATLVAII 1698
            +L+K Y EI+ D +ELD QL NDI VGFAK+GD N+A+ FL ++Q +GLSP+TATL+A+I
Sbjct: 239  ILQKLYSEIQCDKLELDVQLSNDIIVGFAKAGDPNKAMEFLGMVQASGLSPRTATLIAVI 298

Query: 1697 SELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVVSEMERSGVA 1518
            S LG SGR            + GLKP+TRAYN LLKGYV+ G LKDAE +VSEMERSGV+
Sbjct: 299  SALGDSGRIIEAEAIFEEMKDNGLKPKTRAYNGLLKGYVKAGMLKDAEFIVSEMERSGVS 358

Query: 1517 PDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDRGEWQRSFQV 1338
            PDE TYSLLIDAY NAGRWESARIVLKEMEAN++ P+SYVFSRILASYRDRGEWQ+SFQV
Sbjct: 359  PDECTYSLLIDAYSNAGRWESARIVLKEMEANNIMPNSYVFSRILASYRDRGEWQKSFQV 418

Query: 1337 LKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTWNTLIDCHCK 1158
            LKEM+N+GV PDR FYNVMIDTFGK++CLDHAMDTF++M  EGI+PDTVTWNTLIDCHCK
Sbjct: 419  LKEMKNSGVRPDRHFYNVMIDTFGKFSCLDHAMDTFDKMLSEGIQPDTVTWNTLIDCHCK 478

Query: 1157 HGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQSQGLLPNVVT 978
               H +AE+LFE M E G  PC TT+NIM+NSFGEQERW+DV+ L+  M+S GLLPNVVT
Sbjct: 479  AELHERAEELFEEMMEKGFSPCVTTFNIMINSFGEQERWDDVKTLMGNMRSLGLLPNVVT 538

Query: 977  YTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQAVNAFWVMR 798
            YTTL+DIYG+SGRF++AIECLE MKSAG+KPSSTMYNALINAYAQ+GL++QAVNAF +MR
Sbjct: 539  YTTLIDIYGKSGRFSDAIECLEDMKSAGLKPSSTMYNALINAYAQKGLSEQAVNAFRLMR 598

Query: 797  GDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMKALVRVEKFE 618
             D LKPSLLALNSLINAF EDRRDAEAF+VL+YMKENDLKPDVVTYTTLMKAL+RV+KF 
Sbjct: 599  ADSLKPSLLALNSLINAFGEDRRDAEAFSVLKYMKENDLKPDVVTYTTLMKALIRVDKFN 658

Query: 617  KVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            KVP+VYEEM+  GC PDRKARAMLRSAL+YMK T
Sbjct: 659  KVPSVYEEMILAGCTPDRKARAMLRSALKYMKQT 692


>ref|XP_002306972.1| hypothetical protein POPTR_0005s27160g [Populus trichocarpa]
            gi|222856421|gb|EEE93968.1| hypothetical protein
            POPTR_0005s27160g [Populus trichocarpa]
          Length = 709

 Score =  969 bits (2504), Expect = 0.0
 Identities = 495/707 (70%), Positives = 567/707 (80%), Gaps = 13/707 (1%)
 Frame = -3

Query: 2597 VLQPPPLQFRFPSVPFTPPTFLLHHHQN----FQKPLFXXXXXXXXXXXXXXXXP---LS 2439
            +L PPPL  RFPSV  T P    HHH +    FQ P                       +
Sbjct: 2    LLFPPPLPNRFPSVYTTSPIVHHHHHHHHHLIFQPPFSATSTTTNFADSSLSYSKRLHYA 61

Query: 2438 SQQLCHGDKDESYDVASLNNRRYDFTPLLEFLSAYFPSPVDRQSNS------PTQLEPTE 2277
            SQ    G  D+  DV  L +RRYDFTPLL++LS    +  D  S+S      PT L+PTE
Sbjct: 62   SQNDIEGFSDD--DVLPLQSRRYDFTPLLDYLSKKITTSTDTDSDSDSASSSPTSLDPTE 119

Query: 2276 LRLAESYRAVPAPLWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALG 2097
             +LAESYR VP PLWHSLLKSL  + SSI  AYA+V WLQKHNLCFSYELLYSILIHALG
Sbjct: 120  FQLAESYRVVPGPLWHSLLKSLCTSSSSIGLAYAVVLWLQKHNLCFSYELLYSILIHALG 179

Query: 2096 RSEKLYEAFLLSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLI 1917
            +SEKLYEAFLLSQRQ LTPLTYNALI ACARN+DLEKALNL+ +MR+DGY SDFVNYSLI
Sbjct: 180  QSEKLYEAFLLSQRQNLTPLTYNALISACARNNDLEKALNLITRMRQDGYPSDFVNYSLI 239

Query: 1916 IQSLMRNNSIDSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGN 1737
            I+SLMR N +DS +L+K Y EIE D +ELD QL NDI VGFAK+GD ++AL FL V+QG+
Sbjct: 240  IRSLMRKNRVDSAILQKLYREIECDKLELDVQLSNDIIVGFAKAGDLSKALEFLGVVQGS 299

Query: 1736 GLSPKTATLVAIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDA 1557
            GLS KTATLVA+I  LG  GRT           + GLKPRTRAYNALL+GYV+ G LKDA
Sbjct: 300  GLSVKTATLVAVIWALGNCGRTVEAEAIFEEMRDNGLKPRTRAYNALLRGYVKAGLLKDA 359

Query: 1556 EDVVSEMERSGVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILAS 1377
            E VVSEMERSGV+P+E TYS LIDAYGNAGRWESARIVLKEMEA++V+P++YVFSRIL+S
Sbjct: 360  EFVVSEMERSGVSPNEQTYSFLIDAYGNAGRWESARIVLKEMEASNVQPNAYVFSRILSS 419

Query: 1376 YRDRGEWQRSFQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPD 1197
            YRD+GEWQ+SFQVL+EM N+GV PDR FYNVMIDTFGK+NCLDHAM TF+RM  EGIEPD
Sbjct: 420  YRDKGEWQKSFQVLREMENSGVRPDRVFYNVMIDTFGKFNCLDHAMATFDRMLSEGIEPD 479

Query: 1196 TVTWNTLIDCHCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLR 1017
            TVTWNTLIDCHC+ G H++AE+LFE M E G  PC TT+NIM+NSFG+QERW+DV++LL 
Sbjct: 480  TVTWNTLIDCHCRAGKHDRAEELFEEMMEGGYSPCNTTFNIMINSFGDQERWDDVKNLLA 539

Query: 1016 KMQSQGLLPNVVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRG 837
             M+SQGL+PN VTYTTL+DIYG+SGRFN+AIECL+ MK+AG+KPSSTMYNALINAYAQRG
Sbjct: 540  HMRSQGLVPNSVTYTTLIDIYGKSGRFNDAIECLDDMKAAGLKPSSTMYNALINAYAQRG 599

Query: 836  LADQAVNAFWVMRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYT 657
            L++QAV+AF  MR DGLKPSLLALNSLINAF EDRRDAEAF VLQYMKENDLKPDVVTYT
Sbjct: 600  LSEQAVSAFRAMRVDGLKPSLLALNSLINAFGEDRRDAEAFTVLQYMKENDLKPDVVTYT 659

Query: 656  TLMKALVRVEKFEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            TLMKAL+RVEKF+KVP+VYEEM+  GC PDRKARAMLRSAL+YMK T
Sbjct: 660  TLMKALIRVEKFDKVPSVYEEMILSGCTPDRKARAMLRSALKYMKQT 706



 Score =  182 bits (462), Expect = 6e-43
 Identities = 127/522 (24%), Positives = 240/522 (45%), Gaps = 8/522 (1%)
 Frame = -3

Query: 2261 SYRAVPAPLWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRS--- 2091
            S R    PL ++ L S  A  + + KA  L+T +++      + + YS++I +L R    
Sbjct: 191  SQRQNLTPLTYNALISACARNNDLEKALNLITRMRQDGYPSDF-VNYSLIIRSLMRKNRV 249

Query: 2090 -----EKLYEAFLLSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNY 1926
                 +KLY      + +    L+ + ++G  A+  DL KAL  +  ++  G        
Sbjct: 250  DSAILQKLYREIECDKLELDVQLSNDIIVGF-AKAGDLSKALEFLGVVQGSGLSVKTATL 308

Query: 1925 SLIIQSLMRNNSIDSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVI 1746
              +I +L   N   +   E  + E+  + ++   +  N +  G+ K+G    A + +S +
Sbjct: 309  VAVIWAL--GNCGRTVEAEAIFEEMRDNGLKPRTRAYNALLRGYVKAGLLKDAEFVVSEM 366

Query: 1745 QGNGLSPKTATLVAIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGAL 1566
            + +G+SP   T   +I   G +GR               ++P    ++ +L  Y   G  
Sbjct: 367  ERSGVSPNEQTYSFLIDAYGNAGRWESARIVLKEMEASNVQPNAYVFSRILSSYRDKGEW 426

Query: 1565 KDAEDVVSEMERSGVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRI 1386
            + +  V+ EME SGV PD   Y+++ID +G     + A      M +  +EP +  ++ +
Sbjct: 427  QKSFQVLREMENSGVRPDRVFYNVMIDTFGKFNCLDHAMATFDRMLSEGIEPDTVTWNTL 486

Query: 1385 LASYRDRGEWQRSFQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGI 1206
            +  +   G+  R+ ++ +EM   G +P    +N+MI++FG     D   +    M+ +G+
Sbjct: 487  IDCHCRAGKHDRAEELFEEMMEGGYSPCNTTFNIMINSFGDQERWDDVKNLLAHMRSQGL 546

Query: 1205 EPDTVTWNTLIDCHCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQD 1026
             P++VT+ TLID + K G  N A +  + M+ +G  P +T YN ++N++ ++   E    
Sbjct: 547  VPNSVTYTTLIDIYGKSGRFNDAIECLDDMKAAGLKPSSTMYNALINAYAQRGLSEQAVS 606

Query: 1025 LLRKMQSQGLLPNVVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYA 846
              R M+  GL P+++   +L++ +G+  R  EA   L+ MK   +KP    Y  L+ A  
Sbjct: 607  AFRAMRVDGLKPSLLALNSLINAFGEDRRDAEAFTVLQYMKENDLKPDVVTYTTLMKALI 666

Query: 845  QRGLADQAVNAFWVMRGDGLKPSLLALNSLINAFSEDRRDAE 720
            +    D+  + +  M   G  P   A   L +A    ++  E
Sbjct: 667  RVEKFDKVPSVYEEMILSGCTPDRKARAMLRSALKYMKQTLE 708


>ref|XP_006279580.1| hypothetical protein CARUB_v10025981mg [Capsella rubella]
            gi|482548284|gb|EOA12478.1| hypothetical protein
            CARUB_v10025981mg [Capsella rubella]
          Length = 708

 Score =  961 bits (2483), Expect = 0.0
 Identities = 486/702 (69%), Positives = 565/702 (80%), Gaps = 8/702 (1%)
 Frame = -3

Query: 2597 VLQPPPLQFRFPSVPF-TPPTFLLHHHQNFQKPLFXXXXXXXXXXXXXXXXPLSSQQLCH 2421
            +LQPP +  RF S+ F T      HHH+ FQ P+                   SS     
Sbjct: 3    LLQPPLVSSRFHSLYFLTHHHHHNHHHRFFQPPISAFSATASASSPSSSSSYFSSWNGLD 62

Query: 2420 GDKDESYDVASLN-NRRYDFTPLLEFLSAYFP------SPVDRQSNSPTQLEPTELRLAE 2262
              K+E  D  +   +RRYDF+PLL+ LS + P      S     +N+ T LEP E  LAE
Sbjct: 63   KPKEEEDDEFTTEVHRRYDFSPLLKHLSRFGPVELVLDSESKSDTNTDTSLEPVEFELAE 122

Query: 2261 SYRAVPAPLWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEKL 2082
            SYRAVPAP WHSL+KSL+++ SS+  AYA+V+WLQKHNLCFSYELLYSILIHALGRSEKL
Sbjct: 123  SYRAVPAPYWHSLIKSLSSSTSSLGLAYAVVSWLQKHNLCFSYELLYSILIHALGRSEKL 182

Query: 2081 YEAFLLSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSLM 1902
            YEAFLLSQ+QTLTPLTYNALIGACARN+D+EKALNL++KMR+DGYQSDFVNYSL+IQSL 
Sbjct: 183  YEAFLLSQKQTLTPLTYNALIGACARNNDIEKALNLISKMRQDGYQSDFVNYSLVIQSLT 242

Query: 1901 RNNSIDSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSPK 1722
            R+N IDS +L++ Y EIE D +E D QL+NDI +GFAKSGD +RAL  L + Q  GLS K
Sbjct: 243  RSNKIDSVMLQRLYKEIERDKLEFDVQLVNDIIMGFAKSGDPSRALQLLGMAQATGLSAK 302

Query: 1721 TATLVAIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVVS 1542
            TATLV+IIS L  SGRT           + G+KPRT+AYNALLKGYV+TG LKDAE +VS
Sbjct: 303  TATLVSIISALASSGRTDEAEALFEELRQSGIKPRTKAYNALLKGYVKTGPLKDAESMVS 362

Query: 1541 EMERSGVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDRG 1362
            EME+ GV+PDE+TYSLLIDAY NAGRWESARIVLKEMEA  V+P+S+VFSR+LA YRDRG
Sbjct: 363  EMEKRGVSPDEHTYSLLIDAYVNAGRWESARIVLKEMEAGDVQPNSFVFSRLLAGYRDRG 422

Query: 1361 EWQRSFQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTWN 1182
            EWQ++FQVLKEM++ GV PDRQFYNV+IDTFGK+NCLDHAM TF+RM  EGIEPD VTWN
Sbjct: 423  EWQKTFQVLKEMKSIGVKPDRQFYNVVIDTFGKFNCLDHAMTTFDRMLSEGIEPDRVTWN 482

Query: 1181 TLIDCHCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQSQ 1002
            TLIDCHCKHG H  AE +FEAM+  GCLPC TTYNIM+NS+G+QERW+D++ LL KM+SQ
Sbjct: 483  TLIDCHCKHGRHIVAEDMFEAMERRGCLPCATTYNIMINSYGDQERWDDMKRLLGKMKSQ 542

Query: 1001 GLLPNVVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQA 822
            G+LPNVVT+TTLVD+YG+SGRFN+AIECLE MKS G+KPSSTMYNALINAYAQRGL++QA
Sbjct: 543  GILPNVVTHTTLVDVYGKSGRFNDAIECLEEMKSVGLKPSSTMYNALINAYAQRGLSEQA 602

Query: 821  VNAFWVMRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMKA 642
            VNAF VM  DGLKPSLLALNSLINAF EDRRDAEAFAVLQYMKEN +KPDVVTYTTLMKA
Sbjct: 603  VNAFRVMTSDGLKPSLLALNSLINAFGEDRRDAEAFAVLQYMKENGVKPDVVTYTTLMKA 662

Query: 641  LVRVEKFEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            L+RV+KF+KVP VYEEM+  GC PDRKAR+MLRSALRYMK T
Sbjct: 663  LIRVDKFQKVPGVYEEMIMSGCKPDRKARSMLRSALRYMKQT 704


>gb|EMJ26384.1| hypothetical protein PRUPE_ppa002191mg [Prunus persica]
          Length = 703

 Score =  959 bits (2479), Expect = 0.0
 Identities = 486/696 (69%), Positives = 561/696 (80%), Gaps = 5/696 (0%)
 Frame = -3

Query: 2588 PPPLQFRFPSVPFTPPTFLLHHHQNFQK----PLFXXXXXXXXXXXXXXXXPLSSQQLCH 2421
            P P   RFPS     P  + HHH +        L                 P+ S     
Sbjct: 7    PVPGSTRFPSFQLASPILIRHHHHHHHMFHLWSLSATTAAAVTNSSQAHLPPIPSSTSTS 66

Query: 2420 GDKDESYDVASLNNRRYDFTPLLEFLSAYFPSPVDRQSNSPTQLEPTELRLAESYRAVPA 2241
              +  S+D       RYDF+PLL FL+A   S +   ++SPT L+P E +LAESYRAVPA
Sbjct: 67   NSRRRSFDDDQAAVSRYDFSPLLTFLAAKSMS-MSSSASSPTSLDPAEFQLAESYRAVPA 125

Query: 2240 PLWHSLLKSLTATPSS-ISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEKLYEAFLL 2064
            PLWHSLLKSL ++ SS I  AYA+V+WLQKHNLCFSYELLYSILIHALGRSEKLYEAFLL
Sbjct: 126  PLWHSLLKSLCSSSSSDIQLAYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFLL 185

Query: 2063 SQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSLMRNNSID 1884
            SQRQ+LTPLTYNALIGACARN DLEKAL+LM++MR+DGY+SDFVNYSLIIQSL R+N ID
Sbjct: 186  SQRQSLTPLTYNALIGACARNGDLEKALHLMSRMRQDGYRSDFVNYSLIIQSLSRSNKID 245

Query: 1883 STLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSPKTATLVA 1704
            S ++ K Y EIE++ IE+D QL NDI  GFAK+G+  +A++ L+++Q  GLSPKTATLVA
Sbjct: 246  SPIMLKLYREIESESIEIDGQLYNDIIAGFAKAGEPTQAMHLLAMVQATGLSPKTATLVA 305

Query: 1703 IISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVVSEMERSG 1524
            +IS LG  GR            EGGL+PRTRAYNALLKGYV+   LKDAE +VS+ME+SG
Sbjct: 306  LISALGNCGRVVEAEAIFEEMKEGGLQPRTRAYNALLKGYVKAAQLKDAESIVSQMEKSG 365

Query: 1523 VAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDRGEWQRSF 1344
            ++PDE+TYSLLIDAY NAGRWESARIVLKEMEA++V+P+SYVFSRILASYRDRGEWQ+SF
Sbjct: 366  ISPDEHTYSLLIDAYANAGRWESARIVLKEMEASNVQPNSYVFSRILASYRDRGEWQKSF 425

Query: 1343 QVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTWNTLIDCH 1164
            QVL+EM+++GV PDR FYNVMIDTFGK NCLDH M TFERM  EGI+PDTVTWNTLIDCH
Sbjct: 426  QVLREMKSSGVRPDRHFYNVMIDTFGKSNCLDHVMATFERMLSEGIQPDTVTWNTLIDCH 485

Query: 1163 CKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQSQGLLPNV 984
            CK GHH +AE+LFE M +SGC PC TTYNIM+NSFGEQ+RW +V+ LL KMQ+QGLLPN+
Sbjct: 486  CKSGHHKRAEELFEEMHQSGCAPCATTYNIMINSFGEQQRWVEVKGLLGKMQAQGLLPNI 545

Query: 983  VTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQAVNAFWV 804
            VTYTTLVDIYG+SGRFN+AIECLEVMKSAG+KPS TMYNALINAYAQRGL++QA+NAF V
Sbjct: 546  VTYTTLVDIYGKSGRFNDAIECLEVMKSAGLKPSPTMYNALINAYAQRGLSEQALNAFRV 605

Query: 803  MRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMKALVRVEK 624
            MR DGLKPSLLALNSLINAF EDRRDAEAF+VLQYMKENDLKPDVVTYTTLMK L+RV+K
Sbjct: 606  MRADGLKPSLLALNSLINAFGEDRRDAEAFSVLQYMKENDLKPDVVTYTTLMKTLIRVDK 665

Query: 623  FEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            F KVPAVYEEM+   C PDRKARAMLRSAL+YMK T
Sbjct: 666  FYKVPAVYEEMILSRCTPDRKARAMLRSALKYMKQT 701


>ref|XP_004292910.1| PREDICTED: pentatricopeptide repeat-containing protein At5g42310,
            mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 813

 Score =  958 bits (2477), Expect = 0.0
 Identities = 484/697 (69%), Positives = 567/697 (81%), Gaps = 3/697 (0%)
 Frame = -3

Query: 2597 VLQPPPLQFRFPSVPFTPPT-FLLHHHQNFQKPLFXXXXXXXXXXXXXXXXPLSSQQLCH 2421
            +L P P   RFPSV     +  L+ HH  F  PL                  L   +   
Sbjct: 117  LLLPLPGSTRFPSVQLASTSPILIRHHHIFHAPLSATTPCAVSTSAESHLPSLPPSRR-- 174

Query: 2420 GDKDESYDVASLNNRRYDFTPLLEFLSAYFPSPVDRQSNSPTQLEPTELRLAESYRAVPA 2241
              + ++YD       RYDF PLL FLS    +     S SPT L+P E +LAE YRAVPA
Sbjct: 175  --RFDNYDTDQATVSRYDFAPLLAFLSTSSSAHDVTDSASPTSLDPAEFQLAELYRAVPA 232

Query: 2240 PLWHSLLKSL--TATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEKLYEAFL 2067
            PLWHSLLKSL  +++ SS+ +AYALV WLQKHNLCFSYELLYSILIHALGRSEKLYEAFL
Sbjct: 233  PLWHSLLKSLCSSSSSSSLKQAYALVAWLQKHNLCFSYELLYSILIHALGRSEKLYEAFL 292

Query: 2066 LSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSLMRNNSI 1887
            LSQR+TLTPLTYNALIGACARN DLEKALNLM++MR+DGY+SDFVNYSL+IQSL R+N +
Sbjct: 293  LSQRRTLTPLTYNALIGACARNGDLEKALNLMSRMRQDGYRSDFVNYSLVIQSLNRSNKV 352

Query: 1886 DSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSPKTATLV 1707
            DS ++ K Y EIE++ +E+D QLLND+ VGFAK+G+ ++A++FL+++Q +GLSPKTATLV
Sbjct: 353  DSPIMLKLYKEIESENVEIDGQLLNDLIVGFAKAGEPSQAMHFLAMVQASGLSPKTATLV 412

Query: 1706 AIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVVSEMERS 1527
            ++IS LG +GR            EGGL+PRTRAYNALLKGYV+  +L+DAE +VS+MERS
Sbjct: 413  SVISALGNAGRVVEAEAIFEEMKEGGLQPRTRAYNALLKGYVKAASLEDAESIVSQMERS 472

Query: 1526 GVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDRGEWQRS 1347
            G++PDE+TYSLLIDAY NAGRWESARIVLKEMEA++V+P+SYVFSRILASYRDRGEWQ+S
Sbjct: 473  GISPDEHTYSLLIDAYANAGRWESARIVLKEMEASNVQPNSYVFSRILASYRDRGEWQKS 532

Query: 1346 FQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTWNTLIDC 1167
            FQVL+EMR++GV PDR FYNVMIDTFGK NCLDHAM TFERM  EGI+PDTVTWNTLID 
Sbjct: 533  FQVLREMRSSGVMPDRHFYNVMIDTFGKSNCLDHAMATFERMLSEGIQPDTVTWNTLIDI 592

Query: 1166 HCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQSQGLLPN 987
            HCK GHH +AE+LFE MQESGC PC TT+NIM+NS GEQERW++V+ L+ KMQSQGLLPN
Sbjct: 593  HCKSGHHARAEELFEEMQESGCAPCATTFNIMINSLGEQERWDEVKGLMGKMQSQGLLPN 652

Query: 986  VVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQAVNAFW 807
            +VTYTTLVDIYG+SGRFN+AIECLE+MKSAG+KPS TMYNALINAYAQRGL++ A+NAF 
Sbjct: 653  IVTYTTLVDIYGKSGRFNDAIECLEIMKSAGLKPSPTMYNALINAYAQRGLSELALNAFR 712

Query: 806  VMRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMKALVRVE 627
            VMR DGLKPSLLALNSLINAF EDRRDAEAF+VLQYMKEND+KPDVVTYTTLMKAL+RV+
Sbjct: 713  VMRADGLKPSLLALNSLINAFGEDRRDAEAFSVLQYMKENDVKPDVVTYTTLMKALIRVD 772

Query: 626  KFEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            KF KVP VYEEM+     PDRKARAMLRSAL+YMK T
Sbjct: 773  KFYKVPDVYEEMIHSRVTPDRKARAMLRSALKYMKQT 809


>ref|XP_006405260.1| hypothetical protein EUTSA_v10027665mg [Eutrema salsugineum]
            gi|557106398|gb|ESQ46713.1| hypothetical protein
            EUTSA_v10027665mg [Eutrema salsugineum]
          Length = 704

 Score =  957 bits (2475), Expect = 0.0
 Identities = 486/704 (69%), Positives = 568/704 (80%), Gaps = 10/704 (1%)
 Frame = -3

Query: 2597 VLQPPPLQFRFPSVPF-TPPTFLLHHHQNFQKPLFXXXXXXXXXXXXXXXXPLSSQ---- 2433
            +LQPP +  RF S+ F T P     HH+ FQ P+                   SS     
Sbjct: 3    LLQPPLVSSRFHSLYFITRP-----HHRFFQPPISAFSATAPCSPSSSSSSSYSSSWNVL 57

Query: 2432 QLCHGDKDESYDVASLNNRRYDFTPLLEFLSAYFPSPV-----DRQSNSPTQLEPTELRL 2268
              C  ++DE  + +    RRYDF+PLL+FLS + P  +     D +S SP  L+P E  L
Sbjct: 58   DACE-EEDEDDEFSVEVRRRYDFSPLLKFLSRFGPVELVLDSEDSESFSPDSLDPVEFEL 116

Query: 2267 AESYRAVPAPLWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSE 2088
            AESY+AVPAP WHSLLKSL ++ SS+  AYA+V+WL+KHNLCFSYELLYSILIHALGRSE
Sbjct: 117  AESYKAVPAPYWHSLLKSLCSSTSSLGLAYAVVSWLRKHNLCFSYELLYSILIHALGRSE 176

Query: 2087 KLYEAFLLSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQS 1908
            KLYEAFLLSQ+QTLTPLTYNALIGACARN+D+EKALNL+++MRRDGYQSDFVNYSL+IQ+
Sbjct: 177  KLYEAFLLSQKQTLTPLTYNALIGACARNNDIEKALNLISRMRRDGYQSDFVNYSLVIQA 236

Query: 1907 LMRNNSIDSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLS 1728
            L R+N IDS LL++ Y EIE D +ELD QL+NDI +GFAKSGD +RAL  L + Q  GLS
Sbjct: 237  LTRSNKIDSALLQRLYREIERDKLELDVQLVNDIIMGFAKSGDPSRALQILGMAQATGLS 296

Query: 1727 PKTATLVAIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDV 1548
             KTATLV+IIS L  SGRT           + G+KPRT+AYNALLKGYV+TG LKDAE +
Sbjct: 297  AKTATLVSIISALANSGRTLEAEALFEELRQSGIKPRTKAYNALLKGYVKTGPLKDAESM 356

Query: 1547 VSEMERSGVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRD 1368
            VSEME+ GV+PDE+TYSLLIDAY NAGRWESARIVLKEMEA  V+P+S+VFSR+LA YRD
Sbjct: 357  VSEMEKRGVSPDEHTYSLLIDAYVNAGRWESARIVLKEMEAGDVQPNSFVFSRLLAGYRD 416

Query: 1367 RGEWQRSFQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVT 1188
            RGEWQ++FQVLKEM++ GV PDRQFYNV+IDTFGK+NCLDHAM TF+RM  EGIEPD VT
Sbjct: 417  RGEWQKTFQVLKEMKSLGVKPDRQFYNVVIDTFGKFNCLDHAMTTFDRMLSEGIEPDRVT 476

Query: 1187 WNTLIDCHCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQ 1008
            WNTLIDCHCKHG H  AE++FEAM++ GCLPC TTYNIM+NS+G+QERW D++ LL KM+
Sbjct: 477  WNTLIDCHCKHGRHIVAEEMFEAMEKRGCLPCATTYNIMINSYGDQERWNDMKRLLGKMK 536

Query: 1007 SQGLLPNVVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLAD 828
            SQG+LPNVVT+TTLVD+YG+SGRFN+AIECLE MKS G+KPSSTMYNALINAYAQRGL++
Sbjct: 537  SQGVLPNVVTHTTLVDVYGKSGRFNDAIECLEEMKSVGLKPSSTMYNALINAYAQRGLSE 596

Query: 827  QAVNAFWVMRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLM 648
            QAVNAF VM  DGLKPSLLALNSLINAF EDRRDAEAFAVLQYMKEN + PDVVTYTTLM
Sbjct: 597  QAVNAFRVMTSDGLKPSLLALNSLINAFGEDRRDAEAFAVLQYMKENGVNPDVVTYTTLM 656

Query: 647  KALVRVEKFEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            KAL+RV+KF+KVP VYEEM+  GC PDRKAR+MLRSALRYMK T
Sbjct: 657  KALIRVDKFQKVPGVYEEMIMSGCKPDRKARSMLRSALRYMKQT 700


>ref|XP_004144287.1| PREDICTED: pentatricopeptide repeat-containing protein At5g42310,
            mitochondrial-like [Cucumis sativus]
            gi|449489420|ref|XP_004158306.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g42310,
            mitochondrial-like [Cucumis sativus]
          Length = 720

 Score =  956 bits (2472), Expect = 0.0
 Identities = 482/719 (67%), Positives = 569/719 (79%), Gaps = 25/719 (3%)
 Frame = -3

Query: 2597 VLQPPPLQFRFPSVPFTPPTFLLHHHQNFQKPLFXXXXXXXXXXXXXXXXPLSSQQLCHG 2418
            +L P PL  RFP+   + P   LHHH N                        SS   C+ 
Sbjct: 3    LLSPLPLSTRFPATHLSSPPVFLHHHHNPH----IATTHLSFSFFSAPATSSSSLVTCYT 58

Query: 2417 DKDE------SYDVASLNNRRYDFTPLLEFLS---AY----------------FPSPVDR 2313
              D         D  SL +RRYDFTPLL+FLS   AY                F S  D 
Sbjct: 59   SSDNLEFDVFENDPVSLQSRRYDFTPLLDFLSRSSAYPKFDSDSDSEVEFDSTFNSGSDS 118

Query: 2312 QSNSPTQLEPTELRLAESYRAVPAPLWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSY 2133
             + SPT L+PTE +LAE+YRAVPAPLWHSLLKSL ++ SSI   YA+V+WLQ+HNLCFSY
Sbjct: 119  DTASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQRHNLCFSY 178

Query: 2132 ELLYSILIHALGRSEKLYEAFLLSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRD 1953
            ELLYSILIHALGRSEKLYEAF+LSQ+QTLTPLTYNALIGACARN+DLEKALNLM++MR+D
Sbjct: 179  ELLYSILIHALGRSEKLYEAFILSQKQTLTPLTYNALIGACARNNDLEKALNLMSRMRQD 238

Query: 1952 GYQSDFVNYSLIIQSLMRNNSIDSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDAN 1773
            G+QSDF+NYSLIIQSL R N ID  LL+K Y EIE+D IELD  LLNDI +GFAK+GD N
Sbjct: 239  GFQSDFINYSLIIQSLTRTNKIDIPLLQKLYEEIESDKIELDGLLLNDIILGFAKAGDPN 298

Query: 1772 RALYFLSVIQGNGLSPKTATLVAIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALL 1593
            RALYFLS++Q +GL+PKT+T VA+IS LG  GRT           EGGLKPR +A+NALL
Sbjct: 299  RALYFLSMVQASGLNPKTSTFVAVISALGNHGRTEEAEAIFEEMKEGGLKPRIKAFNALL 358

Query: 1592 KGYVRTGALKDAEDVVSEMERSGVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVE 1413
            KGY R G+LK+AE ++SEME+SG++PDE+TY LL+DAY N GRWESAR +LK+MEA +V+
Sbjct: 359  KGYARKGSLKEAESIISEMEKSGLSPDEHTYGLLVDAYANVGRWESARHLLKQMEARNVQ 418

Query: 1412 PSSYVFSRILASYRDRGEWQRSFQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDT 1233
            P++++FSRILASYRDRGEWQ++F+VL+EM+N+ V PDR FYNVMIDTFGK+NCLDHAM+T
Sbjct: 419  PNTFIFSRILASYRDRGEWQKTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMET 478

Query: 1232 FERMKLEGIEPDTVTWNTLIDCHCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGE 1053
            ++RM  EGIEPD VTWNTLIDCH KHG+H++A +LFE MQE G LPC TTYNIM+NS GE
Sbjct: 479  YDRMLSEGIEPDVVTWNTLIDCHRKHGYHDRAAELFEEMQERGYLPCPTTYNIMINSLGE 538

Query: 1052 QERWEDVQDLLRKMQSQGLLPNVVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTM 873
            QE+W++V+ LL KMQSQGLLPNVVTYTTLVDIYG SGRFN+AI+CLE MKSAG+KPS+TM
Sbjct: 539  QEKWDEVKILLGKMQSQGLLPNVVTYTTLVDIYGHSGRFNDAIDCLEAMKSAGLKPSATM 598

Query: 872  YNALINAYAQRGLADQAVNAFWVMRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMK 693
            YNALINA+AQRGL++QAVNA+ VM  DGL+PSLLALNSLINAF EDRRD EAF++LQYMK
Sbjct: 599  YNALINAFAQRGLSEQAVNAYRVMISDGLRPSLLALNSLINAFGEDRRDIEAFSILQYMK 658

Query: 692  ENDLKPDVVTYTTLMKALVRVEKFEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            END+KPDVVTYTTLMKAL+RV+KF+KVPAVYEEM+  GC PD KARAMLRSALRYMK T
Sbjct: 659  ENDVKPDVVTYTTLMKALIRVDKFDKVPAVYEEMILSGCTPDGKARAMLRSALRYMKRT 717


>ref|NP_199046.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75154282|sp|Q8L844.1|PP413_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g42310, mitochondrial; Flags: Precursor
            gi|21539517|gb|AAM53311.1| maize crp1 protein-like
            [Arabidopsis thaliana] gi|332007411|gb|AED94794.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 709

 Score =  955 bits (2468), Expect = 0.0
 Identities = 484/702 (68%), Positives = 562/702 (80%), Gaps = 10/702 (1%)
 Frame = -3

Query: 2591 QPPPLQFRFPSVPFTPPTFLLHHHQNFQKPLFXXXXXXXXXXXXXXXXPLSSQQLCHGD- 2415
            QPP +  RF S+ F   T   HHH  F +P                    SS      + 
Sbjct: 6    QPPLVSTRFHSLYFL--THHHHHHHRFFQPPISAFSATTSASLPSPSPSSSSSYFSSWNG 63

Query: 2414 ----KDESYDVASLNNRRYDFTPLLEFLSAYFPSPVDRQSNS-----PTQLEPTELRLAE 2262
                ++E  + +S  +RRYDF+PLL+FLS + P  +   S S     P  L P E  L E
Sbjct: 64   LDTNEEEDNEFSSEVHRRYDFSPLLKFLSRFGPVELALDSESESEASPESLNPVEFDLVE 123

Query: 2261 SYRAVPAPLWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEKL 2082
            SYRAVPAP WHSL+KSLT++ SS+  AYA+V+WLQKHNLCFSYELLYSILIHALGRSEKL
Sbjct: 124  SYRAVPAPYWHSLIKSLTSSTSSLGLAYAVVSWLQKHNLCFSYELLYSILIHALGRSEKL 183

Query: 2081 YEAFLLSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSLM 1902
            YEAFLLSQ+QTLTPLTYNALIGACARN+D+EKALNL+AKMR+DGYQSDFVNYSL+IQSL 
Sbjct: 184  YEAFLLSQKQTLTPLTYNALIGACARNNDIEKALNLIAKMRQDGYQSDFVNYSLVIQSLT 243

Query: 1901 RNNSIDSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSPK 1722
            R+N IDS +L + Y EIE D +ELD QL+NDI +GFAKSGD ++AL  L + Q  GLS K
Sbjct: 244  RSNKIDSVMLLRLYKEIERDKLELDVQLVNDIIMGFAKSGDPSKALQLLGMAQATGLSAK 303

Query: 1721 TATLVAIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVVS 1542
            TATLV+IIS L  SGRT           + G+KPRTRAYNALLKGYV+TG LKDAE +VS
Sbjct: 304  TATLVSIISALADSGRTLEAEALFEELRQSGIKPRTRAYNALLKGYVKTGPLKDAESMVS 363

Query: 1541 EMERSGVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDRG 1362
            EME+ GV+PDE+TYSLLIDAY NAGRWESARIVLKEMEA  V+P+S+VFSR+LA +RDRG
Sbjct: 364  EMEKRGVSPDEHTYSLLIDAYVNAGRWESARIVLKEMEAGDVQPNSFVFSRLLAGFRDRG 423

Query: 1361 EWQRSFQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTWN 1182
            EWQ++FQVLKEM++ GV PDRQFYNV+IDTFGK+NCLDHAM TF+RM  EGIEPD VTWN
Sbjct: 424  EWQKTFQVLKEMKSIGVKPDRQFYNVVIDTFGKFNCLDHAMTTFDRMLSEGIEPDRVTWN 483

Query: 1181 TLIDCHCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQSQ 1002
            TLIDCHCKHG H  AE++FEAM+  GCLPC TTYNIM+NS+G+QERW+D++ LL KM+SQ
Sbjct: 484  TLIDCHCKHGRHIVAEEMFEAMERRGCLPCATTYNIMINSYGDQERWDDMKRLLGKMKSQ 543

Query: 1001 GLLPNVVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQA 822
            G+LPNVVT+TTLVD+YG+SGRFN+AIECLE MKS G+KPSSTMYNALINAYAQRGL++QA
Sbjct: 544  GILPNVVTHTTLVDVYGKSGRFNDAIECLEEMKSVGLKPSSTMYNALINAYAQRGLSEQA 603

Query: 821  VNAFWVMRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMKA 642
            VNAF VM  DGLKPSLLALNSLINAF EDRRDAEAFAVLQYMKEN +KPDVVTYTTLMKA
Sbjct: 604  VNAFRVMTSDGLKPSLLALNSLINAFGEDRRDAEAFAVLQYMKENGVKPDVVTYTTLMKA 663

Query: 641  LVRVEKFEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            L+RV+KF+KVP VYEEM+  GC PDRKAR+MLRSALRYMK T
Sbjct: 664  LIRVDKFQKVPVVYEEMIMSGCKPDRKARSMLRSALRYMKQT 705


>ref|XP_006473770.1| PREDICTED: pentatricopeptide repeat-containing protein At5g42310,
            mitochondrial-like [Citrus sinensis]
          Length = 704

 Score =  953 bits (2464), Expect = 0.0
 Identities = 491/701 (70%), Positives = 557/701 (79%), Gaps = 7/701 (0%)
 Frame = -3

Query: 2597 VLQPPPLQFRFPSVPFTPPTFLLHHHQNFQKPLFXXXXXXXXXXXXXXXXPLSS------ 2436
            +L P PL  RFP++    P  +  HH  FQ                      SS      
Sbjct: 3    LLPPQPLSVRFPAIHSACP--ITRHHVIFQPAFSVSTITGITTATASGESSFSSSSFTSK 60

Query: 2435 QQLCHGDKDESYDVASLNNRRYDFTPLLEFLSAYFPSP-VDRQSNSPTQLEPTELRLAES 2259
            Q     ++++  DV SL  +RYDFTPLL FLS    S      ++SP+ L   E +LAES
Sbjct: 61   QNDTEEEEEDDDDVLSLQKQRYDFTPLLNFLSENSNSESASALASSPSSLNRVEFKLAES 120

Query: 2258 YRAVPAPLWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEKLY 2079
            YRAVPAPLWHSLLK+L ++ SSI  AYA+V+WLQKHNLC+SYELLYSILIHALGRSEKLY
Sbjct: 121  YRAVPAPLWHSLLKNLCSSNSSIDLAYAVVSWLQKHNLCYSYELLYSILIHALGRSEKLY 180

Query: 2078 EAFLLSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSLMR 1899
            EAFLLSQRQ LTPLTYNALI ACARNDDLEKALNLM+KMR+DGY  DF+NYSL+IQSL R
Sbjct: 181  EAFLLSQRQRLTPLTYNALISACARNDDLEKALNLMSKMRQDGYHCDFINYSLVIQSLTR 240

Query: 1898 NNSIDSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSPKT 1719
             N IDS+LL+K Y EIE D IELD QLLND+ VGFAK+GDA++A+ FL + QG GLSPKT
Sbjct: 241  TNKIDSSLLQKLYKEIECDKIELDGQLLNDVIVGFAKAGDASKAMRFLGMAQGVGLSPKT 300

Query: 1718 ATLVAIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVVSE 1539
            AT  A+I+ L  SGRT           E GLKPRT+AYNALLKGYV+ G LKDAE VVSE
Sbjct: 301  ATYAAVITALSNSGRTIEAEAVFEELKESGLKPRTKAYNALLKGYVKMGYLKDAEFVVSE 360

Query: 1538 MERSGVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDRGE 1359
            MERSGV PDE+TYSLLIDAY NAGRWESARIVLKEME +  +P+S+++SRILA YRDRGE
Sbjct: 361  MERSGVLPDEHTYSLLIDAYANAGRWESARIVLKEMEVSHAKPNSFIYSRILAGYRDRGE 420

Query: 1358 WQRSFQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTWNT 1179
            WQR+FQVLKEM+++GV PD  FYNVMIDTFGKYNCL HAM  F+RM  EGIEPDT+TWNT
Sbjct: 421  WQRTFQVLKEMKSSGVEPDTHFYNVMIDTFGKYNCLHHAMAAFDRMLSEGIEPDTITWNT 480

Query: 1178 LIDCHCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQSQG 999
            LIDCH K G +++AE+LFE MQE G  PCTTTYNIM+N  GEQERWEDV+ LL  M++QG
Sbjct: 481  LIDCHFKCGRYDRAEELFEEMQERGYFPCTTTYNIMINLLGEQERWEDVKRLLGNMRAQG 540

Query: 998  LLPNVVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQAV 819
            LLPNVVTYTTLVDIYGQSGRF++AIECLEVMK+AG+KPSSTMYNALINAYA+RGL+DQAV
Sbjct: 541  LLPNVVTYTTLVDIYGQSGRFDDAIECLEVMKAAGLKPSSTMYNALINAYARRGLSDQAV 600

Query: 818  NAFWVMRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMKAL 639
            NAF VMR DGLKPS LALNSLINAF ED+RDAEAFAVLQYMKEN LKPDVVTYTTLMKAL
Sbjct: 601  NAFRVMRTDGLKPSNLALNSLINAFGEDQRDAEAFAVLQYMKENGLKPDVVTYTTLMKAL 660

Query: 638  VRVEKFEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            +RV+KF KVPAVYEEM+S GC PDRKARAMLRSALRYMK T
Sbjct: 661  IRVDKFHKVPAVYEEMISSGCTPDRKARAMLRSALRYMKQT 701


>ref|XP_006435342.1| hypothetical protein CICLE_v10000451mg [Citrus clementina]
            gi|567885569|ref|XP_006435343.1| hypothetical protein
            CICLE_v10000451mg [Citrus clementina]
            gi|557537464|gb|ESR48582.1| hypothetical protein
            CICLE_v10000451mg [Citrus clementina]
            gi|557537465|gb|ESR48583.1| hypothetical protein
            CICLE_v10000451mg [Citrus clementina]
          Length = 704

 Score =  949 bits (2452), Expect = 0.0
 Identities = 489/701 (69%), Positives = 555/701 (79%), Gaps = 7/701 (0%)
 Frame = -3

Query: 2597 VLQPPPLQFRFPSVPFTPPTFLLHHHQNFQKPLFXXXXXXXXXXXXXXXXPLSS------ 2436
            +L P PL  RFP++    P  +  HH  FQ                      SS      
Sbjct: 3    LLPPQPLSVRFPAIHSACP--ITRHHVIFQPAFSVSTITGITTATASGESSFSSSSFTSK 60

Query: 2435 QQLCHGDKDESYDVASLNNRRYDFTPLLEFLSAYFPSP-VDRQSNSPTQLEPTELRLAES 2259
            Q     ++++  DV SL  +RYDFTPLL FLS    S      ++SP+ L   E +LAES
Sbjct: 61   QNDTEEEEEDDDDVLSLQKQRYDFTPLLNFLSENSNSESASALASSPSSLNRVEFKLAES 120

Query: 2258 YRAVPAPLWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEKLY 2079
            YRAVPAPLWHSLLK+L ++ SSI  AYA+V+WLQKHNLC+SYELLYSILIHALGRSEKLY
Sbjct: 121  YRAVPAPLWHSLLKNLCSSNSSIDLAYAVVSWLQKHNLCYSYELLYSILIHALGRSEKLY 180

Query: 2078 EAFLLSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSLMR 1899
            EAFLLSQRQ LTPLTYNALI ACARNDDLEKALNLM+KMR+DGY  DF+NYSL+IQSL R
Sbjct: 181  EAFLLSQRQRLTPLTYNALISACARNDDLEKALNLMSKMRQDGYHCDFINYSLVIQSLTR 240

Query: 1898 NNSIDSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSPKT 1719
             N IDS+LL K Y EIE D IELD QLLND+ VGFAK+GDA++A+ FL + QG GLSPKT
Sbjct: 241  TNKIDSSLLHKLYKEIECDKIELDGQLLNDVIVGFAKAGDASKAMRFLGMAQGVGLSPKT 300

Query: 1718 ATLVAIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVVSE 1539
            AT  A+I+ L  SGRT           E GLKPRT+A+NALLKGYV+ G LKDAE VVSE
Sbjct: 301  ATYAAVITALSNSGRTIEAEAVFEELKESGLKPRTKAFNALLKGYVKMGYLKDAEFVVSE 360

Query: 1538 MERSGVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDRGE 1359
            MERSGV PDE+TYSLLIDAY NAGRWESARIVLKEME +  +P+S+++SRILA YRDRGE
Sbjct: 361  MERSGVLPDEHTYSLLIDAYANAGRWESARIVLKEMEVSHAKPNSFIYSRILAGYRDRGE 420

Query: 1358 WQRSFQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTWNT 1179
            WQR+FQVLKEM+++GV PD  FYNVMIDTFGKYNCL HAM  F+RM  EGIEPDT+TWNT
Sbjct: 421  WQRTFQVLKEMKSSGVEPDTHFYNVMIDTFGKYNCLHHAMAAFDRMLSEGIEPDTITWNT 480

Query: 1178 LIDCHCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQSQG 999
            LIDCH K G +++AE+LFE MQE G  PCTTTYNIM+N  GEQERWEDV+ LL  M++QG
Sbjct: 481  LIDCHFKCGRYDRAEELFEEMQERGYFPCTTTYNIMINLLGEQERWEDVKRLLGNMRAQG 540

Query: 998  LLPNVVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQAV 819
            LLPNVVTYTTLVDIYGQSGRF++AIECLEVMK+AG+KPSSTMYNALINAYA+RGL+DQAV
Sbjct: 541  LLPNVVTYTTLVDIYGQSGRFDDAIECLEVMKAAGLKPSSTMYNALINAYARRGLSDQAV 600

Query: 818  NAFWVMRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMKAL 639
            NAF VMR DGLKPS LALNSLINAF ED+RDAEAFAVLQYMKEN LKPDVVTYTTLMKAL
Sbjct: 601  NAFRVMRTDGLKPSNLALNSLINAFGEDQRDAEAFAVLQYMKENGLKPDVVTYTTLMKAL 660

Query: 638  VRVEKFEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            +RV+KF KVPAVYEEM+  GC PDRKARAMLRSALRYMK T
Sbjct: 661  IRVDKFHKVPAVYEEMILSGCTPDRKARAMLRSALRYMKQT 701


>ref|XP_002865541.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297311376|gb|EFH41800.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 711

 Score =  947 bits (2449), Expect = 0.0
 Identities = 482/702 (68%), Positives = 562/702 (80%), Gaps = 10/702 (1%)
 Frame = -3

Query: 2591 QPPPLQFRFPSVPF-TPPTFLLHHHQNFQKPLFXXXXXXXXXXXXXXXXPLSSQQLCHG- 2418
            QPP +  RF S+ F T      HHH+ FQ P+                   S     +G 
Sbjct: 6    QPPLVSSRFHSLYFLTHHHHHHHHHRFFQPPISAFSATTSASLPSSSSPSSSYFSSWNGL 65

Query: 2417 -DKDESYDVASLNNRRYDFTPLLEFLSAYFPSPV--DRQSNS-----PTQLEPTELRLAE 2262
               +E  + +S  +RRYDF+PLL+FLS + P  +  D +S S     P  L P E  L E
Sbjct: 66   DTNEEDDEFSSEVHRRYDFSPLLKFLSRFGPVELVLDSESESESEASPESLNPVEFELVE 125

Query: 2261 SYRAVPAPLWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEKL 2082
            SY AVPAP WHSL+KSL ++ SS+  AYA+V+WLQKHNLCFSYELLYSILIHALGRSEKL
Sbjct: 126  SYSAVPAPYWHSLIKSLCSSTSSLGLAYAVVSWLQKHNLCFSYELLYSILIHALGRSEKL 185

Query: 2081 YEAFLLSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSLM 1902
            YEAFLLSQ+QTLTPLTYNALIGACARN+D+EKALNL+++MR+DGYQSDFVNYSL+IQSL 
Sbjct: 186  YEAFLLSQKQTLTPLTYNALIGACARNNDIEKALNLISRMRQDGYQSDFVNYSLVIQSLT 245

Query: 1901 RNNSIDSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSPK 1722
            R N IDS +L++ Y EIE D +ELD QL+NDI +GFAKSGD +RAL  L + Q  GLS K
Sbjct: 246  RCNKIDSVMLQRLYKEIERDKLELDVQLVNDIIMGFAKSGDPSRALQLLGMAQATGLSAK 305

Query: 1721 TATLVAIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVVS 1542
            TATLV+IIS L  SGRT           + G+KPRT+AYNALLKGYV+TG LKDAE +VS
Sbjct: 306  TATLVSIISALANSGRTLEAEALFEELRQSGIKPRTKAYNALLKGYVKTGPLKDAELMVS 365

Query: 1541 EMERSGVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDRG 1362
            EME+ GV+PDE+TYSLLIDAY NAGRWESARIVLKEME   V+P+S+VFSR+LA YRDRG
Sbjct: 366  EMEKRGVSPDEHTYSLLIDAYVNAGRWESARIVLKEMETGDVQPNSFVFSRLLAGYRDRG 425

Query: 1361 EWQRSFQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTWN 1182
            EWQ++FQVLKEM++ GV PDRQFYNV+IDTFGK+NCLDHAM TF+RM  EGIEPD VTWN
Sbjct: 426  EWQKTFQVLKEMKSIGVKPDRQFYNVVIDTFGKFNCLDHAMTTFDRMLSEGIEPDRVTWN 485

Query: 1181 TLIDCHCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQSQ 1002
            TLIDCHCKHG H  AE++FEAM+  GCLPC TTYNIM+NS+G+QERW+D++ LL KM+SQ
Sbjct: 486  TLIDCHCKHGRHIVAEEMFEAMERRGCLPCATTYNIMINSYGDQERWDDMKRLLGKMKSQ 545

Query: 1001 GLLPNVVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQA 822
            G+LPNVVT+TTLVD+YG+SGRFN+AIECLE MKS G+KPSSTMYNALINAYAQRGL++QA
Sbjct: 546  GILPNVVTHTTLVDVYGKSGRFNDAIECLEEMKSVGLKPSSTMYNALINAYAQRGLSEQA 605

Query: 821  VNAFWVMRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMKA 642
            VNAF VM  DGLKPSLLALNSLINAF EDRRDAEAFAVLQYMKEN +KPDVVTYTTLMKA
Sbjct: 606  VNAFRVMTSDGLKPSLLALNSLINAFGEDRRDAEAFAVLQYMKENGVKPDVVTYTTLMKA 665

Query: 641  LVRVEKFEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            L+RV+KF+KVP VYEEM+  GC PDRKAR+MLRSALRYMK T
Sbjct: 666  LIRVDKFQKVPGVYEEMIMSGCKPDRKARSMLRSALRYMKQT 707


>ref|XP_002301924.1| hypothetical protein POPTR_0002s01200g [Populus trichocarpa]
            gi|222843650|gb|EEE81197.1| hypothetical protein
            POPTR_0002s01200g [Populus trichocarpa]
          Length = 709

 Score =  946 bits (2445), Expect = 0.0
 Identities = 475/707 (67%), Positives = 560/707 (79%), Gaps = 13/707 (1%)
 Frame = -3

Query: 2597 VLQPPPLQFRFPSVPFTPPTFLLHHHQNFQKPLFXXXXXXXXXXXXXXXXPLSSQQLCH- 2421
            +L P PL  RFPS+  T P F  HHH+  Q  L                      +  H 
Sbjct: 2    LLFPQPLPNRFPSISITSPIF-QHHHRILQPQLAATPTTTALTATNLTDSSFPYSKRHHE 60

Query: 2420 ----------GDKDESYDVASLNNRRYDFTPLLEFLSAYFPSPVDRQSN--SPTQLEPTE 2277
                       D D+  D+  L + RYDFTPL+ +LS    +  D  S+  SPT L+ TE
Sbjct: 61   TSQNDIEDYYNDDDDDDDILPLQSLRYDFTPLINYLSNKISTSTDSDSDSASPTSLDSTE 120

Query: 2276 LRLAESYRAVPAPLWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALG 2097
             +LAESYR VP PLWHSLLKSL  + SSI  AYA+V+WLQKHNLCFSYELLYSILIHALG
Sbjct: 121  FQLAESYRVVPGPLWHSLLKSLCTSSSSIPLAYAVVSWLQKHNLCFSYELLYSILIHALG 180

Query: 2096 RSEKLYEAFLLSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLI 1917
            +SEKLYEAFLLSQ+Q LTPLTYNALI ACARN+D+EKALNL+ +MR DGY SD VNYSLI
Sbjct: 181  QSEKLYEAFLLSQKQNLTPLTYNALISACARNNDIEKALNLICRMREDGYPSDLVNYSLI 240

Query: 1916 IQSLMRNNSIDSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGN 1737
            I+SLM+NN  +S++L+K Y EI+ D +E+D QL NDI VGFAK+GD ++AL FL V+QG+
Sbjct: 241  IRSLMKNNRANSSILQKIYREIDRDKLEVDVQLWNDIIVGFAKAGDLDKALEFLGVVQGS 300

Query: 1736 GLSPKTATLVAIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDA 1557
            GLS KTATLV +I  LG  GRT           + GL+PRTRAYNALL+GYV+ G L+DA
Sbjct: 301  GLSVKTATLVTVIWGLGNCGRTEEAEAIFEEMRDNGLQPRTRAYNALLRGYVKAGLLRDA 360

Query: 1556 EDVVSEMERSGVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILAS 1377
            E VVSEMERSGV P+E TYSLLIDAYGNA RWESARIVLKEMEA++V+P++YVFSRILAS
Sbjct: 361  EFVVSEMERSGVLPNEQTYSLLIDAYGNAERWESARIVLKEMEASNVQPNAYVFSRILAS 420

Query: 1376 YRDRGEWQRSFQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPD 1197
            YRD+GEWQ++FQVL+EM ++GV PDR FYNV+IDTFGK+NCLDHAM TF+RM  EGIEPD
Sbjct: 421  YRDKGEWQKTFQVLREMEDSGVRPDRIFYNVLIDTFGKFNCLDHAMATFDRMLSEGIEPD 480

Query: 1196 TVTWNTLIDCHCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLR 1017
            T+TWNTL+DCHCK G H++AE+LFE M E G LPC TT+NIM+NSFG+QERW+DV++LL 
Sbjct: 481  TITWNTLVDCHCKAGKHDRAEELFEEMMEKGYLPCNTTFNIMINSFGDQERWDDVKNLLT 540

Query: 1016 KMQSQGLLPNVVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRG 837
             M+SQGLLPN VTYTTL+DIYG+SGRF++AIECL+ MK+AG+KPSSTMYNAL+NAYAQRG
Sbjct: 541  NMRSQGLLPNAVTYTTLIDIYGKSGRFDDAIECLDDMKAAGLKPSSTMYNALLNAYAQRG 600

Query: 836  LADQAVNAFWVMRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYT 657
            L+DQAV+AFW MR DGLKPSLLALNSLINAF +DRRD EAF VLQYMKENDLKPDVVTYT
Sbjct: 601  LSDQAVSAFWAMRDDGLKPSLLALNSLINAFGKDRRDVEAFVVLQYMKENDLKPDVVTYT 660

Query: 656  TLMKALVRVEKFEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            TLMKAL+ VEKF+KVP+VYEEM+  GC PDRKARAMLRSAL+YMK T
Sbjct: 661  TLMKALILVEKFDKVPSVYEEMILSGCTPDRKARAMLRSALKYMKQT 707


>dbj|BAB10204.1| maize crp1 protein-like [Arabidopsis thaliana]
          Length = 680

 Score =  944 bits (2440), Expect = 0.0
 Identities = 482/699 (68%), Positives = 557/699 (79%), Gaps = 7/699 (1%)
 Frame = -3

Query: 2591 QPPPLQFRFPSVPFTPPTFLLHHHQNFQKPLFXXXXXXXXXXXXXXXXPLSSQQLCHGDK 2412
            QPP +  RF S+ F       HHH+ FQ P+                             
Sbjct: 6    QPPLVSTRFHSLYFLTHHHH-HHHRFFQPPISAF-------------------------- 38

Query: 2411 DESYDVASLNNR--RYDFTPLLEFLSAYFPSPVDRQSNS-----PTQLEPTELRLAESYR 2253
              +   ASL +   RYDF+PLL+FLS + P  +   S S     P  L P E  L ESYR
Sbjct: 39   -SATTSASLPSPSPRYDFSPLLKFLSRFGPVELALDSESESEASPESLNPVEFDLVESYR 97

Query: 2252 AVPAPLWHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEKLYEA 2073
            AVPAP WHSL+KSLT++ SS+  AYA+V+WLQKHNLCFSYELLYSILIHALGRSEKLYEA
Sbjct: 98   AVPAPYWHSLIKSLTSSTSSLGLAYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEA 157

Query: 2072 FLLSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSLMRNN 1893
            FLLSQ+QTLTPLTYNALIGACARN+D+EKALNL+AKMR+DGYQSDFVNYSL+IQSL R+N
Sbjct: 158  FLLSQKQTLTPLTYNALIGACARNNDIEKALNLIAKMRQDGYQSDFVNYSLVIQSLTRSN 217

Query: 1892 SIDSTLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSPKTAT 1713
             IDS +L + Y EIE D +ELD QL+NDI +GFAKSGD ++AL  L + Q  GLS KTAT
Sbjct: 218  KIDSVMLLRLYKEIERDKLELDVQLVNDIIMGFAKSGDPSKALQLLGMAQATGLSAKTAT 277

Query: 1712 LVAIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVVSEME 1533
            LV+IIS L  SGRT           + G+KPRTRAYNALLKGYV+TG LKDAE +VSEME
Sbjct: 278  LVSIISALADSGRTLEAEALFEELRQSGIKPRTRAYNALLKGYVKTGPLKDAESMVSEME 337

Query: 1532 RSGVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDRGEWQ 1353
            + GV+PDE+TYSLLIDAY NAGRWESARIVLKEMEA  V+P+S+VFSR+LA +RDRGEWQ
Sbjct: 338  KRGVSPDEHTYSLLIDAYVNAGRWESARIVLKEMEAGDVQPNSFVFSRLLAGFRDRGEWQ 397

Query: 1352 RSFQVLKEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTWNTLI 1173
            ++FQVLKEM++ GV PDRQFYNV+IDTFGK+NCLDHAM TF+RM  EGIEPD VTWNTLI
Sbjct: 398  KTFQVLKEMKSIGVKPDRQFYNVVIDTFGKFNCLDHAMTTFDRMLSEGIEPDRVTWNTLI 457

Query: 1172 DCHCKHGHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQSQGLL 993
            DCHCKHG H  AE++FEAM+  GCLPC TTYNIM+NS+G+QERW+D++ LL KM+SQG+L
Sbjct: 458  DCHCKHGRHIVAEEMFEAMERRGCLPCATTYNIMINSYGDQERWDDMKRLLGKMKSQGIL 517

Query: 992  PNVVTYTTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQAVNA 813
            PNVVT+TTLVD+YG+SGRFN+AIECLE MKS G+KPSSTMYNALINAYAQRGL++QAVNA
Sbjct: 518  PNVVTHTTLVDVYGKSGRFNDAIECLEEMKSVGLKPSSTMYNALINAYAQRGLSEQAVNA 577

Query: 812  FWVMRGDGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMKALVR 633
            F VM  DGLKPSLLALNSLINAF EDRRDAEAFAVLQYMKEN +KPDVVTYTTLMKAL+R
Sbjct: 578  FRVMTSDGLKPSLLALNSLINAFGEDRRDAEAFAVLQYMKENGVKPDVVTYTTLMKALIR 637

Query: 632  VEKFEKVPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            V+KF+KVP VYEEM+  GC PDRKAR+MLRSALRYMK T
Sbjct: 638  VDKFQKVPVVYEEMIMSGCKPDRKARSMLRSALRYMKQT 676


>ref|XP_004500883.1| PREDICTED: pentatricopeptide repeat-containing protein At5g42310,
            mitochondrial-like isoform X2 [Cicer arietinum]
          Length = 691

 Score =  941 bits (2432), Expect = 0.0
 Identities = 468/633 (73%), Positives = 538/633 (84%), Gaps = 1/633 (0%)
 Frame = -3

Query: 2411 DESYDVASLNNRRYDFTPLLEFLSAYFPSPVDR-QSNSPTQLEPTELRLAESYRAVPAPL 2235
            D   D+ SL NRRYDFTPLL FLS    +  +   S+SPT L+ TE +LAESYRAVP+PL
Sbjct: 56   DGPNDILSLQNRRYDFTPLLNFLSNDSNTNTNTTNSSSPTSLDSTEFQLAESYRAVPSPL 115

Query: 2234 WHSLLKSLTATPSSISKAYALVTWLQKHNLCFSYELLYSILIHALGRSEKLYEAFLLSQR 2055
            WH+LLKSL ++ SSI+ AYA+V+WL+KHNLCFSYELLYSILIHALGR+EKLYEAFLLSQR
Sbjct: 116  WHALLKSLCSSSSSITLAYAVVSWLEKHNLCFSYELLYSILIHALGRNEKLYEAFLLSQR 175

Query: 2054 QTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQSDFVNYSLIIQSLMRNNSIDSTL 1875
            Q LTPLTYNALIGACARN DLEKALNLM++MRRDG+Q DFVNYS II+SL R+N IDS +
Sbjct: 176  QVLTPLTYNALIGACARNGDLEKALNLMSRMRRDGFQPDFVNYSSIIKSLTRSNRIDSPI 235

Query: 1874 LEKFYCEIEADMIELDCQLLNDITVGFAKSGDANRALYFLSVIQGNGLSPKTATLVAIIS 1695
            L+K Y EIE D IE D  LLNDI +GF+K+GDA RA++FL+V QG GL PKT T VA+I 
Sbjct: 236  LQKLYAEIETDKIEADGHLLNDIILGFSKAGDATRAMHFLAVAQGKGLCPKTGTFVAVIL 295

Query: 1694 ELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLKGYVRTGALKDAEDVVSEMERSGVAP 1515
             LG SGRT           E GL+PRTRAYNALLKGYV+TG+LKDAE VVSEME+SGV P
Sbjct: 296  ALGNSGRTVEAEALFEEIKESGLEPRTRAYNALLKGYVKTGSLKDAEFVVSEMEKSGVLP 355

Query: 1514 DEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEPSSYVFSRILASYRDRGEWQRSFQVL 1335
            DE+TYSLL+DAY +AGRWESARIVLKEMEA++++P+S+++SRILASYRD+GEWQ+SFQVL
Sbjct: 356  DEHTYSLLVDAYAHAGRWESARIVLKEMEASNLQPNSFIYSRILASYRDKGEWQKSFQVL 415

Query: 1334 KEMRNNGVTPDRQFYNVMIDTFGKYNCLDHAMDTFERMKLEGIEPDTVTWNTLIDCHCKH 1155
            KEM++ GV PDR FYNVMIDTFGKYNCLDHAM TFERM  EGI PDTVTWNTLIDCHCK 
Sbjct: 416  KEMKSCGVQPDRHFYNVMIDTFGKYNCLDHAMATFERMLSEGIRPDTVTWNTLIDCHCKS 475

Query: 1154 GHHNKAEKLFEAMQESGCLPCTTTYNIMVNSFGEQERWEDVQDLLRKMQSQGLLPNVVTY 975
            G H +AE+LFE MQ+SG  PC  TYNIM+NS G QERW+ V DLL +MQSQGLLPN VTY
Sbjct: 476  GRHYRAEELFEEMQQSGYSPCVMTYNIMINSMGTQERWDRVSDLLSRMQSQGLLPNAVTY 535

Query: 974  TTLVDIYGQSGRFNEAIECLEVMKSAGMKPSSTMYNALINAYAQRGLADQAVNAFWVMRG 795
            TTLVDIYG+SGRFN+AIEC++V+KS G KP+STMYNALINAYAQRGL+D AVNAF +M  
Sbjct: 536  TTLVDIYGKSGRFNDAIECIDVLKSLGFKPTSTMYNALINAYAQRGLSDLAVNAFRMMAA 595

Query: 794  DGLKPSLLALNSLINAFSEDRRDAEAFAVLQYMKENDLKPDVVTYTTLMKALVRVEKFEK 615
            +GL PSLLALNSLINAF EDRRDAEAFAVLQYMKEN ++PDVVTYTTLMK+L+RV+K+ K
Sbjct: 596  EGLTPSLLALNSLINAFGEDRRDAEAFAVLQYMKENGVEPDVVTYTTLMKSLIRVDKYPK 655

Query: 614  VPAVYEEMLSCGCMPDRKARAMLRSALRYMKST 516
            VPAVYEEM+  GC PDRKARAMLRSALRYMK T
Sbjct: 656  VPAVYEEMVMSGCAPDRKARAMLRSALRYMKQT 688



 Score = 62.8 bits (151), Expect = 7e-07
 Identities = 57/261 (21%), Positives = 111/261 (42%), Gaps = 2/261 (0%)
 Frame = -3

Query: 2123 YSILIHALGRSEKLYEAFLLSQRQTLTPLTYNALIGACARNDDLEKALNLMAKMRRDGYQ 1944
            Y+ L HA+   E++     LS+      +T+N LI    ++    +A  L  +M++ GY 
Sbjct: 440  YNCLDHAMATFERM-----LSEGIRPDTVTWNTLIDCHCKSGRHYRAEELFEEMQQSGYS 494

Query: 1943 SDFVNYSLIIQSLMRNNSID--STLLEKFYCEIEADMIELDCQLLNDITVGFAKSGDANR 1770
               + Y+++I S+      D  S LL +    +++  +  +      +   + KSG  N 
Sbjct: 495  PCVMTYNIMINSMGTQERWDRVSDLLSR----MQSQGLLPNAVTYTTLVDIYGKSGRFND 550

Query: 1769 ALYFLSVIQGNGLSPKTATLVAIISELGKSGRTXXXXXXXXXXXEGGLKPRTRAYNALLK 1590
            A+  + V++  G  P +    A+I+   + G +             GL P   A N+L+ 
Sbjct: 551  AIECIDVLKSLGFKPTSTMYNALINAYAQRGLSDLAVNAFRMMAAEGLTPSLLALNSLIN 610

Query: 1589 GYVRTGALKDAEDVVSEMERSGVAPDEYTYSLLIDAYGNAGRWESARIVLKEMEANSVEP 1410
             +       +A  V+  M+ +GV PD  TY+ L+ +     ++     V +EM  +   P
Sbjct: 611  AFGEDRRDAEAFAVLQYMKENGVEPDVVTYTTLMKSLIRVDKYPKVPAVYEEMVMSGCAP 670

Query: 1409 SSYVFSRILASYRDRGEWQRS 1347
                 + + ++ R   +  RS
Sbjct: 671  DRKARAMLRSALRYMKQTLRS 691


Top