BLASTX nr result

ID: Mentha27_contig00005856 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha27_contig00005856
         (2460 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU44833.1| hypothetical protein MIMGU_mgv1a017808mg, partial...   899   0.0  
ref|XP_006447217.1| hypothetical protein CICLE_v10014357mg [Citr...   852   0.0  
ref|XP_006338641.1| PREDICTED: pentatricopeptide repeat-containi...   848   0.0  
ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containi...   848   0.0  
ref|XP_007031692.1| Pentatricopeptide repeat (PPR-like) superfam...   845   0.0  
ref|XP_004231824.1| PREDICTED: pentatricopeptide repeat-containi...   842   0.0  
ref|XP_002526948.1| pentatricopeptide repeat-containing protein,...   795   0.0  
gb|EPS70491.1| hypothetical protein M569_04265, partial [Genlise...   786   0.0  
ref|XP_002324000.1| pentatricopeptide repeat-containing family p...   784   0.0  
ref|XP_007140836.1| hypothetical protein PHAVU_008G145600g [Phas...   774   0.0  
gb|EXC31403.1| hypothetical protein L484_017686 [Morus notabilis]     773   0.0  
ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containi...   764   0.0  
ref|XP_006399946.1| hypothetical protein EUTSA_v10015672mg [Eutr...   756   0.0  
ref|XP_007220233.1| hypothetical protein PRUPE_ppa001979mg [Prun...   752   0.0  
ref|XP_006292935.1| hypothetical protein CARUB_v10019206mg [Caps...   738   0.0  
ref|XP_002873660.1| pentatricopeptide repeat-containing protein ...   733   0.0  
ref|NP_190245.1| pentatricopeptide repeat-containing protein [Ar...   732   0.0  
ref|XP_004308618.1| PREDICTED: pentatricopeptide repeat-containi...   725   0.0  
ref|XP_006852234.1| hypothetical protein AMTR_s00049p00149530 [A...   706   0.0  
gb|EAY82798.1| hypothetical protein OsI_38004 [Oryza sativa Indi...   654   0.0  

>gb|EYU44833.1| hypothetical protein MIMGU_mgv1a017808mg, partial [Mimulus guttatus]
          Length = 659

 Score =  899 bits (2323), Expect = 0.0
 Identities = 452/660 (68%), Positives = 536/660 (81%), Gaps = 5/660 (0%)
 Frame = +2

Query: 491  RKGSLGSAFAVSWALDEPTVGKDDS-VAELEQLDEVERDDDGAKNRXXXXXXXXXXXXXX 667
            +K SLG+AFA++WALDEPT G DDS + E +QL+    D+DGA N+              
Sbjct: 2    KKPSLGAAFALTWALDEPTTGNDDSPIQESDQLN----DNDGANNKDGGDVQKRGIYRRQ 57

Query: 668  XXXXXXXXXXXXXXXXXXXXXALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGK 847
                                 AL  RL SA  ADDVE +LK    LPLQVYSTIIRGFGK
Sbjct: 58   KLQNGRIDVR-----------ALALRLHSATNADDVETILKDMGNLPLQVYSTIIRGFGK 106

Query: 848  EKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGV 1027
            +KK++SAMALFEWLKRKS E    IQPNL+IYNSLLGA+K+A  FDF++ +M+DMA  G+
Sbjct: 107  DKKVDSAMALFEWLKRKSNEADSPIQPNLYIYNSLLGALKQAESFDFVDDVMSDMAAKGL 166

Query: 1028 HPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALA 1207
             PNVVTYNTLMGIYIE  KE+K  +LFEEMP+KGI PSPAS+SIVL  YRRLEDGFGAL 
Sbjct: 167  LPNVVTYNTLMGIYIEHRKEAKVFELFEEMPTKGIFPSPASYSIVLLAYRRLEDGFGALT 226

Query: 1208 FYVQTRNRYEQGEIGRDDD---REDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVL 1378
            F+V+ R+++++GEIG+D+D    EDW  EF+KLENF I +CYQVMRRWLV S+N S +VL
Sbjct: 227  FFVEIRDKFQKGEIGKDNDGEEEEDWVDEFAKLENFTIRICYQVMRRWLVNSKNLSTEVL 286

Query: 1379 RLLQEMDNACLKHGREEHERLIWACTREEHCVVAKELYTRIREVDD-EISVSVCNHLIWL 1555
            RLL+EMD A L+ G EEHERLIWACTREEH +V KELY RIRE+   EIS+SVCNH+IWL
Sbjct: 287  RLLKEMDKAGLQPGHEEHERLIWACTREEHYIVVKELYARIREMTSTEISLSVCNHVIWL 346

Query: 1556 LGKAKKWWAALEIYEDMLDKGPKPNNMSHELIISHFNILLSAARKKGIWRWGVRLLNKME 1735
            +GKAKKWWAALEIYED+LDKGPKPNNMS+ELI+SHF+ILL+AARKKGIW+WGVRLLNKME
Sbjct: 347  MGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFSILLTAARKKGIWKWGVRLLNKME 406

Query: 1736 EKGLKPGSREWNSVLVACSKASETSAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYE 1915
            EKGLKPGSREWN+VLVACSKASETSAAIEIFKRMV+QGEKPTIISYGALLS+LEKGKLY+
Sbjct: 407  EKGLKPGSREWNAVLVACSKASETSAAIEIFKRMVDQGEKPTIISYGALLSALEKGKLYD 466

Query: 1916 QAFQVWQHMVRVGVEPNLHAYTIMASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAII 2095
            +A QVW+HM+++G+EPNL+AYTIMASIYAGQ +F+ ++ I++EM ++ I PTV+TFNAII
Sbjct: 467  EALQVWKHMLKMGLEPNLYAYTIMASIYAGQQKFDIVDSIIQEMVTVNIEPTVVTFNAII 526

Query: 2096 SSCGRNGHGGTAYEWFERMKVDDVVPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFE 2275
            SSCGR+  G  AYE+F+RM+V ++ PNEVTY++LIEALA DGKPRLAY+LHLRA +EG  
Sbjct: 527  SSCGRSNLGSVAYEYFQRMRVLNIAPNEVTYDVLIEALASDGKPRLAYELHLRANNEGLV 586

Query: 2276 LSTKAYDAVVESVNLYGATVDIGALGSRPPERKKKVTIRKDLSEFCKLADVPRRSRPFVR 2455
            LSTKAYDAVVES   YGAT+D+ ALG RPPERKKKV  RK LSEFC LADVPRRS+PF R
Sbjct: 587  LSTKAYDAVVESSESYGATIDVSALGPRPPERKKKVQTRKKLSEFCDLADVPRRSKPFDR 646


>ref|XP_006447217.1| hypothetical protein CICLE_v10014357mg [Citrus clementina]
            gi|568831365|ref|XP_006469938.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g46610-like [Citrus sinensis]
            gi|557549828|gb|ESR60457.1| hypothetical protein
            CICLE_v10014357mg [Citrus clementina]
          Length = 768

 Score =  852 bits (2201), Expect = 0.0
 Identities = 441/762 (57%), Positives = 545/762 (71%), Gaps = 10/762 (1%)
 Frame = +2

Query: 203  MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRLDF----CDVHSHGLLRFS 370
            MQ L++WP K        VPQ   +++S         RRK+       C   + G L  S
Sbjct: 1    MQPLSVWPLKG---GFAAVPQLHFDVVSSSFLSTRNRRRKKWSLVESVCHSRNTGFLLVS 57

Query: 371  RYSYYKGCRNGVCLASRSYDXXXXXXXXXXESTINGFSKPRKGSLGSAFAVSWALDEPTV 550
              S +  C  GVC  S   D             +  F +P+K   G++   +W++++  +
Sbjct: 58   SNSTFSCC--GVCCRSIKLDSKCEFLSGFSSHKLVLFCEPKKSYFGASVMFAWSMEQQEI 115

Query: 551  GKDDSVAELEQLD----EVERD--DDGAKNRXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 712
            G    V E    D    E E D  D  + +R                             
Sbjct: 116  GNGLLVEEPNSADGLLVETESDIVDYRSVHRVEDTGDNGNQVESEEVEIIGERGVGKQKS 175

Query: 713  XXXXXXALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLK 892
                  AL   L   KTADDVEEVLK    LP QV+S++IRGFGKEK+ + AMAL EWLK
Sbjct: 176  GRVDVKALAQSLWHTKTADDVEEVLKDMGELPPQVHSSMIRGFGKEKRTDCAMALVEWLK 235

Query: 893  RKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYI 1072
            RK  ETGG I PNLF+YNSLLGAVK+++KF+ +++IMNDMA  GV+PNVVTYNTLM IYI
Sbjct: 236  RKKRETGGFIGPNLFVYNSLLGAVKQSQKFEEMDRIMNDMAEEGVNPNVVTYNTLMAIYI 295

Query: 1073 EEGKESKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIG 1252
            E+G+ +KAL + EE+  KG+TPS  S+S  L  YRR+EDG GAL F+V+ R +Y +GEIG
Sbjct: 296  EQGEGTKALNVLEEIKKKGLTPSAVSYSQALLAYRRMEDGNGALKFFVELREKYLKGEIG 355

Query: 1253 RDDDREDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEH 1432
            + DD E+W++EF KL++FII +CYQVMRRWLVK EN S +VL+LL EMD A L+  + E+
Sbjct: 356  KGDD-ENWENEFVKLKDFIIRICYQVMRRWLVKDENLSTNVLKLLIEMDKAGLRPVKAEY 414

Query: 1433 ERLIWACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLD 1612
            ERL+WACTREEH VVAKE Y RIRE  DEIS+SVCNHLIWL+GKAKKWWAALE+YED+LD
Sbjct: 415  ERLVWACTREEHYVVAKEFYARIRERHDEISLSVCNHLIWLMGKAKKWWAALEVYEDLLD 474

Query: 1613 KGPKPNNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACS 1792
            KGPKPNNMS+ELI+SHFNILLSAARK+GIWRWGVRLLNKMEEKGLKPGSREWN+VLVACS
Sbjct: 475  KGPKPNNMSYELIVSHFNILLSAARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACS 534

Query: 1793 KASETSAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLH 1972
            KASE +AA++IFKRMVE+GEKPTIISYGALLS+LEKGKLY++A +VWQHM+ VG EPNL+
Sbjct: 535  KASEYNAAVQIFKRMVEKGEKPTIISYGALLSALEKGKLYDEASRVWQHMLNVGAEPNLY 594

Query: 1973 AYTIMASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERM 2152
            AYTIMASI+  QG+F  + LI +EMAS  I PTV+T+NAIIS+CG+NG    AYEWF RM
Sbjct: 595  AYTIMASIFTAQGKFNLVELIFREMASSRIEPTVVTYNAIISACGQNGMSSAAYEWFHRM 654

Query: 2153 KVDDVVPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGAT 2332
            KV ++ PNE+TYEMLIEALA+DGKPRLAYDL+LRA++E   LS+KAYDA++E   +YGAT
Sbjct: 655  KVQNISPNEITYEMLIEALAKDGKPRLAYDLYLRARNEELNLSSKAYDAILEFSQVYGAT 714

Query: 2333 VDIGALGSRPPERKKKVTIRKDLSEFCKLADVPRRSRPFVRK 2458
            +D+  LG RPP++KKKV IRK+LS FC  ADVPRRS+PF +K
Sbjct: 715  IDLTVLGPRPPDKKKKVVIRKNLSNFCHFADVPRRSKPFDKK 756


>ref|XP_006338641.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like
            [Solanum tuberosum]
          Length = 740

 Score =  848 bits (2192), Expect = 0.0
 Identities = 404/576 (70%), Positives = 491/576 (85%)
 Frame = +2

Query: 731  ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEET 910
            AL   L   KTAD+V+EVLK +  LPLQVYS++IRGFGK+KKL SAMAL EWL+R+S++ 
Sbjct: 156  ALAQSLHFVKTADEVDEVLKDKIELPLQVYSSMIRGFGKDKKLNSAMALVEWLRRRSKDN 215

Query: 911  GGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKES 1090
             G I  N+FIYNSLLGA+KEA K+DF++K+M+DM   GV PNVVTYNTLM IYIE+G+E 
Sbjct: 216  IGSISLNVFIYNSLLGAIKEAGKYDFVDKVMDDMVSEGVQPNVVTYNTLMRIYIEQGREL 275

Query: 1091 KALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDRE 1270
            +AL LF  MP KG++PSPAS+S  LF YRRLEDGFGA+ F+V+TR +Y+ GEIG  ++ E
Sbjct: 276  EALNLFRLMPKKGLSPSPASYSTALFAYRRLEDGFGAITFFVETREKYQNGEIGNIEE-E 334

Query: 1271 DWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWA 1450
            +W+ EF+KLENFI+ +CYQVMR+WLVK EN + +VL+LL +MD A L+  R E+ERL+WA
Sbjct: 335  NWEDEFAKLENFIVRICYQVMRQWLVKGENANTNVLKLLTDMDRARLQLSRAEYERLVWA 394

Query: 1451 CTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPN 1630
            CTREEH VVAKELY RIRE D EIS+SVCNH+IWL+GKAKKWWAALEIYED+LDKGPKPN
Sbjct: 395  CTREEHHVVAKELYNRIRERDTEISLSVCNHIIWLMGKAKKWWAALEIYEDLLDKGPKPN 454

Query: 1631 NMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETS 1810
            NMS+ELI+SHFNILLSAARK+GIWRWGVRLLNKMEEKGLKP SREWN+VLVACSKASETS
Sbjct: 455  NMSYELIVSHFNILLSAARKRGIWRWGVRLLNKMEEKGLKPSSREWNAVLVACSKASETS 514

Query: 1811 AAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMA 1990
            AA++IF+RMVE+GEKPT+ISYGALLS+LEKGKLY++A QVW+HM++VG+EPNL+AYTIMA
Sbjct: 515  AAVQIFRRMVEKGEKPTVISYGALLSALEKGKLYDEALQVWKHMIKVGIEPNLYAYTIMA 574

Query: 1991 SIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVV 2170
            SIY  QG+F  ++ I+KEM + G+ PTV+TFNAIIS C RNG    AYEWF+RMK  ++ 
Sbjct: 575  SIYTAQGKFNIVDSIIKEMVTTGVEPTVVTFNAIISGCARNGMESVAYEWFQRMKTQNIT 634

Query: 2171 PNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGAL 2350
            PNEV+YEMLIEALA DGKPRLAY+L++RA +EG  LSTKAYDAV+ S   YGA++D+  L
Sbjct: 635  PNEVSYEMLIEALANDGKPRLAYELYVRALTEGLSLSTKAYDAVISSTQAYGASIDLSIL 694

Query: 2351 GSRPPERKKKVTIRKDLSEFCKLADVPRRSRPFVRK 2458
            G RPPE+KK+V IRK LSEFC +ADVPRRSRPF R+
Sbjct: 695  GPRPPEKKKRVQIRKSLSEFCNIADVPRRSRPFDRE 730


>ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610
            [Vitis vinifera]
          Length = 763

 Score =  848 bits (2190), Expect = 0.0
 Identities = 444/766 (57%), Positives = 551/766 (71%), Gaps = 14/766 (1%)
 Frame = +2

Query: 203  MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRLD----FCDVHSHGLLRFS 370
            MQAL++WPSK   W    VPQ D  L S   P +  GRRK  +     C   S   L  S
Sbjct: 1    MQALSVWPSKGVFW---AVPQLDYNLGSSSIPSRRRGRRKLWNPEDPVCQYRSLAFLWVS 57

Query: 371  RYSYYKGCRNGVCLASRSYDXXXXXXXXXXESTINGFSKPRKGSLGSAFAVSWALDEPTV 550
              S  +  R GV   S  +D          +  I    + ++GS G++FA++WAL++  +
Sbjct: 58   SSS--RSDRVGVYCGSPKFDFGCGLLSGYSKLKIFLLCERKRGSFGASFALAWALEQQAI 115

Query: 551  G----KDDSVAELEQLDEVERDD------DGAKNRXXXXXXXXXXXXXXXXXXXXXXXXX 700
            G    K+DS +        E  D      DGA++                          
Sbjct: 116  GNEFVKEDSNSIHSLAGNTETVDIDCLKVDGARD-------GDENDNEEEKEAEKNGEVI 168

Query: 701  XXXXXXXXXXALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALF 880
                      AL   L  A TADDVEEVLK +  LPLQVYST+IRGFG +K+L++AMAL 
Sbjct: 169  EEKSRNVDVRALAHGLEFATTADDVEEVLKDKVELPLQVYSTMIRGFGTDKRLDAAMALV 228

Query: 881  EWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLM 1060
            EWLKRK +ET G   PNLF+YNSLLGAVK++ KF  +EK+MNDMA  G+ PNVVTYNTLM
Sbjct: 229  EWLKRK-KETNGSKGPNLFVYNSLLGAVKQSEKFALVEKVMNDMAREGILPNVVTYNTLM 287

Query: 1061 GIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQ 1240
             IY+E+G+  +AL + EE+   G+ PSP S+S  L  YRR+EDG GAL F+++ R  Y +
Sbjct: 288  SIYLEQGRSVEALNILEEIQKNGLCPSPVSYSTALLVYRRMEDGHGALKFFIELRENYLK 347

Query: 1241 GEIGRDDDREDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHG 1420
            GEIG+D D EDW++EF KL+NF I +CYQVMRRWLVK  N+S  +L+LL +MDNA L+ G
Sbjct: 348  GEIGKDAD-EDWENEFVKLKNFTIRICYQVMRRWLVKEGNQSPILLKLLADMDNAGLQPG 406

Query: 1421 REEHERLIWACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYE 1600
            R E+ERL+WACTREEH VVAKELYTRIRE   EIS+SVCNH+IWL+GKAKKWWAALEIYE
Sbjct: 407  RAEYERLVWACTREEHYVVAKELYTRIRERHTEISLSVCNHIIWLMGKAKKWWAALEIYE 466

Query: 1601 DMLDKGPKPNNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVL 1780
            D+LDKGPKPNN+S+EL++SHFNILL+AARKKGIWRWGVRLLNKME+KGLKPGSREWN+VL
Sbjct: 467  DLLDKGPKPNNLSYELVVSHFNILLTAARKKGIWRWGVRLLNKMEDKGLKPGSREWNAVL 526

Query: 1781 VACSKASETSAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVE 1960
            VACSKA+ETSAA+EIF+RMVEQGEKPTIISYGALLS+LEKGKLY++A +VW+HMV++GVE
Sbjct: 527  VACSKAAETSAAVEIFRRMVEQGEKPTIISYGALLSALEKGKLYDEASRVWEHMVKMGVE 586

Query: 1961 PNLHAYTIMASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEW 2140
            PNL+AYTIMASI  GQG+ + ++ I++EM +LGI  TV+T+NAIIS C RNG    A+EW
Sbjct: 587  PNLYAYTIMASICVGQGKLQRVDSILREMETLGIDATVVTYNAIISGCARNGLSSAAFEW 646

Query: 2141 FERMKVDDVVPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNL 2320
            F RMKV  + PNE+TYEMLIEALA+DGKPRLA++L+ RA++EG  LSTKAYDAVV S  +
Sbjct: 647  FHRMKVGKIQPNEITYEMLIEALAKDGKPRLAFELYSRAQNEGLNLSTKAYDAVVLSSQV 706

Query: 2321 YGATVDIGALGSRPPERKKKVTIRKDLSEFCKLADVPRRSRPFVRK 2458
            + AT+D+  LG RPPE+KKK+  RK LS FC LADVPRR++PF RK
Sbjct: 707  HSATIDVSLLGPRPPEKKKKLLARKTLSAFCNLADVPRRAKPFDRK 752


>ref|XP_007031692.1| Pentatricopeptide repeat (PPR-like) superfamily protein, putative
            [Theobroma cacao] gi|508710721|gb|EOY02618.1|
            Pentatricopeptide repeat (PPR-like) superfamily protein,
            putative [Theobroma cacao]
          Length = 741

 Score =  845 bits (2182), Expect = 0.0
 Identities = 424/752 (56%), Positives = 540/752 (71%)
 Frame = +2

Query: 203  MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRLDFCDVHSHGLLRFSRYSY 382
            MQAL+IWP       S +VP  D EL S C        RK     +      L  S YS 
Sbjct: 1    MQALSIWPLNV---GSLVVPHLDFELGSSCFASTKPSSRKTWSLAESRGPSFLLLSSYSR 57

Query: 383  YKGCRNGVCLASRSYDXXXXXXXXXXESTINGFSKPRKGSLGSAFAVSWALDEPTVGKDD 562
            +   R+G C  + +            E  +  F +P++GS     A++WAL++  +G   
Sbjct: 58   FS--RSGTCYRNLNCSLRCGFLCWYSELKVVLFCEPKRGSSRGLVALAWALEQQEIGN-- 113

Query: 563  SVAELEQLDEVERDDDGAKNRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXALGW 742
               ELE+ +   RD D                                        AL  
Sbjct: 114  ---ELEREESHSRDGDNGNE-----------DKNEEMDASSEGEVELEESARLDVRALAS 159

Query: 743  RLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLI 922
             L  AKTADD+E+VLK    LPLQV+S++I+GFG++  +++AMAL EWLKRK  ++GG +
Sbjct: 160  SLQFAKTADDIEKVLKDMDELPLQVHSSMIKGFGRDNYMDAAMALVEWLKRKKNDSGGSV 219

Query: 923  QPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQ 1102
             PNLFIYNSLLGAVK +++F  +EKI+ DM   GV PN+VTYN LM IY+E+G+ +KAL 
Sbjct: 220  GPNLFIYNSLLGAVKHSKQFREMEKILKDMEEEGVIPNIVTYNVLMAIYLEQGEATKALN 279

Query: 1103 LFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWDH 1282
            + EE+  KG +PSP S+S  L  YRR+EDG GAL F+++ R +Y +G++G+D D E+W++
Sbjct: 280  VLEEIQEKGFSPSPVSYSTALLAYRRMEDGNGALKFFIELREKYVKGDLGKDAD-ENWEY 338

Query: 1283 EFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWACTRE 1462
            EF KLENF + +C QVMRRWLVK EN S +VL+LL++MDNA LK  +E++ER+IWACT E
Sbjct: 339  EFVKLENFTVRICQQVMRRWLVKDENLSTNVLKLLRDMDNAGLKLSKEDYERIIWACTCE 398

Query: 1463 EHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPNNMSH 1642
            EH VVAKELY+RIRE   EIS+SVCNHLIWL+GKAKKWWAALE+YE++LDKGP PNN+S+
Sbjct: 399  EHYVVAKELYSRIRERHSEISLSVCNHLIWLMGKAKKWWAALEVYEELLDKGPSPNNLSY 458

Query: 1643 ELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETSAAIE 1822
            EL++SHFNILL+AARK+GIWRWGVRLLNKME+KGLKPGSREWN+VLVACSKASET+AA++
Sbjct: 459  ELVMSHFNILLTAARKRGIWRWGVRLLNKMEDKGLKPGSREWNAVLVACSKASETTAAVQ 518

Query: 1823 IFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMASIYA 2002
            IF+RMVEQGEKPTIISYGALLS+LEKGKLY++A +VW HM++VGV+PNL+AYTIMASI  
Sbjct: 519  IFRRMVEQGEKPTIISYGALLSALEKGKLYDEALRVWDHMIKVGVKPNLYAYTIMASIVT 578

Query: 2003 GQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVVPNEV 2182
            G+G F  +N + +EMAS GI PTV+T+NAIIS C RNG    AYEWF RMKV ++ PNE+
Sbjct: 579  GKGNFRMVNAVFQEMASSGIEPTVVTYNAIISGCARNGMSSAAYEWFHRMKVQNISPNEI 638

Query: 2183 TYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGALGSRP 2362
            TY+MLIEALA+DGKPRLAY+L+LRA +EG  LS+KAYDAVV+S  +YGAT D+  LG RP
Sbjct: 639  TYQMLIEALAKDGKPRLAYELYLRAHNEGLNLSSKAYDAVVQSSQVYGATTDLSVLGPRP 698

Query: 2363 PERKKKVTIRKDLSEFCKLADVPRRSRPFVRK 2458
            P++K KV IRK L+EFC LADVPRRS+PF RK
Sbjct: 699  PDKKMKVQIRKTLTEFCNLADVPRRSKPFDRK 730


>ref|XP_004231824.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like
            [Solanum lycopersicum]
          Length = 742

 Score =  842 bits (2176), Expect = 0.0
 Identities = 402/577 (69%), Positives = 491/577 (85%), Gaps = 1/577 (0%)
 Frame = +2

Query: 731  ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRK-SEE 907
            AL   L   KTAD+V+EVLK +  LPLQVYS++IRGFGK+KKL SAMAL EWL+R+  ++
Sbjct: 157  ALAQSLHFVKTADEVDEVLKDKVELPLQVYSSMIRGFGKDKKLNSAMALVEWLRRRRGKD 216

Query: 908  TGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKE 1087
              G I  N+FIYNSLLGA+KEA K+DF++K+M+DM   GV PNVVTYNTLM  YIE+G+E
Sbjct: 217  NIGSISLNVFIYNSLLGAIKEAGKYDFVDKVMDDMVSEGVQPNVVTYNTLMRTYIEQGRE 276

Query: 1088 SKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDR 1267
             +AL+LF EMP KG+TPSPAS+S  LF YRRLEDGFGA+ F+V+TR RY+ GEIG  ++ 
Sbjct: 277  LEALKLFREMPKKGLTPSPASYSTALFAYRRLEDGFGAITFFVETRERYQNGEIGNIEE- 335

Query: 1268 EDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIW 1447
            E+W+ EF+KLENFI+ +CYQVMR+WLVK EN + +VL+LL +MD A L+  R E+ERL+W
Sbjct: 336  ENWEDEFAKLENFIVRICYQVMRQWLVKGENANTNVLKLLTDMDRARLQLSRAEYERLVW 395

Query: 1448 ACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKP 1627
            ACTREEH VVAKELY RIRE D +IS+SVCNH+IWL+GKAKKWWAALEIYED+LDKGP+P
Sbjct: 396  ACTREEHYVVAKELYNRIRERDTDISLSVCNHIIWLMGKAKKWWAALEIYEDLLDKGPQP 455

Query: 1628 NNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASET 1807
            NNMS+ELI+SHFNILLSAARK+GIWRWGVRLLNKMEEKGLKP SREWN+VLVACSKASET
Sbjct: 456  NNMSYELIVSHFNILLSAARKRGIWRWGVRLLNKMEEKGLKPSSREWNAVLVACSKASET 515

Query: 1808 SAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIM 1987
            SAA++IF+RMVE+GEKPT+ISYGALLS+LEKGKLY++A QVW+HM++VG+EPNL+AYTIM
Sbjct: 516  SAAVQIFRRMVEKGEKPTVISYGALLSALEKGKLYDEALQVWKHMIKVGIEPNLYAYTIM 575

Query: 1988 ASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDV 2167
            ASIY  QG+F  ++ I+KEM + G+ PTV+TFNAIIS C RNG    AYEWF+RMK  ++
Sbjct: 576  ASIYTAQGKFNIVDSIIKEMVTTGVEPTVVTFNAIISGCARNGMESVAYEWFQRMKTQNI 635

Query: 2168 VPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGA 2347
             PNEV+YE+LIEALA DGKPRLAY+L++RA +EG  LSTKAYDAV+ S   YGA++D+  
Sbjct: 636  TPNEVSYEVLIEALANDGKPRLAYELYVRALTEGLSLSTKAYDAVISSTQAYGASIDLSI 695

Query: 2348 LGSRPPERKKKVTIRKDLSEFCKLADVPRRSRPFVRK 2458
            LG RPPE+KK+V IRK LSEFC +ADVPRRSRPF R+
Sbjct: 696  LGPRPPEKKKRVQIRKSLSEFCHIADVPRRSRPFDRE 732


>ref|XP_002526948.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223533700|gb|EEF35435.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 671

 Score =  795 bits (2053), Expect = 0.0
 Identities = 381/575 (66%), Positives = 479/575 (83%)
 Frame = +2

Query: 731  ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEET 910
            +L   L SA+TADDVEEVLK +  LPLQVYS++I+ FG + K+ESA+AL EWLKR+ +E 
Sbjct: 87   SLARSLHSAQTADDVEEVLKDKGELPLQVYSSMIKAFGWDNKMESALALVEWLKRR-KEI 145

Query: 911  GGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKES 1090
            G  I PNLFIYNSLL AVK+++ F+  EKI+NDM   G+ PNVVTYNTLMGIY+E+G+ +
Sbjct: 146  GSSIGPNLFIYNSLLSAVKKSKLFEEAEKILNDMTQEGIAPNVVTYNTLMGIYVEKGQAT 205

Query: 1091 KALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDRE 1270
            KAL + E+M  KG  P+ AS+S  L  YR +EDG GALAF+V  +++Y +G+IG++ D E
Sbjct: 206  KALNILEQMHEKGFIPTAASYSTALLAYRGMEDGHGALAFFVDIKDKYLKGKIGKNSD-E 264

Query: 1271 DWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWA 1450
            +W++EF KLE FII +CYQVMRRWLV+ +N S DVL+LL +MD A L+  + E+ERL+WA
Sbjct: 265  NWENEFVKLETFIIRICYQVMRRWLVRHDNFSTDVLKLLTDMDKAGLQPSQAEYERLVWA 324

Query: 1451 CTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPN 1630
            CTRE+H  V KELY RIRE   +IS+SVCNHLIWL+GKAKKWWAALEIYED+LDKGP PN
Sbjct: 325  CTREDHYAVGKELYIRIRERHSKISLSVCNHLIWLMGKAKKWWAALEIYEDLLDKGPNPN 384

Query: 1631 NMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETS 1810
            NMS+ELI+SHFNILL+AARK+GIWRWGVRLLNKME+KGLKPGSREWN+VLVACSKASET+
Sbjct: 385  NMSYELIVSHFNILLTAARKRGIWRWGVRLLNKMEDKGLKPGSREWNAVLVACSKASETT 444

Query: 1811 AAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMA 1990
            AA++IF+RM+EQGEKPTI+SYGALLS+LEKGKLY++A +VW+HM++V V+PNL+AYTIMA
Sbjct: 445  AAVQIFRRMIEQGEKPTIVSYGALLSALEKGKLYDEAVRVWEHMLKVDVKPNLYAYTIMA 504

Query: 1991 SIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVV 2170
            S++AGQG+F  ++ I+++M S GI PT+IT+NAIIS C  N     AYEWF RMKV ++ 
Sbjct: 505  SVFAGQGKFTYVDAIIQKMVSSGIEPTIITYNAIISGCTHNNLSSAAYEWFHRMKVQNMP 564

Query: 2171 PNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGAL 2350
            PN++TYEMLIEALA+DGKPRLAY+L+LRAK EG +LS K YDAV+ S  +YGAT+DI  L
Sbjct: 565  PNKITYEMLIEALAKDGKPRLAYELYLRAKYEGLDLSAKVYDAVLRSSQVYGATIDINVL 624

Query: 2351 GSRPPERKKKVTIRKDLSEFCKLADVPRRSRPFVR 2455
            G RPP++KK+V IRK L+EFC LADVPRRS+PF R
Sbjct: 625  GPRPPDKKKRVKIRKTLTEFCDLADVPRRSKPFER 659


>gb|EPS70491.1| hypothetical protein M569_04265, partial [Genlisea aurea]
          Length = 557

 Score =  786 bits (2031), Expect = 0.0
 Identities = 375/549 (68%), Positives = 462/549 (84%)
 Frame = +2

Query: 731  ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEET 910
            AL  +L  A TADDVE++LK +  LPLQVYST+IRG GKEK+++SAMALFEWL+RKS+E+
Sbjct: 10   ALALKLQLATTADDVEQLLKGKENLPLQVYSTVIRGLGKEKRIQSAMALFEWLQRKSKES 69

Query: 911  GGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKES 1090
            G  ++ NLF+YNSLLGA+K+A  FD +E++M  M   GVHPNVVT+N LMGI+IE+G E 
Sbjct: 70   GSKLKLNLFVYNSLLGAMKQAEAFDLVEEVMTKMGAEGVHPNVVTFNALMGIHIEQGNEL 129

Query: 1091 KALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDRE 1270
            +AL+LF EM   GI+PSPAS+S VL  YRR+E+G GA++F+++TRN+Y  G++  DDD E
Sbjct: 130  RALELFREMLMMGISPSPASYSTVLNAYRRMENGSGAVSFFIETRNKYRNGDMANDDD-E 188

Query: 1271 DWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWA 1450
            DW+ E SKLENF + +CYQVMRRWLVK  N S +VL+LL+EMDNA L    E  E+LIWA
Sbjct: 189  DWELEISKLENFTLRICYQVMRRWLVKRGNFSTEVLKLLKEMDNAGLNCDPENLEKLIWA 248

Query: 1451 CTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPN 1630
            CTRE+HC VAKELYTR+RE+  +IS+SVCNH+IWL+GKAKKWWAALEIYE++LD GPKPN
Sbjct: 249  CTREDHCAVAKELYTRVREMGADISLSVCNHIIWLMGKAKKWWAALEIYEELLDTGPKPN 308

Query: 1631 NMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETS 1810
            NMS+ELI+SHFNILL+AARKKGIWRWGVRL+NKM+EKGLKPGSREWNSVLVACSKA ETS
Sbjct: 309  NMSYELIVSHFNILLTAARKKGIWRWGVRLINKMKEKGLKPGSREWNSVLVACSKAGETS 368

Query: 1811 AAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMA 1990
             AIEIFKRMVE G+KPTIISYGALLS+LEKGKLY++A QVW+HMV+VGVE NL+AYTIMA
Sbjct: 369  TAIEIFKRMVENGDKPTIISYGALLSALEKGKLYDEAIQVWKHMVKVGVEANLYAYTIMA 428

Query: 1991 SIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVV 2170
            SI+A QG+ + ++LI++EM   G+ PTV+TFNA+IS   +N     AYEWF RMK+ +V 
Sbjct: 429  SIHASQGKIDLVDLIIREMVGAGVEPTVVTFNAVISGFVKNNLSSAAYEWFRRMKLQNVT 488

Query: 2171 PNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGAL 2350
            PNE+TYE LIEALA+DGKPRLA +LHLRA++EG  LSTKAYDA+++S + YGAT+D GAL
Sbjct: 489  PNEITYETLIEALAKDGKPRLASELHLRAQNEGLMLSTKAYDAIIQSSDAYGATIDYGAL 548

Query: 2351 GSRPPERKK 2377
            G RPPE KK
Sbjct: 549  GPRPPEGKK 557


>ref|XP_002324000.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222867002|gb|EEF04133.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 709

 Score =  784 bits (2025), Expect = 0.0
 Identities = 406/752 (53%), Positives = 527/752 (70%)
 Frame = +2

Query: 203  MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRLDFCDVHSHGLLRFSRYSY 382
            MQ L++WP    S  SC VP  + E  S C      G  KR    D            + 
Sbjct: 1    MQTLSVWPL---SGGSCAVPHLEFEEDSSCFLSTRRGI-KRWGLVD------------NV 44

Query: 383  YKGCRNGVCLASRSYDXXXXXXXXXXESTINGFSKPRKGSLGSAFAVSWALDEPTVGKDD 562
            ++G  +G  + S                    F + ++GS GS+ A++ AL++  +G + 
Sbjct: 45   FQGASSGFPMVSGDLRFLSNHSKIKYVC----FRETKEGSFGSSLALASALEQQKIGNEF 100

Query: 563  SVAELEQLDEVERDDDGAKNRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXALGW 742
               E   LD+    + G +                                     AL  
Sbjct: 101  HRVE-SSLDDRSLGEAGEER-----------------------------DEKIDVPALAQ 130

Query: 743  RLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLI 922
             L  AKT DD+EEVLK +  LP+QVY ++I+GFG +KK+E A+AL +WLK K +ET G I
Sbjct: 131  SLYFAKTVDDIEEVLKDKGELPVQVYLSMIKGFGWDKKMEPAIALVDWLKIK-KETDGTI 189

Query: 923  QPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQ 1102
             PNLFIYNSLL AVK++ +++  EKI+  M   GV PNVVTYN LM IY+++G+  KAL 
Sbjct: 190  VPNLFIYNSLLSAVKQSEQYEETEKILERMTQEGVAPNVVTYNILMVIYVKQGQAKKALD 249

Query: 1103 LFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWDH 1282
            + EEM   G TPS AS+S  L  YR++EDG GAL F+V+ +++Y +GEIG+D D EDW+ 
Sbjct: 250  VLEEMRRNGFTPSAASYSSALLAYRKMEDGDGALKFFVEIKDKYMKGEIGKDAD-EDWER 308

Query: 1283 EFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWACTRE 1462
            E+ KLENF I +CYQVMRRWLV+ EN + +VL+LL +MD A L+ GR ++ERL+WACTRE
Sbjct: 309  EYVKLENFTIRVCYQVMRRWLVRLENLNTNVLKLLTDMDKAELQPGRSDYERLVWACTRE 368

Query: 1463 EHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPNNMSH 1642
            EH VVAKELY RIRE   +IS+SVCNH+IWL+GKAKKWWAALE+YED+LDKGPKPNN+S+
Sbjct: 369  EHYVVAKELYIRIRERCSDISLSVCNHVIWLMGKAKKWWAALEVYEDLLDKGPKPNNLSY 428

Query: 1643 ELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETSAAIE 1822
            ELI+S+FN+LL+AA+K+GIWRWGVRLLNKMEEKGLKPGS+EWN+VLVACSKASET+AA++
Sbjct: 429  ELIVSYFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSKEWNAVLVACSKASETAAAVQ 488

Query: 1823 IFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMASIYA 2002
            IF+RMVEQGEKPT+ISYGALLS+LEKG+LY++A +VW+HM++VGV+PN++AYTIMAS++ 
Sbjct: 489  IFRRMVEQGEKPTVISYGALLSALEKGRLYDEAVRVWEHMLKVGVKPNVYAYTIMASVFT 548

Query: 2003 GQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVVPNEV 2182
             QG F  ++ I+ EM S GI PTV+T+NAIIS C RN     AYEWF RMKV ++ PNE+
Sbjct: 549  RQGNFRLVDAIINEMVSTGIEPTVVTYNAIISGCARNNLSSAAYEWFHRMKVQNISPNEI 608

Query: 2183 TYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGALGSRP 2362
            TY+MLIEALA+ GKPRLAY+L+LRA++E  +LS KAYDAV+ S   YGAT+D   LG RP
Sbjct: 609  TYDMLIEALAKSGKPRLAYELYLRAQNEDLQLSPKAYDAVMHSSEAYGATIDTSVLGPRP 668

Query: 2363 PERKKKVTIRKDLSEFCKLADVPRRSRPFVRK 2458
            P++KKKV IRK L+EFC LADVPRRS+PF +K
Sbjct: 669  PDKKKKVQIRKTLTEFCNLADVPRRSKPFNKK 700


>ref|XP_007140836.1| hypothetical protein PHAVU_008G145600g [Phaseolus vulgaris]
            gi|561013969|gb|ESW12830.1| hypothetical protein
            PHAVU_008G145600g [Phaseolus vulgaris]
          Length = 752

 Score =  774 bits (1999), Expect = 0.0
 Identities = 375/573 (65%), Positives = 453/573 (79%)
 Frame = +2

Query: 731  ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEET 910
            AL  RL +A T DDV E+L  +  LPLQV+STII  FGKEK+++SA+ LFEW+K++  ET
Sbjct: 166  ALALRLQTALTVDDVREILVDKRDLPLQVFSTIINSFGKEKRMDSALILFEWMKKRKIET 225

Query: 911  GGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKES 1090
             G   PNLFIYN LLG VK++ +F  +E I+N+MA +G+  NVVTYNTLM IYIE+G+  
Sbjct: 226  NGSFGPNLFIYNGLLGVVKQSGQFAQMETILNEMAKDGISYNVVTYNTLMAIYIEKGEFD 285

Query: 1091 KALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDRE 1270
            +AL + EE+   G TPSP S+S  L  YRR+ED  GAL F+V+ R  Y +GEIG DDD E
Sbjct: 286  RALNVLEEIHGNGFTPSPVSYSQALLAYRRMEDCNGALNFFVELRENYHRGEIGEDDDGE 345

Query: 1271 DWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWA 1450
            DW+ E  KLE F I +CYQVMR WLV S+N S +VL+ L +MDNA +   R + ERL+WA
Sbjct: 346  DWEEELMKLEKFTIRICYQVMRCWLVSSDNLSKNVLKFLVDMDNAGIPLTRADLERLVWA 405

Query: 1451 CTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPN 1630
            CTRE+H +V KELYTRIRE  D+IS+SVCNH IWL+GKAKKWWAALEIYED+LDKGPKPN
Sbjct: 406  CTREDHYIVVKELYTRIRERYDKISLSVCNHAIWLMGKAKKWWAALEIYEDLLDKGPKPN 465

Query: 1631 NMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETS 1810
            N+S+ELI+SHFN LL+AA++KGIWRWGVRLLNKMEEKGLKPGSREWN+VLVACSKASET+
Sbjct: 466  NLSYELIVSHFNFLLNAAKRKGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKASETT 525

Query: 1811 AAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMA 1990
            AA++IFKRMVE GEKPT+ISYGALLS+LEKGKLY+ A +VW HMV+VGVEPN +AYTIMA
Sbjct: 526  AAVQIFKRMVENGEKPTVISYGALLSALEKGKLYDDALRVWNHMVKVGVEPNAYAYTIMA 585

Query: 1991 SIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVV 2170
            SIY  QG F  ++ IV+EM ++GI  TV+T+NAIIS C RNG    AYEWF RMKV ++ 
Sbjct: 586  SIYTAQGNFNRVDAIVQEMVTIGIEVTVVTYNAIISGCARNGMSSAAYEWFHRMKVQNIT 645

Query: 2171 PNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGAL 2350
            PNE+TYEMLIEALA DGKPRLAY L+ RAK+EG  LS+KAYD VV S    GAT ++G L
Sbjct: 646  PNEITYEMLIEALANDGKPRLAYQLYTRAKNEGLTLSSKAYDVVVHSSQANGATTELGLL 705

Query: 2351 GSRPPERKKKVTIRKDLSEFCKLADVPRRSRPF 2449
            G RP ++KKKV IRK L+EF  LA VPRRS  F
Sbjct: 706  GPRPADKKKKVQIRKTLTEFYNLAGVPRRSNQF 738


>gb|EXC31403.1| hypothetical protein L484_017686 [Morus notabilis]
          Length = 737

 Score =  773 bits (1996), Expect = 0.0
 Identities = 404/749 (53%), Positives = 513/749 (68%), Gaps = 22/749 (2%)
 Frame = +2

Query: 203  MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRLDFCDVHSHGLLRFSRYSY 382
            MQAL+ WP K + W   +VPQ   E  S    +K   RR+R +  D   H  +   R + 
Sbjct: 1    MQALSTWPLKGDLW---IVPQLSSEKSS---SLKTSSRRRRKNVLDFGFHFPVCHGRITG 54

Query: 383  Y-------KGCRNGVCLASRSYDXXXXXXXXXXESTINGFSKPRK-GSLGSAFAVSWALD 538
            +       +G   G       +D          +  +  F KP+K  SLG++ A++ AL+
Sbjct: 55   FVLSTRNSRGVGYGGFCDRPKFDLGCGFLFGFSKLKVARFCKPKKKSSLGASVALAGALE 114

Query: 539  EPTVGKDDSVAELEQ--------------LDEVERDDDGAKNRXXXXXXXXXXXXXXXXX 676
            E  VG    + EL+               L  +E  DD   +                  
Sbjct: 115  EQAVGSAIRIEELDSECSLSGKLSDGHLLLGRIESGDDNNGDEEQENKVIEDVGSEEKSR 174

Query: 677  XXXXXXXXXXXXXXXXXXALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKK 856
                               L   L  AKTADDV+EVLK +  LP QV+ST+IRG G+EK 
Sbjct: 175  EEKGGKVDVRE--------LASSLRFAKTADDVDEVLKDKGELPPQVFSTMIRGLGREKL 226

Query: 857  LESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPN 1036
            L+ A AL EWLKRK EE  GLI  NLFIYNSLLGAVK++ +F  +EK++N MA  GV PN
Sbjct: 227  LDPAFALLEWLKRKKEENNGLISLNLFIYNSLLGAVKQSEQFGEMEKVLNYMAQEGVVPN 286

Query: 1037 VVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYV 1216
            VVTYNT+M I++E G+ +KAL + EE+  KG+TPSP S+S  L  YRR+EDG GAL F+V
Sbjct: 287  VVTYNTMMAIHLENGEGTKALSVLEEIRKKGLTPSPVSYSTALLAYRRMEDGHGALKFFV 346

Query: 1217 QTRNRYEQGEIGRDDDREDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEM 1396
            + R +Y++GE+G+DDD EDW++EF KLENF I +CYQVMR WLV  +N S +VL+LL +M
Sbjct: 347  EIREKYQKGEMGKDDD-EDWENEFVKLENFTIRVCYQVMRHWLVNEDNLSTNVLKLLTKM 405

Query: 1397 DNACLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKW 1576
            D A +   R EHERL+WACTREEH +VAKELY RIRE   +IS+SVCNH IWL+GKAK+W
Sbjct: 406  DIAGIPPSRSEHERLLWACTREEHHLVAKELYDRIREGYSDISLSVCNHTIWLMGKAKRW 465

Query: 1577 WAALEIYEDMLDKGPKPNNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPG 1756
            W ALEIYED+LDKGP+PNNMS+E+I+SHFNILL+AARK+GIW+WGVRLLNKMEEKGLKPG
Sbjct: 466  WTALEIYEDLLDKGPQPNNMSYEIIVSHFNILLTAARKRGIWKWGVRLLNKMEEKGLKPG 525

Query: 1757 SREWNSVLVACSKASETSAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQ 1936
            S+EWN+VL+ACSKASETSAA++IFKRMVEQG+KPT +SYGALLS+LEKGKLY++A QVW+
Sbjct: 526  SKEWNAVLIACSKASETSAAVKIFKRMVEQGQKPTFLSYGALLSALEKGKLYDEARQVWE 585

Query: 1937 HMVRVGVEPNLHAYTIMASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNG 2116
            HM++VG+ PN++AYTIMAS++AG G+F  ++ ++ EM S GI PTV+T+NAIIS C RN 
Sbjct: 586  HMLKVGIRPNVYAYTIMASVFAGHGKFNMVDTVIHEMVSSGIEPTVVTYNAIISGCARND 645

Query: 2117 HGGTAYEWFERMKVDDVVPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYD 2296
                A+EWF RMK   + PN VTYEMLIEALA D KPRLAY+L+LRA++EG  L+ KAYD
Sbjct: 646  MIDMAFEWFHRMKAQSITPNNVTYEMLIEALANDCKPRLAYELYLRAQNEGLRLAPKAYD 705

Query: 2297 AVVESVNLYGATVDIGALGSRPPERKKKV 2383
             VVES   +GAT+D+  LG RPPERK KV
Sbjct: 706  IVVESSQYHGATIDLRLLGPRPPERKGKV 734


>ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like
            [Glycine max]
          Length = 808

 Score =  764 bits (1973), Expect = 0.0
 Identities = 368/575 (64%), Positives = 455/575 (79%)
 Frame = +2

Query: 731  ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEET 910
            AL   L + KT +DV  +LK +  LPLQV+STII GFGKEK+++SA+ LF W+K++  ET
Sbjct: 222  ALALSLQTVKTVEDVGGILKDKGDLPLQVFSTIISGFGKEKRMDSALILFNWMKKRKIET 281

Query: 911  GGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKES 1090
             G   PNLFIYN LLG VK++ +F  +E I+N+MA +G+  NVVTYNTLM IYIE+G+  
Sbjct: 282  NGSFGPNLFIYNGLLGVVKQSGQFAEMEVILNEMAEDGIAYNVVTYNTLMAIYIEKGECD 341

Query: 1091 KALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDRE 1270
            KAL + EE+   G+TPSP S+S  L  YRR+EDG+GAL F+V+ R +Y QGEIG+DDD E
Sbjct: 342  KALNMLEEIRRNGLTPSPVSYSQALLAYRRMEDGYGALNFFVEFREKYRQGEIGKDDDGE 401

Query: 1271 DWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWA 1450
            DW+ E  KLE F I +CYQVMR WLV  +N S +VL+ L +MDN  +   R + ERL WA
Sbjct: 402  DWEKECLKLEKFTIRVCYQVMRCWLVSRDNLSKNVLKFLVDMDNVGIPLPRADLERLAWA 461

Query: 1451 CTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPN 1630
            CTRE+H +V KELY RIRE  D+IS+SVCNH IWL+GKAKKWWAALEIYED+LDKGPKPN
Sbjct: 462  CTREDHYIVVKELYNRIRERYDKISLSVCNHAIWLMGKAKKWWAALEIYEDLLDKGPKPN 521

Query: 1631 NMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETS 1810
            N+S+ELI+SHFN LLSAA++KGIWRWGV+LLNKME+KGLKPG REWN+VLVACSKASET+
Sbjct: 522  NLSYELIVSHFNFLLSAAKRKGIWRWGVKLLNKMEDKGLKPGCREWNAVLVACSKASETT 581

Query: 1811 AAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMA 1990
            AA++IFKRMVE GEKPTIISYGALLS+LEKGKLY+ A +VW HM++VGVEPN +AYTIMA
Sbjct: 582  AAVQIFKRMVENGEKPTIISYGALLSALEKGKLYDDALRVWNHMIKVGVEPNAYAYTIMA 641

Query: 1991 SIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVV 2170
            SI+  QG F  ++ I++EM +LGI  TV+T+NAII+ C  NG    AYEWF RMKV ++ 
Sbjct: 642  SIHTAQGNFNRVDAIIQEMVTLGIEVTVVTYNAIITGCAHNGMSSVAYEWFHRMKVQNIS 701

Query: 2171 PNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGAL 2350
            PNE+TYEMLI ALA DGKPRLAY L+ RAK+EG  LS+KAYDAVV+S     AT+++G L
Sbjct: 702  PNEITYEMLIVALANDGKPRLAYQLYTRAKNEGLTLSSKAYDAVVQSSQANNATIELGLL 761

Query: 2351 GSRPPERKKKVTIRKDLSEFCKLADVPRRSRPFVR 2455
            G RP ++KKKV IRK L+EF  LA VP+RS+PF R
Sbjct: 762  GPRPVDKKKKVQIRKTLNEFYNLAGVPKRSQPFDR 796


>ref|XP_006399946.1| hypothetical protein EUTSA_v10015672mg [Eutrema salsugineum]
            gi|557101036|gb|ESQ41399.1| hypothetical protein
            EUTSA_v10015672mg [Eutrema salsugineum]
          Length = 688

 Score =  756 bits (1952), Expect = 0.0
 Identities = 399/726 (54%), Positives = 502/726 (69%)
 Frame = +2

Query: 203  MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRLDFCDVHSHGLLRFSRYSY 382
            MQAL+IWP K   +   +  + + EL   C  V     RKR  F +    G +  S +  
Sbjct: 1    MQALSIWPLK---FGLLVGSRLEFELDCSCYVVS-PKTRKRQYFVEQACFGSI--SSFLL 54

Query: 383  YKGCRNGVCLASRSYDXXXXXXXXXXESTINGFSKPRKGSLGSAFAVSWALDEPTVGKDD 562
                R    LA                + +    +P+K   GS+  V WA ++  +G++ 
Sbjct: 55   VSSNRKFEGLAINP------------STKVLFLCEPKKSLSGSSVGVGWATEQRELGEE- 101

Query: 563  SVAELEQLDEVERDDDGAKNRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXALGW 742
             V+  +       D D +K++                                    L +
Sbjct: 102  -VSREDSSSVTASDSDHSKSQAVTGGEKTNARVDVRE--------------------LAY 140

Query: 743  RLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLI 922
             L +AKTADDV+ VLK +  LPLQVY  +IRGFGK+K+L+ AMA+ +WLKRK  E+GGLI
Sbjct: 141  SLRAAKTADDVDVVLKEKGELPLQVYCAMIRGFGKDKRLKPAMAVVDWLKRKKIESGGLI 200

Query: 923  QPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQ 1102
             PNLFIYNSLLGA+KE+R F   EKI++DM   G+ PN+VTYNTLM IY+EEG+  KAL 
Sbjct: 201  GPNLFIYNSLLGAMKESRGFGETEKILSDMEEEGIVPNIVTYNTLMVIYMEEGEFHKALG 260

Query: 1103 LFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWDH 1282
            + + +  KG  PSP ++S  L  YRRLEDG GAL F+ + R +Y + EIG D D  DW+ 
Sbjct: 261  ILDLVKEKGFEPSPVTYSTALLVYRRLEDGMGALEFFAELREKYSKREIGNDADY-DWEF 319

Query: 1283 EFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWACTRE 1462
            EF KLENFI  +CYQVMRRWLVK EN +  +L+LL  MDNA LK  REEHERLIWACTRE
Sbjct: 320  EFVKLENFIGRICYQVMRRWLVKDENLTTKMLKLLNAMDNAGLKPSREEHERLIWACTRE 379

Query: 1463 EHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPNNMSH 1642
            EH VV KELY RIRE   EIS+SVCNHLIWL+GKAKKWWAALEIYED+LD+GP+PNN+S+
Sbjct: 380  EHYVVGKELYKRIRERFPEISLSVCNHLIWLMGKAKKWWAALEIYEDLLDQGPEPNNLSY 439

Query: 1643 ELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETSAAIE 1822
            EL++SHFNILLSAA ++GIWRWGVRLLNKME+KGLKP SR WN+VLVACSKASET+AAI+
Sbjct: 440  ELVVSHFNILLSAASRRGIWRWGVRLLNKMEDKGLKPQSRHWNAVLVACSKASETAAAIQ 499

Query: 1823 IFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMASIYA 2002
            IFK MVE GEKPT+ISYGALLS+LEKGKLY++AF+VW HM++VG+EPN+HAYTIMAS+  
Sbjct: 500  IFKAMVENGEKPTVISYGALLSALEKGKLYDEAFRVWNHMIKVGIEPNVHAYTIMASVLT 559

Query: 2003 GQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVVPNEV 2182
            GQ +F  L+ ++KEM+S GI P+V+T+NAIIS C RN   G AYEWF RM+ ++V PNE+
Sbjct: 560  GQQKFNLLDTLLKEMSSKGIEPSVVTYNAIISGCARNELSGVAYEWFHRMRGENVEPNEI 619

Query: 2183 TYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGALGSRP 2362
            TYEMLIEALA D KPRLAY+LHL+A++EG +LS+K YDAVV+S   YGAT+D+  LG RP
Sbjct: 620  TYEMLIEALANDAKPRLAYELHLKAQNEGLKLSSKPYDAVVKSAESYGATIDLNLLGPRP 679

Query: 2363 PERKKK 2380
               KK+
Sbjct: 680  VTPKKE 685


>ref|XP_007220233.1| hypothetical protein PRUPE_ppa001979mg [Prunus persica]
            gi|462416695|gb|EMJ21432.1| hypothetical protein
            PRUPE_ppa001979mg [Prunus persica]
          Length = 734

 Score =  752 bits (1942), Expect = 0.0
 Identities = 389/740 (52%), Positives = 507/740 (68%), Gaps = 5/740 (0%)
 Frame = +2

Query: 203  MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRL-----DFCDVHSHGLLRF 367
            MQAL  WPS+ E+W    VPQ   EL S C       RRK++       C   S  +L  
Sbjct: 1    MQALVTWPSRAETW---AVPQLGFELGSSCK-FSTRIRRKKMWSLGFPVCYGRSGAVLLL 56

Query: 368  SRYSYYKGCRNGVCLASRSYDXXXXXXXXXXESTINGFSKPRKGSLGSAFAVSWALDEPT 547
            S  S   G        S  +D          +       + +K S G++F V+WAL+E  
Sbjct: 57   SSNSGAIGAE--AFSGSPKFDFGCGCFSGYSKLKPARICQSKKRSFGASFVVAWALEEQA 114

Query: 548  VGKDDSVAELEQLDEVERDDDGAKNRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 727
            +G D  + E      +  + +                                       
Sbjct: 115  IGNDIVIEESTSEHRLSGEGESKGVDHLIVDEAEGGEDKNEVDVRNGGANWEQKNEKIDV 174

Query: 728  XALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEE 907
             AL   L  AKTADDVE VLK +  LPLQV+S++IRGFG+++ ++SA A+ EWLKRKSEE
Sbjct: 175  RALALSLQFAKTADDVEVVLKDKGDLPLQVFSSMIRGFGRDRLMDSAFAVVEWLKRKSEE 234

Query: 908  TGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKE 1087
            T G I PNLFIYNSLLGAVK++++F  ++K+++ M   GV  NVVTYNT M IYIE+G  
Sbjct: 235  TNGSITPNLFIYNSLLGAVKQSKQFGEMDKVLSAMTEEGVELNVVTYNTKMAIYIEQGLS 294

Query: 1088 SKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDR 1267
            +KAL + E++  KG+ PS  S+S  L  Y+R+EDG GAL F+++ R +Y +G+I ++   
Sbjct: 295  TKALDVLEDIEKKGLIPSSVSYSTALLAYQRMEDGNGALQFFIEFREKYHKGDISKESV- 353

Query: 1268 EDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIW 1447
            EDW+HEF +LENF   +CYQVMRRWLVK +N S +VL+LL +MD A +   R EHERL+W
Sbjct: 354  EDWEHEFIQLENFTKRVCYQVMRRWLVKDDNLSTNVLKLLAQMDIAGVPLSRAEHERLLW 413

Query: 1448 ACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKP 1627
            ACTREEH  VAKELY RIRE   EI +SVCNH+IWL+GKAKKWWAALEIYEDMLD+GPKP
Sbjct: 414  ACTREEHYTVAKELYNRIRERHTEIGISVCNHVIWLMGKAKKWWAALEIYEDMLDRGPKP 473

Query: 1628 NNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASET 1807
            NNMS+ELI+SHFN+LL+AARK+GIWRWG+RLLNKMEEKGLKP S+EWN+VLVACSKA+ET
Sbjct: 474  NNMSYELIVSHFNVLLTAARKRGIWRWGIRLLNKMEEKGLKPRSKEWNAVLVACSKAAET 533

Query: 1808 SAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIM 1987
            SAA++IFKRMVEQG+KPT++SYGALLS+LEKGKLY++A QVW+HM++VGV+PNL+AYTIM
Sbjct: 534  SAAVKIFKRMVEQGQKPTVLSYGALLSALEKGKLYDEARQVWEHMLKVGVKPNLYAYTIM 593

Query: 1988 ASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDV 2167
            AS+++G G+   ++ I+ EM S GI PTV+T+NAIIS   RNG    AYEWF+RMK  ++
Sbjct: 594  ASVFSGHGKLNMVDTIIHEMVSSGIEPTVVTYNAIISGFARNGSTNAAYEWFQRMKDQNI 653

Query: 2168 VPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGA 2347
             PN VTYEM+IE LA  GKPRLAYDL+L A+++G +LS K+YD VV+S    G  ++ G 
Sbjct: 654  SPNNVTYEMMIEGLANGGKPRLAYDLYLTAQNQGLDLSPKSYDIVVQSSLASGVAIE-GF 712

Query: 2348 LGSRPPERKKKVTIRKDLSE 2407
            LG+RPP++K++V  RK  ++
Sbjct: 713  LGARPPDKKEEVQGRKSSTQ 732


>ref|XP_006292935.1| hypothetical protein CARUB_v10019206mg [Capsella rubella]
            gi|482561642|gb|EOA25833.1| hypothetical protein
            CARUB_v10019206mg [Capsella rubella]
          Length = 673

 Score =  738 bits (1905), Expect = 0.0
 Identities = 368/632 (58%), Positives = 465/632 (73%), Gaps = 1/632 (0%)
 Frame = +2

Query: 485  KPRKGSLGSAFAVSWALD-EPTVGKDDSVAELEQLDEVERDDDGAKNRXXXXXXXXXXXX 661
            +P++  LGS+  V WA +    V  +DS +      E +  + G KN             
Sbjct: 71   EPKRSFLGSSVGVRWATELGEEVSTEDSSSSSVDHSEPQAVNGGEKNNSRVNVRE----- 125

Query: 662  XXXXXXXXXXXXXXXXXXXXXXXALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGF 841
                                    L + L +AKTADDV+ VLK +  LPLQV+  +I GF
Sbjct: 126  ------------------------LAFSLRAAKTADDVDAVLKEKGELPLQVFCAMISGF 161

Query: 842  GKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAIN 1021
            GK+K+LE A+A+ +WLKRK  E+G +I PNLFIYNSLLGA+K+   F   EK+++DM   
Sbjct: 162  GKDKRLEPAVAVVDWLKRKKSESGSVIGPNLFIYNSLLGAMKQLSAFGEAEKVLSDMEEE 221

Query: 1022 GVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGA 1201
            G+ PN+VTYNTLM IY+EEG+  KAL + + +  KG  P+P ++S  L  YRR+EDG GA
Sbjct: 222  GIVPNIVTYNTLMVIYMEEGEFLKALGILDLVKEKGFEPNPITYSTALLVYRRMEDGMGA 281

Query: 1202 LAFYVQTRNRYEQGEIGRDDDREDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLR 1381
            L F+V+ R +Y + EIG D D  DW  EF KLENFI  +CYQVMRRWLVK+EN +  VL+
Sbjct: 282  LEFFVELREKYSKREIGNDPDY-DWKFEFFKLENFIGRICYQVMRRWLVKNENWTTRVLK 340

Query: 1382 LLQEMDNACLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLG 1561
            LL  MD+A LK  REEHERLIWACTREEH +V KELY RIRE   EIS+SVCNHLIWL+G
Sbjct: 341  LLNAMDSAGLKPSREEHERLIWACTREEHYIVGKELYKRIRERFPEISLSVCNHLIWLMG 400

Query: 1562 KAKKWWAALEIYEDMLDKGPKPNNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEK 1741
            KAKKWWAALEIYED+LD+GP+PNN+S+EL++SHF+ILLSAA ++GIWRWGVRLLNKME+K
Sbjct: 401  KAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFSILLSAASRRGIWRWGVRLLNKMEDK 460

Query: 1742 GLKPGSREWNSVLVACSKASETSAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQA 1921
             LKP SR WN+VLVACSKASET+AAI+IFK MV+ GEKPT+ISYGALLS+LEKGKLY++A
Sbjct: 461  NLKPQSRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSALEKGKLYDEA 520

Query: 1922 FQVWQHMVRVGVEPNLHAYTIMASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISS 2101
            F+VW HMV+VG+EPNL+AYT MAS+  GQ +F  L+ ++KEMAS GI P+V+T+NA+IS 
Sbjct: 521  FRVWNHMVKVGIEPNLYAYTTMASVLTGQQKFNLLDTLLKEMASKGIEPSVVTYNAVISG 580

Query: 2102 CGRNGHGGTAYEWFERMKVDDVVPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELS 2281
            C +NG  G AYEWF RMK ++V PNE+TYEMLIEALA D KPRLAY+LHL+A++EG +LS
Sbjct: 581  CAKNGLSGVAYEWFHRMKSENVEPNEITYEMLIEALANDAKPRLAYELHLKAQNEGLKLS 640

Query: 2282 TKAYDAVVESVNLYGATVDIGALGSRPPERKK 2377
            +K YDAVV+S   YGAT+D+  LG RP  +K+
Sbjct: 641  SKPYDAVVKSAETYGATIDLNLLGPRPDTKKR 672


>ref|XP_002873660.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297319497|gb|EFH49919.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 674

 Score =  733 bits (1893), Expect = 0.0
 Identities = 383/726 (52%), Positives = 495/726 (68%), Gaps = 1/726 (0%)
 Frame = +2

Query: 203  MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRLDFCDVHSHGLLRFSRYSY 382
            MQAL+IWP K+      +  + + EL   C  V    R++                  S 
Sbjct: 1    MQALSIWPLKS---GLLVGSRLEFELDCSCFVVSHKSRKRHC----------------SA 41

Query: 383  YKGCRNGVC-LASRSYDXXXXXXXXXXESTINGFSKPRKGSLGSAFAVSWALDEPTVGKD 559
             +GC   +  L   S +           S +    +P++   GS+  V WA ++  +G++
Sbjct: 42   QQGCFGRISSLILVSSNRKFEGLAVNPTSKVLFLCEPKRNLSGSSVGVGWATEQRELGEE 101

Query: 560  DSVAELEQLDEVERDDDGAKNRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXALG 739
             S  +      V   + G K                                      L 
Sbjct: 102  VSTEDSSYPQTV---NGGEKTNSRVDVRE-----------------------------LA 129

Query: 740  WRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGL 919
            + L +AKTADDV+ V+K    LPLQVY  +IRGFGK+K+L+ A+A+ +WL+RK  E+GG+
Sbjct: 130  YSLRAAKTADDVDIVIKEMGELPLQVYCAMIRGFGKDKRLKPAIAVVDWLRRKKSESGGV 189

Query: 920  IQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKAL 1099
            I PNLFIYNSLLGA+K++   +  EKI++DM   G+ PN+VTYNTLM IY+E+G+  KAL
Sbjct: 190  IGPNLFIYNSLLGAMKQSSVGE-AEKILSDMEEEGIVPNIVTYNTLMVIYMEKGEFHKAL 248

Query: 1100 QLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWD 1279
             + + +  KG  P+P ++S  L  YRR+EDG GAL F+V+ R +Y + EIG D D  DW+
Sbjct: 249  GILDLVKEKGFEPNPITYSTALLVYRRMEDGMGALEFFVELREKYSKREIGNDADY-DWE 307

Query: 1280 HEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWACTR 1459
             EF KLENFI  +CYQVMRRWLVK EN +  VL+LL  MDNA  K  REEHERLIWACTR
Sbjct: 308  FEFVKLENFIGRICYQVMRRWLVKDENWTTRVLKLLNAMDNAGPKPSREEHERLIWACTR 367

Query: 1460 EEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPNNMS 1639
            EEH +V KELY RIRE   EIS+SVCNHLIWL+GKAKKWWAALEIYED+LD+GP+PNN+S
Sbjct: 368  EEHYIVGKELYKRIRERFPEISLSVCNHLIWLMGKAKKWWAALEIYEDLLDEGPEPNNLS 427

Query: 1640 HELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETSAAI 1819
            +EL++SHFNILLSAA ++GIWRWGVRLLNKME+KGLKP SR WN+VLVACSKASET+AAI
Sbjct: 428  YELVVSHFNILLSAASRRGIWRWGVRLLNKMEDKGLKPQSRHWNAVLVACSKASETTAAI 487

Query: 1820 EIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMASIY 1999
            +IFK MV+ GEKPT+ISYGALLS+LEKGKLY++AF+VW HM++VG+EPNL+AYT MAS+ 
Sbjct: 488  QIFKAMVDNGEKPTVISYGALLSALEKGKLYDEAFRVWNHMIKVGIEPNLYAYTTMASVL 547

Query: 2000 AGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVVPNE 2179
             GQ +F  L+ ++KEMAS GI P+V+T+NA+IS C RNG  G AYEWF RM+ + V PNE
Sbjct: 548  TGQQKFNLLDTLLKEMASKGIEPSVVTYNAVISGCARNGLSGVAYEWFHRMRGEKVEPNE 607

Query: 2180 VTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGALGSR 2359
            +TYEMLIEALA D KPRLAY+LHL+A+++G +LS+K YDAVV+S   YGAT+D+  LG R
Sbjct: 608  ITYEMLIEALANDAKPRLAYELHLKAQNDGLKLSSKPYDAVVKSAETYGATIDLNLLGPR 667

Query: 2360 PPERKK 2377
            P + K+
Sbjct: 668  PHKEKR 673


>ref|NP_190245.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75206903|sp|Q9SNB7.1|PP264_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At3g46610 gi|6523064|emb|CAB62331.1| hypothetical protein
            [Arabidopsis thaliana] gi|332644660|gb|AEE78181.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 665

 Score =  732 bits (1889), Expect = 0.0
 Identities = 365/631 (57%), Positives = 462/631 (73%)
 Frame = +2

Query: 485  KPRKGSLGSAFAVSWALDEPTVGKDDSVAELEQLDEVERDDDGAKNRXXXXXXXXXXXXX 664
            +P++  LGS+F V WA ++  +   +     E L      + G KN              
Sbjct: 70   EPKRSLLGSSFGVGWATEQRELELGEEEVSTEDLSSA---NGGEKNNLRVDVRE------ 120

Query: 665  XXXXXXXXXXXXXXXXXXXXXXALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFG 844
                                   L + L +AKTADDV+ VLK +  LPLQV+  +I+GFG
Sbjct: 121  -----------------------LAFSLRAAKTADDVDAVLKDKGELPLQVFCAMIKGFG 157

Query: 845  KEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAING 1024
            K+K+L+ A+A+ +WLKRK  E+GG+I PNLFIYNSLLGA+   R F   EKI+ DM   G
Sbjct: 158  KDKRLKPAVAVVDWLKRKKSESGGVIGPNLFIYNSLLGAM---RGFGEAEKILKDMEEEG 214

Query: 1025 VHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGAL 1204
            + PN+VTYNTLM IY+EEG+  KAL + +    KG  P+P ++S  L  YRR+EDG GAL
Sbjct: 215  IVPNIVTYNTLMVIYMEEGEFLKALGILDLTKEKGFEPNPITYSTALLVYRRMEDGMGAL 274

Query: 1205 AFYVQTRNRYEQGEIGRDDDREDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRL 1384
             F+V+ R +Y + EIG D    DW+ EF KLENFI  +CYQVMRRWLVK +N +  VL+L
Sbjct: 275  EFFVELREKYAKREIGNDVGY-DWEFEFVKLENFIGRICYQVMRRWLVKDDNWTTRVLKL 333

Query: 1385 LQEMDNACLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGK 1564
            L  MD+A ++  REEHERLIWACTREEH +V KELY RIRE   EIS+SVCNHLIWL+GK
Sbjct: 334  LNAMDSAGVRPSREEHERLIWACTREEHYIVGKELYKRIRERFSEISLSVCNHLIWLMGK 393

Query: 1565 AKKWWAALEIYEDMLDKGPKPNNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKG 1744
            AKKWWAALEIYED+LD+GP+PNN+S+EL++SHFNILLSAA K+GIWRWGVRLLNKME+KG
Sbjct: 394  AKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAASKRGIWRWGVRLLNKMEDKG 453

Query: 1745 LKPGSREWNSVLVACSKASETSAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAF 1924
            LKP  R WN+VLVACSKASET+AAI+IFK MV+ GEKPT+ISYGALLS+LEKGKLY++AF
Sbjct: 454  LKPQRRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSALEKGKLYDEAF 513

Query: 1925 QVWQHMVRVGVEPNLHAYTIMASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSC 2104
            +VW HM++VG+EPNL+AYT MAS+  GQ +F  L+ ++KEMAS GI P+V+TFNA+IS C
Sbjct: 514  RVWNHMIKVGIEPNLYAYTTMASVLTGQQKFNLLDTLLKEMASKGIEPSVVTFNAVISGC 573

Query: 2105 GRNGHGGTAYEWFERMKVDDVVPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELST 2284
             RNG  G AYEWF RMK ++V PNE+TYEMLIEALA D KPRLAY+LH++A++EG +LS+
Sbjct: 574  ARNGLSGVAYEWFHRMKSENVEPNEITYEMLIEALANDAKPRLAYELHVKAQNEGLKLSS 633

Query: 2285 KAYDAVVESVNLYGATVDIGALGSRPPERKK 2377
            K YDAVV+S   YGAT+D+  LG RP ++ +
Sbjct: 634  KPYDAVVKSAETYGATIDLNLLGPRPDKKNR 664


>ref|XP_004308618.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like
            [Fragaria vesca subsp. vesca]
          Length = 657

 Score =  725 bits (1871), Expect = 0.0
 Identities = 346/551 (62%), Positives = 446/551 (80%), Gaps = 1/551 (0%)
 Frame = +2

Query: 731  ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEET 910
            AL  RL  AKTADDVEEVLK    LPLQV+S++IRGFG++K ++SA A+ EWLKR+ EET
Sbjct: 103  ALASRLQFAKTADDVEEVLKEMGDLPLQVFSSMIRGFGRDKLMDSAFAVVEWLKRRGEET 162

Query: 911  GGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKES 1090
             G++ PNLFI+NSLLGAVK+ ++F  ++K++ DM   GV PN+VTYNT M IY+E+G  +
Sbjct: 163  NGMVAPNLFIFNSLLGAVKQCKQFGEMDKVLADMTQEGVEPNIVTYNTKMAIYVEQGLST 222

Query: 1091 KALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDRE 1270
            KAL + EE+  KG+  SP ++S  L  Y+R++DG GAL F+V+ R +Y  G+I    + E
Sbjct: 223  KALDVLEEIQKKGMIASPVTYSTALQAYQRMQDGIGALEFFVEFREKYRNGDICNVSE-E 281

Query: 1271 DWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWA 1450
            DW+ EF KLE+F   +CYQVMR WLV  ++ S +VL+LL  MDNA +  GR EHERL+WA
Sbjct: 282  DWESEFLKLESFTKRVCYQVMRWWLVMDDDLSINVLKLLVNMDNAGIPLGRAEHERLLWA 341

Query: 1451 CTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPN 1630
            CTRE+H  VAKELY RIRE   EIS+SVCNH+IW++GKAKKWWAALEIYEDMLDKGPKPN
Sbjct: 342  CTREDHYNVAKELYCRIRERHSEISLSVCNHVIWVMGKAKKWWAALEIYEDMLDKGPKPN 401

Query: 1631 NMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETS 1810
            NMS+EL++SHFN+LL+AARKKGIWRWGVRLLNKMEEKGLKP S+EWN+VLVACSKA+ETS
Sbjct: 402  NMSYELVVSHFNVLLTAARKKGIWRWGVRLLNKMEEKGLKPRSKEWNAVLVACSKAAETS 461

Query: 1811 AAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMA 1990
            AA++IF+RMVEQG+KPTI+SYGALLS+LEKGKLY++A QVW+HM++VGV+PNL+AYTIMA
Sbjct: 462  AAVKIFRRMVEQGQKPTILSYGALLSALEKGKLYDEARQVWEHMIKVGVKPNLYAYTIMA 521

Query: 1991 SIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRN-GHGGTAYEWFERMKVDDV 2167
            S+++G G+F  +  I++EM S GI PTV+T+NAIIS C RN      AY+WF+RMK +++
Sbjct: 522  SVFSGHGKFNLVETILQEMVSSGIEPTVVTYNAIISGCARNDSSSADAYDWFDRMKANNI 581

Query: 2168 VPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGA 2347
             PN VTYEM+IEALA++GKPRLAY+L+LRA+++G  LS+KAYD +V+S   +G + D+  
Sbjct: 582  PPNNVTYEMMIEALAKEGKPRLAYELYLRAQNQGIHLSSKAYDILVQSSIDFGDSFDLNL 641

Query: 2348 LGSRPPERKKK 2380
            LG RPP   K+
Sbjct: 642  LGPRPPPHAKE 652


>ref|XP_006852234.1| hypothetical protein AMTR_s00049p00149530 [Amborella trichopoda]
            gi|548855838|gb|ERN13701.1| hypothetical protein
            AMTR_s00049p00149530 [Amborella trichopoda]
          Length = 754

 Score =  706 bits (1821), Expect = 0.0
 Identities = 344/573 (60%), Positives = 447/573 (78%), Gaps = 1/573 (0%)
 Frame = +2

Query: 731  ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEET 910
            AL   L  A+ ADDVEEVL  +  LP  VYS++IRGFG  ++L+ A+AL EWLKR  + T
Sbjct: 164  ALAMSLQFAERADDVEEVL-GDMDLPPSVYSSMIRGFGMAERLKPAIALVEWLKRGKKST 222

Query: 911  GGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKES 1090
             G    NL+IYNSLLGA K +  ++ + KI+ DM   G+ PN+VT NTLM +Y+E+GK  
Sbjct: 223  NGGAILNLYIYNSLLGAAKASHSYEKVGKIIEDMEKQGILPNIVTLNTLMSVYLEQGKTQ 282

Query: 1091 KALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDRE 1270
            +A  +F E+P  G++PSP ++S VL  YR++ED  GAL F+V++R +Y++GEI  +D  E
Sbjct: 283  EARDIFSEIPRNGLSPSPVTYSTVLQIYRKMEDAKGALEFFVESREKYKKGEI-ENDSCE 341

Query: 1271 DWDHEFSKLENFIITLCYQVMRRWLVKSENR-SNDVLRLLQEMDNACLKHGREEHERLIW 1447
            DW++EF+KLENF I +CYQVMR WLVK   R + DVL+LL E+D A LK GR  +ERLIW
Sbjct: 342  DWENEFAKLENFTIRICYQVMRGWLVKGGGREATDVLKLLIELDKAGLKPGRAIYERLIW 401

Query: 1448 ACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKP 1627
            ACT E H +VAKELY RIRE + EIS+SVCNH+IWL+GKAKKWWA+LE+YE+MLDKGPKP
Sbjct: 402  ACTNEGHYIVAKELYQRIRENNTEISLSVCNHVIWLMGKAKKWWASLEVYEEMLDKGPKP 461

Query: 1628 NNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASET 1807
            NN+S+EL++S FNILLSAA ++GIW W +RLLNKM+EKG+KP +REWN+ LVACS+ASE 
Sbjct: 462  NNLSYELMVSQFNILLSAASRRGIWNWAIRLLNKMQEKGIKPRTREWNAALVACSRASEA 521

Query: 1808 SAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIM 1987
            +AA++IF RMVEQGEKPTI+SYGALLS+LEKGKLY++A QVW+HM++VGV+PNL+AYT M
Sbjct: 522  AAAVQIFMRMVEQGEKPTILSYGALLSALEKGKLYDKAHQVWEHMIKVGVQPNLYAYTTM 581

Query: 1988 ASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDV 2167
             SIY  QG  + ++++++EM SLGI PTV+TFNAIIS C   G GG A+EWF RMK  ++
Sbjct: 582  LSIYIKQGRLKAVDIVIREMNSLGIEPTVVTFNAIISGCAYKGMGGAAFEWFHRMKAKNI 641

Query: 2168 VPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGA 2347
             PNE+TYEMLIEALA DGKPRLAY+++LRA++E   LS KAYD+V+ S   Y A++D+  
Sbjct: 642  EPNEITYEMLIEALANDGKPRLAYEVYLRARNEDLLLSPKAYDSVLRSSYQYKASIDMSR 701

Query: 2348 LGSRPPERKKKVTIRKDLSEFCKLADVPRRSRP 2446
            LG RPPE+ KK T  K  +EFC+L D+ RR +P
Sbjct: 702  LGPRPPEKTKKRT--KVSAEFCRLPDMSRREKP 732


>gb|EAY82798.1| hypothetical protein OsI_38004 [Oryza sativa Indica Group]
          Length = 669

 Score =  654 bits (1688), Expect = 0.0
 Identities = 314/582 (53%), Positives = 432/582 (74%), Gaps = 9/582 (1%)
 Frame = +2

Query: 731  ALGWRLSSAKTADDVEEVLKA--------ESILPLQVYSTIIRGFGKEKKLESAMALFEW 886
            A+G  L  A+TAD+VE ++K         E  LPLQVY+++IRG GKE++L++A A+ E 
Sbjct: 75   AVGAALRDARTADEVETLVKGFLDDGGGGEEHLPLQVYTSVIRGLGKERRLDAAFAVVEH 134

Query: 887  LKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGI 1066
            LKR S   GG+   N F+YN LLGAVK + +F  +  ++ DM   G+ PNVVT+NTLM I
Sbjct: 135  LKRGSGSGGGV---NQFVYNCLLGAVKNSGEFGRIHDVLADMEAQGIPPNVVTFNTLMSI 191

Query: 1067 YIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGE 1246
            Y+E+GK  +  ++F+ +   G+ P+ A++S V+  Y++  D F AL F  + R  Y +GE
Sbjct: 192  YVEQGKIDEVFRVFDTIEGSGLVPTAATYSTVMSSYKKAGDAFAALKFLTKLREMYNKGE 251

Query: 1247 IGRDDDREDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGRE 1426
            +    +REDWD EF K E   + +CY  MRR LV  EN   +VL++L  MD A +K  R 
Sbjct: 252  LA--GNREDWDREFVKFEKLTVRVCYMAMRRSLVGGENPVGEVLKVLLGMDEAGVKPDRR 309

Query: 1427 EHERLIWACTREEHCVVAKELYTRIREVDDE-ISVSVCNHLIWLLGKAKKWWAALEIYED 1603
            ++ERL+WACT EEH  +AKELY RIRE  D  IS+SVCNHLIWL+GKAKKWWAALEIYED
Sbjct: 310  DYERLVWACTGEEHYTIAKELYQRIRERGDGVISLSVCNHLIWLMGKAKKWWAALEIYED 369

Query: 1604 MLDKGPKPNNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLV 1783
            +LDKGPKPNN+S+ELI+SHFNILL+AA+++GIWRWGVRLL+KM++KGLKPGSREWN+VL+
Sbjct: 370  LLDKGPKPNNLSYELIMSHFNILLNAAKRRGIWRWGVRLLDKMQQKGLKPGSREWNAVLL 429

Query: 1784 ACSKASETSAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEP 1963
            ACS+A+ETSAA++IFKRM++QG  P ++SYGALLS+LEKGKLY++A +VW+HM +VGV+P
Sbjct: 430  ACSRAAETSAAVDIFKRMIDQGLTPDVVSYGALLSALEKGKLYDEALRVWEHMCKVGVKP 489

Query: 1964 NLHAYTIMASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWF 2143
            NLHAYTI+ SIY G+G    ++ +++ M S  + PTV+TFNAIIS+C RN  GG+A+EWF
Sbjct: 490  NLHAYTILVSIYIGKGNHAMVDSVLRGMLSAKVEPTVVTFNAIISACVRNNKGGSAFEWF 549

Query: 2144 ERMKVDDVVPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLY 2323
             RMKV ++ PNE+TY+MLIEAL +DGKPRLAY++++RA ++G EL  K+YD V+E+   Y
Sbjct: 550  HRMKVQNIEPNEITYQMLIEALVQDGKPRLAYEMYMRACNQGLELPAKSYDTVMEACQDY 609

Query: 2324 GATVDIGALGSRPPERKKKVTIRKDLSEFCKLADVPRRSRPF 2449
            G+ +D+ +LG RP ++ + + I    S    + D+P  ++ F
Sbjct: 610  GSLIDLNSLGPRPVKKVEPIRIENKFSSSYYVGDLPSSTKHF 651


Top