BLASTX nr result

ID: Catharanthus22_contig00000216 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00000216
         (2857 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006338641.1| PREDICTED: pentatricopeptide repeat-containi...   932   0.0  
ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containi...   926   0.0  
ref|XP_004231824.1| PREDICTED: pentatricopeptide repeat-containi...   919   0.0  
gb|EOY02618.1| Pentatricopeptide repeat (PPR-like) superfamily p...   891   0.0  
ref|XP_006447217.1| hypothetical protein CICLE_v10014357mg [Citr...   890   0.0  
ref|XP_002324000.1| pentatricopeptide repeat-containing family p...   861   0.0  
ref|XP_002526948.1| pentatricopeptide repeat-containing protein,...   852   0.0  
gb|EMJ21432.1| hypothetical protein PRUPE_ppa001979mg [Prunus pe...   851   0.0  
gb|EXC31403.1| hypothetical protein L484_017686 [Morus notabilis]     839   0.0  
gb|ESW12830.1| hypothetical protein PHAVU_008G145600g [Phaseolus...   830   0.0  
ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containi...   819   0.0  
ref|XP_004308618.1| PREDICTED: pentatricopeptide repeat-containi...   784   0.0  
gb|EPS70491.1| hypothetical protein M569_04265, partial [Genlise...   774   0.0  
ref|XP_006399946.1| hypothetical protein EUTSA_v10015672mg [Eutr...   766   0.0  
ref|XP_006292935.1| hypothetical protein CARUB_v10019206mg [Caps...   763   0.0  
ref|NP_190245.1| pentatricopeptide repeat-containing protein [Ar...   762   0.0  
ref|XP_002873660.1| pentatricopeptide repeat-containing protein ...   758   0.0  
ref|XP_006852234.1| hypothetical protein AMTR_s00049p00149530 [A...   752   0.0  
gb|EAZ20176.1| hypothetical protein OsJ_35776 [Oryza sativa Japo...   690   0.0  
ref|NP_001066581.1| Os12g0283900 [Oryza sativa Japonica Group] g...   690   0.0  

>ref|XP_006338641.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like
            [Solanum tuberosum]
          Length = 740

 Score =  932 bits (2410), Expect = 0.0
 Identities = 469/688 (68%), Positives = 558/688 (81%), Gaps = 1/688 (0%)
 Frame = +2

Query: 587  PCFGNRTYFRCKLFTKFRGSLGAPCALSWVLEEA-IDSHIVNEGSDSLHDVTEESANQSL 763
            P F N+ +     F  FR       AL+   EE  I   +V + S S    + E   +  
Sbjct: 57   PKFRNQDFCLRTEFVPFRPQKKDSFALTQASEEKDIHCDVVKQNSQSF--TSGEGGVEGF 114

Query: 764  DYVKLEVVRNTNLPSRSLDVDDDLKTEDGNNEEQMGIGKNARIDVRALALSLQFARTAND 943
              V+LE   N    + +++ DDD   + GN E++ G  K  ++DVRALA SL F +TA++
Sbjct: 115  TCVQLEEKGNL---TNNIEYDDD--GDVGNEEDEAGRVKGEKVDVRALAQSLHFVKTADE 169

Query: 944  VEEVLKDKGELPLQVYSSLIRGLGKEKKIDSAIALFEWLKRKSVDSNGAIRPNIFIYNSL 1123
            V+EVLKDK ELPLQVYSS+IRG GK+KK++SA+AL EWL+R+S D+ G+I  N+FIYNSL
Sbjct: 170  VDEVLKDKIELPLQVYSSMIRGFGKDKKLNSAMALVEWLRRRSKDNIGSISLNVFIYNSL 229

Query: 1124 LGAIKQSDKYDFVDSVLNEMTIEGVHPNVITYNTLMAIYIEHGRGVEALRLFEEIPKKGL 1303
            LGAIK++ KYDFVD V+++M  EGV PNV+TYNTLM IYIE GR +EAL LF  +PKKGL
Sbjct: 230  LGAIKEAGKYDFVDKVMDDMVSEGVQPNVVTYNTLMRIYIEQGRELEALNLFRLMPKKGL 289

Query: 1304 YPTPASYSTALLAYQSLEDGFGALNFFIEAKDNYLKGEMRKEVGGDLENEFVKFENLTIR 1483
             P+PASYSTAL AY+ LEDGFGA+ FF+E ++ Y  GE+      + E+EF K EN  +R
Sbjct: 290  SPSPASYSTALFAYRRLEDGFGAITFFVETREKYQNGEIGNIEEENWEDEFAKLENFIVR 349

Query: 1484 ICFWVMRQWLVKNENPCTNILKLLLEMDKVGLQAGRAEHERLVWACTHEEHHIVAKELYK 1663
            IC+ VMRQWLVK EN  TN+LKLL +MD+  LQ  RAE+ERLVWACT EEHH+VAKELY 
Sbjct: 350  ICYQVMRQWLVKGENANTNVLKLLTDMDRARLQLSRAEYERLVWACTREEHHVVAKELYN 409

Query: 1664 RIRERETNISLSVCNHVIWLMGKAKKWWAALEIYEDLLDRGPKPNNMSYELIVSHFNILL 1843
            RIRER+T ISLSVCNH+IWLMGKAKKWWAALEIYEDLLD+GPKPNNMSYELIVSHFNILL
Sbjct: 410  RIRERDTEISLSVCNHIIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILL 469

Query: 1844 TAARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVSCSKAAETSAAVQIFKRMVERGEK 2023
            +AARKRGIWRWGVRLLNKMEEKGLKP SREWNAVLV+CSKA+ETSAAVQIF+RMVE+GEK
Sbjct: 470  SAARKRGIWRWGVRLLNKMEEKGLKPSSREWNAVLVACSKASETSAAVQIFRRMVEKGEK 529

Query: 2024 PTVISYGALLSALEKGKLYEEALQVWKHMVKVGVKPNLYAYTIMASVYTAQGKFNIVNSI 2203
            PTVISYGALLSALEKGKLY+EALQVWKHM+KVG++PNLYAYTIMAS+YTAQGKFNIV+SI
Sbjct: 530  PTVISYGALLSALEKGKLYDEALQVWKHMIKVGIEPNLYAYTIMASIYTAQGKFNIVDSI 589

Query: 2204 IKEMVTVGVEPTVVTFNAIISGCARNNLGNAAYEWFQRMKNHNISPNEVTYEMLIEALAS 2383
            IKEMVT GVEPTVVTFNAIISGCARN + + AYEWFQRMK  NI+PNEV+YEMLIEALA+
Sbjct: 590  IKEMVTTGVEPTVVTFNAIISGCARNGMESVAYEWFQRMKTQNITPNEVSYEMLIEALAN 649

Query: 2384 DGKPRLVFELYLRAHNEGLSLSSKAYDLVICCSETYGATIDVNALGPRPPERKKKVQIRK 2563
            DGKPRL +ELY+RA  EGLSLS+KAYD VI  ++ YGA+ID++ LGPRPPE+KK+VQIRK
Sbjct: 650  DGKPRLAYELYVRALTEGLSLSTKAYDAVISSTQAYGASIDLSILGPRPPEKKKRVQIRK 709

Query: 2564 NLSEFCNLADVPRRSKPFDEKEIDSVHT 2647
            +LSEFCN+ADVPRRS+PFD +EI +  T
Sbjct: 710  SLSEFCNIADVPRRSRPFDREEIFTAQT 737


>ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610
            [Vitis vinifera]
          Length = 763

 Score =  926 bits (2394), Expect = 0.0
 Identities = 472/768 (61%), Positives = 580/768 (75%), Gaps = 28/768 (3%)
 Frame = +2

Query: 431  MQALSTWPSRNESWFVPQLDIELGSVSKLRKSGPQRKR---------------LYLAGSR 565
            MQALS WPS+   W VPQLD  LGS S   +   +RK                L+++ S 
Sbjct: 1    MQALSVWPSKGVFWAVPQLDYNLGSSSIPSRRRGRRKLWNPEDPVCQYRSLAFLWVSSSS 60

Query: 566  LEIANYLPCFGNRTYFRCKLFTKF------------RGSLGAPCALSWVLE-EAIDSHIV 706
                  + C   +  F C L + +            RGS GA  AL+W LE +AI +  V
Sbjct: 61   RSDRVGVYCGSPKFDFGCGLLSGYSKLKIFLLCERKRGSFGASFALAWALEQQAIGNEFV 120

Query: 707  NEGSDSLHDVTEESANQSLDYVKLEVVRNTNLPSRSLDVDDDLKTEDGNNEEQMGIGKNA 886
             E S+S+H +   +    +D +K++  R+        D +D+ + ++     ++   K+ 
Sbjct: 121  KEDSNSIHSLAGNTETVDIDCLKVDGARDG-------DENDNEEEKEAEKNGEVIEEKSR 173

Query: 887  RIDVRALALSLQFARTANDVEEVLKDKGELPLQVYSSLIRGLGKEKKIDSAIALFEWLKR 1066
             +DVRALA  L+FA TA+DVEEVLKDK ELPLQVYS++IRG G +K++D+A+AL EWLKR
Sbjct: 174  NVDVRALAHGLEFATTADDVEEVLKDKVELPLQVYSTMIRGFGTDKRLDAAMALVEWLKR 233

Query: 1067 KSVDSNGAIRPNIFIYNSLLGAIKQSDKYDFVDSVLNEMTIEGVHPNVITYNTLMAIYIE 1246
            K  ++NG+  PN+F+YNSLLGA+KQS+K+  V+ V+N+M  EG+ PNV+TYNTLM+IY+E
Sbjct: 234  KK-ETNGSKGPNLFVYNSLLGAVKQSEKFALVEKVMNDMAREGILPNVVTYNTLMSIYLE 292

Query: 1247 HGRGVEALRLFEEIPKKGLYPTPASYSTALLAYQSLEDGFGALNFFIEAKDNYLKGEMRK 1426
             GR VEAL + EEI K GL P+P SYSTALL Y+ +EDG GAL FFIE ++NYLKGE+ K
Sbjct: 293  QGRSVEALNILEEIQKNGLCPSPVSYSTALLVYRRMEDGHGALKFFIELRENYLKGEIGK 352

Query: 1427 EVGGDLENEFVKFENLTIRICFWVMRQWLVKNENPCTNILKLLLEMDKVGLQAGRAEHER 1606
            +   D ENEFVK +N TIRIC+ VMR+WLVK  N    +LKLL +MD  GLQ GRAE+ER
Sbjct: 353  DADEDWENEFVKLKNFTIRICYQVMRRWLVKEGNQSPILLKLLADMDNAGLQPGRAEYER 412

Query: 1607 LVWACTHEEHHIVAKELYKRIRERETNISLSVCNHVIWLMGKAKKWWAALEIYEDLLDRG 1786
            LVWACT EEH++VAKELY RIRER T ISLSVCNH+IWLMGKAKKWWAALEIYEDLLD+G
Sbjct: 413  LVWACTREEHYVVAKELYTRIRERHTEISLSVCNHIIWLMGKAKKWWAALEIYEDLLDKG 472

Query: 1787 PKPNNMSYELIVSHFNILLTAARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVSCSKA 1966
            PKPNN+SYEL+VSHFNILLTAARK+GIWRWGVRLLNKME+KGLKPGSREWNAVLV+CSKA
Sbjct: 473  PKPNNLSYELVVSHFNILLTAARKKGIWRWGVRLLNKMEDKGLKPGSREWNAVLVACSKA 532

Query: 1967 AETSAAVQIFKRMVERGEKPTVISYGALLSALEKGKLYEEALQVWKHMVKVGVKPNLYAY 2146
            AETSAAV+IF+RMVE+GEKPT+ISYGALLSALEKGKLY+EA +VW+HMVK+GV+PNLYAY
Sbjct: 533  AETSAAVEIFRRMVEQGEKPTIISYGALLSALEKGKLYDEASRVWEHMVKMGVEPNLYAY 592

Query: 2147 TIMASVYTAQGKFNIVNSIIKEMVTVGVEPTVVTFNAIISGCARNNLGNAAYEWFQRMKN 2326
            TIMAS+   QGK   V+SI++EM T+G++ TVVT+NAIISGCARN L +AA+EWF RMK 
Sbjct: 593  TIMASICVGQGKLQRVDSILREMETLGIDATVVTYNAIISGCARNGLSSAAFEWFHRMKV 652

Query: 2327 HNISPNEVTYEMLIEALASDGKPRLVFELYLRAHNEGLSLSSKAYDLVICCSETYGATID 2506
              I PNE+TYEMLIEALA DGKPRL FELY RA NEGL+LS+KAYD V+  S+ + ATID
Sbjct: 653  GKIQPNEITYEMLIEALAKDGKPRLAFELYSRAQNEGLNLSTKAYDAVVLSSQVHSATID 712

Query: 2507 VNALGPRPPERKKKVQIRKNLSEFCNLADVPRRSKPFDEKEIDSVHTQ 2650
            V+ LGPRPPE+KKK+  RK LS FCNLADVPRR+KPFD KEI S  T+
Sbjct: 713  VSLLGPRPPEKKKKLLARKTLSAFCNLADVPRRAKPFDRKEIYSQQTE 760


>ref|XP_004231824.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like
            [Solanum lycopersicum]
          Length = 742

 Score =  919 bits (2374), Expect = 0.0
 Identities = 460/673 (68%), Positives = 556/673 (82%), Gaps = 2/673 (0%)
 Frame = +2

Query: 638  RGSLGAPCALSWVL-EEAIDSHIVNEGSDSLHDVTEESANQSLDYVKLEVVRNTNLPSRS 814
            + S G  CAL+    E+ ID  IV +  +SL   + E   +    V+LE   +    + +
Sbjct: 78   KDSFGPSCALAQASGEKDIDCDIVKQ--NSLSFTSGEGGVEGFTCVQLEEKGDL---TNN 132

Query: 815  LDVDDDLKTEDGNNEEQMGIGKNARIDVRALALSLQFARTANDVEEVLKDKGELPLQVYS 994
            ++ DD +  ED     + GI K  ++DVRALA SL F +TA++V+EVLKDK ELPLQVYS
Sbjct: 133  VEYDDVVSEED-----EAGIVKGEKVDVRALAQSLHFVKTADEVDEVLKDKVELPLQVYS 187

Query: 995  SLIRGLGKEKKIDSAIALFEWLKRK-SVDSNGAIRPNIFIYNSLLGAIKQSDKYDFVDSV 1171
            S+IRG GK+KK++SA+AL EWL+R+   D+ G+I  N+FIYNSLLGAIK++ KYDFVD V
Sbjct: 188  SMIRGFGKDKKLNSAMALVEWLRRRRGKDNIGSISLNVFIYNSLLGAIKEAGKYDFVDKV 247

Query: 1172 LNEMTIEGVHPNVITYNTLMAIYIEHGRGVEALRLFEEIPKKGLYPTPASYSTALLAYQS 1351
            +++M  EGV PNV+TYNTLM  YIE GR +EAL+LF E+PKKGL P+PASYSTAL AY+ 
Sbjct: 248  MDDMVSEGVQPNVVTYNTLMRTYIEQGRELEALKLFREMPKKGLTPSPASYSTALFAYRR 307

Query: 1352 LEDGFGALNFFIEAKDNYLKGEMRKEVGGDLENEFVKFENLTIRICFWVMRQWLVKNENP 1531
            LEDGFGA+ FF+E ++ Y  GE+      + E+EF K EN  +RIC+ VMRQWLVK EN 
Sbjct: 308  LEDGFGAITFFVETRERYQNGEIGNIEEENWEDEFAKLENFIVRICYQVMRQWLVKGENA 367

Query: 1532 CTNILKLLLEMDKVGLQAGRAEHERLVWACTHEEHHIVAKELYKRIRERETNISLSVCNH 1711
             TN+LKLL +MD+  LQ  RAE+ERLVWACT EEH++VAKELY RIRER+T+ISLSVCNH
Sbjct: 368  NTNVLKLLTDMDRARLQLSRAEYERLVWACTREEHYVVAKELYNRIRERDTDISLSVCNH 427

Query: 1712 VIWLMGKAKKWWAALEIYEDLLDRGPKPNNMSYELIVSHFNILLTAARKRGIWRWGVRLL 1891
            +IWLMGKAKKWWAALEIYEDLLD+GP+PNNMSYELIVSHFNILL+AARKRGIWRWGVRLL
Sbjct: 428  IIWLMGKAKKWWAALEIYEDLLDKGPQPNNMSYELIVSHFNILLSAARKRGIWRWGVRLL 487

Query: 1892 NKMEEKGLKPGSREWNAVLVSCSKAAETSAAVQIFKRMVERGEKPTVISYGALLSALEKG 2071
            NKMEEKGLKP SREWNAVLV+CSKA+ETSAAVQIF+RMVE+GEKPTVISYGALLSALEKG
Sbjct: 488  NKMEEKGLKPSSREWNAVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSALEKG 547

Query: 2072 KLYEEALQVWKHMVKVGVKPNLYAYTIMASVYTAQGKFNIVNSIIKEMVTVGVEPTVVTF 2251
            KLY+EALQVWKHM+KVG++PNLYAYTIMAS+YTAQGKFNIV+SIIKEMVT GVEPTVVTF
Sbjct: 548  KLYDEALQVWKHMIKVGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVTTGVEPTVVTF 607

Query: 2252 NAIISGCARNNLGNAAYEWFQRMKNHNISPNEVTYEMLIEALASDGKPRLVFELYLRAHN 2431
            NAIISGCARN + + AYEWFQRMK  NI+PNEV+YE+LIEALA+DGKPRL +ELY+RA  
Sbjct: 608  NAIISGCARNGMESVAYEWFQRMKTQNITPNEVSYEVLIEALANDGKPRLAYELYVRALT 667

Query: 2432 EGLSLSSKAYDLVICCSETYGATIDVNALGPRPPERKKKVQIRKNLSEFCNLADVPRRSK 2611
            EGLSLS+KAYD VI  ++ YGA+ID++ LGPRPPE+KK+VQIRK+LSEFC++ADVPRRS+
Sbjct: 668  EGLSLSTKAYDAVISSTQAYGASIDLSILGPRPPEKKKRVQIRKSLSEFCHIADVPRRSR 727

Query: 2612 PFDEKEIDSVHTQ 2650
            PFD +EI +  T+
Sbjct: 728  PFDREEIFTAQTK 740


>gb|EOY02618.1| Pentatricopeptide repeat (PPR-like) superfamily protein, putative
            [Theobroma cacao]
          Length = 741

 Score =  891 bits (2302), Expect = 0.0
 Identities = 457/757 (60%), Positives = 567/757 (74%), Gaps = 23/757 (3%)
 Frame = +2

Query: 431  MQALSTWPSRNESWFVPQLDIELGSVSKLRKSGPQRKRLYLAGSR----LEIANYLPCFG 598
            MQALS WP    S  VP LD ELGS          RK   LA SR    L +++Y     
Sbjct: 1    MQALSIWPLNVGSLVVPHLDFELGSSCFASTKPSSRKTWSLAESRGPSFLLLSSYSRFSR 60

Query: 599  NRTYFR---CKLFTKF----------------RGSLGAPCALSWVLEEAIDSHIVNEGSD 721
            + T +R   C L   F                RGS     AL+W LE+     I NE   
Sbjct: 61   SGTCYRNLNCSLRCGFLCWYSELKVVLFCEPKRGSSRGLVALAWALEQ---QEIGNELE- 116

Query: 722  SLHDVTEESANQSLDYVKLEVVRNTNLPSRSLDVDDDLKTEDGNNEEQMGIGKNARIDVR 901
                  EES ++  D        N N        +D  +  D ++E ++ + ++AR+DVR
Sbjct: 117  -----REESHSRDGD--------NGN--------EDKNEEMDASSEGEVELEESARLDVR 155

Query: 902  ALALSLQFARTANDVEEVLKDKGELPLQVYSSLIRGLGKEKKIDSAIALFEWLKRKSVDS 1081
            ALA SLQFA+TA+D+E+VLKD  ELPLQV+SS+I+G G++  +D+A+AL EWLKRK  DS
Sbjct: 156  ALASSLQFAKTADDIEKVLKDMDELPLQVHSSMIKGFGRDNYMDAAMALVEWLKRKKNDS 215

Query: 1082 NGAIRPNIFIYNSLLGAIKQSDKYDFVDSVLNEMTIEGVHPNVITYNTLMAIYIEHGRGV 1261
             G++ PN+FIYNSLLGA+K S ++  ++ +L +M  EGV PN++TYN LMAIY+E G   
Sbjct: 216  GGSVGPNLFIYNSLLGAVKHSKQFREMEKILKDMEEEGVIPNIVTYNVLMAIYLEQGEAT 275

Query: 1262 EALRLFEEIPKKGLYPTPASYSTALLAYQSLEDGFGALNFFIEAKDNYLKGEMRKEVGGD 1441
            +AL + EEI +KG  P+P SYSTALLAY+ +EDG GAL FFIE ++ Y+KG++ K+   +
Sbjct: 276  KALNVLEEIQEKGFSPSPVSYSTALLAYRRMEDGNGALKFFIELREKYVKGDLGKDADEN 335

Query: 1442 LENEFVKFENLTIRICFWVMRQWLVKNENPCTNILKLLLEMDKVGLQAGRAEHERLVWAC 1621
             E EFVK EN T+RIC  VMR+WLVK+EN  TN+LKLL +MD  GL+  + ++ER++WAC
Sbjct: 336  WEYEFVKLENFTVRICQQVMRRWLVKDENLSTNVLKLLRDMDNAGLKLSKEDYERIIWAC 395

Query: 1622 THEEHHIVAKELYKRIRERETNISLSVCNHVIWLMGKAKKWWAALEIYEDLLDRGPKPNN 1801
            T EEH++VAKELY RIRER + ISLSVCNH+IWLMGKAKKWWAALE+YE+LLD+GP PNN
Sbjct: 396  TCEEHYVVAKELYSRIRERHSEISLSVCNHLIWLMGKAKKWWAALEVYEELLDKGPSPNN 455

Query: 1802 MSYELIVSHFNILLTAARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVSCSKAAETSA 1981
            +SYEL++SHFNILLTAARKRGIWRWGVRLLNKME+KGLKPGSREWNAVLV+CSKA+ET+A
Sbjct: 456  LSYELVMSHFNILLTAARKRGIWRWGVRLLNKMEDKGLKPGSREWNAVLVACSKASETTA 515

Query: 1982 AVQIFKRMVERGEKPTVISYGALLSALEKGKLYEEALQVWKHMVKVGVKPNLYAYTIMAS 2161
            AVQIF+RMVE+GEKPT+ISYGALLSALEKGKLY+EAL+VW HM+KVGVKPNLYAYTIMAS
Sbjct: 516  AVQIFRRMVEQGEKPTIISYGALLSALEKGKLYDEALRVWDHMIKVGVKPNLYAYTIMAS 575

Query: 2162 VYTAQGKFNIVNSIIKEMVTVGVEPTVVTFNAIISGCARNNLGNAAYEWFQRMKNHNISP 2341
            + T +G F +VN++ +EM + G+EPTVVT+NAIISGCARN + +AAYEWF RMK  NISP
Sbjct: 576  IVTGKGNFRMVNAVFQEMASSGIEPTVVTYNAIISGCARNGMSSAAYEWFHRMKVQNISP 635

Query: 2342 NEVTYEMLIEALASDGKPRLVFELYLRAHNEGLSLSSKAYDLVICCSETYGATIDVNALG 2521
            NE+TY+MLIEALA DGKPRL +ELYLRAHNEGL+LSSKAYD V+  S+ YGAT D++ LG
Sbjct: 636  NEITYQMLIEALAKDGKPRLAYELYLRAHNEGLNLSSKAYDAVVQSSQVYGATTDLSVLG 695

Query: 2522 PRPPERKKKVQIRKNLSEFCNLADVPRRSKPFDEKEI 2632
            PRPP++K KVQIRK L+EFCNLADVPRRSKPFD KEI
Sbjct: 696  PRPPDKKMKVQIRKTLTEFCNLADVPRRSKPFDRKEI 732


>ref|XP_006447217.1| hypothetical protein CICLE_v10014357mg [Citrus clementina]
            gi|568831365|ref|XP_006469938.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g46610-like [Citrus sinensis]
            gi|557549828|gb|ESR60457.1| hypothetical protein
            CICLE_v10014357mg [Citrus clementina]
          Length = 768

 Score =  890 bits (2300), Expect = 0.0
 Identities = 458/770 (59%), Positives = 571/770 (74%), Gaps = 30/770 (3%)
 Frame = +2

Query: 431  MQALSTWPSRNESWFVPQLDIELGSVSKLRKSGPQRKRLYLAGSRLEIAN--YLPCFGNR 604
            MQ LS WP +     VPQL  ++ S S L     +RK+  L  S     N  +L    N 
Sbjct: 1    MQPLSVWPLKGGFAAVPQLHFDVVSSSFLSTRNRRRKKWSLVESVCHSRNTGFLLVSSNS 60

Query: 605  TYF-------------RCKLFTKF------------RGSLGAPCALSWVLEEA-IDSHIV 706
            T+              +C+  + F            +   GA    +W +E+  I + ++
Sbjct: 61   TFSCCGVCCRSIKLDSKCEFLSGFSSHKLVLFCEPKKSYFGASVMFAWSMEQQEIGNGLL 120

Query: 707  NEGSDSLHDVTEESANQSLDYVKLEVVRNTNLPSRSLDVDDDLKTEDGNNEEQMGIGK-- 880
             E  +S   +  E+ +  +DY  +  V +T       D  + +++E+     + G+GK  
Sbjct: 121  VEEPNSADGLLVETESDIVDYRSVHRVEDTG------DNGNQVESEEVEIIGERGVGKQK 174

Query: 881  NARIDVRALALSLQFARTANDVEEVLKDKGELPLQVYSSLIRGLGKEKKIDSAIALFEWL 1060
            + R+DV+ALA SL   +TA+DVEEVLKD GELP QV+SS+IRG GKEK+ D A+AL EWL
Sbjct: 175  SGRVDVKALAQSLWHTKTADDVEEVLKDMGELPPQVHSSMIRGFGKEKRTDCAMALVEWL 234

Query: 1061 KRKSVDSNGAIRPNIFIYNSLLGAIKQSDKYDFVDSVLNEMTIEGVHPNVITYNTLMAIY 1240
            KRK  ++ G I PN+F+YNSLLGA+KQS K++ +D ++N+M  EGV+PNV+TYNTLMAIY
Sbjct: 235  KRKKRETGGFIGPNLFVYNSLLGAVKQSQKFEEMDRIMNDMAEEGVNPNVVTYNTLMAIY 294

Query: 1241 IEHGRGVEALRLFEEIPKKGLYPTPASYSTALLAYQSLEDGFGALNFFIEAKDNYLKGEM 1420
            IE G G +AL + EEI KKGL P+  SYS ALLAY+ +EDG GAL FF+E ++ YLKGE+
Sbjct: 295  IEQGEGTKALNVLEEIKKKGLTPSAVSYSQALLAYRRMEDGNGALKFFVELREKYLKGEI 354

Query: 1421 RKEVGGDLENEFVKFENLTIRICFWVMRQWLVKNENPCTNILKLLLEMDKVGLQAGRAEH 1600
             K    + ENEFVK ++  IRIC+ VMR+WLVK+EN  TN+LKLL+EMDK GL+  +AE+
Sbjct: 355  GKGDDENWENEFVKLKDFIIRICYQVMRRWLVKDENLSTNVLKLLIEMDKAGLRPVKAEY 414

Query: 1601 ERLVWACTHEEHHIVAKELYKRIRERETNISLSVCNHVIWLMGKAKKWWAALEIYEDLLD 1780
            ERLVWACT EEH++VAKE Y RIRER   ISLSVCNH+IWLMGKAKKWWAALE+YEDLLD
Sbjct: 415  ERLVWACTREEHYVVAKEFYARIRERHDEISLSVCNHLIWLMGKAKKWWAALEVYEDLLD 474

Query: 1781 RGPKPNNMSYELIVSHFNILLTAARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVSCS 1960
            +GPKPNNMSYELIVSHFNILL+AARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLV+CS
Sbjct: 475  KGPKPNNMSYELIVSHFNILLSAARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACS 534

Query: 1961 KAAETSAAVQIFKRMVERGEKPTVISYGALLSALEKGKLYEEALQVWKHMVKVGVKPNLY 2140
            KA+E +AAVQIFKRMVE+GEKPT+ISYGALLSALEKGKLY+EA +VW+HM+ VG +PNLY
Sbjct: 535  KASEYNAAVQIFKRMVEKGEKPTIISYGALLSALEKGKLYDEASRVWQHMLNVGAEPNLY 594

Query: 2141 AYTIMASVYTAQGKFNIVNSIIKEMVTVGVEPTVVTFNAIISGCARNNLGNAAYEWFQRM 2320
            AYTIMAS++TAQGKFN+V  I +EM +  +EPTVVT+NAIIS C +N + +AAYEWF RM
Sbjct: 595  AYTIMASIFTAQGKFNLVELIFREMASSRIEPTVVTYNAIISACGQNGMSSAAYEWFHRM 654

Query: 2321 KNHNISPNEVTYEMLIEALASDGKPRLVFELYLRAHNEGLSLSSKAYDLVICCSETYGAT 2500
            K  NISPNE+TYEMLIEALA DGKPRL ++LYLRA NE L+LSSKAYD ++  S+ YGAT
Sbjct: 655  KVQNISPNEITYEMLIEALAKDGKPRLAYDLYLRARNEELNLSSKAYDAILEFSQVYGAT 714

Query: 2501 IDVNALGPRPPERKKKVQIRKNLSEFCNLADVPRRSKPFDEKEIDSVHTQ 2650
            ID+  LGPRPP++KKKV IRKNLS FC+ ADVPRRSKPFD+KEI +  T+
Sbjct: 715  IDLTVLGPRPPDKKKKVVIRKNLSNFCHFADVPRRSKPFDKKEIYTPQTE 764


>ref|XP_002324000.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222867002|gb|EEF04133.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 709

 Score =  861 bits (2224), Expect = 0.0
 Identities = 448/742 (60%), Positives = 555/742 (74%), Gaps = 8/742 (1%)
 Frame = +2

Query: 431  MQALSTWPSRNESWFVPQLDIE------LGSVSKLRKSGPQRKRLYLAGSRLE-IANYLP 589
            MQ LS WP    S  VP L+ E      L +   +++ G        A S    ++  L 
Sbjct: 1    MQTLSVWPLSGGSCAVPHLEFEEDSSCFLSTRRGIKRWGLVDNVFQGASSGFPMVSGDLR 60

Query: 590  CFGNRTYFRCKLFTKFR-GSLGAPCALSWVLEEAIDSHIVNEGSDSLHDVTEESANQSLD 766
               N +  +   F + + GS G+  AL+  LE+     I NE     H V          
Sbjct: 61   FLSNHSKIKYVCFRETKEGSFGSSLALASALEQ---QKIGNE----FHRV---------- 103

Query: 767  YVKLEVVRNTNLPSRSLDVDDDLKTEDGNNEEQMGIGKNARIDVRALALSLQFARTANDV 946
                     ++L  RSL               + G  ++ +IDV ALA SL FA+T +D+
Sbjct: 104  --------ESSLDDRSLG--------------EAGEERDEKIDVPALAQSLYFAKTVDDI 141

Query: 947  EEVLKDKGELPLQVYSSLIRGLGKEKKIDSAIALFEWLKRKSVDSNGAIRPNIFIYNSLL 1126
            EEVLKDKGELP+QVY S+I+G G +KK++ AIAL +WLK K  +++G I PN+FIYNSLL
Sbjct: 142  EEVLKDKGELPVQVYLSMIKGFGWDKKMEPAIALVDWLKIKK-ETDGTIVPNLFIYNSLL 200

Query: 1127 GAIKQSDKYDFVDSVLNEMTIEGVHPNVITYNTLMAIYIEHGRGVEALRLFEEIPKKGLY 1306
             A+KQS++Y+  + +L  MT EGV PNV+TYN LM IY++ G+  +AL + EE+ + G  
Sbjct: 201  SAVKQSEQYEETEKILERMTQEGVAPNVVTYNILMVIYVKQGQAKKALDVLEEMRRNGFT 260

Query: 1307 PTPASYSTALLAYQSLEDGFGALNFFIEAKDNYLKGEMRKEVGGDLENEFVKFENLTIRI 1486
            P+ ASYS+ALLAY+ +EDG GAL FF+E KD Y+KGE+ K+   D E E+VK EN TIR+
Sbjct: 261  PSAASYSSALLAYRKMEDGDGALKFFVEIKDKYMKGEIGKDADEDWEREYVKLENFTIRV 320

Query: 1487 CFWVMRQWLVKNENPCTNILKLLLEMDKVGLQAGRAEHERLVWACTHEEHHIVAKELYKR 1666
            C+ VMR+WLV+ EN  TN+LKLL +MDK  LQ GR+++ERLVWACT EEH++VAKELY R
Sbjct: 321  CYQVMRRWLVRLENLNTNVLKLLTDMDKAELQPGRSDYERLVWACTREEHYVVAKELYIR 380

Query: 1667 IRERETNISLSVCNHVIWLMGKAKKWWAALEIYEDLLDRGPKPNNMSYELIVSHFNILLT 1846
            IRER ++ISLSVCNHVIWLMGKAKKWWAALE+YEDLLD+GPKPNN+SYELIVS+FN+LLT
Sbjct: 381  IRERCSDISLSVCNHVIWLMGKAKKWWAALEVYEDLLDKGPKPNNLSYELIVSYFNVLLT 440

Query: 1847 AARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVSCSKAAETSAAVQIFKRMVERGEKP 2026
            AA+KRGIWRWGVRLLNKMEEKGLKPGS+EWNAVLV+CSKA+ET+AAVQIF+RMVE+GEKP
Sbjct: 441  AAKKRGIWRWGVRLLNKMEEKGLKPGSKEWNAVLVACSKASETAAAVQIFRRMVEQGEKP 500

Query: 2027 TVISYGALLSALEKGKLYEEALQVWKHMVKVGVKPNLYAYTIMASVYTAQGKFNIVNSII 2206
            TVISYGALLSALEKG+LY+EA++VW+HM+KVGVKPN+YAYTIMASV+T QG F +V++II
Sbjct: 501  TVISYGALLSALEKGRLYDEAVRVWEHMLKVGVKPNVYAYTIMASVFTRQGNFRLVDAII 560

Query: 2207 KEMVTVGVEPTVVTFNAIISGCARNNLGNAAYEWFQRMKNHNISPNEVTYEMLIEALASD 2386
             EMV+ G+EPTVVT+NAIISGCARNNL +AAYEWF RMK  NISPNE+TY+MLIEALA  
Sbjct: 561  NEMVSTGIEPTVVTYNAIISGCARNNLSSAAYEWFHRMKVQNISPNEITYDMLIEALAKS 620

Query: 2387 GKPRLVFELYLRAHNEGLSLSSKAYDLVICCSETYGATIDVNALGPRPPERKKKVQIRKN 2566
            GKPRL +ELYLRA NE L LS KAYD V+  SE YGATID + LGPRPP++KKKVQIRK 
Sbjct: 621  GKPRLAYELYLRAQNEDLQLSPKAYDAVMHSSEAYGATIDTSVLGPRPPDKKKKVQIRKT 680

Query: 2567 LSEFCNLADVPRRSKPFDEKEI 2632
            L+EFCNLADVPRRSKPF++KEI
Sbjct: 681  LTEFCNLADVPRRSKPFNKKEI 702


>ref|XP_002526948.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223533700|gb|EEF35435.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 671

 Score =  852 bits (2200), Expect = 0.0
 Identities = 426/672 (63%), Positives = 527/672 (78%), Gaps = 6/672 (0%)
 Frame = +2

Query: 635  FRGSLGAPCALSWVLEEAIDSHIVNEGSDSLHDVTEESANQSLDYVKLEVVRNTNLPSRS 814
            FR S+    A +W L++        + S   H V     +  L   + E V   NL  R 
Sbjct: 4    FRSSI----AFAWALQK-------QDISSEFHGVEPSLDDGLLGKSEKEDVNPHNL-GRL 51

Query: 815  LDVDDDLKTEDGNNE------EQMGIGKNARIDVRALALSLQFARTANDVEEVLKDKGEL 976
             D DDD   ++ N E      E +G  K   IDVR+LA SL  A+TA+DVEEVLKDKGEL
Sbjct: 52   EDSDDDNNNQEDNIELDLRSKEGVGEEKCRSIDVRSLARSLHSAQTADDVEEVLKDKGEL 111

Query: 977  PLQVYSSLIRGLGKEKKIDSAIALFEWLKRKSVDSNGAIRPNIFIYNSLLGAIKQSDKYD 1156
            PLQVYSS+I+  G + K++SA+AL EWLKR+  +   +I PN+FIYNSLL A+K+S  ++
Sbjct: 112  PLQVYSSMIKAFGWDNKMESALALVEWLKRRK-EIGSSIGPNLFIYNSLLSAVKKSKLFE 170

Query: 1157 FVDSVLNEMTIEGVHPNVITYNTLMAIYIEHGRGVEALRLFEEIPKKGLYPTPASYSTAL 1336
              + +LN+MT EG+ PNV+TYNTLM IY+E G+  +AL + E++ +KG  PT ASYSTAL
Sbjct: 171  EAEKILNDMTQEGIAPNVVTYNTLMGIYVEKGQATKALNILEQMHEKGFIPTAASYSTAL 230

Query: 1337 LAYQSLEDGFGALNFFIEAKDNYLKGEMRKEVGGDLENEFVKFENLTIRICFWVMRQWLV 1516
            LAY+ +EDG GAL FF++ KD YLKG++ K    + ENEFVK E   IRIC+ VMR+WLV
Sbjct: 231  LAYRGMEDGHGALAFFVDIKDKYLKGKIGKNSDENWENEFVKLETFIIRICYQVMRRWLV 290

Query: 1517 KNENPCTNILKLLLEMDKVGLQAGRAEHERLVWACTHEEHHIVAKELYKRIRERETNISL 1696
            +++N  T++LKLL +MDK GLQ  +AE+ERLVWACT E+H+ V KELY RIRER + ISL
Sbjct: 291  RHDNFSTDVLKLLTDMDKAGLQPSQAEYERLVWACTREDHYAVGKELYIRIRERHSKISL 350

Query: 1697 SVCNHVIWLMGKAKKWWAALEIYEDLLDRGPKPNNMSYELIVSHFNILLTAARKRGIWRW 1876
            SVCNH+IWLMGKAKKWWAALEIYEDLLD+GP PNNMSYELIVSHFNILLTAARKRGIWRW
Sbjct: 351  SVCNHLIWLMGKAKKWWAALEIYEDLLDKGPNPNNMSYELIVSHFNILLTAARKRGIWRW 410

Query: 1877 GVRLLNKMEEKGLKPGSREWNAVLVSCSKAAETSAAVQIFKRMVERGEKPTVISYGALLS 2056
            GVRLLNKME+KGLKPGSREWNAVLV+CSKA+ET+AAVQIF+RM+E+GEKPT++SYGALLS
Sbjct: 411  GVRLLNKMEDKGLKPGSREWNAVLVACSKASETTAAVQIFRRMIEQGEKPTIVSYGALLS 470

Query: 2057 ALEKGKLYEEALQVWKHMVKVGVKPNLYAYTIMASVYTAQGKFNIVNSIIKEMVTVGVEP 2236
            ALEKGKLY+EA++VW+HM+KV VKPNLYAYTIMASV+  QGKF  V++II++MV+ G+EP
Sbjct: 471  ALEKGKLYDEAVRVWEHMLKVDVKPNLYAYTIMASVFAGQGKFTYVDAIIQKMVSSGIEP 530

Query: 2237 TVVTFNAIISGCARNNLGNAAYEWFQRMKNHNISPNEVTYEMLIEALASDGKPRLVFELY 2416
            T++T+NAIISGC  NNL +AAYEWF RMK  N+ PN++TYEMLIEALA DGKPRL +ELY
Sbjct: 531  TIITYNAIISGCTHNNLSSAAYEWFHRMKVQNMPPNKITYEMLIEALAKDGKPRLAYELY 590

Query: 2417 LRAHNEGLSLSSKAYDLVICCSETYGATIDVNALGPRPPERKKKVQIRKNLSEFCNLADV 2596
            LRA  EGL LS+K YD V+  S+ YGATID+N LGPRPP++KK+V+IRK L+EFC+LADV
Sbjct: 591  LRAKYEGLDLSAKVYDAVLRSSQVYGATIDINVLGPRPPDKKKRVKIRKTLTEFCDLADV 650

Query: 2597 PRRSKPFDEKEI 2632
            PRRSKPF+  EI
Sbjct: 651  PRRSKPFERHEI 662


>gb|EMJ21432.1| hypothetical protein PRUPE_ppa001979mg [Prunus persica]
          Length = 734

 Score =  851 bits (2198), Expect = 0.0
 Identities = 447/743 (60%), Positives = 554/743 (74%), Gaps = 28/743 (3%)
 Frame = +2

Query: 431  MQALSTWPSRNESWFVPQLDIELGSVSKLRKSGPQRKRLYL--------AGSRLEIANYL 586
            MQAL TWPSR E+W VPQL  ELGS  K      ++K   L        +G+ L +++  
Sbjct: 1    MQALVTWPSRAETWAVPQLGFELGSSCKFSTRIRRKKMWSLGFPVCYGRSGAVLLLSSNS 60

Query: 587  PCFGNRTY-------FRCKLFTKF------------RGSLGAPCALSWVLEE-AIDSHIV 706
               G   +       F C  F+ +            + S GA   ++W LEE AI + IV
Sbjct: 61   GAIGAEAFSGSPKFDFGCGCFSGYSKLKPARICQSKKRSFGASFVVAWALEEQAIGNDIV 120

Query: 707  NEGSDSLHDVTEESANQSLDYVKLEVVRNTNLPSRSLDVDDDLKTEDGNNEEQMGIGKNA 886
             E S S H ++ E  ++ +D++ ++           +DV +      G N EQ    KN 
Sbjct: 121  IEESTSEHRLSGEGESKGVDHLIVDEAEGGE-DKNEVDVRNG-----GANWEQ----KNE 170

Query: 887  RIDVRALALSLQFARTANDVEEVLKDKGELPLQVYSSLIRGLGKEKKIDSAIALFEWLKR 1066
            +IDVRALALSLQFA+TA+DVE VLKDKG+LPLQV+SS+IRG G+++ +DSA A+ EWLKR
Sbjct: 171  KIDVRALALSLQFAKTADDVEVVLKDKGDLPLQVFSSMIRGFGRDRLMDSAFAVVEWLKR 230

Query: 1067 KSVDSNGAIRPNIFIYNSLLGAIKQSDKYDFVDSVLNEMTIEGVHPNVITYNTLMAIYIE 1246
            KS ++NG+I PN+FIYNSLLGA+KQS ++  +D VL+ MT EGV  NV+TYNT MAIYIE
Sbjct: 231  KSEETNGSITPNLFIYNSLLGAVKQSKQFGEMDKVLSAMTEEGVELNVVTYNTKMAIYIE 290

Query: 1247 HGRGVEALRLFEEIPKKGLYPTPASYSTALLAYQSLEDGFGALNFFIEAKDNYLKGEMRK 1426
             G   +AL + E+I KKGL P+  SYSTALLAYQ +EDG GAL FFIE ++ Y KG++ K
Sbjct: 291  QGLSTKALDVLEDIEKKGLIPSSVSYSTALLAYQRMEDGNGALQFFIEFREKYHKGDISK 350

Query: 1427 EVGGDLENEFVKFENLTIRICFWVMRQWLVKNENPCTNILKLLLEMDKVGLQAGRAEHER 1606
            E   D E+EF++ EN T R+C+ VMR+WLVK++N  TN+LKLL +MD  G+   RAEHER
Sbjct: 351  ESVEDWEHEFIQLENFTKRVCYQVMRRWLVKDDNLSTNVLKLLAQMDIAGVPLSRAEHER 410

Query: 1607 LVWACTHEEHHIVAKELYKRIRERETNISLSVCNHVIWLMGKAKKWWAALEIYEDLLDRG 1786
            L+WACT EEH+ VAKELY RIRER T I +SVCNHVIWLMGKAKKWWAALEIYED+LDRG
Sbjct: 411  LLWACTREEHYTVAKELYNRIRERHTEIGISVCNHVIWLMGKAKKWWAALEIYEDMLDRG 470

Query: 1787 PKPNNMSYELIVSHFNILLTAARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVSCSKA 1966
            PKPNNMSYELIVSHFN+LLTAARKRGIWRWG+RLLNKMEEKGLKP S+EWNAVLV+CSKA
Sbjct: 471  PKPNNMSYELIVSHFNVLLTAARKRGIWRWGIRLLNKMEEKGLKPRSKEWNAVLVACSKA 530

Query: 1967 AETSAAVQIFKRMVERGEKPTVISYGALLSALEKGKLYEEALQVWKHMVKVGVKPNLYAY 2146
            AETSAAV+IFKRMVE+G+KPTV+SYGALLSALEKGKLY+EA QVW+HM+KVGVKPNLYAY
Sbjct: 531  AETSAAVKIFKRMVEQGQKPTVLSYGALLSALEKGKLYDEARQVWEHMLKVGVKPNLYAY 590

Query: 2147 TIMASVYTAQGKFNIVNSIIKEMVTVGVEPTVVTFNAIISGCARNNLGNAAYEWFQRMKN 2326
            TIMASV++  GK N+V++II EMV+ G+EPTVVT+NAIISG ARN   NAAYEWFQRMK+
Sbjct: 591  TIMASVFSGHGKLNMVDTIIHEMVSSGIEPTVVTYNAIISGFARNGSTNAAYEWFQRMKD 650

Query: 2327 HNISPNEVTYEMLIEALASDGKPRLVFELYLRAHNEGLSLSSKAYDLVICCSETYGATID 2506
             NISPN VTYEM+IE LA+ GKPRL ++LYL A N+GL LS K+YD+V+  S   G  I+
Sbjct: 651  QNISPNNVTYEMMIEGLANGGKPRLAYDLYLTAQNQGLDLSPKSYDIVVQSSLASGVAIE 710

Query: 2507 VNALGPRPPERKKKVQIRKNLSE 2575
               LG RPP++K++VQ RK+ ++
Sbjct: 711  -GFLGARPPDKKEEVQGRKSSTQ 732


>gb|EXC31403.1| hypothetical protein L484_017686 [Morus notabilis]
          Length = 737

 Score =  839 bits (2167), Expect = 0.0
 Identities = 438/747 (58%), Positives = 549/747 (73%), Gaps = 40/747 (5%)
 Frame = +2

Query: 431  MQALSTWPSRNESWFVPQLDIELGSVSKLRKSGPQRKRLYLAGSRLEIANYLP-CFGNRT 607
            MQALSTWP + + W VPQL  E  S   L+ S  +R++     + L+   + P C G  T
Sbjct: 1    MQALSTWPLKGDLWIVPQLSSEKSS--SLKTSSRRRRK-----NVLDFGFHFPVCHGRIT 53

Query: 608  YFR--------------------------------------CKLFTKFRGSLGAPCALSW 673
             F                                       CK   K + SLGA  AL+ 
Sbjct: 54   GFVLSTRNSRGVGYGGFCDRPKFDLGCGFLFGFSKLKVARFCK--PKKKSSLGASVALAG 111

Query: 674  VLEE-AIDSHIVNEGSDSLHDVTEESANQSLDYVKLEVVRNTNLPSRSLDVDDDLKTEDG 850
             LEE A+ S I  E  DS   ++ + ++  L   ++E   + N      +  ++   ED 
Sbjct: 112  ALEEQAVGSAIRIEELDSECSLSGKLSDGHLLLGRIESGDDNN----GDEEQENKVIEDV 167

Query: 851  NNEEQMGIGKNARIDVRALALSLQFARTANDVEEVLKDKGELPLQVYSSLIRGLGKEKKI 1030
             +EE+    K  ++DVR LA SL+FA+TA+DV+EVLKDKGELP QV+S++IRGLG+EK +
Sbjct: 168  GSEEKSREEKGGKVDVRELASSLRFAKTADDVDEVLKDKGELPPQVFSTMIRGLGREKLL 227

Query: 1031 DSAIALFEWLKRKSVDSNGAIRPNIFIYNSLLGAIKQSDKYDFVDSVLNEMTIEGVHPNV 1210
            D A AL EWLKRK  ++NG I  N+FIYNSLLGA+KQS+++  ++ VLN M  EGV PNV
Sbjct: 228  DPAFALLEWLKRKKEENNGLISLNLFIYNSLLGAVKQSEQFGEMEKVLNYMAQEGVVPNV 287

Query: 1211 ITYNTLMAIYIEHGRGVEALRLFEEIPKKGLYPTPASYSTALLAYQSLEDGFGALNFFIE 1390
            +TYNT+MAI++E+G G +AL + EEI KKGL P+P SYSTALLAY+ +EDG GAL FF+E
Sbjct: 288  VTYNTMMAIHLENGEGTKALSVLEEIRKKGLTPSPVSYSTALLAYRRMEDGHGALKFFVE 347

Query: 1391 AKDNYLKGEMRKEVGGDLENEFVKFENLTIRICFWVMRQWLVKNENPCTNILKLLLEMDK 1570
             ++ Y KGEM K+   D ENEFVK EN TIR+C+ VMR WLV  +N  TN+LKLL +MD 
Sbjct: 348  IREKYQKGEMGKDDDEDWENEFVKLENFTIRVCYQVMRHWLVNEDNLSTNVLKLLTKMDI 407

Query: 1571 VGLQAGRAEHERLVWACTHEEHHIVAKELYKRIRERETNISLSVCNHVIWLMGKAKKWWA 1750
             G+   R+EHERL+WACT EEHH+VAKELY RIRE  ++ISLSVCNH IWLMGKAK+WW 
Sbjct: 408  AGIPPSRSEHERLLWACTREEHHLVAKELYDRIREGYSDISLSVCNHTIWLMGKAKRWWT 467

Query: 1751 ALEIYEDLLDRGPKPNNMSYELIVSHFNILLTAARKRGIWRWGVRLLNKMEEKGLKPGSR 1930
            ALEIYEDLLD+GP+PNNMSYE+IVSHFNILLTAARKRGIW+WGVRLLNKMEEKGLKPGS+
Sbjct: 468  ALEIYEDLLDKGPQPNNMSYEIIVSHFNILLTAARKRGIWKWGVRLLNKMEEKGLKPGSK 527

Query: 1931 EWNAVLVSCSKAAETSAAVQIFKRMVERGEKPTVISYGALLSALEKGKLYEEALQVWKHM 2110
            EWNAVL++CSKA+ETSAAV+IFKRMVE+G+KPT +SYGALLSALEKGKLY+EA QVW+HM
Sbjct: 528  EWNAVLIACSKASETSAAVKIFKRMVEQGQKPTFLSYGALLSALEKGKLYDEARQVWEHM 587

Query: 2111 VKVGVKPNLYAYTIMASVYTAQGKFNIVNSIIKEMVTVGVEPTVVTFNAIISGCARNNLG 2290
            +KVG++PN+YAYTIMASV+   GKFN+V+++I EMV+ G+EPTVVT+NAIISGCARN++ 
Sbjct: 588  LKVGIRPNVYAYTIMASVFAGHGKFNMVDTVIHEMVSSGIEPTVVTYNAIISGCARNDMI 647

Query: 2291 NAAYEWFQRMKNHNISPNEVTYEMLIEALASDGKPRLVFELYLRAHNEGLSLSSKAYDLV 2470
            + A+EWF RMK  +I+PN VTYEMLIEALA+D KPRL +ELYLRA NEGL L+ KAYD+V
Sbjct: 648  DMAFEWFHRMKAQSITPNNVTYEMLIEALANDCKPRLAYELYLRAQNEGLRLAPKAYDIV 707

Query: 2471 ICCSETYGATIDVNALGPRPPERKKKV 2551
            +  S+ +GATID+  LGPRPPERK KV
Sbjct: 708  VESSQYHGATIDLRLLGPRPPERKGKV 734


>gb|ESW12830.1| hypothetical protein PHAVU_008G145600g [Phaseolus vulgaris]
          Length = 752

 Score =  830 bits (2145), Expect = 0.0
 Identities = 431/758 (56%), Positives = 548/758 (72%), Gaps = 21/758 (2%)
 Frame = +2

Query: 440  LSTWPSRNESWFVPQLDIELGSVSKLRKSGPQRKRLYLAGSRLEIANYLPC---FGNRTY 610
            +STWP +  +W V    I+    S L +    +       S     +   C   +G   +
Sbjct: 2    ISTWPFKLNNWVVSHFQIDHSGSSDLNRRRRVKLGCVFKVSHCAQISVFQCSRGYGTVVF 61

Query: 611  F-RCKLFTKFRGSLGAPCALSWVLEEAIDSHIVNEGSD---SLHD--VTEESANQSLD-Y 769
                KL  +    LG+P     ++ +   SHI +       +L D  V  E   +++D  
Sbjct: 62   SGHSKLDLRCGFLLGSPQPKFGIILKQNKSHIGDLAPPLGWALEDEGVVSELVEENIDSN 121

Query: 770  VKLEVVRNTNLPSRSLDVDDDLKTEDGNNEEQMGIGKNA----------RIDVRALALSL 919
             + EV+++ NL           + +D + E +MG+G+N+          ++DVRALAL L
Sbjct: 122  GESEVIKSLNLG----------QVQDSDCEPKMGVGENSKEGGKEESFGKVDVRALALRL 171

Query: 920  QFARTANDVEEVLKDKGELPLQVYSSLIRGLGKEKKIDSAIALFEWLKRKSVDSNGAIRP 1099
            Q A T +DV E+L DK +LPLQV+S++I   GKEK++DSA+ LFEW+K++ +++NG+  P
Sbjct: 172  QTALTVDDVREILVDKRDLPLQVFSTIINSFGKEKRMDSALILFEWMKKRKIETNGSFGP 231

Query: 1100 NIFIYNSLLGAIKQSDKYDFVDSVLNEMTIEGVHPNVITYNTLMAIYIEHGRGVEALRLF 1279
            N+FIYN LLG +KQS ++  ++++LNEM  +G+  NV+TYNTLMAIYIE G    AL + 
Sbjct: 232  NLFIYNGLLGVVKQSGQFAQMETILNEMAKDGISYNVVTYNTLMAIYIEKGEFDRALNVL 291

Query: 1280 EEIPKKGLYPTPASYSTALLAYQSLEDGFGALNFFIEAKDNYLKGEMRKEVGG-DLENEF 1456
            EEI   G  P+P SYS ALLAY+ +ED  GALNFF+E ++NY +GE+ ++  G D E E 
Sbjct: 292  EEIHGNGFTPSPVSYSQALLAYRRMEDCNGALNFFVELRENYHRGEIGEDDDGEDWEEEL 351

Query: 1457 VKFENLTIRICFWVMRQWLVKNENPCTNILKLLLEMDKVGLQAGRAEHERLVWACTHEEH 1636
            +K E  TIRIC+ VMR WLV ++N   N+LK L++MD  G+   RA+ ERLVWACT E+H
Sbjct: 352  MKLEKFTIRICYQVMRCWLVSSDNLSKNVLKFLVDMDNAGIPLTRADLERLVWACTREDH 411

Query: 1637 HIVAKELYKRIRERETNISLSVCNHVIWLMGKAKKWWAALEIYEDLLDRGPKPNNMSYEL 1816
            +IV KELY RIRER   ISLSVCNH IWLMGKAKKWWAALEIYEDLLD+GPKPNN+SYEL
Sbjct: 412  YIVVKELYTRIRERYDKISLSVCNHAIWLMGKAKKWWAALEIYEDLLDKGPKPNNLSYEL 471

Query: 1817 IVSHFNILLTAARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVSCSKAAETSAAVQIF 1996
            IVSHFN LL AA+++GIWRWGVRLLNKMEEKGLKPGSREWNAVLV+CSKA+ET+AAVQIF
Sbjct: 472  IVSHFNFLLNAAKRKGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKASETTAAVQIF 531

Query: 1997 KRMVERGEKPTVISYGALLSALEKGKLYEEALQVWKHMVKVGVKPNLYAYTIMASVYTAQ 2176
            KRMVE GEKPTVISYGALLSALEKGKLY++AL+VW HMVKVGV+PN YAYTIMAS+YTAQ
Sbjct: 532  KRMVENGEKPTVISYGALLSALEKGKLYDDALRVWNHMVKVGVEPNAYAYTIMASIYTAQ 591

Query: 2177 GKFNIVNSIIKEMVTVGVEPTVVTFNAIISGCARNNLGNAAYEWFQRMKNHNISPNEVTY 2356
            G FN V++I++EMVT+G+E TVVT+NAIISGCARN + +AAYEWF RMK  NI+PNE+TY
Sbjct: 592  GNFNRVDAIVQEMVTIGIEVTVVTYNAIISGCARNGMSSAAYEWFHRMKVQNITPNEITY 651

Query: 2357 EMLIEALASDGKPRLVFELYLRAHNEGLSLSSKAYDLVICCSETYGATIDVNALGPRPPE 2536
            EMLIEALA+DGKPRL ++LY RA NEGL+LSSKAYD+V+  S+  GAT ++  LGPRP +
Sbjct: 652  EMLIEALANDGKPRLAYQLYTRAKNEGLTLSSKAYDVVVHSSQANGATTELGLLGPRPAD 711

Query: 2537 RKKKVQIRKNLSEFCNLADVPRRSKPFDEKEIDSVHTQ 2650
            +KKKVQIRK L+EF NLA VPRRS  FD  EI   HTQ
Sbjct: 712  KKKKVQIRKTLTEFYNLAGVPRRSNQFDTSEIYRSHTQ 749


>ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like
            [Glycine max]
          Length = 808

 Score =  819 bits (2116), Expect = 0.0
 Identities = 434/812 (53%), Positives = 561/812 (69%), Gaps = 72/812 (8%)
 Frame = +2

Query: 440  LSTWPSRNESWFVPQLDIELGSVSKLRKSGPQRKRLYLAGS-----RLEIANYLPCFGNR 604
            +STWPS+     VP+ +I    V+   +   +R +L  A S     ++ +  +   +G  
Sbjct: 2    ISTWPSKVNHLVVPRFEIGPSGVTDQNRR--RRVKLGFAFSVSHSEKVSVFQFSRGYGTV 59

Query: 605  TY-------FRC---------------KLFTKFRGSLGAPCALSWVLEE-AIDSHIVNEG 715
             +        RC               K      G L  P  L W LEE  + S +V+E 
Sbjct: 60   VFSGHAKLDLRCGFLLGCSRPKLGIILKPHKSHVGDLAPP--LGWALEEDGVGSELVDEQ 117

Query: 716  SDSLHDVTEESANQSLDYVKLEVVRNTNLPSRSLDVDDDLKTEDGNN--EEQM------- 868
             DS +D +    ++ +  + L+ V++++   +    DDD K   GN   EEQ        
Sbjct: 118  IDS-NDASVNRESEGVKSLNLDQVQDSDFEGQIRGYDDDSKESGGNELVEEQTDSNDALV 176

Query: 869  -----GI--------------GK---------------NARIDVRALALSLQFARTANDV 946
                 G+              GK               + ++DVRALALSLQ  +T  DV
Sbjct: 177  NGDLEGVKSLNLDQVKDSDCEGKMCGDDNSKEGGEEESDGKVDVRALALSLQTVKTVEDV 236

Query: 947  EEVLKDKGELPLQVYSSLIRGLGKEKKIDSAIALFEWLKRKSVDSNGAIRPNIFIYNSLL 1126
              +LKDKG+LPLQV+S++I G GKEK++DSA+ LF W+K++ +++NG+  PN+FIYN LL
Sbjct: 237  GGILKDKGDLPLQVFSTIISGFGKEKRMDSALILFNWMKKRKIETNGSFGPNLFIYNGLL 296

Query: 1127 GAIKQSDKYDFVDSVLNEMTIEGVHPNVITYNTLMAIYIEHGRGVEALRLFEEIPKKGLY 1306
            G +KQS ++  ++ +LNEM  +G+  NV+TYNTLMAIYIE G   +AL + EEI + GL 
Sbjct: 297  GVVKQSGQFAEMEVILNEMAEDGIAYNVVTYNTLMAIYIEKGECDKALNMLEEIRRNGLT 356

Query: 1307 PTPASYSTALLAYQSLEDGFGALNFFIEAKDNYLKGEMRKEVGG-DLENEFVKFENLTIR 1483
            P+P SYS ALLAY+ +EDG+GALNFF+E ++ Y +GE+ K+  G D E E +K E  TIR
Sbjct: 357  PSPVSYSQALLAYRRMEDGYGALNFFVEFREKYRQGEIGKDDDGEDWEKECLKLEKFTIR 416

Query: 1484 ICFWVMRQWLVKNENPCTNILKLLLEMDKVGLQAGRAEHERLVWACTHEEHHIVAKELYK 1663
            +C+ VMR WLV  +N   N+LK L++MD VG+   RA+ ERL WACT E+H+IV KELY 
Sbjct: 417  VCYQVMRCWLVSRDNLSKNVLKFLVDMDNVGIPLPRADLERLAWACTREDHYIVVKELYN 476

Query: 1664 RIRERETNISLSVCNHVIWLMGKAKKWWAALEIYEDLLDRGPKPNNMSYELIVSHFNILL 1843
            RIRER   ISLSVCNH IWLMGKAKKWWAALEIYEDLLD+GPKPNN+SYELIVSHFN LL
Sbjct: 477  RIRERYDKISLSVCNHAIWLMGKAKKWWAALEIYEDLLDKGPKPNNLSYELIVSHFNFLL 536

Query: 1844 TAARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVSCSKAAETSAAVQIFKRMVERGEK 2023
            +AA+++GIWRWGV+LLNKME+KGLKPG REWNAVLV+CSKA+ET+AAVQIFKRMVE GEK
Sbjct: 537  SAAKRKGIWRWGVKLLNKMEDKGLKPGCREWNAVLVACSKASETTAAVQIFKRMVENGEK 596

Query: 2024 PTVISYGALLSALEKGKLYEEALQVWKHMVKVGVKPNLYAYTIMASVYTAQGKFNIVNSI 2203
            PT+ISYGALLSALEKGKLY++AL+VW HM+KVGV+PN YAYTIMAS++TAQG FN V++I
Sbjct: 597  PTIISYGALLSALEKGKLYDDALRVWNHMIKVGVEPNAYAYTIMASIHTAQGNFNRVDAI 656

Query: 2204 IKEMVTVGVEPTVVTFNAIISGCARNNLGNAAYEWFQRMKNHNISPNEVTYEMLIEALAS 2383
            I+EMVT+G+E TVVT+NAII+GCA N + + AYEWF RMK  NISPNE+TYEMLI ALA+
Sbjct: 657  IQEMVTLGIEVTVVTYNAIITGCAHNGMSSVAYEWFHRMKVQNISPNEITYEMLIVALAN 716

Query: 2384 DGKPRLVFELYLRAHNEGLSLSSKAYDLVICCSETYGATIDVNALGPRPPERKKKVQIRK 2563
            DGKPRL ++LY RA NEGL+LSSKAYD V+  S+   ATI++  LGPRP ++KKKVQIRK
Sbjct: 717  DGKPRLAYQLYTRAKNEGLTLSSKAYDAVVQSSQANNATIELGLLGPRPVDKKKKVQIRK 776

Query: 2564 NLSEFCNLADVPRRSKPFDEKEIDSVHTQVHD 2659
             L+EF NLA VP+RS+PFD  EI   H+Q  +
Sbjct: 777  TLNEFYNLAGVPKRSQPFDRNEI--YHSQTEE 806


>ref|XP_004308618.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like
            [Fragaria vesca subsp. vesca]
          Length = 657

 Score =  784 bits (2024), Expect = 0.0
 Identities = 392/646 (60%), Positives = 495/646 (76%), Gaps = 10/646 (1%)
 Frame = +2

Query: 641  GSLGAPCALS--------WVLEEA-IDSHIVNEGSDSLHDVTEESANQSLDYVKLEVVRN 793
            GSL   C L+        W LEE  I   +  E S S + +  E  ++ +          
Sbjct: 24   GSLATSCELNKENTFVSAWALEEQDIGDEVSVENSTSGNGLLAECGSREVGM-------- 75

Query: 794  TNLPSRSLDVDDDLKTEDGNNEEQMGIGKNARIDVRALALSLQFARTANDVEEVLKDKGE 973
                  S +VD     E GN EE     K+  +DVRALA  LQFA+TA+DVEEVLK+ G+
Sbjct: 76   ----EGSDEVDGRSGGEGGNWEE-----KSEVVDVRALASRLQFAKTADDVEEVLKEMGD 126

Query: 974  LPLQVYSSLIRGLGKEKKIDSAIALFEWLKRKSVDSNGAIRPNIFIYNSLLGAIKQSDKY 1153
            LPLQV+SS+IRG G++K +DSA A+ EWLKR+  ++NG + PN+FI+NSLLGA+KQ  ++
Sbjct: 127  LPLQVFSSMIRGFGRDKLMDSAFAVVEWLKRRGEETNGMVAPNLFIFNSLLGAVKQCKQF 186

Query: 1154 DFVDSVLNEMTIEGVHPNVITYNTLMAIYIEHGRGVEALRLFEEIPKKGLYPTPASYSTA 1333
              +D VL +MT EGV PN++TYNT MAIY+E G   +AL + EEI KKG+  +P +YSTA
Sbjct: 187  GEMDKVLADMTQEGVEPNIVTYNTKMAIYVEQGLSTKALDVLEEIQKKGMIASPVTYSTA 246

Query: 1334 LLAYQSLEDGFGALNFFIEAKDNYLKGEMRKEVGGDLENEFVKFENLTIRICFWVMRQWL 1513
            L AYQ ++DG GAL FF+E ++ Y  G++      D E+EF+K E+ T R+C+ VMR WL
Sbjct: 247  LQAYQRMQDGIGALEFFVEFREKYRNGDICNVSEEDWESEFLKLESFTKRVCYQVMRWWL 306

Query: 1514 VKNENPCTNILKLLLEMDKVGLQAGRAEHERLVWACTHEEHHIVAKELYKRIRERETNIS 1693
            V +++   N+LKLL+ MD  G+  GRAEHERL+WACT E+H+ VAKELY RIRER + IS
Sbjct: 307  VMDDDLSINVLKLLVNMDNAGIPLGRAEHERLLWACTREDHYNVAKELYCRIRERHSEIS 366

Query: 1694 LSVCNHVIWLMGKAKKWWAALEIYEDLLDRGPKPNNMSYELIVSHFNILLTAARKRGIWR 1873
            LSVCNHVIW+MGKAKKWWAALEIYED+LD+GPKPNNMSYEL+VSHFN+LLTAARK+GIWR
Sbjct: 367  LSVCNHVIWVMGKAKKWWAALEIYEDMLDKGPKPNNMSYELVVSHFNVLLTAARKKGIWR 426

Query: 1874 WGVRLLNKMEEKGLKPGSREWNAVLVSCSKAAETSAAVQIFKRMVERGEKPTVISYGALL 2053
            WGVRLLNKMEEKGLKP S+EWNAVLV+CSKAAETSAAV+IF+RMVE+G+KPT++SYGALL
Sbjct: 427  WGVRLLNKMEEKGLKPRSKEWNAVLVACSKAAETSAAVKIFRRMVEQGQKPTILSYGALL 486

Query: 2054 SALEKGKLYEEALQVWKHMVKVGVKPNLYAYTIMASVYTAQGKFNIVNSIIKEMVTVGVE 2233
            SALEKGKLY+EA QVW+HM+KVGVKPNLYAYTIMASV++  GKFN+V +I++EMV+ G+E
Sbjct: 487  SALEKGKLYDEARQVWEHMIKVGVKPNLYAYTIMASVFSGHGKFNLVETILQEMVSSGIE 546

Query: 2234 PTVVTFNAIISGCARNNLGNA-AYEWFQRMKNHNISPNEVTYEMLIEALASDGKPRLVFE 2410
            PTVVT+NAIISGCARN+  +A AY+WF RMK +NI PN VTYEM+IEALA +GKPRL +E
Sbjct: 547  PTVVTYNAIISGCARNDSSSADAYDWFDRMKANNIPPNNVTYEMMIEALAKEGKPRLAYE 606

Query: 2411 LYLRAHNEGLSLSSKAYDLVICCSETYGATIDVNALGPRPPERKKK 2548
            LYLRA N+G+ LSSKAYD+++  S  +G + D+N LGPRPP   K+
Sbjct: 607  LYLRAQNQGIHLSSKAYDILVQSSIDFGDSFDLNLLGPRPPPHAKE 652


>gb|EPS70491.1| hypothetical protein M569_04265, partial [Genlisea aurea]
          Length = 557

 Score =  774 bits (1999), Expect = 0.0
 Identities = 376/553 (67%), Positives = 453/553 (81%)
 Frame = +2

Query: 887  RIDVRALALSLQFARTANDVEEVLKDKGELPLQVYSSLIRGLGKEKKIDSAIALFEWLKR 1066
            RIDVRALAL LQ A TA+DVE++LK K  LPLQVYS++IRGLGKEK+I SA+ALFEWL+R
Sbjct: 5    RIDVRALALKLQLATTADDVEQLLKGKENLPLQVYSTVIRGLGKEKRIQSAMALFEWLQR 64

Query: 1067 KSVDSNGAIRPNIFIYNSLLGAIKQSDKYDFVDSVLNEMTIEGVHPNVITYNTLMAIYIE 1246
            KS +S   ++ N+F+YNSLLGA+KQ++ +D V+ V+ +M  EGVHPNV+T+N LM I+IE
Sbjct: 65   KSKESGSKLKLNLFVYNSLLGAMKQAEAFDLVEEVMTKMGAEGVHPNVVTFNALMGIHIE 124

Query: 1247 HGRGVEALRLFEEIPKKGLYPTPASYSTALLAYQSLEDGFGALNFFIEAKDNYLKGEMRK 1426
             G  + AL LF E+   G+ P+PASYST L AY+ +E+G GA++FFIE ++ Y  G+M  
Sbjct: 125  QGNELRALELFREMLMMGISPSPASYSTVLNAYRRMENGSGAVSFFIETRNKYRNGDMAN 184

Query: 1427 EVGGDLENEFVKFENLTIRICFWVMRQWLVKNENPCTNILKLLLEMDKVGLQAGRAEHER 1606
            +   D E E  K EN T+RIC+ VMR+WLVK  N  T +LKLL EMD  GL       E+
Sbjct: 185  DDDEDWELEISKLENFTLRICYQVMRRWLVKRGNFSTEVLKLLKEMDNAGLNCDPENLEK 244

Query: 1607 LVWACTHEEHHIVAKELYKRIRERETNISLSVCNHVIWLMGKAKKWWAALEIYEDLLDRG 1786
            L+WACT E+H  VAKELY R+RE   +ISLSVCNH+IWLMGKAKKWWAALEIYE+LLD G
Sbjct: 245  LIWACTREDHCAVAKELYTRVREMGADISLSVCNHIIWLMGKAKKWWAALEIYEELLDTG 304

Query: 1787 PKPNNMSYELIVSHFNILLTAARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVSCSKA 1966
            PKPNNMSYELIVSHFNILLTAARK+GIWRWGVRL+NKM+EKGLKPGSREWN+VLV+CSKA
Sbjct: 305  PKPNNMSYELIVSHFNILLTAARKKGIWRWGVRLINKMKEKGLKPGSREWNSVLVACSKA 364

Query: 1967 AETSAAVQIFKRMVERGEKPTVISYGALLSALEKGKLYEEALQVWKHMVKVGVKPNLYAY 2146
             ETS A++IFKRMVE G+KPT+ISYGALLSALEKGKLY+EA+QVWKHMVKVGV+ NLYAY
Sbjct: 365  GETSTAIEIFKRMVENGDKPTIISYGALLSALEKGKLYDEAIQVWKHMVKVGVEANLYAY 424

Query: 2147 TIMASVYTAQGKFNIVNSIIKEMVTVGVEPTVVTFNAIISGCARNNLGNAAYEWFQRMKN 2326
            TIMAS++ +QGK ++V+ II+EMV  GVEPTVVTFNA+ISG  +NNL +AAYEWF+RMK 
Sbjct: 425  TIMASIHASQGKIDLVDLIIREMVGAGVEPTVVTFNAVISGFVKNNLSSAAYEWFRRMKL 484

Query: 2327 HNISPNEVTYEMLIEALASDGKPRLVFELYLRAHNEGLSLSSKAYDLVICCSETYGATID 2506
             N++PNE+TYE LIEALA DGKPRL  EL+LRA NEGL LS+KAYD +I  S+ YGATID
Sbjct: 485  QNVTPNEITYETLIEALAKDGKPRLASELHLRAQNEGLMLSTKAYDAIIQSSDAYGATID 544

Query: 2507 VNALGPRPPERKK 2545
              ALGPRPPE KK
Sbjct: 545  YGALGPRPPEGKK 557


>ref|XP_006399946.1| hypothetical protein EUTSA_v10015672mg [Eutrema salsugineum]
            gi|557101036|gb|ESQ41399.1| hypothetical protein
            EUTSA_v10015672mg [Eutrema salsugineum]
          Length = 688

 Score =  766 bits (1978), Expect = 0.0
 Identities = 400/721 (55%), Positives = 506/721 (70%), Gaps = 15/721 (2%)
 Frame = +2

Query: 431  MQALSTWPSRNESWFVPQLDIELGSVSKLRKSGPQRKRLYLAGSRL--EIANYLPCFGNR 604
            MQALS WP +       +L+ EL   S    S   RKR Y         I+++L    NR
Sbjct: 1    MQALSIWPLKFGLLVGSRLEFEL-DCSCYVVSPKTRKRQYFVEQACFGSISSFLLVSSNR 59

Query: 605  TY------------FRCKLFTKFRGSLGAPCALSWVLEEA-IDSHIVNEGSDSLHDVTEE 745
             +            F C+      GS      + W  E+  +   +  E S S+      
Sbjct: 60   KFEGLAINPSTKVLFLCEPKKSLSGS---SVGVGWATEQRELGEEVSREDSSSV------ 110

Query: 746  SANQSLDYVKLEVVRNTNLPSRSLDVDDDLKTEDGNNEEQMGIGKNARIDVRALALSLQF 925
            +A+ S D+ K + V                           G   NAR+DVR LA SL+ 
Sbjct: 111  TASDS-DHSKSQAVTG-------------------------GEKTNARVDVRELAYSLRA 144

Query: 926  ARTANDVEEVLKDKGELPLQVYSSLIRGLGKEKKIDSAIALFEWLKRKSVDSNGAIRPNI 1105
            A+TA+DV+ VLK+KGELPLQVY ++IRG GK+K++  A+A+ +WLKRK ++S G I PN+
Sbjct: 145  AKTADDVDVVLKEKGELPLQVYCAMIRGFGKDKRLKPAMAVVDWLKRKKIESGGLIGPNL 204

Query: 1106 FIYNSLLGAIKQSDKYDFVDSVLNEMTIEGVHPNVITYNTLMAIYIEHGRGVEALRLFEE 1285
            FIYNSLLGA+K+S  +   + +L++M  EG+ PN++TYNTLM IY+E G   +AL + + 
Sbjct: 205  FIYNSLLGAMKESRGFGETEKILSDMEEEGIVPNIVTYNTLMVIYMEEGEFHKALGILDL 264

Query: 1286 IPKKGLYPTPASYSTALLAYQSLEDGFGALNFFIEAKDNYLKGEMRKEVGGDLENEFVKF 1465
            + +KG  P+P +YSTALL Y+ LEDG GAL FF E ++ Y K E+  +   D E EFVK 
Sbjct: 265  VKEKGFEPSPVTYSTALLVYRRLEDGMGALEFFAELREKYSKREIGNDADYDWEFEFVKL 324

Query: 1466 ENLTIRICFWVMRQWLVKNENPCTNILKLLLEMDKVGLQAGRAEHERLVWACTHEEHHIV 1645
            EN   RIC+ VMR+WLVK+EN  T +LKLL  MD  GL+  R EHERL+WACT EEH++V
Sbjct: 325  ENFIGRICYQVMRRWLVKDENLTTKMLKLLNAMDNAGLKPSREEHERLIWACTREEHYVV 384

Query: 1646 AKELYKRIRERETNISLSVCNHVIWLMGKAKKWWAALEIYEDLLDRGPKPNNMSYELIVS 1825
             KELYKRIRER   ISLSVCNH+IWLMGKAKKWWAALEIYEDLLD+GP+PNN+SYEL+VS
Sbjct: 385  GKELYKRIRERFPEISLSVCNHLIWLMGKAKKWWAALEIYEDLLDQGPEPNNLSYELVVS 444

Query: 1826 HFNILLTAARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVSCSKAAETSAAVQIFKRM 2005
            HFNILL+AA +RGIWRWGVRLLNKME+KGLKP SR WNAVLV+CSKA+ET+AA+QIFK M
Sbjct: 445  HFNILLSAASRRGIWRWGVRLLNKMEDKGLKPQSRHWNAVLVACSKASETAAAIQIFKAM 504

Query: 2006 VERGEKPTVISYGALLSALEKGKLYEEALQVWKHMVKVGVKPNLYAYTIMASVYTAQGKF 2185
            VE GEKPTVISYGALLSALEKGKLY+EA +VW HM+KVG++PN++AYTIMASV T Q KF
Sbjct: 505  VENGEKPTVISYGALLSALEKGKLYDEAFRVWNHMIKVGIEPNVHAYTIMASVLTGQQKF 564

Query: 2186 NIVNSIIKEMVTVGVEPTVVTFNAIISGCARNNLGNAAYEWFQRMKNHNISPNEVTYEML 2365
            N++++++KEM + G+EP+VVT+NAIISGCARN L   AYEWF RM+  N+ PNE+TYEML
Sbjct: 565  NLLDTLLKEMSSKGIEPSVVTYNAIISGCARNELSGVAYEWFHRMRGENVEPNEITYEML 624

Query: 2366 IEALASDGKPRLVFELYLRAHNEGLSLSSKAYDLVICCSETYGATIDVNALGPRPPERKK 2545
            IEALA+D KPRL +EL+L+A NEGL LSSK YD V+  +E+YGATID+N LGPRP   KK
Sbjct: 625  IEALANDAKPRLAYELHLKAQNEGLKLSSKPYDAVVKSAESYGATIDLNLLGPRPVTPKK 684

Query: 2546 K 2548
            +
Sbjct: 685  E 685


>ref|XP_006292935.1| hypothetical protein CARUB_v10019206mg [Capsella rubella]
            gi|482561642|gb|EOA25833.1| hypothetical protein
            CARUB_v10019206mg [Capsella rubella]
          Length = 673

 Score =  763 bits (1970), Expect = 0.0
 Identities = 389/712 (54%), Positives = 505/712 (70%), Gaps = 7/712 (0%)
 Frame = +2

Query: 431  MQALSTWPSRNESWFVPQLDIEL-------GSVSKLRKSGPQRKRLYLAGSRLEIANYLP 589
            MQALS WP ++      +L+ EL        S ++ R S  ++       S + +++   
Sbjct: 1    MQALSFWPLKSGLLVGSRLEFELDCSCFVVSSKTRKRHSFVEQACFGSISSLVLVSSNRK 60

Query: 590  CFGNRTYFRCKLFTKFRGSLGAPCALSWVLEEAIDSHIVNEGSDSLHDVTEESANQSLDY 769
              G++  F C+    F   LG+   + W  E      +  E S      TE+S++ S+D+
Sbjct: 61   FEGSKFLFLCEPKRSF---LGSSVGVRWATE------LGEEVS------TEDSSSSSVDH 105

Query: 770  VKLEVVRNTNLPSRSLDVDDDLKTEDGNNEEQMGIGKNARIDVRALALSLQFARTANDVE 949
             + + V                           G   N+R++VR LA SL+ A+TA+DV+
Sbjct: 106  SEPQAVNG-------------------------GEKNNSRVNVRELAFSLRAAKTADDVD 140

Query: 950  EVLKDKGELPLQVYSSLIRGLGKEKKIDSAIALFEWLKRKSVDSNGAIRPNIFIYNSLLG 1129
             VLK+KGELPLQV+ ++I G GK+K+++ A+A+ +WLKRK  +S   I PN+FIYNSLLG
Sbjct: 141  AVLKEKGELPLQVFCAMISGFGKDKRLEPAVAVVDWLKRKKSESGSVIGPNLFIYNSLLG 200

Query: 1130 AIKQSDKYDFVDSVLNEMTIEGVHPNVITYNTLMAIYIEHGRGVEALRLFEEIPKKGLYP 1309
            A+KQ   +   + VL++M  EG+ PN++TYNTLM IY+E G  ++AL + + + +KG  P
Sbjct: 201  AMKQLSAFGEAEKVLSDMEEEGIVPNIVTYNTLMVIYMEEGEFLKALGILDLVKEKGFEP 260

Query: 1310 TPASYSTALLAYQSLEDGFGALNFFIEAKDNYLKGEMRKEVGGDLENEFVKFENLTIRIC 1489
             P +YSTALL Y+ +EDG GAL FF+E ++ Y K E+  +   D + EF K EN   RIC
Sbjct: 261  NPITYSTALLVYRRMEDGMGALEFFVELREKYSKREIGNDPDYDWKFEFFKLENFIGRIC 320

Query: 1490 FWVMRQWLVKNENPCTNILKLLLEMDKVGLQAGRAEHERLVWACTHEEHHIVAKELYKRI 1669
            + VMR+WLVKNEN  T +LKLL  MD  GL+  R EHERL+WACT EEH+IV KELYKRI
Sbjct: 321  YQVMRRWLVKNENWTTRVLKLLNAMDSAGLKPSREEHERLIWACTREEHYIVGKELYKRI 380

Query: 1670 RERETNISLSVCNHVIWLMGKAKKWWAALEIYEDLLDRGPKPNNMSYELIVSHFNILLTA 1849
            RER   ISLSVCNH+IWLMGKAKKWWAALEIYEDLLD GP+PNN+SYEL+VSHF+ILL+A
Sbjct: 381  RERFPEISLSVCNHLIWLMGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFSILLSA 440

Query: 1850 ARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVSCSKAAETSAAVQIFKRMVERGEKPT 2029
            A +RGIWRWGVRLLNKME+K LKP SR WNAVLV+CSKA+ET+AA+QIFK MV+ GEKPT
Sbjct: 441  ASRRGIWRWGVRLLNKMEDKNLKPQSRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPT 500

Query: 2030 VISYGALLSALEKGKLYEEALQVWKHMVKVGVKPNLYAYTIMASVYTAQGKFNIVNSIIK 2209
            VISYGALLSALEKGKLY+EA +VW HMVKVG++PNLYAYT MASV T Q KFN++++++K
Sbjct: 501  VISYGALLSALEKGKLYDEAFRVWNHMVKVGIEPNLYAYTTMASVLTGQQKFNLLDTLLK 560

Query: 2210 EMVTVGVEPTVVTFNAIISGCARNNLGNAAYEWFQRMKNHNISPNEVTYEMLIEALASDG 2389
            EM + G+EP+VVT+NA+ISGCA+N L   AYEWF RMK+ N+ PNE+TYEMLIEALA+D 
Sbjct: 561  EMASKGIEPSVVTYNAVISGCAKNGLSGVAYEWFHRMKSENVEPNEITYEMLIEALANDA 620

Query: 2390 KPRLVFELYLRAHNEGLSLSSKAYDLVICCSETYGATIDVNALGPRPPERKK 2545
            KPRL +EL+L+A NEGL LSSK YD V+  +ETYGATID+N LGPRP  +K+
Sbjct: 621  KPRLAYELHLKAQNEGLKLSSKPYDAVVKSAETYGATIDLNLLGPRPDTKKR 672


>ref|NP_190245.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75206903|sp|Q9SNB7.1|PP264_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At3g46610 gi|6523064|emb|CAB62331.1| hypothetical protein
            [Arabidopsis thaliana] gi|332644660|gb|AEE78181.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 665

 Score =  762 bits (1967), Expect = 0.0
 Identities = 364/574 (63%), Positives = 460/574 (80%)
 Frame = +2

Query: 824  DDDLKTEDGNNEEQMGIGKNARIDVRALALSLQFARTANDVEEVLKDKGELPLQVYSSLI 1003
            ++++ TED ++    G   N R+DVR LA SL+ A+TA+DV+ VLKDKGELPLQV+ ++I
Sbjct: 95   EEEVSTEDLSSANG-GEKNNLRVDVRELAFSLRAAKTADDVDAVLKDKGELPLQVFCAMI 153

Query: 1004 RGLGKEKKIDSAIALFEWLKRKSVDSNGAIRPNIFIYNSLLGAIKQSDKYDFVDSVLNEM 1183
            +G GK+K++  A+A+ +WLKRK  +S G I PN+FIYNSLLGA++    +   + +L +M
Sbjct: 154  KGFGKDKRLKPAVAVVDWLKRKKSESGGVIGPNLFIYNSLLGAMRG---FGEAEKILKDM 210

Query: 1184 TIEGVHPNVITYNTLMAIYIEHGRGVEALRLFEEIPKKGLYPTPASYSTALLAYQSLEDG 1363
              EG+ PN++TYNTLM IY+E G  ++AL + +   +KG  P P +YSTALL Y+ +EDG
Sbjct: 211  EEEGIVPNIVTYNTLMVIYMEEGEFLKALGILDLTKEKGFEPNPITYSTALLVYRRMEDG 270

Query: 1364 FGALNFFIEAKDNYLKGEMRKEVGGDLENEFVKFENLTIRICFWVMRQWLVKNENPCTNI 1543
             GAL FF+E ++ Y K E+  +VG D E EFVK EN   RIC+ VMR+WLVK++N  T +
Sbjct: 271  MGALEFFVELREKYAKREIGNDVGYDWEFEFVKLENFIGRICYQVMRRWLVKDDNWTTRV 330

Query: 1544 LKLLLEMDKVGLQAGRAEHERLVWACTHEEHHIVAKELYKRIRERETNISLSVCNHVIWL 1723
            LKLL  MD  G++  R EHERL+WACT EEH+IV KELYKRIRER + ISLSVCNH+IWL
Sbjct: 331  LKLLNAMDSAGVRPSREEHERLIWACTREEHYIVGKELYKRIRERFSEISLSVCNHLIWL 390

Query: 1724 MGKAKKWWAALEIYEDLLDRGPKPNNMSYELIVSHFNILLTAARKRGIWRWGVRLLNKME 1903
            MGKAKKWWAALEIYEDLLD GP+PNN+SYEL+VSHFNILL+AA KRGIWRWGVRLLNKME
Sbjct: 391  MGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAASKRGIWRWGVRLLNKME 450

Query: 1904 EKGLKPGSREWNAVLVSCSKAAETSAAVQIFKRMVERGEKPTVISYGALLSALEKGKLYE 2083
            +KGLKP  R WNAVLV+CSKA+ET+AA+QIFK MV+ GEKPTVISYGALLSALEKGKLY+
Sbjct: 451  DKGLKPQRRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSALEKGKLYD 510

Query: 2084 EALQVWKHMVKVGVKPNLYAYTIMASVYTAQGKFNIVNSIIKEMVTVGVEPTVVTFNAII 2263
            EA +VW HM+KVG++PNLYAYT MASV T Q KFN++++++KEM + G+EP+VVTFNA+I
Sbjct: 511  EAFRVWNHMIKVGIEPNLYAYTTMASVLTGQQKFNLLDTLLKEMASKGIEPSVVTFNAVI 570

Query: 2264 SGCARNNLGNAAYEWFQRMKNHNISPNEVTYEMLIEALASDGKPRLVFELYLRAHNEGLS 2443
            SGCARN L   AYEWF RMK+ N+ PNE+TYEMLIEALA+D KPRL +EL+++A NEGL 
Sbjct: 571  SGCARNGLSGVAYEWFHRMKSENVEPNEITYEMLIEALANDAKPRLAYELHVKAQNEGLK 630

Query: 2444 LSSKAYDLVICCSETYGATIDVNALGPRPPERKK 2545
            LSSK YD V+  +ETYGATID+N LGPRP ++ +
Sbjct: 631  LSSKPYDAVVKSAETYGATIDLNLLGPRPDKKNR 664


>ref|XP_002873660.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297319497|gb|EFH49919.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 674

 Score =  758 bits (1956), Expect = 0.0
 Identities = 390/707 (55%), Positives = 497/707 (70%), Gaps = 2/707 (0%)
 Frame = +2

Query: 431  MQALSTWPSRNESWFVPQLDIELGSVSKLRKSGPQRKRLYLAGSRLEIANYLPCFGNRTY 610
            MQALS WP                      KSG       L GSRLE      CF     
Sbjct: 1    MQALSIWPL---------------------KSG------LLVGSRLEFELDCSCFVVSHK 33

Query: 611  FRCKLFTKFRGSLGAPCALSWVLEEAIDSHIVNEGSDSLHDVTEESANQSLDYVKLEVVR 790
             R +  +  +G  G   +L  V        +    +  +  + E   N S   V      
Sbjct: 34   SRKRHCSAQQGCFGRISSLILVSSNRKFEGLAVNPTSKVLFLCEPKRNLSGSSV------ 87

Query: 791  NTNLPSRSLDVDDDLKTEDGNNEEQMGIGK--NARIDVRALALSLQFARTANDVEEVLKD 964
                 +   ++ +++ TED +  + +  G+  N+R+DVR LA SL+ A+TA+DV+ V+K+
Sbjct: 88   GVGWATEQRELGEEVSTEDSSYPQTVNGGEKTNSRVDVRELAYSLRAAKTADDVDIVIKE 147

Query: 965  KGELPLQVYSSLIRGLGKEKKIDSAIALFEWLKRKSVDSNGAIRPNIFIYNSLLGAIKQS 1144
             GELPLQVY ++IRG GK+K++  AIA+ +WL+RK  +S G I PN+FIYNSLLGA+KQS
Sbjct: 148  MGELPLQVYCAMIRGFGKDKRLKPAIAVVDWLRRKKSESGGVIGPNLFIYNSLLGAMKQS 207

Query: 1145 DKYDFVDSVLNEMTIEGVHPNVITYNTLMAIYIEHGRGVEALRLFEEIPKKGLYPTPASY 1324
               +  + +L++M  EG+ PN++TYNTLM IY+E G   +AL + + + +KG  P P +Y
Sbjct: 208  SVGE-AEKILSDMEEEGIVPNIVTYNTLMVIYMEKGEFHKALGILDLVKEKGFEPNPITY 266

Query: 1325 STALLAYQSLEDGFGALNFFIEAKDNYLKGEMRKEVGGDLENEFVKFENLTIRICFWVMR 1504
            STALL Y+ +EDG GAL FF+E ++ Y K E+  +   D E EFVK EN   RIC+ VMR
Sbjct: 267  STALLVYRRMEDGMGALEFFVELREKYSKREIGNDADYDWEFEFVKLENFIGRICYQVMR 326

Query: 1505 QWLVKNENPCTNILKLLLEMDKVGLQAGRAEHERLVWACTHEEHHIVAKELYKRIRERET 1684
            +WLVK+EN  T +LKLL  MD  G +  R EHERL+WACT EEH+IV KELYKRIRER  
Sbjct: 327  RWLVKDENWTTRVLKLLNAMDNAGPKPSREEHERLIWACTREEHYIVGKELYKRIRERFP 386

Query: 1685 NISLSVCNHVIWLMGKAKKWWAALEIYEDLLDRGPKPNNMSYELIVSHFNILLTAARKRG 1864
             ISLSVCNH+IWLMGKAKKWWAALEIYEDLLD GP+PNN+SYEL+VSHFNILL+AA +RG
Sbjct: 387  EISLSVCNHLIWLMGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAASRRG 446

Query: 1865 IWRWGVRLLNKMEEKGLKPGSREWNAVLVSCSKAAETSAAVQIFKRMVERGEKPTVISYG 2044
            IWRWGVRLLNKME+KGLKP SR WNAVLV+CSKA+ET+AA+QIFK MV+ GEKPTVISYG
Sbjct: 447  IWRWGVRLLNKMEDKGLKPQSRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYG 506

Query: 2045 ALLSALEKGKLYEEALQVWKHMVKVGVKPNLYAYTIMASVYTAQGKFNIVNSIIKEMVTV 2224
            ALLSALEKGKLY+EA +VW HM+KVG++PNLYAYT MASV T Q KFN++++++KEM + 
Sbjct: 507  ALLSALEKGKLYDEAFRVWNHMIKVGIEPNLYAYTTMASVLTGQQKFNLLDTLLKEMASK 566

Query: 2225 GVEPTVVTFNAIISGCARNNLGNAAYEWFQRMKNHNISPNEVTYEMLIEALASDGKPRLV 2404
            G+EP+VVT+NA+ISGCARN L   AYEWF RM+   + PNE+TYEMLIEALA+D KPRL 
Sbjct: 567  GIEPSVVTYNAVISGCARNGLSGVAYEWFHRMRGEKVEPNEITYEMLIEALANDAKPRLA 626

Query: 2405 FELYLRAHNEGLSLSSKAYDLVICCSETYGATIDVNALGPRPPERKK 2545
            +EL+L+A N+GL LSSK YD V+  +ETYGATID+N LGPRP + K+
Sbjct: 627  YELHLKAQNDGLKLSSKPYDAVVKSAETYGATIDLNLLGPRPHKEKR 673


>ref|XP_006852234.1| hypothetical protein AMTR_s00049p00149530 [Amborella trichopoda]
            gi|548855838|gb|ERN13701.1| hypothetical protein
            AMTR_s00049p00149530 [Amborella trichopoda]
          Length = 754

 Score =  752 bits (1941), Expect = 0.0
 Identities = 382/686 (55%), Positives = 490/686 (71%), Gaps = 15/686 (2%)
 Frame = +2

Query: 620  KLFTKFRGSLGAPCALSWVLEE-------AIDSHIVNEGSDSLHDVTEESANQSLDYVKL 778
            K+   +  SL A   LSW LE+         ++ I N G +      E+   +    V  
Sbjct: 60   KVNLAYSSSLRAAFTLSWALEQNPLSNESEKETMIPNLGDEQF----EDQETERFVSVNS 115

Query: 779  EVVRNTNLPSRSLDVDDDLKTEDGNNE-------EQMGIGKNARIDVRALALSLQFARTA 937
            + +   N        D+D +  DG N        E+    +N R++V ALA+SLQFA  A
Sbjct: 116  KEINQNNKDFMVNCEDEDEREADGKNPSLVESEAEKASDIRNGRVNVHALAMSLQFAERA 175

Query: 938  NDVEEVLKDKGELPLQVYSSLIRGLGKEKKIDSAIALFEWLKRKSVDSNGAIRPNIFIYN 1117
            +DVEEVL D  +LP  VYSS+IRG G  +++  AIAL EWLKR    +NG    N++IYN
Sbjct: 176  DDVEEVLGDM-DLPPSVYSSMIRGFGMAERLKPAIALVEWLKRGKKSTNGGAILNLYIYN 234

Query: 1118 SLLGAIKQSDKYDFVDSVLNEMTIEGVHPNVITYNTLMAIYIEHGRGVEALRLFEEIPKK 1297
            SLLGA K S  Y+ V  ++ +M  +G+ PN++T NTLM++Y+E G+  EA  +F EIP+ 
Sbjct: 235  SLLGAAKASHSYEKVGKIIEDMEKQGILPNIVTLNTLMSVYLEQGKTQEARDIFSEIPRN 294

Query: 1298 GLYPTPASYSTALLAYQSLEDGFGALNFFIEAKDNYLKGEMRKEVGGDLENEFVKFENLT 1477
            GL P+P +YST L  Y+ +ED  GAL FF+E+++ Y KGE+  +   D ENEF K EN T
Sbjct: 295  GLSPSPVTYSTVLQIYRKMEDAKGALEFFVESREKYKKGEIENDSCEDWENEFAKLENFT 354

Query: 1478 IRICFWVMRQWLVKNEN-PCTNILKLLLEMDKVGLQAGRAEHERLVWACTHEEHHIVAKE 1654
            IRIC+ VMR WLVK      T++LKLL+E+DK GL+ GRA +ERL+WACT+E H+IVAKE
Sbjct: 355  IRICYQVMRGWLVKGGGREATDVLKLLIELDKAGLKPGRAIYERLIWACTNEGHYIVAKE 414

Query: 1655 LYKRIRERETNISLSVCNHVIWLMGKAKKWWAALEIYEDLLDRGPKPNNMSYELIVSHFN 1834
            LY+RIRE  T ISLSVCNHVIWLMGKAKKWWA+LE+YE++LD+GPKPNN+SYEL+VS FN
Sbjct: 415  LYQRIRENNTEISLSVCNHVIWLMGKAKKWWASLEVYEEMLDKGPKPNNLSYELMVSQFN 474

Query: 1835 ILLTAARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVSCSKAAETSAAVQIFKRMVER 2014
            ILL+AA +RGIW W +RLLNKM+EKG+KP +REWNA LV+CS+A+E +AAVQIF RMVE+
Sbjct: 475  ILLSAASRRGIWNWAIRLLNKMQEKGIKPRTREWNAALVACSRASEAAAAVQIFMRMVEQ 534

Query: 2015 GEKPTVISYGALLSALEKGKLYEEALQVWKHMVKVGVKPNLYAYTIMASVYTAQGKFNIV 2194
            GEKPT++SYGALLSALEKGKLY++A QVW+HM+KVGV+PNLYAYT M S+Y  QG+   V
Sbjct: 535  GEKPTILSYGALLSALEKGKLYDKAHQVWEHMIKVGVQPNLYAYTTMLSIYIKQGRLKAV 594

Query: 2195 NSIIKEMVTVGVEPTVVTFNAIISGCARNNLGNAAYEWFQRMKNHNISPNEVTYEMLIEA 2374
            + +I+EM ++G+EPTVVTFNAIISGCA   +G AA+EWF RMK  NI PNE+TYEMLIEA
Sbjct: 595  DIVIREMNSLGIEPTVVTFNAIISGCAYKGMGGAAFEWFHRMKAKNIEPNEITYEMLIEA 654

Query: 2375 LASDGKPRLVFELYLRAHNEGLSLSSKAYDLVICCSETYGATIDVNALGPRPPERKKKVQ 2554
            LA+DGKPRL +E+YLRA NE L LS KAYD V+  S  Y A+ID++ LGPRPPE+ KK  
Sbjct: 655  LANDGKPRLAYEVYLRARNEDLLLSPKAYDSVLRSSYQYKASIDMSRLGPRPPEKTKK-- 712

Query: 2555 IRKNLSEFCNLADVPRRSKPFDEKEI 2632
              K  +EFC L D+ RR KP D   +
Sbjct: 713  RTKVSAEFCRLPDMSRREKPLDSNAV 738


>gb|EAZ20176.1| hypothetical protein OsJ_35776 [Oryza sativa Japonica Group]
          Length = 642

 Score =  690 bits (1781), Expect = 0.0
 Identities = 343/605 (56%), Positives = 452/605 (74%), Gaps = 13/605 (2%)
 Frame = +2

Query: 869  GIGKNAR----IDVRALALSLQFARTANDVEEVLK---DKG-----ELPLQVYSSLIRGL 1012
            G G+ AR    +DV A+  +L+ ARTA++VE ++K   D G      LPLQVY+S+IRGL
Sbjct: 30   GGGRRARGGGDVDVAAVGAALRDARTADEVETLVKGFLDDGGGGEEHLPLQVYTSVIRGL 89

Query: 1013 GKEKKIDSAIALFEWLKRKSVDSNGAIRPNIFIYNSLLGAIKQSDKYDFVDSVLNEMTIE 1192
            GKE+++D+A A+ E LKR S    G    N F+YN LLGA+K S ++  +  VL +M  +
Sbjct: 90   GKERRLDAAFAVVEHLKRGSGSGGGGGGVNQFVYNCLLGAVKNSGEFGRIHDVLADMEAQ 149

Query: 1193 GVHPNVITYNTLMAIYIEHGRGVEALRLFEEIPKKGLYPTPASYSTALLAYQSLEDGFGA 1372
            GV PN++T+NTLM+IY+E G+  E  R+F+ I   GL PT A+YST + AY+   D F A
Sbjct: 150  GVPPNIVTFNTLMSIYVEQGKIDEVFRVFDTIEGSGLVPTAATYSTVMSAYKKAGDAFAA 209

Query: 1373 LNFFIEAKDNYLKGEMRKEVGGDLENEFVKFENLTIRICFWVMRQWLVKNENPCTNILKL 1552
            L F  + ++ Y KGE+      D + EFVKFE LT+R+C+  MR+ LV  ENP   +LK+
Sbjct: 210  LKFITKLREMYNKGELAVN-HEDWDREFVKFEKLTVRVCYMAMRRSLVGGENPVGEVLKV 268

Query: 1553 LLEMDKVGLQAGRAEHERLVWACTHEEHHIVAKELYKRIRERETN-ISLSVCNHVIWLMG 1729
            LL MD+ G++  R ++ERLVWACT EEH+ +AKELY+RIRER    ISLSVCNH+IWLMG
Sbjct: 269  LLGMDEAGVKPDRRDYERLVWACTGEEHYTIAKELYQRIRERGDGVISLSVCNHLIWLMG 328

Query: 1730 KAKKWWAALEIYEDLLDRGPKPNNMSYELIVSHFNILLTAARKRGIWRWGVRLLNKMEEK 1909
            KAKKWWAALEIYEDLLD+GPKPNN+SYELI+SHFNILL AA++RGIWRWGVRLL+KM++K
Sbjct: 329  KAKKWWAALEIYEDLLDKGPKPNNLSYELIMSHFNILLNAAKRRGIWRWGVRLLDKMQQK 388

Query: 1910 GLKPGSREWNAVLVSCSKAAETSAAVQIFKRMVERGEKPTVISYGALLSALEKGKLYEEA 2089
            GLKPGSREWNAVL++CS+AAETSAAV IFKRM+++G  P V+SYGALLSALEKGKLY+EA
Sbjct: 389  GLKPGSREWNAVLLACSRAAETSAAVDIFKRMIDQGLTPDVVSYGALLSALEKGKLYDEA 448

Query: 2090 LQVWKHMVKVGVKPNLYAYTIMASVYTAQGKFNIVNSIIKEMVTVGVEPTVVTFNAIISG 2269
            L+VW+HM KVGVKPNL+AYTI+ S+Y  +G   +V+S+++ M++  +EPTVVTFNAIIS 
Sbjct: 449  LRVWEHMCKVGVKPNLHAYTILVSIYIGKGNHAMVDSVLRGMLSAKIEPTVVTFNAIISA 508

Query: 2270 CARNNLGNAAYEWFQRMKNHNISPNEVTYEMLIEALASDGKPRLVFELYLRAHNEGLSLS 2449
            C RNN G +A+EWF RMK  NI PNE+TY+MLIEAL  DGKPRL +E+Y+RA N+GL L 
Sbjct: 509  CVRNNKGGSAFEWFHRMKVQNIEPNEITYQMLIEALVQDGKPRLAYEMYMRACNQGLELP 568

Query: 2450 SKAYDLVICCSETYGATIDVNALGPRPPERKKKVQIRKNLSEFCNLADVPRRSKPFDEKE 2629
            +K+YD V+   + YG+ ID+N+LGPRP ++ + ++I    S    + D+P  +K F    
Sbjct: 569  AKSYDTVMEACQDYGSLIDLNSLGPRPVKKVEPIRIENKFSSSYYVGDLPSSTKHFGSTG 628

Query: 2630 IDSVH 2644
              S++
Sbjct: 629  TSSLY 633


>ref|NP_001066581.1| Os12g0283900 [Oryza sativa Japonica Group]
            gi|113649088|dbj|BAF29600.1| Os12g0283900 [Oryza sativa
            Japonica Group]
          Length = 675

 Score =  690 bits (1781), Expect = 0.0
 Identities = 343/605 (56%), Positives = 452/605 (74%), Gaps = 13/605 (2%)
 Frame = +2

Query: 869  GIGKNAR----IDVRALALSLQFARTANDVEEVLK---DKG-----ELPLQVYSSLIRGL 1012
            G G+ AR    +DV A+  +L+ ARTA++VE ++K   D G      LPLQVY+S+IRGL
Sbjct: 63   GGGRRARGGGDVDVAAVGAALRDARTADEVETLVKGFLDDGGGGEEHLPLQVYTSVIRGL 122

Query: 1013 GKEKKIDSAIALFEWLKRKSVDSNGAIRPNIFIYNSLLGAIKQSDKYDFVDSVLNEMTIE 1192
            GKE+++D+A A+ E LKR S    G    N F+YN LLGA+K S ++  +  VL +M  +
Sbjct: 123  GKERRLDAAFAVVEHLKRGSGSGGGGGGVNQFVYNCLLGAVKNSGEFGRIHDVLADMEAQ 182

Query: 1193 GVHPNVITYNTLMAIYIEHGRGVEALRLFEEIPKKGLYPTPASYSTALLAYQSLEDGFGA 1372
            GV PN++T+NTLM+IY+E G+  E  R+F+ I   GL PT A+YST + AY+   D F A
Sbjct: 183  GVPPNIVTFNTLMSIYVEQGKIDEVFRVFDTIEGSGLVPTAATYSTVMSAYKKAGDAFAA 242

Query: 1373 LNFFIEAKDNYLKGEMRKEVGGDLENEFVKFENLTIRICFWVMRQWLVKNENPCTNILKL 1552
            L F  + ++ Y KGE+      D + EFVKFE LT+R+C+  MR+ LV  ENP   +LK+
Sbjct: 243  LKFITKLREMYNKGELAVN-HEDWDREFVKFEKLTVRVCYMAMRRSLVGGENPVGEVLKV 301

Query: 1553 LLEMDKVGLQAGRAEHERLVWACTHEEHHIVAKELYKRIRERETN-ISLSVCNHVIWLMG 1729
            LL MD+ G++  R ++ERLVWACT EEH+ +AKELY+RIRER    ISLSVCNH+IWLMG
Sbjct: 302  LLGMDEAGVKPDRRDYERLVWACTGEEHYTIAKELYQRIRERGDGVISLSVCNHLIWLMG 361

Query: 1730 KAKKWWAALEIYEDLLDRGPKPNNMSYELIVSHFNILLTAARKRGIWRWGVRLLNKMEEK 1909
            KAKKWWAALEIYEDLLD+GPKPNN+SYELI+SHFNILL AA++RGIWRWGVRLL+KM++K
Sbjct: 362  KAKKWWAALEIYEDLLDKGPKPNNLSYELIMSHFNILLNAAKRRGIWRWGVRLLDKMQQK 421

Query: 1910 GLKPGSREWNAVLVSCSKAAETSAAVQIFKRMVERGEKPTVISYGALLSALEKGKLYEEA 2089
            GLKPGSREWNAVL++CS+AAETSAAV IFKRM+++G  P V+SYGALLSALEKGKLY+EA
Sbjct: 422  GLKPGSREWNAVLLACSRAAETSAAVDIFKRMIDQGLTPDVVSYGALLSALEKGKLYDEA 481

Query: 2090 LQVWKHMVKVGVKPNLYAYTIMASVYTAQGKFNIVNSIIKEMVTVGVEPTVVTFNAIISG 2269
            L+VW+HM KVGVKPNL+AYTI+ S+Y  +G   +V+S+++ M++  +EPTVVTFNAIIS 
Sbjct: 482  LRVWEHMCKVGVKPNLHAYTILVSIYIGKGNHAMVDSVLRGMLSAKIEPTVVTFNAIISA 541

Query: 2270 CARNNLGNAAYEWFQRMKNHNISPNEVTYEMLIEALASDGKPRLVFELYLRAHNEGLSLS 2449
            C RNN G +A+EWF RMK  NI PNE+TY+MLIEAL  DGKPRL +E+Y+RA N+GL L 
Sbjct: 542  CVRNNKGGSAFEWFHRMKVQNIEPNEITYQMLIEALVQDGKPRLAYEMYMRACNQGLELP 601

Query: 2450 SKAYDLVICCSETYGATIDVNALGPRPPERKKKVQIRKNLSEFCNLADVPRRSKPFDEKE 2629
            +K+YD V+   + YG+ ID+N+LGPRP ++ + ++I    S    + D+P  +K F    
Sbjct: 602  AKSYDTVMEACQDYGSLIDLNSLGPRPVKKVEPIRIENKFSSSYYVGDLPSSTKHFGSTG 661

Query: 2630 IDSVH 2644
              S++
Sbjct: 662  TSSLY 666


Top