BLASTX nr result

ID: Mentha22_contig00042866 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00042866
         (1635 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU23303.1| hypothetical protein MIMGU_mgv1a003335mg [Mimulus...   691   0.0  
ref|XP_004245945.1| PREDICTED: pentatricopeptide repeat-containi...   683   0.0  
ref|XP_006352928.1| PREDICTED: pentatricopeptide repeat-containi...   682   0.0  
ref|XP_007040736.1| Tetratricopeptide repeat-like superfamily pr...   661   0.0  
ref|XP_002275784.1| PREDICTED: pentatricopeptide repeat-containi...   657   0.0  
ref|XP_007210874.1| hypothetical protein PRUPE_ppa003110mg [Prun...   655   0.0  
ref|XP_006368339.1| pentatricopeptide repeat-containing family p...   648   0.0  
ref|XP_007158217.1| hypothetical protein PHAVU_002G134100g [Phas...   645   0.0  
ref|XP_004300183.1| PREDICTED: pentatricopeptide repeat-containi...   645   0.0  
gb|EXC26223.1| hypothetical protein L484_022794 [Morus notabilis]     644   0.0  
ref|XP_003533450.1| PREDICTED: pentatricopeptide repeat-containi...   642   0.0  
ref|XP_004512460.1| PREDICTED: pentatricopeptide repeat-containi...   636   e-180
ref|XP_006572946.1| PREDICTED: pentatricopeptide repeat-containi...   636   e-180
ref|XP_006572948.1| PREDICTED: pentatricopeptide repeat-containi...   634   e-179
ref|XP_006432677.1| hypothetical protein CICLE_v10000638mg [Citr...   634   e-179
emb|CAN70294.1| hypothetical protein VITISV_005974 [Vitis vinifera]   634   e-179
ref|XP_004136211.1| PREDICTED: pentatricopeptide repeat-containi...   631   e-178
ref|XP_003612704.1| Pentatricopeptide repeat-containing protein ...   617   e-174
ref|XP_006415279.1| hypothetical protein EUTSA_v10010030mg [Eutr...   597   e-168
ref|NP_174474.1| pentatricopeptide repeat-containing protein [Ar...   587   e-165

>gb|EYU23303.1| hypothetical protein MIMGU_mgv1a003335mg [Mimulus guttatus]
          Length = 592

 Score =  691 bits (1783), Expect = 0.0
 Identities = 342/551 (62%), Positives = 426/551 (77%), Gaps = 7/551 (1%)
 Frame = +2

Query: 2    EFLIQHQIHAKIPETEYSFYKEQECISLIKKCKNLEEFKQIHGQILKLGLLWSSFCASNL 181
            +F I    H K PE ++   KEQECISL+K C++++EFK++HG+ILKLGL WSSFCASNL
Sbjct: 12   QFSIPKDNHGKNPEIDFGV-KEQECISLVKTCRSMDEFKKVHGKILKLGLFWSSFCASNL 70

Query: 182  LATCALSQWGSMDYACSIFEEIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDA 361
            LATCALS+WGSMDYACSIF +++DP SF+ N MIRGY+KDM  E+A  TYL MLE GV+ 
Sbjct: 71   LATCALSEWGSMDYACSIFRQMDDPDSFEFNTMIRGYVKDMNSEEAFFTYLEMLEFGVEP 130

Query: 362  DNFTYPLLLKACAFLAASEEGMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVF 541
            DNFTYP LLKAC+ L+A  EG QIHGQ++K GFV DV VQNSLIN+              
Sbjct: 131  DNFTYPPLLKACSILSAFAEGAQIHGQIYKMGFVEDVMVQNSLINV-------------- 176

Query: 542  ERIEKKTVASWSSIISAHAKLGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTHLGTL 721
             R++ KT+ASWS++I+AHA LG W+ECL LFS M  EG+WRA        LSACT LG L
Sbjct: 177  -RMDHKTIASWSALIAAHANLGMWKECLRLFSDMNWEGKWRAEESTLVSVLSACTRLGVL 235

Query: 722  DWGRSIHGYLLRNLSGLNVAVETAVIDMYIKCGSLEKGMSLFQEMA-NKNHKSYSVAISG 898
            D GR  HGYL+RNL+G NVAV+T+++DMY++ GSL+KGMSLF EM   KN KSYSV ISG
Sbjct: 236  DSGRCTHGYLIRNLTGFNVAVQTSLMDMYVRSGSLDKGMSLFLEMGEKKNRKSYSVVISG 295

Query: 899  LASHGRGEEALALFERMLLEGLKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHRIKP 1069
            LA+HG GEEAL +F+ ML  GLKPDD+ YVG LSAC+    VEEGKK F RM  EHR++P
Sbjct: 296  LATHGHGEEALKVFDEMLERGLKPDDVAYVGVLSACSHAGLVEEGKKYFDRMRIEHRVEP 355

Query: 1070 TIQHYGCMVDLMGRTGLLHEAYDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDL 1249
            TIQH GCMVDLMGR GL+ EA + IK+M ++PN+V+WRSLLSSC++H+N+ELGE+AAE+L
Sbjct: 356  TIQHCGCMVDLMGRAGLIREALEFIKNMKIEPNEVIWRSLLSSCRVHQNVELGELAAENL 415

Query: 1250 VKLNSQNGGDYAMMCSIYAQAGRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRFVSN 1429
             K+N++N GDY  +C+IYAQA RW++++ +RVKMA  GLGQ  GSS+VEVK KVH+FVS+
Sbjct: 416  FKMNTRNAGDYLNLCNIYAQARRWEEMSITRVKMASNGLGQEPGSSSVEVKRKVHKFVSS 475

Query: 1430 GVLNR---EVSEMMHQMEWQLRFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAFALV 1600
               +    E+ EM+HQMEWQL+FEGY AD S+VL  V EEEKR+RL  HSQK AIAF+L+
Sbjct: 476  DTSHSQCDEIYEMLHQMEWQLKFEGYSADTSQVLFDVSEEEKRQRLSSHSQKLAIAFSLI 535

Query: 1601 STCDGSVVRIV 1633
            +T +GS VRIV
Sbjct: 536  NTSEGSPVRIV 546


>ref|XP_004245945.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Solanum lycopersicum]
          Length = 605

 Score =  683 bits (1763), Expect = 0.0
 Identities = 337/549 (61%), Positives = 418/549 (76%), Gaps = 6/549 (1%)
 Frame = +2

Query: 5    FLIQHQIHAKIPETEYSFYKEQECISLIKKCKNLEEFKQIHGQILKLGLLWSSFCASNLL 184
            FLI  + HAK  E  +S  KEQE IS+IKKC N+ E KQ+HGQILKLG + SSFCA NLL
Sbjct: 12   FLIPKEYHAKAQELNFSL-KEQEWISMIKKCNNMRELKQVHGQILKLGFICSSFCAGNLL 70

Query: 185  ATCALSQWGSMDYACSIFEEIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDAD 364
            +TCALS+WGSMDYAC IF+EI+DP SF+ N +IRGY+KDM LE+ALL Y+HM+E  V+ D
Sbjct: 71   STCALSEWGSMDYACLIFDEIDDPGSFEYNTVIRGYVKDMNLEEALLWYVHMIEDEVEPD 130

Query: 365  NFTYPLLLKACAFLAASEEGMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVFE 544
            NF+YP LLK CA + A +EG QIHGQ+ K G   DVFVQNSLINMYGKCG +R SC VFE
Sbjct: 131  NFSYPTLLKVCARIRALKEGKQIHGQILKFGHEDDVFVQNSLINMYGKCGGVRQSCIVFE 190

Query: 545  RIEKKTVASWSSIISAHAKLGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTHLGTLD 724
            +++++T+ASWS++I+A+A LG W ECL +F+ M  EG WRA        +SACTHL  LD
Sbjct: 191  QMDQRTIASWSALIAANANLGLWSECLRVFAEMNSEGCWRAEESTLVSVISACTHLNALD 250

Query: 725  WGRSIHGYLLRNLSGLNVAVETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVAISGLA 904
            +G++ HGYLLRN++GLNV VET++IDMY+KCG LEKG+ LFQ MANKN  SYS  ISGLA
Sbjct: 251  FGKATHGYLLRNMTGLNVIVETSLIDMYVKCGCLEKGLFLFQRMANKNQMSYSAIISGLA 310

Query: 905  SHGRGEEALALFERMLLEGLKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHRIKPTI 1075
             HGRGEEAL ++  ML   ++PDD+VYVG LSAC+    VEEG K F RM  EHRI+PTI
Sbjct: 311  LHGRGEEALRIYHEMLKARIEPDDVVYVGVLSACSHAGLVEEGLKCFDRMRLEHRIEPTI 370

Query: 1076 QHYGCMVDLMGRTGLLHEAYDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDLVK 1255
            QHYGCMVDL+GRTG L EA +LIK MPM+PNDV+WRSLLS+C++H+N+ELGEVAA++L  
Sbjct: 371  QHYGCMVDLLGRTGRLKEALELIKGMPMEPNDVLWRSLLSACRVHQNVELGEVAAKNLFM 430

Query: 1256 LNSQNGGDYAMMCSIYAQAGRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRFVS--- 1426
            L S+N  DY M+C+IYAQA  W+ +++ R KM   G+ QV GS  VE   K+++FVS   
Sbjct: 431  LKSRNASDYVMLCNIYAQAKMWEKMSAIRTKMVNEGIIQVPGSCLVEADRKLYKFVSQDR 490

Query: 1427 NGVLNREVSEMMHQMEWQLRFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAFALVST 1606
            +   + EV +M+HQMEWQL+FEGY  D S VL  V EEEKR+RL  H QK AIAFAL+ T
Sbjct: 491  SHTCSDEVYDMIHQMEWQLKFEGYSPDTSLVLFDVDEEEKRQRLSTHCQKLAIAFALIKT 550

Query: 1607 CDGSVVRIV 1633
              GS +RIV
Sbjct: 551  SQGSPIRIV 559


>ref|XP_006352928.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Solanum tuberosum]
          Length = 605

 Score =  682 bits (1761), Expect = 0.0
 Identities = 338/549 (61%), Positives = 416/549 (75%), Gaps = 6/549 (1%)
 Frame = +2

Query: 5    FLIQHQIHAKIPETEYSFYKEQECISLIKKCKNLEEFKQIHGQILKLGLLWSSFCASNLL 184
            FLI  + HAK  E  +S  KEQE IS+IKKC ++ E KQ+HGQILKLG + SSFC+ NLL
Sbjct: 12   FLIPKEYHAKAQEFNFSL-KEQEWISMIKKCNSMRELKQVHGQILKLGFICSSFCSGNLL 70

Query: 185  ATCALSQWGSMDYACSIFEEIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDAD 364
            +TCALS+WGSMDYAC IF+EI+DP SF+ N +IRGY+KDM LE+ALL Y+HM+E  V+ D
Sbjct: 71   STCALSEWGSMDYACLIFDEIDDPRSFEYNTVIRGYVKDMNLEEALLWYVHMIEDEVEPD 130

Query: 365  NFTYPLLLKACAFLAASEEGMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVFE 544
            NF+YP LLK CA + A +EG QIHGQ+ K G   DVFVQNSLINMYGKCG +R SC VFE
Sbjct: 131  NFSYPTLLKVCARIRALKEGKQIHGQILKFGHEDDVFVQNSLINMYGKCGEVRQSCIVFE 190

Query: 545  RIEKKTVASWSSIISAHAKLGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTHLGTLD 724
            +++++T+ASWS++I+A+A LG W ECL +F  M  EG WRA        +SACTHL  LD
Sbjct: 191  QMDQRTIASWSALIAANANLGLWSECLKVFGEMNSEGCWRAEESTLVSVISACTHLDALD 250

Query: 725  WGRSIHGYLLRNLSGLNVAVETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVAISGLA 904
            +G++ HGYLLRN++GLNV VET++IDMY+KCG LEKG+ LFQ MANKN  SYS  ISGLA
Sbjct: 251  FGKATHGYLLRNMTGLNVIVETSLIDMYVKCGCLEKGLFLFQRMANKNQMSYSAIISGLA 310

Query: 905  SHGRGEEALALFERMLLEGLKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHRIKPTI 1075
             HGRGEEAL ++  ML E ++PDD+VYVG LSAC+    VEEG K F RM  EHRI+PTI
Sbjct: 311  LHGRGEEALRIYHEMLKERIEPDDVVYVGVLSACSHAGLVEEGLKCFDRMRLEHRIEPTI 370

Query: 1076 QHYGCMVDLMGRTGLLHEAYDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDLVK 1255
            QHYGCMVDL+GR G L EA +LIK MPM+PNDV+WRSLLSSC++H+N+ELGEVAA++L  
Sbjct: 371  QHYGCMVDLLGRAGRLEEALELIKGMPMEPNDVLWRSLLSSCRVHQNVELGEVAAKNLFM 430

Query: 1256 LNSQNGGDYAMMCSIYAQAGRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRFVS--- 1426
            L S+N  DY M+C+IYAQA  W+ +A  R KM   G+ QV GS  VE   K+++FVS   
Sbjct: 431  LKSRNASDYVMLCNIYAQAKMWEKMAVIRTKMVNEGIIQVPGSCLVEADRKLYKFVSQDR 490

Query: 1427 NGVLNREVSEMMHQMEWQLRFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAFALVST 1606
            +   + EV EM+HQMEWQL+FEGY  D S VL  V EEEKR+RL  H QK AIAFAL+ T
Sbjct: 491  SHTCSDEVYEMIHQMEWQLKFEGYSPDTSLVLFDVDEEEKRQRLSTHCQKLAIAFALIKT 550

Query: 1607 CDGSVVRIV 1633
              GS +RIV
Sbjct: 551  SQGSPIRIV 559


>ref|XP_007040736.1| Tetratricopeptide repeat-like superfamily protein [Theobroma cacao]
            gi|508777981|gb|EOY25237.1| Tetratricopeptide repeat-like
            superfamily protein [Theobroma cacao]
          Length = 703

 Score =  661 bits (1706), Expect = 0.0
 Identities = 317/529 (59%), Positives = 404/529 (76%), Gaps = 6/529 (1%)
 Frame = +2

Query: 62   KEQECISLIKKCKNLEEFKQIHGQILKLGLLWSSFCASNLLATCALSQWGSMDYACSIFE 241
            KEQEC S++K+CKN+EEF+Q H QI+K G  W+SFCASNL+A CALS  GSMDYACSIF+
Sbjct: 128  KEQECFSILKRCKNMEEFRQAHAQIVKWGFFWNSFCASNLVAACALSDGGSMDYACSIFQ 187

Query: 242  EIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDADNFTYPLLLKACAFLAASEE 421
            +I++P +F+ N MIR ++KDM  E+AL+ Y  MLE GV+ DNFTYP L KACA L A EE
Sbjct: 188  QIDEPGTFEFNTMIRAHVKDMTFEEALVFYYEMLEKGVEPDNFTYPALFKACACLQAQEE 247

Query: 422  GMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVFERIEKKTVASWSSIISAHAK 601
            G QIHG  FK G   D++VQNSLINMYGKCG +  SC++FE++++K+VASWS+II+AHA 
Sbjct: 248  GKQIHGHAFKLGLESDLYVQNSLINMYGKCGEIEHSCAIFEQMDQKSVASWSAIIAAHAS 307

Query: 602  LGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTHLGTLDWGRSIHGYLLRNLSGLNVA 781
             GKW ECL +F +M  EG WR         LSACTHLG LD G+  HG LLRN+S LNV 
Sbjct: 308  FGKWYECLMMFGNMSSEGCWRPEESTLVTVLSACTHLGALDLGKCTHGSLLRNISELNVI 367

Query: 782  VETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVAISGLASHGRGEEALALFERMLLEG 961
            V+T+++DMY+KCG LEKG+SLF++M N++  SY+V ISGLA HG GEEAL ++  ML +G
Sbjct: 368  VQTSLMDMYVKCGCLEKGLSLFRKMGNRSQMSYTVMISGLAMHGHGEEALRIYSEMLKDG 427

Query: 962  LKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHRIKPTIQHYGCMVDLMGRTGLLHEA 1132
            L PDD+VYVG LSAC+    V+EG + F RM  EH I PT+QHYGCMVDLMG+ G+++EA
Sbjct: 428  LDPDDVVYVGVLSACSHAGLVDEGFRCFDRMKSEHGITPTVQHYGCMVDLMGKAGMINEA 487

Query: 1133 YDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDLVKLNSQNGGDYAMMCSIYAQA 1312
             + IKSMP+KPNDV WRSLLS+C++H NLE+GE+AA+ L +  SQN GDY ++ ++YA+A
Sbjct: 488  LEFIKSMPIKPNDVFWRSLLSACRVHCNLEIGEIAAKHLFQSKSQNPGDYVILSNMYARA 547

Query: 1313 GRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRFVSNGVLNRE---VSEMMHQMEWQL 1483
             RW +VA  RV+MAR GL QV G S VEV  ++H+FVS    + +   V EM+HQMEWQL
Sbjct: 548  QRWQEVAKIRVEMARKGLHQVPGFSLVEVGRRIHKFVSQDTSHPQCVSVYEMIHQMEWQL 607

Query: 1484 RFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAFALVSTCDGSVVRI 1630
            +FEGY  D S+VL+ V EEEKR+RL+GHSQK AIAFAL+ T  GS +RI
Sbjct: 608  KFEGYSPDTSQVLLDVDEEEKRQRLKGHSQKLAIAFALIHTSQGSPIRI 656


>ref|XP_002275784.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Vitis vinifera] gi|297742017|emb|CBI33804.3| unnamed
            protein product [Vitis vinifera]
          Length = 605

 Score =  657 bits (1695), Expect = 0.0
 Identities = 329/554 (59%), Positives = 408/554 (73%), Gaps = 13/554 (2%)
 Frame = +2

Query: 11   IQHQIHAKI-----PETEYSFYK--EQECISLIKKCKNLEEFKQIHGQILKLGLLWSSFC 169
            + HQ H  +     P++    +K  E+EC+SL+KKC N+EEFKQ H +ILKLGL   SFC
Sbjct: 6    VLHQTHVLVSREDPPQSPELSFKLGEKECVSLLKKCSNMEEFKQSHARILKLGLFGDSFC 65

Query: 170  ASNLLATCALSQWGSMDYACSIFEEIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEI 349
            ASNL+ATCALS WGSMDYACSIF ++++  SF  N M+RG++KDM  E+AL+TY  M E 
Sbjct: 66   ASNLVATCALSDWGSMDYACSIFRQMDELGSFQFNTMMRGHVKDMNTEEALITYKEMAER 125

Query: 350  GVDADNFTYPLLLKACAFLAASEEGMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDS 529
            GV  DNFTYP LLKACA L A EEGMQ+H  + K G   DVFVQNSLI+MYGKCG +   
Sbjct: 126  GVKPDNFTYPTLLKACARLPAVEEGMQVHAHILKLGLENDVFVQNSLISMYGKCGEIGVC 185

Query: 530  CSVFERIEKKTVASWSSIISAHAKLGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTH 709
            C+VFE++ +++VASWS++I+AHA LG W +CL L   M +EG WRA        LSACTH
Sbjct: 186  CAVFEQMNERSVASWSALITAHASLGMWSDCLRLLGDMSNEGYWRAEESILVSVLSACTH 245

Query: 710  LGTLDWGRSIHGYLLRNLSGLNVAVETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVA 889
            LG LD GRS+HG+LLRN+SGLNV VET++I+MY+KCGSL KGM LFQ+MA KN  SYSV 
Sbjct: 246  LGALDLGRSVHGFLLRNVSGLNVIVETSLIEMYLKCGSLYKGMCLFQKMAKKNKLSYSVM 305

Query: 890  ISGLASHGRGEEALALFERMLLEGLKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHR 1060
            ISGLA HG G E L +F  ML +GL+PDDIVYVG L+AC+    V+EG + F RM  EH 
Sbjct: 306  ISGLAMHGYGREGLRIFTEMLEQGLEPDDIVYVGVLNACSHAGLVQEGLQCFNRMKLEHG 365

Query: 1061 IKPTIQHYGCMVDLMGRTGLLHEAYDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAA 1240
            I+PTIQHYGCMVDLMGR G + EA +LIKSMPM+PNDV+WRSLLS+ K+H NL+ GE+AA
Sbjct: 366  IEPTIQHYGCMVDLMGRAGKIDEALELIKSMPMEPNDVLWRSLLSASKVHNNLQAGEIAA 425

Query: 1241 EDLVKLNSQNGGDYAMMCSIYAQAGRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRF 1420
            + L KL+SQ   DY ++ ++YAQA RW+DVA +R  M   GL Q  G S VEVK K+HRF
Sbjct: 426  KQLFKLDSQKASDYVVLSNMYAQAQRWEDVAKTRTNMFSKGLSQRPGFSLVEVKRKMHRF 485

Query: 1421 VSNGV---LNREVSEMMHQMEWQLRFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAF 1591
            VS       +  V EM++QMEWQL+FEGY  D ++VL  V EEEK++RL GHSQK AIA+
Sbjct: 486  VSQDAGHPQSESVYEMLYQMEWQLKFEGYSPDTTQVLCDVDEEEKKQRLSGHSQKLAIAY 545

Query: 1592 ALVSTCDGSVVRIV 1633
            AL+ T  GS +RIV
Sbjct: 546  ALIHTSQGSPIRIV 559


>ref|XP_007210874.1| hypothetical protein PRUPE_ppa003110mg [Prunus persica]
            gi|462406609|gb|EMJ12073.1| hypothetical protein
            PRUPE_ppa003110mg [Prunus persica]
          Length = 602

 Score =  655 bits (1689), Expect = 0.0
 Identities = 329/536 (61%), Positives = 406/536 (75%), Gaps = 4/536 (0%)
 Frame = +2

Query: 38   PETEYSFYKEQECISLIKKCKNLEEFKQIHGQILKLGLLWSSFCASNLLATCALSQWGSM 217
            PET  S  KEQE +SL+K+C+N+EE KQ+H  ILKLG    SFCA NL+AT ALS WGSM
Sbjct: 23   PETS-SRSKEQESLSLLKRCRNMEELKQVHAHILKLGHFCDSFCAGNLVATSALSAWGSM 81

Query: 218  DYACSIFEEIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDADNFTYPLLLKAC 397
            D+ACSIF++I +P +F CN MI+G++K M  ++ALL Y  MLE GV+ DNFTYP+LLKAC
Sbjct: 82   DHACSIFQQINEPGTFVCNTMIKGHVKAMNWDKALLLYCEMLETGVEPDNFTYPVLLKAC 141

Query: 398  AFLAASEEGMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVFERIEKKTVASWS 577
            A+L A EEGMQIHG + K G   DVFVQNSLI+MYGKCG L  SC+VFE++++K+VASWS
Sbjct: 142  AWLLAIEEGMQIHGHILKLGLENDVFVQNSLISMYGKCGELERSCTVFEQMDQKSVASWS 201

Query: 578  SIISAHAKLGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTHLGTLDWGRSIHGYLLR 757
            +II+AHA LG W ECL LF  M  EG WRA        LSACTHLG LD GR  HG LLR
Sbjct: 202  AIIAAHANLGMWCECLMLFGDMRREG-WRAEESTLVSVLSACTHLGALDLGRCSHGSLLR 260

Query: 758  NLSGLNVAVETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVAISGLASHGRGEEALAL 937
            N+S LNV V+T++IDMY+KCG LEKG+ LFQ+M  KN  SY+V ISGLA HG G +AL L
Sbjct: 261  NISALNVIVQTSLIDMYVKCGCLEKGLCLFQKMNKKNQLSYTVMISGLAVHGHGRKALEL 320

Query: 938  FERMLLEGLKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHRIKPTIQHYGCMVDLMG 1108
            F  ML EGL PD + ++G LSACT    V+EG + F RM  EH+I+PT+QHYGC+VDLMG
Sbjct: 321  FSAMLQEGLTPDAVAHLGVLSACTHAGLVDEGLRCFNRMKGEHKIQPTVQHYGCLVDLMG 380

Query: 1109 RTGLLHEAYDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDLVKLNSQNGGDYAM 1288
            R G+L EA  LI SMP++PNDV+WRSLLS+C++HKNLE+GE+AA  L +LNSQN  DY +
Sbjct: 381  RAGMLKEALQLITSMPVRPNDVIWRSLLSACRVHKNLEIGEIAAHMLFQLNSQNPSDYVV 440

Query: 1289 MCSIYAQAGRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRFVSNGVLNRE-VSEMMH 1465
            + ++YAQA RWD++A +R +MA  GL Q  G S VEVK +V++FVS      + V +M+H
Sbjct: 441  LSNMYAQAQRWDNMARTRTEMASKGLTQTPGISLVEVKRRVYKFVSQSHHQCDGVYKMVH 500

Query: 1466 QMEWQLRFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAFALVSTCDGSVVRIV 1633
            QMEWQLRFEGY AD S+VL+ V EEEKRERL+ HSQK AIAFAL+ T  GS +RIV
Sbjct: 501  QMEWQLRFEGYSADTSQVLLDVDEEEKRERLKYHSQKLAIAFALIHTSQGSPIRIV 556


>ref|XP_006368339.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550346246|gb|ERP64908.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 602

 Score =  648 bits (1672), Expect = 0.0
 Identities = 319/530 (60%), Positives = 398/530 (75%), Gaps = 7/530 (1%)
 Frame = +2

Query: 62   KEQECISLIKKCKNLEEFKQIHGQILKLGLLW-SSFCASNLLATCALSQWGSMDYACSIF 238
            KEQEC+SL+K+CKN+EEFKQ+H Q+LK    W +SFCASNL+ATCALS WGSMDYACSIF
Sbjct: 30   KEQECLSLMKRCKNMEEFKQVHAQVLK----WENSFCASNLVATCALSDWGSMDYACSIF 85

Query: 239  EEIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDADNFTYPLLLKACAFLAASE 418
             +I+ P +F+ N MIRGY+  M +E AL  Y  MLE GV++DNFTYP L KACA L + E
Sbjct: 86   RQIDQPGTFEFNTMIRGYVNVMNMENALFLYYEMLERGVESDNFTYPALFKACASLRSIE 145

Query: 419  EGMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVFERIEKKTVASWSSIISAHA 598
            EGMQIHG +FK G  GD+FVQNSLINMYGKCG +  SCSVFE ++++ VASWS+II+AHA
Sbjct: 146  EGMQIHGYIFKRGLEGDLFVQNSLINMYGKCGKIELSCSVFEHMDRRDVASWSAIIAAHA 205

Query: 599  KLGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTHLGTLDWGRSIHGYLLRNLSGLNV 778
             LG W ECL++F  M  EG  R         LSACTHLG LD GR  H  LLRN+  +NV
Sbjct: 206  SLGMWSECLSVFGEMSREGSCRPEESILVSVLSACTHLGALDLGRCTHVTLLRNIREMNV 265

Query: 779  AVETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVAISGLASHGRGEEALALFERMLLE 958
             V+T++IDMY+KCG +EKG+SLFQ M  KN  SYSV I+GLA HGRG EAL +F  ML E
Sbjct: 266  IVQTSLIDMYVKCGCIEKGLSLFQRMVKKNQLSYSVMITGLAMHGRGMEALQVFSDMLEE 325

Query: 959  GLKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHRIKPTIQHYGCMVDLMGRTGLLHE 1129
            GLKPDD+VY+G LSAC     V+EG + F RM  EH I+PTIQHYGC+V LMGR G+L+E
Sbjct: 326  GLKPDDVVYLGVLSACNHAGLVDEGLQCFNRMKLEHGIEPTIQHYGCIVHLMGRAGMLNE 385

Query: 1130 AYDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDLVKLNSQNGGDYAMMCSIYAQ 1309
            A +LI+ MP+KPN+VVWR LLS+CK H NLE+GE+AA+ L +LNS N GDY ++ ++YA+
Sbjct: 386  ALELIRCMPIKPNEVVWRGLLSACKFHHNLEIGEIAAKSLGELNSSNPGDYVVLSNMYAR 445

Query: 1310 AGRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRFVSNGVLN---REVSEMMHQMEWQ 1480
            A RW+DVA  R +MAR G  Q  G S VEV+ K+++FVS  + +   + + EM+HQMEWQ
Sbjct: 446  AKRWEDVAKIRTEMARKGFIQTPGFSLVEVERKIYKFVSQDMSHPQCKGIYEMIHQMEWQ 505

Query: 1481 LRFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAFALVSTCDGSVVRI 1630
            L+FEGY  D S+VL  V EEEKR+RL+ HSQK A+AFAL+ T  G+ +RI
Sbjct: 506  LKFEGYSPDTSQVLFDVDEEEKRQRLKAHSQKLAMAFALIHTSQGAPIRI 555


>ref|XP_007158217.1| hypothetical protein PHAVU_002G134100g [Phaseolus vulgaris]
            gi|561031632|gb|ESW30211.1| hypothetical protein
            PHAVU_002G134100g [Phaseolus vulgaris]
          Length = 605

 Score =  645 bits (1665), Expect = 0.0
 Identities = 320/530 (60%), Positives = 400/530 (75%), Gaps = 6/530 (1%)
 Frame = +2

Query: 59   YKEQECISLIKKCKNLEEFKQIHGQILKLGLLWSSFCASNLLATCALSQWGSMDYACSIF 238
            + EQ  +SL+K+CK++EEFKQ+H QILKLGL   SFC SNL+ATCALS+WGSM+YACSIF
Sbjct: 29   FNEQGWLSLLKRCKSMEEFKQVHAQILKLGLFLDSFCGSNLVATCALSRWGSMEYACSIF 88

Query: 239  EEIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDADNFTYPLLLKACAFLAASE 418
             +IE+P SF+ N MIRG + +M LE+ALL Y+ MLE G++ DNFTYP +LKAC+ L A +
Sbjct: 89   RQIEEPGSFEYNTMIRGNVNNMNLEKALLLYVEMLEKGIEHDNFTYPFVLKACSLLGALK 148

Query: 419  EGMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVFERIEKKTVASWSSIISAHA 598
            EG+QIHGQVFK G   D FVQN LI+MYGKCG +  +C++FE++++K+VASWSSII AHA
Sbjct: 149  EGVQIHGQVFKAGLEDDTFVQNGLISMYGKCGEINHACALFEQMDEKSVASWSSIIGAHA 208

Query: 599  KLGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTHLGTLDWGRSIHGYLLRNLSGLNV 778
            ++  W++CL L   M  EG+ RA        LSACTHLG+ + GR IHG LLRN+S LNV
Sbjct: 209  RVELWQDCLMLLGDMSSEGRHRAEESILVTALSACTHLGSPNLGRCIHGILLRNISELNV 268

Query: 779  AVETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVAISGLASHGRGEEALALFERMLLE 958
             V+T++IDMY+KCGSLEKG+ +FQ MA KN  SY+V ISGLA HGRG EAL +F  M+ E
Sbjct: 269  VVKTSLIDMYVKCGSLEKGLCVFQSMAVKNRYSYTVMISGLAFHGRGREALRVFSEMVEE 328

Query: 959  GLKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHRIKPTIQHYGCMVDLMGRTGLLHE 1129
            GL PDD+VYVG LSAC+    V EG + F  M   H+IKPTIQHYGCMVDLMGR G+L E
Sbjct: 329  GLAPDDVVYVGVLSACSHAGLVNEGLQCFNSMQLVHKIKPTIQHYGCMVDLMGRAGMLKE 388

Query: 1130 AYDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDLVKLNSQNGGDYAMMCSIYAQ 1309
            A DLIK M +KPNDV+WRSLLS+CK+H NLE+GEVAAE++ KLN  N GDY ++ S+YA+
Sbjct: 389  ACDLIKGMQIKPNDVIWRSLLSACKVHLNLEIGEVAAENVFKLNQHNPGDYLVLASMYAR 448

Query: 1310 AGRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRFVSNGVLNRE---VSEMMHQMEWQ 1480
            A +W DVA  R +MA   L Q  G S VE   KVH+FVS      +   + +M+HQMEWQ
Sbjct: 449  AQKWTDVARIRTEMAEKHLVQTPGFSLVEANRKVHKFVSQDKSQPQCDTIYDMIHQMEWQ 508

Query: 1481 LRFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAFALVSTCDGSVVRI 1630
            L+FEGY  D S+VL+ V EEEKR+RL+ HSQK AIAFAL+ T +GS VRI
Sbjct: 509  LKFEGYAPDTSQVLLDVDEEEKRQRLKYHSQKLAIAFALIQTSEGSPVRI 558


>ref|XP_004300183.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Fragaria vesca subsp. vesca]
          Length = 606

 Score =  645 bits (1664), Expect = 0.0
 Identities = 331/537 (61%), Positives = 401/537 (74%), Gaps = 8/537 (1%)
 Frame = +2

Query: 47   EYSF-YKEQECISLIKKCKNLEEFKQIHGQILKLGLLWSSFCASNLLATCALSQWGSMDY 223
            E SF  KEQE +SL+K+CKNLEEFKQ+H  ILKLG+   SF A NL+AT  LS WGSMDY
Sbjct: 25   ELSFRLKEQESLSLLKRCKNLEEFKQVHSHILKLGVSCDSFVAGNLVATNVLSAWGSMDY 84

Query: 224  ACSIFEEIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDADNFTYPLLLKACAF 403
            ACSIFE+IE+P SF CN MI+G++K +  +QALL Y  MLE GV  DNFTYP++LKACA+
Sbjct: 85   ACSIFEQIEEPGSFVCNTMIKGHVKALNWDQALLVYCEMLESGVRPDNFTYPIVLKACAW 144

Query: 404  LAASEEGMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVFERI-EKKTVASWSS 580
            L A EEG QIHG VFK G   DVFVQNSLI+MYGKCG ++ S SVFE++ ++K+VASWS+
Sbjct: 145  LVAIEEGKQIHGHVFKLGLENDVFVQNSLISMYGKCGKVQLSRSVFEQLMDQKSVASWSA 204

Query: 581  IISAHAKLGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTHLGTLDWGRSIHGYLLRN 760
            IISAHA LG W ECL L+  M  EG  RA        LSACTHLG L+ GR  HGYLLRN
Sbjct: 205  IISAHASLGLWSECLKLYGDMRREG-LRAEESTLVSVLSACTHLGALNLGRCCHGYLLRN 263

Query: 761  LSGLNVAVETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVAISGLASHGRGEEALALF 940
            +S LNV VET++IDMY+KCG LEKG+SLFQ+M  KN  SY+V I GLA HG G EAL L+
Sbjct: 264  ISALNVIVETSLIDMYVKCGCLEKGLSLFQKMIKKNRLSYTVVICGLAIHGHGREALELY 323

Query: 941  ERMLLEGLKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHRIKPTIQHYGCMVDLMGR 1111
              M  EGLKPDD V+V  LSAC     VEEG + FKRM  EH I+P I+HYGC+VDLMGR
Sbjct: 324  SEMFREGLKPDDAVHVSVLSACNHAGLVEEGLQCFKRMKYEHEIQPKIEHYGCLVDLMGR 383

Query: 1112 TGLLHEAYDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDLVKLNSQNGGDYAMM 1291
             G L EA  LI SMP++PNDV+WRSLLS+ ++HKNL +GE+AAE L +LN  N  DY ++
Sbjct: 384  AGRLEEAMQLINSMPIRPNDVIWRSLLSASRVHKNLGIGEIAAEKLFQLNMHNPSDYVVL 443

Query: 1292 CSIYAQAGRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRFVSNGVLN---REVSEMM 1462
             ++YAQA RWD+VA  R +MA  GL Q  GSS VEV+ +VH+FVS  + +   + + EM+
Sbjct: 444  SNLYAQAQRWDNVARIRTEMASKGLTQTPGSSLVEVRREVHKFVSQDMSHPQCKRIYEMI 503

Query: 1463 HQMEWQLRFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAFALVSTCDGSVVRIV 1633
            HQMEWQLRFEGY AD ++VL+ V EEE+RERL+ HSQK AIAFAL+ T  GS +RIV
Sbjct: 504  HQMEWQLRFEGYSADTTQVLLDVDEEERRERLKYHSQKLAIAFALIHTSQGSPIRIV 560


>gb|EXC26223.1| hypothetical protein L484_022794 [Morus notabilis]
          Length = 605

 Score =  644 bits (1661), Expect = 0.0
 Identities = 321/554 (57%), Positives = 410/554 (74%), Gaps = 13/554 (2%)
 Frame = +2

Query: 11   IQHQIHAKIPETE-------YSFYKEQECISLIKKCKNLEEFKQIHGQILKLGLLWSSFC 169
            + +Q H  +P  E       +   KEQEC+SL+K+CK++ E KQIH QILK+GLL  SFC
Sbjct: 6    VLNQTHLLLPAKEPIQSPEFHLSLKEQECLSLLKRCKSVRELKQIHVQILKIGLLGDSFC 65

Query: 170  ASNLLATCALSQWGSMDYACSIFEEIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEI 349
            A NL+ATCALS WGSMDYACSIF  +++P +F  N M+RG++KD    QAL+ Y  ML+ 
Sbjct: 66   AGNLVATCALSDWGSMDYACSIFRHVKEPQTFLFNTMMRGHVKDGNWGQALILYFDMLKS 125

Query: 350  GVDADNFTYPLLLKACAFLAASEEGMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDS 529
            GV+ DNFTYP+LLKACA L+A+EEGMQIHG   K G  GD+FVQNSLINMYGKCG +  +
Sbjct: 126  GVEPDNFTYPVLLKACARLSATEEGMQIHGHTSKLGLQGDLFVQNSLINMYGKCGKIELA 185

Query: 530  CSVFERIEKKTVASWSSIISAHAKLGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTH 709
            C+VF+++++K+VASW +II+AHA LG W ECL LF  M  EG WRA        LSACTH
Sbjct: 186  CAVFDQMDQKSVASWGAIIAAHASLGMWWECLVLFGDMNREGCWRAEESTLVSVLSACTH 245

Query: 710  LGTLDWGRSIHGYLLRNLSGLNVAVETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVA 889
            L   D GR  HG LLRN SG NV VET++IDMY+KCG LEKG+ LF  MA +N  S+SV 
Sbjct: 246  LRVFDMGRCTHGSLLRNFSGFNVIVETSLIDMYVKCGCLEKGLCLFHNMAKRNQLSFSVI 305

Query: 890  ISGLASHGRGEEALALFERMLLEGLKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHR 1060
            ISGLA HG G +AL +F +ML EGL PDD+VYVG LSAC+    V+EG + F RM  EH 
Sbjct: 306  ISGLAMHGHGRKALEVFSKMLEEGLLPDDVVYVGVLSACSHAGLVDEGLQCFNRMKFEHG 365

Query: 1061 IKPTIQHYGCMVDLMGRTGLLHEAYDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAA 1240
            I+PT+QHYGC+VDL+GR G +  A++LI+SMP++PNDV+WRSLLS+C+IH ++ELGE+AA
Sbjct: 366  IQPTVQHYGCLVDLLGRAGWVRAAFELIESMPIRPNDVIWRSLLSACRIHGDMELGEIAA 425

Query: 1241 EDLVKLNSQNGGDYAMMCSIYAQAGRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRF 1420
             +L++ NS+N GDY ++ ++YA+A +WDD A  R +M   GL Q  G S VEV+ KV +F
Sbjct: 426  RNLMQSNSRNPGDYVVLSNMYAKAQKWDDFARVRTEMVSKGLVQTPGFSMVEVQRKVFKF 485

Query: 1421 VSNGVLNRE---VSEMMHQMEWQLRFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAF 1591
            VS+ + + +   V+EM+HQMEWQLRF+GY  D S+VL+ V EEEKRERL+ HSQK AIAF
Sbjct: 486  VSHDMSHPQCDGVNEMIHQMEWQLRFDGYVPDTSQVLLDVDEEEKRERLKYHSQKLAIAF 545

Query: 1592 ALVSTCDGSVVRIV 1633
            AL+ T  GS VRIV
Sbjct: 546  ALIHTSQGSPVRIV 559


>ref|XP_003533450.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Glycine max]
          Length = 604

 Score =  642 bits (1655), Expect = 0.0
 Identities = 316/524 (60%), Positives = 397/524 (75%), Gaps = 6/524 (1%)
 Frame = +2

Query: 77   ISLIKKCKNLEEFKQIHGQILKLGLLWSSFCASNLLATCALSQWGSMDYACSIFEEIEDP 256
            +SL+K+CK++EEFKQ+H  ILKLGL + SFC SNL+ATCALS+WGSM+YACSIF +IE+P
Sbjct: 34   LSLLKRCKSMEEFKQVHAHILKLGLFYDSFCGSNLVATCALSRWGSMEYACSIFRQIEEP 93

Query: 257  CSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDADNFTYPLLLKACAFLAASEEGMQIH 436
             SF+ N MIRG +  M LE+ALL Y+ MLE G++ DNFTYP +LKAC+ L A +EG+QIH
Sbjct: 94   GSFEYNTMIRGNVNSMNLEEALLLYVEMLERGIEPDNFTYPFVLKACSLLGALKEGVQIH 153

Query: 437  GQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVFERIEKKTVASWSSIISAHAKLGKWR 616
              VFK G  GDVFVQN LINMYGKCG +  +  VFE++++K+VASWSSII AHA +  W 
Sbjct: 154  AHVFKAGLEGDVFVQNGLINMYGKCGAIEHASVVFEQMDEKSVASWSSIIGAHASVEMWH 213

Query: 617  ECLNLFSHMLHEGQWRAXXXXXXXXLSACTHLGTLDWGRSIHGYLLRNLSGLNVAVETAV 796
            ECL L   M  EG+ RA        LSACTHLG+ ++GR IHG LLRN+S LNVAV+T++
Sbjct: 214  ECLMLLGDMSGEGRHRAEESILVSALSACTHLGSPNFGRCIHGILLRNISELNVAVKTSL 273

Query: 797  IDMYIKCGSLEKGMSLFQEMANKNHKSYSVAISGLASHGRGEEALALFERMLLEGLKPDD 976
            IDMY+K GSLEKG+ +FQ MA KN  SY+V I+GLA HGRG EAL++F  ML EGL PDD
Sbjct: 274  IDMYVKSGSLEKGLCVFQNMAQKNRYSYTVIITGLAIHGRGREALSVFSDMLEEGLAPDD 333

Query: 977  IVYVGTLSACTP---VEEGKKLFKRMWEEHRIKPTIQHYGCMVDLMGRTGLLHEAYDLIK 1147
            +VYVG LSAC+    V EG + F R+  EH+IKPTIQHYGCMVDLMGR G+L  AYDLIK
Sbjct: 334  VVYVGVLSACSHAGLVNEGLQCFNRLQFEHKIKPTIQHYGCMVDLMGRAGMLKGAYDLIK 393

Query: 1148 SMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDLVKLNSQNGGDYAMMCSIYAQAGRWDD 1327
            SMP+KPNDVVWRSLLS+CK+H NLE+GE+AAE++ KLN  N GDY ++ ++YA+A +W D
Sbjct: 394  SMPIKPNDVVWRSLLSACKVHHNLEIGEIAAENIFKLNQHNPGDYLVLANMYARAKKWAD 453

Query: 1328 VASSRVKMARLGLGQVAGSSAVEVKGKVHRFVSNGVLNRE---VSEMMHQMEWQLRFEGY 1498
            VA  R +MA   L Q  G S VE    V++FVS      +   + +M+ QMEWQL+FEGY
Sbjct: 454  VARIRTEMAEKHLVQTPGFSLVEANRNVYKFVSQDKSQPQCETIYDMIQQMEWQLKFEGY 513

Query: 1499 EADLSEVLIPVGEEEKRERLRGHSQKAAIAFALVSTCDGSVVRI 1630
              D+S+VL+ V E+EKR+RL+ HSQK AIAFAL+ T +GS +RI
Sbjct: 514  TPDMSQVLLDVDEDEKRQRLKHHSQKLAIAFALIQTSEGSRIRI 557


>ref|XP_004512460.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Cicer arietinum]
          Length = 606

 Score =  636 bits (1641), Expect = e-180
 Identities = 310/531 (58%), Positives = 403/531 (75%), Gaps = 7/531 (1%)
 Frame = +2

Query: 59   YKEQECISLIKKCKNLEEFKQIHGQILKLGLLWSSFCASNLLATCALSQWGSMDYACSIF 238
            + E+  + L+K+C N+EEFKQ+H   LK G+ + SFC SNL+ATCAL++WGSMDYACSIF
Sbjct: 29   FNEKGWLCLLKRCNNMEEFKQVHAYFLKCGIFFDSFCGSNLVATCALTKWGSMDYACSIF 88

Query: 239  EEIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDADNFTYPLLLKACAFLAASE 418
             +IE+PCSFD N MIRG + +M+L++ALL Y+ MLE G++ D FTYP +LKAC+ L A +
Sbjct: 89   TQIEEPCSFDYNTMIRGNVNNMKLDEALLLYVEMLERGIEPDKFTYPFVLKACSLLGALK 148

Query: 419  EGMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVFERIEKKTVASWSSIISAHA 598
            EG+QIHG V K G  GD+FV+NSLINMYGKCG ++D+C VF+++ +++VASWS+II AH 
Sbjct: 149  EGVQIHGHVLKTGLEGDLFVENSLINMYGKCGAIKDACDVFDKMGERSVASWSAIIGAHV 208

Query: 599  KLGKWRECLNLFSHML-HEGQWRAXXXXXXXXLSACTHLGTLDWGRSIHGYLLRNLSGLN 775
             +  W ECL L   M+  EG+ R         LSACTHLG+ + GR IHG LLRN+S LN
Sbjct: 209  CVEMWHECLVLLGDMMSSEGRCRPEESTLVSVLSACTHLGSYNLGRFIHGNLLRNISELN 268

Query: 776  VAVETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVAISGLASHGRGEEALALFERMLL 955
            V V+T++IDMY+KCG LEKG+ +F+ M  KN  SY+V ISGLA HG G+EAL +F  M+ 
Sbjct: 269  VVVKTSLIDMYVKCGCLEKGLHVFRNMPEKNRYSYTVMISGLAVHGHGKEALEVFSEMVE 328

Query: 956  EGLKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHRIKPTIQHYGCMVDLMGRTGLLH 1126
            +GL+PDD+VYVG LSAC+    V+EG + FKRM  EH+IKPTIQHYGCMVDLMGR+G+L 
Sbjct: 329  QGLEPDDVVYVGVLSACSHAGLVDEGLQCFKRMQFEHKIKPTIQHYGCMVDLMGRSGMLK 388

Query: 1127 EAYDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDLVKLNSQNGGDYAMMCSIYA 1306
            EAY+LIKSMP+KPNDVVWRSLLS+CK+H NLE+G++AA++L  LN  N GDY ++ ++YA
Sbjct: 389  EAYELIKSMPIKPNDVVWRSLLSACKVHLNLEIGQIAADNLFMLNPNNPGDYLVLANMYA 448

Query: 1307 QAGRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRFVSNGVLNRE---VSEMMHQMEW 1477
            +  +WD+VA  R KMA   L Q  G S VE K KV++FVS    + +   V +M+HQMEW
Sbjct: 449  KVQKWDEVAKIRRKMADKHLVQTPGFSLVEAKRKVYKFVSLDKSSPQWNIVYDMIHQMEW 508

Query: 1478 QLRFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAFALVSTCDGSVVRI 1630
            QL+FEGY AD S+VL+ V EEEKRERL+ HSQK AIAFAL+ T +G  +RI
Sbjct: 509  QLKFEGYVADTSQVLLDVDEEEKRERLKCHSQKLAIAFALIHTSEGCPLRI 559


>ref|XP_006572946.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Glycine max]
          Length = 605

 Score =  636 bits (1641), Expect = e-180
 Identities = 314/530 (59%), Positives = 399/530 (75%), Gaps = 6/530 (1%)
 Frame = +2

Query: 59   YKEQECISLIKKCKNLEEFKQIHGQILKLGLLWSSFCASNLLATCALSQWGSMDYACSIF 238
            + EQ  +SL+K+CK++EEFKQ+H  ILKLGL + SFC SNL+A+CALS+WGSM+YACSIF
Sbjct: 29   FNEQGWLSLLKRCKSMEEFKQVHAHILKLGLFYDSFCGSNLVASCALSRWGSMEYACSIF 88

Query: 239  EEIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDADNFTYPLLLKACAFLAASE 418
             +IE+P SF+ N MIRG +  M LE+ALL Y+ MLE G++ DNFTYP +LKAC+ L A +
Sbjct: 89   SQIEEPGSFEYNTMIRGNVNSMDLEEALLLYVEMLERGIEPDNFTYPFVLKACSLLVALK 148

Query: 419  EGMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVFERIEKKTVASWSSIISAHA 598
            EG+QIH  VFK G   DVFVQN LI+MYGKCG +  +  VFE++++K+VASWSSII AHA
Sbjct: 149  EGVQIHAHVFKAGLEVDVFVQNGLISMYGKCGAIEHAGVVFEQMDEKSVASWSSIIGAHA 208

Query: 599  KLGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTHLGTLDWGRSIHGYLLRNLSGLNV 778
             +  W ECL L   M  EG+ RA        LSACTHLG+ + GR IHG LLRN+S LNV
Sbjct: 209  SVEMWHECLMLLGDMSGEGRHRAEESILVSALSACTHLGSPNLGRCIHGILLRNISELNV 268

Query: 779  AVETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVAISGLASHGRGEEALALFERMLLE 958
             V+T++IDMY+KCGSLEKG+ +FQ MA+KN  SY+V I+GLA HGRG EA+ +F  ML E
Sbjct: 269  VVKTSLIDMYVKCGSLEKGLCVFQNMAHKNRYSYTVMIAGLAIHGRGREAVRVFSDMLEE 328

Query: 959  GLKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHRIKPTIQHYGCMVDLMGRTGLLHE 1129
            GL PDD+VYVG LSAC+    V EG + F RM  EH IKPTIQHYGCMVDLMGR G+L E
Sbjct: 329  GLTPDDVVYVGVLSACSHAGLVNEGLQCFNRMQFEHMIKPTIQHYGCMVDLMGRAGMLKE 388

Query: 1130 AYDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDLVKLNSQNGGDYAMMCSIYAQ 1309
            AYDLIKSMP+KPNDVVWRSLLS+CK+H NLE+GE+AAE++ +LN  N GDY ++ ++YA+
Sbjct: 389  AYDLIKSMPIKPNDVVWRSLLSACKVHHNLEIGEIAAENIFRLNKHNPGDYLVLANMYAR 448

Query: 1310 AGRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRFVS---NGVLNREVSEMMHQMEWQ 1480
            A +W +VA  R +MA   L Q  G S VE    V++FVS   +  +   + +M+ QMEWQ
Sbjct: 449  AKKWANVARIRTEMAEKHLVQTPGFSLVEANRNVYKFVSQDKSQPICETIYDMIQQMEWQ 508

Query: 1481 LRFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAFALVSTCDGSVVRI 1630
            L+FEGY  D+S+VL+ V E+EKR+RL+ HSQK AIAFAL+ T +GS +RI
Sbjct: 509  LKFEGYTPDMSQVLLDVDEDEKRQRLKHHSQKLAIAFALIQTSEGSPIRI 558


>ref|XP_006572948.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Glycine max]
          Length = 605

 Score =  634 bits (1636), Expect = e-179
 Identities = 313/530 (59%), Positives = 397/530 (74%), Gaps = 6/530 (1%)
 Frame = +2

Query: 59   YKEQECISLIKKCKNLEEFKQIHGQILKLGLLWSSFCASNLLATCALSQWGSMDYACSIF 238
            + EQ  +SL+K+CK++EEFK++H  ILKLGL + SFC SNL+A+CALS+WGSM+YACSIF
Sbjct: 29   FNEQGWLSLLKRCKSMEEFKKVHAHILKLGLFYDSFCGSNLVASCALSRWGSMEYACSIF 88

Query: 239  EEIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDADNFTYPLLLKACAFLAASE 418
             +IE+P SF+ N MIRG +  M LE+ALL Y+ MLE G++ DNFTYP +LKAC+ L A +
Sbjct: 89   RQIEEPGSFEYNTMIRGNVNSMDLEEALLLYVEMLERGIEPDNFTYPFVLKACSLLVALK 148

Query: 419  EGMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVFERIEKKTVASWSSIISAHA 598
            EG+QIH  VF  G   DVFVQN LI+MYGKCG +  +  VFE++++K+VASWSSII AHA
Sbjct: 149  EGVQIHAHVFNAGLEVDVFVQNGLISMYGKCGAIEHAGVVFEQMDEKSVASWSSIIGAHA 208

Query: 599  KLGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTHLGTLDWGRSIHGYLLRNLSGLNV 778
             +  W ECL L   M  EG+ RA        LSACTHLG+ + GR IHG LLRN+S LNV
Sbjct: 209  SVEMWHECLMLLGDMSREGRHRAEESILVSALSACTHLGSPNLGRCIHGILLRNISELNV 268

Query: 779  AVETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVAISGLASHGRGEEALALFERMLLE 958
             V+T++IDMY+KCGSLEKG+ +FQ MA+KN  SY+V I+GLA HGRG EAL +F  ML E
Sbjct: 269  VVKTSLIDMYVKCGSLEKGLCVFQNMAHKNRYSYTVMIAGLAIHGRGREALRVFSDMLEE 328

Query: 959  GLKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHRIKPTIQHYGCMVDLMGRTGLLHE 1129
            GL PDD+VYVG LSAC+    V+EG + F RM  EH IKPTIQHYGCMVDLMGR G+L E
Sbjct: 329  GLTPDDVVYVGVLSACSHAGLVKEGFQCFNRMQFEHMIKPTIQHYGCMVDLMGRAGMLKE 388

Query: 1130 AYDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDLVKLNSQNGGDYAMMCSIYAQ 1309
            AYDLIKSMP+KPNDVVWRSLLS+CK+H NLE+GE+AA+++ KLN  N GDY ++ ++YA+
Sbjct: 389  AYDLIKSMPIKPNDVVWRSLLSACKVHHNLEIGEIAADNIFKLNKHNPGDYLVLANMYAR 448

Query: 1310 AGRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRFVSNGVLNRE---VSEMMHQMEWQ 1480
            A +W +VA  R +M    L Q  G S VE    V++FVS      +   + +M+ QMEWQ
Sbjct: 449  AQKWANVARIRTEMVEKNLVQTPGFSLVEANRNVYKFVSQDKSQPQCETIYDMIQQMEWQ 508

Query: 1481 LRFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAFALVSTCDGSVVRI 1630
            L+FEGY  D+S+VL+ V E+EKR+RL+ HSQK AIAFAL+ T +GS VRI
Sbjct: 509  LKFEGYTPDMSQVLLDVDEDEKRQRLKHHSQKLAIAFALIQTSEGSPVRI 558


>ref|XP_006432677.1| hypothetical protein CICLE_v10000638mg [Citrus clementina]
            gi|568834767|ref|XP_006471474.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At1g31920-like [Citrus sinensis]
            gi|557534799|gb|ESR45917.1| hypothetical protein
            CICLE_v10000638mg [Citrus clementina]
          Length = 605

 Score =  634 bits (1635), Expect = e-179
 Identities = 311/529 (58%), Positives = 400/529 (75%), Gaps = 6/529 (1%)
 Frame = +2

Query: 62   KEQECISLIKKCKNLEEFKQIHGQILKLGLLWSSFCASNLLATCALSQWGSMDYACSIFE 241
            KEQEC++++K CKNLEEFK++H  +LK G  W+ FCASNL+ATCALS WGSMDYACSIF 
Sbjct: 30   KEQECLTILKTCKNLEEFKKVHAHVLKWGFFWNPFCASNLVATCALSHWGSMDYACSIFR 89

Query: 242  EIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDADNFTYPLLLKACAFLAASEE 421
            +I++P +FD N++IRG++KD++ E+AL  Y  M E GV+ D+FT+P L KACA L A +E
Sbjct: 90   QIDEPGAFDFNSLIRGFVKDVKFEEALFLYNEMFERGVEPDHFTFPALFKACAKLQALKE 149

Query: 422  GMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVFERIEKKTVASWSSIISAHAK 601
            GMQIHG VFK GF  D+FVQNSLINMYGKC  +  + ++F+++++K+VASWS+II+AHA 
Sbjct: 150  GMQIHGHVFKLGFEYDLFVQNSLINMYGKCEKVEFASAIFKQMDQKSVASWSAIIAAHAS 209

Query: 602  LGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTHLGTLDWGRSIHGYLLRNLSGLNVA 781
             G W ECL LF  M  E  WR         LSACTHLG LD G+  HG L+RN+S LNV 
Sbjct: 210  NGLWSECLKLFGEMNSEKCWRPEESILVSVLSACTHLGALDLGKCTHGSLIRNISALNVI 269

Query: 782  VETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVAISGLASHGRGEEALALFERMLLEG 961
            VET++IDMY+KCG LEKG+ LF+ MA K+  + SV ISGLA HG+G+EAL++F  ML EG
Sbjct: 270  VETSLIDMYVKCGCLEKGLCLFRMMAEKSQLTDSVMISGLAMHGQGKEALSIFSEMLREG 329

Query: 962  LKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHRIKPTIQHYGCMVDLMGRTGLLHEA 1132
            L+PDD+VYVG LSAC+    V+EG   F RM  EHRI PT+QHYGC+VDLMGR G+L EA
Sbjct: 330  LEPDDVVYVGVLSACSHAGLVKEGLLCFDRMKLEHRIVPTVQHYGCVVDLMGRAGMLGEA 389

Query: 1133 YDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDLVKLNSQNGGDYAMMCSIYAQA 1312
             +LI+SMP++ NDVVWRSLLS+ K+H NLE+GE AA++L ++NS +  DY ++ ++YA+A
Sbjct: 390  LELIQSMPIQQNDVVWRSLLSASKVHHNLEIGERAAKNLFQINSHHPSDYVVLSNMYARA 449

Query: 1313 GRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRFVSNGVLN---REVSEMMHQMEWQL 1483
             RWDDVA  R +MA  GL Q  G S VEV  KV++FVS    +     + EM+HQMEWQL
Sbjct: 450  QRWDDVAKIRTEMASKGLTQSPGFSLVEVARKVYKFVSQDRSHPTWDNIYEMIHQMEWQL 509

Query: 1484 RFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAFALVSTCDGSVVRI 1630
            +FEGY  D+S+VL+ V E+EKRERL+GHSQK AIAFAL+ T  GS +RI
Sbjct: 510  KFEGYSPDISQVLLDVDEDEKRERLKGHSQKLAIAFALIHTSQGSPIRI 558


>emb|CAN70294.1| hypothetical protein VITISV_005974 [Vitis vinifera]
          Length = 562

 Score =  634 bits (1634), Expect = e-179
 Identities = 315/516 (61%), Positives = 386/516 (74%), Gaps = 6/516 (1%)
 Frame = +2

Query: 104  LEEFKQIHGQILKLGLLWSSFCASNLLATCALSQWGSMDYACSIFEEIEDPCSFDCNAMI 283
            +EEFKQ H +ILK GL   SFCASNL+ATCALS WGSMDYACSIF ++++P SF+ N M+
Sbjct: 1    MEEFKQSHARILKXGLFXDSFCASNLVATCALSDWGSMDYACSIFRQMDEPGSFZFNTMM 60

Query: 284  RGYIKDMRLEQALLTYLHMLEIGVDADNFTYPLLLKACAFLAASEEGMQIHGQVFKHGFV 463
            RG++KDM  E+AL+TY  M E GV  DNFTYP LLKACA L A EEGMQ+H  + K G  
Sbjct: 61   RGHVKDMNTEEALITYKEMAERGVKPDNFTYPTLLKACARLPAVEEGMQVHAHILKLGLE 120

Query: 464  GDVFVQNSLINMYGKCGLLRDSCSVFERIEKKTVASWSSIISAHAKLGKWRECLNLFSHM 643
             DVFVQNSLI+MYGKCG +   C+VFE++ +++VASWS++I+AHA LG W +CL L   M
Sbjct: 121  NDVFVQNSLISMYGKCGEIGVCCAVFEQMNERSVASWSALITAHASLGMWSDCLRLLGDM 180

Query: 644  LHEGQWRAXXXXXXXXLSACTHLGTLDWGRSIHGYLLRNLSGLNVAVETAVIDMYIKCGS 823
             +EG WRA        LSACTHLG LD GRS+HG+LLRN+SGLNV VET++I+MY+KCG 
Sbjct: 181  SNEGYWRAEESILVSVLSACTHLGALDLGRSVHGFLLRNVSGLNVIVETSLIEMYLKCGX 240

Query: 824  LEKGMSLFQEMANKNHKSYSVAISGLASHGRGEEALALFERMLLEGLKPDDIVYVGTLSA 1003
            L KGM LFQ+MA KN  SYSV ISGLA HG G E L +F  ML +GL+PDDIVYVG L+A
Sbjct: 241  LYKGMCLFQKMAKKNKLSYSVMISGLAMHGYGREGLRIFTEMLEQGLEPDDIVYVGVLNA 300

Query: 1004 CTP---VEEGKKLFKRMWEEHRIKPTIQHYGCMVDLMGRTGLLHEAYDLIKSMPMKPNDV 1174
            C+    V+EG + F RM  EH I+PTIQHYGCMVDLMGR G + EA +LIKSMPM+PNDV
Sbjct: 301  CSHAGLVQEGLQCFNRMKLEHGIEPTIQHYGCMVDLMGRAGKIDEALELIKSMPMEPNDV 360

Query: 1175 VWRSLLSSCKIHKNLELGEVAAEDLVKLNSQNGGDYAMMCSIYAQAGRWDDVASSRVKMA 1354
            +WRSLLS+ K+H NL+ GE+AA+ L KL+SQ   DY ++ ++YAQA RW+DVA +R  M 
Sbjct: 361  LWRSLLSASKVHNNLQAGEIAAKQLFKLDSQKASDYVVLSNMYAQAQRWEDVARTRTNMF 420

Query: 1355 RLGLGQVAGSSAVEVKGKVHRFVSNGV---LNREVSEMMHQMEWQLRFEGYEADLSEVLI 1525
              GL Q  G S VEVK K+HRFVS       +  V EM++QMEWQL+FEGY  D ++VL 
Sbjct: 421  SKGLSQRPGFSLVEVKRKMHRFVSQDAGHPQSESVYEMLYQMEWQLKFEGYXPDTTQVLC 480

Query: 1526 PVGEEEKRERLRGHSQKAAIAFALVSTCDGSVVRIV 1633
             V EEEK++RL GHSQK AIA+AL+ T  GS VRIV
Sbjct: 481  DVDEEEKKQRLSGHSQKLAIAYALIHTSQGSPVRIV 516


>ref|XP_004136211.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Cucumis sativus] gi|449508034|ref|XP_004163198.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At1g31920-like [Cucumis sativus]
          Length = 606

 Score =  631 bits (1627), Expect = e-178
 Identities = 310/530 (58%), Positives = 393/530 (74%), Gaps = 6/530 (1%)
 Frame = +2

Query: 62   KEQECISLIKKCKNLEEFKQIHGQILKLGLLWSSFCASNLLATCALSQWGSMDYACSIFE 241
            KEQE + L+KKCK+LEEFKQ+H QILK GL   SFC+S++LATCALS W SMDYACSIF+
Sbjct: 31   KEQEYLCLVKKCKSLEEFKQVHVQILKFGLFLDSFCSSSVLATCALSDWNSMDYACSIFQ 90

Query: 242  EIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDADNFTYPLLLKACAFLAASEE 421
            ++++P +FD N MIRGY+ +M  E A+  Y  ML+  V+ DNFTYP++LKACA LA  +E
Sbjct: 91   QLDEPTTFDFNTMIRGYVNNMNFENAIYLYNDMLQREVEPDNFTYPVVLKACARLAVIQE 150

Query: 422  GMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVFERIEKKTVASWSSIISAHAK 601
            GMQIHG VFK G   DV+VQNSLINMYGKC  +  SC++F R+E+K+VASWS+II+AHA 
Sbjct: 151  GMQIHGHVFKLGLEDDVYVQNSLINMYGKCRDIEMSCAIFRRMEQKSVASWSAIIAAHAS 210

Query: 602  LGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTHLGTLDWGRSIHGYLLRNLSGLNVA 781
            L  W ECL LF  M  EG WRA        LSACTHLG    GR  HG LL+N++ LNVA
Sbjct: 211  LAMWWECLALFEDMSREGCWRAEESILVNVLSACTHLGAFHLGRCAHGSLLKNITELNVA 270

Query: 782  VETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVAISGLASHGRGEEALALFERMLLEG 961
            V T+++DMY+KCGSL+KG+ LFQ M  KN  SYSV ISGL  HG G +AL +F  M+ EG
Sbjct: 271  VMTSLMDMYVKCGSLQKGLCLFQNMTRKNQLSYSVIISGLGLHGYGRQALQIFSEMVEEG 330

Query: 962  LKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHRIKPTIQHYGCMVDLMGRTGLLHEA 1132
            L+PDD+ YV  LSAC+    V+EG  LF +M  E+RI+PT+QHYGCMVDL GR GLL EA
Sbjct: 331  LEPDDVTYVSVLSACSHSGLVDEGLDLFDKMKFEYRIEPTMQHYGCMVDLKGRAGLLEEA 390

Query: 1133 YDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDLVKLNSQNGGDYAMMCSIYAQA 1312
            + L++SMP+K NDV+WRSLLS+CK+H NL+LGE+AAE+L +L+S N  DY ++ ++YA+A
Sbjct: 391  FQLVQSMPIKANDVLWRSLLSACKVHDNLKLGEIAAENLFRLSSHNPSDYLVLSNMYARA 450

Query: 1313 GRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRFVSNG---VLNREVSEMMHQMEWQL 1483
             +W++ A  R KM   GL Q  G S VEVK KV++FVS       +  + +M+HQMEWQL
Sbjct: 451  QQWENAAKIRTKMINRGLIQTPGYSLVEVKSKVYKFVSQDKSYCKSGNIYKMIHQMEWQL 510

Query: 1484 RFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAFALVSTCDGSVVRIV 1633
            RFEGY  D S+V++ V EEEK ERL+GHSQK AIAFAL+ T  GS +RI+
Sbjct: 511  RFEGYMPDTSQVMLDVDEEEKGERLKGHSQKLAIAFALIHTSQGSAIRII 560


>ref|XP_003612704.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355514039|gb|AES95662.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 572

 Score =  617 bits (1592), Expect = e-174
 Identities = 298/516 (57%), Positives = 387/516 (75%), Gaps = 6/516 (1%)
 Frame = +2

Query: 101  NLEEFKQIHGQILKLGLLWSSFCASNLLATCALSQWGSMDYACSIFEEIEDPCSFDCNAM 280
            ++EEFKQ+H  +LK G+ + +FC SNL+ATCAL++WGSMDYACSIF +I++P SFD N M
Sbjct: 10   HMEEFKQVHAHVLKCGIFFDTFCMSNLVATCALTKWGSMDYACSIFTQIDEPSSFDYNTM 69

Query: 281  IRGYIKDMRLEQALLTYLHMLEIGVDADNFTYPLLLKACAFLAASEEGMQIHGQVFKHGF 460
            IRG + DM+LE+ALL Y+ M+E GV+ D FTYP +LKAC+ L   +EG+Q+HG VFK G 
Sbjct: 70   IRGNVNDMKLEEALLLYVDMIERGVEPDKFTYPFVLKACSLLGVVDEGIQVHGHVFKMGL 129

Query: 461  VGDVFVQNSLINMYGKCGLLRDSCSVFERIEKKTVASWSSIISAHAKLGKWRECLNLFSH 640
             GDV VQNSLINMYGKCG ++++C VF  +++K+VASWS+II AHA +  W ECL L   
Sbjct: 130  EGDVIVQNSLINMYGKCGEIKNACDVFNGMDEKSVASWSAIIGAHACVEMWNECLMLLGK 189

Query: 641  MLHEGQWRAXXXXXXXXLSACTHLGTLDWGRSIHGYLLRNLSGLNVAVETAVIDMYIKCG 820
            M  EG+ R         LSACTHLG+ D G+ IHG LLRN+S LNV V+T++IDMY+K G
Sbjct: 190  MSSEGRCRVEESTLVNVLSACTHLGSPDLGKCIHGILLRNISELNVVVKTSLIDMYVKSG 249

Query: 821  SLEKGMSLFQEMANKNHKSYSVAISGLASHGRGEEALALFERMLLEGLKPDDIVYVGTLS 1000
             LEKG+ +F+ M+ KN  SY+V ISGLA HGRG+EAL +F  M+ EGL PDD+VYVG  S
Sbjct: 250  CLEKGLRVFKNMSEKNRYSYTVMISGLAIHGRGKEALKVFSEMIEEGLAPDDVVYVGVFS 309

Query: 1001 ACTP---VEEGKKLFKRMWEEHRIKPTIQHYGCMVDLMGRTGLLHEAYDLIKSMPMKPND 1171
            AC+    VEEG + FK M  EH+I+PT+QHYGCMVDL+GR G+L EAY+LIKSM +KPND
Sbjct: 310  ACSHAGLVEEGLQCFKSMQFEHKIEPTVQHYGCMVDLLGRFGMLKEAYELIKSMSIKPND 369

Query: 1172 VVWRSLLSSCKIHKNLELGEVAAEDLVKLNSQNGGDYAMMCSIYAQAGRWDDVASSRVKM 1351
            V+WRSLLS+CK+H NLE+G++AAE+L  LN  N GDY ++ ++YA+A +WDDVA  R K+
Sbjct: 370  VIWRSLLSACKVHHNLEIGKIAAENLFMLNQNNSGDYLVLANMYAKAQKWDDVAKIRTKL 429

Query: 1352 ARLGLGQVAGSSAVEVKGKVHRFVSNGVLNRE---VSEMMHQMEWQLRFEGYEADLSEVL 1522
            A   L Q  G S +E K KV++FVS      +   + EM+HQMEWQL+FEGY  D S+VL
Sbjct: 430  AERNLVQTPGFSLIEAKRKVYKFVSQDKSIPQWNIIYEMIHQMEWQLKFEGYIPDTSQVL 489

Query: 1523 IPVGEEEKRERLRGHSQKAAIAFALVSTCDGSVVRI 1630
            + V +EEK+ERL+ HSQK AIAF L+ T +GS +RI
Sbjct: 490  LDVDDEEKKERLKFHSQKLAIAFGLIHTSEGSPLRI 525


>ref|XP_006415279.1| hypothetical protein EUTSA_v10010030mg [Eutrema salsugineum]
            gi|557093050|gb|ESQ33632.1| hypothetical protein
            EUTSA_v10010030mg [Eutrema salsugineum]
          Length = 607

 Score =  597 bits (1539), Expect = e-168
 Identities = 293/540 (54%), Positives = 397/540 (73%), Gaps = 9/540 (1%)
 Frame = +2

Query: 38   PETEYSFYKEQECISLIKKCKNLEEFKQIHGQILKLGLLWSS-FCASNLLATCALSQWG- 211
            PE      KEQEC+ ++K+CKN++EFKQ+H + +KL L  SS F ASN+L+TCA S W  
Sbjct: 21   PEVNNYRAKEQECLYILKRCKNIKEFKQVHARFIKLSLFCSSSFSASNVLSTCAHSGWDK 80

Query: 212  SMDYACSIFEEIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDADNFTYPLLLK 391
            SM+YA SIF  I+DPC+FD N MIRGY+ +   E+AL  Y+ M++ G++ DNFTYP LLK
Sbjct: 81   SMNYAASIFRAIDDPCTFDFNTMIRGYVNETGYEEALWFYVEMVKRGIEPDNFTYPCLLK 140

Query: 392  ACAFLAASEEGMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVFERIEKKTVAS 571
            AC  L + +EG QIHG VFK GF  DVFVQNSLINMYG+CG +  S +VFE++E KT AS
Sbjct: 141  ACTRLRSIQEGKQIHGHVFKLGFEVDVFVQNSLINMYGRCGEMELSSAVFEKLESKTAAS 200

Query: 572  WSSIISAHAKLGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTHLGTLDWGRSIHGYL 751
            WSS++SA A +G W ECL LF  M  E   +A        LSAC +   L+ G SIHG+L
Sbjct: 201  WSSMVSARAGMGMWSECLMLFREMCRETNLKAEESGMVSALSACANTNALNLGMSIHGFL 260

Query: 752  LRNLSGLNVAVETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVAISGLASHGRGEEAL 931
            LRN+S LN+AV+T+++DMY KCG LEK + +F++M ++N+ +YS  ISGLA HG GE AL
Sbjct: 261  LRNISELNIAVQTSLVDMYAKCGCLEKALYIFRKMESRNNLTYSAMISGLALHGEGEAAL 320

Query: 932  ALFERMLLEGLKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHRIKPTIQHYGCMVDL 1102
             +F  M+ EGL+ D +VYV  L+AC+    V+EG+++F+ M +E  ++PT +HYGC+VDL
Sbjct: 321  RMFSEMIEEGLESDHVVYVSVLNACSHSGLVKEGRRVFEEMLKEGTVEPTAEHYGCLVDL 380

Query: 1103 MGRTGLLHEAYDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDLVKLNSQNGGDY 1282
            +GR GLL EA + I++MP++ NDVVWRS LSSC++H+N+ELG++AA +L+KL+S N GDY
Sbjct: 381  LGRAGLLEEALETIQTMPIEQNDVVWRSFLSSCRVHQNVELGQIAARELLKLSSHNSGDY 440

Query: 1283 AMMCSIYAQAGRWDDVASSRVKMARL-GLGQVAGSSAVEVKGKVHRFVSNGVLN---REV 1450
             ++ ++YAQA  W+DVA +R +MA + GL Q+ G S VEV GK HRFVS    +   +E+
Sbjct: 441  LVISNMYAQAQMWEDVARARTEMAAIKGLKQIPGFSTVEVDGKTHRFVSQDRFHPNCKEI 500

Query: 1451 SEMMHQMEWQLRFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAFALVSTCDGSVVRI 1630
             +M+HQMEWQL+FEGY  D +++L+ V EEEKRERL+GHSQK AIAFAL+ T  GS++RI
Sbjct: 501  YKMLHQMEWQLKFEGYSPDTTQILLNVDEEEKRERLKGHSQKVAIAFALLYTPPGSIIRI 560


>ref|NP_174474.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75169173|sp|Q9C6T2.1|PPR68_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g31920 gi|12321292|gb|AAG50713.1|AC079041_6 PPR-repeat
            protein, putative [Arabidopsis thaliana]
            gi|332193295|gb|AEE31416.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 606

 Score =  587 bits (1513), Expect = e-165
 Identities = 285/539 (52%), Positives = 391/539 (72%), Gaps = 8/539 (1%)
 Frame = +2

Query: 38   PETEYSFYKEQECISLIKKCKNLEEFKQIHGQILKLGLLWSS-FCASNLLATCALSQW-G 211
            PE      KEQEC+ L+K+C N++EFKQ+H + +KL L +SS F AS++LA CA S W  
Sbjct: 21   PEVNNFGGKEQECLYLLKRCHNIDEFKQVHARFIKLSLFYSSSFSASSVLAKCAHSGWEN 80

Query: 212  SMDYACSIFEEIEDPCSFDCNAMIRGYIKDMRLEQALLTYLHMLEIGVDADNFTYPLLLK 391
            SM+YA SIF  I+DPC+FD N MIRGY+  M  E+AL  Y  M++ G + DNFTYP LLK
Sbjct: 81   SMNYAASIFRGIDDPCTFDFNTMIRGYVNVMSFEEALCFYNEMMQRGNEPDNFTYPCLLK 140

Query: 392  ACAFLAASEEGMQIHGQVFKHGFVGDVFVQNSLINMYGKCGLLRDSCSVFERIEKKTVAS 571
            AC  L +  EG QIHGQVFK G   DVFVQNSLINMYG+CG +  S +VFE++E KT AS
Sbjct: 141  ACTRLKSIREGKQIHGQVFKLGLEADVFVQNSLINMYGRCGEMELSSAVFEKLESKTAAS 200

Query: 572  WSSIISAHAKLGKWRECLNLFSHMLHEGQWRAXXXXXXXXLSACTHLGTLDWGRSIHGYL 751
            WSS++SA A +G W ECL LF  M  E   +A        L AC + G L+ G SIHG+L
Sbjct: 201  WSSMVSARAGMGMWSECLLLFRGMCSETNLKAEESGMVSALLACANTGALNLGMSIHGFL 260

Query: 752  LRNLSGLNVAVETAVIDMYIKCGSLEKGMSLFQEMANKNHKSYSVAISGLASHGRGEEAL 931
            LRN+S LN+ V+T+++DMY+KCG L+K + +FQ+M  +N+ +YS  ISGLA HG GE AL
Sbjct: 261  LRNISELNIIVQTSLVDMYVKCGCLDKALHIFQKMEKRNNLTYSAMISGLALHGEGESAL 320

Query: 932  ALFERMLLEGLKPDDIVYVGTLSACTP---VEEGKKLFKRMWEEHRIKPTIQHYGCMVDL 1102
             +F +M+ EGL+PD +VYV  L+AC+    V+EG+++F  M +E +++PT +HYGC+VDL
Sbjct: 321  RMFSKMIKEGLEPDHVVYVSVLNACSHSGLVKEGRRVFAEMLKEGKVEPTAEHYGCLVDL 380

Query: 1103 MGRTGLLHEAYDLIKSMPMKPNDVVWRSLLSSCKIHKNLELGEVAAEDLVKLNSQNGGDY 1282
            +GR GLL EA + I+S+P++ NDV+WR+ LS C++ +N+ELG++AA++L+KL+S N GDY
Sbjct: 381  LGRAGLLEEALETIQSIPIEKNDVIWRTFLSQCRVRQNIELGQIAAQELLKLSSHNPGDY 440

Query: 1283 AMMCSIYAQAGRWDDVASSRVKMARLGLGQVAGSSAVEVKGKVHRFVSNGVLN---REVS 1453
             ++ ++Y+Q   WDDVA +R ++A  GL Q  G S VE+KGK HRFVS    +   +E+ 
Sbjct: 441  LLISNLYSQGQMWDDVARTRTEIAIKGLKQTPGFSIVELKGKTHRFVSQDRSHPKCKEIY 500

Query: 1454 EMMHQMEWQLRFEGYEADLSEVLIPVGEEEKRERLRGHSQKAAIAFALVSTCDGSVVRI 1630
            +M+HQMEWQL+FEGY  DL+++L+ V EEEK+ERL+GHSQK AIAF L+ T  GS+++I
Sbjct: 501  KMLHQMEWQLKFEGYSPDLTQILLNVDEEEKKERLKGHSQKVAIAFGLLYTPPGSIIKI 559


Top