BLASTX nr result

ID: Catharanthus22_contig00014034 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00014034
         (2358 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006348275.1| PREDICTED: pentatricopeptide repeat-containi...   741   0.0  
ref|XP_004244369.1| PREDICTED: pentatricopeptide repeat-containi...   734   0.0  
gb|EOY16117.1| Pentatricopeptide repeat (PPR) superfamily protei...   702   0.0  
ref|XP_002284293.1| PREDICTED: pentatricopeptide repeat-containi...   702   0.0  
ref|XP_006472673.1| PREDICTED: pentatricopeptide repeat-containi...   671   0.0  
ref|XP_004301943.1| PREDICTED: pentatricopeptide repeat-containi...   668   0.0  
ref|XP_002513855.1| pentatricopeptide repeat-containing protein,...   667   0.0  
gb|EXC06699.1| hypothetical protein L484_021537 [Morus notabilis]     659   0.0  
gb|EMJ23235.1| hypothetical protein PRUPE_ppa003044mg [Prunus pe...   653   0.0  
ref|XP_004138810.1| PREDICTED: pentatricopeptide repeat-containi...   645   0.0  
ref|XP_002867987.1| binding protein [Arabidopsis lyrata subsp. l...   622   e-175
ref|NP_193587.4| pentatricopeptide repeat-containing protein [Ar...   621   e-175
ref|XP_006283342.1| hypothetical protein CARUB_v10004383mg [Caps...   620   e-174
ref|XP_002307403.1| pentatricopeptide repeat-containing family p...   616   e-173
ref|XP_004505335.1| PREDICTED: pentatricopeptide repeat-containi...   611   e-172
ref|XP_006605526.1| PREDICTED: pentatricopeptide repeat-containi...   610   e-172
ref|XP_006605525.1| PREDICTED: pentatricopeptide repeat-containi...   610   e-172
gb|ESW06749.1| hypothetical protein PHAVU_010G073100g [Phaseolus...   597   e-168
gb|ESW29807.1| hypothetical protein PHAVU_002G100300g [Phaseolus...   595   e-167
gb|EAY91181.1| hypothetical protein OsI_12790 [Oryza sativa Indi...   590   e-165

>ref|XP_006348275.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18520-like
            [Solanum tuberosum]
          Length = 621

 Score =  741 bits (1913), Expect = 0.0
 Identities = 375/608 (61%), Positives = 463/608 (76%), Gaps = 4/608 (0%)
 Frame = +3

Query: 318  IQFPNFDVQL-PKALEYGTNFTKKSSCTN-TLNRNISCFGSHDPFPSVHYDNSVTSSIPS 491
            I FP   +Q+ P      T+  +   C + T +R + CF   DP  S  +D S +    +
Sbjct: 6    ICFPTTILQIHPSYSSDNTSRFRTHKCKSYTDDRKLPCFSFKDPSFSARFDVSASFCRDN 65

Query: 492  NNIDNHKKAE--IFDPSSVSNGSTLLAYLLQSCTDKEEVRIIHAIVLKISKESNIFVDNN 665
              +   +  +  I   +S    S+LL  LLQS +  +EV+I+HAIVLK  + S IFV+NN
Sbjct: 66   ELVSQLEVPDDKICSVNSFVADSSLLGRLLQSSSSLKEVKILHAIVLKCLRSSTIFVENN 125

Query: 666  LISGYARYGGIVEARKVFDEMLERNVVSWTAMLNAYLRFGYDNEAIRHFINFVESGIPWS 845
            LIS   ++G + +ARKVFD M E NVVSWTAMLN YLR+G+D+EA+  F  FV+ G+ W+
Sbjct: 126  LISVLVKFGRLDDARKVFDHMPENNVVSWTAMLNGYLRYGFDDEAMDFFAEFVQRGLLWN 185

Query: 846  SKTFVCMLNMCSKILDLELGKQVHTCMVKGGFSGLILNSALLYFYVQCGDLENAVSVFDH 1025
            SKT+VC+L+M  +  D +LGKQVH  ++KGGFS LIL+S+++ FY QCGDLE+A  VFD 
Sbjct: 186  SKTYVCVLSMAGRCCDFDLGKQVHAGVIKGGFSNLILDSSIVSFYAQCGDLESAFRVFDV 245

Query: 1026 IKERDVICWTIMISAYSQHRRGEEAFKTFSRMLSDRVEPNEFTICTVLDACGEEKELKLG 1205
            IK  DV+CWT MI+A SQH RG+EA   F ++ SD  + NEFT+C++L+ACGEE+ELK G
Sbjct: 246  IKRPDVVCWTTMITACSQHGRGKEALLMFLQLFSDGFDANEFTVCSILNACGEERELKFG 305

Query: 1206 RQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARN 1385
            +QLH A IK  +  DVFIGT+LVDMYAKC E +D+R VFD M  RNTVTWT+IIAGYARN
Sbjct: 306  KQLHAAVIKNRFRMDVFIGTSLVDMYAKCSEIDDARTVFDGMGKRNTVTWTSIIAGYARN 365

Query: 1386 GFGEEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIG 1565
            G  EEAI LFR+M+RRKI ANNLTMVSILRACGLL  LPTGKEVHAQI KN  Q NI++G
Sbjct: 366  GHAEEAIRLFRIMKRRKIFANNLTMVSILRACGLLRTLPTGKEVHAQIIKNSLQDNIYLG 425

Query: 1566 SALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVS 1745
            S LVWLYC+C + SAA KVL+ MP+RDVVSWTAMISGC +LGHE+EALEYLKEMLGEGV+
Sbjct: 426  STLVWLYCKCSENSAAHKVLQDMPIRDVVSWTAMISGCAHLGHEYEALEYLKEMLGEGVA 485

Query: 1746 PNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRV 1925
            PNPFTYSSALKACA+LEDI +GKLIHSSI KTPA S+VFVGSALI+MYAKCGHL EAI++
Sbjct: 486  PNPFTYSSALKACAKLEDIERGKLIHSSISKTPALSNVFVGSALINMYAKCGHLPEAIQI 545

Query: 1926 FDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTECGDFERD 2105
            FDNMPE+NLVS KAMI+AYAKNG C EALKL+YRMQ EGIE+DDY+L+TVLT CG+++  
Sbjct: 546  FDNMPEKNLVSWKAMIVAYAKNGSCGEALKLMYRMQVEGIEVDDYILATVLTACGEYKGT 605

Query: 2106 KSTKELAF 2129
              +K   F
Sbjct: 606  IKSKSKYF 613



 Score =  199 bits (506), Expect = 5e-48
 Identities = 107/325 (32%), Positives = 177/325 (54%)
 Frame = +3

Query: 1164 VLDACGEEKELKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRN 1343
            +L +    KE+K+   LH   +K +    +F+   L+ +  K G  +D+R VFD M   N
Sbjct: 94   LLQSSSSLKEVKI---LHAIVLKCLRSSTIFVENNLISVLVKFGRLDDARKVFDHMPENN 150

Query: 1344 TVTWTTIIAGYARNGFGEEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHA 1523
             V+WT ++ GY R GF +EA++ F    +R +L N+ T V +L   G       GK+VHA
Sbjct: 151  VVSWTAMLNGYLRYGFDDEAMDFFAEFVQRGLLWNSKTYVCVLSMAGRCCDFDLGKQVHA 210

Query: 1524 QIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHE 1703
             + K    SN+ + S++V  Y +CGD  +A +V + +   DVV WT MI+ C+  G   E
Sbjct: 211  GVIKG-GFSNLILDSSIVSFYAQCGDLESAFRVFDVIKRPDVVCWTTMITACSQHGRGKE 269

Query: 1704 ALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIH 1883
            AL    ++  +G   N FT  S L AC +  +++ GK +H+++ K      VF+G++L+ 
Sbjct: 270  ALLMFLQLFSDGFDANEFTVCSILNACGEERELKFGKQLHAAVIKNRFRMDVFIGTSLVD 329

Query: 1884 MYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYV 2063
            MYAKC  + +A  VFD M ++N V+  ++I  YA+NG   EA++L   M+   I  ++  
Sbjct: 330  MYAKCSEIDDARTVFDGMGKRNTVTWTSIIAGYARNGHAEEAIRLFRIMKRRKIFANNLT 389

Query: 2064 LSTVLTECGDFERDKSTKELAFHLV 2138
            + ++L  CG      + KE+   ++
Sbjct: 390  MVSILRACGLLRTLPTGKEVHAQII 414


>ref|XP_004244369.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18520-like
            [Solanum lycopersicum]
          Length = 617

 Score =  734 bits (1894), Expect = 0.0
 Identities = 368/578 (63%), Positives = 447/578 (77%), Gaps = 2/578 (0%)
 Frame = +3

Query: 402  TLNRNISCFGSHDPFPSVHYDNSVTSSIPSNNIDNHKKA--EIFDPSSVSNGSTLLAYLL 575
            T +R + CF   DP  S  +D S +    +  +   +    +I   +S    S+LL  LL
Sbjct: 32   TDDRKLPCFSFKDPSSSARFDVSASFCRDNELVCQLEVPGDKICSVNSFVADSSLLGRLL 91

Query: 576  QSCTDKEEVRIIHAIVLKISKESNIFVDNNLISGYARYGGIVEARKVFDEMLERNVVSWT 755
            QS +  +EV+I+HAIVLK  + S IFV+NNLIS   ++G + +ARKVFD MLERNVVSWT
Sbjct: 92   QSSSSLKEVKILHAIVLKCLRSSTIFVENNLISVLVKFGRLDDARKVFDHMLERNVVSWT 151

Query: 756  AMLNAYLRFGYDNEAIRHFINFVESGIPWSSKTFVCMLNMCSKILDLELGKQVHTCMVKG 935
            AMLN YLR+G D+EA   F  FV  G+ W+SKT+VC+L+M  +    ELGKQVH  +VKG
Sbjct: 152  AMLNGYLRYGLDDEAFDFFAEFVRCGLLWNSKTYVCVLSMAGRCCYFELGKQVHAGVVKG 211

Query: 936  GFSGLILNSALLYFYVQCGDLENAVSVFDHIKERDVICWTIMISAYSQHRRGEEAFKTFS 1115
            G S LIL+S+++ FY QCGDL +A  VFD IK  DV+CWT MI+A SQH RG+EA   F 
Sbjct: 212  GLSNLILDSSVVSFYAQCGDLASAFRVFDVIKRPDVVCWTTMITACSQHGRGKEALLMFL 271

Query: 1116 RMLSDRVEPNEFTICTVLDACGEEKELKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCG 1295
            ++ SD  + NEFT+C++L+ACGEE+ELK G+QLH A IK  +  DVFIGT+LVDMYAKC 
Sbjct: 272  QLFSDGFDANEFTVCSILNACGEERELKFGKQLHAAVIKNRFRMDVFIGTSLVDMYAKCS 331

Query: 1296 ETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFGEEAINLFRVMQRRKILANNLTMVSILR 1475
            + +D+R VFD M  RNTVTWT+IIAGYARNG  EEAI LFR+M+RRKI ANNLTMVSILR
Sbjct: 332  KIDDARTVFDGMGKRNTVTWTSIIAGYARNGHAEEAIRLFRIMKRRKIFANNLTMVSILR 391

Query: 1476 ACGLLGALPTGKEVHAQIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVS 1655
            ACGLL ALPTGKEVHAQI KN  Q NI++GS LVWLYC+C + S A KVL+ MP+RDVVS
Sbjct: 392  ACGLLRALPTGKEVHAQIIKNSLQDNIYLGSTLVWLYCKCSENSTAHKVLQEMPIRDVVS 451

Query: 1656 WTAMISGCTNLGHEHEALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIH 1835
            WTAMISGC +LGHE+EALEYLKEMLGEGV+PNPFTYSSALKACA+LEDI +GKLIHSSI 
Sbjct: 452  WTAMISGCAHLGHEYEALEYLKEMLGEGVAPNPFTYSSALKACAKLEDIERGKLIHSSIS 511

Query: 1836 KTPASSSVFVGSALIHMYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALK 2015
            KTPA S+VFVGSALI+MYAKCGHL EAI++FDNMPE+NLVS KAMI+AYAKNG C EALK
Sbjct: 512  KTPALSNVFVGSALINMYAKCGHLPEAIQIFDNMPEKNLVSWKAMIVAYAKNGNCGEALK 571

Query: 2016 LVYRMQAEGIELDDYVLSTVLTECGDFERDKSTKELAF 2129
            L+YRMQ EGIE+DDY+L+TVLT CG+++    +K   F
Sbjct: 572  LMYRMQVEGIEVDDYILATVLTACGEYKETIKSKSKYF 609



 Score =  196 bits (499), Expect = 3e-47
 Identities = 107/325 (32%), Positives = 176/325 (54%)
 Frame = +3

Query: 1164 VLDACGEEKELKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRN 1343
            +L +    KE+K+   LH   +K +    +F+   L+ +  K G  +D+R VFD M  RN
Sbjct: 90   LLQSSSSLKEVKI---LHAIVLKCLRSSTIFVENNLISVLVKFGRLDDARKVFDHMLERN 146

Query: 1344 TVTWTTIIAGYARNGFGEEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHA 1523
             V+WT ++ GY R G  +EA + F    R  +L N+ T V +L   G       GK+VHA
Sbjct: 147  VVSWTAMLNGYLRYGLDDEAFDFFAEFVRCGLLWNSKTYVCVLSMAGRCCYFELGKQVHA 206

Query: 1524 QIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHE 1703
             + K    SN+ + S++V  Y +CGD ++A +V + +   DVV WT MI+ C+  G   E
Sbjct: 207  GVVKG-GLSNLILDSSVVSFYAQCGDLASAFRVFDVIKRPDVVCWTTMITACSQHGRGKE 265

Query: 1704 ALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIH 1883
            AL    ++  +G   N FT  S L AC +  +++ GK +H+++ K      VF+G++L+ 
Sbjct: 266  ALLMFLQLFSDGFDANEFTVCSILNACGEERELKFGKQLHAAVIKNRFRMDVFIGTSLVD 325

Query: 1884 MYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYV 2063
            MYAKC  + +A  VFD M ++N V+  ++I  YA+NG   EA++L   M+   I  ++  
Sbjct: 326  MYAKCSKIDDARTVFDGMGKRNTVTWTSIIAGYARNGHAEEAIRLFRIMKRRKIFANNLT 385

Query: 2064 LSTVLTECGDFERDKSTKELAFHLV 2138
            + ++L  CG      + KE+   ++
Sbjct: 386  MVSILRACGLLRALPTGKEVHAQII 410


>gb|EOY16117.1| Pentatricopeptide repeat (PPR) superfamily protein, putative
            [Theobroma cacao]
          Length = 620

 Score =  702 bits (1812), Expect = 0.0
 Identities = 358/615 (58%), Positives = 464/615 (75%), Gaps = 3/615 (0%)
 Frame = +3

Query: 273  MLSPLLPSVNITIFRIQFPNFDVQLPKALEYGTNFTKKSSCTNTLNR-NISCFGSHDPFP 449
            MLS  +    +T+F+ Q  N  +Q P      +N   +S  T+T N    SCF S D   
Sbjct: 1    MLSLAVTLPQVTLFQ-QPSNSTIQKPSFR--CSNSKTQSKTTSTKNPPQFSCFCSTDSCF 57

Query: 450  SVHYDNSVTSSIPSNNIDNHKKAEIFDPSSVSNG--STLLAYLLQSCTDKEEVRIIHAIV 623
               +D+S+  S+ +++ D         P SVS    S  LA LLQSC +  + R +HA+V
Sbjct: 58   LPEFDHSI--SVSASHEDPDAGFMDITPPSVSRSVDSDDLAALLQSCYNVRQARRVHAVV 115

Query: 624  LKISKESNIFVDNNLISGYARYGGIVEARKVFDEMLERNVVSWTAMLNAYLRFGYDNEAI 803
            LK  K    +V+NNLIS Y+R+G ++EARKVFD+M ERNVVSWTAM+N Y + G+D+EA+
Sbjct: 116  LKRLKNPGTYVENNLISVYSRFGKLMEARKVFDKMAERNVVSWTAMINGYSKLGFDDEAL 175

Query: 804  RHFINFVESGIPWSSKTFVCMLNMCSKILDLELGKQVHTCMVKGGFSGLILNSALLYFYV 983
            R F + + SG+  + K FVC++N+CS+ +D ELG+++H C++KG +  LI++SA++ FY 
Sbjct: 176  RLFADSISSGVRGNGKMFVCLMNLCSRRMDFELGRRIHGCILKGNWRNLIVDSAVVNFYA 235

Query: 984  QCGDLENAVSVFDHIKERDVICWTIMISAYSQHRRGEEAFKTFSRMLSDRVEPNEFTICT 1163
            QCG+L  A  VF  + ++DV+CWT +I+A +Q   G+EAF  FSRMLS+   PNEFT+C+
Sbjct: 236  QCGELSKAFRVFCWMGKKDVVCWTTIITACAQQGNGKEAFSMFSRMLSEGFWPNEFTVCS 295

Query: 1164 VLDACGEEKELKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRN 1343
            VL ACGEEK LK GRQLHGA IK ++  DVF+GT+LVDMYAKCGE  D+R+VF+ M +RN
Sbjct: 296  VLKACGEEKALKSGRQLHGAIIKKIFKNDVFVGTSLVDMYAKCGEISDARIVFNGMGSRN 355

Query: 1344 TVTWTTIIAGYARNGFGEEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHA 1523
            TVTWT+IIAGYAR G GE+AI+LFRVM+RR I+ANNLT+VS+LRACG +G L  G+EVHA
Sbjct: 356  TVTWTSIIAGYARKGLGEDAISLFRVMKRRNIIANNLTIVSVLRACGSVGYLLMGREVHA 415

Query: 1524 QIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHE 1703
            QI K   Q+NI+IGS LVW YC+CG+Y+ ASKVL+ MPLRDVVSWTAMISGC +LGHE E
Sbjct: 416  QIVKISIQTNIYIGSTLVWFYCKCGEYNIASKVLQQMPLRDVVSWTAMISGCASLGHEAE 475

Query: 1704 ALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIH 1883
            AL++LKEM+ EGV PN FTYSSALKACA+LE + QGKLIHS  +KTPA S+VFVGSALIH
Sbjct: 476  ALDFLKEMMEEGVEPNSFTYSSALKACAKLEAVSQGKLIHSFANKTPALSNVFVGSALIH 535

Query: 1884 MYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYV 2063
            MYAKCG +SEA +VFD+MPE+NLVS KAMII YA+NGLCREAL+L+YRM+AEG E+DDY+
Sbjct: 536  MYAKCGFVSEAFQVFDSMPERNLVSWKAMIIGYARNGLCREALQLMYRMEAEGFEVDDYI 595

Query: 2064 LSTVLTECGDFERDK 2108
            L+TVL+ CGD E D+
Sbjct: 596  LTTVLSACGDIEWDE 610



 Score =  118 bits (295), Expect = 1e-23
 Identities = 69/205 (33%), Positives = 113/205 (55%)
 Frame = +3

Query: 1509 KEVHAQIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNL 1688
            + VHA + K       ++ + L+ +Y R G    A KV + M  R+VVSWTAMI+G + L
Sbjct: 109  RRVHAVVLKRLKNPGTYVENNLISVYSRFGKLMEARKVFDKMAERNVVSWTAMINGYSKL 168

Query: 1689 GHEHEALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVG 1868
            G + EAL    + +  GV  N   +   +  C++  D   G+ IH  I K    + + V 
Sbjct: 169  GFDDEALRLFADSISSGVRGNGKMFVCLMNLCSRRMDFELGRRIHGCILKGNWRNLI-VD 227

Query: 1869 SALIHMYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIE 2048
            SA+++ YA+CG LS+A RVF  M ++++V    +I A A+ G  +EA  +  RM +EG  
Sbjct: 228  SAVVNFYAQCGELSKAFRVFCWMGKKDVVCWTTIITACAQQGNGKEAFSMFSRMLSEGFW 287

Query: 2049 LDDYVLSTVLTECGDFERDKSTKEL 2123
             +++ + +VL  CG+ +  KS ++L
Sbjct: 288  PNEFTVCSVLKACGEEKALKSGRQL 312


>ref|XP_002284293.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18520-like
            [Vitis vinifera]
          Length = 595

 Score =  702 bits (1812), Expect = 0.0
 Identities = 354/553 (64%), Positives = 430/553 (77%), Gaps = 2/553 (0%)
 Frame = +3

Query: 447  PSVHYDNSVTSSIPSNNIDNHKKAEIFDPSSVSN--GSTLLAYLLQSCTDKEEVRIIHAI 620
            PS+       S  P  N    K A   +   +     + LLA+ LQSC    EVR +HA+
Sbjct: 29   PSLFTIRRSQSPEPRKNSKTWKNAGFLNVQPIVGHVNANLLAFWLQSCCTVREVRRVHAV 88

Query: 621  VLKISKESNIFVDNNLISGYARYGGIVEARKVFDEMLERNVVSWTAMLNAYLRFGYDNEA 800
            V K    S  +V+NNLIS Y+R+G +VEARKVFD+M ERNVVSWTA++N Y R+G+D+EA
Sbjct: 89   VFKCLDNSVTYVNNNLISAYSRFGKLVEARKVFDKMPERNVVSWTAVVNGYSRYGFDDEA 148

Query: 801  IRHFINFVESGIPWSSKTFVCMLNMCSKILDLELGKQVHTCMVKGGFSGLILNSALLYFY 980
            +R F + +E+G+  + KTFVC+LN+CSK LD ELG+Q+H C+VK  +  LI++SAL+ FY
Sbjct: 149  LRLFDDCIENGVRANGKTFVCVLNLCSKRLDFELGRQIHACIVKDNWRNLIVDSALVCFY 208

Query: 981  VQCGDLENAVSVFDHIKERDVICWTIMISAYSQHRRGEEAFKTFSRMLSDRVEPNEFTIC 1160
             QCGDL  A   FD + ERDV+CWT MI+A SQ  RG EA   FS+M+ +   PNEFT+C
Sbjct: 209  AQCGDLSGAFHAFDQMPERDVVCWTTMITACSQQGRGTEALSMFSQMMFNTSSPNEFTVC 268

Query: 1161 TVLDACGEEKELKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNR 1340
            +VL ACGEEK L+ G+QLHGA IK M+ +DVFIGT+LV MYAKCGE  DSR VFD M+ R
Sbjct: 269  SVLKACGEEKALEFGKQLHGAIIKKMFKEDVFIGTSLVGMYAKCGEILDSRKVFDGMKKR 328

Query: 1341 NTVTWTTIIAGYARNGFGEEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVH 1520
            NTVTWT+IIAGYARNG GEEAI+LFRVM+RRKI ANNLT+VSILRACG    L  GKEVH
Sbjct: 329  NTVTWTSIIAGYARNGQGEEAISLFRVMKRRKIFANNLTVVSILRACGSTRNLLMGKEVH 388

Query: 1521 AQIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEH 1700
            AQI KN  QSNI+IGS LVW YC+C ++  ASKVL+ MPLRDVVSWTA+ISG T+LGHE 
Sbjct: 389  AQIMKNSMQSNIYIGSTLVWFYCKCEEHPFASKVLQNMPLRDVVSWTAIISGYTSLGHEP 448

Query: 1701 EALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALI 1880
            EALE+LKEML EGV PNPFTYSSALKACA LE I QGKLIHSS++KT A S+VFVGSALI
Sbjct: 449  EALEFLKEMLEEGVEPNPFTYSSALKACAHLEAILQGKLIHSSVNKTLALSNVFVGSALI 508

Query: 1881 HMYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDY 2060
            +MYAKCG++SEAI+VFD+MP++NLVS KAMI+ YA+NGLC EALKL+YRMQAEGIE+DDY
Sbjct: 509  NMYAKCGYVSEAIQVFDSMPQRNLVSWKAMIVGYARNGLCGEALKLMYRMQAEGIEVDDY 568

Query: 2061 VLSTVLTECGDFE 2099
            +L+TVL+ CGD E
Sbjct: 569  ILTTVLSACGDVE 581



 Score =  118 bits (296), Expect = 1e-23
 Identities = 70/205 (34%), Positives = 110/205 (53%)
 Frame = +3

Query: 1509 KEVHAQIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNL 1688
            + VHA +FK    S  ++ + L+  Y R G    A KV + MP R+VVSWTA+++G +  
Sbjct: 83   RRVHAVVFKCLDNSVTYVNNNLISAYSRFGKLVEARKVFDKMPERNVVSWTAVVNGYSRY 142

Query: 1689 GHEHEALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVG 1868
            G + EAL    + +  GV  N  T+   L  C++  D   G+ IH+ I K    + + V 
Sbjct: 143  GFDDEALRLFDDCIENGVRANGKTFVCVLNLCSKRLDFELGRQIHACIVKDNWRNLI-VD 201

Query: 1869 SALIHMYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIE 2048
            SAL+  YA+CG LS A   FD MPE+++V    MI A ++ G   EAL +  +M      
Sbjct: 202  SALVCFYAQCGDLSGAFHAFDQMPERDVVCWTTMITACSQQGRGTEALSMFSQMMFNTSS 261

Query: 2049 LDDYVLSTVLTECGDFERDKSTKEL 2123
             +++ + +VL  CG+ +  +  K+L
Sbjct: 262  PNEFTVCSVLKACGEEKALEFGKQL 286


>ref|XP_006472673.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18520-like
            [Citrus sinensis]
          Length = 620

 Score =  671 bits (1731), Expect = 0.0
 Identities = 350/610 (57%), Positives = 438/610 (71%), Gaps = 9/610 (1%)
 Frame = +3

Query: 297  VNITIFRIQFPNFDVQLPKALEYGTNFTKKSSCTNTLNRNIS-------CFGSHDPFPSV 455
            +++T+F  QF  F       ++      K+   + T+N N         CF         
Sbjct: 2    LSLTLFSSQFSLFQQSSLLNIQ-PLQSPKQKHSSKTINSNYQKNPNDCRCFYRETTNSHT 60

Query: 456  HYDNSVTSSIPSNNIDNHKKAEIFDPSSVSNG--STLLAYLLQSCTDKEEVRIIHAIVLK 629
            ++  S       +N D    AE     SVS    S LLA  L+SC   ++V+ IHAIVLK
Sbjct: 61   NFHKSSDDFSLQDNAD----AESSHAPSVSQSVDSNLLAIWLRSCNTVKQVKRIHAIVLK 116

Query: 630  ISKESNIFVDNNLISGYARYGGIVEARKVFDEMLERNVVSWTAMLNAYLRFGYDNEAIRH 809
              ++S ++VDNNLIS Y + G +VEARKVFD+M ERNVVSWTAM+N Y RFG+ +EA   
Sbjct: 117  SLRKSVLYVDNNLISVYVKLGKLVEARKVFDKMSERNVVSWTAMVNGYSRFGFVDEAFML 176

Query: 810  FINFVESGIPWSSKTFVCMLNMCSKILDLELGKQVHTCMVKGGFSGLILNSALLYFYVQC 989
            F   + +G+  + K FVC++N+C + LD ELG+Q+H  ++K     LI++SA+L+FY Q 
Sbjct: 177  FSESIRTGVRGNEKMFVCVMNLCGRRLDFELGRQIHASILKCHCRNLIVDSAILHFYAQF 236

Query: 990  GDLENAVSVFDHIKERDVICWTIMISAYSQHRRGEEAFKTFSRMLSDRVEPNEFTICTVL 1169
            GD  +A   FD + ERDV+ WT MI+A SQ  RG+EA   FSRMLS+   PNEFT+C+VL
Sbjct: 237  GDFSSAFCAFDGMSERDVVSWTAMITACSQQARGDEAIFMFSRMLSEGFLPNEFTVCSVL 296

Query: 1170 DACGEEKELKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTV 1349
             ACGEE+ LKLGRQLH A +K +Y  DVFI T+LVDMYAKCGE  DSR VFD M NRNTV
Sbjct: 297  KACGEERALKLGRQLHAAIVKKLYKDDVFIRTSLVDMYAKCGEILDSRRVFDEMGNRNTV 356

Query: 1350 TWTTIIAGYARNGFGEEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHAQI 1529
            TWT+IIAGYAR G GEEAI LFRVM+RRKI ANNLT+VSIL+ACGL+GA   GKEVHAQI
Sbjct: 357  TWTSIIAGYAREGLGEEAIGLFRVMKRRKIHANNLTIVSILKACGLIGAFQLGKEVHAQI 416

Query: 1530 FKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEAL 1709
             KN+ QSN HIGS LVW YC+ G++  ASKVL+ MPLRDVVSWTA+ISG   LGHE EAL
Sbjct: 417  IKNFIQSNEHIGSTLVWFYCKNGEFPVASKVLQQMPLRDVVSWTAIISGYACLGHESEAL 476

Query: 1710 EYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMY 1889
            ++LK+ML EGV PNPFTYSSALKACA+LE++ QG LIHSS+ KT + S+V+VGSALI+MY
Sbjct: 477  DFLKDMLEEGVEPNPFTYSSALKACAKLENVSQGMLIHSSVKKTTSLSNVYVGSALINMY 536

Query: 1890 AKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLS 2069
            AKCG++SEA +VFDNMPE+NL + K+MI+ YA+NGLC+EALKL+YRMQAEG E+DDY+L 
Sbjct: 537  AKCGYISEASQVFDNMPERNLFTWKSMIVGYARNGLCQEALKLMYRMQAEGFEVDDYILI 596

Query: 2070 TVLTECGDFE 2099
            TV   CGD E
Sbjct: 597  TVYNACGDIE 606



 Score =  203 bits (516), Expect = 3e-49
 Identities = 116/360 (32%), Positives = 194/360 (53%)
 Frame = +3

Query: 1095 EAFKTFSRMLSDRVEPNEFTICTVLDACGEEKELKLGRQLHGATIKTMYIKDVFIGTALV 1274
            +A  + +  +S  V+ N   I   L +C   K++K   ++H   +K++    +++   L+
Sbjct: 76   DAESSHAPSVSQSVDSNLLAIW--LRSCNTVKQVK---RIHAIVLKSLRKSVLYVDNNLI 130

Query: 1275 DMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFGEEAINLFRVMQRRKILANNL 1454
             +Y K G+  ++R VFD M  RN V+WT ++ GY+R GF +EA  LF    R  +  N  
Sbjct: 131  SVYVKLGKLVEARKVFDKMSERNVVSWTAMVNGYSRFGFVDEAFMLFSESIRTGVRGNEK 190

Query: 1455 TMVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYM 1634
              V ++  CG       G+++HA I K + + N+ + SA++  Y + GD+S+A    + M
Sbjct: 191  MFVCVMNLCGRRLDFELGRQIHASILKCHCR-NLIVDSAILHFYAQFGDFSSAFCAFDGM 249

Query: 1635 PLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGK 1814
              RDVVSWTAMI+ C+      EA+     ML EG  PN FT  S LKAC +   ++ G+
Sbjct: 250  SERDVVSWTAMITACSQQARGDEAIFMFSRMLSEGFLPNEFTVCSVLKACGEERALKLGR 309

Query: 1815 LIHSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNG 1994
             +H++I K      VF+ ++L+ MYAKCG + ++ RVFD M  +N V+  ++I  YA+ G
Sbjct: 310  QLHAAIVKKLYKDDVFIRTSLVDMYAKCGEILDSRRVFDEMGNRNTVTWTSIIAGYAREG 369

Query: 1995 LCREALKLVYRMQAEGIELDDYVLSTVLTECGDFERDKSTKELAFHLV*IFFER*SNMAS 2174
            L  EA+ L   M+   I  ++  + ++L  CG     +  KE+   ++  F +   ++ S
Sbjct: 370  LGEEAIGLFRVMKRRKIHANNLTIVSILKACGLIGAFQLGKEVHAQIIKNFIQSNEHIGS 429


>ref|XP_004301943.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18520-like
            [Fragaria vesca subsp. vesca]
          Length = 583

 Score =  668 bits (1724), Expect = 0.0
 Identities = 335/577 (58%), Positives = 430/577 (74%), Gaps = 2/577 (0%)
 Frame = +3

Query: 381  KKSSCTNTLNRNISCFGSHDPFPSVHYDNSVTSSIPSNNIDNHKKAEIFDPSSVSNGST- 557
            +K +CTN  N+ +S   S+D       + S+T++  S     + +AE  +  +++     
Sbjct: 2    RKLTCTN--NKRLSSRISND-------ETSLTTTDLST--PENPEAEFSNTQTLTQSLRP 50

Query: 558  -LLAYLLQSCTDKEEVRIIHAIVLKISKESNIFVDNNLISGYARYGGIVEARKVFDEMLE 734
             LLA  L+SC   +EVR +HA++L+       +V NNL+  Y  +G +V ARKVFDEM  
Sbjct: 51   YLLALWLRSCRSLKEVRRVHALILRCICNPVTYVYNNLMCAYLGFGELVNARKVFDEMAV 110

Query: 735  RNVVSWTAMLNAYLRFGYDNEAIRHFINFVESGIPWSSKTFVCMLNMCSKILDLELGKQV 914
            RNVVSWTA++N YL FG D+EA+  F   V+ GI  +   FVC+LN+C K LD ELG+QV
Sbjct: 111  RNVVSWTAIVNGYLNFGLDDEALGLFSEAVDEGIQPNGNMFVCVLNLCCKRLDYELGRQV 170

Query: 915  HTCMVKGGFSGLILNSALLYFYVQCGDLENAVSVFDHIKERDVICWTIMISAYSQHRRGE 1094
            H  +VKGG+S +I++S ++  Y QCG+  +A   FD + + DVICWT MI+A SQ  RG 
Sbjct: 171  HAGVVKGGWSNMIVDSTIVKLYAQCGEFSSAFRAFDQMPKLDVICWTTMITACSQQGRGM 230

Query: 1095 EAFKTFSRMLSDRVEPNEFTICTVLDACGEEKELKLGRQLHGATIKTMYIKDVFIGTALV 1274
            EAF  F++MLSD   PNEFT+C VL ACGEEKEL+ GRQLHGA +K +Y  D+F+ TALV
Sbjct: 231  EAFSLFAQMLSDGFSPNEFTVCGVLKACGEEKELRFGRQLHGAIVKKIYKSDIFVATALV 290

Query: 1275 DMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFGEEAINLFRVMQRRKILANNL 1454
            DMYAKCGE EDSR VFD MRNRNTVTWT+IIAGYAR G  EEAI LFR+M+RR I  NNL
Sbjct: 291  DMYAKCGEIEDSRYVFDGMRNRNTVTWTSIIAGYARKGLSEEAICLFRLMKRRNIHVNNL 350

Query: 1455 TMVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYM 1634
            T+VSILRACGL+   P G+EVHA I KN  ++N+++GS LVW YC+CG+YS A+KVL+ M
Sbjct: 351  TIVSILRACGLIRCSPIGREVHAHIIKNSVETNLYLGSTLVWFYCKCGEYSTATKVLQQM 410

Query: 1635 PLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGK 1814
            PLRDVVSWTA+ISGC +LGHE EA+E LKEM+ +GV PN FTYSSALKACA LE +  G+
Sbjct: 411  PLRDVVSWTAIISGCAHLGHESEAIELLKEMMEDGVEPNAFTYSSALKACANLETVLHGQ 470

Query: 1815 LIHSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNG 1994
            L+HSS +K+PA S+V+VGSALI+MYAKCG+++EA +VFD+MPE+NLVS KAMI+ YA+NG
Sbjct: 471  LVHSSANKSPAMSNVYVGSALIYMYAKCGYVTEASQVFDSMPERNLVSWKAMIVGYARNG 530

Query: 1995 LCREALKLVYRMQAEGIELDDYVLSTVLTECGDFERD 2105
             C+EALKL+YRMQAEG ELDDY+++TVLT CGD E D
Sbjct: 531  HCQEALKLMYRMQAEGFELDDYIVATVLTACGDLEWD 567



 Score =  197 bits (501), Expect = 2e-47
 Identities = 107/317 (33%), Positives = 170/317 (53%)
 Frame = +3

Query: 1188 KELKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTII 1367
            + LK  R++H   ++ +     ++   L+  Y   GE  ++R VFD M  RN V+WT I+
Sbjct: 61   RSLKEVRRVHALILRCICNPVTYVYNNLMCAYLGFGELVNARKVFDEMAVRNVVSWTAIV 120

Query: 1368 AGYARNGFGEEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQ 1547
             GY   G  +EA+ LF       I  N    V +L  C        G++VHA + K    
Sbjct: 121  NGYLNFGLDDEALGLFSEAVDEGIQPNGNMFVCVLNLCCKRLDYELGRQVHAGVVKG-GW 179

Query: 1548 SNIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEM 1727
            SN+ + S +V LY +CG++S+A +  + MP  DV+ WT MI+ C+  G   EA     +M
Sbjct: 180  SNMIVDSTIVKLYAQCGEFSSAFRAFDQMPKLDVICWTTMITACSQQGRGMEAFSLFAQM 239

Query: 1728 LGEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHL 1907
            L +G SPN FT    LKAC + +++R G+ +H +I K    S +FV +AL+ MYAKCG +
Sbjct: 240  LSDGFSPNEFTVCGVLKACGEEKELRFGRQLHGAIVKKIYKSDIFVATALVDMYAKCGEI 299

Query: 1908 SEAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTEC 2087
             ++  VFD M  +N V+  ++I  YA+ GL  EA+ L   M+   I +++  + ++L  C
Sbjct: 300  EDSRYVFDGMRNRNTVTWTSIIAGYARKGLSEEAICLFRLMKRRNIHVNNLTIVSILRAC 359

Query: 2088 GDFERDKSTKELAFHLV 2138
            G        +E+  H++
Sbjct: 360  GLIRCSPIGREVHAHII 376


>ref|XP_002513855.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223546941|gb|EEF48438.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 498

 Score =  667 bits (1722), Expect = 0.0
 Identities = 318/485 (65%), Positives = 401/485 (82%)
 Frame = +3

Query: 648  IFVDNNLISGYARYGGIVEARKVFDEMLERNVVSWTAMLNAYLRFGYDNEAIRHFINFVE 827
            ++VDNNLIS YAR G ++EARKVFD+M ER VVSWTAM+N Y+ FG D+EA+R F   +E
Sbjct: 1    MYVDNNLISVYARLGELIEARKVFDQMHERCVVSWTAMINGYVSFGLDDEALRLFSELIE 60

Query: 828  SGIPWSSKTFVCMLNMCSKILDLELGKQVHTCMVKGGFSGLILNSALLYFYVQCGDLENA 1007
            +G+  +++TFVC+LN+CSK LD ELG+Q+H C+VKG +  LI++SA++ FY QCGDLE+A
Sbjct: 61   NGVTANNRTFVCILNVCSKRLDFELGRQIHACVVKGNWRNLIVDSAIVSFYAQCGDLESA 120

Query: 1008 VSVFDHIKERDVICWTIMISAYSQHRRGEEAFKTFSRMLSDRVEPNEFTICTVLDACGEE 1187
               F  ++E+DV+CWT +ISA SQ  RGEEAF+ FS+ML +   PNEFT+C +L ACGE+
Sbjct: 121  FCAFFQVREKDVVCWTSVISACSQQGRGEEAFRMFSQMLGEGFLPNEFTVCAILKACGEK 180

Query: 1188 KELKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTII 1367
            K LK GRQLH A +K MY  DVFIGT+LVDMYAKCGE  DS+ VFD MR RNTVTWT+II
Sbjct: 181  KALKFGRQLHCAIVKGMYKDDVFIGTSLVDMYAKCGEMIDSKEVFDGMRKRNTVTWTSII 240

Query: 1368 AGYARNGFGEEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQ 1547
            AGYAR G GEEAI LFRVM+RRKI++NNLT+VS+LRACG + A  TG+EVHAQI K+  Q
Sbjct: 241  AGYARKGLGEEAIRLFRVMKRRKIISNNLTVVSVLRACGSISASLTGREVHAQIIKSGIQ 300

Query: 1548 SNIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEM 1727
            SN+++GS LVW YC+CG+++ ASKVL+ M  R+VVSWTAMISG   LG+E EALE+LKEM
Sbjct: 301  SNVYLGSTLVWFYCKCGEFNIASKVLQQMSFRNVVSWTAMISGYIGLGYEFEALEFLKEM 360

Query: 1728 LGEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHL 1907
            + EGV PN FTYSSALKACA LE + QGKLIHS  +KTPASS+V+VGSALI+MY+KCG+L
Sbjct: 361  MDEGVEPNEFTYSSALKACANLESVLQGKLIHSFANKTPASSNVYVGSALIYMYSKCGYL 420

Query: 1908 SEAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTEC 2087
            S+AI+VFD+MPE+NL+S K MI++YA+NGLCREALKL+YRMQAEGIE+DDY+ ++V+  C
Sbjct: 421  SDAIQVFDSMPERNLISWKTMILSYARNGLCREALKLMYRMQAEGIEVDDYIYASVMGSC 480

Query: 2088 GDFER 2102
            GD +R
Sbjct: 481  GDVDR 485



 Score =  243 bits (619), Expect = 4e-61
 Identities = 138/431 (32%), Positives = 236/431 (54%), Gaps = 4/431 (0%)
 Frame = +3

Query: 516  AEIFDPSSVSNGSTLLAYLLQSCT---DKEEVRIIHAIVLKISKESNIFVDNNLISGYAR 686
            +E+ +    +N  T +  +L  C+   D E  R IHA V+K     N+ VD+ ++S YA+
Sbjct: 56   SELIENGVTANNRTFVC-ILNVCSKRLDFELGRQIHACVVK-GNWRNLIVDSAIVSFYAQ 113

Query: 687  YGGIVEARKVFDEMLERNVVSWTAMLNAYLRFGYDNEAIRHFINFVESGIPWSSKTFVCM 866
             G +  A   F ++ E++VV WT++++A  + G   EA R F   +  G   +  T   +
Sbjct: 114  CGDLESAFCAFFQVREKDVVCWTSVISACSQQGRGEEAFRMFSQMLGEGFLPNEFTVCAI 173

Query: 867  LNMCSKILDLELGKQVHTCMVKGGF-SGLILNSALLYFYVQCGDLENAVSVFDHIKERDV 1043
            L  C +   L+ G+Q+H  +VKG +   + + ++L+  Y +CG++ ++  VFD +++R+ 
Sbjct: 174  LKACGEKKALKFGRQLHCAIVKGMYKDDVFIGTSLVDMYAKCGEMIDSKEVFDGMRKRNT 233

Query: 1044 ICWTIMISAYSQHRRGEEAFKTFSRMLSDRVEPNEFTICTVLDACGEEKELKLGRQLHGA 1223
            + WT +I+ Y++   GEEA + F  M   ++  N  T+ +VL ACG       GR++H  
Sbjct: 234  VTWTSIIAGYARKGLGEEAIRLFRVMKRRKIISNNLTVVSVLRACGSISASLTGREVHAQ 293

Query: 1224 TIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFGEEA 1403
             IK+    +V++G+ LV  Y KCGE   +  V   M  RN V+WT +I+GY   G+  EA
Sbjct: 294  IIKSGIQSNVYLGSTLVWFYCKCGEFNIASKVLQQMSFRNVVSWTAMISGYIGLGYEFEA 353

Query: 1404 INLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIGSALVWL 1583
            +   + M    +  N  T  S L+AC  L ++  GK +H+   K  A SN+++GSAL+++
Sbjct: 354  LEFLKEMMDEGVEPNEFTYSSALKACANLESVLQGKLIHSFANKTPASSNVYVGSALIYM 413

Query: 1584 YCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNPFTY 1763
            Y +CG  S A +V + MP R+++SW  MI      G   EAL+ +  M  EG+  + + Y
Sbjct: 414  YSKCGYLSDAIQVFDSMPERNLISWKTMILSYARNGLCREALKLMYRMQAEGIEVDDYIY 473

Query: 1764 SSALKACAQLE 1796
            +S + +C  ++
Sbjct: 474  ASVMGSCGDVD 484



 Score =  194 bits (493), Expect = 1e-46
 Identities = 102/296 (34%), Positives = 165/296 (55%)
 Frame = +3

Query: 1251 VFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFGEEAINLFRVMQR 1430
            +++   L+ +YA+ GE  ++R VFD M  R  V+WT +I GY   G  +EA+ LF  +  
Sbjct: 1    MYVDNNLISVYARLGELIEARKVFDQMHERCVVSWTAMINGYVSFGLDDEALRLFSELIE 60

Query: 1431 RKILANNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIGSALVWLYCRCGDYSA 1610
              + ANN T V IL  C        G+++HA + K   + N+ + SA+V  Y +CGD  +
Sbjct: 61   NGVTANNRTFVCILNVCSKRLDFELGRQIHACVVKGNWR-NLIVDSAIVSFYAQCGDLES 119

Query: 1611 ASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNPFTYSSALKACAQ 1790
            A      +  +DVV WT++IS C+  G   EA     +MLGEG  PN FT  + LKAC +
Sbjct: 120  AFCAFFQVREKDVVCWTSVISACSQQGRGEEAFRMFSQMLGEGFLPNEFTVCAILKACGE 179

Query: 1791 LEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRVFDNMPEQNLVSCKAM 1970
             + ++ G+ +H +I K      VF+G++L+ MYAKCG + ++  VFD M ++N V+  ++
Sbjct: 180  KKALKFGRQLHCAIVKGMYKDDVFIGTSLVDMYAKCGEMIDSKEVFDGMRKRNTVTWTSI 239

Query: 1971 IIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTECGDFERDKSTKELAFHLV 2138
            I  YA+ GL  EA++L   M+   I  ++  + +VL  CG      + +E+   ++
Sbjct: 240  IAGYARKGLGEEAIRLFRVMKRRKIISNNLTVVSVLRACGSISASLTGREVHAQII 295


>gb|EXC06699.1| hypothetical protein L484_021537 [Morus notabilis]
          Length = 616

 Score =  659 bits (1699), Expect = 0.0
 Identities = 345/616 (56%), Positives = 441/616 (71%), Gaps = 5/616 (0%)
 Frame = +3

Query: 273  MLSPLLPSVNITIFRIQFPNFDVQLPKALEYGTNF----TKKSSCTNTLNRNISCFGSHD 440
            MLSP   S  I+    Q P         L   TN     TK S C +  N       S D
Sbjct: 1    MLSPTFLSPRISFNLSQPPPLFPLQRNKLTTTTNHKQVNTKNSRCFSRGNNP----SSSD 56

Query: 441  PFPSVHYDNSVTSSIPSNNIDNHKKAEIFDPS-SVSNGSTLLAYLLQSCTDKEEVRIIHA 617
             FPS            SN ++N       D S S S    LLA+ L+S    ++V+ +HA
Sbjct: 57   NFPSFEDF--------SNFLENPDSESPADQSFSQSLCPFLLAFWLRSSRTLKDVKRVHA 108

Query: 618  IVLKISKESNIFVDNNLISGYARYGGIVEARKVFDEMLERNVVSWTAMLNAYLRFGYDNE 797
            IVL+  +  + +V NNLI  Y R+  + EAR VFD+M  RNVV+WTA++N YL FG+D+E
Sbjct: 109  IVLRRLRNPDAYVYNNLICVYFRFEKLNEARNVFDKMSLRNVVTWTAVINGYLSFGFDDE 168

Query: 798  AIRHFINFVESGIPWSSKTFVCMLNMCSKILDLELGKQVHTCMVKGGFSGLILNSALLYF 977
            A+  F ++VESG+  + K FVC+LN+CSK  D ELG+Q+H  +VKG +S +I+ S+++ F
Sbjct: 169  ALSLFSDYVESGVRPNGKMFVCVLNLCSKRKDFELGRQIHAGVVKGRWSNMIVESSIVKF 228

Query: 978  YVQCGDLENAVSVFDHIKERDVICWTIMISAYSQHRRGEEAFKTFSRMLSDRVEPNEFTI 1157
            Y +CGD+ +A   FD + +RDV+CWT MI+A SQ  +G+EAF  FSRML++   PNEFT+
Sbjct: 229  YAKCGDMLSAFRKFDQMLKRDVVCWTTMITACSQQGKGKEAFSLFSRMLNEGFSPNEFTV 288

Query: 1158 CTVLDACGEEKELKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRN 1337
            C VL AC EEKEL  GRQLHGA +K MY  DVFIGT+LVDMYAKCGE  DSR VF+ MR+
Sbjct: 289  CGVLKACSEEKELNFGRQLHGAIVKKMYKNDVFIGTSLVDMYAKCGEILDSRNVFNKMRH 348

Query: 1338 RNTVTWTTIIAGYARNGFGEEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEV 1517
            RNTVTWT+IIAGYAR G G EA+ LFRVM++R IL NNLT+VSILRACGL+     G+EV
Sbjct: 349  RNTVTWTSIIAGYARKGLGHEALKLFRVMKKRNILTNNLTIVSILRACGLIRESLIGREV 408

Query: 1518 HAQIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHE 1697
            HAQI KN  ++N+++GS LVW YCRC +YS A+K L  MPLRDV SWTA+ISGC +LGHE
Sbjct: 409  HAQIIKNSIETNLYLGSTLVWFYCRCDEYSNATKALLQMPLRDVFSWTALISGCAHLGHE 468

Query: 1698 HEALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSAL 1877
             EALE+LK+M+ EGV PN FTYSSALKACA+LE I  G+LIHSS +KT + S+VFVGSAL
Sbjct: 469  TEALEFLKDMMEEGVEPNSFTYSSALKACARLEAILHGRLIHSSANKTSSMSNVFVGSAL 528

Query: 1878 IHMYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDD 2057
            I+MYAKCG+++EA++VFD+MPE+NLV+ K+MI+ YA+NGLCREAL+L+YRMQAEG ++DD
Sbjct: 529  IYMYAKCGYVAEALQVFDSMPERNLVAWKSMIVGYARNGLCREALRLMYRMQAEGFQVDD 588

Query: 2058 YVLSTVLTECGDFERD 2105
            Y+L+TVLT CGD E D
Sbjct: 589  YILTTVLTACGDIELD 604


>gb|EMJ23235.1| hypothetical protein PRUPE_ppa003044mg [Prunus persica]
          Length = 609

 Score =  653 bits (1685), Expect = 0.0
 Identities = 323/520 (62%), Positives = 401/520 (77%), Gaps = 2/520 (0%)
 Frame = +3

Query: 558  LLAYLLQSCTDKEEVRIIHAIVLKISKESNIFVDNNLISGYARYGGIVEARKVFDEMLER 737
            LLA  L+SC    EVR +HA+VL+       +V NNLI  Y  +G +V+ARKV D+M  R
Sbjct: 82   LLALWLRSCRSLNEVRRLHAVVLRCLANPVTYVFNNLICAYIVFGKLVDARKVLDKMTVR 141

Query: 738  NVVSWTAMLNAYLRFGYDNEAIRHFINFVESGIPWSSKTFVCMLNMCSKILDLELGKQVH 917
            NVVSWTA++N YL FG D+EA+  F   +  G+  +   FVC+LN+CSK +D ELG+QVH
Sbjct: 142  NVVSWTAIINGYLNFGLDDEALGLFSYAINEGVQPNGNMFVCVLNLCSKRVDYELGRQVH 201

Query: 918  TCMVKGGFSGLILNSALLYFYVQCGDLENAVSVFDHIKERDVICWTIMISAYSQHRRGEE 1097
              ++KGG+S LI++SA++  Y QCG+L +A   FD + + DV+CWT MI+A SQ   G+E
Sbjct: 202  GGVLKGGWSNLIVDSAVVKLYAQCGELSSAYRAFDQMPKSDVVCWTTMITACSQQGHGQE 261

Query: 1098 AFKTFSRMLSDRVEPNEFTICTVLDACGEEKELKLGRQLHGATIKTMYIKDVFIGTALVD 1277
            AF  FS+MLS+   PNEFT+C VL ACGEEKEL+ GRQLHGA +K +Y  DVFI T+LVD
Sbjct: 262  AFSLFSQMLSEGFSPNEFTVCGVLKACGEEKELRFGRQLHGAIVKKIYKNDVFIETSLVD 321

Query: 1278 MYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFGEEAINLFRVMQRRKILANNLT 1457
            MYAKCGE  DSR VFD MRNRNTVTWT+IIAGYAR GF EEAI LF+VM+RR I  NNLT
Sbjct: 322  MYAKCGEMIDSRTVFDGMRNRNTVTWTSIIAGYARKGFSEEAICLFQVMKRRNIFVNNLT 381

Query: 1458 MVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYMP 1637
            +VSILRACG +     G+EVHAQI KN  ++N H+GS LVW YCRCG+YS A+KVL+ MP
Sbjct: 382  IVSILRACGSMRDSLMGREVHAQIVKNSTETNSHLGSTLVWFYCRCGEYSNATKVLQQMP 441

Query: 1638 LRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGKL 1817
            LRDVVSWTA+ISGC +LG E EALE+L EM+ +GV PN FTYSSALKACAQLE +  GKL
Sbjct: 442  LRDVVSWTAIISGCAHLGFESEALEFLNEMMEDGVEPNAFTYSSALKACAQLETVLHGKL 501

Query: 1818 IHSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNGL 1997
            IHSS +K+ A S+VFVGSALI MYAKCG+++EA +VFD+MPE+NLVS KAMI+ YAKNGL
Sbjct: 502  IHSSANKSAAMSNVFVGSALISMYAKCGYVTEAFQVFDSMPERNLVSWKAMIVGYAKNGL 561

Query: 1998 CREALKLVYRMQAEGIELDDYVLSTVLTECGD--FERDKS 2111
            C+EA+KL+YRM+ EG E+DDY+L+TVLT CG+  +E D S
Sbjct: 562  CQEAMKLMYRMRTEGFEVDDYILATVLTACGELGWEMDPS 601



 Score =  197 bits (501), Expect = 2e-47
 Identities = 105/311 (33%), Positives = 164/311 (52%)
 Frame = +3

Query: 1206 RQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARN 1385
            R+LH   ++ +     ++   L+  Y   G+  D+R V D M  RN V+WT II GY   
Sbjct: 97   RRLHAVVLRCLANPVTYVFNNLICAYIVFGKLVDARKVLDKMTVRNVVSWTAIINGYLNF 156

Query: 1386 GFGEEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIG 1565
            G  +EA+ LF       +  N    V +L  C        G++VH  + K    SN+ + 
Sbjct: 157  GLDDEALGLFSYAINEGVQPNGNMFVCVLNLCSKRVDYELGRQVHGGVLKG-GWSNLIVD 215

Query: 1566 SALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVS 1745
            SA+V LY +CG+ S+A +  + MP  DVV WT MI+ C+  GH  EA     +ML EG S
Sbjct: 216  SAVVKLYAQCGELSSAYRAFDQMPKSDVVCWTTMITACSQQGHGQEAFSLFSQMLSEGFS 275

Query: 1746 PNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRV 1925
            PN FT    LKAC + +++R G+ +H +I K    + VF+ ++L+ MYAKCG + ++  V
Sbjct: 276  PNEFTVCGVLKACGEEKELRFGRQLHGAIVKKIYKNDVFIETSLVDMYAKCGEMIDSRTV 335

Query: 1926 FDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTECGDFERD 2105
            FD M  +N V+  ++I  YA+ G   EA+ L   M+   I +++  + ++L  CG     
Sbjct: 336  FDGMRNRNTVTWTSIIAGYARKGFSEEAICLFQVMKRRNIFVNNLTIVSILRACGSMRDS 395

Query: 2106 KSTKELAFHLV 2138
               +E+   +V
Sbjct: 396  LMGREVHAQIV 406


>ref|XP_004138810.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18520-like
            [Cucumis sativus] gi|449490224|ref|XP_004158542.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g18520-like [Cucumis sativus]
          Length = 619

 Score =  645 bits (1663), Expect = 0.0
 Identities = 324/565 (57%), Positives = 414/565 (73%), Gaps = 21/565 (3%)
 Frame = +3

Query: 474  TSSIPSNNIDNHKKA--------------EIFDPSSVSNGST-------LLAYLLQSCTD 590
            ++S P N +++H KA              E  D  S SN S        L+   L+S   
Sbjct: 43   STSFPFNFVEDHSKALPVACATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRS 102

Query: 591  KEEVRIIHAIVLKISKESNIFVDNNLISGYARYGGIVEARKVFDEMLERNVVSWTAMLNA 770
             +++R +HA +L+      I+V NNL+S Y R G +V+ARKVFDEM  R+VV+WTA++N 
Sbjct: 103  VKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIING 162

Query: 771  YLRFGYDNEAIRHFINFVESGIPWSSKTFVCMLNMCSKILDLELGKQVHTCMVKGGFSGL 950
            Y+      EA+  F + V+SG+  + + FVC+LN+C+K LD ELG+Q+H  +VKG    L
Sbjct: 163  YIDLDLTEEALALFSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNL 222

Query: 951  ILNSALLYFYVQCGDLENAVSVFDHIKERDVICWTIMISAYSQHRRGEEAFKTFSRMLSD 1130
            I++SA++YFY QC D+ +A   F+ ++ RDV+CWT MI++ SQ   G EA   FS MLSD
Sbjct: 223  IVDSAIIYFYAQCKDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSD 282

Query: 1131 RVEPNEFTICTVLDACGEEKELKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDS 1310
               PNEF++C+VL ACGEE+ELK+GRQLHG  IK +   DVF+GT+LVDMYAKCG   DS
Sbjct: 283  EFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADS 342

Query: 1311 RLVFDCMRNRNTVTWTTIIAGYARNGFGEEAINLFRVMQRRKILANNLTMVSILRACGLL 1490
            R VFD MRNRNTVTWT+IIAGYAR G GEEA+NLFR+M+R++I ANNLT+VSILRACG +
Sbjct: 343  REVFDGMRNRNTVTWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSI 402

Query: 1491 GALPTGKEVHAQIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMI 1670
             A  TG+EVHAQI KN  Q+NIHIGS LVW YC+C +   AS VL+ MPLRDVVSWTA+I
Sbjct: 403  EASLTGREVHAQIVKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAII 462

Query: 1671 SGCTNLGHEHEALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPAS 1850
            SGC +LGHE EALE+LK M+ EGV PN FTYSS LKACA++E + QGK+IHSS +KT A 
Sbjct: 463  SGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSAL 522

Query: 1851 SSVFVGSALIHMYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRM 2030
            S+VFVGSALI+MYAKCG+++EA +VFD+MP +NLVS KAMI+ YA+NGLCREALKL+YRM
Sbjct: 523  SNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRM 582

Query: 2031 QAEGIELDDYVLSTVLTECGDFERD 2105
            QAEG E+DDY+L TV   CGD + D
Sbjct: 583  QAEGFEVDDYILGTVYGACGDVKCD 607



 Score =  198 bits (504), Expect = 8e-48
 Identities = 109/317 (34%), Positives = 171/317 (53%)
 Frame = +3

Query: 1188 KELKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTII 1367
            + +K  R +H   ++      +++G  L+  Y + G   D+R VFD M  R+ VTWT II
Sbjct: 101  RSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAII 160

Query: 1368 AGYARNGFGEEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQ 1547
             GY      EEA+ LF    +  +LAN    V IL  C        G+++H  I K   +
Sbjct: 161  NGYIDLDLTEEALALFSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGN-R 219

Query: 1548 SNIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEM 1727
             N+ + SA+++ Y +C D S+A    E M  RDVV WT+MI+ C+  G   EA+     M
Sbjct: 220  GNLIVDSAIIYFYAQCKDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNM 279

Query: 1728 LGEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHL 1907
            L +   PN F+  S LKAC +  +++ G+ +H  I K    + VFVG++L+ MYAKCG+L
Sbjct: 280  LSDEFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNL 339

Query: 1908 SEAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTEC 2087
            +++  VFD M  +N V+  ++I  YA+ GL  EAL L   M+ + I  ++  + ++L  C
Sbjct: 340  ADSREVFDGMRNRNTVTWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRAC 399

Query: 2088 GDFERDKSTKELAFHLV 2138
            G  E   + +E+   +V
Sbjct: 400  GSIEASLTGREVHAQIV 416


>ref|XP_002867987.1| binding protein [Arabidopsis lyrata subsp. lyrata]
            gi|297313823|gb|EFH44246.1| binding protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 616

 Score =  622 bits (1603), Expect = e-175
 Identities = 313/554 (56%), Positives = 402/554 (72%), Gaps = 1/554 (0%)
 Frame = +3

Query: 477  SSIPSNNIDNHKKAEIFDPSSVSNGSTLLAYLLQSCTDKEEVRIIHAIVLKISKESNIFV 656
            S +   N+ N      FD   V     LLA  LQS      ++ IHA+ LK   +  I+ 
Sbjct: 63   SDLQGENV-NRDDLSSFDSERVDYA--LLAEWLQSSNGMRLIKRIHAMALKCFDDQVIYF 119

Query: 657  DNNLISGYARYGGIVEARKVFDEMLERNVVSWTAMLNAYLRFGYDNEAIRHFINFVESGI 836
             NNLIS   R G +V ARKVFD M +RN V+WTAM++ YL+FG ++EA   F ++V+ GI
Sbjct: 120  GNNLISSCVRLGDLVYARKVFDSMPDRNTVTWTAMIDGYLKFGLEDEAFSLFEDYVKHGI 179

Query: 837  PWSS-KTFVCMLNMCSKILDLELGKQVHTCMVKGGFSGLILNSALLYFYVQCGDLENAVS 1013
             +++ + FVC+LN+CS+  + ELG+QVH  MVK G   LI+ S+L+YFY QCG+L +A+ 
Sbjct: 180  RFTNERMFVCLLNLCSRRSEFELGRQVHGNMVKVGVGNLIVESSLVYFYAQCGELTSALR 239

Query: 1014 VFDHIKERDVICWTIMISAYSQHRRGEEAFKTFSRMLSDRVEPNEFTICTVLDACGEEKE 1193
             FD ++E+DVI WT +ISA S+   G +A   F  ML+    PNEFT+C++L AC EEK 
Sbjct: 240  AFDMMEEKDVISWTAVISACSRKGHGNKAIFMFIGMLNHGFLPNEFTVCSILKACSEEKA 299

Query: 1194 LKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAG 1373
            ++ GRQ+H   +K M   DVF+GT+L+DMYAKCGE  D R VFD M NRNTVTWT+IIA 
Sbjct: 300  IRFGRQVHSLVVKRMIKTDVFVGTSLMDMYAKCGEISDCRKVFDGMSNRNTVTWTSIIAA 359

Query: 1374 YARNGFGEEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQSN 1553
            +AR GFGEEAI+LFRVM+RR ++ANNLT+VSILRACG +GAL  GKE+HAQI KN  + N
Sbjct: 360  HAREGFGEEAISLFRVMKRRHLIANNLTVVSILRACGSVGALLLGKELHAQIIKNSIEKN 419

Query: 1554 IHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEMLG 1733
            ++IGS LVWLYC+CG+   A  VL+ +P RDVVSWTAMISGC++LGHE EAL++LKEM+ 
Sbjct: 420  VYIGSTLVWLYCKCGESRDAFNVLQQLPSRDVVSWTAMISGCSSLGHESEALDFLKEMIQ 479

Query: 1734 EGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHLSE 1913
            EGV PNPFTYSSALKACA  E +  G+ IHS   K  A S+VFVGSALIHMYAKCG +SE
Sbjct: 480  EGVEPNPFTYSSALKACANSESLLIGRSIHSIAKKNHALSNVFVGSALIHMYAKCGFVSE 539

Query: 1914 AIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTECGD 2093
            A RVFD+MPE+NLVS KAMI+ YA+NG CREALKL+YRM+AEG E+DDY+ +T+L+ CGD
Sbjct: 540  AFRVFDSMPEKNLVSWKAMIMGYARNGFCREALKLMYRMEAEGFEVDDYIFATILSTCGD 599

Query: 2094 FERDKSTKELAFHL 2135
             E D++      +L
Sbjct: 600  IELDEAEPSATCYL 613



 Score =  196 bits (499), Expect = 3e-47
 Identities = 107/316 (33%), Positives = 176/316 (55%), Gaps = 1/316 (0%)
 Frame = +3

Query: 1194 LKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAG 1373
            ++L +++H   +K    + ++ G  L+    + G+   +R VFD M +RNTVTWT +I G
Sbjct: 98   MRLIKRIHAMALKCFDDQVIYFGNNLISSCVRLGDLVYARKVFDSMPDRNTVTWTAMIDG 157

Query: 1374 YARNGFGEEAINLFRVMQRRKILANNLTM-VSILRACGLLGALPTGKEVHAQIFKNYAQS 1550
            Y + G  +EA +LF    +  I   N  M V +L  C        G++VH  + K     
Sbjct: 158  YLKFGLEDEAFSLFEDYVKHGIRFTNERMFVCLLNLCSRRSEFELGRQVHGNMVK-VGVG 216

Query: 1551 NIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEML 1730
            N+ + S+LV+ Y +CG+ ++A +  + M  +DV+SWTA+IS C+  GH ++A+     ML
Sbjct: 217  NLIVESSLVYFYAQCGELTSALRAFDMMEEKDVISWTAVISACSRKGHGNKAIFMFIGML 276

Query: 1731 GEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHLS 1910
              G  PN FT  S LKAC++ + IR G+ +HS + K    + VFVG++L+ MYAKCG +S
Sbjct: 277  NHGFLPNEFTVCSILKACSEEKAIRFGRQVHSLVVKRMIKTDVFVGTSLMDMYAKCGEIS 336

Query: 1911 EAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTECG 2090
            +  +VFD M  +N V+  ++I A+A+ G   EA+ L   M+   +  ++  + ++L  CG
Sbjct: 337  DCRKVFDGMSNRNTVTWTSIIAAHAREGFGEEAISLFRVMKRRHLIANNLTVVSILRACG 396

Query: 2091 DFERDKSTKELAFHLV 2138
                    KEL   ++
Sbjct: 397  SVGALLLGKELHAQII 412


>ref|NP_193587.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|122236144|sp|Q0WNP3.1|PP319_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g18520 gi|110738662|dbj|BAF01256.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332658657|gb|AEE84057.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 617

 Score =  621 bits (1601), Expect = e-175
 Identities = 310/542 (57%), Positives = 399/542 (73%), Gaps = 1/542 (0%)
 Frame = +3

Query: 504  NHKKAEIFDPSSVSNGSTLLAYLLQSCTDKEEVRIIHAIVLKISKESNIFVDNNLISGYA 683
            N   +  FD   V     LLA  LQS      ++ IHA+ LK   +  I+  NNLIS   
Sbjct: 71   NQDDSSSFDSERVDYA--LLAEWLQSSNGMRLIKRIHAMALKCFDDQVIYFGNNLISSCV 128

Query: 684  RYGGIVEARKVFDEMLERNVVSWTAMLNAYLRFGYDNEAIRHFINFVESGIPWSS-KTFV 860
            R G +V ARKVFD M E+N V+WTAM++ YL++G ++EA   F ++V+ GI +++ + FV
Sbjct: 129  RLGDLVYARKVFDSMPEKNTVTWTAMIDGYLKYGLEDEAFALFEDYVKHGIRFTNERMFV 188

Query: 861  CMLNMCSKILDLELGKQVHTCMVKGGFSGLILNSALLYFYVQCGDLENAVSVFDHIKERD 1040
            C+LN+CS+  + ELG+QVH  MVK G   LI+ S+L+YFY QCG+L +A+  FD ++E+D
Sbjct: 189  CLLNLCSRRAEFELGRQVHGNMVKVGVGNLIVESSLVYFYAQCGELTSALRAFDMMEEKD 248

Query: 1041 VICWTIMISAYSQHRRGEEAFKTFSRMLSDRVEPNEFTICTVLDACGEEKELKLGRQLHG 1220
            VI WT +ISA S+   G +A   F  ML+    PNEFT+C++L AC EEK L+ GRQ+H 
Sbjct: 249  VISWTAVISACSRKGHGIKAIGMFIGMLNHWFLPNEFTVCSILKACSEEKALRFGRQVHS 308

Query: 1221 ATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFGEE 1400
              +K M   DVF+GT+L+DMYAKCGE  D R VFD M NRNTVTWT+IIA +AR GFGEE
Sbjct: 309  LVVKRMIKTDVFVGTSLMDMYAKCGEISDCRKVFDGMSNRNTVTWTSIIAAHAREGFGEE 368

Query: 1401 AINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIGSALVW 1580
            AI+LFR+M+RR ++ANNLT+VSILRACG +GAL  GKE+HAQI KN  + N++IGS LVW
Sbjct: 369  AISLFRIMKRRHLIANNLTVVSILRACGSVGALLLGKELHAQIIKNSIEKNVYIGSTLVW 428

Query: 1581 LYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNPFT 1760
            LYC+CG+   A  VL+ +P RDVVSWTAMISGC++LGHE EAL++LKEM+ EGV PNPFT
Sbjct: 429  LYCKCGESRDAFNVLQQLPSRDVVSWTAMISGCSSLGHESEALDFLKEMIQEGVEPNPFT 488

Query: 1761 YSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRVFDNMP 1940
            YSSALKACA  E +  G+ IHS   K  A S+VFVGSALIHMYAKCG +SEA RVFD+MP
Sbjct: 489  YSSALKACANSESLLIGRSIHSIAKKNHALSNVFVGSALIHMYAKCGFVSEAFRVFDSMP 548

Query: 1941 EQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTECGDFERDKSTKE 2120
            E+NLVS KAMI+ YA+NG CREALKL+YRM+AEG E+DDY+ +T+L+ CGD E D++ + 
Sbjct: 549  EKNLVSWKAMIMGYARNGFCREALKLMYRMEAEGFEVDDYIFATILSTCGDIELDEAVES 608

Query: 2121 LA 2126
             A
Sbjct: 609  SA 610



 Score =  189 bits (480), Expect = 5e-45
 Identities = 104/316 (32%), Positives = 172/316 (54%), Gaps = 1/316 (0%)
 Frame = +3

Query: 1194 LKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAG 1373
            ++L +++H   +K    + ++ G  L+    + G+   +R VFD M  +NTVTWT +I G
Sbjct: 98   MRLIKRIHAMALKCFDDQVIYFGNNLISSCVRLGDLVYARKVFDSMPEKNTVTWTAMIDG 157

Query: 1374 YARNGFGEEAINLFRVMQRRKILANNLTM-VSILRACGLLGALPTGKEVHAQIFKNYAQS 1550
            Y + G  +EA  LF    +  I   N  M V +L  C        G++VH  + K     
Sbjct: 158  YLKYGLEDEAFALFEDYVKHGIRFTNERMFVCLLNLCSRRAEFELGRQVHGNMVK-VGVG 216

Query: 1551 NIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEML 1730
            N+ + S+LV+ Y +CG+ ++A +  + M  +DV+SWTA+IS C+  GH  +A+     ML
Sbjct: 217  NLIVESSLVYFYAQCGELTSALRAFDMMEEKDVISWTAVISACSRKGHGIKAIGMFIGML 276

Query: 1731 GEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHLS 1910
                 PN FT  S LKAC++ + +R G+ +HS + K    + VFVG++L+ MYAKCG +S
Sbjct: 277  NHWFLPNEFTVCSILKACSEEKALRFGRQVHSLVVKRMIKTDVFVGTSLMDMYAKCGEIS 336

Query: 1911 EAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTECG 2090
            +  +VFD M  +N V+  ++I A+A+ G   EA+ L   M+   +  ++  + ++L  CG
Sbjct: 337  DCRKVFDGMSNRNTVTWTSIIAAHAREGFGEEAISLFRIMKRRHLIANNLTVVSILRACG 396

Query: 2091 DFERDKSTKELAFHLV 2138
                    KEL   ++
Sbjct: 397  SVGALLLGKELHAQII 412


>ref|XP_006283342.1| hypothetical protein CARUB_v10004383mg [Capsella rubella]
            gi|482552047|gb|EOA16240.1| hypothetical protein
            CARUB_v10004383mg [Capsella rubella]
          Length = 621

 Score =  620 bits (1598), Expect = e-174
 Identities = 304/518 (58%), Positives = 392/518 (75%), Gaps = 1/518 (0%)
 Frame = +3

Query: 558  LLAYLLQSCTDKEEVRIIHAIVLKISKESNIFVDNNLISGYARYGGIVEARKVFDEMLER 737
            LLA  LQS      ++ IHA+ LK   +  I+  NNLIS   R G +V ARKVFD M +R
Sbjct: 92   LLAEWLQSSNGMRLIKRIHAMALKCFDDQVIYFGNNLISSCVRLGDLVYARKVFDSMPDR 151

Query: 738  NVVSWTAMLNAYLRFGYDNEAIRHFINFVESGIPWSS-KTFVCMLNMCSKILDLELGKQV 914
            N V+WTAM++ YL++G ++EA   F ++V+ GI +++ + FVC+LN+CS+  + ELG+QV
Sbjct: 152  NTVTWTAMIDGYLKYGLEDEAFALFEDYVKHGIRFTNQRMFVCLLNLCSRRSEFELGRQV 211

Query: 915  HTCMVKGGFSGLILNSALLYFYVQCGDLENAVSVFDHIKERDVICWTIMISAYSQHRRGE 1094
            H  MVK G   LI+ S+L+YFY QCG+L +A+  FD ++E+DVI WT +ISA S+   G 
Sbjct: 212  HGNMVKVGVENLIVESSLVYFYAQCGELTSALRAFDMMEEKDVISWTAVISACSRKGHGN 271

Query: 1095 EAFKTFSRMLSDRVEPNEFTICTVLDACGEEKELKLGRQLHGATIKTMYIKDVFIGTALV 1274
            +A   F  ML+    PNEFT+C++L AC EEK ++ G+Q+H   +K M   DVF+GT+L+
Sbjct: 272  KAIVMFMGMLNHGFLPNEFTVCSILKACSEEKAIRFGKQVHSLIVKRMIKTDVFVGTSLM 331

Query: 1275 DMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFGEEAINLFRVMQRRKILANNL 1454
            DMYAKCGE  D R VFD M NRNTVTWT+IIA +AR GFGE+AI+LFRVM+RR ++ANNL
Sbjct: 332  DMYAKCGEISDCRKVFDGMSNRNTVTWTSIIAAHAREGFGEDAISLFRVMKRRHLIANNL 391

Query: 1455 TMVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYM 1634
            T+VSILRACG +GAL  GKE+HAQI KN  + N++IGS LVWLYC+CG+   AS VL+ +
Sbjct: 392  TVVSILRACGSVGALLLGKELHAQIIKNSIEKNVYIGSTLVWLYCKCGESRYASNVLQQL 451

Query: 1635 PLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGK 1814
            P RDVVSWTAMISGC++LGHE EAL++LKEM+ EGV PNPFTYSSALKACA  E +  G+
Sbjct: 452  PSRDVVSWTAMISGCSSLGHESEALDFLKEMIQEGVEPNPFTYSSALKACANSESVLIGR 511

Query: 1815 LIHSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNG 1994
             IHS   K  A S+VFVGSALIHMYAKCG +SEA RVFD+MPE+NLVS KAMI+ YA+NG
Sbjct: 512  SIHSIAKKNHALSNVFVGSALIHMYAKCGFVSEAFRVFDSMPEKNLVSWKAMIMGYARNG 571

Query: 1995 LCREALKLVYRMQAEGIELDDYVLSTVLTECGDFERDK 2108
             CREALKL+YRM+AEG E+DDY+ +T+L+ CGD E D+
Sbjct: 572  FCREALKLMYRMEAEGFEVDDYIFATILSTCGDIELDE 609



 Score =  197 bits (502), Expect = 1e-47
 Identities = 108/316 (34%), Positives = 175/316 (55%), Gaps = 1/316 (0%)
 Frame = +3

Query: 1194 LKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAG 1373
            ++L +++H   +K    + ++ G  L+    + G+   +R VFD M +RNTVTWT +I G
Sbjct: 103  MRLIKRIHAMALKCFDDQVIYFGNNLISSCVRLGDLVYARKVFDSMPDRNTVTWTAMIDG 162

Query: 1374 YARNGFGEEAINLFRVMQRRKILANNLTM-VSILRACGLLGALPTGKEVHAQIFKNYAQS 1550
            Y + G  +EA  LF    +  I   N  M V +L  C        G++VH  + K     
Sbjct: 163  YLKYGLEDEAFALFEDYVKHGIRFTNQRMFVCLLNLCSRRSEFELGRQVHGNMVK-VGVE 221

Query: 1551 NIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEML 1730
            N+ + S+LV+ Y +CG+ ++A +  + M  +DV+SWTA+IS C+  GH ++A+     ML
Sbjct: 222  NLIVESSLVYFYAQCGELTSALRAFDMMEEKDVISWTAVISACSRKGHGNKAIVMFMGML 281

Query: 1731 GEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHLS 1910
              G  PN FT  S LKAC++ + IR GK +HS I K    + VFVG++L+ MYAKCG +S
Sbjct: 282  NHGFLPNEFTVCSILKACSEEKAIRFGKQVHSLIVKRMIKTDVFVGTSLMDMYAKCGEIS 341

Query: 1911 EAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTECG 2090
            +  +VFD M  +N V+  ++I A+A+ G   +A+ L   M+   +  ++  + ++L  CG
Sbjct: 342  DCRKVFDGMSNRNTVTWTSIIAAHAREGFGEDAISLFRVMKRRHLIANNLTVVSILRACG 401

Query: 2091 DFERDKSTKELAFHLV 2138
                    KEL   ++
Sbjct: 402  SVGALLLGKELHAQII 417


>ref|XP_002307403.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222856852|gb|EEE94399.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 472

 Score =  616 bits (1589), Expect = e-173
 Identities = 299/460 (65%), Positives = 372/460 (80%)
 Frame = +3

Query: 726  MLERNVVSWTAMLNAYLRFGYDNEAIRHFINFVESGIPWSSKTFVCMLNMCSKILDLELG 905
            M ERNVVSWTAM+N Y +FG D+EA+ +F   ++ G+  +SKTFVC+LN+CS+ LD ELG
Sbjct: 1    MPERNVVSWTAMINGYFKFGLDDEALSYFSQAIKDGVVPNSKTFVCVLNLCSRRLDFELG 60

Query: 906  KQVHTCMVKGGFSGLILNSALLYFYVQCGDLENAVSVFDHIKERDVICWTIMISAYSQHR 1085
            +QVH  +VKG +  LI++SA++YFYVQCGDL++A  VFD + ERDV+ WT MI+A SQ  
Sbjct: 61   RQVHARVVKGNWRNLIVDSAVVYFYVQCGDLKSAFCVFDRMVERDVVSWTTMITACSQQG 120

Query: 1086 RGEEAFKTFSRMLSDRVEPNEFTICTVLDACGEEKELKLGRQLHGATIKTMYIKDVFIGT 1265
            R  EAF  F++ML+    PN FT   +L ACGEEK LK G+Q+HGA +K MY  DVF+GT
Sbjct: 121  RCGEAFGMFTQMLNGGFLPNGFTASGILKACGEEKALKFGKQIHGAIVKKMYKDDVFVGT 180

Query: 1266 ALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFGEEAINLFRVMQRRKILA 1445
            +LVDMYAKCGE  DS  VF+ MR RNTVTWT+IIAGYAR G GEEAI LFR+M RR++++
Sbjct: 181  SLVDMYAKCGEVSDSSKVFNGMRRRNTVTWTSIIAGYARKGLGEEAICLFRLMMRRRVVS 240

Query: 1446 NNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVL 1625
            NNLT+VS+LRACGL+GAL  G+EVHAQI KN +QSN ++GS LVW YC+CG+   ASKVL
Sbjct: 241  NNLTIVSMLRACGLIGALLAGREVHAQIIKNCSQSNEYLGSTLVWFYCKCGESRTASKVL 300

Query: 1626 EYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIR 1805
            + MP RDVVSWTA+ISG   LGHE EALE+LKEM+ EGV PN FTYSSALKACA LE + 
Sbjct: 301  QQMPFRDVVSWTAIISGHACLGHESEALEFLKEMMEEGVEPNSFTYSSALKACANLETVL 360

Query: 1806 QGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYA 1985
            QGKLIHSS +KTPASS+VFVGSALIHMYA+CG++SEAI+VFD+MPE+NLV+ +AMI+ Y 
Sbjct: 361  QGKLIHSSANKTPASSNVFVGSALIHMYARCGYVSEAIQVFDSMPERNLVTWRAMIMGYV 420

Query: 1986 KNGLCREALKLVYRMQAEGIELDDYVLSTVLTECGDFERD 2105
            +NGLC+EALKL+YRMQAEGI++DDY+ + VL  CG+ E D
Sbjct: 421  RNGLCQEALKLMYRMQAEGIQVDDYISAKVLGACGEIEWD 460



 Score =  242 bits (617), Expect = 6e-61
 Identities = 145/423 (34%), Positives = 227/423 (53%), Gaps = 4/423 (0%)
 Frame = +3

Query: 540  VSNGSTLLAYLLQSCT---DKEEVRIIHAIVLKISKESNIFVDNNLISGYARYGGIVEAR 710
            V N  T +  +L  C+   D E  R +HA V+K     N+ VD+ ++  Y + G +  A 
Sbjct: 38   VPNSKTFVC-VLNLCSRRLDFELGRQVHARVVK-GNWRNLIVDSAVVYFYVQCGDLKSAF 95

Query: 711  KVFDEMLERNVVSWTAMLNAYLRFGYDNEAIRHFINFVESGIPWSSKTFVCMLNMCSKIL 890
             VFD M+ER+VVSWT M+ A  + G   EA   F   +  G   +  T   +L  C +  
Sbjct: 96   CVFDRMVERDVVSWTTMITACSQQGRCGEAFGMFTQMLNGGFLPNGFTASGILKACGEEK 155

Query: 891  DLELGKQVHTCMVKGGF-SGLILNSALLYFYVQCGDLENAVSVFDHIKERDVICWTIMIS 1067
             L+ GKQ+H  +VK  +   + + ++L+  Y +CG++ ++  VF+ ++ R+ + WT +I+
Sbjct: 156  ALKFGKQIHGAIVKKMYKDDVFVGTSLVDMYAKCGEVSDSSKVFNGMRRRNTVTWTSIIA 215

Query: 1068 AYSQHRRGEEAFKTFSRMLSDRVEPNEFTICTVLDACGEEKELKLGRQLHGATIKTMYIK 1247
             Y++   GEEA   F  M+  RV  N  TI ++L ACG    L  GR++H   IK     
Sbjct: 216  GYARKGLGEEAICLFRLMMRRRVVSNNLTIVSMLRACGLIGALLAGREVHAQIIKNCSQS 275

Query: 1248 DVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFGEEAINLFRVMQ 1427
            + ++G+ LV  Y KCGE+  +  V   M  R+ V+WT II+G+A  G   EA+   + M 
Sbjct: 276  NEYLGSTLVWFYCKCGESRTASKVLQQMPFRDVVSWTAIISGHACLGHESEALEFLKEMM 335

Query: 1428 RRKILANNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIGSALVWLYCRCGDYS 1607
               +  N+ T  S L+AC  L  +  GK +H+   K  A SN+ +GSAL+ +Y RCG  S
Sbjct: 336  EEGVEPNSFTYSSALKACANLETVLQGKLIHSSANKTPASSNVFVGSALIHMYARCGYVS 395

Query: 1608 AASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNPFTYSSALKACA 1787
             A +V + MP R++V+W AMI G    G   EAL+ +  M  EG+  + +  +  L AC 
Sbjct: 396  EAIQVFDSMPERNLVTWRAMIMGYVRNGLCQEALKLMYRMQAEGIQVDDYISAKVLGACG 455

Query: 1788 QLE 1796
            ++E
Sbjct: 456  EIE 458



 Score =  104 bits (259), Expect = 2e-19
 Identities = 58/164 (35%), Positives = 91/164 (55%)
 Frame = +3

Query: 1632 MPLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQG 1811
            MP R+VVSWTAMI+G    G + EAL Y  + + +GV PN  T+   L  C++  D   G
Sbjct: 1    MPERNVVSWTAMINGYFKFGLDDEALSYFSQAIKDGVVPNSKTFVCVLNLCSRRLDFELG 60

Query: 1812 KLIHSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKN 1991
            + +H+ + K    + + V SA+++ Y +CG L  A  VFD M E+++VS   MI A ++ 
Sbjct: 61   RQVHARVVKGNWRNLI-VDSAVVYFYVQCGDLKSAFCVFDRMVERDVVSWTTMITACSQQ 119

Query: 1992 GLCREALKLVYRMQAEGIELDDYVLSTVLTECGDFERDKSTKEL 2123
            G C EA  +  +M   G   + +  S +L  CG+ +  K  K++
Sbjct: 120  GRCGEAFGMFTQMLNGGFLPNGFTASGILKACGEEKALKFGKQI 163



 Score = 68.6 bits (166), Expect = 1e-08
 Identities = 42/121 (34%), Positives = 63/121 (52%), Gaps = 3/121 (2%)
 Frame = +3

Query: 573 LQSCTDKEEV---RIIHAIVLKISKESNIFVDNNLISGYARYGGIVEARKVFDEMLERNV 743
           L++C + E V   ++IH+   K    SN+FV + LI  YAR G + EA +VFD M ERN+
Sbjct: 350 LKACANLETVLQGKLIHSSANKTPASSNVFVGSALIHMYARCGYVSEAIQVFDSMPERNL 409

Query: 744 VSWTAMLNAYLRFGYDNEAIRHFINFVESGIPWSSKTFVCMLNMCSKILDLELGKQVHTC 923
           V+W AM+  Y+R G   EA++        GI         +L  C +I + + G     C
Sbjct: 410 VTWRAMIMGYVRNGLCQEALKLMYRMQAEGIQVDDYISAKVLGACGEI-EWDAGHSSEYC 468

Query: 924 M 926
           +
Sbjct: 469 L 469


>ref|XP_004505335.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18520-like
            [Cicer arietinum]
          Length = 787

 Score =  611 bits (1576), Expect = e-172
 Identities = 303/535 (56%), Positives = 393/535 (73%), Gaps = 1/535 (0%)
 Frame = +3

Query: 498  IDN-HKKAEIFDPSSVSNGSTLLAYLLQSCTDKEEVRIIHAIVLKISKESNIFVDNNLIS 674
            +DN  +K +  DP SV       A+ L+ C D EEV  +H IVLK  ++SN +V+NNLI 
Sbjct: 242  VDNIAEKGQCLDPDSV-------AHWLRLCNDVEEVGRVHTIVLKRFRDSNTYVNNNLIC 294

Query: 675  GYARYGGIVEARKVFDEMLERNVVSWTAMLNAYLRFGYDNEAIRHFINFVESGIPWSSKT 854
             Y R G + +ARKVFD M  R+ V+WTA+++ YL+F  D+EA + F   V+ G+  +SK 
Sbjct: 295  SYLRLGKLAQARKVFDGMPRRDTVTWTAIIDGYLKFNLDDEAFKLFHGSVKHGVQPNSKM 354

Query: 855  FVCMLNMCSKILDLELGKQVHTCMVKGGFSGLILNSALLYFYVQCGDLENAVSVFDHIKE 1034
            FVC +N+C K +DL LGKQ+H  ++K  +  LI++SA++ FY +CG + +A   FD + +
Sbjct: 355  FVCFMNLCCKRVDLALGKQIHARILKSNWRNLIVDSAVVNFYSKCGKILSAFRTFDRMAK 414

Query: 1035 RDVICWTIMISAYSQHRRGEEAFKTFSRMLSDRVEPNEFTICTVLDACGEEKELKLGRQL 1214
            RDVICWT MI+A SQ   G EA    S+ML D   PNE+TIC  L ACGE K LK G QL
Sbjct: 415  RDVICWTTMITACSQQGLGHEALLMLSQMLGDGFFPNEYTICAALKACGENKALKFGTQL 474

Query: 1215 HGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFG 1394
            HGA IK +   DVFIGT+L+DMYAKCGE  +S+ VFD MR RNT TWT+II+GYARNGFG
Sbjct: 475  HGAVIKKICKSDVFIGTSLIDMYAKCGEIANSKNVFDRMRVRNTATWTSIISGYARNGFG 534

Query: 1395 EEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIGSAL 1574
            EEA+N FR+M+R+K+  N LT+V ++ ACG + A   G+EVHAQ  K+   +N++I S L
Sbjct: 535  EEAVNFFRLMKRKKVYVNKLTLVCVMMACGTMKASLIGREVHAQKIKSLIHTNMYIESTL 594

Query: 1575 VWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNP 1754
            VW YCRC +YS A  VL++MP RDVVSWTA+ISGC  LG E EALE+L+EM+ EGVSPN 
Sbjct: 595  VWFYCRCKEYSKAFNVLKHMPFRDVVSWTAIISGCARLGLEAEALEFLREMMEEGVSPNS 654

Query: 1755 FTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRVFDN 1934
            +TYSSALKACA+LE   QGKLIHS+  KTPA S+VFV SALI+MYAKCG++++A +VFDN
Sbjct: 655  YTYSSALKACAKLEAPMQGKLIHSNASKTPALSNVFVNSALIYMYAKCGYIADAFQVFDN 714

Query: 1935 MPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTECGDFE 2099
            MPE+NLVS KAMI+ YA+NG CR+ALKL+YRM+AEG  +DDY+L+TVLT CG  +
Sbjct: 715  MPERNLVSWKAMILGYARNGHCRKALKLMYRMRAEGFVVDDYILATVLTACGGID 769



 Score =  193 bits (490), Expect = 3e-46
 Identities = 107/319 (33%), Positives = 173/319 (54%)
 Frame = +3

Query: 1167 LDACGEEKELKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNT 1346
            L  C + +E+  GR +H   +K     + ++   L+  Y + G+   +R VFD M  R+T
Sbjct: 261  LRLCNDVEEV--GR-VHTIVLKRFRDSNTYVNNNLICSYLRLGKLAQARKVFDGMPRRDT 317

Query: 1347 VTWTTIIAGYARNGFGEEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHAQ 1526
            VTWT II GY +    +EA  LF    +  +  N+   V  +  C     L  GK++HA+
Sbjct: 318  VTWTAIIDGYLKFNLDDEAFKLFHGSVKHGVQPNSKMFVCFMNLCCKRVDLALGKQIHAR 377

Query: 1527 IFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEA 1706
            I K+  + N+ + SA+V  Y +CG   +A +  + M  RDV+ WT MI+ C+  G  HEA
Sbjct: 378  ILKSNWR-NLIVDSAVVNFYSKCGKILSAFRTFDRMAKRDVICWTTMITACSQQGLGHEA 436

Query: 1707 LEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHM 1886
            L  L +MLG+G  PN +T  +ALKAC + + ++ G  +H ++ K    S VF+G++LI M
Sbjct: 437  LLMLSQMLGDGFFPNEYTICAALKACGENKALKFGTQLHGAVIKKICKSDVFIGTSLIDM 496

Query: 1887 YAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVL 2066
            YAKCG ++ +  VFD M  +N  +  ++I  YA+NG   EA+     M+ + + ++   L
Sbjct: 497  YAKCGEIANSKNVFDRMRVRNTATWTSIISGYARNGFGEEAVNFFRLMKRKKVYVNKLTL 556

Query: 2067 STVLTECGDFERDKSTKEL 2123
              V+  CG  +     +E+
Sbjct: 557  VCVMMACGTMKASLIGREV 575


>ref|XP_006605526.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18520-like
            isoform X2 [Glycine max]
          Length = 817

 Score =  610 bits (1574), Expect = e-172
 Identities = 301/535 (56%), Positives = 401/535 (74%), Gaps = 1/535 (0%)
 Frame = +3

Query: 498  IDN-HKKAEIFDPSSVSNGSTLLAYLLQSCTDKEEVRIIHAIVLKISKESNIFVDNNLIS 674
            IDN  +K++ F+P        L+A+ L+ C + EEV  +H IVLK       +VDNNLI 
Sbjct: 273  IDNLAEKSQCFNPE-------LVAHWLRLCYNMEEVGRVHTIVLKFFIHPVTYVDNNLIC 325

Query: 675  GYARYGGIVEARKVFDEMLERNVVSWTAMLNAYLRFGYDNEAIRHFINFVESGIPWSSKT 854
             Y R G + +AR+VFD M  +N V+WTA+++ YL+F  D+EA + F + V+ G+P +SK 
Sbjct: 326  SYLRLGKLAQARRVFDGMSRKNTVTWTAIIDGYLKFNLDDEAFKLFQDCVKHGVPANSKM 385

Query: 855  FVCMLNMCSKILDLELGKQVHTCMVKGGFSGLILNSALLYFYVQCGDLENAVSVFDHIKE 1034
            FVC++N+C + +DLELGKQ+H  ++K  +  LI+++A+++FY +CG++ +A   FD + E
Sbjct: 386  FVCIMNLCGRRVDLELGKQIHARILKSRWRNLIVDNAVVHFYAKCGNISSAFRAFDCMAE 445

Query: 1035 RDVICWTIMISAYSQHRRGEEAFKTFSRMLSDRVEPNEFTICTVLDACGEEKELKLGRQL 1214
            RDVICWT MI+A SQ   G EA    S+MLSD   PNE+TIC+ L ACGE K LK G QL
Sbjct: 446  RDVICWTTMITACSQQGFGHEALSMLSQMLSDGFYPNEYTICSALKACGENKALKFGTQL 505

Query: 1215 HGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFG 1394
            HGA IK +   DVFIGT+LVDMYAKCG   DS++VFD MR RNT TWT+II+GYARNGFG
Sbjct: 506  HGAIIKKICKSDVFIGTSLVDMYAKCGVMVDSKVVFDRMRIRNTATWTSIISGYARNGFG 565

Query: 1395 EEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIGSAL 1574
            EEA + FR+M+ ++I  N LT++S+L ACG + +L  G+EVHAQI K+   +NI++GS L
Sbjct: 566  EEATSFFRLMKMKRIHVNKLTVLSVLMACGTIKSLLFGREVHAQIIKSNIHTNIYVGSTL 625

Query: 1575 VWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNP 1754
            VW YC+C +YS A KVL+YMP RDVVSWTA+ISGC  LG EHEALE+L+EM+ EGV PN 
Sbjct: 626  VWFYCKCKEYSYAFKVLQYMPFRDVVSWTAIISGCARLGLEHEALEFLQEMMEEGVLPNS 685

Query: 1755 FTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRVFDN 1934
            +TYSSALKACA+LE   QGKLIHS   KTPASS+VFV SALI+MY+KCG++++A +VFDN
Sbjct: 686  YTYSSALKACAELEAPIQGKLIHSYASKTPASSNVFVNSALIYMYSKCGYVADAFQVFDN 745

Query: 1935 MPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTECGDFE 2099
            MPE+N+VS ++MI+AYA+NG  REALKL++RMQAEG  +DDY+ +TV++ CG  E
Sbjct: 746  MPERNVVSWESMILAYARNGHAREALKLMHRMQAEGFVVDDYIHTTVISACGGVE 800


>ref|XP_006605525.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18520-like
            isoform X1 [Glycine max]
          Length = 833

 Score =  610 bits (1574), Expect = e-172
 Identities = 301/535 (56%), Positives = 401/535 (74%), Gaps = 1/535 (0%)
 Frame = +3

Query: 498  IDN-HKKAEIFDPSSVSNGSTLLAYLLQSCTDKEEVRIIHAIVLKISKESNIFVDNNLIS 674
            IDN  +K++ F+P        L+A+ L+ C + EEV  +H IVLK       +VDNNLI 
Sbjct: 289  IDNLAEKSQCFNPE-------LVAHWLRLCYNMEEVGRVHTIVLKFFIHPVTYVDNNLIC 341

Query: 675  GYARYGGIVEARKVFDEMLERNVVSWTAMLNAYLRFGYDNEAIRHFINFVESGIPWSSKT 854
             Y R G + +AR+VFD M  +N V+WTA+++ YL+F  D+EA + F + V+ G+P +SK 
Sbjct: 342  SYLRLGKLAQARRVFDGMSRKNTVTWTAIIDGYLKFNLDDEAFKLFQDCVKHGVPANSKM 401

Query: 855  FVCMLNMCSKILDLELGKQVHTCMVKGGFSGLILNSALLYFYVQCGDLENAVSVFDHIKE 1034
            FVC++N+C + +DLELGKQ+H  ++K  +  LI+++A+++FY +CG++ +A   FD + E
Sbjct: 402  FVCIMNLCGRRVDLELGKQIHARILKSRWRNLIVDNAVVHFYAKCGNISSAFRAFDCMAE 461

Query: 1035 RDVICWTIMISAYSQHRRGEEAFKTFSRMLSDRVEPNEFTICTVLDACGEEKELKLGRQL 1214
            RDVICWT MI+A SQ   G EA    S+MLSD   PNE+TIC+ L ACGE K LK G QL
Sbjct: 462  RDVICWTTMITACSQQGFGHEALSMLSQMLSDGFYPNEYTICSALKACGENKALKFGTQL 521

Query: 1215 HGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFG 1394
            HGA IK +   DVFIGT+LVDMYAKCG   DS++VFD MR RNT TWT+II+GYARNGFG
Sbjct: 522  HGAIIKKICKSDVFIGTSLVDMYAKCGVMVDSKVVFDRMRIRNTATWTSIISGYARNGFG 581

Query: 1395 EEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIGSAL 1574
            EEA + FR+M+ ++I  N LT++S+L ACG + +L  G+EVHAQI K+   +NI++GS L
Sbjct: 582  EEATSFFRLMKMKRIHVNKLTVLSVLMACGTIKSLLFGREVHAQIIKSNIHTNIYVGSTL 641

Query: 1575 VWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNP 1754
            VW YC+C +YS A KVL+YMP RDVVSWTA+ISGC  LG EHEALE+L+EM+ EGV PN 
Sbjct: 642  VWFYCKCKEYSYAFKVLQYMPFRDVVSWTAIISGCARLGLEHEALEFLQEMMEEGVLPNS 701

Query: 1755 FTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRVFDN 1934
            +TYSSALKACA+LE   QGKLIHS   KTPASS+VFV SALI+MY+KCG++++A +VFDN
Sbjct: 702  YTYSSALKACAELEAPIQGKLIHSYASKTPASSNVFVNSALIYMYSKCGYVADAFQVFDN 761

Query: 1935 MPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTECGDFE 2099
            MPE+N+VS ++MI+AYA+NG  REALKL++RMQAEG  +DDY+ +TV++ CG  E
Sbjct: 762  MPERNVVSWESMILAYARNGHAREALKLMHRMQAEGFVVDDYIHTTVISACGGVE 816


>gb|ESW06749.1| hypothetical protein PHAVU_010G073100g [Phaseolus vulgaris]
          Length = 814

 Score =  597 bits (1539), Expect = e-168
 Identities = 299/535 (55%), Positives = 398/535 (74%), Gaps = 1/535 (0%)
 Frame = +3

Query: 498  IDN-HKKAEIFDPSSVSNGSTLLAYLLQSCTDKEEVRIIHAIVLKISKESNIFVDNNLIS 674
            IDN  +K + F+P        L+A+ LQ C + EEV  +HAIVLK  + SN +VDNNLI 
Sbjct: 270  IDNIGEKNQCFNPE-------LVAHWLQLCYNVEEVGRVHAIVLKCFRHSNTYVDNNLIC 322

Query: 675  GYARYGGIVEARKVFDEMLERNVVSWTAMLNAYLRFGYDNEAIRHFINFVESGIPWSSKT 854
             Y R   +V AR+VFD M  +N V+WTA+++ YL+   D+EA + F + V+ G+P +SK 
Sbjct: 323  SYLRLVELVRARRVFDGMPRKNTVTWTAIIDGYLKCNLDDEAFKLFQDSVKHGVPANSKM 382

Query: 855  FVCMLNMCSKILDLELGKQVHTCMVKGGFSGLILNSALLYFYVQCGDLENAVSVFDHIKE 1034
            FVC++N+C K ++L+LGKQ+H  ++K  +  LI+++A+++FY QCGD+ +A   FD + E
Sbjct: 383  FVCIMNLCGKRVNLKLGKQIHARILKSRWRNLIVDNAVVHFYAQCGDISSAFRAFDCMAE 442

Query: 1035 RDVICWTIMISAYSQHRRGEEAFKTFSRMLSDRVEPNEFTICTVLDACGEEKELKLGRQL 1214
            RDVICWT MI+A SQ   G EA    S+ML +   PNE+TIC+ L ACG+ K LK G QL
Sbjct: 443  RDVICWTTMITACSQQGFGYEAMLLLSQMLGEGFFPNEYTICSALKACGKNKALKFGTQL 502

Query: 1215 HGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFG 1394
            HGA IK +   DVFIGT+LVDMYAKCG  +DS+ VFD MR RNT TWT II+GYARNGFG
Sbjct: 503  HGAIIKNICKSDVFIGTSLVDMYAKCGLMKDSKDVFDRMRIRNTATWTCIISGYARNGFG 562

Query: 1395 EEAINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIGSAL 1574
            +EA+NLFR M+ +++  N LT++S+L ACG + AL  G+EVHAQI K    +N++IGS L
Sbjct: 563  KEAVNLFRSMESKRMHVNKLTVLSVLMACGTIKALLIGREVHAQIIKRIIHTNMYIGSTL 622

Query: 1575 VWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNP 1754
            VW YC+C +YS A KVL++MP RDVVSWTA+ISGC  LG E EALE+L+EM+ +GV PN 
Sbjct: 623  VWFYCKCKEYSYAFKVLQHMPFRDVVSWTAIISGCARLGLELEALEFLQEMMEDGVLPNS 682

Query: 1755 FTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRVFDN 1934
            +TYSSALKACA+LE    GKLIHS   K+PAS++VFV SALI+MY+KCG++++A +VFDN
Sbjct: 683  YTYSSALKACAELEAPMLGKLIHSYASKSPASANVFVNSALIYMYSKCGYVADAFQVFDN 742

Query: 1935 MPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTECGDFE 2099
            MPE+NLVS ++MI+AYA NG  REALKLV+RMQAEG+ +DDY+ +TV++ CG  E
Sbjct: 743  MPERNLVSWESMILAYAWNGHAREALKLVHRMQAEGLVVDDYIHTTVVSACGGVE 797


>gb|ESW29807.1| hypothetical protein PHAVU_002G100300g [Phaseolus vulgaris]
          Length = 815

 Score =  595 bits (1534), Expect = e-167
 Identities = 298/533 (55%), Positives = 396/533 (74%)
 Frame = +3

Query: 501  DNHKKAEIFDPSSVSNGSTLLAYLLQSCTDKEEVRIIHAIVLKISKESNIFVDNNLISGY 680
            D  +K + F+P        L+A+ LQ C + EEVR +HAIVLK  + SN +VDNNLI  Y
Sbjct: 272  DLGEKNQSFNPE-------LVAHWLQLCYNVEEVRRVHAIVLKCFRHSNTYVDNNLICSY 324

Query: 681  ARYGGIVEARKVFDEMLERNVVSWTAMLNAYLRFGYDNEAIRHFINFVESGIPWSSKTFV 860
             R   +V AR+VFD M ++N V+WTA+++ YL+   D+EA + F + V+ G+P +SK FV
Sbjct: 325  LRLVELVRARRVFDGMPKKNTVTWTAIIDGYLKCNLDDEAFKLFQDSVKHGVPANSKMFV 384

Query: 861  CMLNMCSKILDLELGKQVHTCMVKGGFSGLILNSALLYFYVQCGDLENAVSVFDHIKERD 1040
            C++N+C K ++L+LGKQ+H  ++K  +  LI+++A+++FY +CG++ +A   FD + ERD
Sbjct: 385  CIMNLCGKRVNLKLGKQIHARILKSRWRNLIVDNAVVHFYAKCGEISSAFRAFDCMAERD 444

Query: 1041 VICWTIMISAYSQHRRGEEAFKTFSRMLSDRVEPNEFTICTVLDACGEEKELKLGRQLHG 1220
            VICWT MI+A SQ   G EA    S+ML +   PNE+TIC+ L ACG+ K LK G QLH 
Sbjct: 445  VICWTTMITACSQQGFGYEAMLMLSQMLGEGFFPNEYTICSALKACGKNKALKFGTQLHC 504

Query: 1221 ATIKTMYIKDVFIGTALVDMYAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFGEE 1400
            A IK +   DVFIGT+LVDMYAKCG   DS+ VFD MR RNT TWT II+GYARNGFG+E
Sbjct: 505  AIIKNICKSDVFIGTSLVDMYAKCGLMRDSKDVFDGMRIRNTATWTCIISGYARNGFGKE 564

Query: 1401 AINLFRVMQRRKILANNLTMVSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIGSALVW 1580
            AINLFR M+ +++  N LT++S+L ACG + AL  G+EVHAQI K    +N++IGS LVW
Sbjct: 565  AINLFRSMESKRMHVNKLTVLSVLMACGTIKALLIGREVHAQIIKRIIHTNMYIGSTLVW 624

Query: 1581 LYCRCGDYSAASKVLEYMPLRDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNPFT 1760
             YC+C +YS A KVL++MP RDVVSWTA+ISGC  LG E EALE+L+EM+ EGV PN +T
Sbjct: 625  FYCKCKEYSYAFKVLQHMPFRDVVSWTAIISGCARLGLELEALEFLQEMMQEGVLPNSYT 684

Query: 1761 YSSALKACAQLEDIRQGKLIHSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRVFDNMP 1940
            YSSALKACA+LE    GKLIHS   K+PASS+VFV SALI+MY+KCG++++A +VFDNMP
Sbjct: 685  YSSALKACAELEAPMLGKLIHSYASKSPASSNVFVNSALIYMYSKCGYVADAFQVFDNMP 744

Query: 1941 EQNLVSCKAMIIAYAKNGLCREALKLVYRMQAEGIELDDYVLSTVLTECGDFE 2099
            E+NLVS ++MI+AYA NG  REALKLV+RMQAEG+ +DDY+ +TV++ CG  E
Sbjct: 745  ERNLVSWESMILAYAWNGHAREALKLVHRMQAEGLVVDDYIHTTVVSACGGVE 797


>gb|EAY91181.1| hypothetical protein OsI_12790 [Oryza sativa Indica Group]
          Length = 885

 Score =  590 bits (1520), Expect = e-165
 Identities = 283/513 (55%), Positives = 378/513 (73%)
 Frame = +3

Query: 561  LAYLLQSCTDKEEVRIIHAIVLKISKESNIFVDNNLISGYARYGGIVEARKVFDEMLERN 740
            LA  L+ C   + VR +HA+ ++       FV NNLIS YAR+  + +ARKVFDEM ER+
Sbjct: 355  LASSLRDCGGADGVRRVHAVAVRSLDSLGTFVANNLISAYARFDEVSDARKVFDEMPERS 414

Query: 741  VVSWTAMLNAYLRFGYDNEAIRHFINFVESGIPWSSKTFVCMLNMCSKILDLELGKQVHT 920
            VVSWTAM+NAYL+ G+  E +R F + V SG+  +S TFVC+L  C +  D +LG+QVH 
Sbjct: 415  VVSWTAMMNAYLKLGHYGEVVRLFFDMVGSGVQGNSLTFVCLLKSCGERCDAKLGQQVHC 474

Query: 921  CMVKGGFSGLILNSALLYFYVQCGDLENAVSVFDHIKERDVICWTIMISAYSQHRRGEEA 1100
            C+VKGG+S +I++SA+ +FY QCGD+ +A ++FD +  RDVI WT MI+AY QH  G +A
Sbjct: 475  CIVKGGWSNVIVDSAIAHFYAQCGDVASASAIFDKMASRDVISWTTMITAYVQHGHGGQA 534

Query: 1101 FKTFSRMLSDRVEPNEFTICTVLDACGEEKELKLGRQLHGATIKTMYIKDVFIGTALVDM 1280
             + FS M+S+   PNEFT+C+VL AC EEK ++ G+QLH A +K MY  D+ IG+ALV M
Sbjct: 535  LRMFSEMVSEGFRPNEFTVCSVLKACAEEKAVRFGKQLHCAVLKKMYKNDIHIGSALVTM 594

Query: 1281 YAKCGETEDSRLVFDCMRNRNTVTWTTIIAGYARNGFGEEAINLFRVMQRRKILANNLTM 1460
            YA+CGE  D++ VFD M  RNT+TWT++I+GYA++G GE+AI LFR M+ R++  NNLT+
Sbjct: 595  YARCGEVFDAQAVFDMMPRRNTITWTSMISGYAQSGHGEKAIFLFRKMKMRRVFVNNLTI 654

Query: 1461 VSILRACGLLGALPTGKEVHAQIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYMPL 1640
            V +L ACG L +L  GKE+HAQI KN  + N+ IGS LVW YC+CG+Y+ A+++LE MP 
Sbjct: 655  VGLLSACGSLQSLYLGKELHAQIIKNSMEDNLQIGSTLVWFYCKCGEYTYAARILEAMPD 714

Query: 1641 RDVVSWTAMISGCTNLGHEHEALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGKLI 1820
            RD +SWTA+ISG  NLGH  EAL+ L +ML +GV PN +TYSSALKACA+LE ++ G+ I
Sbjct: 715  RDAISWTALISGYNNLGHNVEALKSLDDMLWDGVKPNTYTYSSALKACAKLEALQYGRKI 774

Query: 1821 HSSIHKTPASSSVFVGSALIHMYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNGLC 2000
            H  ++KT   S+VFVGS+LI MY +CG + EA RVFD MPE NLV+ K +I  +A+NGLC
Sbjct: 775  HGFVNKTQDFSNVFVGSSLIDMYMRCGKVDEARRVFDAMPEHNLVTWKVIITGFAQNGLC 834

Query: 2001 REALKLVYRMQAEGIELDDYVLSTVLTECGDFE 2099
             EALK +Y MQ EG E+DD+VLSTVLT CGD +
Sbjct: 835  EEALKYMYLMQQEGHEVDDFVLSTVLTSCGDLQ 867



 Score =  204 bits (520), Expect = 1e-49
 Identities = 111/333 (33%), Positives = 183/333 (54%)
 Frame = +3

Query: 1140 PNEFTICTVLDACGEEKELKLGRQLHGATIKTMYIKDVFIGTALVDMYAKCGETEDSRLV 1319
            P+   + + L  CG    +   R++H   ++++     F+   L+  YA+  E  D+R V
Sbjct: 350  PDAEALASSLRDCGGADGV---RRVHAVAVRSLDSLGTFVANNLISAYARFDEVSDARKV 406

Query: 1320 FDCMRNRNTVTWTTIIAGYARNGFGEEAINLFRVMQRRKILANNLTMVSILRACGLLGAL 1499
            FD M  R+ V+WT ++  Y + G   E + LF  M    +  N+LT V +L++CG     
Sbjct: 407  FDEMPERSVVSWTAMMNAYLKLGHYGEVVRLFFDMVGSGVQGNSLTFVCLLKSCGERCDA 466

Query: 1500 PTGKEVHAQIFKNYAQSNIHIGSALVWLYCRCGDYSAASKVLEYMPLRDVVSWTAMISGC 1679
              G++VH  I K    SN+ + SA+   Y +CGD ++AS + + M  RDV+SWT MI+  
Sbjct: 467  KLGQQVHCCIVKG-GWSNVIVDSAIAHFYAQCGDVASASAIFDKMASRDVISWTTMITAY 525

Query: 1680 TNLGHEHEALEYLKEMLGEGVSPNPFTYSSALKACAQLEDIRQGKLIHSSIHKTPASSSV 1859
               GH  +AL    EM+ EG  PN FT  S LKACA+ + +R GK +H ++ K    + +
Sbjct: 526  VQHGHGGQALRMFSEMVSEGFRPNEFTVCSVLKACAEEKAVRFGKQLHCAVLKKMYKNDI 585

Query: 1860 FVGSALIHMYAKCGHLSEAIRVFDNMPEQNLVSCKAMIIAYAKNGLCREALKLVYRMQAE 2039
             +GSAL+ MYA+CG + +A  VFD MP +N ++  +MI  YA++G   +A+ L  +M+  
Sbjct: 586  HIGSALVTMYARCGEVFDAQAVFDMMPRRNTITWTSMISGYAQSGHGEKAIFLFRKMKMR 645

Query: 2040 GIELDDYVLSTVLTECGDFERDKSTKELAFHLV 2138
             + +++  +  +L+ CG  +     KEL   ++
Sbjct: 646  RVFVNNLTIVGLLSACGSLQSLYLGKELHAQII 678


Top