BLASTX nr result

ID: Catharanthus22_contig00034818 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00034818
         (1965 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006339636.1| PREDICTED: pentatricopeptide repeat-containi...   851   0.0  
gb|EMJ04813.1| hypothetical protein PRUPE_ppa002292mg [Prunus pe...   841   0.0  
ref|XP_004229908.1| PREDICTED: pentatricopeptide repeat-containi...   835   0.0  
ref|XP_002324099.1| hypothetical protein POPTR_0017s12720g [Popu...   830   0.0  
ref|XP_003632466.1| PREDICTED: pentatricopeptide repeat-containi...   823   0.0  
ref|XP_006466653.1| PREDICTED: pentatricopeptide repeat-containi...   814   0.0  
ref|XP_006425825.1| hypothetical protein CICLE_v10024955mg [Citr...   814   0.0  
ref|XP_004301849.1| PREDICTED: pentatricopeptide repeat-containi...   801   0.0  
gb|EOX91314.1| Pentatricopeptide repeat (PPR) superfamily protei...   788   0.0  
ref|XP_003542017.1| PREDICTED: pentatricopeptide repeat-containi...   765   0.0  
gb|EXC24858.1| hypothetical protein L484_013224 [Morus notabilis]     756   0.0  
gb|ESW22273.1| hypothetical protein PHAVU_005G140500g [Phaseolus...   745   0.0  
ref|XP_004152308.1| PREDICTED: pentatricopeptide repeat-containi...   745   0.0  
ref|XP_004511479.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   739   0.0  
ref|XP_003610927.1| Pentatricopeptide repeat-containing protein ...   734   0.0  
ref|XP_006411944.1| hypothetical protein EUTSA_v10026762mg [Eutr...   713   0.0  
ref|XP_006283244.1| hypothetical protein CARUB_v10004277mg [Caps...   710   0.0  
ref|XP_002869013.1| pentatricopeptide repeat-containing protein ...   707   0.0  
ref|NP_195434.1| pentatricopeptide repeat-containing protein [Ar...   699   0.0  
gb|EPS71377.1| hypothetical protein M569_03380, partial [Genlise...   677   0.0  

>ref|XP_006339636.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like
            [Solanum tuberosum]
          Length = 695

 Score =  851 bits (2198), Expect = 0.0
 Identities = 415/612 (67%), Positives = 500/612 (81%)
 Frame = -3

Query: 1837 FQYHLHRAASSFSSPYPFQPKTTRNPPSPGKTFFKRDSNGDQQLINRLCRAYKFKEAIEI 1658
            F  H+H A +  SS       ++ +     KTFF+  +N DQ +I RLC   KF EA++I
Sbjct: 16   FSPHIHSACTIRSS-------SSFSTAQKEKTFFE-PANKDQ-MITRLCNENKFNEALQI 66

Query: 1657 LCQQNRLEDAIQLLEHQIDRPFADIYLVLLQSCIRQRALREGKRVHNHIRLSGFRPGIFV 1478
            LC+Q +L++AIQLLE    RP A ++  LL+ CI  RAL EGKRVH  ++ SGFRPG+ +
Sbjct: 67   LCEQRQLKEAIQLLERPETRPSATVFSTLLRICIDNRALEEGKRVHKSMKCSGFRPGVVI 126

Query: 1477 SNQILQLYCKCDSLEDAHQVFNEMGEKDVCSWNILISGYAKVGRVDLARRLFDEMPGRDN 1298
            SN+IL  YCKCD   DAH +F EM E+D+CSWNI++SG+AK+G +D AR+LFDEMP +DN
Sbjct: 127  SNRILDFYCKCDKPFDAHNLFVEMPERDLCSWNIMVSGFAKLGLIDEARKLFDEMPEKDN 186

Query: 1297 FSWTAMISGYVKHYMPKDALELYRLMQKQGAFVPSKFTVSSALAATASIQSLQLGMEIHG 1118
            FSWTAMISGYV+   P+ ALELYR+MQ+      +KFT+SSALAA+AS+QSL+LG EIHG
Sbjct: 187  FSWTAMISGYVRQNKPECALELYRVMQRDENVKCNKFTISSALAASASVQSLRLGKEIHG 246

Query: 1117 HINRTGLDSDAVVWSALSDMYGKCGSIDQARYVFDRTFEKDVVSWTAMIDRYFEGGRWEE 938
            HI RTGLDSDAVVWSALSDMYGKCGS+D+AR++FDRT +KDVVSWTAMIDRYF  GRWEE
Sbjct: 247  HIVRTGLDSDAVVWSALSDMYGKCGSVDEARHIFDRTKDKDVVSWTAMIDRYFGDGRWEE 306

Query: 937  GFLLFSDMLKSGIRPNEFTFAGILNACAHQTVEGLGKQVHGYMTRLEFDPVSFAASALLH 758
            G+LLFS +++SGIRPN+FTFAG+LNACAHQT E  GKQVHGYMTR+ FDP+SFAAS L+H
Sbjct: 307  GYLLFSCLMESGIRPNDFTFAGVLNACAHQTTEHFGKQVHGYMTRIGFDPLSFAASTLVH 366

Query: 757  MYSKCGNMDGAYKVFRLLKRPDLFSWTSLISGYAQNGQPHEALRLFELLLETDTKPDQIT 578
            MY+KCG++D AYKVF+ L RPD+ SWTSLI+GYAQNGQP EAL+LF+LLL++ T+PD IT
Sbjct: 367  MYAKCGSVDSAYKVFKRLPRPDVVSWTSLINGYAQNGQPSEALQLFDLLLKSGTQPDHIT 426

Query: 577  FVGVLSACTHAGLVDRGLEYFNSIKEKHGLTHTQDHYACVVDLLSRAGRFGEVEDLIKKM 398
            FVGVLSACTHAGLVD+GLEYF SIK+KH LTHT DHYACV+DLLSR GRF E E++I +M
Sbjct: 427  FVGVLSACTHAGLVDKGLEYFYSIKDKHCLTHTSDHYACVIDLLSRFGRFKEAEEIISQM 486

Query: 397  PMKPDKFLWASLLGGCRIHGXXXXXXXXXXXLFEIEPENAATYVTLANIYATAGNWGEVA 218
            PMKPDKFLWASLLGGCR+HG           LFEIEPENAATYVT+AN+YATAG W EVA
Sbjct: 487  PMKPDKFLWASLLGGCRVHGNVELAKRAAEALFEIEPENAATYVTIANVYATAGKWTEVA 546

Query: 217  KIRKIMDEKGLVKKPGMSWIHLKRKIHVFLVGDESHPRSDEIHAFLGELSKRMKAEGYVP 38
            KIR++M+EKG+VKKPG+SWI+L RK +VFLVGD+SHPRS EI+ FLGEL +RMK EGYVP
Sbjct: 547  KIRQVMEEKGVVKKPGISWINLLRKDYVFLVGDKSHPRSKEIYEFLGELWRRMKEEGYVP 606

Query: 37   DTDYVLHDVEEE 2
            D D VLHDVEEE
Sbjct: 607  DIDNVLHDVEEE 618


>gb|EMJ04813.1| hypothetical protein PRUPE_ppa002292mg [Prunus persica]
          Length = 691

 Score =  841 bits (2172), Expect = 0.0
 Identities = 411/601 (68%), Positives = 489/601 (81%)
 Frame = -3

Query: 1804 FSSPYPFQPKTTRNPPSPGKTFFKRDSNGDQQLINRLCRAYKFKEAIEILCQQNRLEDAI 1625
            F+  +    +T  N P   KTFFK  SN    LI+RLC+  KFKEAI+ILC+Q  L +AI
Sbjct: 17   FNRTHSSSSQTQLNKPLIEKTFFK--SNTKDGLISRLCKDGKFKEAIDILCEQKHLAEAI 74

Query: 1624 QLLEHQIDRPFADIYLVLLQSCIRQRALREGKRVHNHIRLSGFRPGIFVSNQILQLYCKC 1445
            QLL ++IDRP A IY  LLQ C++QRAL +GK VH H ++SGF PG+F+ N+++ LY KC
Sbjct: 75   QLL-NRIDRPSASIYSTLLQLCLQQRALVQGKLVHAHTKVSGFVPGLFICNRLIDLYAKC 133

Query: 1444 DSLEDAHQVFNEMGEKDVCSWNILISGYAKVGRVDLARRLFDEMPGRDNFSWTAMISGYV 1265
             SL DA +VF+EM E+D+CSWN +ISGYAKVG +  AR+LFDEMP +DNFSWTAMISGYV
Sbjct: 134  GSLVDAQKVFDEMSERDLCSWNTMISGYAKVGLLGEARKLFDEMPEKDNFSWTAMISGYV 193

Query: 1264 KHYMPKDALELYRLMQKQGAFVPSKFTVSSALAATASIQSLQLGMEIHGHINRTGLDSDA 1085
            +H  PK+AL+LYR+MQ+      +KFTVSSALAA+A+IQSL+LG EIHG I RTGLDSD 
Sbjct: 194  RHERPKEALQLYRMMQRHDNSKSNKFTVSSALAASAAIQSLRLGKEIHGFIMRTGLDSDE 253

Query: 1084 VVWSALSDMYGKCGSIDQARYVFDRTFEKDVVSWTAMIDRYFEGGRWEEGFLLFSDMLKS 905
            VVWSALSDMYGKCGSI++A+ +FD+   +DVVSWTAMIDRYFE G+ EEGF LFS+++KS
Sbjct: 254  VVWSALSDMYGKCGSIEEAKRIFDKMVNRDVVSWTAMIDRYFEDGKREEGFALFSELMKS 313

Query: 904  GIRPNEFTFAGILNACAHQTVEGLGKQVHGYMTRLEFDPVSFAASALLHMYSKCGNMDGA 725
            GIRPNEFTFAG+LNACAH   E LGKQVHGYMTR+ FDP+SFA+SAL+HMYSKCGN   A
Sbjct: 314  GIRPNEFTFAGVLNACAHHAAENLGKQVHGYMTRIGFDPLSFASSALVHMYSKCGNTVNA 373

Query: 724  YKVFRLLKRPDLFSWTSLISGYAQNGQPHEALRLFELLLETDTKPDQITFVGVLSACTHA 545
              VF+ +  PD+ SWTSLI GYAQNGQP+EAL+LFELLL++ TKPD ITFVGVLSACTHA
Sbjct: 374  NMVFKGMPHPDVVSWTSLIVGYAQNGQPYEALQLFELLLKSGTKPDHITFVGVLSACTHA 433

Query: 544  GLVDRGLEYFNSIKEKHGLTHTQDHYACVVDLLSRAGRFGEVEDLIKKMPMKPDKFLWAS 365
            GLV++GLEYF+SIK KHGL HT DHYACVVDLL+RAGRF E E+ I +MPMKPDKFLWAS
Sbjct: 434  GLVEKGLEYFHSIKAKHGLAHTADHYACVVDLLARAGRFEEAENFINEMPMKPDKFLWAS 493

Query: 364  LLGGCRIHGXXXXXXXXXXXLFEIEPENAATYVTLANIYATAGNWGEVAKIRKIMDEKGL 185
            L+GGCRIHG           LFEIEPEN ATY+TLANIYAT G W EV K+RK MDE+G+
Sbjct: 494  LIGGCRIHGNLKLAKRAAEALFEIEPENPATYITLANIYATGGMWDEVTKVRKTMDERGV 553

Query: 184  VKKPGMSWIHLKRKIHVFLVGDESHPRSDEIHAFLGELSKRMKAEGYVPDTDYVLHDVEE 5
            +KKPG+SWI +KR++HVFLVGD+SH R DEIH FL ELSKRMK EGYVPDT++VLHDVEE
Sbjct: 554  IKKPGLSWIEIKREVHVFLVGDKSHLRYDEIHFFLHELSKRMKEEGYVPDTNFVLHDVEE 613

Query: 4    E 2
            E
Sbjct: 614  E 614


>ref|XP_004229908.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like
            [Solanum lycopersicum]
          Length = 695

 Score =  835 bits (2158), Expect = 0.0
 Identities = 410/617 (66%), Positives = 498/617 (80%), Gaps = 5/617 (0%)
 Frame = -3

Query: 1837 FQYHLH-----RAASSFSSPYPFQPKTTRNPPSPGKTFFKRDSNGDQQLINRLCRAYKFK 1673
            F  H+H     R++SSFS+ +              KTFF+  +N DQ +I RLC   KF 
Sbjct: 16   FSPHIHSACTIRSSSSFSTAHK------------EKTFFQ-PANKDQ-MITRLCNENKFN 61

Query: 1672 EAIEILCQQNRLEDAIQLLEHQIDRPFADIYLVLLQSCIRQRALREGKRVHNHIRLSGFR 1493
            EA+++LC+Q RL++AIQLLE    RP A ++  LL+ CI  RAL EGKRVH  ++ SGFR
Sbjct: 62   EALQMLCEQRRLKEAIQLLERPETRPSATVFSTLLRICIDNRALEEGKRVHKIMKCSGFR 121

Query: 1492 PGIFVSNQILQLYCKCDSLEDAHQVFNEMGEKDVCSWNILISGYAKVGRVDLARRLFDEM 1313
            PG+ +SN++L  YCKCD   DA  +F EM E+D+CSWNI++SG+AK+G +D AR+LFDEM
Sbjct: 122  PGVVISNRVLDFYCKCDKPFDAQNLFVEMPERDLCSWNIMVSGFAKLGLIDEARKLFDEM 181

Query: 1312 PGRDNFSWTAMISGYVKHYMPKDALELYRLMQKQGAFVPSKFTVSSALAATASIQSLQLG 1133
            P +DNFSWTAMISGYV+H  P+ ALELYR+M +   F  +KFT+SSALAA+ASIQSL+LG
Sbjct: 182  PEKDNFSWTAMISGYVRHNKPECALELYRVMLRDENFKCNKFTISSALAASASIQSLRLG 241

Query: 1132 MEIHGHINRTGLDSDAVVWSALSDMYGKCGSIDQARYVFDRTFEKDVVSWTAMIDRYFEG 953
             EI+GHI RTGLDSDAVVWSALSDMYGKCGS+D+AR++FDRT +KDVVSWTAMIDRYF  
Sbjct: 242  KEIYGHIVRTGLDSDAVVWSALSDMYGKCGSVDEARHIFDRTKDKDVVSWTAMIDRYFGD 301

Query: 952  GRWEEGFLLFSDMLKSGIRPNEFTFAGILNACAHQTVEGLGKQVHGYMTRLEFDPVSFAA 773
            GRWEEG+LLFS ++ SGIRPN+FTFAG+LNACAHQT E  GKQVHGYM R+ FDP+SFAA
Sbjct: 302  GRWEEGYLLFSCLMYSGIRPNDFTFAGVLNACAHQTKEHFGKQVHGYMMRIGFDPLSFAA 361

Query: 772  SALLHMYSKCGNMDGAYKVFRLLKRPDLFSWTSLISGYAQNGQPHEALRLFELLLETDTK 593
            S L+HMY+KCG++D AYKVF+ L +PD+ SWTSLI+GYAQN QP EAL+L++ LL++ T+
Sbjct: 362  STLVHMYAKCGSVDSAYKVFKRLPKPDVVSWTSLINGYAQNSQPSEALQLYDSLLKSGTQ 421

Query: 592  PDQITFVGVLSACTHAGLVDRGLEYFNSIKEKHGLTHTQDHYACVVDLLSRAGRFGEVED 413
            PD ITFVGVLSACTHAGLVD+GLEYF SIK+KH LTHT DHYACV+DLLSR GRF E E+
Sbjct: 422  PDHITFVGVLSACTHAGLVDKGLEYFYSIKDKHCLTHTADHYACVIDLLSRFGRFKEAEE 481

Query: 412  LIKKMPMKPDKFLWASLLGGCRIHGXXXXXXXXXXXLFEIEPENAATYVTLANIYATAGN 233
            +I +MPMKPDKFLWASLLGGCR+HG           LFEIEPENAATYVT+AN+YATAG 
Sbjct: 482  IISQMPMKPDKFLWASLLGGCRVHGNVELAKRAAEALFEIEPENAATYVTIANVYATAGK 541

Query: 232  WGEVAKIRKIMDEKGLVKKPGMSWIHLKRKIHVFLVGDESHPRSDEIHAFLGELSKRMKA 53
            W EVAKIR++M+EKG+VKKPG+SWI+L+RK +VFLVGD+SHPRS EI+ FLGEL +RMK 
Sbjct: 542  WTEVAKIRRVMEEKGVVKKPGISWINLQRKDYVFLVGDKSHPRSKEIYEFLGELWRRMKE 601

Query: 52   EGYVPDTDYVLHDVEEE 2
            EGYVP  D VLHDVEEE
Sbjct: 602  EGYVPAIDNVLHDVEEE 618


>ref|XP_002324099.1| hypothetical protein POPTR_0017s12720g [Populus trichocarpa]
            gi|222867101|gb|EEF04232.1| hypothetical protein
            POPTR_0017s12720g [Populus trichocarpa]
          Length = 676

 Score =  830 bits (2145), Expect = 0.0
 Identities = 395/594 (66%), Positives = 497/594 (83%)
 Frame = -3

Query: 1783 QPKTTRNPPSPGKTFFKRDSNGDQQLINRLCRAYKFKEAIEILCQQNRLEDAIQLLEHQI 1604
            +P  + +P  P KTFFK ++  D  L+  LC   +F EAI ILCQQNRL++A+Q+L HQI
Sbjct: 9    KPSHSSSPFQP-KTFFKSNTK-DTTLVPHLCNHKRFDEAIHILCQQNRLKEALQIL-HQI 65

Query: 1603 DRPFADIYLVLLQSCIRQRALREGKRVHNHIRLSGFRPGIFVSNQILQLYCKCDSLEDAH 1424
            D+P A +Y  L+QSCI+ R L++GK+VH HI+LSGF PG+F+ N++L++Y KCDSL D+ 
Sbjct: 66   DKPSASVYSTLIQSCIKSRLLQQGKKVHQHIKLSGFVPGLFILNRLLEMYAKCDSLMDSQ 125

Query: 1423 QVFNEMGEKDVCSWNILISGYAKVGRVDLARRLFDEMPGRDNFSWTAMISGYVKHYMPKD 1244
            ++F+EM E+D+CSWNILISGYAK+G +  A+ LFD+MP RDNFSWTAMISGYV+H  P +
Sbjct: 126  KLFDEMPERDLCSWNILISGYAKMGLLQEAKSLFDKMPERDNFSWTAMISGYVRHDRPNE 185

Query: 1243 ALELYRLMQKQGAFVPSKFTVSSALAATASIQSLQLGMEIHGHINRTGLDSDAVVWSALS 1064
            ALEL+R+M++      +KFTVSSALAA A++  L++G EIHG+I RTGLDSD VVWSALS
Sbjct: 186  ALELFRMMKRSDNSKSNKFTVSSALAAAAAVPCLRIGKEIHGYIMRTGLDSDEVVWSALS 245

Query: 1063 DMYGKCGSIDQARYVFDRTFEKDVVSWTAMIDRYFEGGRWEEGFLLFSDMLKSGIRPNEF 884
            DMYGKCGSI++AR++FD+  ++D+V+WTAMIDRYF+ GR +EGF LF+D+L+SGIRPNEF
Sbjct: 246  DMYGKCGSIEEARHIFDKMVDRDIVTWTAMIDRYFQDGRRKEGFDLFADLLRSGIRPNEF 305

Query: 883  TFAGILNACAHQTVEGLGKQVHGYMTRLEFDPVSFAASALLHMYSKCGNMDGAYKVFRLL 704
            TF+G+LNACA+QT E LGK+VHGYMTR+ FDP SFAASAL+HMYSKCGNM  A +VF+  
Sbjct: 306  TFSGVLNACANQTSEELGKKVHGYMTRVGFDPFSFAASALVHMYSKCGNMVSAERVFKET 365

Query: 703  KRPDLFSWTSLISGYAQNGQPHEALRLFELLLETDTKPDQITFVGVLSACTHAGLVDRGL 524
             +PDLFSWTSLI+GYAQNGQP EA+R FELL+++ T+PD ITFVGVLSAC HAGLVD+GL
Sbjct: 366  PQPDLFSWTSLIAGYAQNGQPDEAIRYFELLVKSGTQPDHITFVGVLSACAHAGLVDKGL 425

Query: 523  EYFNSIKEKHGLTHTQDHYACVVDLLSRAGRFGEVEDLIKKMPMKPDKFLWASLLGGCRI 344
            +YF+SIKE++GLTHT DHYAC++DLL+R+G+F E E++I KM MKPDKFLWASLLGGCRI
Sbjct: 426  DYFHSIKEQYGLTHTADHYACIIDLLARSGQFDEAENIISKMSMKPDKFLWASLLGGCRI 485

Query: 343  HGXXXXXXXXXXXLFEIEPENAATYVTLANIYATAGNWGEVAKIRKIMDEKGLVKKPGMS 164
            HG           LFEIEPEN ATYVTLANIYATAG W EVAKIRK MD++G+VKKPG+S
Sbjct: 486  HGNLKLAQRAAEALFEIEPENPATYVTLANIYATAGMWSEVAKIRKTMDDRGVVKKPGLS 545

Query: 163  WIHLKRKIHVFLVGDESHPRSDEIHAFLGELSKRMKAEGYVPDTDYVLHDVEEE 2
            WI +KR +HVFLVGD+SHP+S EI+ FLG+LSKRMK EG+VPDT++VLHDVE+E
Sbjct: 546  WIAIKRDVHVFLVGDDSHPKSKEINEFLGKLSKRMKEEGFVPDTNFVLHDVEDE 599


>ref|XP_003632466.1| PREDICTED: pentatricopeptide repeat-containing protein
            At4g37170-like, partial [Vitis vinifera]
          Length = 621

 Score =  823 bits (2126), Expect = 0.0
 Identities = 403/600 (67%), Positives = 484/600 (80%)
 Frame = -3

Query: 1801 SSPYPFQPKTTRNPPSPGKTFFKRDSNGDQQLINRLCRAYKFKEAIEILCQQNRLEDAIQ 1622
            SS    QP+ ++ P     TFFK  S    +L+ RLC+   FKEAI+ILC+Q RL +AIQ
Sbjct: 24   SSSTTSQPQLSKPPIH--NTFFK--SGAKDELVKRLCKDNNFKEAIDILCEQKRLREAIQ 79

Query: 1621 LLEHQIDRPFADIYLVLLQSCIRQRALREGKRVHNHIRLSGFRPGIFVSNQILQLYCKCD 1442
            +L+H +DRP A  Y  LLQ C++ RAL EG +VH H + SGF PG+ +SN+IL +Y KC+
Sbjct: 80   ILDH-VDRPSAATYSTLLQLCLQLRALDEGMKVHAHTKTSGFVPGVVISNRILDMYIKCN 138

Query: 1441 SLEDAHQVFNEMGEKDVCSWNILISGYAKVGRVDLARRLFDEMPGRDNFSWTAMISGYVK 1262
            SL +A ++F+EM E+D+CSWNI+ISGYAK GR+  AR+LFD+M  RDNFSWTAM SGYV+
Sbjct: 139  SLVNAKRLFDEMAERDLCSWNIMISGYAKAGRLQEARKLFDQMTERDNFSWTAMTSGYVR 198

Query: 1261 HYMPKDALELYRLMQKQGAFVPSKFTVSSALAATASIQSLQLGMEIHGHINRTGLDSDAV 1082
            H   ++ALEL+R MQ+   F  +KFT+SSALAA+A+IQSL LG EIHGHI R GLD D V
Sbjct: 199  HDQHEEALELFRAMQRHENFKCNKFTMSSALAASAAIQSLHLGKEIHGHILRIGLDLDGV 258

Query: 1081 VWSALSDMYGKCGSIDQARYVFDRTFEKDVVSWTAMIDRYFEGGRWEEGFLLFSDMLKSG 902
            VWSALSDMYGKCGSI +AR++FD+T ++DVVSWTAMIDRYF+ GR EEGF LFSD+LKSG
Sbjct: 259  VWSALSDMYGKCGSIGEARHIFDKTVDRDVVSWTAMIDRYFKEGRREEGFALFSDLLKSG 318

Query: 901  IRPNEFTFAGILNACAHQTVEGLGKQVHGYMTRLEFDPVSFAASALLHMYSKCGNMDGAY 722
            I PNEFTF+G+LNACA    E LGKQVHGYMTR+ FDP SFAAS L+HMY+KCGN+  A 
Sbjct: 319  IWPNEFTFSGVLNACADHAAEELGKQVHGYMTRIGFDPSSFAASTLVHMYTKCGNIKNAR 378

Query: 721  KVFRLLKRPDLFSWTSLISGYAQNGQPHEALRLFELLLETDTKPDQITFVGVLSACTHAG 542
            +VF  + RPDL SWTSLISGYAQNGQP EAL+ FELLL++ T+PD ITFVGVLSACTHAG
Sbjct: 379  RVFNGMPRPDLVSWTSLISGYAQNGQPDEALQFFELLLKSGTQPDHITFVGVLSACTHAG 438

Query: 541  LVDRGLEYFNSIKEKHGLTHTQDHYACVVDLLSRAGRFGEVEDLIKKMPMKPDKFLWASL 362
            LVD+GLEYF+SIKEKHGLTHT DHYAC++DLLSR+GR  E ED+I KMP++PDKFLWASL
Sbjct: 439  LVDKGLEYFDSIKEKHGLTHTADHYACLIDLLSRSGRLQEAEDIIDKMPIEPDKFLWASL 498

Query: 361  LGGCRIHGXXXXXXXXXXXLFEIEPENAATYVTLANIYATAGNWGEVAKIRKIMDEKGLV 182
            LGGCRIHG           LFEIEPEN ATY TLANIYATAG WG VA++RK+MD +G+V
Sbjct: 499  LGGCRIHGNLKLAKRAAEALFEIEPENPATYTTLANIYATAGLWGGVAEVRKVMDARGVV 558

Query: 181  KKPGMSWIHLKRKIHVFLVGDESHPRSDEIHAFLGELSKRMKAEGYVPDTDYVLHDVEEE 2
            KKPG+SWI +KR++HVFLVGD SH +S EIH FLG+LSKRMK EGYVPDT++VLHDVEEE
Sbjct: 559  KKPGLSWIEIKREVHVFLVGDTSHAKSKEIHEFLGKLSKRMKEEGYVPDTNFVLHDVEEE 618


>ref|XP_006466653.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like
            [Citrus sinensis]
          Length = 695

 Score =  814 bits (2102), Expect = 0.0
 Identities = 392/582 (67%), Positives = 476/582 (81%)
 Frame = -3

Query: 1747 KTFFKRDSNGDQQLINRLCRAYKFKEAIEILCQQNRLEDAIQLLEHQIDRPFADIYLVLL 1568
            KTFFK ++N D  LI  LC+  KFKEAI+ILC Q RL++A+Q+L HQI  P   IY  L+
Sbjct: 40   KTFFKSNNNDD--LITDLCKHNKFKEAIDILCNQKRLKEALQIL-HQISHPSPSIYSSLI 96

Query: 1567 QSCIRQRALREGKRVHNHIRLSGFRPGIFVSNQILQLYCKCDSLEDAHQVFNEMGEKDVC 1388
            Q C + RAL EGK+VH+H++ SGF+PG+F+SN +L +Y KC +L DA  +F+EM E+DVC
Sbjct: 97   QFCRQNRALEEGKKVHSHLKSSGFKPGVFISNCLLDMYAKCGNLSDARTLFDEMHERDVC 156

Query: 1387 SWNILISGYAKVGRVDLARRLFDEMPGRDNFSWTAMISGYVKHYMPKDALELYRLMQKQG 1208
            S+N +ISGY KVG ++ AR LFDEMP RDNFSWTA+ISGYV++  P +AL+LYR+MQ   
Sbjct: 157  SYNTMISGYTKVGFLEQARNLFDEMPQRDNFSWTAIISGYVRYNQPIEALDLYRMMQNFE 216

Query: 1207 AFVPSKFTVSSALAATASIQSLQLGMEIHGHINRTGLDSDAVVWSALSDMYGKCGSIDQA 1028
              V +KFT+SSAL+A ++IQ L+LG EIHG+I RTG DSD VVWSALSDMYGKCGSI++A
Sbjct: 217  NSVSNKFTLSSALSAVSAIQCLRLGKEIHGYIMRTGFDSDEVVWSALSDMYGKCGSINEA 276

Query: 1027 RYVFDRTFEKDVVSWTAMIDRYFEGGRWEEGFLLFSDMLKSGIRPNEFTFAGILNACAHQ 848
            R +FD+  ++DVVSWTAMI RYF+ GR EEGF LFS+++KSGIRPN FTFAG+LNACA  
Sbjct: 277  RQIFDKMVDRDVVSWTAMIGRYFQEGRREEGFALFSELIKSGIRPNAFTFAGVLNACADH 336

Query: 847  TVEGLGKQVHGYMTRLEFDPVSFAASALLHMYSKCGNMDGAYKVFRLLKRPDLFSWTSLI 668
              E LGKQVHGYMTR+ +DP SFAASAL+HMYSKCGN++ + KVF  + RPDL SWTSLI
Sbjct: 337  AAEELGKQVHGYMTRIGYDPYSFAASALVHMYSKCGNVENSKKVFNGMPRPDLVSWTSLI 396

Query: 667  SGYAQNGQPHEALRLFELLLETDTKPDQITFVGVLSACTHAGLVDRGLEYFNSIKEKHGL 488
            +GYAQNG P +AL  FELLL++ T+PD I FVGVL+ACTHAGLVD+GL+YF+SIKEKHGL
Sbjct: 397  AGYAQNGMPDKALEYFELLLKSGTQPDNIVFVGVLTACTHAGLVDKGLQYFHSIKEKHGL 456

Query: 487  THTQDHYACVVDLLSRAGRFGEVEDLIKKMPMKPDKFLWASLLGGCRIHGXXXXXXXXXX 308
            T+T DHYAC+VDLL+R+GRF E ED+I KMPMKPDKFLWASLLGGCRIHG          
Sbjct: 457  TYTADHYACIVDLLARSGRFHEAEDVISKMPMKPDKFLWASLLGGCRIHGNLDLAKRAAE 516

Query: 307  XLFEIEPENAATYVTLANIYATAGNWGEVAKIRKIMDEKGLVKKPGMSWIHLKRKIHVFL 128
             LFEIEPEN ATYVT+ANIYA+AG W EVA IRK+MD++G+VKKPG+SWI ++R  HVFL
Sbjct: 517  ALFEIEPENPATYVTMANIYASAGKWSEVASIRKLMDDRGVVKKPGLSWIEIQRDAHVFL 576

Query: 127  VGDESHPRSDEIHAFLGELSKRMKAEGYVPDTDYVLHDVEEE 2
            VGD SHPRS EIH FLG+LSKRMK EGYVPDT++VLHDVEEE
Sbjct: 577  VGDTSHPRSKEIHEFLGKLSKRMKEEGYVPDTNFVLHDVEEE 618


>ref|XP_006425825.1| hypothetical protein CICLE_v10024955mg [Citrus clementina]
            gi|557527815|gb|ESR39065.1| hypothetical protein
            CICLE_v10024955mg [Citrus clementina]
          Length = 759

 Score =  814 bits (2102), Expect = 0.0
 Identities = 392/582 (67%), Positives = 476/582 (81%)
 Frame = -3

Query: 1747 KTFFKRDSNGDQQLINRLCRAYKFKEAIEILCQQNRLEDAIQLLEHQIDRPFADIYLVLL 1568
            KTFFK ++N D  LI  LC+  KFKEAI+ILC Q RL++A+Q+L HQI  P   IY  L+
Sbjct: 104  KTFFKSNNNDD--LITDLCKHNKFKEAIDILCNQKRLKEALQIL-HQISHPSPSIYSSLI 160

Query: 1567 QSCIRQRALREGKRVHNHIRLSGFRPGIFVSNQILQLYCKCDSLEDAHQVFNEMGEKDVC 1388
            Q C + RAL EGK+VH+H++ SGF+PG+F+SN +L +Y KC +L DA  +F+EM E+DVC
Sbjct: 161  QFCRQNRALEEGKKVHSHLKSSGFKPGVFISNCLLDMYAKCGNLSDARTLFDEMHERDVC 220

Query: 1387 SWNILISGYAKVGRVDLARRLFDEMPGRDNFSWTAMISGYVKHYMPKDALELYRLMQKQG 1208
            S+N +ISGY KVG ++ AR LFDEMP RDNFSWTA+ISGYV++  P +AL+LYR+MQ   
Sbjct: 221  SYNTMISGYTKVGFLEQARNLFDEMPQRDNFSWTAIISGYVRYNQPIEALDLYRMMQNFE 280

Query: 1207 AFVPSKFTVSSALAATASIQSLQLGMEIHGHINRTGLDSDAVVWSALSDMYGKCGSIDQA 1028
              V +KFT+SSAL+A ++IQ L+LG EIHG+I RTG DSD VVWSALSDMYGKCGSI++A
Sbjct: 281  NSVSNKFTLSSALSAVSAIQCLRLGKEIHGYIMRTGFDSDEVVWSALSDMYGKCGSINEA 340

Query: 1027 RYVFDRTFEKDVVSWTAMIDRYFEGGRWEEGFLLFSDMLKSGIRPNEFTFAGILNACAHQ 848
            R +FD+  ++DVVSWTAMI RYF+ GR EEGF LFS+++KSGIRPN FTFAG+LNACA  
Sbjct: 341  RQIFDKMVDRDVVSWTAMIGRYFQEGRREEGFALFSELIKSGIRPNAFTFAGVLNACADH 400

Query: 847  TVEGLGKQVHGYMTRLEFDPVSFAASALLHMYSKCGNMDGAYKVFRLLKRPDLFSWTSLI 668
              E LGKQVHGYMTR+ +DP SFAASAL+HMYSKCGN++ + KVF  + RPDL SWTSLI
Sbjct: 401  AAEELGKQVHGYMTRIGYDPYSFAASALVHMYSKCGNVENSKKVFNGMPRPDLVSWTSLI 460

Query: 667  SGYAQNGQPHEALRLFELLLETDTKPDQITFVGVLSACTHAGLVDRGLEYFNSIKEKHGL 488
            +GYAQNG P +AL  FELLL++ T+PD I FVGVL+ACTHAGLVD+GL+YF+SIKEKHGL
Sbjct: 461  AGYAQNGMPDKALEYFELLLKSGTQPDNIVFVGVLTACTHAGLVDKGLQYFHSIKEKHGL 520

Query: 487  THTQDHYACVVDLLSRAGRFGEVEDLIKKMPMKPDKFLWASLLGGCRIHGXXXXXXXXXX 308
            T+T DHYAC+VDLL+R+GRF E ED+I KMPMKPDKFLWASLLGGCRIHG          
Sbjct: 521  TYTADHYACIVDLLARSGRFHEAEDVISKMPMKPDKFLWASLLGGCRIHGNLDLAKRAAE 580

Query: 307  XLFEIEPENAATYVTLANIYATAGNWGEVAKIRKIMDEKGLVKKPGMSWIHLKRKIHVFL 128
             LFEIEPEN ATYVT+ANIYA+AG W EVA IRK+MD++G+VKKPG+SWI ++R  HVFL
Sbjct: 581  ALFEIEPENPATYVTMANIYASAGKWSEVASIRKLMDDRGVVKKPGLSWIEIQRDAHVFL 640

Query: 127  VGDESHPRSDEIHAFLGELSKRMKAEGYVPDTDYVLHDVEEE 2
            VGD SHPRS EIH FLG+LSKRMK EGYVPDT++VLHDVEEE
Sbjct: 641  VGDTSHPRSKEIHEFLGKLSKRMKEEGYVPDTNFVLHDVEEE 682


>ref|XP_004301849.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like
            [Fragaria vesca subsp. vesca]
          Length = 757

 Score =  801 bits (2070), Expect = 0.0
 Identities = 393/587 (66%), Positives = 467/587 (79%)
 Frame = -3

Query: 1762 PPSPGKTFFKRDSNGDQQLINRLCRAYKFKEAIEILCQQNRLEDAIQLLEHQIDRPFADI 1583
            PP   K FFK  SN    LI RLC+  KFKEAI ILC QNRL +A+QLL H I  P + +
Sbjct: 97   PPLSNKYFFK--SNTKDGLITRLCKDRKFKEAIHILCNQNRLPEAVQLLTH-IATPSSSL 153

Query: 1582 YLVLLQSCIRQRALREGKRVHNHIRLSGFRPGIFVSNQILQLYCKCDSLEDAHQVFNEMG 1403
            Y  LL  C++ RAL + K VH+H +L GF  G+F+SN+ + LY KC SL DA +VF+EM 
Sbjct: 154  YSTLLHHCLQHRALDQAKLVHSHTKLYGFDLGLFISNRFINLYAKCGSLVDAQKVFDEMP 213

Query: 1402 EKDVCSWNILISGYAKVGRVDLARRLFDEMPGRDNFSWTAMISGYVKHYMPKDALELYRL 1223
            ++D+CSWN +ISGYAK+G++  AR+LFDEMP RDNFSWTAMISGYV H  P +ALELYR+
Sbjct: 214  DRDLCSWNTMISGYAKLGKLGDARKLFDEMPHRDNFSWTAMISGYVWHERPDEALELYRV 273

Query: 1222 MQKQGAFVPSKFTVSSALAATASIQSLQLGMEIHGHINRTGLDSDAVVWSALSDMYGKCG 1043
            M+K+ +   SKFTVSS L A+A++QSL++G EIH +I RTGLDSD VVWSALSDMYGKCG
Sbjct: 274  MRKEESSKCSKFTVSSVLVASAAVQSLRMGKEIHCYIMRTGLDSDEVVWSALSDMYGKCG 333

Query: 1042 SIDQARYVFDRTFEKDVVSWTAMIDRYFEGGRWEEGFLLFSDMLKSGIRPNEFTFAGILN 863
            SI++AR VFD+   +DVV+WTAM+ RYFE G+ EEG  LFS+++++GIRPNEFTFAG+LN
Sbjct: 334  SIEEARRVFDKMVNRDVVTWTAMMGRYFEDGKREEGLALFSELMRTGIRPNEFTFAGVLN 393

Query: 862  ACAHQTVEGLGKQVHGYMTRLEFDPVSFAASALLHMYSKCGNMDGAYKVFRLLKRPDLFS 683
            ACA   +E LGKQVHGYMTR+EFDP SFAASAL+HMYSKCGN   A KVF+ +  PDL S
Sbjct: 394  ACADHAIENLGKQVHGYMTRIEFDPFSFAASALVHMYSKCGNTANANKVFKGMPSPDLVS 453

Query: 682  WTSLISGYAQNGQPHEALRLFELLLETDTKPDQITFVGVLSACTHAGLVDRGLEYFNSIK 503
            WTSLI GYAQNGQ  EAL+LFE LL++ T+PD +TFVGVLSACTHAGLVDRGLEYF+SIK
Sbjct: 454  WTSLIVGYAQNGQADEALQLFESLLKSGTRPDHVTFVGVLSACTHAGLVDRGLEYFHSIK 513

Query: 502  EKHGLTHTQDHYACVVDLLSRAGRFGEVEDLIKKMPMKPDKFLWASLLGGCRIHGXXXXX 323
            EKHGL HT DHYACVVDLL+RAG+F E E++I +MPMKP KFLWASL+GGCRIHG     
Sbjct: 514  EKHGLKHTADHYACVVDLLARAGQFDEAENIISEMPMKPSKFLWASLIGGCRIHGNVKLA 573

Query: 322  XXXXXXLFEIEPENAATYVTLANIYATAGNWGEVAKIRKIMDEKGLVKKPGMSWIHLKRK 143
                  LF IEPEN ATYVTLANIYATAG W EV  +RK MDE G+ KKPG+SWI +KR+
Sbjct: 574  KRAAEALFVIEPENPATYVTLANIYATAGMWSEVTNVRKKMDESGITKKPGLSWIEIKRE 633

Query: 142  IHVFLVGDESHPRSDEIHAFLGELSKRMKAEGYVPDTDYVLHDVEEE 2
            +HVFLVGD+SHPR +EI  FL ELSKRMK EGYVPDT +VLHDVEEE
Sbjct: 634  MHVFLVGDQSHPRYNEIDHFLSELSKRMKEEGYVPDTKFVLHDVEEE 680


>gb|EOX91314.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao]
          Length = 684

 Score =  788 bits (2035), Expect = 0.0
 Identities = 394/612 (64%), Positives = 476/612 (77%), Gaps = 2/612 (0%)
 Frame = -3

Query: 1831 YHLHRA--ASSFSSPYPFQPKTTRNPPSPGKTFFKRDSNGDQQLINRLCRAYKFKEAIEI 1658
            +H +RA  AS FSS      +T  +  SP KTFFK   N    LIN+L            
Sbjct: 15   FHKNRAIFASWFSS------QTQIHKHSPRKTFFK--PNTKDNLINQLHN---------- 56

Query: 1657 LCQQNRLEDAIQLLEHQIDRPFADIYLVLLQSCIRQRALREGKRVHNHIRLSGFRPGIFV 1478
            LC Q RL++AIQ+L +QI++P A +Y  L+Q C + RAL EGK VH HI++SGF  G+ +
Sbjct: 57   LCNQKRLKEAIQIL-NQIEKPPASLYSTLIQLCCQNRALNEGKSVHQHIKISGFSAGLVI 115

Query: 1477 SNQILQLYCKCDSLEDAHQVFNEMGEKDVCSWNILISGYAKVGRVDLARRLFDEMPGRDN 1298
             N++L +Y KC SL DA  VF+EM E+D+CSWN L+SGYAK+G +  A +LFDEMP RDN
Sbjct: 116  CNRLLDMYAKCGSLADAQNVFDEMSERDLCSWNTLMSGYAKMGMLKEANKLFDEMPERDN 175

Query: 1297 FSWTAMISGYVKHYMPKDALELYRLMQKQGAFVPSKFTVSSALAATASIQSLQLGMEIHG 1118
            FSWTAMISGYV+   PK+ALELYR+ +       +KFTVSSA+AA+A++  L  G EIHG
Sbjct: 176  FSWTAMISGYVRFDRPKEALELYRMKEMSMVSKLNKFTVSSAIAASAAMGCLTTGKEIHG 235

Query: 1117 HINRTGLDSDAVVWSALSDMYGKCGSIDQARYVFDRTFEKDVVSWTAMIDRYFEGGRWEE 938
             I R GLD D VVWSAL DMYGKCGSI++AR VFD+  ++D+VSWTAMIDRYFE GRWEE
Sbjct: 236  RITRAGLDLDEVVWSALMDMYGKCGSIEEARRVFDKIVDRDIVSWTAMIDRYFEDGRWEE 295

Query: 937  GFLLFSDMLKSGIRPNEFTFAGILNACAHQTVEGLGKQVHGYMTRLEFDPVSFAASALLH 758
            GF LFS+++KSGIRPNEFTFAG+LNACA    E +GKQVHG MTRL F+P SFAASAL+H
Sbjct: 296  GFELFSELMKSGIRPNEFTFAGVLNACADHAAEEIGKQVHGCMTRLGFNPFSFAASALVH 355

Query: 757  MYSKCGNMDGAYKVFRLLKRPDLFSWTSLISGYAQNGQPHEALRLFELLLETDTKPDQIT 578
            MYSKCGN++ A +VF  +  PDL SWTSLI+GYAQNGQP EAL  FELLL++ TKPD IT
Sbjct: 356  MYSKCGNVENAKRVFNGMPLPDLVSWTSLITGYAQNGQPEEALEYFELLLKSGTKPDHIT 415

Query: 577  FVGVLSACTHAGLVDRGLEYFNSIKEKHGLTHTQDHYACVVDLLSRAGRFGEVEDLIKKM 398
            FVGVLSACTHAGLVD+GLEYF+SIK++HGLTHT DHYAC++DLL+R+GRF E E++I KM
Sbjct: 416  FVGVLSACTHAGLVDKGLEYFHSIKDRHGLTHTADHYACIIDLLARSGRFQEAENIIVKM 475

Query: 397  PMKPDKFLWASLLGGCRIHGXXXXXXXXXXXLFEIEPENAATYVTLANIYATAGNWGEVA 218
            PMKPDKFLWASLLGGCRIHG           LFEIEPEN ATYVT+ANIYATAG W EVA
Sbjct: 476  PMKPDKFLWASLLGGCRIHGNLELAEKAAEALFEIEPENPATYVTMANIYATAGRWDEVA 535

Query: 217  KIRKIMDEKGLVKKPGMSWIHLKRKIHVFLVGDESHPRSDEIHAFLGELSKRMKAEGYVP 38
            KIRK MD+KG+VKKPG+SWI +KR++HVFLVGD +HP+S EI+ FL +LSKRM+ EGYVP
Sbjct: 536  KIRKKMDDKGVVKKPGLSWIEVKRELHVFLVGDTTHPKSKEINEFLVKLSKRMREEGYVP 595

Query: 37   DTDYVLHDVEEE 2
            +T++VLHDVEEE
Sbjct: 596  NTNFVLHDVEEE 607


>ref|XP_003542017.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like
            [Glycine max]
          Length = 693

 Score =  765 bits (1975), Expect = 0.0
 Identities = 360/560 (64%), Positives = 447/560 (79%)
 Frame = -3

Query: 1681 KFKEAIEILCQQNRLEDAIQLLEHQIDRPFADIYLVLLQSCIRQRALREGKRVHNHIRLS 1502
            KF+EA+++LCQQ R+++A++LL     RP A +Y  L+ +C+R RAL  G+RVH H + S
Sbjct: 57   KFEEAVDVLCQQKRVKEAVELLHRTDHRPSARVYSTLIAACVRHRALELGRRVHAHTKAS 116

Query: 1501 GFRPGIFVSNQILQLYCKCDSLEDAHQVFNEMGEKDVCSWNILISGYAKVGRVDLARRLF 1322
             F PG+F+SN++L +Y KC SL DA  +F+EMG +D+CSWN +I GYAK+GR++ AR+LF
Sbjct: 117  NFVPGVFISNRLLDMYAKCGSLVDAQMLFDEMGHRDLCSWNTMIVGYAKLGRLEQARKLF 176

Query: 1321 DEMPGRDNFSWTAMISGYVKHYMPKDALELYRLMQKQGAFVPSKFTVSSALAATASIQSL 1142
            DEMP RDNFSW A ISGYV H  P++ALEL+R+MQ+      +KFT+SSALAA+A+I  L
Sbjct: 177  DEMPQRDNFSWNAAISGYVTHNQPREALELFRVMQRHERSSSNKFTLSSALAASAAIPCL 236

Query: 1141 QLGMEIHGHINRTGLDSDAVVWSALSDMYGKCGSIDQARYVFDRTFEKDVVSWTAMIDRY 962
            +LG EIHG++ RT L+ D VVWSAL D+YGKCGS+D+AR +FD+  ++DVVSWT MI R 
Sbjct: 237  RLGKEIHGYLIRTELNLDEVVWSALLDLYGKCGSLDEARGIFDQMKDRDVVSWTTMIHRC 296

Query: 961  FEGGRWEEGFLLFSDMLKSGIRPNEFTFAGILNACAHQTVEGLGKQVHGYMTRLEFDPVS 782
            FE GR EEGFLLF D+++SG+RPNE+TFAG+LNACA    E LGK+VHGYM    +DP S
Sbjct: 297  FEDGRREEGFLLFRDLMQSGVRPNEYTFAGVLNACADHAAEHLGKEVHGYMMHAGYDPGS 356

Query: 781  FAASALLHMYSKCGNMDGAYKVFRLLKRPDLFSWTSLISGYAQNGQPHEALRLFELLLET 602
            FA SAL+HMYSKCGN   A +VF  + +PDL SWTSLI GYAQNGQP EAL  FELLL++
Sbjct: 357  FAISALVHMYSKCGNTRVARRVFNEMHQPDLVSWTSLIVGYAQNGQPDEALHFFELLLQS 416

Query: 601  DTKPDQITFVGVLSACTHAGLVDRGLEYFNSIKEKHGLTHTQDHYACVVDLLSRAGRFGE 422
             TKPDQ+T+VGVLSACTHAGLVD+GLEYF+SIKEKHGL HT DHYACV+DLL+R+GRF E
Sbjct: 417  GTKPDQVTYVGVLSACTHAGLVDKGLEYFHSIKEKHGLMHTADHYACVIDLLARSGRFKE 476

Query: 421  VEDLIKKMPMKPDKFLWASLLGGCRIHGXXXXXXXXXXXLFEIEPENAATYVTLANIYAT 242
             E++I  MP+KPDKFLWASLLGGCRIHG           L+EIEPEN ATY+TLANIYA 
Sbjct: 477  AENIIDNMPVKPDKFLWASLLGGCRIHGNLELAKRAAKALYEIEPENPATYITLANIYAN 536

Query: 241  AGNWGEVAKIRKIMDEKGLVKKPGMSWIHLKRKIHVFLVGDESHPRSDEIHAFLGELSKR 62
            AG W EVA +RK MD  G+VKKPG SWI +KR++HVFLVGD SHP++ +IH FLGELSK+
Sbjct: 537  AGLWSEVANVRKDMDNMGIVKKPGKSWIEIKRQVHVFLVGDTSHPKTSDIHEFLGELSKK 596

Query: 61   MKAEGYVPDTDYVLHDVEEE 2
            +K EGYVPDT++VLHDVEEE
Sbjct: 597  IKEEGYVPDTNFVLHDVEEE 616


>gb|EXC24858.1| hypothetical protein L484_013224 [Morus notabilis]
          Length = 742

 Score =  756 bits (1953), Expect = 0.0
 Identities = 369/595 (62%), Positives = 465/595 (78%)
 Frame = -3

Query: 1786 FQPKTTRNPPSPGKTFFKRDSNGDQQLINRLCRAYKFKEAIEILCQQNRLEDAIQLLEHQ 1607
            F  +T  + P   K+FFK  SN    LI+RLC   ++KEA++ILC Q RL++A+QLL ++
Sbjct: 26   FFSETQLSKPLIEKSFFK--SNSKDGLISRLCIDKRYKEAVDILCDQKRLKEAVQLL-NR 82

Query: 1606 IDRPFADIYLVLLQSCIRQRALREGKRVHNHIRLSGFRPGIFVSNQILQLYCKCDSLEDA 1427
            I+RP A IY  +L  C+ +RAL EGK VH H + SG  PG+F+SN+ + LY KC  L DA
Sbjct: 83   IERPSALIYSTILCHCLHERALEEGKLVHAHTKASGLVPGLFISNRFIDLYAKCGCLGDA 142

Query: 1426 HQVFNEMGEKDVCSWNILISGYAKVGRVDLARRLFDEMPGRDNFSWTAMISGYVKHYMPK 1247
             +VF+EM +KD+CSWN +ISGYAKVG++D ARRLFDEMP RD++SW+AMISGYV+    K
Sbjct: 143  RKVFDEMPDKDLCSWNTMISGYAKVGKLDEARRLFDEMPDRDHYSWSAMISGYVRQDWAK 202

Query: 1246 DALELYRLMQKQGAFVPSKFTVSSALAATASIQSLQLGMEIHGHINRTGLDSDAVVWSAL 1067
            + LELYR+MQ+       KFTVSS LAA A+I SL++G EIHG++ RTGLDSD VV SAL
Sbjct: 203  EGLELYRMMQRCEKSRCDKFTVSSVLAAAAAIPSLRVGKEIHGYVMRTGLDSDEVVLSAL 262

Query: 1066 SDMYGKCGSIDQARYVFDRTFEKDVVSWTAMIDRYFEGGRWEEGFLLFSDMLKSGIRPNE 887
             DMYGKCG+ID+AR VFD+  E+DVV+WTAMIDR F  GR ++GF LF +++ SG RPN 
Sbjct: 263  LDMYGKCGNIDEARRVFDKMVERDVVTWTAMIDRCFRSGRSKDGFSLFMELMSSGTRPNG 322

Query: 886  FTFAGILNACAHQTVEGLGKQVHGYMTRLEFDPVSFAASALLHMYSKCGNMDGAYKVFRL 707
            FTF+G+LNACA      LGKQVHGYMTR+ FDP+SFAASAL+HMY+KCGN++ A +VF+ 
Sbjct: 323  FTFSGVLNACADHAAGDLGKQVHGYMTRIGFDPLSFAASALVHMYAKCGNIENAKRVFKG 382

Query: 706  LKRPDLFSWTSLISGYAQNGQPHEALRLFELLLETDTKPDQITFVGVLSACTHAGLVDRG 527
            + +PDL SWTSLI GYAQ+GQP+EAL++FE L ++  KPD +TFVGVLSACTHAGLVD+G
Sbjct: 383  MPKPDLVSWTSLIVGYAQHGQPNEALQMFESLHKSGIKPDHVTFVGVLSACTHAGLVDKG 442

Query: 526  LEYFNSIKEKHGLTHTQDHYACVVDLLSRAGRFGEVEDLIKKMPMKPDKFLWASLLGGCR 347
            LEYF+SIK KHGL +T DHYAC+VD+L+RAGRF E E++I  MP++PDKFLWASLLGGCR
Sbjct: 443  LEYFHSIKTKHGLGYTADHYACIVDILARAGRFKEAEEIINGMPIRPDKFLWASLLGGCR 502

Query: 346  IHGXXXXXXXXXXXLFEIEPENAATYVTLANIYATAGNWGEVAKIRKIMDEKGLVKKPGM 167
            I+G           LFEIEPEN ATYVTL NIYA AG WGEVAK+RK MD++ + KKPG+
Sbjct: 503  IYGNLELAKRAAEALFEIEPENPATYVTLGNIYAAAGMWGEVAKVRKTMDKRKVAKKPGL 562

Query: 166  SWIHLKRKIHVFLVGDESHPRSDEIHAFLGELSKRMKAEGYVPDTDYVLHDVEEE 2
            SWI +KR+ H F +GD SHPR +EI  FL +LSKRM+ EGY+P+T++VLHDVEEE
Sbjct: 563  SWIEIKREKHAFTIGDMSHPRFNEIDMFLQKLSKRMREEGYIPNTNFVLHDVEEE 617


>gb|ESW22273.1| hypothetical protein PHAVU_005G140500g [Phaseolus vulgaris]
          Length = 681

 Score =  745 bits (1924), Expect = 0.0
 Identities = 361/580 (62%), Positives = 450/580 (77%), Gaps = 1/580 (0%)
 Frame = -3

Query: 1738 FKRDSNGDQQLINRLCRAYKFKEAIEILCQQNRLEDAIQLLEHQID-RPFADIYLVLLQS 1562
            F+ D   + +L + +     F+EAI++LCQQ R+++A++LL H+ID RP A  Y  L+ +
Sbjct: 26   FRNDIRNNLKLKDPISENNNFEEAIDVLCQQKRVKEAVELL-HRIDHRPSARAYSTLIAA 84

Query: 1561 CIRQRALREGKRVHNHIRLSGFRPGIFVSNQILQLYCKCDSLEDAHQVFNEMGEKDVCSW 1382
            C+R RAL  G+RVH H + S F  G+F+ N++L +Y KC SL DA  +F+EMG +D+CSW
Sbjct: 85   CVRHRALELGRRVHAHTKGSNFVLGVFICNRLLDMYAKCGSLVDAQMLFDEMGHRDLCSW 144

Query: 1381 NILISGYAKVGRVDLARRLFDEMPGRDNFSWTAMISGYVKHYMPKDALELYRLMQKQGAF 1202
            N +I+GYAK+GR++ AR+LFDEMP RDNFSW A ISGYV H  P +ALEL+R+MQ+    
Sbjct: 145  NTMIAGYAKLGRLEQARKLFDEMPRRDNFSWNAAISGYVSHDRPWEALELFRVMQRCERS 204

Query: 1201 VPSKFTVSSALAATASIQSLQLGMEIHGHINRTGLDSDAVVWSALSDMYGKCGSIDQARY 1022
              +KFT+SSALAA+A+I  L+LG EIHG++ RT L+ D VVWSAL D+YGKCGS+D+AR 
Sbjct: 205  NSNKFTLSSALAASAAIPCLRLGKEIHGYLMRTELNLDEVVWSALLDLYGKCGSLDEARG 264

Query: 1021 VFDRTFEKDVVSWTAMIDRYFEGGRWEEGFLLFSDMLKSGIRPNEFTFAGILNACAHQTV 842
            +FD+   KDVVSWT MI R FE GR EEG  LF D++ SG+RPNE+TFAG+LN CA    
Sbjct: 265  IFDQMKSKDVVSWTTMIHRCFEDGRKEEGLSLFRDLMWSGVRPNEYTFAGVLNECADHAA 324

Query: 841  EGLGKQVHGYMTRLEFDPVSFAASALLHMYSKCGNMDGAYKVFRLLKRPDLFSWTSLISG 662
            E LGK+VHGYM R+ +DP SFA SAL+HMYSKCGN   A +VF  +   DL SWTSLI G
Sbjct: 325  EHLGKEVHGYMMRVGYDPCSFAVSALVHMYSKCGNTRVARRVFNHMPHKDLVSWTSLIVG 384

Query: 661  YAQNGQPHEALRLFELLLETDTKPDQITFVGVLSACTHAGLVDRGLEYFNSIKEKHGLTH 482
            YAQNG+P EAL  FELLL++ TKPDQITFVGVLSACTHAGLVD+GLEYF+SI+EKHGL H
Sbjct: 385  YAQNGEPEEALHFFELLLQSGTKPDQITFVGVLSACTHAGLVDKGLEYFHSIREKHGLMH 444

Query: 481  TQDHYACVVDLLSRAGRFGEVEDLIKKMPMKPDKFLWASLLGGCRIHGXXXXXXXXXXXL 302
            + DHYACV+DLL+R+GRF E E++I  MP+KPDKFLWASLLGGCRIHG           L
Sbjct: 445  SADHYACVIDLLARSGRFKEAENIIDNMPIKPDKFLWASLLGGCRIHGNLELAKRAAKAL 504

Query: 301  FEIEPENAATYVTLANIYATAGNWGEVAKIRKIMDEKGLVKKPGMSWIHLKRKIHVFLVG 122
            ++IEPEN ATY+TLANIYA AG W EVAK+RK MD +G+VKKPG SWI +KR++HVFLVG
Sbjct: 505  YDIEPENPATYITLANIYANAGLWTEVAKVRKDMDNRGIVKKPGKSWIEIKRQVHVFLVG 564

Query: 121  DESHPRSDEIHAFLGELSKRMKAEGYVPDTDYVLHDVEEE 2
            D SHP++  IH FL ELSK+MK EGYVPDT++VLHDVEEE
Sbjct: 565  DTSHPKTSHIHEFLVELSKKMKEEGYVPDTNFVLHDVEEE 604


>ref|XP_004152308.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like
            [Cucumis sativus] gi|449484855|ref|XP_004156999.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g37170-like [Cucumis sativus]
          Length = 724

 Score =  745 bits (1924), Expect = 0.0
 Identities = 365/593 (61%), Positives = 458/593 (77%), Gaps = 3/593 (0%)
 Frame = -3

Query: 1771 TRNPPSPGKTFFKRDS---NGDQQLINRLCRAYKFKEAIEILCQQNRLEDAIQLLEHQID 1601
            TR+  +P +T   +     +G    INRLC +  FKEAI+ILC Q+RL +A+QLL ++I+
Sbjct: 57   TRDLTTPSQTHIGKKIILFDGKDAYINRLCDSKLFKEAIDILCGQSRLREAVQLL-YRIE 115

Query: 1600 RPFADIYLVLLQSCIRQRALREGKRVHNHIRLSGFRPGIFVSNQILQLYCKCDSLEDAHQ 1421
            +P+A IYL LL+ C++QRAL+EGK+VH HI+ SG   G+++SN++L +Y KC SL DA +
Sbjct: 116  KPYASIYLTLLKFCLKQRALKEGKQVHAHIKTSG-SIGLYISNRLLDMYAKCGSLVDAEK 174

Query: 1420 VFNEMGEKDVCSWNILISGYAKVGRVDLARRLFDEMPGRDNFSWTAMISGYVKHYMPKDA 1241
            VF+EM  +D+CSWNI+ISGY K G  + AR LFD+MP RDNFSWTA+ISG V+H  P++A
Sbjct: 175  VFDEMVHRDLCSWNIMISGYVKGGNFEKARNLFDKMPNRDNFSWTAIISGCVQHNRPEEA 234

Query: 1240 LELYRLMQKQGAFVPSKFTVSSALAATASIQSLQLGMEIHGHINRTGLDSDAVVWSALSD 1061
            LELYRLMQK      +K T+SSALAA+A+I SL +G +IHGHI R GLDSD VVW +L D
Sbjct: 235  LELYRLMQKHDYSKSNKCTISSALAASAAIPSLHMGKKIHGHIMRMGLDSDEVVWCSLLD 294

Query: 1060 MYGKCGSIDQARYVFDRTFEKDVVSWTAMIDRYFEGGRWEEGFLLFSDMLKSGIRPNEFT 881
            MYGKCGSI++ARY+FD+  E+DVVSWT MI  Y + GR EEGF LF  ++ S I PN+FT
Sbjct: 295  MYGKCGSIEEARYIFDKMEERDVVSWTTMIHTYLKNGRREEGFALFRHLMNSNIMPNDFT 354

Query: 880  FAGILNACAHQTVEGLGKQVHGYMTRLEFDPVSFAASALLHMYSKCGNMDGAYKVFRLLK 701
            FAG+LNACA    E LGKQ+H YM R+ FD  S AASAL+HMYSKCG+++ A  VF +L 
Sbjct: 355  FAGVLNACADLAAEDLGKQIHAYMVRVGFDSFSSAASALVHMYSKCGDIENAKSVFEILP 414

Query: 700  RPDLFSWTSLISGYAQNGQPHEALRLFELLLETDTKPDQITFVGVLSACTHAGLVDRGLE 521
            +PDLFSWTSL+ GYAQ+GQ  +AL  FELLL++ TKPD I F+GVLSAC HAGLVD+GLE
Sbjct: 415  QPDLFSWTSLLVGYAQHGQHDKALHFFELLLKSGTKPDGIAFIGVLSACAHAGLVDKGLE 474

Query: 520  YFNSIKEKHGLTHTQDHYACVVDLLSRAGRFGEVEDLIKKMPMKPDKFLWASLLGGCRIH 341
            YF+SIKEKHGLT T DHYAC++DLL+RAG+F E E +I +MP+KPDK++WA+LLGGCRIH
Sbjct: 475  YFHSIKEKHGLTRTIDHYACIIDLLARAGQFTEAESIINEMPIKPDKYIWAALLGGCRIH 534

Query: 340  GXXXXXXXXXXXLFEIEPENAATYVTLANIYATAGNWGEVAKIRKIMDEKGLVKKPGMSW 161
            G           LFEIEPEN ATYVTLANIYA+AG   E A IR+ MD +G+VKKPGMSW
Sbjct: 535  GNLELAKRAAKSLFEIEPENPATYVTLANIYASAGMRAEEANIRETMDSRGIVKKPGMSW 594

Query: 160  IHLKRKIHVFLVGDESHPRSDEIHAFLGELSKRMKAEGYVPDTDYVLHDVEEE 2
            I ++R++HVF VGD SHP+S EI  +L ELSKRMK  GYVPDT++VLHDVE E
Sbjct: 595  IEIRREVHVFSVGDNSHPKSKEILEYLSELSKRMKEVGYVPDTNFVLHDVELE 647


>ref|XP_004511479.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At4g37170-like [Cicer arietinum]
          Length = 700

 Score =  739 bits (1907), Expect = 0.0
 Identities = 353/586 (60%), Positives = 452/586 (77%), Gaps = 7/586 (1%)
 Frame = -3

Query: 1738 FKRDSNGDQQLIN-------RLCRAYKFKEAIEILCQQNRLEDAIQLLEHQIDRPFADIY 1580
            F+R  +   Q +N       RL    K +E +++LCQQ RL++A+  L H+I +P   +Y
Sbjct: 4    FRRAFSSSSQFLNTKDTTLSRLSEHRKLEEIVDVLCQQKRLKEAVDFL-HRIHQPSPRLY 62

Query: 1579 LVLLQSCIRQRALREGKRVHNHIRLSGFRPGIFVSNQILQLYCKCDSLEDAHQVFNEMGE 1400
              L+ +C+  R+L  G++VH H + S F PGIF+SN++L +Y KC  L DA  +F+EM +
Sbjct: 63   SNLIAACLHHRSLELGRKVHAHTKASNFIPGIFISNRLLHMYVKCGGLIDAQSLFDEMSQ 122

Query: 1399 KDVCSWNILISGYAKVGRVDLARRLFDEMPGRDNFSWTAMISGYVKHYMPKDALELYRLM 1220
            KD+CSWN +I+GYA +G ++ AR+LFDEMP RDNFSW A ISGYV H+  ++AL+L+R M
Sbjct: 123  KDLCSWNTMIAGYANLGHLEQARKLFDEMPQRDNFSWNAAISGYVSHHRHREALDLFRTM 182

Query: 1219 QKQGAFVPSKFTVSSALAATASIQSLQLGMEIHGHINRTGLDSDAVVWSALSDMYGKCGS 1040
            Q+  +   + FT+SSALAA A+I+SL+LG EIHG++ RT L+ D VVWSAL D+YGKCGS
Sbjct: 183  QEHESSNSNMFTLSSALAAAAAIRSLRLGKEIHGYLVRTELNLDEVVWSALLDLYGKCGS 242

Query: 1039 IDQARYVFDRTFEKDVVSWTAMIDRYFEGGRWEEGFLLFSDMLKSGIRPNEFTFAGILNA 860
            +D+AR +FD+  ++DVVSWT MI RYFE GR E G  LF +++ SG+RPNE+TFAG+LNA
Sbjct: 243  LDEARGIFDQMVDRDVVSWTTMIHRYFEDGRKEGGLSLFRNLMGSGVRPNEYTFAGVLNA 302

Query: 859  CAHQTVEGLGKQVHGYMTRLEFDPVSFAASALLHMYSKCGNMDGAYKVFRLLKRPDLFSW 680
            CA   +E +GK+VHGYM R+ ++P SFAASAL+H+YSKCGN + A +VF  + RPDL S 
Sbjct: 303  CADLAIERIGKEVHGYMIRVGYNPCSFAASALVHLYSKCGNTEIARRVFNKMPRPDLVSC 362

Query: 679  TSLISGYAQNGQPHEALRLFELLLETDTKPDQITFVGVLSACTHAGLVDRGLEYFNSIKE 500
            TSLI GYAQNGQP  AL  FELLL + TKPD+ITFVGVLSACTHAGLVD+GLEYF+S+KE
Sbjct: 363  TSLIVGYAQNGQPDMALNFFELLLRSGTKPDEITFVGVLSACTHAGLVDKGLEYFHSVKE 422

Query: 499  KHGLTHTQDHYACVVDLLSRAGRFGEVEDLIKKMPMKPDKFLWASLLGGCRIHGXXXXXX 320
            KHGL HT DHYACV+DLL+R+GRF E E++I  MPMKPDKFLWASLLGGCRIHG      
Sbjct: 423  KHGLMHTADHYACVIDLLARSGRFKEAENIIDNMPMKPDKFLWASLLGGCRIHGNIELAE 482

Query: 319  XXXXXLFEIEPENAATYVTLANIYATAGNWGEVAKIRKIMDEKGLVKKPGMSWIHLKRKI 140
                 LFEIEPEN ATY+TLANIYA AG W +VAK+RK M+ +G+VKKPG SWI +KR++
Sbjct: 483  RAAKALFEIEPENPATYITLANIYANAGLWTKVAKVRKDMENRGIVKKPGKSWIEIKRQV 542

Query: 139  HVFLVGDESHPRSDEIHAFLGELSKRMKAEGYVPDTDYVLHDVEEE 2
            HVFLVGD+SHP++ +IH FLGELS +MK EGYVPDT++VLHDVEEE
Sbjct: 543  HVFLVGDKSHPKTSDIHEFLGELSTKMKEEGYVPDTNFVLHDVEEE 588


>ref|XP_003610927.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355512262|gb|AES93885.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 802

 Score =  734 bits (1894), Expect = 0.0
 Identities = 348/560 (62%), Positives = 439/560 (78%)
 Frame = -3

Query: 1681 KFKEAIEILCQQNRLEDAIQLLEHQIDRPFADIYLVLLQSCIRQRALREGKRVHNHIRLS 1502
            +F+E IE+ CQQNRL++A+  L H+I +P   +Y  L+ +C+R R L  GKRVH H + S
Sbjct: 34   RFEEIIELFCQQNRLKEAVDYL-HRIPQPSPRLYSTLIAACLRHRKLELGKRVHAHTKAS 92

Query: 1501 GFRPGIFVSNQILQLYCKCDSLEDAHQVFNEMGEKDVCSWNILISGYAKVGRVDLARRLF 1322
             F PGI +SN+++ +Y KC SL DA  +F+E+ +KD+CSWN +ISGYA VGR++ AR+LF
Sbjct: 93   NFIPGIVISNRLIHMYAKCGSLVDAQMLFDEIPQKDLCSWNTMISGYANVGRIEQARKLF 152

Query: 1321 DEMPGRDNFSWTAMISGYVKHYMPKDALELYRLMQKQGAFVPSKFTVSSALAATASIQSL 1142
            DEMP RDNFSW A+ISGYV      +AL+L+R+MQ+  +   + FT+SSALAA A+I SL
Sbjct: 153  DEMPHRDNFSWNAVISGYVSQGWYMEALDLFRMMQENESSNCNMFTLSSALAAAAAISSL 212

Query: 1141 QLGMEIHGHINRTGLDSDAVVWSALSDMYGKCGSIDQARYVFDRTFEKDVVSWTAMIDRY 962
            + G EIHG++ R+GL+ D VVW+AL D+YGKCGS+++AR +FD+  +KD+VSWT MI R 
Sbjct: 213  RRGKEIHGYLIRSGLELDEVVWTALLDLYGKCGSLNEARGIFDQMADKDIVSWTTMIHRC 272

Query: 961  FEGGRWEEGFLLFSDMLKSGIRPNEFTFAGILNACAHQTVEGLGKQVHGYMTRLEFDPVS 782
            FE GR +EGF LF D++ SG+RPNE+TFAG+LNACA    E +GK+VHGYMTR+ +DP S
Sbjct: 273  FEDGRKKEGFSLFRDLMGSGVRPNEYTFAGVLNACADLAAEQMGKEVHGYMTRVGYDPFS 332

Query: 781  FAASALLHMYSKCGNMDGAYKVFRLLKRPDLFSWTSLISGYAQNGQPHEALRLFELLLET 602
            FAASAL+H+YSKCGN + A +VF  + RPDL SWTSLI GYAQNGQP  AL+ FE LL +
Sbjct: 333  FAASALVHVYSKCGNTETARRVFNQMPRPDLVSWTSLIVGYAQNGQPDMALQFFESLLRS 392

Query: 601  DTKPDQITFVGVLSACTHAGLVDRGLEYFNSIKEKHGLTHTQDHYACVVDLLSRAGRFGE 422
             TKPD+ITFVGVLSACTHAGLVD GLEYF+S+KEKHGL HT DHYACV+DLL+R+GRF E
Sbjct: 393  GTKPDEITFVGVLSACTHAGLVDIGLEYFHSVKEKHGLVHTADHYACVIDLLARSGRFKE 452

Query: 421  VEDLIKKMPMKPDKFLWASLLGGCRIHGXXXXXXXXXXXLFEIEPENAATYVTLANIYAT 242
             E++I  MPMKPDKFLWASLLGGCRIHG           LFE+EPEN ATY+TL+NIYA 
Sbjct: 453  AENIIDNMPMKPDKFLWASLLGGCRIHGNIELAERAAKALFELEPENPATYITLSNIYAN 512

Query: 241  AGNWGEVAKIRKIMDEKGLVKKPGMSWIHLKRKIHVFLVGDESHPRSDEIHAFLGELSKR 62
            AG W E  K+R  MD +G+VKKPG SWI +KR++HVFLVGD SHP+  +IH +LGELSK+
Sbjct: 513  AGLWTEETKVRNDMDNRGIVKKPGKSWIEIKRQVHVFLVGDTSHPKISDIHEYLGELSKK 572

Query: 61   MKAEGYVPDTDYVLHDVEEE 2
            MK EGYV DT++VLHDVEEE
Sbjct: 573  MKEEGYVADTNFVLHDVEEE 592


>ref|XP_006411944.1| hypothetical protein EUTSA_v10026762mg [Eutrema salsugineum]
            gi|557113114|gb|ESQ53397.1| hypothetical protein
            EUTSA_v10026762mg [Eutrema salsugineum]
          Length = 694

 Score =  713 bits (1841), Expect = 0.0
 Identities = 347/604 (57%), Positives = 447/604 (74%)
 Frame = -3

Query: 1813 ASSFSSPYPFQPKTTRNPPSPGKTFFKRDSNGDQQLINRLCRAYKFKEAIEILCQQNRLE 1634
            +SS++S    Q +       P K FFK + + D  ++ RLC+  +F EAI++LC Q  L 
Sbjct: 22   SSSYASQIDIQKRF------PEKKFFKSNRD-DVGVVERLCKDKRFGEAIDVLCAQKLLG 74

Query: 1633 DAIQLLEHQIDRPFADIYLVLLQSCIRQRALREGKRVHNHIRLSGFRPGIFVSNQILQLY 1454
            +A+ LL  +  +P A  Y  L+Q C ++RAL EGK+VH HI+ SGF PG+ + N++L +Y
Sbjct: 75   EAVHLLG-RAKKPPASTYCNLIQVCSQKRALEEGKKVHEHIKNSGFVPGVVICNRLLGMY 133

Query: 1453 CKCDSLEDAHQVFNEMGEKDVCSWNILISGYAKVGRVDLARRLFDEMPGRDNFSWTAMIS 1274
             KC SL DA ++F+EM  KDVCSWNI+++GYA+VG ++ AR+LFDEMP RD++SWTAM++
Sbjct: 134  AKCGSLIDARKLFDEMPNKDVCSWNIMVNGYAEVGLLEEARKLFDEMPERDSYSWTAMVT 193

Query: 1273 GYVKHYMPKDALELYRLMQKQGAFVPSKFTVSSALAATASIQSLQLGMEIHGHINRTGLD 1094
            GY K   P+DAL +Y LMQK     P+ FTVSSA+AA A+I  ++ G EIHGHI R GLD
Sbjct: 194  GYAKKNQPEDALVMYSLMQKPPKSKPNIFTVSSAVAAAAAIPCIRRGKEIHGHIFRAGLD 253

Query: 1093 SDAVVWSALSDMYGKCGSIDQARYVFDRTFEKDVVSWTAMIDRYFEGGRWEEGFLLFSDM 914
            SD V+WS+L DMYGKCG ID+AR++FD+  +KDVVSWT+MIDRYF+  RW EGF LFS++
Sbjct: 254  SDEVLWSSLMDMYGKCGCIDEARHIFDKIVDKDVVSWTSMIDRYFKSRRWREGFCLFSEL 313

Query: 913  LKSGIRPNEFTFAGILNACAHQTVEGLGKQVHGYMTRLEFDPVSFAASALLHMYSKCGNM 734
            + S  RPNE+TFAG+LNAC   T E LGKQVHGYMTR+ +DP SFA+S+L+ MY+KCGN+
Sbjct: 314  VSSCERPNEYTFAGVLNACTDLTTEELGKQVHGYMTRIGYDPYSFASSSLVDMYTKCGNI 373

Query: 733  DGAYKVFRLLKRPDLFSWTSLISGYAQNGQPHEALRLFELLLETDTKPDQITFVGVLSAC 554
              A  V     +PDLFSWTSLI GYAQNG+P +AL+ F+LLLE+ TKPD ITFV VLSAC
Sbjct: 374  QSAKHVVDGCPKPDLFSWTSLIGGYAQNGEPEKALKYFDLLLESGTKPDHITFVNVLSAC 433

Query: 553  THAGLVDRGLEYFNSIKEKHGLTHTQDHYACVVDLLSRAGRFGEVEDLIKKMPMKPDKFL 374
            THAGLV++GLEYF+SI EKHGL+HT DHY C+VDLL+R+GRF +++ +I +MPMKP+KFL
Sbjct: 434  THAGLVEKGLEYFHSITEKHGLSHTDDHYTCLVDLLARSGRFEQLKGIISEMPMKPNKFL 493

Query: 373  WASLLGGCRIHGXXXXXXXXXXXLFEIEPENAATYVTLANIYATAGNWGEVAKIRKIMDE 194
            WAS+LGGC  HG           LF+IEPEN ATYVT+ANIYA AG W +  ++RK M E
Sbjct: 494  WASVLGGCSTHGNVDLAEEAAQELFKIEPENPATYVTMANIYAAAGKWEDEGRVRKRMRE 553

Query: 193  KGLVKKPGMSWIHLKRKIHVFLVGDESHPRSDEIHAFLGELSKRMKAEGYVPDTDYVLHD 14
             G+ K PG SW  +KRK HVF+  D SHP  + +  FL EL K+M  EGYVP TD VLHD
Sbjct: 554  IGVTKSPGSSWTEIKRKRHVFIASDTSHPMYNRVVEFLAELRKKMIEEGYVPATDLVLHD 613

Query: 13   VEEE 2
            VE+E
Sbjct: 614  VEDE 617


>ref|XP_006283244.1| hypothetical protein CARUB_v10004277mg [Capsella rubella]
            gi|482551949|gb|EOA16142.1| hypothetical protein
            CARUB_v10004277mg [Capsella rubella]
          Length = 690

 Score =  710 bits (1832), Expect = 0.0
 Identities = 347/582 (59%), Positives = 439/582 (75%)
 Frame = -3

Query: 1747 KTFFKRDSNGDQQLINRLCRAYKFKEAIEILCQQNRLEDAIQLLEHQIDRPFADIYLVLL 1568
            K FF   +N D  ++ RLCRA KF EAI++LC Q  L +A+Q+L  +  +P A  Y  L+
Sbjct: 34   KKFFN-SNNEDVGVVERLCRANKFGEAIDVLCGQKLLGEAVQIL-CRAKKPPASTYCNLI 91

Query: 1567 QSCIRQRALREGKRVHNHIRLSGFRPGIFVSNQILQLYCKCDSLEDAHQVFNEMGEKDVC 1388
            Q C + RAL EGK+VH HIR SGF PGI + N++L +Y KC SL DA +VF++M ++DVC
Sbjct: 92   QVCSQTRALEEGKKVHEHIRTSGFVPGIVIWNRLLGMYAKCGSLVDARKVFDDMPKRDVC 151

Query: 1387 SWNILISGYAKVGRVDLARRLFDEMPGRDNFSWTAMISGYVKHYMPKDALELYRLMQKQG 1208
            SWN++++GYA+VG VD AR+LFDEMP RD++SWTAM++GYVK   P++AL LY LMQ+  
Sbjct: 152  SWNLMVNGYAEVGLVDEARKLFDEMPQRDSYSWTAMVAGYVKKDQPEEALVLYSLMQRVP 211

Query: 1207 AFVPSKFTVSSALAATASIQSLQLGMEIHGHINRTGLDSDAVVWSALSDMYGKCGSIDQA 1028
               P+ FTVSSA+AA A+I  ++ G EIHGHI R GLDSD V+WS+L DMYGKCG ID+A
Sbjct: 212  NSRPNIFTVSSAVAAAAAIPCIRRGKEIHGHIVRAGLDSDEVLWSSLIDMYGKCGCIDEA 271

Query: 1027 RYVFDRTFEKDVVSWTAMIDRYFEGGRWEEGFLLFSDMLKSGIRPNEFTFAGILNACAHQ 848
            R +FD+   KDVVSWT+MIDRYF+  RW EGF LFS+++ S  RPNE+TFAGILNACA  
Sbjct: 272  RNIFDKILVKDVVSWTSMIDRYFKSRRWREGFSLFSELIGSCERPNEYTFAGILNACADL 331

Query: 847  TVEGLGKQVHGYMTRLEFDPVSFAASALLHMYSKCGNMDGAYKVFRLLKRPDLFSWTSLI 668
            T E LGKQVHGYMTR+ FDP SFA+S+L+ MY+KCGN++ A  V     +PDL SWTSLI
Sbjct: 332  TKEDLGKQVHGYMTRIGFDPYSFASSSLVDMYTKCGNIESAKHVVDGCPKPDLVSWTSLI 391

Query: 667  SGYAQNGQPHEALRLFELLLETDTKPDQITFVGVLSACTHAGLVDRGLEYFNSIKEKHGL 488
             GYAQNG+P EAL+ F+LLL++ TKPD +TFV VLSACTHAGLV++GLEYF+SI EKHGL
Sbjct: 392  GGYAQNGKPDEALKYFDLLLKSGTKPDHVTFVNVLSACTHAGLVEKGLEYFHSITEKHGL 451

Query: 487  THTQDHYACVVDLLSRAGRFGEVEDLIKKMPMKPDKFLWASLLGGCRIHGXXXXXXXXXX 308
            +HT DHY C+VDLL+R+GRF +++ +  +MPMKP K+LWAS+LGGC  +G          
Sbjct: 452  SHTSDHYTCLVDLLARSGRFEQLKSITSEMPMKPSKYLWASVLGGCNTYGNTDLAEEAAQ 511

Query: 307  XLFEIEPENAATYVTLANIYATAGNWGEVAKIRKIMDEKGLVKKPGMSWIHLKRKIHVFL 128
             LF+IEPEN  TYVT+ANIYA+AG W E  K+RK M E G+ K+PG SW  +KRK HVF+
Sbjct: 512  ELFKIEPENPVTYVTMANIYASAGKWEEEGKMRKRMQEIGVTKRPGSSWTEIKRKRHVFI 571

Query: 127  VGDESHPRSDEIHAFLGELSKRMKAEGYVPDTDYVLHDVEEE 2
              D SHP  ++I  FL EL K+MK EGYVP T  VLHDVE+E
Sbjct: 572  AADTSHPMYNQIVEFLCELRKKMKEEGYVPATSLVLHDVEDE 613


>ref|XP_002869013.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297314849|gb|EFH45272.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 693

 Score =  707 bits (1826), Expect = 0.0
 Identities = 344/582 (59%), Positives = 435/582 (74%)
 Frame = -3

Query: 1747 KTFFKRDSNGDQQLINRLCRAYKFKEAIEILCQQNRLEDAIQLLEHQIDRPFADIYLVLL 1568
            K FF         ++ RLCRA +F EAI++LC Q  L +A+QLL  +  +P A  Y  L+
Sbjct: 36   KKFFDSKLEDGGVVVERLCRANRFGEAIDVLCGQKLLREAVQLLG-RAKKPPASTYCNLI 94

Query: 1567 QSCIRQRALREGKRVHNHIRLSGFRPGIFVSNQILQLYCKCDSLEDAHQVFNEMGEKDVC 1388
            Q C + RAL EGK+VH HIR SGF PGI + N+IL +Y KC SL DA +VF+EM E+DVC
Sbjct: 95   QVCSQTRALEEGKKVHEHIRTSGFVPGIVIWNRILGMYAKCGSLVDARKVFDEMPERDVC 154

Query: 1387 SWNILISGYAKVGRVDLARRLFDEMPGRDNFSWTAMISGYVKHYMPKDALELYRLMQKQG 1208
            SWN++++GYA+VG ++ AR LFDEMP RD++SWTAM++GYVK   P++AL LY LMQ+  
Sbjct: 155  SWNVMVNGYAEVGLLEEARNLFDEMPERDSYSWTAMVTGYVKKDQPEEALVLYSLMQRVP 214

Query: 1207 AFVPSKFTVSSALAATASIQSLQLGMEIHGHINRTGLDSDAVVWSALSDMYGKCGSIDQA 1028
               P+ FTVSSA+AA A+I+ ++ G EIHGHI R GLDSD V+WS+L DMYGKCG ID+A
Sbjct: 215  NSKPNIFTVSSAVAAAAAIKCIRRGKEIHGHIVRAGLDSDEVLWSSLMDMYGKCGCIDEA 274

Query: 1027 RYVFDRTFEKDVVSWTAMIDRYFEGGRWEEGFLLFSDMLKSGIRPNEFTFAGILNACAHQ 848
            R +FD+  +KDVVSWT+MIDRYF+  RW EGF LFS+++ S  RPNE+TF+G+LNACA  
Sbjct: 275  RNIFDKIIDKDVVSWTSMIDRYFKSSRWREGFSLFSELIGSCERPNEYTFSGVLNACADL 334

Query: 847  TVEGLGKQVHGYMTRLEFDPVSFAASALLHMYSKCGNMDGAYKVFRLLKRPDLFSWTSLI 668
            T E LG+QVHGYMTR+ FDP SFA+S+L+ MY+KCGN++ A  V     +PDL S TSLI
Sbjct: 335  TTEELGRQVHGYMTRVGFDPYSFASSSLIDMYTKCGNIESARHVVDGCPKPDLVSLTSLI 394

Query: 667  SGYAQNGQPHEALRLFELLLETDTKPDQITFVGVLSACTHAGLVDRGLEYFNSIKEKHGL 488
             GYAQNG+P EAL+ F+LLL++ TKPD +TFV VLSACTHAGLV++GLE+F SI EKH L
Sbjct: 395  GGYAQNGKPDEALKYFDLLLKSGTKPDHVTFVNVLSACTHAGLVEKGLEFFYSITEKHDL 454

Query: 487  THTQDHYACVVDLLSRAGRFGEVEDLIKKMPMKPDKFLWASLLGGCRIHGXXXXXXXXXX 308
            THT DHY C+VDLL+R+GRF +++ ++ +MPMKP KFLWAS+LGGC  +G          
Sbjct: 455  THTSDHYTCLVDLLARSGRFEQLKSVLSEMPMKPSKFLWASVLGGCSTYGNIDLAEEAAQ 514

Query: 307  XLFEIEPENAATYVTLANIYATAGNWGEVAKIRKIMDEKGLVKKPGMSWIHLKRKIHVFL 128
             LF+IEPEN  TYVT+ANIYA AG W E  K+RK M E G+ KKPG SW  +KRK HVF+
Sbjct: 515  ELFKIEPENPVTYVTMANIYAAAGKWEEEGKMRKRMQEIGITKKPGSSWTEIKRKRHVFI 574

Query: 127  VGDESHPRSDEIHAFLGELSKRMKAEGYVPDTDYVLHDVEEE 2
              D SHP  ++I  FLGEL K+MK EGYVP T  VLHDVE+E
Sbjct: 575  AADTSHPMYNQIIEFLGELRKKMKEEGYVPATSLVLHDVEDE 616


>ref|NP_195434.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75097747|sp|O23169.1|PP353_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g37170 gi|2464864|emb|CAB16758.1| putative protein
            [Arabidopsis thaliana] gi|7270666|emb|CAB80383.1|
            putative protein [Arabidopsis thaliana]
            gi|332661361|gb|AEE86761.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 691

 Score =  699 bits (1804), Expect = 0.0
 Identities = 339/582 (58%), Positives = 434/582 (74%)
 Frame = -3

Query: 1747 KTFFKRDSNGDQQLINRLCRAYKFKEAIEILCQQNRLEDAIQLLEHQIDRPFADIYLVLL 1568
            K FF  +      ++ RLCRA +F EAI++LC Q  L +A+QLL  +  +P A  Y  L+
Sbjct: 34   KKFFNPNHEDGGVVVERLCRANRFGEAIDVLCGQKLLREAVQLLG-RAKKPPASTYCNLI 92

Query: 1567 QSCIRQRALREGKRVHNHIRLSGFRPGIFVSNQILQLYCKCDSLEDAHQVFNEMGEKDVC 1388
            Q C + RAL EGK+VH HIR SGF PGI + N++L++Y KC SL DA +VF+EM  +D+C
Sbjct: 93   QVCSQTRALEEGKKVHEHIRTSGFVPGIVIWNRLLRMYAKCGSLVDARKVFDEMPNRDLC 152

Query: 1387 SWNILISGYAKVGRVDLARRLFDEMPGRDNFSWTAMISGYVKHYMPKDALELYRLMQKQG 1208
            SWN++++GYA+VG ++ AR+LFDEM  +D++SWTAM++GYVK   P++AL LY LMQ+  
Sbjct: 153  SWNVMVNGYAEVGLLEEARKLFDEMTEKDSYSWTAMVTGYVKKDQPEEALVLYSLMQRVP 212

Query: 1207 AFVPSKFTVSSALAATASIQSLQLGMEIHGHINRTGLDSDAVVWSALSDMYGKCGSIDQA 1028
               P+ FTVS A+AA A+++ ++ G EIHGHI R GLDSD V+WS+L DMYGKCG ID+A
Sbjct: 213  NSRPNIFTVSIAVAAAAAVKCIRRGKEIHGHIVRAGLDSDEVLWSSLMDMYGKCGCIDEA 272

Query: 1027 RYVFDRTFEKDVVSWTAMIDRYFEGGRWEEGFLLFSDMLKSGIRPNEFTFAGILNACAHQ 848
            R +FD+  EKDVVSWT+MIDRYF+  RW EGF LFS+++ S  RPNE+TFAG+LNACA  
Sbjct: 273  RNIFDKIVEKDVVSWTSMIDRYFKSSRWREGFSLFSELVGSCERPNEYTFAGVLNACADL 332

Query: 847  TVEGLGKQVHGYMTRLEFDPVSFAASALLHMYSKCGNMDGAYKVFRLLKRPDLFSWTSLI 668
            T E LGKQVHGYMTR+ FDP SFA+S+L+ MY+KCGN++ A  V     +PDL SWTSLI
Sbjct: 333  TTEELGKQVHGYMTRVGFDPYSFASSSLVDMYTKCGNIESAKHVVDGCPKPDLVSWTSLI 392

Query: 667  SGYAQNGQPHEALRLFELLLETDTKPDQITFVGVLSACTHAGLVDRGLEYFNSIKEKHGL 488
             G AQNGQP EAL+ F+LLL++ TKPD +TFV VLSACTHAGLV++GLE+F SI EKH L
Sbjct: 393  GGCAQNGQPDEALKYFDLLLKSGTKPDHVTFVNVLSACTHAGLVEKGLEFFYSITEKHRL 452

Query: 487  THTQDHYACVVDLLSRAGRFGEVEDLIKKMPMKPDKFLWASLLGGCRIHGXXXXXXXXXX 308
            +HT DHY C+VDLL+R+GRF +++ +I +MPMKP KFLWAS+LGGC  +G          
Sbjct: 453  SHTSDHYTCLVDLLARSGRFEQLKSVISEMPMKPSKFLWASVLGGCSTYGNIDLAEEAAQ 512

Query: 307  XLFEIEPENAATYVTLANIYATAGNWGEVAKIRKIMDEKGLVKKPGMSWIHLKRKIHVFL 128
             LF+IEPEN  TYVT+ANIYA AG W E  K+RK M E G+ K+PG SW  +KRK HVF+
Sbjct: 513  ELFKIEPENPVTYVTMANIYAAAGKWEEEGKMRKRMQEIGVTKRPGSSWTEIKRKRHVFI 572

Query: 127  VGDESHPRSDEIHAFLGELSKRMKAEGYVPDTDYVLHDVEEE 2
              D SHP  ++I  FL EL K+MK EGYVP T  VLHDVE+E
Sbjct: 573  AADTSHPMYNQIVEFLRELRKKMKEEGYVPATSLVLHDVEDE 614


>gb|EPS71377.1| hypothetical protein M569_03380, partial [Genlisea aurea]
          Length = 639

 Score =  677 bits (1747), Expect = 0.0
 Identities = 335/556 (60%), Positives = 416/556 (74%), Gaps = 3/556 (0%)
 Frame = -3

Query: 1660 ILCQQNRLEDAIQLLEHQIDRPFADIYLVLLQSCIRQRALREGKRVHNHIRLSGFRPGIF 1481
            +LC++ RL+  + L+E+Q+ R  A  Y  +LQ CI ++AL EGK V   I+ SGF PG F
Sbjct: 11   LLCRERRLDAVVNLVENQVQRFSAASYATVLQLCIEKKALDEGKIVVAQIKASGFVPGTF 70

Query: 1480 VSNQILQLYCKCDSLEDAHQVFNEMGEKDVCSWNILISGYAKVGRVDLARRLFDEMPGRD 1301
            VSN+IL L+CKC SL +A  +F+EM  +D+CSWNI++SGY K G +  AR +FDEMP RD
Sbjct: 71   VSNRILDLFCKCGSLFEARTLFDEMNCRDLCSWNIMLSGYTKCGLISDARNMFDEMPQRD 130

Query: 1300 NFSWTAMISGYVKHYMPKDALELYRLMQKQ--GAFVPSKFTVSSALAATASIQSLQLGME 1127
            NF+WTA+ISGYVKH  P+ ALELYRLM ++   +   +KFT+S ALAA+AS++SL  G E
Sbjct: 131  NFTWTALISGYVKHNEPEHALELYRLMHEEEISSACDNKFTISIALAASASLKSLCSGKE 190

Query: 1126 IHGHINRTGLDSDAVVWSALSDMYGKCGSIDQARYVFDRTFEKDVVSWTAMIDRYFEGGR 947
            IHG I RT  DSDAVVWSAL DMYGKCGS+++AR++FD T +KD+VSWT M+D YF  G+
Sbjct: 191  IHGRIIRTSRDSDAVVWSALLDMYGKCGSVNEARHIFDTTPDKDIVSWTTMMDCYFGDGK 250

Query: 946  WEEGFLLFSDMLK-SGIRPNEFTFAGILNACAHQTVEGLGKQVHGYMTRLEFDPVSFAAS 770
            W EGF LFS +L  SG  PN+FT +G+L AC   T E +G+QVH  M    F P SFAAS
Sbjct: 251  WTEGFSLFSHLLSCSGNEPNDFTISGVLKACTFCTAEEIGRQVHARMMLTGFSPDSFAAS 310

Query: 769  ALLHMYSKCGNMDGAYKVFRLLKRPDLFSWTSLISGYAQNGQPHEALRLFELLLETDTKP 590
             L+HMY+KCG+++ A KVF ++  PDL SWTSLI+GYAQNGQ  EALRLF+LLLE+  +P
Sbjct: 311  TLVHMYTKCGSIESARKVFSMIPEPDLVSWTSLINGYAQNGQHREALRLFDLLLESGNRP 370

Query: 589  DQITFVGVLSACTHAGLVDRGLEYFNSIKEKHGLTHTQDHYACVVDLLSRAGRFGEVEDL 410
            D ITFVGVLSACTHAGLV  GLEYF+SI EKHGL+HT DHYACVVDLLSRAGRF E E++
Sbjct: 371  DHITFVGVLSACTHAGLVSEGLEYFHSITEKHGLSHTPDHYACVVDLLSRAGRFEEAENV 430

Query: 409  IKKMPMKPDKFLWASLLGGCRIHGXXXXXXXXXXXLFEIEPENAATYVTLANIYATAGNW 230
            I +MPMKPD+F+W SLL GCRIHG           L  +EP+NAATYVTLANIYA+ G W
Sbjct: 431  INEMPMKPDRFIWGSLLNGCRIHGNYVLAKEAAEALLRLEPDNAATYVTLANIYASEGKW 490

Query: 229  GEVAKIRKIMDEKGLVKKPGMSWIHLKRKIHVFLVGDESHPRSDEIHAFLGELSKRMKAE 50
             E  ++RK+M+E   VKKPGMSWI +KRK HVF  GD+     +EI  FL E+SKRMK E
Sbjct: 491  DEAGEMRKVMEEGKAVKKPGMSWISMKRKTHVFSAGDQ----PEEIVEFLKEVSKRMKEE 546

Query: 49   GYVPDTDYVLHDVEEE 2
            GY+P+T+ V  DVEEE
Sbjct: 547  GYIPETNLVAQDVEEE 562


Top