BLASTX nr result

ID: Papaver25_contig00029206 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver25_contig00029206
         (1825 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN67401.1| hypothetical protein VITISV_025967 [Vitis vinifera]   730   0.0  
ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containi...   726   0.0  
ref|XP_007030407.1| Pentatricopeptide repeat (PPR-like) superfam...   721   0.0  
ref|XP_004304772.1| PREDICTED: pentatricopeptide repeat-containi...   698   0.0  
ref|XP_002521980.1| pentatricopeptide repeat-containing protein,...   689   0.0  
gb|EXB93167.1| hypothetical protein L484_024505 [Morus notabilis]     684   0.0  
ref|XP_006442665.1| hypothetical protein CICLE_v10019446mg [Citr...   681   0.0  
ref|XP_006487702.1| PREDICTED: pentatricopeptide repeat-containi...   681   0.0  
ref|XP_006371094.1| hypothetical protein POPTR_0019s03630g [Popu...   672   0.0  
ref|XP_004142590.1| PREDICTED: pentatricopeptide repeat-containi...   662   0.0  
ref|XP_007205118.1| hypothetical protein PRUPE_ppa004835mg [Prun...   660   0.0  
ref|XP_006361415.1| PREDICTED: pentatricopeptide repeat-containi...   639   e-180
ref|XP_002884468.1| pentatricopeptide repeat-containing protein ...   638   e-180
ref|XP_006296608.1| hypothetical protein CARUB_v10013258mg [Caps...   637   e-180
ref|NP_566237.1| pentatricopeptide repeat-containing protein [Ar...   636   e-179
dbj|BAD95034.1| hypothetical protein [Arabidopsis thaliana]           632   e-178
ref|XP_004236781.1| PREDICTED: pentatricopeptide repeat-containi...   628   e-177
ref|XP_007144014.1| hypothetical protein PHAVU_007G121900g [Phas...   610   e-172
ref|XP_003555568.1| PREDICTED: pentatricopeptide repeat-containi...   608   e-171
ref|XP_003590960.1| Pentatricopeptide repeat-containing protein ...   608   e-171

>emb|CAN67401.1| hypothetical protein VITISV_025967 [Vitis vinifera]
          Length = 592

 Score =  730 bits (1884), Expect = 0.0
 Identities = 366/593 (61%), Positives = 446/593 (75%), Gaps = 5/593 (0%)
 Frame = +2

Query: 20   TQRSH*SMAVLY-TEFFPYCYSLNIHSKASPLPCHHQKSIIISCKNSTTND----RISSK 184
            TQ +  S+  +Y T+FFP+C   +   K +    H   + I++C+N   ND    R S K
Sbjct: 3    TQHNAGSLMTIYSTDFFPHCPPFSPQLKPTS---HSHHTSIVTCRNPNPNDGYNSRNSPK 59

Query: 185  IRVSAETRSIHFQSLDFQETHFIKLLNRSCKAGKYSETLYFLECMVNKGYKPDVILCTKL 364
            + VSAE R  H QS DF+ETH +KLLNRSCKAGK++E+LYFLEC+VNKGY PDVILCTKL
Sbjct: 60   VGVSAEARPAHLQSYDFRETHLMKLLNRSCKAGKFNESLYFLECLVNKGYTPDVILCTKL 119

Query: 365  VKGFLNARNVAKAVRVMAILESHGEPDVFAYNALISGFCKMNQIESANKVLHRMRNRGCL 544
            +KGF N +N+ KA RVM ILESH EPDVFAYNA+ISGFCK+NQIE+A +VL+RM+ RG L
Sbjct: 120  IKGFFNFKNIEKASRVMEILESHTEPDVFAYNAVISGFCKVNQIEAATQVLNRMKARGFL 179

Query: 545  PDIVTYNILIGSLCSRGKLELAREVFNQFALYDCHPXXXXXXXXXXXXXXXXXXXEAXXX 724
            PDIVTYNI+IGSLC+R KL LA  V +Q  L +C P                   EA   
Sbjct: 180  PDIVTYNIMIGSLCNRRKLGLALTVLDQLLLDNCMPTVITYTILIEATIVEGGINEAMKL 239

Query: 725  XXXXXXXXXQPDMYTYNAVVRGMCKEGMVDQAFDFIMNLAELGCEPDEISYNIVLRAFLS 904
                      PDMYTYNA++RGMCKEGMV++A + I +L   GCEPD ISYNI+LRAFL+
Sbjct: 240  LEEMLARGLLPDMYTYNAIIRGMCKEGMVERAAELITSLTSKGCEPDVISYNILLRAFLN 299

Query: 905  RGRWIDGERLIEKMISRGCEPNVVTYSILISSLCHDGKLKDAIAVLKTMMAKGLTPDAYS 1084
            +G+W +GE+L+ +M SRGCEPN VTYSILISSLC  G++ +AI+VLK M+ K LTPD YS
Sbjct: 300  QGKWDEGEKLVAEMFSRGCEPNKVTYSILISSLCRFGRIDEAISVLKVMIEKELTPDTYS 359

Query: 1085 FDPVISALCKDGKLDLAIEFLDFMISNGCSPDIVNYNTILAALCKSGKAEEALEIFEKLS 1264
            +DP+ISALCK+G+LDLAI  +D+MISNGC PDIVNYNTILAALCK+G A +ALEIF KL 
Sbjct: 360  YDPLISALCKEGRLDLAIGIMDYMISNGCLPDIVNYNTILAALCKNGNANQALEIFNKLR 419

Query: 1265 QTDSPPNVSSYNTMISGLWNSGDPKRALNMVSEMVDKGIDADQITYNSLISCLCRDAMVE 1444
                PPNVSSYNTMIS LW+ GD  RAL MV  M+ KGID D+ITYNSLISCLCRD +VE
Sbjct: 420  GMGCPPNVSSYNTMISALWSCGDRSRALGMVPAMISKGIDPDEITYNSLISCLCRDGLVE 479

Query: 1445 EAIELLVDMQNRGFKPTIISYNTMLLGLCKIHNIDTAIDILAVMVENGVRPNETSYILLI 1624
            EAI LL DM+  GF+PT+ISYN +LLGLCK+  ID AI + A M+E G RPNET+YILLI
Sbjct: 480  EAIGLLDDMEQSGFRPTVISYNIVLLGLCKVRRIDDAIGMFAEMIEKGCRPNETTYILLI 539

Query: 1625 EGMAYSGWRAEAMELANSLVSIGVVSKDFFKRLNRTFPMINVYKDLSNSDIKK 1783
            EG+ ++GWR EAMELANSL S  V+S+D FKRLN+TFPM++VYK+LSNS+ KK
Sbjct: 540  EGIGFAGWRTEAMELANSLFSRDVISQDSFKRLNKTFPMLDVYKELSNSETKK 592


>ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Vitis vinifera]
          Length = 582

 Score =  726 bits (1874), Expect = 0.0
 Identities = 360/585 (61%), Positives = 441/585 (75%), Gaps = 4/585 (0%)
 Frame = +2

Query: 41   MAVLYTEFFPYCYSLNIHSKASPLPCHHQKSIIISCKNSTTNDRISS----KIRVSAETR 208
            M +  T+FFP C   N   K +    H   + I++C+N   ND  +S    K+ VSAE R
Sbjct: 1    MTIYSTDFFPRCPPFNPQLKPTS---HSHHTSIVTCRNPNPNDGFNSRNAPKVGVSAEAR 57

Query: 209  SIHFQSLDFQETHFIKLLNRSCKAGKYSETLYFLECMVNKGYKPDVILCTKLVKGFLNAR 388
              H QS DF+ETH +KLLNRSCKAGK++E+LYFLEC+VNKGY PDVILCTKL+KGF N +
Sbjct: 58   PAHLQSYDFRETHLMKLLNRSCKAGKFNESLYFLECLVNKGYTPDVILCTKLIKGFFNFK 117

Query: 389  NVAKAVRVMAILESHGEPDVFAYNALISGFCKMNQIESANKVLHRMRNRGCLPDIVTYNI 568
            N+ KA RVM ILESH EPDVFAYNA+ISGFCK+N+IE+A +VL+RM+ RG LPDIVTYNI
Sbjct: 118  NIEKASRVMEILESHTEPDVFAYNAVISGFCKVNRIEAATQVLNRMKARGFLPDIVTYNI 177

Query: 569  LIGSLCSRGKLELAREVFNQFALYDCHPXXXXXXXXXXXXXXXXXXXEAXXXXXXXXXXX 748
            +IGSLC+R KL LA +V +Q  L +C P                   EA           
Sbjct: 178  MIGSLCNRRKLGLALKVLDQLLLDNCMPTVITYTILIEATIVEGGINEAMKLLEEMLARG 237

Query: 749  XQPDMYTYNAVVRGMCKEGMVDQAFDFIMNLAELGCEPDEISYNIVLRAFLSRGRWIDGE 928
              PDMYTYNA++RGMCKEGMV++A + I +L   GC+PD ISYNI+LRAFL++G+W +GE
Sbjct: 238  LLPDMYTYNAIIRGMCKEGMVERAAELITSLTSKGCKPDVISYNILLRAFLNQGKWDEGE 297

Query: 929  RLIEKMISRGCEPNVVTYSILISSLCHDGKLKDAIAVLKTMMAKGLTPDAYSFDPVISAL 1108
            +L+ +M SRGCEPN VTYSILISSLC  G++ +AI+VLK M+ K LTPD YS+DP+ISAL
Sbjct: 298  KLVAEMFSRGCEPNKVTYSILISSLCRFGRIDEAISVLKVMIEKELTPDTYSYDPLISAL 357

Query: 1109 CKDGKLDLAIEFLDFMISNGCSPDIVNYNTILAALCKSGKAEEALEIFEKLSQTDSPPNV 1288
            CK+G+LDLAI  +D+MISNGC PDIVNYNTILAALCK+G A +ALEIF KL     PPNV
Sbjct: 358  CKEGRLDLAIGIMDYMISNGCLPDIVNYNTILAALCKNGNANQALEIFNKLRGMGCPPNV 417

Query: 1289 SSYNTMISGLWNSGDPKRALNMVSEMVDKGIDADQITYNSLISCLCRDAMVEEAIELLVD 1468
            SSYNTMIS LW+ GD  RAL MV  M+ KG+D D+ITYNSLISCLCRD +VEEAI LL D
Sbjct: 418  SSYNTMISALWSCGDRSRALGMVPAMISKGVDPDEITYNSLISCLCRDGLVEEAIGLLDD 477

Query: 1469 MQNRGFKPTIISYNTMLLGLCKIHNIDTAIDILAVMVENGVRPNETSYILLIEGMAYSGW 1648
            M+  GF+PT+ISYN +LLGLCK+  ID AI + A M+E G RPNET+YILLIEG+ ++GW
Sbjct: 478  MEQSGFRPTVISYNIVLLGLCKVRRIDDAIGMFAEMIEKGCRPNETTYILLIEGIGFAGW 537

Query: 1649 RAEAMELANSLVSIGVVSKDFFKRLNRTFPMINVYKDLSNSDIKK 1783
            R EAMELANSL S  V+S+D FKRLN+TFPM++VYK+LSNS+ KK
Sbjct: 538  RTEAMELANSLFSRDVISQDSFKRLNKTFPMLDVYKELSNSETKK 582


>ref|XP_007030407.1| Pentatricopeptide repeat (PPR-like) superfamily protein [Theobroma
            cacao] gi|508719012|gb|EOY10909.1| Pentatricopeptide
            repeat (PPR-like) superfamily protein [Theobroma cacao]
          Length = 586

 Score =  721 bits (1862), Expect = 0.0
 Identities = 347/560 (61%), Positives = 432/560 (77%), Gaps = 5/560 (0%)
 Frame = +2

Query: 119  HHQKSIIISCKN-----STTNDRISSKIRVSAETRSIHFQSLDFQETHFIKLLNRSCKAG 283
            H   + ++SC N     S++  R + K+RVSAETR  H  S DF+ETH +KLLNRSCKAG
Sbjct: 27   HSHHTSLVSCLNHESQDSSSKSRNNQKVRVSAETRPTHLLSFDFKETHLMKLLNRSCKAG 86

Query: 284  KYSETLYFLECMVNKGYKPDVILCTKLVKGFLNARNVAKAVRVMAILESHGEPDVFAYNA 463
            KY+E  YFLECMV KGYKPDV+LCTK++KGF N RNV KA RV+ ILE +GEPDVFAYNA
Sbjct: 87   KYNEAFYFLECMVGKGYKPDVVLCTKMIKGFFNGRNVEKATRVIEILEKYGEPDVFAYNA 146

Query: 464  LISGFCKMNQIESANKVLHRMRNRGCLPDIVTYNILIGSLCSRGKLELAREVFNQFALYD 643
            +ISGFCKMN+++ ANKVL RMR+RG  PD+VTYNI+IGS CSRGKL+ A +V NQ    +
Sbjct: 147  IISGFCKMNRLDFANKVLDRMRSRGFSPDVVTYNIMIGSFCSRGKLDSAYKVINQLLKDN 206

Query: 644  CHPXXXXXXXXXXXXXXXXXXXEAXXXXXXXXXXXXQPDMYTYNAVVRGMCKEGMVDQAF 823
            C P                   EA            +PDM+TYNA++RGMCK+GMV++AF
Sbjct: 207  CKPSVITYTILIEATMLQGEINEAMKLLDEMLSKGLRPDMFTYNAIIRGMCKDGMVNRAF 266

Query: 824  DFIMNLAELGCEPDEISYNIVLRAFLSRGRWIDGERLIEKMISRGCEPNVVTYSILISSL 1003
             F+ +L   GC+PD ISYNI+LR  L++G+W +GE+L+ +M+SRGCEPNVVTYSILISSL
Sbjct: 267  KFVRSLKARGCQPDVISYNILLRVLLNQGKWAEGEKLVTEMVSRGCEPNVVTYSILISSL 326

Query: 1004 CHDGKLKDAIAVLKTMMAKGLTPDAYSFDPVISALCKDGKLDLAIEFLDFMISNGCSPDI 1183
            C +GKL++A+ VLK M  +GLTPDAYS+DP+ISA CK+G+LDLAIEFLD MIS+GC PDI
Sbjct: 327  CREGKLEEAVNVLKMMKERGLTPDAYSYDPLISAFCKEGRLDLAIEFLDCMISDGCLPDI 386

Query: 1184 VNYNTILAALCKSGKAEEALEIFEKLSQTDSPPNVSSYNTMISGLWNSGDPKRALNMVSE 1363
            VNYNT+LA LCK+GKAE+ALEIFEKL +   PPNVSSYNTM S LW+SGD  +AL M+SE
Sbjct: 387  VNYNTVLATLCKNGKAEQALEIFEKLREVGCPPNVSSYNTMFSALWSSGDKVKALEMISE 446

Query: 1364 MVDKGIDADQITYNSLISCLCRDAMVEEAIELLVDMQNRGFKPTIISYNTMLLGLCKIHN 1543
            M+ K I  D+ITYNSLISCLCRD MV+EAIELLVDM   G  PT+ISYN +LLGLCK+H 
Sbjct: 447  MLSKRIGPDEITYNSLISCLCRDGMVDEAIELLVDMGCSGIPPTVISYNIVLLGLCKVHR 506

Query: 1544 IDTAIDILAVMVENGVRPNETSYILLIEGMAYSGWRAEAMELANSLVSIGVVSKDFFKRL 1723
            I+ AI++LA MV+   +PNET+YILLIEG+ ++GWR+EAMELAN+L  +  +SKD FKRL
Sbjct: 507  INDAIEVLAAMVDKRCQPNETTYILLIEGIGFAGWRSEAMELANALFRMEAISKDSFKRL 566

Query: 1724 NRTFPMINVYKDLSNSDIKK 1783
            NRTFP+++VYK+ + SD  K
Sbjct: 567  NRTFPLLDVYKEFAGSDSNK 586


>ref|XP_004304772.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 586

 Score =  698 bits (1802), Expect = 0.0
 Identities = 347/588 (59%), Positives = 436/588 (74%), Gaps = 8/588 (1%)
 Frame = +2

Query: 41   MAVLYTEFFPYCYSLNIHSKASPLPCHHQKSIIISCK----NSTTNDRISSK----IRVS 196
            MA++ TE  P+ +      K +    H      +SC+    +S +N R SS+    + VS
Sbjct: 1    MAIVSTELLPHSFHTTSQLKPTS---HSHHPTALSCRASSASSISNGRNSSRNPTRVSVS 57

Query: 197  AETRSIHFQSLDFQETHFIKLLNRSCKAGKYSETLYFLECMVNKGYKPDVILCTKLVKGF 376
            AE +S   Q+ DF++TH +K+LNRSCKAG+Y+E +YFLE MVNKGYKPDVILCTKL+KGF
Sbjct: 58   AEPKSTQLQNYDFKDTHLMKVLNRSCKAGQYNEAIYFLELMVNKGYKPDVILCTKLIKGF 117

Query: 377  LNARNVAKAVRVMAILESHGEPDVFAYNALISGFCKMNQIESANKVLHRMRNRGCLPDIV 556
             N+RN+ KA+RVM ILE +GEPD+FAYNALISGFCK N+IESANKVL RM+++G  PD+V
Sbjct: 118  FNSRNIEKAIRVMQILEQYGEPDLFAYNALISGFCKANRIESANKVLDRMKSQGFKPDVV 177

Query: 557  TYNILIGSLCSRGKLELAREVFNQFALYDCHPXXXXXXXXXXXXXXXXXXXEAXXXXXXX 736
            TYNI+IGSLCSRGKL LA +V ++    +C P                   EA       
Sbjct: 178  TYNIMIGSLCSRGKLGLALQVMDRLVRDNCKPTVITYTILIEAIILDGGINEAMKLLDEM 237

Query: 737  XXXXXQPDMYTYNAVVRGMCKEGMVDQAFDFIMNLAELGCEPDEISYNIVLRAFLSRGRW 916
                 +PDMYTYNA+VRGMC+EGM+D+AF+F+      GC P+ ISYNI+LRA L+RG+W
Sbjct: 238  LSRGLKPDMYTYNAIVRGMCREGMLDRAFEFVKCFDAKGCAPNVISYNILLRALLNRGKW 297

Query: 917  IDGERLIEKMISRGCEPNVVTYSILISSLCHDGKLKDAIAVLKTMMAKGLTPDAYSFDPV 1096
             +GE L+  M +RGCEPNVVTYSILIS+LC DGK++D + VLK M  KGLTPDAYS+DP+
Sbjct: 298  EEGENLVANMCARGCEPNVVTYSILISTLCRDGKVEDGMNVLKIMKEKGLTPDAYSYDPL 357

Query: 1097 ISALCKDGKLDLAIEFLDFMISNGCSPDIVNYNTILAALCKSGKAEEALEIFEKLSQTDS 1276
            IS  CK+G+LDLAIE LD MIS+GC PDIVNYNT+LAALCK+G A++ALEIFE L +   
Sbjct: 358  ISCFCKEGRLDLAIELLDCMISDGCLPDIVNYNTVLAALCKNGSADQALEIFENLGEVGC 417

Query: 1277 PPNVSSYNTMISGLWNSGDPKRALNMVSEMVDKGIDADQITYNSLISCLCRDAMVEEAIE 1456
            PPNVSSYNTM S LWN GD  RAL MVS+MV KGI+ D+ITYNSLISCLCRD MV EAI 
Sbjct: 418  PPNVSSYNTMFSALWNCGDRVRALGMVSDMVSKGIEPDEITYNSLISCLCRDGMVNEAIG 477

Query: 1457 LLVDMQNRGFKPTIISYNTMLLGLCKIHNIDTAIDILAVMVENGVRPNETSYILLIEGMA 1636
            LLVDM+  GF+PT+I+YN +LLGL K   I  AI++   MVE G RPNET+YILLIEG+ 
Sbjct: 478  LLVDMEAGGFQPTVITYNIVLLGLSKARRIVDAIEVFTAMVEKGCRPNETTYILLIEGIG 537

Query: 1637 YSGWRAEAMELANSLVSIGVVSKDFFKRLNRTFPMINVYKDLSNSDIK 1780
            ++GWRAEAMELA S+ S+  + +D FKRL+RTFPM++VYK+L+ S+I+
Sbjct: 538  FAGWRAEAMELAKSVYSLSAICEDSFKRLSRTFPMLDVYKELTLSEIE 585


>ref|XP_002521980.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223538784|gb|EEF40384.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 584

 Score =  689 bits (1779), Expect = 0.0
 Identities = 335/585 (57%), Positives = 435/585 (74%), Gaps = 4/585 (0%)
 Frame = +2

Query: 41   MAVLYTEFFPYCYSLNIHSKASPLPCHHQKSIIISCKNSTTND----RISSKIRVSAETR 208
            M +  TEF P+  S             H  S I+SC     ND    R   K+RVSAETR
Sbjct: 1    MTLFSTEFLPHSISFTTQPLKPTSNSLH--STIVSCIRPELNDANKVRNPQKVRVSAETR 58

Query: 209  SIHFQSLDFQETHFIKLLNRSCKAGKYSETLYFLECMVNKGYKPDVILCTKLVKGFLNAR 388
              H  S DF+E H +KLLNRSC+AGKY+E+LYFLECMV+KGY PDVILCTKL+KGF N+R
Sbjct: 59   QTHVLSFDFKEVHLMKLLNRSCRAGKYNESLYFLECMVDKGYTPDVILCTKLIKGFFNSR 118

Query: 389  NVAKAVRVMAILESHGEPDVFAYNALISGFCKMNQIESANKVLHRMRNRGCLPDIVTYNI 568
            N+ KA RVM ILE +G+PDVFAYNALISGF K NQ+E+AN+VL RM++RG LPD+VTYNI
Sbjct: 119  NIGKATRVMEILERYGKPDVFAYNALISGFIKANQLENANRVLDRMKSRGFLPDVVTYNI 178

Query: 569  LIGSLCSRGKLELAREVFNQFALYDCHPXXXXXXXXXXXXXXXXXXXEAXXXXXXXXXXX 748
            +IGS CSRGKL+LA E+F +    +C P                    A           
Sbjct: 179  MIGSFCSRGKLDLALEIFEELLKDNCEPTVITYTILIEATILDGGIDVAMKLLDEMLSKG 238

Query: 749  XQPDMYTYNAVVRGMCKEGMVDQAFDFIMNLAELGCEPDEISYNIVLRAFLSRGRWIDGE 928
             +PD  TYNA++RGMCKE MVD+AF+ + +L+  GC+PD I+YNI+LR  LSRG+W +GE
Sbjct: 239  LEPDTLTYNAIIRGMCKEMMVDKAFELLRSLSSRGCKPDIITYNILLRTLLSRGKWSEGE 298

Query: 929  RLIEKMISRGCEPNVVTYSILISSLCHDGKLKDAIAVLKTMMAKGLTPDAYSFDPVISAL 1108
            +LI +MIS GC+PNVVT+SILI +LC DGK+++A+ +L++M  KGL PDAY +DP+I+  
Sbjct: 299  KLISEMISIGCKPNVVTHSILIGTLCRDGKVEEAVNLLRSMKEKGLKPDAYCYDPLIAGF 358

Query: 1109 CKDGKLDLAIEFLDFMISNGCSPDIVNYNTILAALCKSGKAEEALEIFEKLSQTDSPPNV 1288
            C++G+LDLA EFL++MIS+GC PDIVNYNTI+A LC++GKA++ALE+FEKL +   PPNV
Sbjct: 359  CREGRLDLATEFLEYMISDGCLPDIVNYNTIMAGLCRTGKADQALEVFEKLDEVGCPPNV 418

Query: 1289 SSYNTMISGLWNSGDPKRALNMVSEMVDKGIDADQITYNSLISCLCRDAMVEEAIELLVD 1468
            SSYNT+ S LW+SGD  RAL M+ +++++GID D+ITYNSLISCLCRD MV+EAIELLVD
Sbjct: 419  SSYNTLFSALWSSGDRYRALEMILKLLNQGIDPDEITYNSLISCLCRDGMVDEAIELLVD 478

Query: 1469 MQNRGFKPTIISYNTMLLGLCKIHNIDTAIDILAVMVENGVRPNETSYILLIEGMAYSGW 1648
            MQ+  ++P ++SYN +LLGLCK++  + AI++LA M E G +PNET+YILLIEG+ +SG 
Sbjct: 479  MQSGRYRPNVVSYNIILLGLCKVNRANDAIEVLAAMTEKGCQPNETTYILLIEGIGFSGL 538

Query: 1649 RAEAMELANSLVSIGVVSKDFFKRLNRTFPMINVYKDLSNSDIKK 1783
            RAEAMELANSL  +  +S+D F RLN+TFP+++VYKDL+ SD  K
Sbjct: 539  RAEAMELANSLHGMNAISEDSFNRLNKTFPLLDVYKDLTFSDGSK 583


>gb|EXB93167.1| hypothetical protein L484_024505 [Morus notabilis]
          Length = 587

 Score =  684 bits (1765), Expect = 0.0
 Identities = 339/592 (57%), Positives = 437/592 (73%), Gaps = 12/592 (2%)
 Frame = +2

Query: 41   MAVLYTEFFPYCYSLNIHSKASPLPCHH---QKSIIISCKNSTTN--------DRISSKI 187
            MA++ TEF P           SP P  H   Q    +SC+N + +        ++   ++
Sbjct: 1    MAIISTEFLPQTLPF------SPQPKQHTSRQSHTCLSCRNPSQSSTDIYRKKNKKPLRV 54

Query: 188  RVSAETRSIHFQS-LDFQETHFIKLLNRSCKAGKYSETLYFLECMVNKGYKPDVILCTKL 364
            RVS ET+S + QS  DF E+H +K++NRSCK+GKY+E LYFLE MV+KG+KPDVILCTK+
Sbjct: 55   RVSVETKSPNSQSNSDFSESHLLKVINRSCKSGKYNEALYFLELMVSKGFKPDVILCTKV 114

Query: 365  VKGFLNARNVAKAVRVMAILESHGEPDVFAYNALISGFCKMNQIESANKVLHRMRNRGCL 544
            ++GF N+RN+ KA+RVM ILE HGEPD+F+YNA+ISGFCK N++E ANKVL RMR +G  
Sbjct: 115  MRGFFNSRNIPKAIRVMEILEKHGEPDLFSYNAMISGFCKANRVELANKVLDRMRVQGFS 174

Query: 545  PDIVTYNILIGSLCSRGKLELAREVFNQFALYDCHPXXXXXXXXXXXXXXXXXXXEAXXX 724
            PD +TYNI+IGSLCSRGK+++A +V ++    +C P                   +A   
Sbjct: 175  PDTITYNIMIGSLCSRGKVDMAFKVLDELLRDNCKPSVITYTILIEATISEGGVDKAMEV 234

Query: 725  XXXXXXXXXQPDMYTYNAVVRGMCKEGMVDQAFDFIMNLAELGCEPDEISYNIVLRAFLS 904
                      PDM+TYNA+VRGMC+EGM+D+AF+F+ +L   GC P+ ISYNI+LRA L+
Sbjct: 235  LEEMLSRGLLPDMFTYNAIVRGMCREGMLDRAFEFVRSLEAKGCSPNVISYNILLRALLN 294

Query: 905  RGRWIDGERLIEKMISRGCEPNVVTYSILISSLCHDGKLKDAIAVLKTMMAKGLTPDAYS 1084
            RG+W DGE+++  M+SRGCEPNVVTYSILIS+LC DGK++DA+ VLK M  KG+TPDAYS
Sbjct: 295  RGKWSDGEKILSDMVSRGCEPNVVTYSILISTLCRDGKVEDAVNVLKAMKEKGITPDAYS 354

Query: 1085 FDPVISALCKDGKLDLAIEFLDFMISNGCSPDIVNYNTILAALCKSGKAEEALEIFEKLS 1264
            +DP+ISA CK+G+LDLAIEF+D+MIS+G  PDIVNYNTILAALCK+G A+ ALEIFEKL 
Sbjct: 355  YDPLISAFCKEGRLDLAIEFMDYMISDGSLPDIVNYNTILAALCKNGNADHALEIFEKLG 414

Query: 1265 QTDSPPNVSSYNTMISGLWNSGDPKRALNMVSEMVDKGIDADQITYNSLISCLCRDAMVE 1444
            +   PP VSSYNTM S LWN G+  +AL M+SEMV K I+ D+ITYNSLISCLCR+ MV 
Sbjct: 415  EVGCPPTVSSYNTMFSALWNCGERIKALEMISEMVSKRINPDEITYNSLISCLCREGMVN 474

Query: 1445 EAIELLVDMQNRGFKPTIISYNTMLLGLCKIHNIDTAIDILAVMVENGVRPNETSYILLI 1624
            EAI LL+DM+  GFK ++ISYN +LLGLCK   ID AI++LA MVE G RPNET+Y LLI
Sbjct: 475  EAIGLLIDMEAGGFKLSVISYNIVLLGLCKARRIDDAIELLAAMVEKGCRPNETTYTLLI 534

Query: 1625 EGMAYSGWRAEAMELANSLVSIGVVSKDFFKRLNRTFPMINVYKDLSNSDIK 1780
            EG+ ++GWR EAM LAN L  I  +S+  FKRLN+TFPM++VYK+L+ S+IK
Sbjct: 535  EGIGFAGWRVEAMGLANLLFDIEAISEHSFKRLNKTFPMLDVYKELTLSEIK 586


>ref|XP_006442665.1| hypothetical protein CICLE_v10019446mg [Citrus clementina]
            gi|557544927|gb|ESR55905.1| hypothetical protein
            CICLE_v10019446mg [Citrus clementina]
          Length = 583

 Score =  681 bits (1757), Expect = 0.0
 Identities = 333/560 (59%), Positives = 420/560 (75%), Gaps = 1/560 (0%)
 Frame = +2

Query: 92   HSKASPLPCHHQKSIIISCKNSTTNDRISSKIRVSAETR-SIHFQSLDFQETHFIKLLNR 268
            H +  P   H  +S ++SC N  +N+R+  ++  SAETR + H  S D +ET F+KL+ R
Sbjct: 21   HQQQKPTTSHSVQSTVVSCINPKSNERV--RVSSSAETRPNTHLLSFDVKETQFMKLIKR 78

Query: 269  SCKAGKYSETLYFLECMVNKGYKPDVILCTKLVKGFLNARNVAKAVRVMAILESHGEPDV 448
            S +AGK+ E+LYF+E MV  G KPDV++CTKL+K F   R   KAVRVM ILE +GEPDV
Sbjct: 79   SFRAGKFDESLYFIESMVANGCKPDVVMCTKLIKKFFQERKSNKAVRVMEILEKYGEPDV 138

Query: 449  FAYNALISGFCKMNQIESANKVLHRMRNRGCLPDIVTYNILIGSLCSRGKLELAREVFNQ 628
            FAYNALISGFCK NQIE ANKVL R+R+RG  PD+VTYNI+IGSLCSRG +E   +VF+Q
Sbjct: 139  FAYNALISGFCKANQIELANKVLDRLRSRGFSPDVVTYNIMIGSLCSRGMIESGFKVFDQ 198

Query: 629  FALYDCHPXXXXXXXXXXXXXXXXXXXEAXXXXXXXXXXXXQPDMYTYNAVVRGMCKEGM 808
                +C P                   +A             PDM+T NA++RGMCK+GM
Sbjct: 199  LLRDNCKPTVITYTILIQATMLEGQTDKAMKLLDEMFARGLIPDMFTNNAIIRGMCKKGM 258

Query: 809  VDQAFDFIMNLAELGCEPDEISYNIVLRAFLSRGRWIDGERLIEKMISRGCEPNVVTYSI 988
            V QAF F+ +L   GC+PD ISYN++LR  L+ G+W +GE+L+ +MISRG EPNVVTYSI
Sbjct: 259  VGQAFQFVRSLESRGCQPDVISYNMLLRTLLNMGKWEEGEKLMTEMISRGLEPNVVTYSI 318

Query: 989  LISSLCHDGKLKDAIAVLKTMMAKGLTPDAYSFDPVISALCKDGKLDLAIEFLDFMISNG 1168
            LISSLC DGK +DA+ VL+    KGLTPDAYS+DP+ISA CKDG+LDLAIEFLD+MIS+G
Sbjct: 319  LISSLCRDGKTEDAVDVLRAAKEKGLTPDAYSYDPLISAYCKDGRLDLAIEFLDYMISDG 378

Query: 1169 CSPDIVNYNTILAALCKSGKAEEALEIFEKLSQTDSPPNVSSYNTMISGLWNSGDPKRAL 1348
            C PDIVNYNTILAA CK+G A++ALEIFEKLS    PPNVSSYNTM S LW+SGD  RAL
Sbjct: 379  CLPDIVNYNTILAAFCKNGNADQALEIFEKLSDVGCPPNVSSYNTMFSALWSSGDKIRAL 438

Query: 1349 NMVSEMVDKGIDADQITYNSLISCLCRDAMVEEAIELLVDMQNRGFKPTIISYNTMLLGL 1528
             M+SEM+ KGI+ D+ITYNSLISCLCRD MV+EA+ LLVDM++  F+PT++SYN ++LG 
Sbjct: 439  GMISEMLSKGIEPDEITYNSLISCLCRDGMVDEAVGLLVDMESTRFRPTVVSYNIIILGF 498

Query: 1529 CKIHNIDTAIDILAVMVENGVRPNETSYILLIEGMAYSGWRAEAMELANSLVSIGVVSKD 1708
            CK   I+ AI++LA M E G +PNET+Y+LLIEG+ Y GWRAEAMELAN+LVS+  +S+D
Sbjct: 499  CKTRRINEAIEVLAAMFEKGCKPNETTYVLLIEGIGYGGWRAEAMELANALVSMHAISRD 558

Query: 1709 FFKRLNRTFPMINVYKDLSN 1768
             FKRLNRTFP+++VYK++S+
Sbjct: 559  TFKRLNRTFPLLDVYKEISH 578


>ref|XP_006487702.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Citrus sinensis]
          Length = 583

 Score =  681 bits (1756), Expect = 0.0
 Identities = 333/560 (59%), Positives = 421/560 (75%), Gaps = 1/560 (0%)
 Frame = +2

Query: 92   HSKASPLPCHHQKSIIISCKNSTTNDRISSKIRVSAETR-SIHFQSLDFQETHFIKLLNR 268
            H +  P   H  +S ++SC N  +N+R+  ++  SAETR + H  S D +ET F+KL+ +
Sbjct: 21   HQQLKPTTSHSVQSTVVSCINPKSNERV--RVSSSAETRPNTHLLSFDVKETQFMKLIKK 78

Query: 269  SCKAGKYSETLYFLECMVNKGYKPDVILCTKLVKGFLNARNVAKAVRVMAILESHGEPDV 448
            S +AGK+ E+LYF+E MV  G KPDV++CTKL+K F   R   KAVRVM ILE +GEPDV
Sbjct: 79   SFRAGKFDESLYFIESMVANGCKPDVVMCTKLIKKFFQERKSNKAVRVMEILEKYGEPDV 138

Query: 449  FAYNALISGFCKMNQIESANKVLHRMRNRGCLPDIVTYNILIGSLCSRGKLELAREVFNQ 628
            FAYNALISGFCK NQIE ANKVL R+R+RG  PD+VTYNI+IGSLCSRG +E A +VF+Q
Sbjct: 139  FAYNALISGFCKANQIELANKVLDRLRSRGFSPDVVTYNIMIGSLCSRGMIESAFKVFDQ 198

Query: 629  FALYDCHPXXXXXXXXXXXXXXXXXXXEAXXXXXXXXXXXXQPDMYTYNAVVRGMCKEGM 808
                +C P                   +A             PDM+T NA++RGMCK+GM
Sbjct: 199  LLRDNCKPTVITYTILIQATMLEGQTDKAMKLLDEMLARGLIPDMFTNNAIIRGMCKKGM 258

Query: 809  VDQAFDFIMNLAELGCEPDEISYNIVLRAFLSRGRWIDGERLIEKMISRGCEPNVVTYSI 988
            V QAF F+ +L   GC+PD ISYN++LR  L+ G+W +GE+L+ +MISRG EPNVVTYSI
Sbjct: 259  VGQAFQFVRSLESRGCQPDVISYNMLLRTLLNMGKWEEGEKLMTEMISRGLEPNVVTYSI 318

Query: 989  LISSLCHDGKLKDAIAVLKTMMAKGLTPDAYSFDPVISALCKDGKLDLAIEFLDFMISNG 1168
            LISSLC DGK +DA+ VL+    KGLTPDAYS+DP+ISA CKDG+LDLAIEFLD+MIS+G
Sbjct: 319  LISSLCRDGKTEDAVDVLRAAKEKGLTPDAYSYDPLISAYCKDGRLDLAIEFLDYMISDG 378

Query: 1169 CSPDIVNYNTILAALCKSGKAEEALEIFEKLSQTDSPPNVSSYNTMISGLWNSGDPKRAL 1348
            C PDIVNYNTILAA CK+G A++ALEIFEKLS    PPNVSSYNTM S LW+SGD  RAL
Sbjct: 379  CLPDIVNYNTILAAFCKNGNADQALEIFEKLSDVGCPPNVSSYNTMFSALWSSGDKIRAL 438

Query: 1349 NMVSEMVDKGIDADQITYNSLISCLCRDAMVEEAIELLVDMQNRGFKPTIISYNTMLLGL 1528
             M+SEM+ KGI+ D+ITYNSLISCLCRD MV+EA+ LLVDM++  F+PT+ISYN ++LG 
Sbjct: 439  GMISEMLSKGIEPDEITYNSLISCLCRDGMVDEAVGLLVDMESTRFRPTVISYNIIILGF 498

Query: 1529 CKIHNIDTAIDILAVMVENGVRPNETSYILLIEGMAYSGWRAEAMELANSLVSIGVVSKD 1708
            CK   I+ +I++LA M E G +PNET+Y+LLIEG+ Y GWRAEAMELAN+LVS+  +S+D
Sbjct: 499  CKTRRINESIEVLAAMFEKGCKPNETTYVLLIEGIGYGGWRAEAMELANALVSMHAISRD 558

Query: 1709 FFKRLNRTFPMINVYKDLSN 1768
             FKRLNRTFP+++VYK++S+
Sbjct: 559  TFKRLNRTFPLLDVYKEISH 578


>ref|XP_006371094.1| hypothetical protein POPTR_0019s03630g [Populus trichocarpa]
            gi|550316702|gb|ERP48891.1| hypothetical protein
            POPTR_0019s03630g [Populus trichocarpa]
          Length = 586

 Score =  672 bits (1734), Expect = 0.0
 Identities = 331/582 (56%), Positives = 427/582 (73%), Gaps = 7/582 (1%)
 Frame = +2

Query: 41   MAVLYTEFFPYCYSLNIHSKASPLPCHHQKSIIISCKNSTTNDRISS-----KIR-VSAE 202
            M +  TEF  +  S    SK   L  H  +S ++SC N T ND  S+     K+R V  E
Sbjct: 1    MTMFSTEFISHSCSFPFTSKHFKLSLHSLQSNVVSCINPTHNDTNSNLGNPPKLRRVLPE 60

Query: 203  TRSIHFQSLDFQETHFIKLLNRSCKAGKYSETLYFLECMVNKGYKPDVILCTKLVKGFLN 382
            T+  H  S DF+ETH +KLLNRSCKAGK +E+LYFLECMV KGY+PDVI+CTKL+KGF N
Sbjct: 61   TKPTHVLSYDFKETHLMKLLNRSCKAGKCNESLYFLECMVAKGYQPDVIMCTKLIKGFFN 120

Query: 383  ARNVAKAVRVMAILESHGEPDVFAYNALISGFCKMNQIESANKVLHRMRNRGCLPDIVTY 562
            +RN+ KA RVM ILE HGEPDVFAYNA+ISGFCK N+IESA KVL RM+ +G   D+VTY
Sbjct: 121  SRNIEKATRVMEILEKHGEPDVFAYNAVISGFCKANRIESAKKVLDRMKRKGFSQDVVTY 180

Query: 563  NILIGSLCSRGKLELAREVFNQFAL-YDCHPXXXXXXXXXXXXXXXXXXXEAXXXXXXXX 739
            NI+IG+ CS+GK++LA +VF +     +C P                   E         
Sbjct: 181  NIMIGTFCSKGKIDLALKVFEELLKDNNCKPTLITYTILIEAHILEGGIDEGLKLLDEML 240

Query: 740  XXXXQPDMYTYNAVVRGMCKEGMVDQAFDFIMNLAELGCEPDEISYNIVLRAFLSRGRWI 919
                +PD +TYN +VRG+ KEG V+QAF+ +  L   GC+PD I+YNI+LRA L +G+W 
Sbjct: 241  SRGLEPDTFTYNVIVRGLGKEGKVNQAFELVRTLNSRGCKPDVITYNILLRALLDQGKWY 300

Query: 920  DGERLIEKMISRGCEPNVVTYSILISSLCHDGKLKDAIAVLKTMMAKGLTPDAYSFDPVI 1099
            +GE+L+++M SRGCEPNVVTYSILISSLC DGK+++++ ++K M  KGLTPDAY +DP+I
Sbjct: 301  EGEKLMDEMFSRGCEPNVVTYSILISSLCRDGKIEESVNLVKVMKEKGLTPDAYCYDPLI 360

Query: 1100 SALCKDGKLDLAIEFLDFMISNGCSPDIVNYNTILAALCKSGKAEEALEIFEKLSQTDSP 1279
            +A C++GKLD+AI+FLD+MIS+G  PDIVNYNTI+AALCK+G ++ A+EIF KL +   P
Sbjct: 361  AAFCREGKLDMAIKFLDYMISDGFLPDIVNYNTIMAALCKNGNSDHAVEIFGKLEEVGCP 420

Query: 1280 PNVSSYNTMISGLWNSGDPKRALNMVSEMVDKGIDADQITYNSLISCLCRDAMVEEAIEL 1459
            PNVSSYNTM+S LW SGD  RAL M+S+M+  GID D ITYNSLISCLCRD MV+EAI L
Sbjct: 421  PNVSSYNTMLSALWGSGDRYRALGMISQMLSTGIDPDGITYNSLISCLCRDGMVDEAIGL 480

Query: 1460 LVDMQNRGFKPTIISYNTMLLGLCKIHNIDTAIDILAVMVENGVRPNETSYILLIEGMAY 1639
            L DM +  F+P I+SYN +LLGLCK+H ID AI++L  M+ENG +PNET+Y LLIEG+ +
Sbjct: 481  LADMLSGRFQPNIVSYNIVLLGLCKVHRIDDAIEVLTAMIENGCQPNETTYTLLIEGIGF 540

Query: 1640 SGWRAEAMELANSLVSIGVVSKDFFKRLNRTFPMINVYKDLS 1765
            SG RA+AMELANSL S+  +S+  +KRLN+ FP+++VYKDL+
Sbjct: 541  SGSRAQAMELANSLYSMNAISEGSYKRLNKVFPLLDVYKDLT 582


>ref|XP_004142590.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Cucumis sativus]
          Length = 581

 Score =  662 bits (1708), Expect = 0.0
 Identities = 325/575 (56%), Positives = 425/575 (73%)
 Frame = +2

Query: 56   TEFFPYCYSLNIHSKASPLPCHHQKSIIISCKNSTTNDRISSKIRVSAETRSIHFQSLDF 235
            +EF P       +  A P     +   I +C+ S  N      +  SAE R  HF +LD 
Sbjct: 4    SEFLPQSLHFT-NPLAKPTIPQSRSDSIPACRFS--NKTHLRNVTSSAEFRQPHFPNLDN 60

Query: 236  QETHFIKLLNRSCKAGKYSETLYFLECMVNKGYKPDVILCTKLVKGFLNARNVAKAVRVM 415
            ++ H +KLLNRSC+AGK++E+LYFLE +V+KG+KPDV+LCTKL+KGF N+RN+ KA+RVM
Sbjct: 61   RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 416  AILESHGEPDVFAYNALISGFCKMNQIESANKVLHRMRNRGCLPDIVTYNILIGSLCSRG 595
             ILE++G+PDV++YNA+ISGF K NQI+SAN+V  RMR+RG  PD+VTYNI+IGSLCSRG
Sbjct: 121  EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180

Query: 596  KLELAREVFNQFALYDCHPXXXXXXXXXXXXXXXXXXXEAXXXXXXXXXXXXQPDMYTYN 775
            KLELA EV ++     C P                   EA            +PD+YTYN
Sbjct: 181  KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240

Query: 776  AVVRGMCKEGMVDQAFDFIMNLAELGCEPDEISYNIVLRAFLSRGRWIDGERLIEKMISR 955
            A++RG+CKEGM D+A DF+ +L+  GC PD +SYNI+LR+FL++ RW DGERL++ M+  
Sbjct: 241  AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300

Query: 956  GCEPNVVTYSILISSLCHDGKLKDAIAVLKTMMAKGLTPDAYSFDPVISALCKDGKLDLA 1135
            GCEPNVVT+SILISS C +G++++A+ VL+ M  KGLTPD+YS+DP+ISA CK+G+LDLA
Sbjct: 301  GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360

Query: 1136 IEFLDFMISNGCSPDIVNYNTILAALCKSGKAEEALEIFEKLSQTDSPPNVSSYNTMISG 1315
            IE+L+ M+S+GC PDIVNYNTILA LCK G A+ AL++FEKL +   PP V +YNTM S 
Sbjct: 361  IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420

Query: 1316 LWNSGDPKRALNMVSEMVDKGIDADQITYNSLISCLCRDAMVEEAIELLVDMQNRGFKPT 1495
            LW+ G+  +AL M+SEM+ KGID D+ITYNSLISCLCRD +V+EAI LLVDM+   F+PT
Sbjct: 421  LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480

Query: 1496 IISYNTMLLGLCKIHNIDTAIDILAVMVENGVRPNETSYILLIEGMAYSGWRAEAMELAN 1675
            +IS+N +LLG+CK H +   I++L  MVE G  PNETSY+LLIEG+AY+GWRAEAMELAN
Sbjct: 481  VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 1676 SLVSIGVVSKDFFKRLNRTFPMINVYKDLSNSDIK 1780
            SL  +GV+S D  KRLN+TFPM++VYK LS S+ K
Sbjct: 541  SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESK 575


>ref|XP_007205118.1| hypothetical protein PRUPE_ppa004835mg [Prunus persica]
            gi|462400760|gb|EMJ06317.1| hypothetical protein
            PRUPE_ppa004835mg [Prunus persica]
          Length = 489

 Score =  660 bits (1702), Expect = 0.0
 Identities = 321/488 (65%), Positives = 387/488 (79%)
 Frame = +2

Query: 317  MVNKGYKPDVILCTKLVKGFLNARNVAKAVRVMAILESHGEPDVFAYNALISGFCKMNQI 496
            MVNKGYKPDVILCTKL+KGF N+RN+ KA+RVM ILE +GEPD+F+YNALISGFCK N+I
Sbjct: 1    MVNKGYKPDVILCTKLIKGFFNSRNIEKAIRVMQILEKYGEPDLFSYNALISGFCKANRI 60

Query: 497  ESANKVLHRMRNRGCLPDIVTYNILIGSLCSRGKLELAREVFNQFALYDCHPXXXXXXXX 676
            ESANKVL RMR++G  PD+VTYNI+IGSLCSRGKL LA +V +Q    +C P        
Sbjct: 61   ESANKVLDRMRSQGFSPDVVTYNIMIGSLCSRGKLGLALKVMDQLVKDNCRPTVITYTIL 120

Query: 677  XXXXXXXXXXXEAXXXXXXXXXXXXQPDMYTYNAVVRGMCKEGMVDQAFDFIMNLAELGC 856
                       EA            +PDMYTYNAV+RGMC+EGM+D+AF F+ +L   GC
Sbjct: 121  IEATIVDGGIDEAMKLLDEMLSRGLKPDMYTYNAVIRGMCREGMLDRAFQFVRSLDSKGC 180

Query: 857  EPDEISYNIVLRAFLSRGRWIDGERLIEKMISRGCEPNVVTYSILISSLCHDGKLKDAIA 1036
             P+ ISYNI+LRA L+RG+W +GE+L+  M SRGCEPNVVTYSILIS+LC DGK++DA+ 
Sbjct: 181  PPNVISYNILLRALLNRGKWEEGEKLVTNMCSRGCEPNVVTYSILISTLCRDGKVEDAVN 240

Query: 1037 VLKTMMAKGLTPDAYSFDPVISALCKDGKLDLAIEFLDFMISNGCSPDIVNYNTILAALC 1216
            VLK M  KGLTPDAYS+DP++SA CK+G+LDLAIEFLD+MIS+GC PDIVNYNTILAALC
Sbjct: 241  VLKIMKKKGLTPDAYSYDPLVSAFCKEGRLDLAIEFLDYMISDGCLPDIVNYNTILAALC 300

Query: 1217 KSGKAEEALEIFEKLSQTDSPPNVSSYNTMISGLWNSGDPKRALNMVSEMVDKGIDADQI 1396
            KSGKA++AL+IFE L +   PPNVSSYNTM S LWN GD  RAL MVSEMV KGI  D+I
Sbjct: 301  KSGKADQALQIFENLGEVGCPPNVSSYNTMFSALWNCGDRVRALGMVSEMVGKGIKPDEI 360

Query: 1397 TYNSLISCLCRDAMVEEAIELLVDMQNRGFKPTIISYNTMLLGLCKIHNIDTAIDILAVM 1576
            TYNSLISCLCRD MV+EAI LLVDM+  GF+PT+ISYN +LLGLCK   +  AI +L  M
Sbjct: 361  TYNSLISCLCRDGMVDEAIGLLVDMETGGFQPTVISYNIILLGLCKTRRVVDAIQVLTEM 420

Query: 1577 VENGVRPNETSYILLIEGMAYSGWRAEAMELANSLVSIGVVSKDFFKRLNRTFPMINVYK 1756
            VE G RPNET+YILLIEG+ ++GWRAEAMELANS+ S+  +S+D FKRLNRTFPM++V+K
Sbjct: 421  VEKGCRPNETTYILLIEGIGFAGWRAEAMELANSVFSLRAISEDSFKRLNRTFPMLDVFK 480

Query: 1757 DLSNSDIK 1780
            +L+ S+IK
Sbjct: 481  ELTLSEIK 488



 Score =  186 bits (473), Expect = 2e-44
 Identities = 105/406 (25%), Positives = 199/406 (49%), Gaps = 36/406 (8%)
 Frame = +2

Query: 257  LLNRSCKAGKYSETLYFLECMVNKGYKPDVI--------LCTK----------------- 361
            L++  CKA +       L+ M ++G+ PDV+        LC++                 
Sbjct: 50   LISGFCKANRIESANKVLDRMRSQGFSPDVVTYNIMIGSLCSRGKLGLALKVMDQLVKDN 109

Query: 362  ----------LVKGFLNARNVAKAVRVMAILESHG-EPDVFAYNALISGFCKMNQIESAN 508
                      L++  +    + +A++++  + S G +PD++ YNA+I G C+   ++ A 
Sbjct: 110  CRPTVITYTILIEATIVDGGIDEAMKLLDEMLSRGLKPDMYTYNAVIRGMCREGMLDRAF 169

Query: 509  KVLHRMRNRGCLPDIVTYNILIGSLCSRGKLELAREVFNQFALYDCHPXXXXXXXXXXXX 688
            + +  + ++GC P++++YNIL+ +L +RGK E   ++        C P            
Sbjct: 170  QFVRSLDSKGCPPNVISYNILLRALLNRGKWEEGEKLVTNMCSRGCEPNVVTYSILISTL 229

Query: 689  XXXXXXXEAXXXXXXXXXXXXQPDMYTYNAVVRGMCKEGMVDQAFDFIMNLAELGCEPDE 868
                   +A             PD Y+Y+ +V   CKEG +D A +F+  +   GC PD 
Sbjct: 230  CRDGKVEDAVNVLKIMKKKGLTPDAYSYDPLVSAFCKEGRLDLAIEFLDYMISDGCLPDI 289

Query: 869  ISYNIVLRAFLSRGRWIDGERLIEKMISRGCEPNVVTYSILISSLCHDGKLKDAIAVLKT 1048
            ++YN +L A    G+     ++ E +   GC PNV +Y+ + S+L + G    A+ ++  
Sbjct: 290  VNYNTILAALCKSGKADQALQIFENLGEVGCPPNVSSYNTMFSALWNCGDRVRALGMVSE 349

Query: 1049 MMAKGLTPDAYSFDPVISALCKDGKLDLAIEFLDFMISNGCSPDIVNYNTILAALCKSGK 1228
            M+ KG+ PD  +++ +IS LC+DG +D AI  L  M + G  P +++YN IL  LCK+ +
Sbjct: 350  MVGKGIKPDEITYNSLISCLCRDGMVDEAIGLLVDMETGGFQPTVISYNIILLGLCKTRR 409

Query: 1229 AEEALEIFEKLSQTDSPPNVSSYNTMISGLWNSGDPKRALNMVSEM 1366
              +A+++  ++ +    PN ++Y  +I G+  +G    A+ + + +
Sbjct: 410  VVDAIQVLTEMVEKGCRPNETTYILLIEGIGFAGWRAEAMELANSV 455


>ref|XP_006361415.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Solanum tuberosum]
          Length = 583

 Score =  639 bits (1648), Expect = e-180
 Identities = 321/578 (55%), Positives = 423/578 (73%), Gaps = 5/578 (0%)
 Frame = +2

Query: 47   VLYTEFFPYC--YSLNIHSKASPLPCHHQKSIIISCKNSTTNDRISSKIRVSAETRSIHF 220
            ++  E FP C  +S N+  K+       + + ++ C +S++ND+  SK++     R +  
Sbjct: 4    IIPAEIFPQCPFFSNNLKPKSQS----SKHNFVVRC-SSSSNDQ--SKVKTRNPLR-VKI 55

Query: 221  QSLDFQETHFIKLLNRSCKAGKYSETLYFLECMVNKGYKPDVILCTKLVKGFLNARNVAK 400
             S +++ TH +K+LN SCK GKY ETLY LEC +  GYKPDVILCTKL+KGF N++N  K
Sbjct: 56   SSENYRPTHDMKVLNWSCKVGKYDETLYLLECKLKSGYKPDVILCTKLIKGFCNSKNSDK 115

Query: 401  AVRVMAILESHGEPDVFAYNALISGFCKMNQIESANKVLHRMRNRGCLPDIVTYNILIGS 580
             V+VM ILE  GEPDVFAYNALISGFCKMN+IE ANKVL+RM+ RG  PD VTYNILIGS
Sbjct: 116  GVKVMQILEQFGEPDVFAYNALISGFCKMNKIEEANKVLNRMKARGFPPDSVTYNILIGS 175

Query: 581  LCSRGKLELAREVFNQFALYD-CHPXXXXXXXXXXXXXXXXXXXEAXXXXXXXXXXXXQP 757
            LC RGKL  A ++ +Q    + C P                   EA            QP
Sbjct: 176  LCDRGKLGSALKLLDQLKEENNCKPTVITYTILIEATILEGGIHEAMKLLDEMLSRGLQP 235

Query: 758  DMYTYNAVVRGMCKEGMVDQAFDFIMNLAELGCEPDEISYNIVLRAFLS-RGRWIDGERL 934
            DMYTYNA++RGMC+E M+DQA++F+ +L   GC+PD ISYNI+LRA L  +G+W DGE+L
Sbjct: 236  DMYTYNAIIRGMCREKMMDQAYEFVRSLPSKGCKPDVISYNILLRALLHHKGKWSDGEKL 295

Query: 935  IEKMISRGCEPNVVTYSILISSLCHDGKLKDAIAVLKTMMAKGLTPDAYSFDPVISALCK 1114
            + +M+S GCEPNVVTYSIL+S+LC DGKL +AI +LK MM KGLTPD +++DP+ISA CK
Sbjct: 296  MNEMLSAGCEPNVVTYSILMSALCRDGKLDEAINLLKIMMDKGLTPDTFTYDPLISAFCK 355

Query: 1115 DGKLDLAIEFLDFMISNGCSPDIVNYNTILAALCKSGKAEEALEIFEKLSQTDSPPNVSS 1294
             G+LDLAI+FLD+MISNGC PDIVNYNTIL+ +CK GKA+EA+E+FEKL++   PP+VS+
Sbjct: 356  GGRLDLAIKFLDYMISNGCLPDIVNYNTILSTMCKKGKADEAMEVFEKLAEIGCPPDVST 415

Query: 1295 YNTMISGLWNSGDPKRALNMVSEMVDKGIDADQITYNSLISCLCRDAMVEEAIELLVDMQ 1474
            YNT++S LWN+G   RAL MVSEM++KG+D D+ITYN+LISCLCRD MV EA++LL DM+
Sbjct: 416  YNTLMSALWNNGGRARALKMVSEMIEKGVDPDEITYNALISCLCRDGMVNEALDLLGDME 475

Query: 1475 NRGFKPTIISYNTMLLGLCKIHNIDTAIDILAVMVENGVRPNETSYILLIEGMAYSGWRA 1654
              GF PT+I+YN +LLGLCK H +  AI++LA MVE G RPNET+YILLIEG+ +SG R 
Sbjct: 476  GNGFPPTVITYNILLLGLCKAHRVVEAIEVLAEMVEKGRRPNETTYILLIEGIGFSGRRV 535

Query: 1655 EAMELANSLVSIGVVSKDFFKRLNRTFPMINVY-KDLS 1765
            +AME+A+++     +SK+  +RL +TF + +VY KD++
Sbjct: 536  QAMEMASAIYHKNAISKESLQRLRKTFQVPDVYNKDIT 573


>ref|XP_002884468.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297330308|gb|EFH60727.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 598

 Score =  638 bits (1645), Expect = e-180
 Identities = 314/564 (55%), Positives = 408/564 (72%)
 Frame = +2

Query: 80   SLNIHSKASPLPCHHQKSIIISCKNSTTNDRISSKIRVSAETRSIHFQSLDFQETHFIKL 259
            SL   S ++P   +H      S   +      ++   +  E R  H QSL F++T  +K+
Sbjct: 35   SLLTFSNSNP---NHDNGKSFSSSGARNLQATTTDAAIPTERRQQHSQSLGFRDTQMLKI 91

Query: 260  LNRSCKAGKYSETLYFLECMVNKGYKPDVILCTKLVKGFLNARNVAKAVRVMAILESHGE 439
             +RSC++G Y E+L+ LE MV KGY PDVILCTKL+KGF   RNV KAVRVM ILE  G+
Sbjct: 92   FHRSCRSGNYIESLHLLETMVRKGYNPDVILCTKLIKGFFTLRNVPKAVRVMEILEKFGQ 151

Query: 440  PDVFAYNALISGFCKMNQIESANKVLHRMRNRGCLPDIVTYNILIGSLCSRGKLELAREV 619
            PDVFAYNALI+GFCKMN+I+ A +VL RMR++   PD VTYNI+IGSLCSRGKL+LA +V
Sbjct: 152  PDVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKV 211

Query: 620  FNQFALYDCHPXXXXXXXXXXXXXXXXXXXEAXXXXXXXXXXXXQPDMYTYNAVVRGMCK 799
             +Q    +C P                   EA            +PDM+TYN ++RGMCK
Sbjct: 212  LDQLLSDNCQPTVITYTILIEATMLEGGVDEALKLLDEMLSRGLKPDMFTYNTIIRGMCK 271

Query: 800  EGMVDQAFDFIMNLAELGCEPDEISYNIVLRAFLSRGRWIDGERLIEKMISRGCEPNVVT 979
            EGMVD+AF+ I NL   GCEPD ISYNI+LRA L++G+W +GE+L+ KM S  C+PNVVT
Sbjct: 272  EGMVDRAFEMIRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVT 331

Query: 980  YSILISSLCHDGKLKDAIAVLKTMMAKGLTPDAYSFDPVISALCKDGKLDLAIEFLDFMI 1159
            YSILI++LC DGK+++A+ +LK M  KGLTPDAYS+DP+I+A C++G+LD+AIEFL+ MI
Sbjct: 332  YSILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMI 391

Query: 1160 SNGCSPDIVNYNTILAALCKSGKAEEALEIFEKLSQTDSPPNVSSYNTMISGLWNSGDPK 1339
            S+GC PDIVNYNT+LA LCK+GKA++ALEIF KL +    PN SSYNTM S LW+SGD  
Sbjct: 392  SDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKI 451

Query: 1340 RALNMVSEMVDKGIDADQITYNSLISCLCRDAMVEEAIELLVDMQNRGFKPTIISYNTML 1519
            RAL+M+ EMV  GID D+ITYNS+ISCLCR+ MV++A ELLVDM++  F P++++YN +L
Sbjct: 452  RALHMILEMVSNGIDPDEITYNSMISCLCREGMVDKAFELLVDMRSCEFHPSVVTYNIVL 511

Query: 1520 LGLCKIHNIDTAIDILAVMVENGVRPNETSYILLIEGMAYSGWRAEAMELANSLVSIGVV 1699
            LG CK H I+ AID+L  MV NG RPNET+Y +LIEG+ ++G+RAEAMELAN LV I  +
Sbjct: 512  LGFCKAHRIEDAIDVLDSMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRINAI 571

Query: 1700 SKDFFKRLNRTFPMINVYKDLSNS 1771
            S+  FKRL+RTFP++NV +  S +
Sbjct: 572  SEYSFKRLHRTFPLLNVLQRSSQT 595


>ref|XP_006296608.1| hypothetical protein CARUB_v10013258mg [Capsella rubella]
            gi|482565317|gb|EOA29506.1| hypothetical protein
            CARUB_v10013258mg [Capsella rubella]
          Length = 607

 Score =  637 bits (1643), Expect = e-180
 Identities = 316/588 (53%), Positives = 413/588 (70%), Gaps = 19/588 (3%)
 Frame = +2

Query: 62   FFPYCYSLNIHSKASPLPCHHQKSIIISCKNSTTNDRISSKIRVSAETRSI--------- 214
            F+      N +S    L   H +S ++S  NS  N         S   R++         
Sbjct: 15   FYSKTQKHNSNSSHGLLLLSHNRSSLLSFSNSNPNHDNVKSFSSSGAARNLQAATTQDAT 74

Query: 215  ----------HFQSLDFQETHFIKLLNRSCKAGKYSETLYFLECMVNKGYKPDVILCTKL 364
                      H  SL F++T  +K+ +RSC++G Y E+L+ LE MV KGY PDVILCTKL
Sbjct: 75   VPTERRQHQTHSHSLGFRDTQMLKIFHRSCRSGNYIESLHLLESMVRKGYNPDVILCTKL 134

Query: 365  VKGFLNARNVAKAVRVMAILESHGEPDVFAYNALISGFCKMNQIESANKVLHRMRNRGCL 544
            +KGF   RN+ KAVRVM ILE  G+PDVFAYNALI+GFCKMN+I+ A +VL RMR++G  
Sbjct: 135  IKGFFTLRNIPKAVRVMEILEKFGQPDVFAYNALINGFCKMNRIDDATRVLDRMRSKGFS 194

Query: 545  PDIVTYNILIGSLCSRGKLELAREVFNQFALYDCHPXXXXXXXXXXXXXXXXXXXEAXXX 724
            PD VTYNI+IGSLCSRGKL LA +V +Q    +C P                   EA   
Sbjct: 195  PDTVTYNIMIGSLCSRGKLVLALKVLDQLLSDNCQPTVITYTILIEATMLEGGVDEALKL 254

Query: 725  XXXXXXXXXQPDMYTYNAVVRGMCKEGMVDQAFDFIMNLAELGCEPDEISYNIVLRAFLS 904
                     +PDM+TYN ++RGMCKEGMVD+AF+ + NL   GCEPD ISYNI+LRA L+
Sbjct: 255  LDEMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLELRGCEPDVISYNILLRALLN 314

Query: 905  RGRWIDGERLIEKMISRGCEPNVVTYSILISSLCHDGKLKDAIAVLKTMMAKGLTPDAYS 1084
            +G+W +GE+L+ KM S  C+PNVVTYSILI++LC DGK+++A+ +LK M  KGL+PDAYS
Sbjct: 315  QGKWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEALNLLKLMKEKGLSPDAYS 374

Query: 1085 FDPVISALCKDGKLDLAIEFLDFMISNGCSPDIVNYNTILAALCKSGKAEEALEIFEKLS 1264
            +DP+I+A C++G+LDLAIEFL+ MIS+GC PDIVNYNT+LA LCK+GKA++ALEIF KL 
Sbjct: 375  YDPLIAAFCREGRLDLAIEFLETMISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLG 434

Query: 1265 QTDSPPNVSSYNTMISGLWNSGDPKRALNMVSEMVDKGIDADQITYNSLISCLCRDAMVE 1444
            +    PN SSYNTM S LW+SGD  RAL+M+SEMV +GID D+ITYNS+ISCLCR+ MV+
Sbjct: 435  EVGCSPNSSSYNTMFSALWSSGDKIRALHMISEMVSQGIDPDEITYNSMISCLCREGMVD 494

Query: 1445 EAIELLVDMQNRGFKPTIISYNTMLLGLCKIHNIDTAIDILAVMVENGVRPNETSYILLI 1624
            EA +LLVDM++  F P++++YN +LLG CK H I+ AID+L  MV NG RPNE++Y +LI
Sbjct: 495  EAFDLLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAIDVLESMVGNGCRPNESTYTVLI 554

Query: 1625 EGMAYSGWRAEAMELANSLVSIGVVSKDFFKRLNRTFPMINVYKDLSN 1768
            EG+ ++G+RAEAMELAN LV I  +S+  FKRL+RTFP++NV +  S+
Sbjct: 555  EGIGFAGYRAEAMELANDLVRIDAISEHSFKRLHRTFPLLNVLQRSSH 602


>ref|NP_566237.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75207286|sp|Q9SR00.1|PP213_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At3g04760, chloroplastic; Flags: Precursor
            gi|6175176|gb|AAF04902.1|AC011437_17 hypothetical protein
            [Arabidopsis thaliana] gi|15810359|gb|AAL07067.1| unknown
            protein [Arabidopsis thaliana] gi|22136960|gb|AAM91709.1|
            unknown protein [Arabidopsis thaliana]
            gi|332640611|gb|AEE74132.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 602

 Score =  636 bits (1640), Expect = e-179
 Identities = 311/566 (54%), Positives = 408/566 (72%), Gaps = 16/566 (2%)
 Frame = +2

Query: 122  HQKSIIISCKNSTTND----------------RISSKIRVSAETRSIHFQSLDFQETHFI 253
            H +S +++  NS  N+                  ++   +  E R  H QSL F++T  +
Sbjct: 34   HNRSSLLTFSNSNPNNDNGRSFSSSGARNLQTTTTTDATLPTERRQQHSQSLGFRDTQML 93

Query: 254  KLLNRSCKAGKYSETLYFLECMVNKGYKPDVILCTKLVKGFLNARNVAKAVRVMAILESH 433
            K+ +RSC++G Y E+L+ LE MV KGY PDVILCTKL+KGF   RN+ KAVRVM ILE  
Sbjct: 94   KIFHRSCRSGNYIESLHLLETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKF 153

Query: 434  GEPDVFAYNALISGFCKMNQIESANKVLHRMRNRGCLPDIVTYNILIGSLCSRGKLELAR 613
            G+PDVFAYNALI+GFCKMN+I+ A +VL RMR++   PD VTYNI+IGSLCSRGKL+LA 
Sbjct: 154  GQPDVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLAL 213

Query: 614  EVFNQFALYDCHPXXXXXXXXXXXXXXXXXXXEAXXXXXXXXXXXXQPDMYTYNAVVRGM 793
            +V NQ    +C P                   EA            +PDM+TYN ++RGM
Sbjct: 214  KVLNQLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGM 273

Query: 794  CKEGMVDQAFDFIMNLAELGCEPDEISYNIVLRAFLSRGRWIDGERLIEKMISRGCEPNV 973
            CKEGMVD+AF+ + NL   GCEPD ISYNI+LRA L++G+W +GE+L+ KM S  C+PNV
Sbjct: 274  CKEGMVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNV 333

Query: 974  VTYSILISSLCHDGKLKDAIAVLKTMMAKGLTPDAYSFDPVISALCKDGKLDLAIEFLDF 1153
            VTYSILI++LC DGK+++A+ +LK M  KGLTPDAYS+DP+I+A C++G+LD+AIEFL+ 
Sbjct: 334  VTYSILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLET 393

Query: 1154 MISNGCSPDIVNYNTILAALCKSGKAEEALEIFEKLSQTDSPPNVSSYNTMISGLWNSGD 1333
            MIS+GC PDIVNYNT+LA LCK+GKA++ALEIF KL +    PN SSYNTM S LW+SGD
Sbjct: 394  MISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGD 453

Query: 1334 PKRALNMVSEMVDKGIDADQITYNSLISCLCRDAMVEEAIELLVDMQNRGFKPTIISYNT 1513
              RAL+M+ EM+  GID D+ITYNS+ISCLCR+ MV+EA ELLVDM++  F P++++YN 
Sbjct: 454  KIRALHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNI 513

Query: 1514 MLLGLCKIHNIDTAIDILAVMVENGVRPNETSYILLIEGMAYSGWRAEAMELANSLVSIG 1693
            +LLG CK H I+ AI++L  MV NG RPNET+Y +LIEG+ ++G+RAEAMELAN LV I 
Sbjct: 514  VLLGFCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRID 573

Query: 1694 VVSKDFFKRLNRTFPMINVYKDLSNS 1771
             +S+  FKRL+RTFP++NV +  S +
Sbjct: 574  AISEYSFKRLHRTFPLLNVLQRSSQT 599


>dbj|BAD95034.1| hypothetical protein [Arabidopsis thaliana]
          Length = 602

 Score =  632 bits (1630), Expect = e-178
 Identities = 310/566 (54%), Positives = 407/566 (71%), Gaps = 16/566 (2%)
 Frame = +2

Query: 122  HQKSIIISCKNSTTND----------------RISSKIRVSAETRSIHFQSLDFQETHFI 253
            H +S +++  NS  N+                  ++   +  E R  H QSL F++T  +
Sbjct: 34   HNRSSLLTFSNSNPNNDNGRSFSSSGARNLQTTTTTDATLPTERRQQHSQSLGFRDTQML 93

Query: 254  KLLNRSCKAGKYSETLYFLECMVNKGYKPDVILCTKLVKGFLNARNVAKAVRVMAILESH 433
            K+ +RSC++G Y E+L+ LE MV KGY PDVILCTKL+KGF   RN+ KAVRVM ILE  
Sbjct: 94   KIFHRSCRSGNYIESLHLLETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKF 153

Query: 434  GEPDVFAYNALISGFCKMNQIESANKVLHRMRNRGCLPDIVTYNILIGSLCSRGKLELAR 613
            G+PDVFAYNALI+GFCKMN+I+ A +VL RMR++   PD VTYNI+IGSLCSRGKL+LA 
Sbjct: 154  GQPDVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLAL 213

Query: 614  EVFNQFALYDCHPXXXXXXXXXXXXXXXXXXXEAXXXXXXXXXXXXQPDMYTYNAVVRGM 793
            +V NQ    +C P                   EA            +PDM+TYN ++RGM
Sbjct: 214  KVLNQLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGM 273

Query: 794  CKEGMVDQAFDFIMNLAELGCEPDEISYNIVLRAFLSRGRWIDGERLIEKMISRGCEPNV 973
            CKEGMVD+AF+ + NL   G EPD ISYNI+LRA L++G+W +GE+L+ KM S  C+PNV
Sbjct: 274  CKEGMVDRAFEMVRNLELKGSEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNV 333

Query: 974  VTYSILISSLCHDGKLKDAIAVLKTMMAKGLTPDAYSFDPVISALCKDGKLDLAIEFLDF 1153
            VTYSILI++LC DGK+++A+ +LK M  KGLTPDAYS+DP+I+A C++G+LD+AIEFL+ 
Sbjct: 334  VTYSILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLET 393

Query: 1154 MISNGCSPDIVNYNTILAALCKSGKAEEALEIFEKLSQTDSPPNVSSYNTMISGLWNSGD 1333
            MIS+GC PDIVNYNT+LA LCK+GKA++ALEIF KL +    PN SSYNTM S LW+SGD
Sbjct: 394  MISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGD 453

Query: 1334 PKRALNMVSEMVDKGIDADQITYNSLISCLCRDAMVEEAIELLVDMQNRGFKPTIISYNT 1513
              RAL+M+ EM+  GID D+ITYNS+ISCLCR+ MV+EA ELLVDM++  F P++++YN 
Sbjct: 454  KIRALHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNI 513

Query: 1514 MLLGLCKIHNIDTAIDILAVMVENGVRPNETSYILLIEGMAYSGWRAEAMELANSLVSIG 1693
            +LLG CK H I+ AI++L  MV NG RPNET+Y +LIEG+ ++G+RAEAMELAN LV I 
Sbjct: 514  VLLGFCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRID 573

Query: 1694 VVSKDFFKRLNRTFPMINVYKDLSNS 1771
             +S+  FKRL+RTFP++NV +  S +
Sbjct: 574  AISEYSFKRLHRTFPLLNVLQRSSQT 599


>ref|XP_004236781.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Solanum lycopersicum]
          Length = 575

 Score =  628 bits (1620), Expect = e-177
 Identities = 312/566 (55%), Positives = 407/566 (71%), Gaps = 3/566 (0%)
 Frame = +2

Query: 77   YSLNIHSKASPLPCHHQKSIIISCKNSTTNDRISSKIRVSAETRSIHFQSLDFQETHFIK 256
            +S N+  K+     +      IS + S  N R   ++++S+E R           +H +K
Sbjct: 13   FSNNLKPKSESSKHNFVVRCSISNEESRVNIRNPQRVKISSENRG----------SHDMK 62

Query: 257  LLNRSCKAGKYSETLYFLECMVNKGYKPDVILCTKLVKGFLNARNVAKAVRVMAILESHG 436
            +LN SCK GKY ETLY LEC V  GYKPDVILCTKL+KGF N++N  K V+VM ILE  G
Sbjct: 63   VLNWSCKVGKYDETLYLLECKVKSGYKPDVILCTKLIKGFFNSKNSDKGVKVMQILEQFG 122

Query: 437  EPDVFAYNALISGFCKMNQIESANKVLHRMRNRGCLPDIVTYNILIGSLCSRGKLELARE 616
            EPDVFAYNAL+SGFCKMN+IE ANKVL+RM+  G  PD VTYNILIGSLC RGKL  A  
Sbjct: 123  EPDVFAYNALVSGFCKMNKIEEANKVLNRMKTHGFPPDSVTYNILIGSLCDRGKLGSALM 182

Query: 617  VFNQFAL-YDCHPXXXXXXXXXXXXXXXXXXXEAXXXXXXXXXXXXQPDMYTYNAVVRGM 793
            + +Q    ++C P                   EA            QPDMYTYNA++RGM
Sbjct: 183  LLDQLKEEHNCKPTVITYTILIEATILEGGIHEAMKLLDEMLSIGLQPDMYTYNAIIRGM 242

Query: 794  CKEGMVDQAFDFIMNLAELGCEPDEISYNIVLRAFLS-RGRWIDGERLIEKMISRGCEPN 970
            C+E M+DQA++F+ +L   GC+PD ISYNI+LRA L  RG+W DGE+L+ +M+  GCEPN
Sbjct: 243  CREKMMDQAYEFVRSLPSKGCKPDVISYNILLRALLHHRGKWSDGEKLMNEMLCAGCEPN 302

Query: 971  VVTYSILISSLCHDGKLKDAIAVLKTMMAKGLTPDAYSFDPVISALCKDGKLDLAIEFLD 1150
            VVTYSIL+S+LC DGKL +AI +LK M+ KGLTPD +++DP+ISA CK G+LD+AI+FLD
Sbjct: 303  VVTYSILMSALCRDGKLDEAINLLKIMVDKGLTPDTFTYDPLISAFCKGGRLDMAIKFLD 362

Query: 1151 FMISNGCSPDIVNYNTILAALCKSGKAEEALEIFEKLSQTDSPPNVSSYNTMISGLWNSG 1330
            +MI+NGC PDIVNYNTIL+ +CK GKA+EA+E+FEKL++   PP+VS+YNT++S LWN+G
Sbjct: 363  YMITNGCLPDIVNYNTILSTMCKKGKADEAMEVFEKLAEIGCPPDVSTYNTLMSALWNNG 422

Query: 1331 DPKRALNMVSEMVDKGIDADQITYNSLISCLCRDAMVEEAIELLVDMQNRGFKPTIISYN 1510
               RAL MVSEM++KG+D D+ITYN+LISCLCRD MV EA++LL DM+  GF PT+I+YN
Sbjct: 423  GRARALKMVSEMIEKGVDPDEITYNALISCLCRDGMVNEALDLLGDMEGNGFPPTVITYN 482

Query: 1511 TMLLGLCKIHNIDTAIDILAVMVENGVRPNETSYILLIEGMAYSGWRAEAMELANSLVSI 1690
             +LLGLCK H +  AI++LA MVE G RPNET+YILLIEG+ +SG R +AME+A ++   
Sbjct: 483  ILLLGLCKAHRVVEAIEVLAEMVEKGCRPNETTYILLIEGIGFSGRRVQAMEMATAIYHK 542

Query: 1691 GVVSKDFFKRLNRTFPMINVY-KDLS 1765
              +SK+  +RL +TF + +VY KD++
Sbjct: 543  NAISKESLQRLRKTFQVPDVYSKDIT 568


>ref|XP_007144014.1| hypothetical protein PHAVU_007G121900g [Phaseolus vulgaris]
            gi|561017204|gb|ESW16008.1| hypothetical protein
            PHAVU_007G121900g [Phaseolus vulgaris]
          Length = 570

 Score =  610 bits (1574), Expect = e-172
 Identities = 303/573 (52%), Positives = 405/573 (70%), Gaps = 10/573 (1%)
 Frame = +2

Query: 41   MAVLYTEFFPYCYSLNIHSKASPLPCHHQKSIIISCKNSTTNDRISSKIR---------- 190
            M  + TEF  +  SL  +SK +    H + + +I+C+    N+   SK +          
Sbjct: 1    MTTVSTEFLSHTLSLRTNSKGA---WHPKPNTVITCRIPVLNEDNPSKRKNNYNKGNGRV 57

Query: 191  VSAETRSIHFQSLDFQETHFIKLLNRSCKAGKYSETLYFLECMVNKGYKPDVILCTKLVK 370
             S++TR  H+   DF++TH ++ LNR C+ GKY+E LYFLE MV +GYKPDVILCTKL+K
Sbjct: 58   SSSDTRPRHY---DFRDTHHMRALNRLCRTGKYTEALYFLEQMVKRGYKPDVILCTKLIK 114

Query: 371  GFLNARNVAKAVRVMAILESHGEPDVFAYNALISGFCKMNQIESANKVLHRMRNRGCLPD 550
            G   ++   KAV+VM ILE HG+PD FAYNA+ISGFC+ ++ ++AN VL RM+NRG  PD
Sbjct: 115  GLFTSKKTEKAVQVMEILEQHGDPDAFAYNAVISGFCRSDRFDAANGVLLRMKNRGFSPD 174

Query: 551  IVTYNILIGSLCSRGKLELAREVFNQFALYDCHPXXXXXXXXXXXXXXXXXXXEAXXXXX 730
            +VTYNILIGSLC+RGKL+LA +V +Q    +C+P                   +A     
Sbjct: 175  VVTYNILIGSLCARGKLDLAMKVMDQLMKDNCNPTVITYTILIEATIIHGVIDKAMKLLD 234

Query: 731  XXXXXXXQPDMYTYNAVVRGMCKEGMVDQAFDFIMNLAELGCEPDEISYNIVLRAFLSRG 910
                   QPDMYTYN +VRGMCK G+VD+AF+F+ NL+     P    YN+VL+  L+ G
Sbjct: 235  EMVSRGLQPDMYTYNVIVRGMCKRGLVDRAFEFVCNLSTT---PSLNLYNLVLKGLLNEG 291

Query: 911  RWIDGERLIEKMISRGCEPNVVTYSILISSLCHDGKLKDAIAVLKTMMAKGLTPDAYSFD 1090
            RW  GERL+  M+ +GCEPNVVTYS+LI+SLC DGK  +A+ +LK M  KGL+PDAY +D
Sbjct: 292  RWKTGERLMSDMMVKGCEPNVVTYSVLINSLCRDGKTGEAVDLLKVMKEKGLSPDAYCYD 351

Query: 1091 PVISALCKDGKLDLAIEFLDFMISNGCSPDIVNYNTILAALCKSGKAEEALEIFEKLSQT 1270
            P+ISA CK+GK+DLAI F+D M+S G  PDI+NYNTI+ +LCK G+ +EAL IF+KL + 
Sbjct: 352  PLISAFCKEGKVDLAIGFVDDMVSAGWLPDIINYNTIMGSLCKKGRGDEALSIFKKLDEV 411

Query: 1271 DSPPNVSSYNTMISGLWNSGDPKRALNMVSEMVDKGIDADQITYNSLISCLCRDAMVEEA 1450
              PPNVSSYNTM+  LW+SGD  RAL MV EM++ G+D D+ITYNSLISCLCRD MV+EA
Sbjct: 412  GCPPNVSSYNTMLGALWSSGDKIRALRMVLEMLNNGLDPDRITYNSLISCLCRDGMVDEA 471

Query: 1451 IELLVDMQNRGFKPTIISYNTMLLGLCKIHNIDTAIDILAVMVENGVRPNETSYILLIEG 1630
            I LLVDM+   ++PT+ISYN +LLGLCK H I  AI++LAVMV+NG +PN+T+Y LL+EG
Sbjct: 472  IGLLVDMERSEWQPTVISYNIVLLGLCKAHRIVDAIEVLAVMVDNGCQPNQTTYTLLVEG 531

Query: 1631 MAYSGWRAEAMELANSLVSIGVVSKDFFKRLNR 1729
            ++Y+GW ++A+ELA SL S+  +S+D F+RLN+
Sbjct: 532  ISYAGWPSDAVELAKSLSSMKAISQDLFRRLNK 564


>ref|XP_003555568.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Glycine max]
          Length = 576

 Score =  608 bits (1568), Expect = e-171
 Identities = 307/576 (53%), Positives = 401/576 (69%), Gaps = 13/576 (2%)
 Frame = +2

Query: 41   MAVLYTEFFPYCYSLNIHSKASPLPCHHQKSIIISCKNSTTNDRISSKIRV--------- 193
            MA + +EF  +C  L  +SK + LP  +  + +I+C+    N+   SK R+         
Sbjct: 1    MATVSSEFLSHCLPLGTNSKRAWLP--NPSNTVITCRIPLLNEDNPSKRRLNNNNNNKGH 58

Query: 194  ----SAETRSIHFQSLDFQETHFIKLLNRSCKAGKYSETLYFLECMVNKGYKPDVILCTK 361
                S++TR    Q  DF++TH +K LNR CK GKY+E LYFLE MV +GYKPDVILCTK
Sbjct: 59   TRVTSSDTRPQQ-QHYDFRDTHHMKALNRLCKTGKYTEALYFLEQMVKRGYKPDVILCTK 117

Query: 362  LVKGFLNARNVAKAVRVMAILESHGEPDVFAYNALISGFCKMNQIESANKVLHRMRNRGC 541
            L+KG   ++   KAVRVM ILE +G+PD FAYNA+ISGFC+ ++ ++AN+V+ RM+ RG 
Sbjct: 118  LIKGLFTSKRTEKAVRVMEILEQYGDPDSFAYNAVISGFCRSDRFDAANRVILRMKYRGF 177

Query: 542  LPDIVTYNILIGSLCSRGKLELAREVFNQFALYDCHPXXXXXXXXXXXXXXXXXXXEAXX 721
             PD+VTYNILIGSLC+RGKL+LA +V +Q    +C+P                   +A  
Sbjct: 178  SPDVVTYNILIGSLCARGKLDLALKVMDQLLEDNCNPTVITYTILIEATIIHGSIDDAMR 237

Query: 722  XXXXXXXXXXQPDMYTYNAVVRGMCKEGMVDQAFDFIMNLAELGCEPDEISYNIVLRAFL 901
                      QPDMYTYN +VRGMCK G+VD+AF+F+ NL      P    YN++L+  L
Sbjct: 238  LLDEMMSRGLQPDMYTYNVIVRGMCKRGLVDRAFEFVSNL---NTTPSLNLYNLLLKGLL 294

Query: 902  SRGRWIDGERLIEKMISRGCEPNVVTYSILISSLCHDGKLKDAIAVLKTMMAKGLTPDAY 1081
            + GRW  GERL+  MI +GCEPN+VTYS+LISSLC DGK  +A+ VL+ M  KGL PDAY
Sbjct: 295  NEGRWEAGERLMSDMIVKGCEPNIVTYSVLISSLCRDGKAGEAVDVLRVMKEKGLNPDAY 354

Query: 1082 SFDPVISALCKDGKLDLAIEFLDFMISNGCSPDIVNYNTILAALCKSGKAEEALEIFEKL 1261
             +DP+ISA CK+GK+DLAI F+D MIS G  PDIVNYNTI+ +LCK G+A+EAL IF+KL
Sbjct: 355  CYDPLISAFCKEGKVDLAIGFVDDMISAGWLPDIVNYNTIMGSLCKKGRADEALNIFKKL 414

Query: 1262 SQTDSPPNVSSYNTMISGLWNSGDPKRALNMVSEMVDKGIDADQITYNSLISCLCRDAMV 1441
             +   PPN SSYNTM   LW+SGD  RAL M+ EM+  G+D D+ITYNSLIS LCRD MV
Sbjct: 415  EEVGCPPNASSYNTMFGALWSSGDKIRALTMILEMLSNGVDPDRITYNSLISSLCRDGMV 474

Query: 1442 EEAIELLVDMQNRGFKPTIISYNTMLLGLCKIHNIDTAIDILAVMVENGVRPNETSYILL 1621
            +EAI LLVDM+   ++PT+ISYN +LLGLCK H I  AI++LAVMV+NG +PNET+Y LL
Sbjct: 475  DEAIGLLVDMERTEWQPTVISYNIVLLGLCKAHRIVDAIEVLAVMVDNGCQPNETTYTLL 534

Query: 1622 IEGMAYSGWRAEAMELANSLVSIGVVSKDFFKRLNR 1729
            +EG+ Y+GWR+ A+ELA SLVS+  +S+D F+RL +
Sbjct: 535  VEGVGYAGWRSYAVELAKSLVSMNAISQDLFRRLQK 570


>ref|XP_003590960.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355480008|gb|AES61211.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 590

 Score =  608 bits (1567), Expect = e-171
 Identities = 302/592 (51%), Positives = 405/592 (68%), Gaps = 15/592 (2%)
 Frame = +2

Query: 41   MAVLYTEFFPYCYSLNIHSKASPLPCHHQKSIIISC--------------KNSTTNDRIS 178
            M    TEF  +  +  IH+ +   P     +III+               +   TN+   
Sbjct: 1    MTTFSTEFLSHTLNFRIHTSSHSKP----NTIIITSSILFLNEANNNNNKRRRRTNNNEQ 56

Query: 179  SKIRVSAETRSIHFQSLDFQETHFIKLLNRSCKAGKYSETLYFLECMVNKGYKPDVILCT 358
             + RV+    + H Q  DF++T+F+K LNRSCK+ KY E+LYFL+ MVN+GYKPDVILCT
Sbjct: 57   QQFRVNETKPTKHDQDYDFRDTNFMKTLNRSCKSAKYDESLYFLQHMVNRGYKPDVILCT 116

Query: 359  KLVKGFLNARNVAKAVRVMAILESHGEPDVFAYNALISGFCKMNQIESANKVLHRMRNRG 538
            KL+KGF N + + KA++VM ILE HG+PDVFAYNA+ISGFCK ++++ A+KVL RM+ RG
Sbjct: 117  KLIKGFFNMKKIEKAIQVMEILEKHGKPDVFAYNAVISGFCKADRVDHASKVLDRMKKRG 176

Query: 539  CLPDIVTYNILIGSLCSRGKLELAREVFNQFALYDCHPXXXXXXXXXXXXXXXXXXXEAX 718
              PD+VTYNILIG+ C RG+L+LA  V +Q    +C P                   EA 
Sbjct: 177  FEPDVVTYNILIGNFCGRGRLDLALRVMDQLLKDNCKPTVITYTILIEATITQGGIDEAM 236

Query: 719  XXXXXXXXXXXQPDMYTYNAVVRGMCKEGMVDQAFDFIMNLAELGCEPDEISYNIVLRAF 898
                       +PD YTYN VV GMCKEGM+D+AF+F+  +++ GC     +YNI+LR  
Sbjct: 237  KLLDEMLSRGLRPDRYTYNVVVNGMCKEGMLDRAFEFLSRISKNGCVAGVSTYNILLRDL 296

Query: 899  LSRGRWIDGERLIEKMISRGCEPNVVTYSILISSLCHDGKLKDAIAVLKTMMAKGLTPDA 1078
            L+ G+W  GE+L+  M+ +GCEPN +TYS LI++LC DGK+ +A  VLK M  K L PD 
Sbjct: 297  LNEGKWEYGEKLMSDMLVKGCEPNPITYSTLITALCRDGKIDEAKNVLKVMKEKALAPDG 356

Query: 1079 YSFDPVISALCKDGKLDLAIEFLDFMISNGCSPDIVNYNTILAALCKSGKAEEALEIFEK 1258
            YS+DP+ISALC++GK+DLAIEFLD MIS G  PDI++YN+ILA+LCK+G A+EAL IFEK
Sbjct: 357  YSYDPLISALCREGKVDLAIEFLDDMISGGHLPDILSYNSILASLCKNGNADEALNIFEK 416

Query: 1259 LSQTDSPPNVSSYNTMISGLWNSGDPKRALNMVSEMVDKGIDADQITYNSLISCLCRDAM 1438
            L +   PPN  SYNT+   LW+SGD  RAL M+ EM+  GID D+ITYNSLISCLCRD +
Sbjct: 417  LGEVGCPPNAGSYNTLFGALWSSGDKIRALGMILEMLSNGIDPDEITYNSLISCLCRDGL 476

Query: 1439 VEEAIELLVDM-QNRGFKPTIISYNTMLLGLCKIHNIDTAIDILAVMVENGVRPNETSYI 1615
            V++AIELLVDM ++   +PT+ISYNT+LLGLCK+  I  AI++LA MV  G  PNET+Y 
Sbjct: 477  VDQAIELLVDMFESEKCQPTVISYNTVLLGLCKVQRIIDAIEVLAAMVNEGCLPNETTYT 536

Query: 1616 LLIEGMAYSGWRAEAMELANSLVSIGVVSKDFFKRLNRTFPMINVYKDLSNS 1771
            LLI+G+ ++GWR +AMELAN LV++  +S+D FKR  + FP+ + +K+L+ S
Sbjct: 537  LLIQGIGFAGWRYDAMELANLLVNMDAISEDSFKRFQKIFPVFDAHKELALS 588


Top