BLASTX nr result

ID: Magnolia22_contig00015381 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Magnolia22_contig00015381
         (1443 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_006478887.1 PREDICTED: pentatricopeptide repeat-containing pr...   336   e-110
XP_010274657.1 PREDICTED: pentatricopeptide repeat-containing pr...   335   e-108
KDO50539.1 hypothetical protein CISIN_1g047178mg [Citrus sinensis]    332   e-108
XP_006443149.1 hypothetical protein CICLE_v10021498mg [Citrus cl...   332   e-108
XP_008804661.1 PREDICTED: pentatricopeptide repeat-containing pr...   331   e-107
XP_008804662.1 PREDICTED: pentatricopeptide repeat-containing pr...   330   e-107
CBI39461.3 unnamed protein product, partial [Vitis vinifera]          326   e-106
XP_002269673.1 PREDICTED: pentatricopeptide repeat-containing pr...   326   e-106
XP_011015613.1 PREDICTED: pentatricopeptide repeat-containing pr...   325   e-106
XP_012483420.1 PREDICTED: pentatricopeptide repeat-containing pr...   325   e-105
XP_011007631.1 PREDICTED: pentatricopeptide repeat-containing pr...   324   e-105
XP_017608435.1 PREDICTED: pentatricopeptide repeat-containing pr...   323   e-105
XP_002526313.1 PREDICTED: pentatricopeptide repeat-containing pr...   323   e-105
XP_018842584.1 PREDICTED: pentatricopeptide repeat-containing pr...   322   e-104
XP_006373907.1 hypothetical protein POPTR_0016s10300g [Populus t...   320   e-103
XP_017984164.1 PREDICTED: pentatricopeptide repeat-containing pr...   318   e-103
XP_007048491.1 PREDICTED: pentatricopeptide repeat-containing pr...   317   e-102
XP_020102208.1 pentatricopeptide repeat-containing protein At4g2...   320   e-102
XP_015572754.1 PREDICTED: pentatricopeptide repeat-containing pr...   315   e-102
GAV62919.1 hypothetical protein CFOL_v3_06441 [Cephalotus follic...   316   e-102

>XP_006478887.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic isoform X1 [Citrus sinensis]
          Length = 288

 Score =  336 bits (862), Expect = e-110
 Identities = 164/251 (65%), Positives = 205/251 (81%), Gaps = 1/251 (0%)
 Frame = +2

Query: 302  SQKIVGDECNSRAFTSSEVRHGNQAKDQY-SKNARDLRNFQIGENVSRKDKMNFLVKTLF 478
            S +I+G    + + +S E +  NQ+ DQY  +NA   RNF+IGENV RKDK+NFLV TL 
Sbjct: 28   SNQIIG---KAMSMSSLEGQRTNQSVDQYPERNAASTRNFRIGENVPRKDKINFLVNTLL 84

Query: 479  DLKDSKEAVYSALDAWVAWEQNFPIVSLKRALITLEKEEQWHRIIQVLKWMLSKGQGTTR 658
            DLK+SKE VY  LDAWVAWEQNFP+ SLK+AL+ LEKE+QWHR++QV+KWMLSKGQG+T 
Sbjct: 85   DLKNSKEDVYGTLDAWVAWEQNFPVGSLKKALLALEKEQQWHRVVQVIKWMLSKGQGSTM 144

Query: 659  GTYAQLIRALDKDRRAEEAHNLWVKKIGHDLHSVPWQLCHLMISIYYRNNMPERLVKLFK 838
            GT  QLIRALD D RAEEAH  W K+IG DLHSVPWQLC  MI+IYYRNNM ERL+KLFK
Sbjct: 145  GTCGQLIRALDMDHRAEEAHKFWEKRIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLFK 204

Query: 839  GLEAFDRQPPDKSIVQKVADAYEMLGLMEEQKRVVEKYKDLFAKSDQGNSKKSRRASLKT 1018
            GLEAFDR+PP+KSIVQ+VADAYE+LGL+EE++RV+EKYKDLF + ++ ++KKS+ +S+K 
Sbjct: 205  GLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVLEKYKDLFTEKEKRSNKKSKSSSMKG 264

Query: 1019 EKKEKRKQGRP 1051
            +KK+ R +  P
Sbjct: 265  KKKKGRIRDTP 275


>XP_010274657.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic [Nelumbo nucifera] XP_010274658.1
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g18975, chloroplastic [Nelumbo nucifera]
            XP_010274659.1 PREDICTED: pentatricopeptide
            repeat-containing protein At4g18975, chloroplastic
            [Nelumbo nucifera] XP_010274660.1 PREDICTED:
            pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic [Nelumbo nucifera] XP_010274661.1
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g18975, chloroplastic [Nelumbo nucifera]
          Length = 346

 Score =  335 bits (858), Expect = e-108
 Identities = 164/258 (63%), Positives = 202/258 (78%), Gaps = 1/258 (0%)
 Frame = +2

Query: 314  VGDECNSRAFTSSEVRHGNQAK-DQYSKNARDLRNFQIGENVSRKDKMNFLVKTLFDLKD 490
            + ++C+  A  SSEV+ GNQ K +  S +   +  FQIGENVS+KDK+ FLV TL DLKD
Sbjct: 85   IPNDCSRGAIISSEVQLGNQTKHEDLSDDNLHIHKFQIGENVSKKDKIKFLVNTLSDLKD 144

Query: 491  SKEAVYSALDAWVAWEQNFPIVSLKRALITLEKEEQWHRIIQVLKWMLSKGQGTTRGTYA 670
            SKEA+Y ALDAWVAWE+NFPI SLK+ L+ LEKE+QWHR+IQV+KWMLSKGQG T GTY 
Sbjct: 145  SKEAIYGALDAWVAWERNFPIASLKQVLLALEKEQQWHRVIQVIKWMLSKGQGNTLGTYR 204

Query: 671  QLIRALDKDRRAEEAHNLWVKKIGHDLHSVPWQLCHLMISIYYRNNMPERLVKLFKGLEA 850
            QLIRALD D RAEEAHN W+KKIG DLHSVPWQLC LMISIYYRNNM +RLVKLFKGLEA
Sbjct: 205  QLIRALDMDHRAEEAHNFWMKKIGTDLHSVPWQLCSLMISIYYRNNMLDRLVKLFKGLEA 264

Query: 851  FDRQPPDKSIVQKVADAYEMLGLMEEQKRVVEKYKDLFAKSDQGNSKKSRRASLKTEKKE 1030
            FDR+PP+K+IVQKVADAYE+LG  EE++R+++KY  LF ++ +G  K+S++AS K  +K 
Sbjct: 265  FDRKPPEKAIVQKVADAYEILGRPEEKERILDKYNHLFTETWKGKPKRSQKASQKKTRKS 324

Query: 1031 KRKQGRPQASQDKCLVQD 1084
              ++    +   K  V D
Sbjct: 325  GERKNTDTSDNLKPAVDD 342


>KDO50539.1 hypothetical protein CISIN_1g047178mg [Citrus sinensis]
          Length = 287

 Score =  332 bits (851), Expect = e-108
 Identities = 161/247 (65%), Positives = 202/247 (81%), Gaps = 1/247 (0%)
 Frame = +2

Query: 302  SQKIVGDECNSRAFTSSEVRHGNQAKDQY-SKNARDLRNFQIGENVSRKDKMNFLVKTLF 478
            S +I+G    + + +S E +  NQ+ DQY  +NA   RNF+IGENV RKDK+NFLV TL 
Sbjct: 28   SNQIIG---KAMSMSSLEGQRTNQSVDQYPERNAASTRNFRIGENVPRKDKINFLVNTLL 84

Query: 479  DLKDSKEAVYSALDAWVAWEQNFPIVSLKRALITLEKEEQWHRIIQVLKWMLSKGQGTTR 658
            DLK+SKE VY  LDAWVAWEQNFP+ SLK+AL+ LEKE+QWHR++QV+KWMLSKGQG+T 
Sbjct: 85   DLKNSKEDVYGTLDAWVAWEQNFPVGSLKKALLALEKEQQWHRVVQVIKWMLSKGQGSTM 144

Query: 659  GTYAQLIRALDKDRRAEEAHNLWVKKIGHDLHSVPWQLCHLMISIYYRNNMPERLVKLFK 838
            GT  QLIRALD D RAEEAH  W K+IG DLHSVPWQLC  MI+IYYRNNM ERL+KLFK
Sbjct: 145  GTCGQLIRALDMDHRAEEAHKFWEKRIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLFK 204

Query: 839  GLEAFDRQPPDKSIVQKVADAYEMLGLMEEQKRVVEKYKDLFAKSDQGNSKKSRRASLKT 1018
            GLEAFDR+PP+KSIVQ+VADAYE+LGL+EE++RV+EKYKDLF + ++ ++KKS+ +S+K 
Sbjct: 205  GLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVLEKYKDLFTEKEKRSNKKSKSSSMKG 264

Query: 1019 EKKEKRK 1039
            +K  + +
Sbjct: 265  KKSGRTR 271


>XP_006443149.1 hypothetical protein CICLE_v10021498mg [Citrus clementina]
            XP_006478888.1 PREDICTED: pentatricopeptide
            repeat-containing protein At4g18975, chloroplastic
            isoform X2 [Citrus sinensis] ESR56389.1 hypothetical
            protein CICLE_v10021498mg [Citrus clementina]
          Length = 287

 Score =  332 bits (850), Expect = e-108
 Identities = 161/242 (66%), Positives = 200/242 (82%), Gaps = 1/242 (0%)
 Frame = +2

Query: 302  SQKIVGDECNSRAFTSSEVRHGNQAKDQY-SKNARDLRNFQIGENVSRKDKMNFLVKTLF 478
            S +I+G    + + +S E +  NQ+ DQY  +NA   RNF+IGENV RKDK+NFLV TL 
Sbjct: 28   SNQIIG---KAMSMSSLEGQRTNQSVDQYPERNAASTRNFRIGENVPRKDKINFLVNTLL 84

Query: 479  DLKDSKEAVYSALDAWVAWEQNFPIVSLKRALITLEKEEQWHRIIQVLKWMLSKGQGTTR 658
            DLK+SKE VY  LDAWVAWEQNFP+ SLK+AL+ LEKE+QWHR++QV+KWMLSKGQG+T 
Sbjct: 85   DLKNSKEDVYGTLDAWVAWEQNFPVGSLKKALLALEKEQQWHRVVQVIKWMLSKGQGSTM 144

Query: 659  GTYAQLIRALDKDRRAEEAHNLWVKKIGHDLHSVPWQLCHLMISIYYRNNMPERLVKLFK 838
            GT  QLIRALD D RAEEAH  W K+IG DLHSVPWQLC  MI+IYYRNNM ERL+KLFK
Sbjct: 145  GTCGQLIRALDMDHRAEEAHKFWEKRIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLFK 204

Query: 839  GLEAFDRQPPDKSIVQKVADAYEMLGLMEEQKRVVEKYKDLFAKSDQGNSKKSRRASLKT 1018
            GLEAFDR+PP+KSIVQ+VADAYE+LGL+EE++RV+EKYKDLF + ++ ++KKS+ +S+K 
Sbjct: 205  GLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVLEKYKDLFTEKEKRSNKKSKSSSMKG 264

Query: 1019 EK 1024
            +K
Sbjct: 265  KK 266


>XP_008804661.1 PREDICTED: pentatricopeptide repeat-containing protein At4g21190
            isoform X1 [Phoenix dactylifera]
          Length = 317

 Score =  331 bits (848), Expect = e-107
 Identities = 170/307 (55%), Positives = 213/307 (69%), Gaps = 11/307 (3%)
 Frame = +2

Query: 182  SRNILHRINIDRH-----LRCEIEVSRCCYCYSTGLVPFQNEDSDSQKIVGDECNSRAFT 346
            S  ++ + NI  H      R E+    CC  Y+T    FQ+  S+  +IV D+   +   
Sbjct: 11   SLQLIIKSNIGEHSWLWSARTELVAGHCCSSYATSTFGFQSRYSNDSRIVEDQVFEQRGL 70

Query: 347  SSEV--RHGNQAKDQYSKN-ARDLRNFQIGENVSRKDKMNFLVKTLFDLKDSKEAVYSAL 517
             ++    H N   +Q  ++    L   QIG+N+S  +K  FL+ TL DLK+SKEAVY  L
Sbjct: 71   KAKFPSEHDNSMVNQKQRSDIEPLPRQQIGKNISSAEKAKFLINTLLDLKNSKEAVYGTL 130

Query: 518  DAWVAWEQNFPIVSLKRALITLEKEEQWHRIIQVLKWMLSKGQGTTRGTYAQLIRALDKD 697
            DAWVAWEQNFP+  LKRALI LEK+EQWHR++QV+KWMLSKGQGTT GTY QLIRAL+KD
Sbjct: 131  DAWVAWEQNFPLAMLKRALIVLEKQEQWHRVVQVVKWMLSKGQGTTMGTYEQLIRALEKD 190

Query: 698  RRAEEAHNLWVKKIGHDLHSVPWQLCHLMISIYYRNNMPERLVKLFKGLEAFDRQPPDKS 877
             RAEEAH +WVKKIGHDLHSVPW+ C LM+SIYYRNNM ERLVKLFKGLE FDR+PP KS
Sbjct: 191  NRAEEAHKIWVKKIGHDLHSVPWRFCDLMLSIYYRNNMLERLVKLFKGLEEFDRKPPKKS 250

Query: 878  IVQKVADAYEMLGLMEEQKRVVEKYKDLFAKSDQGNSK---KSRRASLKTEKKEKRKQGR 1048
            IV+KVADAYE+LGL+EE+ +++EKY  LF KS +  S+   KS++AS K +KK   +   
Sbjct: 251  IVRKVADAYELLGLLEEKNKLLEKYSHLFIKSSEERSRKSQKSKKASRKNDKKTGTETNE 310

Query: 1049 PQASQDK 1069
               S DK
Sbjct: 311  SIESSDK 317


>XP_008804662.1 PREDICTED: pentatricopeptide repeat-containing protein At4g21190
            isoform X2 [Phoenix dactylifera]
          Length = 315

 Score =  330 bits (846), Expect = e-107
 Identities = 164/295 (55%), Positives = 210/295 (71%), Gaps = 8/295 (2%)
 Frame = +2

Query: 182  SRNILHRINIDRH-----LRCEIEVSRCCYCYSTGLVPFQNEDSDSQKIVGDECNSRAFT 346
            S  ++ + NI  H      R E+    CC  Y+T    FQ+  S+  +IV D+   +   
Sbjct: 11   SLQLIIKSNIGEHSWLWSARTELVAGHCCSSYATSTFGFQSRYSNDSRIVEDQVFEQRGL 70

Query: 347  SSEV--RHGNQAKDQYSKN-ARDLRNFQIGENVSRKDKMNFLVKTLFDLKDSKEAVYSAL 517
             ++    H N   +Q  ++    L   QIG+N+S  +K  FL+ TL DLK+SKEAVY  L
Sbjct: 71   KAKFPSEHDNSMVNQKQRSDIEPLPRQQIGKNISSAEKAKFLINTLLDLKNSKEAVYGTL 130

Query: 518  DAWVAWEQNFPIVSLKRALITLEKEEQWHRIIQVLKWMLSKGQGTTRGTYAQLIRALDKD 697
            DAWVAWEQNFP+  LKRALI LEK+EQWHR++QV+KWMLSKGQGTT GTY QLIRAL+KD
Sbjct: 131  DAWVAWEQNFPLAMLKRALIVLEKQEQWHRVVQVVKWMLSKGQGTTMGTYEQLIRALEKD 190

Query: 698  RRAEEAHNLWVKKIGHDLHSVPWQLCHLMISIYYRNNMPERLVKLFKGLEAFDRQPPDKS 877
             RAEEAH +WVKKIGHDLHSVPW+ C LM+SIYYRNNM ERLVKLFKGLE FDR+PP KS
Sbjct: 191  NRAEEAHKIWVKKIGHDLHSVPWRFCDLMLSIYYRNNMLERLVKLFKGLEEFDRKPPKKS 250

Query: 878  IVQKVADAYEMLGLMEEQKRVVEKYKDLFAKSDQGNSKKSRRASLKTEKKEKRKQ 1042
            IV+KVADAYE+LGL+EE+ +++EKY  LF KS +  S+KS+++   + K +K+ +
Sbjct: 251  IVRKVADAYELLGLLEEKNKLLEKYSHLFIKSSEERSRKSQKSKKASRKNDKKTE 305


>CBI39461.3 unnamed protein product, partial [Vitis vinifera]
          Length = 296

 Score =  326 bits (836), Expect = e-106
 Identities = 164/257 (63%), Positives = 192/257 (74%), Gaps = 5/257 (1%)
 Frame = +2

Query: 284  QNEDSDSQKI-----VGDECNSRAFTSSEVRHGNQAKDQYSKNARDLRNFQIGENVSRKD 448
            Q + SD+  +     +G +CN++                  K+A  +   QIGENVSRKD
Sbjct: 34   QTQMSDTSNVGEVAFLGGQCNNQPMYHDS-----------GKDAASVHKHQIGENVSRKD 82

Query: 449  KMNFLVKTLFDLKDSKEAVYSALDAWVAWEQNFPIVSLKRALITLEKEEQWHRIIQVLKW 628
            K+NFLV TL DLKDSKEAVY ALDAWVAWEQNFPI SLKR LITLEKE+QWHR+IQV+KW
Sbjct: 83   KINFLVTTLLDLKDSKEAVYGALDAWVAWEQNFPIASLKRVLITLEKEQQWHRVIQVVKW 142

Query: 629  MLSKGQGTTRGTYAQLIRALDKDRRAEEAHNLWVKKIGHDLHSVPWQLCHLMISIYYRNN 808
            MLSKGQGTT GTY QLIRALD D RAEEAH  WVKKIG DLHSVPW LCH MIS+YYRNN
Sbjct: 143  MLSKGQGTTMGTYGQLIRALDMDHRAEEAHEFWVKKIGTDLHSVPWHLCHRMISVYYRNN 202

Query: 809  MPERLVKLFKGLEAFDRQPPDKSIVQKVADAYEMLGLMEEQKRVVEKYKDLFAKSDQGNS 988
            M E LVKLFKGLEAFDR+P DK +V+KVADAYEMLGL+EE++R+ EKY  LF ++  G  
Sbjct: 203  MLENLVKLFKGLEAFDRKPQDKLVVKKVADAYEMLGLLEEKERIFEKYDYLFTETVAGKP 262

Query: 989  KKSRRASLKTEKKEKRK 1039
            KKS++   + +K  +RK
Sbjct: 263  KKSKKFLSEKKKSGRRK 279


>XP_002269673.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic isoform X1 [Vitis vinifera]
          Length = 300

 Score =  326 bits (836), Expect = e-106
 Identities = 164/257 (63%), Positives = 192/257 (74%), Gaps = 5/257 (1%)
 Frame = +2

Query: 284  QNEDSDSQKI-----VGDECNSRAFTSSEVRHGNQAKDQYSKNARDLRNFQIGENVSRKD 448
            Q + SD+  +     +G +CN++                  K+A  +   QIGENVSRKD
Sbjct: 38   QTQMSDTSNVGEVAFLGGQCNNQPMYHDS-----------GKDAASVHKHQIGENVSRKD 86

Query: 449  KMNFLVKTLFDLKDSKEAVYSALDAWVAWEQNFPIVSLKRALITLEKEEQWHRIIQVLKW 628
            K+NFLV TL DLKDSKEAVY ALDAWVAWEQNFPI SLKR LITLEKE+QWHR+IQV+KW
Sbjct: 87   KINFLVTTLLDLKDSKEAVYGALDAWVAWEQNFPIASLKRVLITLEKEQQWHRVIQVVKW 146

Query: 629  MLSKGQGTTRGTYAQLIRALDKDRRAEEAHNLWVKKIGHDLHSVPWQLCHLMISIYYRNN 808
            MLSKGQGTT GTY QLIRALD D RAEEAH  WVKKIG DLHSVPW LCH MIS+YYRNN
Sbjct: 147  MLSKGQGTTMGTYGQLIRALDMDHRAEEAHEFWVKKIGTDLHSVPWHLCHRMISVYYRNN 206

Query: 809  MPERLVKLFKGLEAFDRQPPDKSIVQKVADAYEMLGLMEEQKRVVEKYKDLFAKSDQGNS 988
            M E LVKLFKGLEAFDR+P DK +V+KVADAYEMLGL+EE++R+ EKY  LF ++  G  
Sbjct: 207  MLENLVKLFKGLEAFDRKPQDKLVVKKVADAYEMLGLLEEKERIFEKYDYLFTETVAGKP 266

Query: 989  KKSRRASLKTEKKEKRK 1039
            KKS++   + +K  +RK
Sbjct: 267  KKSKKFLSEKKKSGRRK 283


>XP_011015613.1 PREDICTED: pentatricopeptide repeat-containing protein At4g21190-like
            [Populus euphratica]
          Length = 294

 Score =  325 bits (834), Expect = e-106
 Identities = 158/246 (64%), Positives = 193/246 (78%), Gaps = 2/246 (0%)
 Frame = +2

Query: 401  RDLRNFQIGENVSRKDKMNFLVKTLFDLKDSKEAVYSALDAWVAWEQNFPIVSLKRALIT 580
            ++LR  QIG+NVS+KDK+ FL+ TL DL DSK++VY ALDAWVAWEQ FPI S+K+ LI 
Sbjct: 56   QNLRRNQIGDNVSKKDKIKFLITTLLDLNDSKDSVYGALDAWVAWEQKFPIASIKQVLIA 115

Query: 581  LEKEEQWHRIIQVLKWMLSKGQGTTRGTYAQLIRALDKDRRAEEAHNLWVKKIGHDLHSV 760
            LEKE+QWHRI+QV+KWMLSKGQGTT GTY+Q IRALD D RA+EAH  W+KKIG DLHSV
Sbjct: 116  LEKEQQWHRIVQVIKWMLSKGQGTTMGTYSQFIRALDMDHRAKEAHEFWLKKIGRDLHSV 175

Query: 761  PWQLCHLMISIYYRNNMPERLVKLFKGLEAFDRQPPDKSIVQKVADAYEMLGLMEEQKRV 940
            PWQLC+ MISIYYRNNM E L+KLFKGLEAFDRQPP+KSIVQKVADAYEMLGL+EE++RV
Sbjct: 176  PWQLCNRMISIYYRNNMLENLIKLFKGLEAFDRQPPEKSIVQKVADAYEMLGLLEEKERV 235

Query: 941  VEKYKDLFAKSDQGNSKKSRRASLKTEKKEKRKQGRPQASQDKCLVQ--DDSAIGGNDME 1114
            +EKY  +F ++ +G +KK R AS     K+ +K G+P+      L    DD  +     +
Sbjct: 236  LEKYNHIFVEAGKGRNKKLRNAS----SKKNKKSGKPKNESSDTLADAVDDKKLS----Q 287

Query: 1115 TSEEHC 1132
            +S EHC
Sbjct: 288  SSSEHC 293


>XP_012483420.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic isoform X1 [Gossypium raimondii] KJB33324.1
            hypothetical protein B456_006G006600 [Gossypium
            raimondii]
          Length = 304

 Score =  325 bits (833), Expect = e-105
 Identities = 174/305 (57%), Positives = 214/305 (70%), Gaps = 2/305 (0%)
 Frame = +2

Query: 155  MLRAVTSRLSRNILHRINIDRHLRCEIEVSRCCYCYSTGLVPF-QNEDSDSQKIVGDECN 331
            MLR    ++S +   RI     L     + R C   +  ++P  Q  ++    +V D+CN
Sbjct: 1    MLRFALQKISGHSTQRI----FLPATSTLMRGCSFATYEVIPKGQGREAHCDHVVKDQCN 56

Query: 332  SRAFTSSEVRHGNQAKDQYSK-NARDLRNFQIGENVSRKDKMNFLVKTLFDLKDSKEAVY 508
                        NQ  + Y K N    +  QIG+NVSRKDK+ FLV TL DLKDSKEA+Y
Sbjct: 57   ------------NQVANLYPKPNVGGQQKLQIGQNVSRKDKIKFLVTTLLDLKDSKEAIY 104

Query: 509  SALDAWVAWEQNFPIVSLKRALITLEKEEQWHRIIQVLKWMLSKGQGTTRGTYAQLIRAL 688
            SALDAWVAWEQNFPI  LK  ++ LEKE QWHRI+QV+KWMLSKGQG T GTY QL+RAL
Sbjct: 105  SALDAWVAWEQNFPIGPLKNVILALEKEHQWHRIVQVIKWMLSKGQGNTMGTYGQLLRAL 164

Query: 689  DKDRRAEEAHNLWVKKIGHDLHSVPWQLCHLMISIYYRNNMPERLVKLFKGLEAFDRQPP 868
            D D RA+EAH  WVKK+G DLHSVPWQLC LMIS+YYRNNM E LVKLFKGLEAF R+P 
Sbjct: 165  DMDNRADEAHQFWVKKVGADLHSVPWQLCGLMISVYYRNNMLENLVKLFKGLEAFGRKPT 224

Query: 869  DKSIVQKVADAYEMLGLMEEQKRVVEKYKDLFAKSDQGNSKKSRRASLKTEKKEKRKQGR 1048
            DKSIVQ+VADAYEMLGL+EE++RV+EKY+D+  K ++G+ KKS++ SLK  KK+   +GR
Sbjct: 225  DKSIVQRVADAYEMLGLLEEKERVLEKYEDICTKIEKGH-KKSKQTSLK--KKKDSGRGR 281

Query: 1049 PQASQ 1063
            P+  Q
Sbjct: 282  PRQRQ 286


>XP_011007631.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like [Populus euphratica]
          Length = 294

 Score =  324 bits (830), Expect = e-105
 Identities = 158/246 (64%), Positives = 192/246 (78%), Gaps = 2/246 (0%)
 Frame = +2

Query: 401  RDLRNFQIGENVSRKDKMNFLVKTLFDLKDSKEAVYSALDAWVAWEQNFPIVSLKRALIT 580
            ++LR  QIG+NVS+KDK+ FL+ TL DL DSK+AVY ALDAWVAWEQ FPI S+K+ LI 
Sbjct: 56   QNLRRNQIGDNVSKKDKIKFLITTLLDLNDSKDAVYGALDAWVAWEQKFPIASIKQVLIA 115

Query: 581  LEKEEQWHRIIQVLKWMLSKGQGTTRGTYAQLIRALDKDRRAEEAHNLWVKKIGHDLHSV 760
            LEKE+QWHRI+QV+KWMLSKGQGTT GTY+Q IRALD D RA+EAH  W+KKIG DLHSV
Sbjct: 116  LEKEQQWHRIVQVIKWMLSKGQGTTMGTYSQFIRALDMDHRAKEAHEFWLKKIGRDLHSV 175

Query: 761  PWQLCHLMISIYYRNNMPERLVKLFKGLEAFDRQPPDKSIVQKVADAYEMLGLMEEQKRV 940
            PWQLC+ MISIYYRNNM E L+KLFKGLEAFDRQPP+KSIVQKVADAYEMLGL+ E++RV
Sbjct: 176  PWQLCNRMISIYYRNNMLENLIKLFKGLEAFDRQPPEKSIVQKVADAYEMLGLLYEKERV 235

Query: 941  VEKYKDLFAKSDQGNSKKSRRASLKTEKKEKRKQGRPQASQDKCLVQ--DDSAIGGNDME 1114
            +EKY  +F ++ +G +KK R AS     K+ +K G+P+      L    DD  +     +
Sbjct: 236  LEKYNHIFVEAGKGRNKKLRNAS----SKKNKKSGKPKNESSDTLADAVDDKKLS----Q 287

Query: 1115 TSEEHC 1132
            +S EHC
Sbjct: 288  SSSEHC 293


>XP_017608435.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic isoform X1 [Gossypium arboreum]
          Length = 304

 Score =  323 bits (829), Expect = e-105
 Identities = 174/305 (57%), Positives = 213/305 (69%), Gaps = 2/305 (0%)
 Frame = +2

Query: 155  MLRAVTSRLSRNILHRINIDRHLRCEIEVSRCCYCYSTGLVPF-QNEDSDSQKIVGDECN 331
            MLR    ++S +   RI     L     + R C   +  ++P  Q  ++    +V D+CN
Sbjct: 1    MLRFALQKISGHSTQRI----FLPATSTLMRGCSFATYEVIPKGQGREAHCDHVVKDQCN 56

Query: 332  SRAFTSSEVRHGNQAKDQYSK-NARDLRNFQIGENVSRKDKMNFLVKTLFDLKDSKEAVY 508
                        NQ  + Y K N    +  QIG+NVSRKDK+ FLV TL DLKDSKEAVY
Sbjct: 57   ------------NQVANLYPKPNVGGQQKLQIGQNVSRKDKIKFLVTTLLDLKDSKEAVY 104

Query: 509  SALDAWVAWEQNFPIVSLKRALITLEKEEQWHRIIQVLKWMLSKGQGTTRGTYAQLIRAL 688
            SALDAWVAWEQNFPI  LK  ++ LEKE QWHR++QV+KWMLSKGQG T GTY QL+RAL
Sbjct: 105  SALDAWVAWEQNFPIGPLKNVILALEKEHQWHRVVQVVKWMLSKGQGNTMGTYGQLLRAL 164

Query: 689  DKDRRAEEAHNLWVKKIGHDLHSVPWQLCHLMISIYYRNNMPERLVKLFKGLEAFDRQPP 868
            D D RA+EAH  WVKK+  DLHSVPWQLC LMIS+YYRNNM E LVKLFKGLEAF R+P 
Sbjct: 165  DMDNRADEAHQFWVKKVSADLHSVPWQLCGLMISVYYRNNMLENLVKLFKGLEAFGRKPT 224

Query: 869  DKSIVQKVADAYEMLGLMEEQKRVVEKYKDLFAKSDQGNSKKSRRASLKTEKKEKRKQGR 1048
            DKSIVQ+VADAYEMLGL+EE++RV+EKYKD+  K ++G+ KKS++ SLK  KK+   +GR
Sbjct: 225  DKSIVQRVADAYEMLGLLEEKERVLEKYKDVCTKIEKGH-KKSKQTSLK--KKKDSGRGR 281

Query: 1049 PQASQ 1063
            P+  Q
Sbjct: 282  PRQRQ 286


>XP_002526313.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic [Ricinus communis] EEF36102.1 conserved
            hypothetical protein [Ricinus communis]
          Length = 300

 Score =  323 bits (828), Expect = e-105
 Identities = 155/222 (69%), Positives = 189/222 (85%)
 Frame = +2

Query: 389  SKNARDLRNFQIGENVSRKDKMNFLVKTLFDLKDSKEAVYSALDAWVAWEQNFPIVSLKR 568
            +++A  ++  QIG+NVSRK+K++FL+KTL DLKDSKEAVY ALDAWVAWE NFPI SLKR
Sbjct: 62   NQSAGGVQKNQIGKNVSRKEKIDFLLKTLLDLKDSKEAVYGALDAWVAWEHNFPIASLKR 121

Query: 569  ALITLEKEEQWHRIIQVLKWMLSKGQGTTRGTYAQLIRALDKDRRAEEAHNLWVKKIGHD 748
             LI LEKE+QWH+++QV+KWMLSKGQG T GTY QLIRALD D RA EAH  W+KKIG D
Sbjct: 122  VLILLEKEQQWHKVVQVIKWMLSKGQGNTMGTYGQLIRALDMDHRANEAHMFWLKKIGLD 181

Query: 749  LHSVPWQLCHLMISIYYRNNMPERLVKLFKGLEAFDRQPPDKSIVQKVADAYEMLGLMEE 928
            LHSVPWQLCH MIS+YYRNNM E LVKLFKGLEAFDR+PPDKSI+QKVADAYEMLG++EE
Sbjct: 182  LHSVPWQLCHRMISVYYRNNMLESLVKLFKGLEAFDRKPPDKSILQKVADAYEMLGMLEE 241

Query: 929  QKRVVEKYKDLFAKSDQGNSKKSRRASLKTEKKEKRKQGRPQ 1054
            ++RV++KYKDLF ++++G  KKS R++L  +K  +RK  + Q
Sbjct: 242  KERVLQKYKDLFKETEKGRPKKS-RSTLAKKKSGERKMHKIQ 282


>XP_018842584.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic [Juglans regia]
          Length = 273

 Score =  322 bits (825), Expect = e-104
 Identities = 155/212 (73%), Positives = 183/212 (86%)
 Frame = +2

Query: 392  KNARDLRNFQIGENVSRKDKMNFLVKTLFDLKDSKEAVYSALDAWVAWEQNFPIVSLKRA 571
            KNA D++  +IGENVSRKDK+NFLV TL D+KDSKEAVY ALDAWVAWEQNFPIVS+KRA
Sbjct: 65   KNAGDVQESRIGENVSRKDKVNFLVNTLLDIKDSKEAVYGALDAWVAWEQNFPIVSIKRA 124

Query: 572  LITLEKEEQWHRIIQVLKWMLSKGQGTTRGTYAQLIRALDKDRRAEEAHNLWVKKIGHDL 751
            L+ LEKE+QWH+++QV+KWMLSKGQGTT GTY QLIRALD D RAEEAH +W +KIG DL
Sbjct: 125  LLALEKEQQWHKVVQVIKWMLSKGQGTTMGTYGQLIRALDMDHRAEEAHKIWERKIGMDL 184

Query: 752  HSVPWQLCHLMISIYYRNNMPERLVKLFKGLEAFDRQPPDKSIVQKVADAYEMLGLMEEQ 931
            HSVPWQLC  MISIYYRNNM + LVKLFK LEAFDR+PP+KSIVQ+VADAYEMLGL+EE+
Sbjct: 185  HSVPWQLCRQMISIYYRNNMLKSLVKLFKDLEAFDRKPPEKSIVQRVADAYEMLGLLEEK 244

Query: 932  KRVVEKYKDLFAKSDQGNSKKSRRASLKTEKK 1027
            +RV+EKY DLF  ++   S+K ++A  K +KK
Sbjct: 245  ERVLEKYNDLFTGNE---SEKHKKAPSKRKKK 273


>XP_006373907.1 hypothetical protein POPTR_0016s10300g [Populus trichocarpa]
            ERP51704.1 hypothetical protein POPTR_0016s10300g
            [Populus trichocarpa]
          Length = 295

 Score =  320 bits (820), Expect = e-103
 Identities = 156/241 (64%), Positives = 187/241 (77%)
 Frame = +2

Query: 410  RNFQIGENVSRKDKMNFLVKTLFDLKDSKEAVYSALDAWVAWEQNFPIVSLKRALITLEK 589
            R  QIG+NVS+KDK+ FL+ TL DL DSK++VY ALDAWVAWEQ FPI S+K+ LI LEK
Sbjct: 59   RRNQIGDNVSKKDKIKFLITTLLDLNDSKDSVYGALDAWVAWEQKFPIASIKQVLIALEK 118

Query: 590  EEQWHRIIQVLKWMLSKGQGTTRGTYAQLIRALDKDRRAEEAHNLWVKKIGHDLHSVPWQ 769
            E+QWHRI+QV+KWMLSKGQGTT GTYAQ IRALD D RA+EAH  W+KKIG DLHSVPWQ
Sbjct: 119  EQQWHRIVQVIKWMLSKGQGTTMGTYAQFIRALDMDHRAKEAHEFWLKKIGRDLHSVPWQ 178

Query: 770  LCHLMISIYYRNNMPERLVKLFKGLEAFDRQPPDKSIVQKVADAYEMLGLMEEQKRVVEK 949
            LC+ MISIYYRNNM E L+KLFKGLEAFDRQPP+KSIVQKVAD+YEMLGL+EE++RV+EK
Sbjct: 179  LCNRMISIYYRNNMLENLIKLFKGLEAFDRQPPEKSIVQKVADSYEMLGLLEEKERVLEK 238

Query: 950  YKDLFAKSDQGNSKKSRRASLKTEKKEKRKQGRPQASQDKCLVQDDSAIGGNDMETSEEH 1129
            Y  +F ++ +G +KK R AS K  KK  + +    AS       DD  +     ++  EH
Sbjct: 239  YNHIFVEAGKGQNKKLRNASSKKNKKSGKPKNE-SASDTLADAVDDKKLS----QSLSEH 293

Query: 1130 C 1132
            C
Sbjct: 294  C 294


>XP_017984164.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic isoform X1 [Theobroma cacao]
          Length = 312

 Score =  318 bits (816), Expect = e-103
 Identities = 159/245 (64%), Positives = 192/245 (78%), Gaps = 1/245 (0%)
 Frame = +2

Query: 311  IVGDECNSRAFTSSEVRHGNQAKDQYSK-NARDLRNFQIGENVSRKDKMNFLVKTLFDLK 487
            I G    +R  ++ +   GNQA++  SK N   +   QIG+NVSRKDK+ FLV TL DLK
Sbjct: 65   ISGHFKRTRKRSTPDCEGGNQAENLSSKPNIGGILKHQIGQNVSRKDKIKFLVTTLLDLK 124

Query: 488  DSKEAVYSALDAWVAWEQNFPIVSLKRALITLEKEEQWHRIIQVLKWMLSKGQGTTRGTY 667
            D KEAVY ALDAWVAWEQNFPI  LK  ++ LEKE QWHR++QV+KWMLSKGQG T GTY
Sbjct: 125  DGKEAVYGALDAWVAWEQNFPIGPLKNVILALEKEHQWHRVVQVIKWMLSKGQGNTMGTY 184

Query: 668  AQLIRALDKDRRAEEAHNLWVKKIGHDLHSVPWQLCHLMISIYYRNNMPERLVKLFKGLE 847
             QLIRALD D RAEEAH  W+KK+  DLHSVPWQLC  MIS+YYRNNM E LVKLFKGLE
Sbjct: 185  VQLIRALDMDNRAEEAHQFWLKKVSADLHSVPWQLCRQMISVYYRNNMLENLVKLFKGLE 244

Query: 848  AFDRQPPDKSIVQKVADAYEMLGLMEEQKRVVEKYKDLFAKSDQGNSKKSRRASLKTEKK 1027
            AFDR+PP+KSIVQ+VADAYEMLGL+EE++RV+EKYKD+  K+D+ + KKS++AS K +K 
Sbjct: 245  AFDRKPPEKSIVQRVADAYEMLGLLEEKERVLEKYKDIPTKTDKVH-KKSKQASSKRKKN 303

Query: 1028 EKRKQ 1042
              R++
Sbjct: 304  SGRRK 308


>XP_007048491.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic isoform X2 [Theobroma cacao] EOX92648.1
            Uncharacterized protein TCM_046974 [Theobroma cacao]
          Length = 285

 Score =  317 bits (813), Expect = e-102
 Identities = 156/227 (68%), Positives = 185/227 (81%), Gaps = 1/227 (0%)
 Frame = +2

Query: 365  GNQAKDQYSK-NARDLRNFQIGENVSRKDKMNFLVKTLFDLKDSKEAVYSALDAWVAWEQ 541
            GNQA++  SK N   +   QIG+NVSRKDK+ FLV TL DLKD KEAVY ALDAWVAWEQ
Sbjct: 56   GNQAENLSSKPNIGGILKHQIGQNVSRKDKIKFLVTTLLDLKDGKEAVYGALDAWVAWEQ 115

Query: 542  NFPIVSLKRALITLEKEEQWHRIIQVLKWMLSKGQGTTRGTYAQLIRALDKDRRAEEAHN 721
            NFPI  LK  ++ LEKE QWHR++QV+KWMLSKGQG T GTY QLIRALD D RAEEAH 
Sbjct: 116  NFPIGPLKNVILALEKEHQWHRVVQVIKWMLSKGQGNTMGTYVQLIRALDMDNRAEEAHQ 175

Query: 722  LWVKKIGHDLHSVPWQLCHLMISIYYRNNMPERLVKLFKGLEAFDRQPPDKSIVQKVADA 901
             W+KK+  DLHSVPWQLC  MIS+YYRNNM E LVKLFKGLEAFDR+PP+KSIVQ+VADA
Sbjct: 176  FWLKKVSADLHSVPWQLCRQMISVYYRNNMLENLVKLFKGLEAFDRKPPEKSIVQRVADA 235

Query: 902  YEMLGLMEEQKRVVEKYKDLFAKSDQGNSKKSRRASLKTEKKEKRKQ 1042
            YEMLGL+EE++RV+EKYKD+  K+D+ + KKS++AS K +K   R++
Sbjct: 236  YEMLGLLEEKERVLEKYKDIPTKTDKVH-KKSKQASSKRKKNSGRRK 281


>XP_020102208.1 pentatricopeptide repeat-containing protein At4g21190 [Ananas
            comosus]
          Length = 354

 Score =  320 bits (819), Expect = e-102
 Identities = 162/300 (54%), Positives = 208/300 (69%), Gaps = 10/300 (3%)
 Frame = +2

Query: 248  CCYCYSTGLVPFQNEDSDSQKIVGDECNSRAFTSSEVRHGNQAK--DQYSKNARDLRNFQ 421
            CC  Y T  +  Q+  S  +++V D+   +  + + +     A+  +Q  ++  +    Q
Sbjct: 60   CCRSYVTSTLVLQSRSSAYREVVEDQVYGQRGSKATLPAERNAEMINQIQRSENEPVIKQ 119

Query: 422  IGENVSRKDKMNFLVKTLFDLKDSKEAVYSALDAWVAWEQNFPIVSLKRALITLEKEEQW 601
            +G+N++  DK  FLV TL DLKDSKEAVY  LDAWVAWEQ FP+ SLK+AL+ LEKEEQW
Sbjct: 120  VGKNITSTDKRRFLVNTLLDLKDSKEAVYGTLDAWVAWEQTFPLSSLKKALLVLEKEEQW 179

Query: 602  HRIIQVLKWMLSKGQGTTRGTYAQLIRALDKDRRAEEAHNLWVKKIGHDLHSVPWQLCHL 781
            H+++QV+KWMLSKGQGTT GTY QLIRAL+KD RAEEAH +W KK+ HDLHSVPW+ C L
Sbjct: 180  HKVVQVVKWMLSKGQGTTMGTYEQLIRALEKDNRAEEAHKIWEKKMSHDLHSVPWRFCDL 239

Query: 782  MISIYYRNNMPERLVKLFKGLEAFDRQPPDKSIVQKVADAYEMLGLMEEQKRVVEKYKDL 961
            M+SIYYRNNM ERLVKLFKGLEA+DR+PP KS+V+K ADAYEMLGL+EE+ RV+EKY  L
Sbjct: 240  MLSIYYRNNMLERLVKLFKGLEAYDRKPPSKSVVRKAADAYEMLGLIEEKNRVLEKYGHL 299

Query: 962  FAKSDQGNSKKSRRASLKTEK--------KEKRKQGRPQASQDKCLVQDDSAIGGNDMET 1117
            F+KS     KKSR+A    +K        K+KRK+   + S D          G +D+ET
Sbjct: 300  FSKSSDERQKKSRKARKVAQKVDDKADNSKQKRKEASDETSADS---------GPSDVET 350


>XP_015572754.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic [Ricinus communis]
          Length = 247

 Score =  315 bits (808), Expect = e-102
 Identities = 149/216 (68%), Positives = 181/216 (83%)
 Frame = +2

Query: 389  SKNARDLRNFQIGENVSRKDKMNFLVKTLFDLKDSKEAVYSALDAWVAWEQNFPIVSLKR 568
            +++A  ++  QIG+NVSRK+K++FL+KTL DLKDSKEAVY A+DAWVAWE NFPI SLKR
Sbjct: 30   NQSAGGVQKNQIGKNVSRKEKIDFLLKTLLDLKDSKEAVYGAVDAWVAWEHNFPIASLKR 89

Query: 569  ALITLEKEEQWHRIIQVLKWMLSKGQGTTRGTYAQLIRALDKDRRAEEAHNLWVKKIGHD 748
             LI LEKE+QWHR++QV+KW++SKGQG T GTY QLIRALD D RA EAH  W+KKIG D
Sbjct: 90   VLILLEKEQQWHRVVQVIKWIISKGQGNTMGTYGQLIRALDMDHRANEAHMFWLKKIGLD 149

Query: 749  LHSVPWQLCHLMISIYYRNNMPERLVKLFKGLEAFDRQPPDKSIVQKVADAYEMLGLMEE 928
            LHSVPWQLCH MIS+YYRNNM E LVKL KGLEAFD +PPDKSIVQKVADAYEMLG++EE
Sbjct: 150  LHSVPWQLCHRMISVYYRNNMLESLVKLSKGLEAFDHKPPDKSIVQKVADAYEMLGMLEE 209

Query: 929  QKRVVEKYKDLFAKSDQGNSKKSRRASLKTEKKEKR 1036
            ++RV++KYKDLF ++++G  KKSR    K +  + R
Sbjct: 210  KERVLQKYKDLFKETEKGRPKKSRSTLAKKKSGDTR 245


>GAV62919.1 hypothetical protein CFOL_v3_06441 [Cephalotus follicularis]
          Length = 276

 Score =  316 bits (809), Expect = e-102
 Identities = 156/229 (68%), Positives = 182/229 (79%), Gaps = 1/229 (0%)
 Frame = +2

Query: 344  TSSEVRHGNQAKDQYS-KNARDLRNFQIGENVSRKDKMNFLVKTLFDLKDSKEAVYSALD 520
            T+ E R+ + A  Q+  KN    +    G NVS KDK+ FL  TL +L DSKEAVY ALD
Sbjct: 46   TTPEDRYNSPATCQHEEKNVGGTQKNHTGANVSGKDKITFLTNTLLELNDSKEAVYGALD 105

Query: 521  AWVAWEQNFPIVSLKRALITLEKEEQWHRIIQVLKWMLSKGQGTTRGTYAQLIRALDKDR 700
            AWVAWEQNFPI  LK  L+ LEKE+QWHR+IQV+KWMLSKGQGTT GTY QLI+ALD D 
Sbjct: 106  AWVAWEQNFPIARLKNVLLALEKEQQWHRVIQVIKWMLSKGQGTTMGTYGQLIKALDMDH 165

Query: 701  RAEEAHNLWVKKIGHDLHSVPWQLCHLMISIYYRNNMPERLVKLFKGLEAFDRQPPDKSI 880
            R EEAH LW KKIG DLHSVPWQLC+ MISIYYRNNM E+LVKLFKGLEAFDR+PP+KSI
Sbjct: 166  RTEEAHKLWEKKIGSDLHSVPWQLCNRMISIYYRNNMLEKLVKLFKGLEAFDRKPPEKSI 225

Query: 881  VQKVADAYEMLGLMEEQKRVVEKYKDLFAKSDQGNSKKSRRASLKTEKK 1027
            VQKVA+AYEMLGL+EE+ RV+EKYKDLF ++ +GN KK  ++S K +KK
Sbjct: 226  VQKVANAYEMLGLLEEKDRVLEKYKDLFTQTGKGNLKKFGKSSSKKKKK 274


Top