BLASTX nr result

ID: Chrysanthemum21_contig00047872 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00047872
         (521 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_023748048.1| pentatricopeptide repeat-containing protein ...   308   e-101
gb|PLY82906.1| hypothetical protein LSAT_6X62421 [Lactuca sativa]     306   4e-98
gb|KVI01112.1| Pentatricopeptide repeat-containing protein [Cyna...   303   1e-95
ref|XP_020533355.1| pentatricopeptide repeat-containing protein ...   276   2e-89
ref|XP_020533354.1| pentatricopeptide repeat-containing protein ...   276   4e-88
ref|XP_010271319.1| PREDICTED: pentatricopeptide repeat-containi...   270   7e-87
gb|OVA15118.1| Pentatricopeptide repeat [Macleaya cordata]            264   2e-84
ref|XP_006373596.1| hypothetical protein POPTR_0016s01140g [Popu...   265   5e-84
gb|PNS97242.1| hypothetical protein POPTR_016G010100v3 [Populus ...   265   1e-83
ref|XP_021642592.1| pentatricopeptide repeat-containing protein ...   260   5e-83
ref|XP_009773982.1| PREDICTED: pentatricopeptide repeat-containi...   256   6e-83
ref|XP_015575478.1| PREDICTED: pentatricopeptide repeat-containi...   259   1e-82
gb|PPD66143.1| hypothetical protein GOBAR_DD36981 [Gossypium bar...   259   2e-82
gb|KJB67393.1| hypothetical protein B456_010G188400 [Gossypium r...   259   2e-82
ref|XP_016683464.1| PREDICTED: pentatricopeptide repeat-containi...   259   3e-82
ref|XP_012448866.1| PREDICTED: pentatricopeptide repeat-containi...   259   3e-82
gb|PPR88264.1| hypothetical protein GOBAR_AA32422 [Gossypium bar...   258   7e-82
ref|XP_017648738.1| PREDICTED: pentatricopeptide repeat-containi...   258   7e-82
ref|XP_021642588.1| pentatricopeptide repeat-containing protein ...   260   1e-81
ref|XP_015166393.1| PREDICTED: pentatricopeptide repeat-containi...   257   1e-81

>ref|XP_023748048.1| pentatricopeptide repeat-containing protein At4g16470 [Lactuca
           sativa]
          Length = 521

 Score =  308 bits (790), Expect = e-101
 Identities = 139/173 (80%), Positives = 158/173 (91%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE+GKRAHGV+IK  +NGN+VVNSALIDMYFKCSCPYDGHLVF KA ++N++TWTSLI
Sbjct: 254 ATLEQGKRAHGVMIKTQMNGNLVVNSALIDMYFKCSCPYDGHLVFNKASDKNIVTWTSLI 313

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHGRVKEVLD+F RM + GF+PN VTFL VLSAC+HGG VKEGW YF SMR++YGI
Sbjct: 314 SGYGQHGRVKEVLDAFHRMINEGFRPNSVTFLVVLSACSHGGLVKEGWEYFQSMRRNYGI 373

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            PGEKHYAAMVD+LGR GRLDEA+EFVK+APCK+HPVIWGAL+QACKVYGNMD
Sbjct: 374 TPGEKHYAAMVDILGRSGRLDEAFEFVKSAPCKDHPVIWGALIQACKVYGNMD 426



 Score = 63.5 bits (153), Expect = 1e-08
 Identities = 44/170 (25%), Positives = 81/170 (47%), Gaps = 3/170 (1%)
 Frame = -3

Query: 504 GKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLISGYGQ 325
           GKR H  +I +    N  +N  L+ +Y K       H++FKK    NVI+W ++ISGY Q
Sbjct: 158 GKRIHSQMIISGFVPNEYLNIKLLILYAKSGDLVTAHILFKKLLIPNVISWNAMISGYVQ 217

Query: 324 HGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEG---WGYFNSMRKDYGIIP 154
            G  ++ L+ + +M   G  P+  TF +V  AC     +++G    G     + +  ++ 
Sbjct: 218 KGLEEQGLNLYYKMRQNGLTPDQFTFSSVFRACATLATLEQGKRAHGVMIKTQMNGNLVV 277

Query: 153 GEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNM 4
                +A++D+  +C    + +  V N    ++ V W +L+     +G +
Sbjct: 278 N----SALIDMYFKCSCPYDGH-LVFNKASDKNIVTWTSLISGYGQHGRV 322


>gb|PLY82906.1| hypothetical protein LSAT_6X62421 [Lactuca sativa]
          Length = 684

 Score =  306 bits (785), Expect = 4e-98
 Identities = 139/173 (80%), Positives = 157/173 (90%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE+GK+AHGV+IK  INGN+VVNSALIDMYFKCSCPYDGHLVF KA ++N++TWTSLI
Sbjct: 254 ATLEQGKQAHGVMIKTQINGNLVVNSALIDMYFKCSCPYDGHLVFNKASDKNIVTWTSLI 313

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHGRVKEVLD+F RM + GF+PN VTFL VLSAC+HGG VKEGW YF SMR++Y I
Sbjct: 314 SGYGQHGRVKEVLDAFHRMINEGFRPNSVTFLVVLSACSHGGLVKEGWEYFQSMRRNYVI 373

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            PGEKHYAAMVD+LGR GRLDEA+EFVKNAPCK+HPVIWGAL+QACKVYGNMD
Sbjct: 374 TPGEKHYAAMVDILGRSGRLDEAFEFVKNAPCKDHPVIWGALIQACKVYGNMD 426



 Score = 63.9 bits (154), Expect = 1e-08
 Identities = 46/167 (27%), Positives = 79/167 (47%)
 Frame = -3

Query: 504 GKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLISGYGQ 325
           GKR H  +I +    N  +N  L+ +Y K       H++FKK    NVI+W ++ISGY Q
Sbjct: 158 GKRIHSQMIISGFVPNEYLNIKLLILYAKSGDLVTAHILFKKLLIPNVISWNAMISGYVQ 217

Query: 324 HGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGIIPGEK 145
            G  ++ L+ + +M   G  P+  TF +V  AC     +++G      M K   I     
Sbjct: 218 KGLEEQGLNLYYKMRQNGLTPDQFTFSSVFRACATLATLEQGKQAHGVMIKTQ-INGNLV 276

Query: 144 HYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNM 4
             +A++D+  +C    + +  V N    ++ V W +L+     +G +
Sbjct: 277 VNSALIDMYFKCSCPYDGH-LVFNKASDKNIVTWTSLISGYGQHGRV 322


>gb|KVI01112.1| Pentatricopeptide repeat-containing protein [Cynara cardunculus
           var. scolymus]
          Length = 791

 Score =  303 bits (775), Expect = 1e-95
 Identities = 141/173 (81%), Positives = 154/173 (89%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           AMLE+GKR H VLIKN I+GN+VVNSALIDMYFKCSCPYDGHLVF KA ++NV+TWTSLI
Sbjct: 258 AMLEQGKRVHAVLIKNQISGNVVVNSALIDMYFKCSCPYDGHLVFDKALDKNVVTWTSLI 317

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHGRVKEVLD F RM   GF+PN +TFLAVLSAC+HGG V EGW YF +MR+DYGI
Sbjct: 318 SGYGQHGRVKEVLDVFHRMIDEGFRPNNITFLAVLSACSHGGLVGEGWNYFRAMRRDYGI 377

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P EKHYAAMVDLLGR GRLDEAYEFV+NAP K+HPVIWGALLQACKVYGNMD
Sbjct: 378 QPREKHYAAMVDLLGRSGRLDEAYEFVRNAPFKDHPVIWGALLQACKVYGNMD 430


>ref|XP_020533355.1| pentatricopeptide repeat-containing protein At4g16470 isoform X2
           [Jatropha curcas]
          Length = 433

 Score =  276 bits (707), Expect = 2e-89
 Identities = 129/173 (74%), Positives = 147/173 (84%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE GK+AHGV+IK H++ N++VNSALIDMYFKCS   DGH VF K+  RNV+TWTSLI
Sbjct: 159 ATLEHGKKAHGVMIKCHLSENVIVNSALIDMYFKCSNLSDGHKVFSKSVIRNVVTWTSLI 218

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHGRV EVL+SF RM   GF+PNYVTFLAVLSAC+HGG + EGWGYF+SM+KDYGI
Sbjct: 219 SGYGQHGRVSEVLESFHRMKDEGFRPNYVTFLAVLSACSHGGLIDEGWGYFSSMKKDYGI 278

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  +HYAAMVDLLGR GRL EAYEFV NAPCKEH VIWGALL AC+++GNMD
Sbjct: 279 QPRGQHYAAMVDLLGRAGRLQEAYEFVLNAPCKEHSVIWGALLGACRIHGNMD 331



 Score = 58.2 bits (139), Expect = 1e-06
 Identities = 40/175 (22%), Positives = 79/175 (45%), Gaps = 6/175 (3%)
 Frame = -3

Query: 510 EEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLISGY 331
           + GKR H  ++      N  +N+ L+ +Y K       H++F K  E+++I+W ++I+GY
Sbjct: 61  KNGKRIHAQMVVVGYVSNEYLNTKLLILYAKSGDLKAMHMLFDKLMEKSLISWNAIIAGY 120

Query: 330 GQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACT------HGGFVKEGWGYFNSMRKD 169
            Q G  +  +  + +    G  P+  TF +V  AC       HG   K+  G        
Sbjct: 121 VQKGLEELGISFYYKKRETGLVPDQYTFASVFRACAALATLEHG---KKAHGVMIKCHLS 177

Query: 168 YGIIPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNM 4
             +I      +A++D+  +C  L + ++    +  + + V W +L+     +G +
Sbjct: 178 ENVIVN----SALIDMYFKCSNLSDGHKVFSKSVIR-NVVTWTSLISGYGQHGRV 227


>ref|XP_020533354.1| pentatricopeptide repeat-containing protein At4g16470 isoform X1
           [Jatropha curcas]
          Length = 535

 Score =  276 bits (707), Expect = 4e-88
 Identities = 129/173 (74%), Positives = 147/173 (84%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE GK+AHGV+IK H++ N++VNSALIDMYFKCS   DGH VF K+  RNV+TWTSLI
Sbjct: 261 ATLEHGKKAHGVMIKCHLSENVIVNSALIDMYFKCSNLSDGHKVFSKSVIRNVVTWTSLI 320

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHGRV EVL+SF RM   GF+PNYVTFLAVLSAC+HGG + EGWGYF+SM+KDYGI
Sbjct: 321 SGYGQHGRVSEVLESFHRMKDEGFRPNYVTFLAVLSACSHGGLIDEGWGYFSSMKKDYGI 380

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  +HYAAMVDLLGR GRL EAYEFV NAPCKEH VIWGALL AC+++GNMD
Sbjct: 381 QPRGQHYAAMVDLLGRAGRLQEAYEFVLNAPCKEHSVIWGALLGACRIHGNMD 433



 Score = 58.2 bits (139), Expect = 1e-06
 Identities = 40/175 (22%), Positives = 79/175 (45%), Gaps = 6/175 (3%)
 Frame = -3

Query: 510 EEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLISGY 331
           + GKR H  ++      N  +N+ L+ +Y K       H++F K  E+++I+W ++I+GY
Sbjct: 163 KNGKRIHAQMVVVGYVSNEYLNTKLLILYAKSGDLKAMHMLFDKLMEKSLISWNAIIAGY 222

Query: 330 GQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACT------HGGFVKEGWGYFNSMRKD 169
            Q G  +  +  + +    G  P+  TF +V  AC       HG   K+  G        
Sbjct: 223 VQKGLEELGISFYYKKRETGLVPDQYTFASVFRACAALATLEHG---KKAHGVMIKCHLS 279

Query: 168 YGIIPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNM 4
             +I      +A++D+  +C  L + ++    +  + + V W +L+     +G +
Sbjct: 280 ENVIVN----SALIDMYFKCSNLSDGHKVFSKSVIR-NVVTWTSLISGYGQHGRV 329


>ref|XP_010271319.1| PREDICTED: pentatricopeptide repeat-containing protein At4g16470
           [Nelumbo nucifera]
          Length = 439

 Score =  270 bits (691), Expect = 7e-87
 Identities = 122/173 (70%), Positives = 144/173 (83%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE+GKR H V+IK+ +  N+VVNSAL DMYFKCS P+DGH VF K  ERNV+TWT+LI
Sbjct: 166 ATLEQGKRVHAVMIKSQLTENVVVNSALTDMYFKCSSPHDGHRVFDKTSERNVVTWTALI 225

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHGRV EVLD F RM H GF+PNYVTFLAVL+AC+HGG + EGW YF SM +DYGI
Sbjct: 226 SGYGQHGRVNEVLDLFNRMLHEGFRPNYVTFLAVLTACSHGGLINEGWKYFTSMSRDYGI 285

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  KHYAAMVDLLGR GRL+EAYEFV N+PC++H VIWGALL AC+++GN++
Sbjct: 286 RPRGKHYAAMVDLLGRAGRLNEAYEFVLNSPCEDHSVIWGALLGACRIHGNLE 338



 Score = 56.2 bits (134), Expect = 5e-06
 Identities = 38/170 (22%), Positives = 78/170 (45%)
 Frame = -3

Query: 510 EEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLISGY 331
           ++G+R H  ++      +  + + L  +Y K       H++F K   R++++W ++ISGY
Sbjct: 68  KKGRRIHAQMVIVGFAPDEYLQTKLAILYAKNGDLETAHIMFDKISNRSLVSWNAMISGY 127

Query: 330 GQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGIIPG 151
            Q G  +  L+ + +M   G  P+  TF +V  AC     +++G      M K   +   
Sbjct: 128 VQKGLDETGLNLYHKMRQSGLIPDQYTFASVFRACASLATLEQGKRVHAVMIKSQ-LTEN 186

Query: 150 EKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
               +A+ D+  +C    + +  V +   + + V W AL+     +G ++
Sbjct: 187 VVVNSALTDMYFKCSSPHDGHR-VFDKTSERNVVTWTALISGYGQHGRVN 235


>gb|OVA15118.1| Pentatricopeptide repeat [Macleaya cordata]
          Length = 446

 Score =  264 bits (675), Expect = 2e-84
 Identities = 118/173 (68%), Positives = 148/173 (85%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE+G+R HGV++K+ I+ N+VVNSAL+DMYFKCS P+D H VF KA ERNV+TWTSLI
Sbjct: 166 ASLEQGRRIHGVMLKSKISDNVVVNSALMDMYFKCSNPHDAHKVFDKAEERNVVTWTSLI 225

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHG+V+EVL+ F RM   GF+PNYVTFLAVLSAC+HGG + EGW YF+SM +++GI
Sbjct: 226 SGYGQHGQVQEVLELFHRMIDEGFRPNYVTFLAVLSACSHGGLISEGWKYFSSMTREFGI 285

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  KHYAAMVDLLGR GRL +AYEFVKN+PC+EH V+WGALL AC+++G+++
Sbjct: 286 RPRGKHYAAMVDLLGRAGRLQDAYEFVKNSPCEEHSVVWGALLGACRIHGDLE 338



 Score = 65.1 bits (157), Expect = 4e-09
 Identities = 43/169 (25%), Positives = 81/169 (47%)
 Frame = -3

Query: 510 EEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLISGY 331
           ++G+R HG +I      +  + + L+ +Y K       HLVF K P  ++++W S+I+GY
Sbjct: 68  KKGRRIHGHMIVLGFTPDEYLQTKLVILYSKSGDLETAHLVFDKNPNPSLVSWNSMIAGY 127

Query: 330 GQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGIIPG 151
            Q G  +  L+ + +M   G  P+  TF +V  AC     +++G      M K   I   
Sbjct: 128 VQKGLEEMGLNLYYKMRLSGLIPDQFTFASVFRACASVASLEQGRRIHGVMLKS-KISDN 186

Query: 150 EKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNM 4
               +A++D+  +C    +A++    A  + + V W +L+     +G +
Sbjct: 187 VVVNSALMDMYFKCSNPHDAHKVFDKAE-ERNVVTWTSLISGYGQHGQV 234


>ref|XP_006373596.1| hypothetical protein POPTR_0016s01140g [Populus trichocarpa]
          Length = 509

 Score =  265 bits (678), Expect = 5e-84
 Identities = 125/173 (72%), Positives = 143/173 (82%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE GKRAH V++K  +  N+VV+SAL+DMYFKCS   DGHLVF K+  RNV+TWTSLI
Sbjct: 235 ATLEHGKRAHCVMMKCFLKENVVVSSALMDMYFKCSSLSDGHLVFDKSSNRNVVTWTSLI 294

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYG HGRV EV++SF RM   GF+PNYVTFLAVLSAC+HGG V EGW YF+SMR+DYGI
Sbjct: 295 SGYGHHGRVSEVIESFHRMKDEGFQPNYVTFLAVLSACSHGGLVDEGWAYFSSMRRDYGI 354

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  KHYAAMVDLLGR GRL EAYEFV NAPCKEH V+WGALL ACK++G+MD
Sbjct: 355 QPRGKHYAAMVDLLGRAGRLKEAYEFVVNAPCKEHSVLWGALLGACKIHGDMD 407



 Score = 56.6 bits (135), Expect = 4e-06
 Identities = 41/171 (23%), Positives = 77/171 (45%)
 Frame = -3

Query: 516 MLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLIS 337
           +  +GKR H  ++      N  + + L+ +Y K       HL+F    E+++I+W +LI+
Sbjct: 135 LYNKGKRIHAQMVVVGYVPNEYLKTKLMILYAKSGDLKTMHLLFDMLMEKSLISWNALIA 194

Query: 336 GYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGII 157
           GY Q G  +  L  +  M   G  P+  TF +V  AC     ++ G      M K + + 
Sbjct: 195 GYVQKGLEEMGLSFYYEMRQNGLTPDQYTFASVFRACATLATLEHGKRAHCVMMKCF-LK 253

Query: 156 PGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNM 4
                 +A++D+  +C  L + +  V +     + V W +L+     +G +
Sbjct: 254 ENVVVSSALMDMYFKCSSLSDGH-LVFDKSSNRNVVTWTSLISGYGHHGRV 303


>gb|PNS97242.1| hypothetical protein POPTR_016G010100v3 [Populus trichocarpa]
          Length = 538

 Score =  265 bits (678), Expect = 1e-83
 Identities = 125/173 (72%), Positives = 143/173 (82%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE GKRAH V++K  +  N+VV+SAL+DMYFKCS   DGHLVF K+  RNV+TWTSLI
Sbjct: 264 ATLEHGKRAHCVMMKCFLKENVVVSSALMDMYFKCSSLSDGHLVFDKSSNRNVVTWTSLI 323

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYG HGRV EV++SF RM   GF+PNYVTFLAVLSAC+HGG V EGW YF+SMR+DYGI
Sbjct: 324 SGYGHHGRVSEVIESFHRMKDEGFQPNYVTFLAVLSACSHGGLVDEGWAYFSSMRRDYGI 383

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  KHYAAMVDLLGR GRL EAYEFV NAPCKEH V+WGALL ACK++G+MD
Sbjct: 384 QPRGKHYAAMVDLLGRAGRLKEAYEFVVNAPCKEHSVLWGALLGACKIHGDMD 436



 Score = 56.6 bits (135), Expect = 4e-06
 Identities = 41/171 (23%), Positives = 77/171 (45%)
 Frame = -3

Query: 516 MLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLIS 337
           +  +GKR H  ++      N  + + L+ +Y K       HL+F    E+++I+W +LI+
Sbjct: 164 LYNKGKRIHAQMVVVGYVPNEYLKTKLMILYAKSGDLKTMHLLFDMLMEKSLISWNALIA 223

Query: 336 GYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGII 157
           GY Q G  +  L  +  M   G  P+  TF +V  AC     ++ G      M K + + 
Sbjct: 224 GYVQKGLEEMGLSFYYEMRQNGLTPDQYTFASVFRACATLATLEHGKRAHCVMMKCF-LK 282

Query: 156 PGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNM 4
                 +A++D+  +C  L + +  V +     + V W +L+     +G +
Sbjct: 283 ENVVVSSALMDMYFKCSSLSDGH-LVFDKSSNRNVVTWTSLISGYGHHGRV 332


>ref|XP_021642592.1| pentatricopeptide repeat-containing protein At4g16470 isoform X2
           [Hevea brasiliensis]
          Length = 422

 Score =  260 bits (664), Expect = 5e-83
 Identities = 122/173 (70%), Positives = 141/173 (81%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE GK+AHGV+IK  +  N+VVNSALIDMYFKCS   DGH VF K+  RNV+TWTSLI
Sbjct: 148 ATLEHGKKAHGVMIKCRLRENVVVNSALIDMYFKCSNLSDGHKVFSKSLNRNVVTWTSLI 207

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHGRV EVL+SF RM   GF PNYVTFLAVLSAC+HGG + E W YF+SM++DYGI
Sbjct: 208 SGYGQHGRVAEVLESFHRMKDEGFGPNYVTFLAVLSACSHGGLIDEAWDYFSSMKRDYGI 267

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  +HYAAMVDLLGR GRL EAYEFV  APC+EH V+WGALL AC+++G+MD
Sbjct: 268 QPRGQHYAAMVDLLGRAGRLQEAYEFVLEAPCQEHSVVWGALLGACRIHGDMD 320



 Score = 57.8 bits (138), Expect = 1e-06
 Identities = 40/174 (22%), Positives = 80/174 (45%), Gaps = 6/174 (3%)
 Frame = -3

Query: 507 EGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLISGYG 328
           +GKR H  ++      N  +N+ L+ +Y K       +++F     +++I+W ++I+GY 
Sbjct: 51  KGKRIHAQMVVVGYVANEYLNTKLLILYAKSGDLKAANVLFDMVVGKSLISWNAIIAGYV 110

Query: 327 QHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACT------HGGFVKEGWGYFNSMRKDY 166
           Q G+ +  L  + +M   G  P+  TF +V  AC       HG   K+  G     R   
Sbjct: 111 QKGQEEIGLTFYYKMRENGLTPDQYTFASVFRACATLATLEHG---KKAHGVMIKCRLRE 167

Query: 165 GIIPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNM 4
            ++      +A++D+  +C  L + ++ V +     + V W +L+     +G +
Sbjct: 168 NVVVN----SALIDMYFKCSNLSDGHK-VFSKSLNRNVVTWTSLISGYGQHGRV 216


>ref|XP_009773982.1| PREDICTED: pentatricopeptide repeat-containing protein At4g16470
           isoform X2 [Nicotiana sylvestris]
          Length = 320

 Score =  256 bits (655), Expect = 6e-83
 Identities = 118/173 (68%), Positives = 149/173 (86%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A+LE+GK+AH +LIK+ I+GNIVVNSAL+DMYFKCSCP DG+LVF K+ ERNVITWT+LI
Sbjct: 47  AVLEQGKQAHALLIKSQISGNIVVNSALMDMYFKCSCPSDGYLVFCKSLERNVITWTALI 106

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQ+GR+K+VL+SF RM   GF+PN++TFLAVLSAC+HGG V  G  YF+ M +DYG+
Sbjct: 107 SGYGQNGRIKDVLESFHRMIDEGFRPNHITFLAVLSACSHGGLVDRGKEYFSLMMRDYGL 166

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  KHYAA+VDLLGR GRL EA+EFV+N+ C EHPV+WGALL ACK++G+++
Sbjct: 167 RPRGKHYAAIVDLLGRAGRLQEAHEFVQNSRCGEHPVLWGALLGACKIHGDIE 219


>ref|XP_015575478.1| PREDICTED: pentatricopeptide repeat-containing protein At4g16470,
           partial [Ricinus communis]
          Length = 439

 Score =  259 bits (663), Expect = 1e-82
 Identities = 120/173 (69%), Positives = 141/173 (81%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A L+ GK+AHGV+IK ++  N+VVNSALIDMYFKCS   DGH  F K+  RN++TWTSLI
Sbjct: 165 ATLQHGKKAHGVMIKCNLRENVVVNSALIDMYFKCSSLTDGHKAFNKSVNRNIVTWTSLI 224

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHGRV EVL+SF RM   GF+PNYVTFLA L AC+HGG + EGW YF SM+++YGI
Sbjct: 225 SGYGQHGRVTEVLESFHRMKDEGFRPNYVTFLAALCACSHGGLIDEGWDYFLSMKRNYGI 284

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  +HYAAMVDLLGR GRL EAYEFV NAPCKEH VIWGALL AC+++G+MD
Sbjct: 285 QPRGQHYAAMVDLLGRAGRLQEAYEFVLNAPCKEHSVIWGALLGACRIHGDMD 337


>gb|PPD66143.1| hypothetical protein GOBAR_DD36981 [Gossypium barbadense]
          Length = 451

 Score =  259 bits (662), Expect = 2e-82
 Identities = 121/173 (69%), Positives = 143/173 (82%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE GKRAHGVLIK+HI  N+VV+SAL+DMYFKCS   D H VF +   RNV+TWTSLI
Sbjct: 177 ASLEHGKRAHGVLIKSHIRENVVVSSALMDMYFKCSSLTDAHRVFNEVVNRNVVTWTSLI 236

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHGRV EVL+SF +M + GF+PNYVTFLAVLSAC+HGG V EGW YF SM++DYGI
Sbjct: 237 SGYGQHGRVNEVLESFDKMINEGFRPNYVTFLAVLSACSHGGLVNEGWHYFLSMKRDYGI 296

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  +HY+AMVDLLGR G+L EAYEFV N+P KEHP IWGALL AC+++G++D
Sbjct: 297 QPRGQHYSAMVDLLGRSGKLHEAYEFVLNSPFKEHPAIWGALLGACRIHGDLD 349


>gb|KJB67393.1| hypothetical protein B456_010G188400 [Gossypium raimondii]
          Length = 451

 Score =  259 bits (662), Expect = 2e-82
 Identities = 122/173 (70%), Positives = 142/173 (82%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE GKRAHGVLIK+HI  N+VV+SAL+DMYFKCS   D H VF +   RNV TWTSLI
Sbjct: 177 ASLEHGKRAHGVLIKSHIRENVVVSSALMDMYFKCSSLTDAHRVFNEVVNRNVFTWTSLI 236

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHGRV EVL+SF +M + GF+PNYVTFLAVLSAC+HGG V EGW YF SM++DYGI
Sbjct: 237 SGYGQHGRVNEVLESFDKMINEGFRPNYVTFLAVLSACSHGGLVNEGWHYFLSMKRDYGI 296

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  +HY+AMVDLLGR G+L EAYEFV N+P KEHP IWGALL AC+++G+MD
Sbjct: 297 QPRGQHYSAMVDLLGRSGKLHEAYEFVLNSPFKEHPAIWGALLGACRIHGDMD 349


>ref|XP_016683464.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g16470-like, partial [Gossypium hirsutum]
          Length = 462

 Score =  259 bits (662), Expect = 3e-82
 Identities = 121/173 (69%), Positives = 143/173 (82%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE GKRAHGVLIK+HI  N+VV+SAL+DMYFKCS   D H VF +   RNV+TWTSLI
Sbjct: 188 ASLEHGKRAHGVLIKSHIRENVVVSSALMDMYFKCSSLTDAHRVFNEVVNRNVVTWTSLI 247

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHGRV EVL+SF +M + GF+PNYVTFLAVLSAC+HGG V EGW YF SM++DYGI
Sbjct: 248 SGYGQHGRVNEVLESFDKMINEGFRPNYVTFLAVLSACSHGGLVNEGWHYFLSMKRDYGI 307

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  +HY+AMVDLLGR G+L EAYEFV N+P KEHP IWGALL AC+++G++D
Sbjct: 308 QPRGQHYSAMVDLLGRSGKLHEAYEFVLNSPFKEHPAIWGALLGACRIHGDLD 360


>ref|XP_012448866.1| PREDICTED: pentatricopeptide repeat-containing protein At4g16470,
           partial [Gossypium raimondii]
          Length = 462

 Score =  259 bits (662), Expect = 3e-82
 Identities = 122/173 (70%), Positives = 142/173 (82%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE GKRAHGVLIK+HI  N+VV+SAL+DMYFKCS   D H VF +   RNV TWTSLI
Sbjct: 188 ASLEHGKRAHGVLIKSHIRENVVVSSALMDMYFKCSSLTDAHRVFNEVVNRNVFTWTSLI 247

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHGRV EVL+SF +M + GF+PNYVTFLAVLSAC+HGG V EGW YF SM++DYGI
Sbjct: 248 SGYGQHGRVNEVLESFDKMINEGFRPNYVTFLAVLSACSHGGLVNEGWHYFLSMKRDYGI 307

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  +HY+AMVDLLGR G+L EAYEFV N+P KEHP IWGALL AC+++G+MD
Sbjct: 308 QPRGQHYSAMVDLLGRSGKLHEAYEFVLNSPFKEHPAIWGALLGACRIHGDMD 360


>gb|PPR88264.1| hypothetical protein GOBAR_AA32422 [Gossypium barbadense]
          Length = 451

 Score =  258 bits (659), Expect = 7e-82
 Identities = 121/173 (69%), Positives = 142/173 (82%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE GKRAHGVLIK+HI  N+VV+SAL+DMYFKCS   D H VF +   RNV+TWTSLI
Sbjct: 177 ASLEHGKRAHGVLIKSHIRENVVVSSALMDMYFKCSSLTDAHQVFNEVVNRNVVTWTSLI 236

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHGRV EVL+ F +M + GF+PNYVTFLAVLSAC+HGG V EGW YF SM++DYGI
Sbjct: 237 SGYGQHGRVNEVLELFDKMINEGFRPNYVTFLAVLSACSHGGLVNEGWHYFLSMKRDYGI 296

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  +HY+AMVDLLGR G+L EAYEFV N+P KEHP IWGALL AC+++G+MD
Sbjct: 297 QPRGQHYSAMVDLLGRSGKLHEAYEFVLNSPFKEHPAIWGALLGACRIHGDMD 349


>ref|XP_017648738.1| PREDICTED: pentatricopeptide repeat-containing protein At4g16470
           [Gossypium arboreum]
          Length = 451

 Score =  258 bits (659), Expect = 7e-82
 Identities = 121/173 (69%), Positives = 142/173 (82%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE GKRAHGVLIK+HI  N+VV+SAL+DMYFKCS   D H VF +   RNV+TWTSLI
Sbjct: 177 ASLEHGKRAHGVLIKSHIRENVVVSSALMDMYFKCSSLTDAHQVFNEVVNRNVVTWTSLI 236

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHGRV EVL+ F +M + GF+PNYVTFLAVLSAC+HGG V EGW YF SM++DYGI
Sbjct: 237 SGYGQHGRVNEVLELFDKMINEGFRPNYVTFLAVLSACSHGGLVNEGWHYFLSMKRDYGI 296

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  +HY+AMVDLLGR G+L EAYEFV N+P KEHP IWGALL AC+++G+MD
Sbjct: 297 QPRGQHYSAMVDLLGRSGKLHEAYEFVLNSPFKEHPAIWGALLGACRIHGDMD 349


>ref|XP_021642588.1| pentatricopeptide repeat-containing protein At4g16470 isoform X1
           [Hevea brasiliensis]
 ref|XP_021642590.1| pentatricopeptide repeat-containing protein At4g16470 isoform X1
           [Hevea brasiliensis]
 ref|XP_021642591.1| pentatricopeptide repeat-containing protein At4g16470 isoform X1
           [Hevea brasiliensis]
          Length = 535

 Score =  260 bits (664), Expect = 1e-81
 Identities = 122/173 (70%), Positives = 141/173 (81%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A LE GK+AHGV+IK  +  N+VVNSALIDMYFKCS   DGH VF K+  RNV+TWTSLI
Sbjct: 261 ATLEHGKKAHGVMIKCRLRENVVVNSALIDMYFKCSNLSDGHKVFSKSLNRNVVTWTSLI 320

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQHGRV EVL+SF RM   GF PNYVTFLAVLSAC+HGG + E W YF+SM++DYGI
Sbjct: 321 SGYGQHGRVAEVLESFHRMKDEGFGPNYVTFLAVLSACSHGGLIDEAWDYFSSMKRDYGI 380

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  +HYAAMVDLLGR GRL EAYEFV  APC+EH V+WGALL AC+++G+MD
Sbjct: 381 QPRGQHYAAMVDLLGRAGRLQEAYEFVLEAPCQEHSVVWGALLGACRIHGDMD 433



 Score = 57.8 bits (138), Expect = 1e-06
 Identities = 40/174 (22%), Positives = 80/174 (45%), Gaps = 6/174 (3%)
 Frame = -3

Query: 507 EGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLISGYG 328
           +GKR H  ++      N  +N+ L+ +Y K       +++F     +++I+W ++I+GY 
Sbjct: 164 KGKRIHAQMVVVGYVANEYLNTKLLILYAKSGDLKAANVLFDMVVGKSLISWNAIIAGYV 223

Query: 327 QHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACT------HGGFVKEGWGYFNSMRKDY 166
           Q G+ +  L  + +M   G  P+  TF +V  AC       HG   K+  G     R   
Sbjct: 224 QKGQEEIGLTFYYKMRENGLTPDQYTFASVFRACATLATLEHG---KKAHGVMIKCRLRE 280

Query: 165 GIIPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNM 4
            ++      +A++D+  +C  L + ++ V +     + V W +L+     +G +
Sbjct: 281 NVVVN----SALIDMYFKCSNLSDGHK-VFSKSLNRNVVTWTSLISGYGQHGRV 329


>ref|XP_015166393.1| PREDICTED: pentatricopeptide repeat-containing protein At4g16470
           [Solanum tuberosum]
          Length = 436

 Score =  257 bits (656), Expect = 1e-81
 Identities = 118/173 (68%), Positives = 150/173 (86%)
 Frame = -3

Query: 519 AMLEEGKRAHGVLIKNHINGNIVVNSALIDMYFKCSCPYDGHLVFKKAPERNVITWTSLI 340
           A+LE+GK+AH +LIK+ I+GNIVVNSAL+DMYFKCS P DG+LVF K+ ERNVITWT+LI
Sbjct: 163 AVLEQGKQAHALLIKSQISGNIVVNSALMDMYFKCSSPSDGYLVFSKSLERNVITWTALI 222

Query: 339 SGYGQHGRVKEVLDSFRRMTHGGFKPNYVTFLAVLSACTHGGFVKEGWGYFNSMRKDYGI 160
           SGYGQ+GR+K+VL+SF RM   G++PN+VTFLAVLSAC+HGG V  G  YF+SM +DYG+
Sbjct: 223 SGYGQNGRIKDVLESFHRMIDEGYRPNHVTFLAVLSACSHGGLVDRGKEYFSSMMRDYGL 282

Query: 159 IPGEKHYAAMVDLLGRCGRLDEAYEFVKNAPCKEHPVIWGALLQACKVYGNMD 1
            P  KHYAA+VDLLGR GRL EA+EFVKN+ C+EHPV+WG+LL ACK++G+++
Sbjct: 283 QPRGKHYAAIVDLLGRAGRLQEAHEFVKNSRCEEHPVLWGSLLGACKIHGDIE 335


Top