BLASTX nr result

ID: Akebia23_contig00043532 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00043532
         (683 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007203772.1| hypothetical protein PRUPE_ppa002597mg [Prun...   360   2e-97
ref|XP_002268784.1| PREDICTED: pentatricopeptide repeat-containi...   354   1e-95
ref|XP_004169636.1| PREDICTED: pentatricopeptide repeat-containi...   349   4e-94
ref|XP_004146494.1| PREDICTED: pentatricopeptide repeat-containi...   349   4e-94
ref|XP_006466854.1| PREDICTED: pentatricopeptide repeat-containi...   348   1e-93
ref|XP_006425612.1| hypothetical protein CICLE_v10025108mg [Citr...   346   5e-93
ref|XP_006383156.1| hypothetical protein POPTR_0005s12100g [Popu...   342   7e-92
ref|XP_007046822.1| Pentatricopeptide repeat (PPR) superfamily p...   337   2e-90
ref|XP_004233728.1| PREDICTED: pentatricopeptide repeat-containi...   335   6e-90
ref|XP_006340666.1| PREDICTED: pentatricopeptide repeat-containi...   335   8e-90
ref|XP_003524199.1| PREDICTED: pentatricopeptide repeat-containi...   333   2e-89
ref|XP_003629742.1| Pentatricopeptide repeat-containing protein ...   333   2e-89
ref|XP_007159438.1| hypothetical protein PHAVU_002G237800g [Phas...   330   3e-88
ref|XP_004289114.1| PREDICTED: pentatricopeptide repeat-containi...   325   6e-87
gb|EXC24885.1| hypothetical protein L484_013254 [Morus notabilis]     314   2e-83
ref|XP_006403127.1| hypothetical protein EUTSA_v10003396mg [Eutr...   312   7e-83
ref|XP_006280148.1| hypothetical protein CARUB_v10026047mg [Caps...   311   2e-82
ref|XP_006280147.1| hypothetical protein CARUB_v10026047mg [Caps...   311   2e-82
ref|XP_002865376.1| pentatricopeptide repeat-containing protein ...   311   2e-82
ref|NP_199236.1| pentatricopeptide repeat-containing protein [Ar...   305   1e-80

>ref|XP_007203772.1| hypothetical protein PRUPE_ppa002597mg [Prunus persica]
           gi|462399303|gb|EMJ04971.1| hypothetical protein
           PRUPE_ppa002597mg [Prunus persica]
          Length = 654

 Score =  360 bits (924), Expect = 2e-97
 Identities = 170/227 (74%), Positives = 199/227 (87%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY QNA+P +AL  FE M+ AGV TDE+TLVG +SACAQLGA KYA W+RD+AEK G  P
Sbjct: 253 GYAQNARPRDALDCFERMQGAGVGTDEITLVGLISACAQLGASKYANWVRDIAEKSGFGP 312

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
             NV++GSALIDMYSKCGS++EA++VF+GMKERNVFSYS+MI+GFAMHGRA+AA++LF+E
Sbjct: 313 TENVLVGSALIDMYSKCGSLDEAYKVFQGMKERNVFSYSSMILGFAMHGRANAAIELFHE 372

Query: 323 MVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRAG 144
           M+ TE+RPN VTFIGVLTACSH GMV QG+++FA+M K Y V PSADHY CMVDLLGRAG
Sbjct: 373 MLTTEIRPNRVTFIGVLTACSHAGMVDQGRQLFATMEKYYNVVPSADHYTCMVDLLGRAG 432

Query: 143 HLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
            LEEALELV+TMP+  HGGVWGALLGAC IHGNPDIA+IAANHLFEL
Sbjct: 433 RLEEALELVETMPIAAHGGVWGALLGACHIHGNPDIAQIAANHLFEL 479



 Score = 64.7 bits (156), Expect = 3e-08
 Identities = 52/224 (23%), Positives = 91/224 (40%), Gaps = 32/224 (14%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY       EAL  +  MR AG      T      AC  +  V     I       G   
Sbjct: 120 GYTVQGPISEALNFYTCMRSAGTGPVSFTFSALFKACGDVLDVNLGRQIHAQTILVG-GF 178

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYS--------------------- 387
           A ++ +G+ +IDMY KCG ++   +VF  M +R+V S++                     
Sbjct: 179 AADLYVGNTMIDMYVKCGFLDCGRKVFDEMPDRDVVSWTELIVAYTKIGDMGSARELFEG 238

Query: 386 ----------AMIVGFAMHGRADAAMQLFYEMVETEVRPNWVTFIGVLTACSHVGMVQQG 237
                     AM+ G+A + R   A+  F  M    V  + +T +G+++AC+ +G  +  
Sbjct: 239 LPVKDMVAWTAMVTGYAQNARPRDALDCFERMQGAGVGTDEITLVGLISACAQLGASKYA 298

Query: 236 QEIFASMHKD-YGVAPSADHYACMVDLLGRAGHLEEALELVKTM 108
             +     K  +G   +    + ++D+  + G L+EA ++ + M
Sbjct: 299 NWVRDIAEKSGFGPTENVLVGSALIDMYSKCGSLDEAYKVFQGM 342


>ref|XP_002268784.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g44230-like [Vitis vinifera]
          Length = 647

 Score =  354 bits (909), Expect = 1e-95
 Identities = 169/227 (74%), Positives = 199/227 (87%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY QNA+P EAL +FE M+ AGV TDEVTLVG +SACAQLGA KYA W+RDVAE+ G  P
Sbjct: 246 GYAQNARPREALEVFERMQAAGVKTDEVTLVGVISACAQLGAAKYANWVRDVAEQSGFGP 305

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
            +NVV+GSALIDMY+KCGSVE+A++VF+ M+ERNV+SYS+MIVGFAMHG A AAM+LF E
Sbjct: 306 TSNVVVGSALIDMYAKCGSVEDAYKVFERMEERNVYSYSSMIVGFAMHGLAGAAMELFDE 365

Query: 323 MVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRAG 144
           M++TE++PN VTFIGVLTACSH GMV+QGQ++FA M + +GVAPS DHYACMVDLLGRAG
Sbjct: 366 MLKTEIKPNRVTFIGVLTACSHAGMVEQGQQLFAMMEECHGVAPSEDHYACMVDLLGRAG 425

Query: 143 HLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
            LEEAL LVK MP+ PHGGVWGALLGAC IHGNPD+A+IAA+HLFEL
Sbjct: 426 RLEEALNLVKMMPMNPHGGVWGALLGACRIHGNPDMAQIAASHLFEL 472



 Score = 72.0 bits (175), Expect = 2e-10
 Identities = 52/225 (23%), Positives = 100/225 (44%), Gaps = 33/225 (14%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY       E++ L+ SMR  G+     T    + AC+    V     +       G   
Sbjct: 113 GYALQGPFMESVLLYNSMRRQGIGPVSFTFTALLKACSAALDVNLGRQVHTQTILIG-GF 171

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYS--------------------- 387
            +++ +G+ LIDMY KCG +   H+VF  M +R+V S++                     
Sbjct: 172 GSDLYVGNTLIDMYVKCGCLGCGHRVFDEMLDRDVISWTSLIVAYAKVGNMEAASELFDG 231

Query: 386 ----------AMIVGFAMHGRADAAMQLFYEMVETEVRPNWVTFIGVLTACSHVGMVQQG 237
                     AM+ G+A + R   A+++F  M    V+ + VT +GV++AC+ +G  +  
Sbjct: 232 LPMKDMVAWTAMVTGYAQNARPREALEVFERMQAAGVKTDEVTLVGVISACAQLGAAKYA 291

Query: 236 QEIFASMHKDYGVAPSADHY--ACMVDLLGRAGHLEEALELVKTM 108
             +   + +  G  P+++    + ++D+  + G +E+A ++ + M
Sbjct: 292 NWV-RDVAEQSGFGPTSNVVVGSALIDMYAKCGSVEDAYKVFERM 335


>ref|XP_004169636.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g44230-like [Cucumis sativus]
          Length = 650

 Score =  349 bits (896), Expect = 4e-94
 Identities = 163/227 (71%), Positives = 200/227 (88%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY QN +P EAL  F+ M+D G+ TDEVTL G +SACAQLGAVK+A WIRD+AE+ G  P
Sbjct: 249 GYAQNGRPKEALEYFQKMQDVGMETDEVTLAGVISACAQLGAVKHANWIRDIAERSGFGP 308

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
           + NVV+GSALIDMYSKCGS +EA++VF+ MKERNVFSYS+MI+G+AMHGRA +A+QLF++
Sbjct: 309 SGNVVVGSALIDMYSKCGSPDEAYKVFEVMKERNVFSYSSMILGYAMHGRAHSALQLFHD 368

Query: 323 MVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRAG 144
           M++TE+RPN VTFIG+L+ACSH G+V+QG+++FA M K +GVAPS DHYACMVDLLGRAG
Sbjct: 369 MLKTEIRPNKVTFIGILSACSHAGLVEQGRQLFAKMEKFFGVAPSPDHYACMVDLLGRAG 428

Query: 143 HLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
            LEEAL+LVKTMP+EP+GGVWGALLGAC IHGNPDIA+IAAN LF+L
Sbjct: 429 CLEEALDLVKTMPMEPNGGVWGALLGACRIHGNPDIAQIAANELFKL 475



 Score = 65.9 bits (159), Expect = 1e-08
 Identities = 36/132 (27%), Positives = 74/132 (56%), Gaps = 2/132 (1%)
 Frame = -1

Query: 497 NVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYEMV 318
           +VV  + LI  Y+K G +E A  +F  +  +++ +++AM+ G+A +GR   A++ F +M 
Sbjct: 208 DVVSWTELIVAYAKYGDMESASGLFDDLPSKDMVAWTAMVTGYAQNGRPKEALEYFQKMQ 267

Query: 317 ETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHY--ACMVDLLGRAG 144
           +  +  + VT  GV++AC+ +G V+    I   + +  G  PS +    + ++D+  + G
Sbjct: 268 DVGMETDEVTLAGVISACAQLGAVKHANWI-RDIAERSGFGPSGNVVVGSALIDMYSKCG 326

Query: 143 HLEEALELVKTM 108
             +EA ++ + M
Sbjct: 327 SPDEAYKVFEVM 338


>ref|XP_004146494.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g44230-like [Cucumis sativus]
          Length = 650

 Score =  349 bits (896), Expect = 4e-94
 Identities = 163/227 (71%), Positives = 200/227 (88%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY QN +P EAL  F+ M+D G+ TDEVTL G +SACAQLGAVK+A WIRD+AE+ G  P
Sbjct: 249 GYAQNGRPKEALEYFQKMQDVGMETDEVTLAGVISACAQLGAVKHANWIRDIAERSGFGP 308

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
           + NVV+GSALIDMYSKCGS +EA++VF+ MKERNVFSYS+MI+G+AMHGRA +A+QLF++
Sbjct: 309 SGNVVVGSALIDMYSKCGSPDEAYKVFEVMKERNVFSYSSMILGYAMHGRAHSALQLFHD 368

Query: 323 MVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRAG 144
           M++TE+RPN VTFIG+L+ACSH G+V+QG+++FA M K +GVAPS DHYACMVDLLGRAG
Sbjct: 369 MLKTEIRPNKVTFIGILSACSHAGLVEQGRQLFAKMEKFFGVAPSPDHYACMVDLLGRAG 428

Query: 143 HLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
            LEEAL+LVKTMP+EP+GGVWGALLGAC IHGNPDIA+IAAN LF+L
Sbjct: 429 CLEEALDLVKTMPMEPNGGVWGALLGACRIHGNPDIAQIAANELFKL 475



 Score = 65.1 bits (157), Expect = 2e-08
 Identities = 56/227 (24%), Positives = 99/227 (43%), Gaps = 35/227 (15%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY       E+   +  MR  GV     T      AC   GA       + V  +  L  
Sbjct: 116 GYALQGLLSESTNFYTRMRRDGVGPVSFTFSALFKAC---GAALNMDLGKQVHAQTILIG 172

Query: 503 --ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYS------------------- 387
             A+++ +G+++ID+Y KCG +  A +VF  M ER+V S++                   
Sbjct: 173 GFASDLYVGNSMIDLYVKCGFLGCARKVFDEMSERDVVSWTELIVAYAKYGDMESASGLF 232

Query: 386 ------------AMIVGFAMHGRADAAMQLFYEMVETEVRPNWVTFIGVLTACSHVGMVQ 243
                       AM+ G+A +GR   A++ F +M +  +  + VT  GV++AC+ +G V+
Sbjct: 233 DDLPLKDMVAWTAMVTGYAQNGRPKEALEYFQKMQDVGMETDEVTLAGVISACAQLGAVK 292

Query: 242 QGQEIFASMHKDYGVAPSADHY--ACMVDLLGRAGHLEEALELVKTM 108
               I   + +  G  PS +    + ++D+  + G  +EA ++ + M
Sbjct: 293 HANWI-RDIAERSGFGPSGNVVVGSALIDMYSKCGSPDEAYKVFEVM 338


>ref|XP_006466854.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g44230-like isoform X1 [Citrus sinensis]
           gi|568824952|ref|XP_006466855.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g44230-like isoform X2 [Citrus sinensis]
           gi|568824954|ref|XP_006466856.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g44230-like isoform X3 [Citrus sinensis]
           gi|568824956|ref|XP_006466857.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g44230-like isoform X4 [Citrus sinensis]
          Length = 653

 Score =  348 bits (892), Expect = 1e-93
 Identities = 165/227 (72%), Positives = 198/227 (87%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GYVQNAKP EA+  FE M+ AGV TD VTLVG +SACAQLG +KYA W+ ++AE  G  P
Sbjct: 252 GYVQNAKPREAIEYFERMQYAGVETDYVTLVGVISACAQLGVIKYANWVCEIAEGSGFGP 311

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
             NVV+GSALIDMYSKCGS+++A+++F GMK+RNVFSYS+MI+GFAMHGRA AA+QLF +
Sbjct: 312 INNVVVGSALIDMYSKCGSIDDAYRIFVGMKQRNVFSYSSMILGFAMHGRAHAAIQLFGD 371

Query: 323 MVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRAG 144
           MV+TE +PN VTFIGVLTACSHVG+V+QG+++FASM K YGV+PS DHYACMVDLLGRAG
Sbjct: 372 MVKTETKPNGVTFIGVLTACSHVGLVEQGRKLFASMEKCYGVSPSTDHYACMVDLLGRAG 431

Query: 143 HLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
            LEEAL++V+ MPVEP+GGVWGALLGAC IH NP+IA+IAANHLFEL
Sbjct: 432 CLEEALKMVEKMPVEPNGGVWGALLGACQIHRNPEIAQIAANHLFEL 478



 Score = 62.4 bits (150), Expect = 1e-07
 Identities = 49/220 (22%), Positives = 97/220 (44%), Gaps = 32/220 (14%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY+      +++ L+ SMR  G+     TL     AC ++  V     I       G   
Sbjct: 119 GYILQGHLKDSISLYCSMRREGIGPVSFTLSALFKACTEVLDVSLGQQIHAQTILLG-GF 177

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
            +++ +G+ +I MY KCG +  + +VF  M ER+V S++ +IV +A +G  ++A  LF E
Sbjct: 178 TSDLYVGNTMIGMYVKCGFLGCSRKVFDEMPERDVVSWTELIVAYANNGDMESAGGLFNE 237

Query: 323 -------------------------------MVETEVRPNWVTFIGVLTACSHVGMVQQG 237
                                          M    V  ++VT +GV++AC+ +G+++  
Sbjct: 238 LPLKDKVAWTAMVTGYVQNAKPREAIEYFERMQYAGVETDYVTLVGVISACAQLGVIKYA 297

Query: 236 QEIF-ASMHKDYGVAPSADHYACMVDLLGRAGHLEEALEL 120
             +   +    +G   +    + ++D+  + G +++A  +
Sbjct: 298 NWVCEIAEGSGFGPINNVVVGSALIDMYSKCGSIDDAYRI 337


>ref|XP_006425612.1| hypothetical protein CICLE_v10025108mg [Citrus clementina]
           gi|557527602|gb|ESR38852.1| hypothetical protein
           CICLE_v10025108mg [Citrus clementina]
          Length = 653

 Score =  346 bits (887), Expect = 5e-93
 Identities = 166/227 (73%), Positives = 197/227 (86%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GYVQNAKP EA+  FE M+ AGV TD VTLVG +SACAQLG VKYA W+ ++AE  G  P
Sbjct: 252 GYVQNAKPREAIEYFERMQYAGVETDYVTLVGVISACAQLGVVKYANWVCEIAEGSGFGP 311

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
             NVV+GSALIDMYSKCGS+++A++VF  MK+RNVFSYS+MI+GFAMHGRA AA+QLF E
Sbjct: 312 INNVVVGSALIDMYSKCGSIDDAYRVFVDMKQRNVFSYSSMILGFAMHGRAHAAIQLFGE 371

Query: 323 MVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRAG 144
           MV+TE +PN VTFIGVLTACSHVG+V+QG+++FASM K YGV+PS DHYACMVDLLGRAG
Sbjct: 372 MVKTETKPNGVTFIGVLTACSHVGLVEQGRKLFASMEKCYGVSPSTDHYACMVDLLGRAG 431

Query: 143 HLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
            LEEAL++V+ MPVEP+GGVWGALLGAC IH NP+IA+IAANHLF+L
Sbjct: 432 CLEEALKMVEKMPVEPNGGVWGALLGACQIHRNPEIAQIAANHLFQL 478



 Score = 60.5 bits (145), Expect = 5e-07
 Identities = 50/224 (22%), Positives = 97/224 (43%), Gaps = 32/224 (14%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY+      +++ L+ SMR  G+     TL     AC ++  V     I       G   
Sbjct: 119 GYILQGHLKDSISLYCSMRREGIGPVSFTLSALFKACTEVLDVSLGQQIHAQTILLG-GF 177

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
            +++ + + +I MY KCG +  + +VF  M ER+V S++ +IV +A +G  ++A  LF E
Sbjct: 178 TSDLYVANTMIGMYVKCGFLGCSRKVFDEMPERDVVSWTELIVAYANNGDMESAGGLFNE 237

Query: 323 -------------------------------MVETEVRPNWVTFIGVLTACSHVGMVQQG 237
                                          M    V  ++VT +GV++AC+ +G+V+  
Sbjct: 238 LPLKDKVAWTAMVTGYVQNAKPREAIEYFERMQYAGVETDYVTLVGVISACAQLGVVKYA 297

Query: 236 QEIF-ASMHKDYGVAPSADHYACMVDLLGRAGHLEEALELVKTM 108
             +   +    +G   +    + ++D+  + G +++A  +   M
Sbjct: 298 NWVCEIAEGSGFGPINNVVVGSALIDMYSKCGSIDDAYRVFVDM 341


>ref|XP_006383156.1| hypothetical protein POPTR_0005s12100g [Populus trichocarpa]
           gi|550338737|gb|ERP60953.1| hypothetical protein
           POPTR_0005s12100g [Populus trichocarpa]
          Length = 654

 Score =  342 bits (877), Expect = 7e-92
 Identities = 165/227 (72%), Positives = 191/227 (84%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           G+ QNAKP EA+  FE M++ GV TDE+TL+G +SACAQLGA KYA WIRDVAEK     
Sbjct: 253 GFAQNAKPREAIMFFEKMQEFGVETDEITLIGVISACAQLGAAKYADWIRDVAEKSEFGG 312

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
             +VV+GSALIDMYSKCGSV +A++VF+GMKERNV+SYS+MI+GFAMHGR   AM+LF E
Sbjct: 313 KHSVVVGSALIDMYSKCGSVGDAYRVFQGMKERNVYSYSSMILGFAMHGRVHDAMKLFDE 372

Query: 323 MVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRAG 144
           MV+TE++PN VTFIGVLTACSH GMV+QG +IF  M K YG+ PSADHY CMVDLLGRAG
Sbjct: 373 MVKTEIKPNRVTFIGVLTACSHAGMVEQGWQIFELMEKCYGIKPSADHYTCMVDLLGRAG 432

Query: 143 HLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
            L+EA ELVKTMP+EPHGGVWGALLGAC IH +PDIA IAANHLFEL
Sbjct: 433 RLQEAHELVKTMPIEPHGGVWGALLGACRIHKSPDIAAIAANHLFEL 479



 Score = 64.3 bits (155), Expect = 3e-08
 Identities = 35/134 (26%), Positives = 72/134 (53%), Gaps = 1/134 (0%)
 Frame = -1

Query: 506 PATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFY 327
           P  +V+  + LI  Y K G++E A ++F G+  +++ +++ M+ GFA + +   A+  F 
Sbjct: 209 PNRDVISWTELISAYVKSGNMESAGELFDGLPVKDMVAWTVMVSGFAQNAKPREAIMFFE 268

Query: 326 EMVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHK-DYGVAPSADHYACMVDLLGR 150
           +M E  V  + +T IGV++AC+ +G  +    I     K ++G   S    + ++D+  +
Sbjct: 269 KMQEFGVETDEITLIGVISACAQLGAAKYADWIRDVAEKSEFGGKHSVVVGSALIDMYSK 328

Query: 149 AGHLEEALELVKTM 108
            G + +A  + + M
Sbjct: 329 CGSVGDAYRVFQGM 342


>ref|XP_007046822.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma
           cacao] gi|508699083|gb|EOX90979.1| Pentatricopeptide
           repeat (PPR) superfamily protein [Theobroma cacao]
          Length = 658

 Score =  337 bits (865), Expect = 2e-90
 Identities = 161/227 (70%), Positives = 192/227 (84%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY QNAKP EAL  FE M++ GV TDEVTLVG +SACAQLG  KYA W+R +AE  G  P
Sbjct: 257 GYAQNAKPREALEFFERMQNEGVETDEVTLVGVISACAQLGTAKYANWVRGIAENSGFDP 316

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
              VV+GSALIDMYSKCGSVE+A++VF+ M+ERNVFSYS+MI GFAMHG A AA++LF E
Sbjct: 317 TRCVVVGSALIDMYSKCGSVEDAYKVFEAMEERNVFSYSSMIAGFAMHGCAYAALELFRE 376

Query: 323 MVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRAG 144
           MV+T ++PN VTFIGVLTACSH GMV+QG++IFASM +++GV+P+ DHYAC+VDLLGRAG
Sbjct: 377 MVKTGIKPNRVTFIGVLTACSHSGMVEQGRQIFASMEEEFGVSPAVDHYACIVDLLGRAG 436

Query: 143 HLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
            LEEAL L +TMPVEP+GGVWGALLGAC  +GNPD+A+I ANHLFEL
Sbjct: 437 CLEEALNLAETMPVEPNGGVWGALLGACRTYGNPDMAQIGANHLFEL 483



 Score = 58.5 bits (140), Expect = 2e-06
 Identities = 30/135 (22%), Positives = 73/135 (54%), Gaps = 2/135 (1%)
 Frame = -1

Query: 506 PATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFY 327
           P  +++  + LI  Y+K G +E A ++F  +  +++ +++ M+ G+A + +   A++ F 
Sbjct: 213 PERDLISWTELIVAYAKLGDMESAGELFDELPIKDMVAWTTMVTGYAQNAKPREALEFFE 272

Query: 326 EMVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHY--ACMVDLLG 153
            M    V  + VT +GV++AC+ +G  +    +   + ++ G  P+      + ++D+  
Sbjct: 273 RMQNEGVETDEVTLVGVISACAQLGTAKYANWV-RGIAENSGFDPTRCVVVGSALIDMYS 331

Query: 152 RAGHLEEALELVKTM 108
           + G +E+A ++ + M
Sbjct: 332 KCGSVEDAYKVFEAM 346


>ref|XP_004233728.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g44230-like [Solanum lycopersicum]
          Length = 651

 Score =  335 bits (860), Expect = 6e-90
 Identities = 158/227 (69%), Positives = 191/227 (84%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           G+ QNAKP EAL  F  M+  GV TDE+TLVG +SACAQLGA KYA W+RD+AE  G  P
Sbjct: 249 GFAQNAKPREALEFFHRMQSEGVETDELTLVGVISACAQLGAAKYANWVRDMAEGYGFGP 308

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
           A +V++GSALIDMYSKCG+VEEA++VF+ MKE+NVFSYS+MI+GFAMHG A+AA+ LF E
Sbjct: 309 ANHVMVGSALIDMYSKCGNVEEAYKVFEKMKEKNVFSYSSMIMGFAMHGCANAALDLFEE 368

Query: 323 MVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRAG 144
           MV+TEV+PN VTFIGVL AC+H G+V++G+ +F +M K Y V PS +HYACM+DLLGRAG
Sbjct: 369 MVKTEVKPNKVTFIGVLMACTHAGLVERGRNLFDTMEKHYSVEPSVEHYACMIDLLGRAG 428

Query: 143 HLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
            LEEA EL+K MP+EP+ GVWGALLGAC IHGNPDIAE+AANHLFEL
Sbjct: 429 QLEEARELIKAMPMEPNSGVWGALLGACRIHGNPDIAEVAANHLFEL 475



 Score = 69.7 bits (169), Expect = 8e-10
 Identities = 36/133 (27%), Positives = 78/133 (58%), Gaps = 3/133 (2%)
 Frame = -1

Query: 497 NVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYEMV 318
           +V+  ++LI  YSK G +  A ++F+ +  +++ +++AM+ GFA + +   A++ F+ M 
Sbjct: 208 DVISWTSLIVAYSKAGDMAAAAEMFERLPVKDLVAWTAMVSGFAQNAKPREALEFFHRMQ 267

Query: 317 ETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHY---ACMVDLLGRA 147
              V  + +T +GV++AC+ +G  +    +   M + YG  P A+H    + ++D+  + 
Sbjct: 268 SEGVETDELTLVGVISACAQLGAAKYANWV-RDMAEGYGFGP-ANHVMVGSALIDMYSKC 325

Query: 146 GHLEEALELVKTM 108
           G++EEA ++ + M
Sbjct: 326 GNVEEAYKVFEKM 338


>ref|XP_006340666.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g44230-like [Solanum tuberosum]
          Length = 654

 Score =  335 bits (859), Expect = 8e-90
 Identities = 157/227 (69%), Positives = 191/227 (84%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           G+ QNAKP EAL  F  M+  GV TDE+TLVG +SACAQLGA KYA W+RD+AE  G+ P
Sbjct: 252 GFAQNAKPREALEFFHRMQSEGVETDELTLVGVISACAQLGAAKYANWVRDMAEGYGIGP 311

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
             +V++GSALIDMYSKCG+VEEA++VFK MKE+NVFSYS+MI+GFAMHG A+AA+ LF E
Sbjct: 312 VNHVMVGSALIDMYSKCGNVEEAYKVFKKMKEKNVFSYSSMIMGFAMHGCANAALDLFEE 371

Query: 323 MVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRAG 144
           MV+TEV+PN VTFIGVL AC+H G+V++G+ +F  M K YGV PS +HYACM+DLLGRAG
Sbjct: 372 MVKTEVKPNKVTFIGVLMACTHAGLVERGRHLFDKMEKHYGVEPSVEHYACMIDLLGRAG 431

Query: 143 HLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
            L+EALEL+K MP++P+ GVWGALLGAC IHGNPDIAE+AAN LFEL
Sbjct: 432 QLQEALELIKAMPMDPNSGVWGALLGACRIHGNPDIAEVAANRLFEL 478



 Score = 70.9 bits (172), Expect = 4e-10
 Identities = 36/133 (27%), Positives = 78/133 (58%), Gaps = 3/133 (2%)
 Frame = -1

Query: 497 NVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYEMV 318
           +V+  ++LI  YSK G +  A ++F+ +  +++ +++AM+ GFA + +   A++ F+ M 
Sbjct: 211 DVISWTSLIVAYSKSGDMAAAAELFERLPVKDLVAWTAMVSGFAQNAKPREALEFFHRMQ 270

Query: 317 ETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHY---ACMVDLLGRA 147
              V  + +T +GV++AC+ +G  +    +   M + YG+ P  +H    + ++D+  + 
Sbjct: 271 SEGVETDELTLVGVISACAQLGAAKYANWV-RDMAEGYGIGP-VNHVMVGSALIDMYSKC 328

Query: 146 GHLEEALELVKTM 108
           G++EEA ++ K M
Sbjct: 329 GNVEEAYKVFKKM 341


>ref|XP_003524199.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g44230-like [Glycine max]
          Length = 617

 Score =  333 bits (855), Expect = 2e-89
 Identities = 159/227 (70%), Positives = 190/227 (83%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY QNA P +AL +F  +RD GV  DEVTLVG +SACAQLGA KYA WIRD+AE  G   
Sbjct: 216 GYAQNAMPMDALEVFRRLRDEGVEIDEVTLVGVISACAQLGASKYANWIRDIAESSGFGV 275

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
             NV++GSALIDMYSKCG+VEEA+ VFKGM+ERNVFSYS+MIVGFA+HGRA AA++LFY+
Sbjct: 276 GDNVLVGSALIDMYSKCGNVEEAYDVFKGMRERNVFSYSSMIVGFAIHGRARAAIKLFYD 335

Query: 323 MVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRAG 144
           M+ET V+PN VTF+GVLTACSH G+V QGQ++FASM K YGVAP+A+ YACM DLL RAG
Sbjct: 336 MLETGVKPNHVTFVGVLTACSHAGLVDQGQQLFASMEKCYGVAPTAELYACMTDLLSRAG 395

Query: 143 HLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
           +LE+AL+LV+TMP+E  G VWGALLGA  +HGNPD+AEIA+  LFEL
Sbjct: 396 YLEKALQLVETMPMESDGAVWGALLGASHVHGNPDVAEIASKRLFEL 442



 Score = 67.8 bits (164), Expect = 3e-09
 Identities = 58/220 (26%), Positives = 99/220 (45%), Gaps = 37/220 (16%)
 Frame = -1

Query: 656 EALRLFESMRDAGVHTDEVTLVGAVSACAQ-----LGAVKYATWIRDVAEKEGLSPATNV 492
           +AL  + SMR   V     T     SACA      LGA  +A  +       G S  +++
Sbjct: 92  QALSFYSSMRKRRVSPISFTFSALFSACAAVRHSALGAQLHAQTLL----LGGFS--SDL 145

Query: 491 VMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYS------------------------- 387
            + +A+IDMY KCGS+  A  VF  M ER+V S++                         
Sbjct: 146 YVNNAVIDMYVKCGSLRCARMVFDEMPERDVISWTGLIVAYTRIGDMRAARDLFDGLPVK 205

Query: 386 ------AMIVGFAMHGRADAAMQLFYEMVETEVRPNWVTFIGVLTACSHVGMVQQGQEI- 228
                 AM+ G+A +     A+++F  + +  V  + VT +GV++AC+ +G  +    I 
Sbjct: 206 DMVTWTAMVTGYAQNAMPMDALEVFRRLRDEGVEIDEVTLVGVISACAQLGASKYANWIR 265

Query: 227 FASMHKDYGVAPSADHYACMVDLLGRAGHLEEALELVKTM 108
             +    +GV  +    + ++D+  + G++EEA ++ K M
Sbjct: 266 DIAESSGFGVGDNVLVGSALIDMYSKCGNVEEAYDVFKGM 305


>ref|XP_003629742.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355523764|gb|AET04218.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 616

 Score =  333 bits (855), Expect = 2e-89
 Identities = 157/227 (69%), Positives = 192/227 (84%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY QNA P +AL+ F  MR+AGV TDE+TLVGA+SACAQLG   YA WIR++AE      
Sbjct: 215 GYSQNAMPKKALQFFRKMREAGVVTDEITLVGAISACAQLGVSGYADWIREIAESSRFGS 274

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
            +NV +GSALIDMYSKCG+VEEA+ VFKGMKE NVFSYS+MIVGFA+HGRA +A++LFYE
Sbjct: 275 GSNVFVGSALIDMYSKCGNVEEAYNVFKGMKEMNVFSYSSMIVGFAVHGRARSAIKLFYE 334

Query: 323 MVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRAG 144
           M+E  ++PN VTF+G+ TACSH GMV+QGQ++F +M + YGV+P+ADHYACM DLLGRAG
Sbjct: 335 MLENGIKPNHVTFVGLFTACSHAGMVEQGQQLFGAMKECYGVSPTADHYACMADLLGRAG 394

Query: 143 HLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
           HLE+AL+LV+TMP+EP+GGVWGALLGA  IHGNPD+AEIA+  LFEL
Sbjct: 395 HLEKALQLVQTMPMEPNGGVWGALLGASHIHGNPDVAEIASRSLFEL 441


>ref|XP_007159438.1| hypothetical protein PHAVU_002G237800g [Phaseolus vulgaris]
           gi|561032853|gb|ESW31432.1| hypothetical protein
           PHAVU_002G237800g [Phaseolus vulgaris]
          Length = 617

 Score =  330 bits (846), Expect = 3e-88
 Identities = 153/227 (67%), Positives = 191/227 (84%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY QNA P +A+ +F  + D GV  DEVTLVG +SACAQLGA  YA WIRD+AE  G  P
Sbjct: 216 GYAQNAMPKDAVEVFRRLLDEGVEIDEVTLVGVISACAQLGASVYAKWIRDIAESSGFGP 275

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
            ++V++GSALIDMYSKCG+VEEA+ VFKGM+ERNVFSYS+MIVGFA+HGR  AA++LFY+
Sbjct: 276 GSSVLVGSALIDMYSKCGNVEEAYNVFKGMRERNVFSYSSMIVGFAIHGRVHAAIKLFYD 335

Query: 323 MVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRAG 144
           M+ETEV+PN VTF+GVLTAC+H G+V  GQ++FA+M K YGVAP+A+ YACM DLLGRAG
Sbjct: 336 MLETEVKPNHVTFVGVLTACTHAGLVDLGQQLFATMEKCYGVAPTAELYACMADLLGRAG 395

Query: 143 HLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
           +LE+ + LV+TMP++P G VWGALLGA ++HGNPD+AEIA+ HLFEL
Sbjct: 396 YLEKVIRLVETMPMKPDGAVWGALLGASYVHGNPDVAEIASKHLFEL 442



 Score = 65.1 bits (157), Expect = 2e-08
 Identities = 36/139 (25%), Positives = 76/139 (54%), Gaps = 6/139 (4%)
 Frame = -1

Query: 506 PATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFY 327
           P  +VV  + LI  Y++ G ++ A  +F G+  +++ +++AM+ G+A +     A+++F 
Sbjct: 172 PERDVVSWTELIVAYARRGDMKAAQDLFDGLHVKDMVAWTAMVTGYAQNAMPKDAVEVFR 231

Query: 326 EMVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKD------YGVAPSADHYACMV 165
            +++  V  + VT +GV++AC+     Q G  ++A   +D      +G   S    + ++
Sbjct: 232 RLLDEGVEIDEVTLVGVISACA-----QLGASVYAKWIRDIAESSGFGPGSSVLVGSALI 286

Query: 164 DLLGRAGHLEEALELVKTM 108
           D+  + G++EEA  + K M
Sbjct: 287 DMYSKCGNVEEAYNVFKGM 305


>ref|XP_004289114.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g44230-like [Fragaria vesca subsp. vesca]
          Length = 650

 Score =  325 bits (834), Expect = 6e-87
 Identities = 157/228 (68%), Positives = 189/228 (82%), Gaps = 1/228 (0%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY QN  P EAL  FE MR AGV  DEVTL+G VSACAQLGA +YA W+R +A + G  P
Sbjct: 248 GYAQNLMPREALDCFERMRGAGVGIDEVTLLGVVSACAQLGACRYANWVRGIAGESGFGP 307

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
           A NV++GSALIDMY+KCGS+++A+ +F+GMK+RNVFSYS+MI GFA+HG A+AA++LF+E
Sbjct: 308 AENVLVGSALIDMYAKCGSLDDAYDIFRGMKQRNVFSYSSMIWGFAVHGNANAAIELFHE 367

Query: 323 MVETEV-RPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRA 147
           M+ T+V RPN VTFIGVLTACSH GMV QG+++FA+M K Y V PSA+HY CMVDLLGRA
Sbjct: 368 MLTTDVIRPNRVTFIGVLTACSHAGMVDQGRQLFATMEKYYNVTPSAEHYTCMVDLLGRA 427

Query: 146 GHLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
           G LEEALEL +TMP+  HGGVWGALLGAC IH NP IA+IAA+HLFEL
Sbjct: 428 GRLEEALELAETMPIVAHGGVWGALLGACRIHKNPGIAQIAASHLFEL 475



 Score = 63.2 bits (152), Expect = 8e-08
 Identities = 55/249 (22%), Positives = 105/249 (42%), Gaps = 32/249 (12%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           G V   +  EA+R +  MR  G      T       C  +G V     +       G   
Sbjct: 115 GCVIEGQVSEAVRFYGLMRREGTGPVSFTFSSLFKGCGSVGDVSLGRQVHAQTVVIG-GF 173

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
           A ++ +G+ +IDMY KCG +    +VF  M ER+V S++ +I  +A  G   +A +LF E
Sbjct: 174 AADLYVGNTMIDMYLKCGELGCGRKVFDEMPERDVVSWTELIAVYAKSGDMGSARELFEE 233

Query: 323 MVETE-------------------------------VRPNWVTFIGVLTACSHVGMVQQG 237
           +   +                               V  + VT +GV++AC+ +G  +  
Sbjct: 234 LSLKDMVAWTAMVTGYAQNLMPREALDCFERMRGAGVGIDEVTLLGVVSACAQLGACRYA 293

Query: 236 QEIFA-SMHKDYGVAPSADHYACMVDLLGRAGHLEEALELVKTMPVEPHGGVWGALLGAC 60
             +   +    +G A +    + ++D+  + G L++A ++ + M  + +   + +++   
Sbjct: 294 NWVRGIAGESGFGPAENVLVGSALIDMYAKCGSLDDAYDIFRGMK-QRNVFSYSSMIWGF 352

Query: 59  WIHGNPDIA 33
            +HGN + A
Sbjct: 353 AVHGNANAA 361


>gb|EXC24885.1| hypothetical protein L484_013254 [Morus notabilis]
          Length = 615

 Score =  314 bits (804), Expect = 2e-83
 Identities = 150/227 (66%), Positives = 187/227 (82%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY QNA+P EAL +FE M++AGV TDEVTL+  +SACAQLG  +YA  +R VAE  G   
Sbjct: 213 GYAQNARPREALYIFERMKNAGVLTDEVTLISVISACAQLGVSRYANSVRVVAETSGFGA 272

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
             +V++GSAL+DMYSKCGSV+ A +VF+ MK+RNV+SYS+MI GFAMHGRA AA+QLF++
Sbjct: 273 CESVLVGSALVDMYSKCGSVDVAFEVFEKMKQRNVYSYSSMIAGFAMHGRAHAAIQLFHD 332

Query: 323 MVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRAG 144
           MV+ E++PN VTFIGVLTACSH  +V+QG+++FASM K YGVAPS +HY C+VDLLGRAG
Sbjct: 333 MVKLEIKPNHVTFIGVLTACSHASLVEQGRQVFASMEKHYGVAPSVEHYTCIVDLLGRAG 392

Query: 143 HLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
            LEEAL++++TMP+EP+GGVWGALLGAC    NP IA  AANHLFEL
Sbjct: 393 RLEEALKVIETMPMEPNGGVWGALLGACRRLKNPSIARHAANHLFEL 439



 Score = 59.7 bits (143), Expect = 9e-07
 Identities = 52/209 (24%), Positives = 88/209 (42%), Gaps = 34/209 (16%)
 Frame = -1

Query: 632 MRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSPA--TNVVMGSALIDMYS 459
           MR AG      T       C   GAVK     R +  +  L     +++ +G+ L+DMY 
Sbjct: 97  MRRAGSRPLSFTFSALFKTC---GAVKEMGLGRQIHAQTILVGGFVSDLYVGNTLMDMYV 153

Query: 458 KCGSVEEAHQVFKGMKERNVFSYS-------------------------------AMIVG 372
           KCG +  A +VF  M ER+V S++                               AM+ G
Sbjct: 154 KCGVLGCARRVFDEMPERDVVSWTELIVAHAKGGDMELAEDLFGELPVKDKVAWTAMVTG 213

Query: 371 FAMHGRADAAMQLFYEMVETEVRPNWVTFIGVLTACSHVGMVQQGQEI-FASMHKDYGVA 195
           +A + R   A+ +F  M    V  + VT I V++AC+ +G+ +    +   +    +G  
Sbjct: 214 YAQNARPREALYIFERMKNAGVLTDEVTLISVISACAQLGVSRYANSVRVVAETSGFGAC 273

Query: 194 PSADHYACMVDLLGRAGHLEEALELVKTM 108
            S    + +VD+  + G ++ A E+ + M
Sbjct: 274 ESVLVGSALVDMYSKCGSVDVAFEVFEKM 302


>ref|XP_006403127.1| hypothetical protein EUTSA_v10003396mg [Eutrema salsugineum]
           gi|557104240|gb|ESQ44580.1| hypothetical protein
           EUTSA_v10003396mg [Eutrema salsugineum]
          Length = 656

 Score =  312 bits (799), Expect = 7e-83
 Identities = 151/228 (66%), Positives = 182/228 (79%), Gaps = 1/228 (0%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           G+ QNAKP EAL  F+ M  +G+  DEVT+ G +SACAQLGA KYA    ++A K G SP
Sbjct: 254 GFAQNAKPQEALEYFDRMEKSGIRADEVTVAGFISACAQLGASKYADRAVEIARKCGYSP 313

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLF-Y 327
             +VV+GSALIDMYSKCG+VEEA  VF+ M  +NVFSYS+MI+G AMHGRA  A+ LF Y
Sbjct: 314 RDHVVIGSALIDMYSKCGNVEEALHVFESMNNKNVFSYSSMILGLAMHGRAQEALDLFNY 373

Query: 326 EMVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRA 147
            +  T V+PN VTF+GVLTACSH G+V QG++IFASMH+ +GV P++DHY CMVDLLGRA
Sbjct: 374 MVTHTTVKPNTVTFVGVLTACSHAGLVDQGRQIFASMHQTFGVKPTSDHYTCMVDLLGRA 433

Query: 146 GHLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
           G L+EALE++KTM VEPHGGVWGALLGAC IH +PD+AEIAA HLFEL
Sbjct: 434 GRLQEALEVIKTMSVEPHGGVWGALLGACRIHKDPDVAEIAAKHLFEL 481



 Score = 76.6 bits (187), Expect = 7e-12
 Identities = 57/226 (25%), Positives = 101/226 (44%), Gaps = 34/226 (15%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY    K  EA+ ++  MR  G+     T    + AC  +  +         A+   L  
Sbjct: 122 GYAIQGKLLEAISMYGCMRKEGITPVSFTFSALLKACGFVRDLNLGRQFH--AQTFRLRG 179

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYS--------------------- 387
              V +G+ +IDMY KCGS++ A +VF  M ER+V S++                     
Sbjct: 180 FCFVYVGNTMIDMYVKCGSIDCARKVFDEMPERDVISWTELIAAYARAGNMESASELFES 239

Query: 386 ----------AMIVGFAMHGRADAAMQLFYEMVETEVRPNWVTFIGVLTACSHVGMVQQG 237
                     AM+ GFA + +   A++ F  M ++ +R + VT  G ++AC+ +G  +  
Sbjct: 240 LPTKDMVAWTAMVTGFAQNAKPQEALEYFDRMEKSGIRADEVTVAGFISACAQLGASKYA 299

Query: 236 QEIFASMHKDYGVAPSADHY---ACMVDLLGRAGHLEEALELVKTM 108
                 + +  G +P  DH    + ++D+  + G++EEAL + ++M
Sbjct: 300 DRA-VEIARKCGYSP-RDHVVIGSALIDMYSKCGNVEEALHVFESM 343


>ref|XP_006280148.1| hypothetical protein CARUB_v10026047mg [Capsella rubella]
           gi|482548852|gb|EOA13046.1| hypothetical protein
           CARUB_v10026047mg [Capsella rubella]
          Length = 657

 Score =  311 bits (796), Expect = 2e-82
 Identities = 151/228 (66%), Positives = 182/228 (79%), Gaps = 1/228 (0%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           G+ QNAKP EAL  F+ M  +G+  DEVT+ G +SACAQLGA KYA     +A+K G SP
Sbjct: 255 GFAQNAKPQEALEYFDRMEKSGIRADEVTVAGFISACAQLGASKYADRAVQIAQKSGYSP 314

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
           + +VV+GSALIDMYSKCG+VEEA  VF  M ++NVFSYS+MI+G A+HGRA  A+ LF+ 
Sbjct: 315 SDHVVIGSALIDMYSKCGNVEEAVNVFASMNKKNVFSYSSMILGLAIHGRAQEALDLFHY 374

Query: 323 MV-ETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRA 147
           MV +T ++PN VTFIG LTACSH G+V QG+ +FASM++ +GV P+ DHY CMVDLLGRA
Sbjct: 375 MVTQTAIKPNTVTFIGALTACSHSGLVDQGRLVFASMYQTFGVKPTQDHYTCMVDLLGRA 434

Query: 146 GHLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
           G L+EALEL+KTM VEPHGGVWGALLGAC IH NPDIAEIAA HLFEL
Sbjct: 435 GRLQEALELIKTMSVEPHGGVWGALLGACRIHNNPDIAEIAAEHLFEL 482



 Score = 80.1 bits (196), Expect = 6e-13
 Identities = 65/246 (26%), Positives = 108/246 (43%), Gaps = 34/246 (13%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY    K  EA+ ++  MR   +     T    + AC  +G +K        A+   L  
Sbjct: 123 GYAIEGKFDEAVSMYGCMRKEEITPVSFTFSALLKACGSMGDLKLGRQFH--AQTFRLRG 180

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYS--------------------- 387
              V +G+ +IDMY KCGS++ A +VF  M ER+V S++                     
Sbjct: 181 FCFVYVGNTMIDMYVKCGSIDCARKVFDEMPERDVISWTELIAAYGRVGNMESAAELFES 240

Query: 386 ----------AMIVGFAMHGRADAAMQLFYEMVETEVRPNWVTFIGVLTACSHVGMVQQG 237
                     AMI GFA + +   A++ F  M ++ +R + VT  G ++AC+ +G  +  
Sbjct: 241 LPTKDMVAWTAMITGFAQNAKPQEALEYFDRMEKSGIRADEVTVAGFISACAQLGASKYA 300

Query: 236 QEIFASMHKDYGVAPSADHY---ACMVDLLGRAGHLEEALELVKTMPVEPHGGVWGALLG 66
                   K  G +PS DH    + ++D+  + G++EEA+ +  +M  +        +LG
Sbjct: 301 DRAVQIAQKS-GYSPS-DHVVIGSALIDMYSKCGNVEEAVNVFASMNKKNVFSYSSMILG 358

Query: 65  ACWIHG 48
              IHG
Sbjct: 359 LA-IHG 363


>ref|XP_006280147.1| hypothetical protein CARUB_v10026047mg [Capsella rubella]
           gi|482548851|gb|EOA13045.1| hypothetical protein
           CARUB_v10026047mg [Capsella rubella]
          Length = 565

 Score =  311 bits (796), Expect = 2e-82
 Identities = 151/228 (66%), Positives = 182/228 (79%), Gaps = 1/228 (0%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           G+ QNAKP EAL  F+ M  +G+  DEVT+ G +SACAQLGA KYA     +A+K G SP
Sbjct: 163 GFAQNAKPQEALEYFDRMEKSGIRADEVTVAGFISACAQLGASKYADRAVQIAQKSGYSP 222

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
           + +VV+GSALIDMYSKCG+VEEA  VF  M ++NVFSYS+MI+G A+HGRA  A+ LF+ 
Sbjct: 223 SDHVVIGSALIDMYSKCGNVEEAVNVFASMNKKNVFSYSSMILGLAIHGRAQEALDLFHY 282

Query: 323 MV-ETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRA 147
           MV +T ++PN VTFIG LTACSH G+V QG+ +FASM++ +GV P+ DHY CMVDLLGRA
Sbjct: 283 MVTQTAIKPNTVTFIGALTACSHSGLVDQGRLVFASMYQTFGVKPTQDHYTCMVDLLGRA 342

Query: 146 GHLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
           G L+EALEL+KTM VEPHGGVWGALLGAC IH NPDIAEIAA HLFEL
Sbjct: 343 GRLQEALELIKTMSVEPHGGVWGALLGACRIHNNPDIAEIAAEHLFEL 390



 Score = 67.8 bits (164), Expect = 3e-09
 Identities = 44/157 (28%), Positives = 84/157 (53%), Gaps = 6/157 (3%)
 Frame = -1

Query: 500 TNVVMGSA---LIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLF 330
           T V+ G A   LI  Y + G++E A ++F+ +  +++ +++AMI GFA + +   A++ F
Sbjct: 118 TAVIRGYAIEELIAAYGRVGNMESAAELFESLPTKDMVAWTAMITGFAQNAKPQEALEYF 177

Query: 329 YEMVETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHY---ACMVDL 159
             M ++ +R + VT  G ++AC+ +G  +          K  G +PS DH    + ++D+
Sbjct: 178 DRMEKSGIRADEVTVAGFISACAQLGASKYADRAVQIAQKS-GYSPS-DHVVIGSALIDM 235

Query: 158 LGRAGHLEEALELVKTMPVEPHGGVWGALLGACWIHG 48
             + G++EEA+ +  +M  +        +LG   IHG
Sbjct: 236 YSKCGNVEEAVNVFASMNKKNVFSYSSMILGLA-IHG 271


>ref|XP_002865376.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297311211|gb|EFH41635.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 658

 Score =  311 bits (796), Expect = 2e-82
 Identities = 150/228 (65%), Positives = 181/228 (79%), Gaps = 1/228 (0%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           G+ QNAKP EAL  F+ M  +G+  DEVT+ G +SACAQLGA KYA     +A+K G SP
Sbjct: 256 GFAQNAKPQEALEYFDRMEKSGIRADEVTVAGYISACAQLGASKYADRAVQIAQKSGYSP 315

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
           + +VV+GSALIDMYSKCG+VEEA  VF  M  +NVFSYS+MI+G A HGRA  A+ LF+ 
Sbjct: 316 SDHVVIGSALIDMYSKCGNVEEAVNVFVSMNNKNVFSYSSMILGLATHGRAQEALDLFHY 375

Query: 323 MV-ETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRA 147
           MV +T ++PN VTF+G LTACSH G+V QG+++FASM++ +GV P+ DHY CMVDLLGRA
Sbjct: 376 MVTQTAIKPNTVTFVGALTACSHSGLVDQGRQVFASMYQTFGVEPTRDHYTCMVDLLGRA 435

Query: 146 GHLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
           G L+EALEL+KTM VEPHGGVWGALLGAC IH NPDIAEIAA HLFEL
Sbjct: 436 GRLQEALELIKTMSVEPHGGVWGALLGACRIHNNPDIAEIAAEHLFEL 483



 Score = 75.9 bits (185), Expect = 1e-11
 Identities = 58/226 (25%), Positives = 99/226 (43%), Gaps = 34/226 (15%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY    K  EA+ ++  MR   +     T    + AC  +G +         A+   L  
Sbjct: 124 GYTIEGKFDEAIAMYGCMRKEEITPVSFTFSALLKACGSMGDLNLGRQFH--AQTFRLRG 181

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYS--------------------- 387
              V +G+ +IDMY KCGS+  A +VF  M ER+V S++                     
Sbjct: 182 FCFVYVGNTMIDMYVKCGSIVCARKVFDEMPERDVISWTELIAAYARVGNMESAADLFES 241

Query: 386 ----------AMIVGFAMHGRADAAMQLFYEMVETEVRPNWVTFIGVLTACSHVGMVQQG 237
                     AM+ GFA + +   A++ F  M ++ +R + VT  G ++AC+ +G  +  
Sbjct: 242 LPTKDMVAWTAMVTGFAQNAKPQEALEYFDRMEKSGIRADEVTVAGYISACAQLGASKYA 301

Query: 236 QEIFASMHKDYGVAPSADHY---ACMVDLLGRAGHLEEALELVKTM 108
                   K  G +PS DH    + ++D+  + G++EEA+ +  +M
Sbjct: 302 DRAVQIAQKS-GYSPS-DHVVIGSALIDMYSKCGNVEEAVNVFVSM 345


>ref|NP_199236.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75170229|sp|Q9FFG8.1|PP417_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At5g44230 gi|9759524|dbj|BAB10990.1| selenium-binding
           protein-like [Arabidopsis thaliana]
           gi|91806984|gb|ABE66219.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|332007694|gb|AED95077.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 657

 Score =  305 bits (780), Expect = 1e-80
 Identities = 146/228 (64%), Positives = 179/228 (78%), Gaps = 1/228 (0%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           G+ QNAKP EAL  F+ M  +G+  DEVT+ G +SACAQLGA KYA     +A+K G SP
Sbjct: 255 GFAQNAKPQEALEYFDRMEKSGIRADEVTVAGYISACAQLGASKYADRAVQIAQKSGYSP 314

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYSAMIVGFAMHGRADAAMQLFYE 324
           + +VV+GSALIDMYSKCG+VEEA  VF  M  +NVF+YS+MI+G A HGRA  A+ LF+ 
Sbjct: 315 SDHVVIGSALIDMYSKCGNVEEAVNVFMSMNNKNVFTYSSMILGLATHGRAQEALHLFHY 374

Query: 323 MV-ETEVRPNWVTFIGVLTACSHVGMVQQGQEIFASMHKDYGVAPSADHYACMVDLLGRA 147
           MV +TE++PN VTF+G L ACSH G+V QG+++F SM++ +GV P+ DHY CMVDLLGR 
Sbjct: 375 MVTQTEIKPNTVTFVGALMACSHSGLVDQGRQVFDSMYQTFGVQPTRDHYTCMVDLLGRT 434

Query: 146 GHLEEALELVKTMPVEPHGGVWGALLGACWIHGNPDIAEIAANHLFEL 3
           G L+EALEL+KTM VEPHGGVWGALLGAC IH NP+IAEIAA HLFEL
Sbjct: 435 GRLQEALELIKTMSVEPHGGVWGALLGACRIHNNPEIAEIAAEHLFEL 482



 Score = 71.2 bits (173), Expect = 3e-10
 Identities = 56/226 (24%), Positives = 98/226 (43%), Gaps = 34/226 (15%)
 Frame = -1

Query: 683 GYVQNAKPWEALRLFESMRDAGVHTDEVTLVGAVSACAQLGAVKYATWIRDVAEKEGLSP 504
           GY    K  EA+ ++  MR   +     T    + AC  +  +         A+   L  
Sbjct: 123 GYAIEGKFDEAIAMYGCMRKEEITPVSFTFSALLKACGTMKDLNLGRQFH--AQTFRLRG 180

Query: 503 ATNVVMGSALIDMYSKCGSVEEAHQVFKGMKERNVFSYS--------------------- 387
              V +G+ +IDMY KC S++ A +VF  M ER+V S++                     
Sbjct: 181 FCFVYVGNTMIDMYVKCESIDCARKVFDEMPERDVISWTELIAAYARVGNMECAAELFES 240

Query: 386 ----------AMIVGFAMHGRADAAMQLFYEMVETEVRPNWVTFIGVLTACSHVGMVQQG 237
                     AM+ GFA + +   A++ F  M ++ +R + VT  G ++AC+ +G  +  
Sbjct: 241 LPTKDMVAWTAMVTGFAQNAKPQEALEYFDRMEKSGIRADEVTVAGYISACAQLGASKYA 300

Query: 236 QEIFASMHKDYGVAPSADHY---ACMVDLLGRAGHLEEALELVKTM 108
                   K  G +PS DH    + ++D+  + G++EEA+ +  +M
Sbjct: 301 DRAVQIAQKS-GYSPS-DHVVIGSALIDMYSKCGNVEEAVNVFMSM 344


Top