BLASTX nr result

ID: Anemarrhena21_contig00012618 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Anemarrhena21_contig00012618
         (1199 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009410349.1| PREDICTED: pentatricopeptide repeat-containi...   413   e-112
ref|XP_008796414.1| PREDICTED: pentatricopeptide repeat-containi...   405   e-110
ref|XP_010270624.1| PREDICTED: pentatricopeptide repeat-containi...   354   8e-95
ref|XP_009415576.1| PREDICTED: pentatricopeptide repeat-containi...   354   8e-95
ref|XP_008799153.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   338   6e-90
ref|XP_009759999.1| PREDICTED: pentatricopeptide repeat-containi...   327   1e-86
ref|XP_009624485.1| PREDICTED: pentatricopeptide repeat-containi...   324   7e-86
ref|XP_007043123.1| Pentatricopeptide repeat superfamily protein...   324   9e-86
ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containi...   323   2e-85
ref|XP_012462310.1| PREDICTED: pentatricopeptide repeat-containi...   320   2e-84
ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containi...   320   2e-84
ref|XP_009626842.1| PREDICTED: pentatricopeptide repeat-containi...   319   2e-84
ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containi...   317   1e-83
gb|KCW55702.1| hypothetical protein EUGRSUZ_I01549 [Eucalyptus g...   316   2e-83
emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera]   316   2e-83
ref|XP_010030710.1| PREDICTED: pentatricopeptide repeat-containi...   315   4e-83
ref|XP_006573589.1| PREDICTED: pentatricopeptide repeat-containi...   313   1e-82
ref|XP_006573588.1| PREDICTED: pentatricopeptide repeat-containi...   313   1e-82
ref|XP_003516576.2| PREDICTED: pentatricopeptide repeat-containi...   313   1e-82
ref|XP_007156913.1| hypothetical protein PHAVU_002G027800g [Phas...   313   1e-82

>ref|XP_009410349.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Musa acuminata subsp. malaccensis]
          Length = 413

 Score =  413 bits (1062), Expect = e-112
 Identities = 232/420 (55%), Positives = 271/420 (64%), Gaps = 45/420 (10%)
 Frame = -1

Query: 1133 MQPRISRALFSKLLRKFSHVPKSPIADRSPSLPLAHFSTI--PNRPMRGNRRRDDGSEDD 960
            MQ  +S+ LF+ L R+  H+PK+ I  R   LP   FST   P  P RG RRRDDGSED 
Sbjct: 1    MQRTLSKLLFNNLARQLCHLPKASILGRC--LPAIDFSTNNDPGGPTRGGRRRDDGSEDL 58

Query: 959  FLRNLNFGGERDGDNAENTHQNSPSRMPHR--PLRGDQRPIKREGRDGDDRFH------- 807
            FLR+LNFG +  G+  E THQ +PSR P    PL G Q+  K     GDD          
Sbjct: 59   FLRSLNFGDD-GGEEQEMTHQEAPSRRPSARPPLGGGQQSGKEPSFRGDDSIDIASGDLF 117

Query: 806  --QKFKGGNLAFGGLRENGDERID--------------------------------ESLK 729
               +F  G+    G   NG  R D                                E L 
Sbjct: 118  PGLEFGDGSRGLRGRSRNGPVRRDTPREDFGRQDNMDGFGTARQRSPSRSAGGFRGEELD 177

Query: 728  TGNGEKERNQSGDTLLDKLNFGYMGSRKSAEETDTNSSTQVPGTEPSVPEKPPEDADEIF 549
             G  E+  ++ GD+L  K+NFG  G R   EE D   +      E +  E PPEDADEIF
Sbjct: 178  DGGEERRSDRIGDSLAQKINFGEAGRRNRVEEADQKPAV----AESAAQEAPPEDADEIF 233

Query: 548  KKMKATGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKF 369
            KKMK TGLIPNAVAMLDGLCKDGLVQ+AMKLFGLMREKGTIPEVVIYTAVVEGFCK AKF
Sbjct: 234  KKMKETGLIPNAVAMLDGLCKDGLVQDAMKLFGLMREKGTIPEVVIYTAVVEGFCKGAKF 293

Query: 368  DDAKRIFRKMQKNGIVPNAFSYTVLIQGLCRGKRLEDAVEFCMEMVDAGHSLNAATLTGL 189
            DDAKRIFRKMQKNGIVPNAFS+ VLIQGLC+GK+LED+VEFCMEM+DAGH+ + ATL GL
Sbjct: 294  DDAKRIFRKMQKNGIVPNAFSFKVLIQGLCKGKKLEDSVEFCMEMLDAGHAPSVATLIGL 353

Query: 188  VNGFCKEKGVEEAEGFVRSLRERGFVIDDKAVSEYLDKKGPYSPMVWEAILGKKNSQRRF 9
            V+GFC+EKGVEE E  +  LRERGFV+D++AV E+L+KKGP+SP VW+A  GKKNS+  F
Sbjct: 354  VDGFCQEKGVEEGENVIIRLRERGFVLDERAVREHLNKKGPFSPKVWDAFFGKKNSRGPF 413


>ref|XP_008796414.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Phoenix dactylifera]
          Length = 424

 Score =  405 bits (1042), Expect = e-110
 Identities = 230/411 (55%), Positives = 273/411 (66%), Gaps = 53/411 (12%)
 Frame = -1

Query: 1082 SHVPKSPIADRSP-------SLPLAHFSTIPNRPMRGNRRRDDGSEDDFLRNLNFG---- 936
            SH+PK  I +RS        S+   HFST PNRPMRG RRR+D SED FL++LNFG    
Sbjct: 16   SHLPKDSILERSSPGALLLRSVSNTHFSTSPNRPMRGERRREDPSEDLFLKSLNFGDDGE 75

Query: 935  -----------GER--DGDNAENTH-------------QNSPSRMPHRPLRGDQRP---- 846
                       GER  DG + +  H             +   SR+P RP+RG++R     
Sbjct: 76   EERTVNTRPLRGERRLDGGSGDLFHGLKDEDKILGRGRRTPESRIPERPMRGERREDSGY 135

Query: 845  ----IKREGRDGDDRFHQKF-KGGNLAFGGLRENGDERIDESLKTGNG-------EKERN 702
                 +  G D ++ F     K  +L   G +   +E+  ES+ TG+        EK  +
Sbjct: 136  SRQRFRNHGEDYEENFGIPGPKSASLFSDGPKS--EEKNKESIDTGDQLKDSAEIEKGGD 193

Query: 701  QSGDTLLDKLNFGYMGSRKSAEETDTNSSTQVPGTEPSVPEKPPEDADEIFKKMKATGLI 522
            ++GDTL  KLN G  G     EE     + Q  G +  VP+   EDADEIFK+MK TGLI
Sbjct: 194  KTGDTLFKKLNLGDAGRGGKVEEAPQKQTKQSYGPDSMVPKSQSEDADEIFKEMKETGLI 253

Query: 521  PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFDDAKRIFRK 342
            PNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFDDAKRIFRK
Sbjct: 254  PNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFDDAKRIFRK 313

Query: 341  MQKNGIVPNAFSYTVLIQGLCRGKRLEDAVEFCMEMVDAGHSLNAATLTGLVNGFCKEKG 162
            MQKNGI+PNAFSY VLIQGLC+G +LED+VE+CMEM+ AGH  NAAT TGLV+ +CKEKG
Sbjct: 314  MQKNGIMPNAFSYAVLIQGLCKGGKLEDSVEYCMEMLGAGHLPNAATFTGLVDRYCKEKG 373

Query: 161  VEEAEGFVRSLRERGFVIDDKAVSEYLDKKGPYSPMVWEAILGKKNSQRRF 9
            VEEA   VR+LRERGF +D+KAV E+LDKKGP+SPMVWEAI GKKN QR F
Sbjct: 374  VEEAGSLVRTLRERGFAMDEKAVREHLDKKGPFSPMVWEAIFGKKNLQRPF 424


>ref|XP_010270624.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Nelumbo nucifera] gi|720046844|ref|XP_010270625.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g38150-like [Nelumbo nucifera]
          Length = 407

 Score =  354 bits (908), Expect = 8e-95
 Identities = 204/421 (48%), Positives = 260/421 (61%), Gaps = 51/421 (12%)
 Frame = -1

Query: 1118 SRALFSKLLRKFSHVPKSPIADRSPSLPLAHFSTIPNRPMRGNRRRDDGSEDDFLRNLNF 939
            S+  FS LL+  SH  +S      P +   HFS+I + P+RG  R  + S+D F   L  
Sbjct: 3    SKLRFSNLLKFLSH--RSETVSTFPIVH--HFSSIRDTPIRGEIR-SNASQDPFFSKLES 57

Query: 938  GGERDGDNAEN---THQNSPSRMPHRPLRGDQRPI------------------------- 843
            G  +DG + E    T+QN P+ +P+RP+RG++R                           
Sbjct: 58   GYGQDGKDEERSNRTYQNPPNPIPNRPMRGEKRREPSEYHFNGKFKLGDDEDDEKMRKPD 117

Query: 842  -------------KREGRDGDDRFHQKFKGGNLAFGGLRENGDERIDESLKTGN------ 720
                         KREGR   D F +KF  G+       +  DER  ES ++ +      
Sbjct: 118  QIRQTHFGSSREGKREGRFNGDTFARKFDFGS-------DIVDERTSESQQSPSVQFPNR 170

Query: 719  ---GEKERNQSGDTLLDKLNFGYMGSRKSAEETDTNSSTQVPGTEPSV-PEKPPEDADEI 552
               GEK  +   ++ L+KL        +  + T+  S TQV  T+    P+  P+DADEI
Sbjct: 171  PMKGEKRGSPLDESFLEKLRL----CEEKKKNTNETSPTQVTETDVKAEPDSTPQDADEI 226

Query: 551  FKKMKATGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAK 372
            F+KMK TGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGT PEVVIYTAVVEGFCKA K
Sbjct: 227  FRKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTFPEVVIYTAVVEGFCKAEK 286

Query: 371  FDDAKRIFRKMQKNGIVPNAFSYTVLIQGLCRGKRLEDAVEFCMEMVDAGHSLNAATLTG 192
             DDAKRIFRKMQ NGI PNAFSYTV IQGL +GKRLEDA++ C+EM++AGHS N  T TG
Sbjct: 287  LDDAKRIFRKMQNNGISPNAFSYTVFIQGLYKGKRLEDAIDICVEMLEAGHSPNVTTFTG 346

Query: 191  LVNGFCKEKGVEEAEGFVRSLRERGFVIDDKAVSEYLDKKGPYSPMVWEAILGKKNSQRR 12
            LV+  C++KGVEEA+  +  LRE+G+ +D+KA+ EYLDKKGP+SP++WEA+ GKKNS+  
Sbjct: 347  LVDAICRDKGVEEAKSTIERLREKGYFVDEKAIREYLDKKGPFSPLIWEAVFGKKNSKLS 406

Query: 11   F 9
            F
Sbjct: 407  F 407


>ref|XP_009415576.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
            [Musa acuminata subsp. malaccensis]
          Length = 390

 Score =  354 bits (908), Expect = 8e-95
 Identities = 216/397 (54%), Positives = 261/397 (65%), Gaps = 27/397 (6%)
 Frame = -1

Query: 1133 MQPRISRALFSKLLRKFSHVPKSPIADRSPSLPLAHFSTI--PNRPMRGNRRRDDGSEDD 960
            M+  +S+ L S   R+  +  K+ I +R   +P   FST   P RPMRG RRRDD SED 
Sbjct: 1    MRTALSKLLLSNPWRRICYQHKAYILERC--VPSNDFSTRNNPRRPMRGERRRDDRSEDI 58

Query: 959  FLRNLNFGGERDGDNAENTHQNSPSRMPHRP-----LRGDQR-----PIKRE-GRDG--D 819
            FLR LNFG +   +  +  H+ +    P RP     LRG Q+     P++ E G DG  D
Sbjct: 59   FLRGLNFGDDDGVNGPQRAHREA---FPDRPYDGPSLRGAQQRKKEPPLREEDGSDGAAD 115

Query: 818  DR---FHQKFKGGNLAFGGLRENGDERIDESLKTGNGEKERNQSGDTLLDKLNFGYMGSR 648
            D    F    + G +  G  R N   R  +  + G G   ++Q  D   D         +
Sbjct: 116  DLLVDFDLADRTGRVPPGHTR-NSVRR--DPPREGFGPSPQSQFKDFGGDYFEGSGSPQQ 172

Query: 647  K----SAEETDTNSSTQVPGTEPSVP-----EKPPEDADEIFKKMKATGLIPNAVAMLDG 495
            K    SA+    + S  V  T P+V      E PPEDADEIFKKMK TGLIPNAVAMLDG
Sbjct: 173  KARPPSADGHRVDKSDVVDQTPPTVAKSAAEEAPPEDADEIFKKMKETGLIPNAVAMLDG 232

Query: 494  LCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFDDAKRIFRKMQKNGIVPN 315
            LCKDGL+QEAMKLFG MREKGT+PEVVIYTA VEGFCKAA+FDDAKRIFRKMQKNG  PN
Sbjct: 233  LCKDGLIQEAMKLFGSMREKGTMPEVVIYTAAVEGFCKAARFDDAKRIFRKMQKNGTAPN 292

Query: 314  AFSYTVLIQGLCRGKRLEDAVEFCMEMVDAGHSLNAATLTGLVNGFCKEKGVEEAEGFVR 135
            AFSY VLIQGLC+GK+L+D+VEFCMEM+DAGHS +  T+  +V+GFC+EKGVEEA   V+
Sbjct: 293  AFSYKVLIQGLCKGKKLDDSVEFCMEMLDAGHSPSVTTVVDVVDGFCREKGVEEAADVVK 352

Query: 134  SLRERGFVIDDKAVSEYLDKKGPYSPMVWEAILGKKN 24
             LRERGFV+D KAVSE+LDKKGP+SPMV+EAI GKK+
Sbjct: 353  RLRERGFVLDLKAVSEHLDKKGPFSPMVFEAISGKKD 389


>ref|XP_008799153.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At4g38150 [Phoenix dactylifera]
          Length = 357

 Score =  338 bits (866), Expect = 6e-90
 Identities = 192/342 (56%), Positives = 224/342 (65%), Gaps = 20/342 (5%)
 Frame = -1

Query: 1004 PMRGNRRRDDGSEDDFLRNLNFGGERDGDNAENTHQNSP-SRMPHRPLRGDQRP------ 846
            P+RG  R D+GS+D F         +D D        +P SR+P+RPLRG  R       
Sbjct: 12   PLRGEGRLDEGSKDLFPEL------KDQDGIMGRGGRAPESRIPNRPLRGVGREDFGHFR 65

Query: 845  --IKREGRDGDDRFHQKFKGGNLAFGGLRENGDERI----DESLKTGNGEKERNQSGDTL 684
              I   G D  + F         A G   E  ++      D+S  +   EK  +++GDTL
Sbjct: 66   WKIGNRGEDYKEFFLGPKSASLFADGPKSEEKNKESTDIGDQSKDSAKIEKGGDKTGDTL 125

Query: 683  LDKLNFGYMGSRKSAEETDTNSSTQVPGTEPSVPEKPPEDADEIFKKMKATGL------- 525
              KLN G  GS  + E      S Q  G +    E  PEDADEIF+KMK TGL       
Sbjct: 126  FKKLNLGDAGSGGNVEVAPQKKSKQSSGPDSVALESVPEDADEIFRKMKETGLSLFSFFF 185

Query: 524  IPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFDDAKRIFR 345
            IPNAVAMLDGLCKDGL+QEAMKLFGL+REKGT+PEVVIYTAVVEGFCKAAKFDDAKRIFR
Sbjct: 186  IPNAVAMLDGLCKDGLIQEAMKLFGLLREKGTVPEVVIYTAVVEGFCKAAKFDDAKRIFR 245

Query: 344  KMQKNGIVPNAFSYTVLIQGLCRGKRLEDAVEFCMEMVDAGHSLNAATLTGLVNGFCKEK 165
            KMQKNGIVPNAFSY VLIQGLC+G +LED VEFCMEM+D GH  NAAT TGLV+G CK+K
Sbjct: 246  KMQKNGIVPNAFSYAVLIQGLCKGGKLEDFVEFCMEMLDVGHLPNAATFTGLVDGCCKDK 305

Query: 164  GVEEAEGFVRSLRERGFVIDDKAVSEYLDKKGPYSPMVWEAI 39
            GVEEA   VR+LRERGF +D+KA   +LDKKGP SP+VWE +
Sbjct: 306  GVEEAGSLVRTLRERGFAVDEKAARVHLDKKGPLSPVVWEEL 347


>ref|XP_009759999.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like [Nicotiana sylvestris]
           gi|698526340|ref|XP_009760000.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At4g38150-like [Nicotiana sylvestris]
          Length = 342

 Score =  327 bits (837), Expect = 1e-86
 Identities = 172/324 (53%), Positives = 221/324 (68%), Gaps = 9/324 (2%)
 Frame = -1

Query: 956 LRNLNFGGERDGDNAENTHQNSPSRMPHRPLRGDQR----PIKREGRDGDDRFHQKFKGG 789
           LR+ +   +   ++ ++++   P  +P+RPLR D R    P +R+     +  H      
Sbjct: 37  LRSFSSSNKYSDESTQSSYPPPPDPIPNRPLRADPRRPFNPSQRQRPGSSNPTH------ 90

Query: 788 NLAFGGLRENGDERIDESLKTGNGEKERNQSGDTLLDKLNFGYMGSRKSAEETDTNSSTQ 609
           +  F    EN + +I            ++Q     L +   G+   RK  E T+TN +  
Sbjct: 91  STTFRKPGENNENQI------------KSQDSQDFLKRFQLGF--DRKD-ENTNTNPALH 135

Query: 608 VPGTEPSVPEK-----PPEDADEIFKKMKATGLIPNAVAMLDGLCKDGLVQEAMKLFGLM 444
             G     P       PPED+DEIFKKMK TGLIPNAVAMLDGLCKDGLVQEAMKLFGLM
Sbjct: 136 PEGERSDAPASEAPPAPPEDSDEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLM 195

Query: 443 REKGTIPEVVIYTAVVEGFCKAAKFDDAKRIFRKMQKNGIVPNAFSYTVLIQGLCRGKRL 264
           REKG IPEVVIYTAVVEGFCKA K+DDA RIFRKMQ NGI+PNAFSY +LI+GLC+GKRL
Sbjct: 196 REKGAIPEVVIYTAVVEGFCKAHKYDDAVRIFRKMQGNGIIPNAFSYGILIRGLCQGKRL 255

Query: 263 EDAVEFCMEMVDAGHSLNAATLTGLVNGFCKEKGVEEAEGFVRSLRERGFVIDDKAVSEY 84
           EDA+EFC+EM++AGHS N  T  GLV+G+CKEK +E+A+  ++++R++GF +D+KAV EY
Sbjct: 256 EDALEFCLEMLEAGHSPNLMTFVGLVDGYCKEKSLEDAQSMIKAVRQKGFTLDEKAVREY 315

Query: 83  LDKKGPYSPMVWEAILGKKNSQRR 12
           LDKKGP+ P+VWEAILGKK SQR+
Sbjct: 316 LDKKGPFLPLVWEAILGKKASQRQ 339


>ref|XP_009624485.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like [Nicotiana tomentosiformis]
          Length = 342

 Score =  324 bits (831), Expect = 7e-86
 Identities = 170/324 (52%), Positives = 221/324 (68%), Gaps = 9/324 (2%)
 Frame = -1

Query: 956 LRNLNFGGERDGDNAENTHQNSPSRMPHRPLRGDQR----PIKREGRDGDDRFHQKFKGG 789
           LR+ +   +   ++ ++ +   P  +P+RPLR D R    P +R+     +  H      
Sbjct: 37  LRSFSSSNKYVDESTQSNYPPPPDPIPNRPLRADSRRPFNPSQRQRPSSSNPTH------ 90

Query: 788 NLAFGGLRENGDERIDESLKTGNGEKERNQSGDTLLDKLNFGYMGSRKSAEETDTNSSTQ 609
           +  F    EN + +I            ++Q     L +   G+   RK  E  +TN +  
Sbjct: 91  STTFRRPGENNENQI------------KSQDSQDFLKRFQLGF--DRKD-ENPNTNPALH 135

Query: 608 VPGTEPSVPEK-----PPEDADEIFKKMKATGLIPNAVAMLDGLCKDGLVQEAMKLFGLM 444
             G     P       PPED+DEIFKKMK TGLIPNAVAMLDGLCKDGLVQEAMKLFGLM
Sbjct: 136 PKGEMSDTPASESSPAPPEDSDEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLM 195

Query: 443 REKGTIPEVVIYTAVVEGFCKAAKFDDAKRIFRKMQKNGIVPNAFSYTVLIQGLCRGKRL 264
           REKGTIPEVVIYTAVVEGFCKA K+DD  RIFRKMQ NGI+PNAFSY++LI+GLC+G+RL
Sbjct: 196 REKGTIPEVVIYTAVVEGFCKAHKYDDGVRIFRKMQGNGIIPNAFSYSILIRGLCQGRRL 255

Query: 263 EDAVEFCMEMVDAGHSLNAATLTGLVNGFCKEKGVEEAEGFVRSLRERGFVIDDKAVSEY 84
           EDA+EFC+EM++AGHS N  T  GLV+G+CKEK +E+A+  ++++R++GF++D+KAV EY
Sbjct: 256 EDALEFCLEMLEAGHSPNLMTFVGLVDGYCKEKSLEDAQSMIKAVRQKGFILDEKAVREY 315

Query: 83  LDKKGPYSPMVWEAILGKKNSQRR 12
           LDKKGP+ P+VWEAILGKK SQR+
Sbjct: 316 LDKKGPFLPLVWEAILGKKASQRQ 339


>ref|XP_007043123.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao] gi|508707058|gb|EOX98954.1| Pentatricopeptide
            repeat superfamily protein, putative [Theobroma cacao]
          Length = 345

 Score =  324 bits (830), Expect = 9e-86
 Identities = 182/370 (49%), Positives = 235/370 (63%), Gaps = 3/370 (0%)
 Frame = -1

Query: 1109 LFSKLLRKFSHVPKSPIADRSPSLPLAHFSTIPNRPMRGNRRRDDGSEDDFLRNLNFGGE 930
            +F+KL++  +  P  PI    PSL L       +  MRG  R +D         ++F   
Sbjct: 8    VFTKLMKILTLKPHHPILGSPPSLSLLQTRLFSD--MRGPFRDNDP--------ISFNSN 57

Query: 929  RDGDNAENTHQNSPSRMPHRPLRGDQRPIKREGRDGDDRFHQKFKGGNLAFGGLRENGDE 750
             DGD         P  +P+R L G QRP     R+         KG  L       NG  
Sbjct: 58   GDGDKP-------PEPIPNRSLEG-QRPFNPSFRET--------KGATL-----NSNGSS 96

Query: 749  RIDESLKTG---NGEKERNQSGDTLLDKLNFGYMGSRKSAEETDTNSSTQVPGTEPSVPE 579
                + K     N ++E +QS +  L+K   G + +++  + +D+ ++  +   E     
Sbjct: 97   FQSFNTKFASDPNRKREDSQSDENFLEKFKLG-LDNKRGKQPSDSEAAALLRRKEQEEKP 155

Query: 578  KPPEDADEIFKKMKATGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAV 399
             PP+DADEIFKKMK TGLIPNAVAMLDGLCKDGL+QEAMKLFG MREKGTIPEVVIYTAV
Sbjct: 156  SPPQDADEIFKKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGSMREKGTIPEVVIYTAV 215

Query: 398  VEGFCKAAKFDDAKRIFRKMQKNGIVPNAFSYTVLIQGLCRGKRLEDAVEFCMEMVDAGH 219
            V+GFCKA K DDAKRIFRKMQ  G+ PN+FSY VLIQGL R  +L+DA+EFC+EM++AGH
Sbjct: 216  VDGFCKAHKLDDAKRIFRKMQSKGVTPNSFSYIVLIQGLYRCNKLDDAIEFCLEMLEAGH 275

Query: 218  SLNAATLTGLVNGFCKEKGVEEAEGFVRSLRERGFVIDDKAVSEYLDKKGPYSPMVWEAI 39
            S N  T  GLV+G CKEKGVEEA+  + +L+++GFV++DKAV ++LDKK P+SP+VWEAI
Sbjct: 276  SPNVTTFVGLVDGLCKEKGVEEAQSVIGTLKQKGFVLNDKAVRQFLDKKAPFSPLVWEAI 335

Query: 38   LGKKNSQRRF 9
             GKK SQ+ F
Sbjct: 336  FGKKPSQKTF 345


>ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
           [Solanum lycopersicum]
          Length = 340

 Score =  323 bits (828), Expect = 2e-85
 Identities = 170/328 (51%), Positives = 220/328 (67%), Gaps = 1/328 (0%)
 Frame = -1

Query: 992 NRRRDDGSEDDFLRNLNFGGERDGDNAENTHQNSPSRMPHRPLRGD-QRPIKREGRDGDD 816
           N  R   +   F  +  F    D ++AE+ +   P  +P+RPLR D +RP     R    
Sbjct: 27  NETRSSTNLRSFSSSNKFSDYSD-ESAESNYPPPPEPIPNRPLRADSRRPFNPSQRQHPS 85

Query: 815 RFHQKFKGGNLAFGGLRENGDERIDESLKTGNGEKERNQSGDTLLDKLNFGYMGSRKSAE 636
             ++     +  F    EN + ++            ++Q  +  L +   G+    ++  
Sbjct: 86  --NRSSPNHSTTFRRSSENNESQM------------KSQDSEDFLKRFQLGFDRKEENPN 131

Query: 635 ETDTNSSTQVPGTEPSVPEKPPEDADEIFKKMKATGLIPNAVAMLDGLCKDGLVQEAMKL 456
                 S   P +E   P  PPEDADEIFKKMK TGLIPNAVAMLDGLCKDGLVQEAMKL
Sbjct: 132 TNPKAESRDCPVSE--APPAPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKL 189

Query: 455 FGLMREKGTIPEVVIYTAVVEGFCKAAKFDDAKRIFRKMQKNGIVPNAFSYTVLIQGLCR 276
           FGLMREKGTIPEVVIYTAVV+GFCKA KFDDA RIFRKMQ NGI+PNAFSY ++I+GL +
Sbjct: 190 FGLMREKGTIPEVVIYTAVVDGFCKAQKFDDAVRIFRKMQGNGIIPNAFSYGIIIRGLSQ 249

Query: 275 GKRLEDAVEFCMEMVDAGHSLNAATLTGLVNGFCKEKGVEEAEGFVRSLRERGFVIDDKA 96
           GKRL+DA+EFC+EM++AGHS N  T   LV+GFCKEK +E+A+  ++++R++GF++DDKA
Sbjct: 250 GKRLDDALEFCLEMLEAGHSPNVVTFVTLVDGFCKEKSLEDAQNMIKTVRQKGFIVDDKA 309

Query: 95  VSEYLDKKGPYSPMVWEAILGKKNSQRR 12
           V E+LDKKGP+ P+VWEAILGKK SQR+
Sbjct: 310 VREFLDKKGPFLPVVWEAILGKKASQRQ 337


>ref|XP_012462310.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
            isoform X1 [Gossypium raimondii]
            gi|763812905|gb|KJB79757.1| hypothetical protein
            B456_013G065500 [Gossypium raimondii]
          Length = 341

 Score =  320 bits (819), Expect = 2e-84
 Identities = 184/382 (48%), Positives = 233/382 (60%), Gaps = 7/382 (1%)
 Frame = -1

Query: 1133 MQPRISRALFSKLLRKFSHVPKSPIADRSPS---LPLAHFSTIPNRPMRGNRRRDDGSED 963
            M+   +R + +KL++  +  P  P++  +PS   L    FS I  RP+  N       +D
Sbjct: 1    MESSQARRVLTKLMKVLTLKPHHPVSRAAPSSCVLQTRFFSDI-KRPITENESIRSNEDD 59

Query: 962  DFLRNLNFGGERDGDNAENTHQNSPSRMPHRPLRGDQRPIKREGRDGD----DRFHQKFK 795
            D                      +   +P RPLRG +RP     R+ +    DR    F+
Sbjct: 60   D---------------------GATEHIPKRPLRG-RRPFNPSFRETEGASFDRNRSSFQ 97

Query: 794  GGNLAFGGLRENGDERIDESLKTGNGEKERNQSGDTLLDKLNFGYMGSRKSAEETDTNSS 615
              N  F           D + K     +E +QS    L+K   G    R    E   + S
Sbjct: 98   SPNAKFAS---------DPTKK-----REDSQSDVNFLEKFKLGLENKR----ERVPSES 139

Query: 614  TQVPGTEPSVPEKPPEDADEIFKKMKATGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREK 435
              +   E      PPEDADEIFKKMK TGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREK
Sbjct: 140  EAMHRKEHEEKLSPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREK 199

Query: 434  GTIPEVVIYTAVVEGFCKAAKFDDAKRIFRKMQKNGIVPNAFSYTVLIQGLCRGKRLEDA 255
            GTIPEVVIYTAVV+GFCKA K +DAKRIFRKMQ  G++PNAFSYTVLIQGL + K L+DA
Sbjct: 200  GTIPEVVIYTAVVDGFCKAHKLEDAKRIFRKMQSKGVIPNAFSYTVLIQGLYKCKHLDDA 259

Query: 254  VEFCMEMVDAGHSLNAATLTGLVNGFCKEKGVEEAEGFVRSLRERGFVIDDKAVSEYLDK 75
            +EFC+EMV+AGHS N  T  GLV+G CKEKGVEEA   + +L+++GF+++DKAV ++LDK
Sbjct: 260  IEFCLEMVEAGHSPNVTTFVGLVDGLCKEKGVEEAVNVIGTLKQKGFLVNDKAVRQFLDK 319

Query: 74   KGPYSPMVWEAILGKKNSQRRF 9
            + P+SP+VWEAI GKK SQ+ F
Sbjct: 320  RAPFSPLVWEAIFGKKTSQKAF 341


>ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like [Solanum tuberosum]
          Length = 354

 Score =  320 bits (819), Expect = 2e-84
 Identities = 168/309 (54%), Positives = 215/309 (69%), Gaps = 9/309 (2%)
 Frame = -1

Query: 911 ENTHQNSPSRMPHRPLRGD-QRPIKREGRDG-DDRFHQKFKGGNLAFGGLRENGDERIDE 738
           ++ +   P  +P+RPLRGD +RP++ + R    D F +  +  +        N       
Sbjct: 53  QSNYPPPPDPIPNRPLRGDSKRPLRDDSRRPLRDDFRRPLRADS-------SNNPTHSTT 105

Query: 737 SLKTG--NGEKERNQSGDTLLDKLNFGYMGSRKSAEETDTNSSTQVPGTEPS-----VPE 579
             ++G  NG + ++Q  +  L +   G+    +  E  +TN +    G          P 
Sbjct: 106 LRRSGENNGGQMKSQDSEDFLKRFQLGF---DRKEENPNTNPALHPKGESSDSPVSEAPP 162

Query: 578 KPPEDADEIFKKMKATGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAV 399
            PPEDADEIFKKMK TGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAV
Sbjct: 163 APPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAV 222

Query: 398 VEGFCKAAKFDDAKRIFRKMQKNGIVPNAFSYTVLIQGLCRGKRLEDAVEFCMEMVDAGH 219
           V+GF KA KFDDA RIFRKMQ NGI+PNAFSY +LI+GL +G RL+DA EFC+EM++AGH
Sbjct: 223 VDGFFKAQKFDDAVRIFRKMQGNGIIPNAFSYGILIRGLSQGNRLDDAFEFCLEMLEAGH 282

Query: 218 SLNAATLTGLVNGFCKEKGVEEAEGFVRSLRERGFVIDDKAVSEYLDKKGPYSPMVWEAI 39
           S N  T   LV+GFCKEK +E+A+  ++++R++GF++DDKAV EYLDKKGP+ P+VWEAI
Sbjct: 283 SPNVVTFVTLVDGFCKEKSLEDAQNMIKTVRQKGFIVDDKAVREYLDKKGPFLPVVWEAI 342

Query: 38  LGKKNSQRR 12
           LGKK SQR+
Sbjct: 343 LGKKASQRQ 351


>ref|XP_009626842.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like [Nicotiana tomentosiformis]
           gi|697098468|ref|XP_009626850.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At4g38150-like [Nicotiana tomentosiformis]
          Length = 336

 Score =  319 bits (818), Expect = 2e-84
 Identities = 165/307 (53%), Positives = 212/307 (69%), Gaps = 4/307 (1%)
 Frame = -1

Query: 920 DNAENTHQNSPSRMPHRPLRGDQR----PIKREGRDGDDRFHQKFKGGNLAFGGLRENGD 753
           ++ ++ +   P  +P+RPLR D R    P +R+     +  H      +  F    EN +
Sbjct: 49  ESTQSNYPPPPDPIPNRPLRADSRRPFNPSQRQRPSSSNPTH------STTFRRPGENNE 102

Query: 752 ERIDESLKTGNGEKERNQSGDTLLDKLNFGYMGSRKSAEETDTNSSTQVPGTEPSVPEKP 573
            +I+             Q     L +   G+   RK        + +  P +E   P  P
Sbjct: 103 NQIE------------CQDSQDFLKRFQLGF--DRKDENPNTNPARSDTPASES--PPAP 146

Query: 572 PEDADEIFKKMKATGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVE 393
           PED+DEIFKKMK TGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIP VVIYTAVV+
Sbjct: 147 PEDSDEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPAVVIYTAVVQ 206

Query: 392 GFCKAAKFDDAKRIFRKMQKNGIVPNAFSYTVLIQGLCRGKRLEDAVEFCMEMVDAGHSL 213
           GFCKA K+DDA RIFRKMQ NGI+PNAFSY+ LI+GLC+GKRLEDA+EFC+EM++AGHS 
Sbjct: 207 GFCKAHKYDDAVRIFRKMQGNGIIPNAFSYSSLIRGLCQGKRLEDALEFCLEMLEAGHSP 266

Query: 212 NAATLTGLVNGFCKEKGVEEAEGFVRSLRERGFVIDDKAVSEYLDKKGPYSPMVWEAILG 33
           N  T   LV+G+CKEK +E+A+  ++++R++GF++D+KAV EYLDKKGP+ P+VWEAILG
Sbjct: 267 NMTTFVDLVDGYCKEKSLEDAQSMIKAVRQKGFILDEKAVREYLDKKGPFLPLVWEAILG 326

Query: 32  KKNSQRR 12
           KK SQR+
Sbjct: 327 KKASQRQ 333


>ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
            [Vitis vinifera]
          Length = 380

 Score =  317 bits (811), Expect = 1e-83
 Identities = 183/391 (46%), Positives = 243/391 (62%), Gaps = 14/391 (3%)
 Frame = -1

Query: 1139 KRMQPRISRALFSKLLRKFSHVPKSPIADRSP---SLPLAHFSTIP-NRPMRGNRRRDDG 972
            + +Q ++S+ +FS  L+   H   S  ++ SP    L L  FS+I  +   RG  RR+D 
Sbjct: 3    RALQGKVSKVVFSDCLKDLLHSSHSSPSNPSPLPLPLLLRRFSSIDASSSTRGASRREDL 62

Query: 971  SEDDFLRNLN-------FGGERDGDNAENTHQNSPSRMPHRPLRGDQRPIKREGRDGDDR 813
            + +  L + +       +G +        +  N P+ +P+RPLRG+QR + R       R
Sbjct: 63   ANNSDLFSPSTEPDDDTYGRKSSSSCGGGSSSNPPNPIPNRPLRGEQR-MNRPPPHIPQR 121

Query: 812  FHQKFKGGNLAFGGLRENGDERIDESL---KTGNGEKERNQSGDTLLDKLNFGYMGSRKS 642
                        G  ++ G +R  ++    +    EK      D  L++   G     + 
Sbjct: 122  ----------KLGLPKDEGVDRASQASPFNQPSPAEKVGATLEDGFLERFKLGVQKKERP 171

Query: 641  AEETDTNSSTQVPGTEPSVPEKPPEDADEIFKKMKATGLIPNAVAMLDGLCKDGLVQEAM 462
             E      S +         E+PP++ADEIF+KMK +GLIPNAVAMLDGLCKDGLVQEAM
Sbjct: 172  QESAAAQPSREQDANHGK--EQPPQNADEIFRKMKESGLIPNAVAMLDGLCKDGLVQEAM 229

Query: 461  KLFGLMREKGTIPEVVIYTAVVEGFCKAAKFDDAKRIFRKMQKNGIVPNAFSYTVLIQGL 282
            KLFGLMREKGTIPEVVIYTAVVEGFCKA + +DA RIFRKMQ NGI PNAFSYTVLI+G+
Sbjct: 230  KLFGLMREKGTIPEVVIYTAVVEGFCKARQLNDAVRIFRKMQNNGISPNAFSYTVLIRGM 289

Query: 281  CRGKRLEDAVEFCMEMVDAGHSLNAATLTGLVNGFCKEKGVEEAEGFVRSLRERGFVIDD 102
             +G RL+ AV+FC+EM++AGHS N ATL  L++ FCKEKGVEEA+  + +L+++G  +DD
Sbjct: 290  YKGNRLDIAVDFCVEMLEAGHSPNVATLVDLIHEFCKEKGVEEAKNVIVTLKQKGLFVDD 349

Query: 101  KAVSEYLDKKGPYSPMVWEAILGKKNSQRRF 9
            KAV EYLDKKGP SP+VWEA  GKK+ QR F
Sbjct: 350  KAVREYLDKKGPQSPLVWEAFFGKKSPQRSF 380


>gb|KCW55702.1| hypothetical protein EUGRSUZ_I01549 [Eucalyptus grandis]
           gi|629089450|gb|KCW55703.1| hypothetical protein
           EUGRSUZ_I01549 [Eucalyptus grandis]
           gi|629089451|gb|KCW55704.1| hypothetical protein
           EUGRSUZ_I01549 [Eucalyptus grandis]
          Length = 349

 Score =  316 bits (810), Expect = 2e-83
 Identities = 178/325 (54%), Positives = 216/325 (66%), Gaps = 1/325 (0%)
 Frame = -1

Query: 980 DDGSEDDFLRNLNFGGERDGDNAENTHQNSP-SRMPHRPLRGDQRPIKREGRDGDDRFHQ 804
           + G ED          +  G N +N ++  P   +P+RPLRG Q+  +  G +G +    
Sbjct: 46  ESGLEDARTDQSQSRSQSPGYNNDNGNETPPPDPIPNRPLRGLQQSQRIIGNNGPN---- 101

Query: 803 KFKGGNLAFGGLRENGDERIDESLKTGNGEKERNQSGDTLLDKLNFGYMGSRKSAEETDT 624
            F+G     G  R+  D+   E  K    ++++ + GD            ++ S EE   
Sbjct: 102 -FRGE----GVRRDPSDDSFLEKFKLSFDKRDKPE-GDV-------ASATTQPSQEENKV 148

Query: 623 NSSTQVPGTEPSVPEKPPEDADEIFKKMKATGLIPNAVAMLDGLCKDGLVQEAMKLFGLM 444
           NS+      +P +PE    DADEIFKKMK TGLIPNAVAMLDGLCKDGLVQEAMKLFGLM
Sbjct: 149 NSNQMANEGQPPLPE----DADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLM 204

Query: 443 REKGTIPEVVIYTAVVEGFCKAAKFDDAKRIFRKMQKNGIVPNAFSYTVLIQGLCRGKRL 264
           REKG+IPEVVIYTAVVEGFCKA KFDDAKRIFRKMQ NGI PNAFS+TVLIQGL R  RL
Sbjct: 205 REKGSIPEVVIYTAVVEGFCKAQKFDDAKRIFRKMQNNGITPNAFSFTVLIQGLYRCDRL 264

Query: 263 EDAVEFCMEMVDAGHSLNAATLTGLVNGFCKEKGVEEAEGFVRSLRERGFVIDDKAVSEY 84
           EDA+EFC EM+DAGHS N  T  GLVNG CK+KGVEEA+  +  LRE+G+ I++KAV E+
Sbjct: 265 EDALEFCQEMIDAGHSPNVMTFVGLVNGVCKQKGVEEAQTVINRLREKGYFINEKAVREF 324

Query: 83  LDKKGPYSPMVWEAILGKKNSQRRF 9
           L+KK P+S MVWEAI GKK S   F
Sbjct: 325 LEKKAPFSSMVWEAIFGKKQSHSLF 349


>emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera]
          Length = 381

 Score =  316 bits (810), Expect = 2e-83
 Identities = 185/392 (47%), Positives = 243/392 (61%), Gaps = 15/392 (3%)
 Frame = -1

Query: 1139 KRMQPRISRALFSKLLRKFSHVPKSPIADRSP---SLPLAHFSTIP-NRPMRGNRRRDD- 975
            + +Q ++S+ +FS  L+   H   S  ++ SP    L L  FS+I  +   RG  RR+D 
Sbjct: 3    RALQGKVSKVVFSDCLKDLLHSSHSSPSNPSPLPLPLLLRRFSSIDASSSTRGASRREDL 62

Query: 974  -GSEDDFLRNLNFGGERDGDNAENT------HQNSPSRMPHRPLRGDQRPIKREGRDGDD 816
              + D F  +     +  G  + ++        N P+ +P+RPLRG+QR + R       
Sbjct: 63   ANNSDLFSPSTEPDDDTYGRKSSSSCGGGGSSSNPPNPIPNRPLRGEQR-MNRPPPHIPQ 121

Query: 815  RFHQKFKGGNLAFGGLRENGDERIDESL---KTGNGEKERNQSGDTLLDKLNFGYMGSRK 645
            R            G  ++ G +R  ++    +    EK      D  L++   G     +
Sbjct: 122  R----------KLGLPKDEGVDRASQASPFNQPSPAEKVGATLEDGFLERFKLGVQKKER 171

Query: 644  SAEETDTNSSTQVPGTEPSVPEKPPEDADEIFKKMKATGLIPNAVAMLDGLCKDGLVQEA 465
              E      S +         E+PP++ADEIF+KMK +GLIPNAVAMLDGLCKDGLVQEA
Sbjct: 172  PQESAAAQPSREQDANHGK--EQPPQNADEIFRKMKESGLIPNAVAMLDGLCKDGLVQEA 229

Query: 464  MKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFDDAKRIFRKMQKNGIVPNAFSYTVLIQG 285
            MKLFGLMREKGTIPEVVIYTAVVEGFCKA + DDA RIFRKMQ NGI PNAFSYTVLI+G
Sbjct: 230  MKLFGLMREKGTIPEVVIYTAVVEGFCKARQLDDAVRIFRKMQNNGISPNAFSYTVLIRG 289

Query: 284  LCRGKRLEDAVEFCMEMVDAGHSLNAATLTGLVNGFCKEKGVEEAEGFVRSLRERGFVID 105
            + +G RL+ AV+FC+EM++AGHS N ATL  L++ FCKEKGVEEA+  + +L+++G  +D
Sbjct: 290  MYKGNRLDIAVDFCVEMLEAGHSPNVATLVDLIHEFCKEKGVEEAKNVIVTLKQKGLFVD 349

Query: 104  DKAVSEYLDKKGPYSPMVWEAILGKKNSQRRF 9
            DKAV EYLDKKGP SP+VWEA  GKK+ QR F
Sbjct: 350  DKAVREYLDKKGPQSPLVWEAFFGKKSPQRSF 381


>ref|XP_010030710.1| PREDICTED: pentatricopeptide repeat-containing protein At5g48910
           [Eucalyptus grandis]
          Length = 1024

 Score =  315 bits (807), Expect = 4e-83
 Identities = 177/321 (55%), Positives = 215/321 (66%), Gaps = 1/321 (0%)
 Frame = -1

Query: 980 DDGSEDDFLRNLNFGGERDGDNAENTHQNSP-SRMPHRPLRGDQRPIKREGRDGDDRFHQ 804
           + G ED          +  G N +N ++  P   +P+RPLRG Q+  +  G +G +    
Sbjct: 46  ESGLEDARTDQSQSRSQSPGYNNDNGNETPPPDPIPNRPLRGLQQSQRIIGNNGPN---- 101

Query: 803 KFKGGNLAFGGLRENGDERIDESLKTGNGEKERNQSGDTLLDKLNFGYMGSRKSAEETDT 624
            F+G     G  R+  D+   E  K    ++++ + GD            ++ S EE   
Sbjct: 102 -FRGE----GVRRDPSDDSFLEKFKLSFDKRDKPE-GDV-------ASATTQPSQEENKV 148

Query: 623 NSSTQVPGTEPSVPEKPPEDADEIFKKMKATGLIPNAVAMLDGLCKDGLVQEAMKLFGLM 444
           NS+      +P +PE    DADEIFKKMK TGLIPNAVAMLDGLCKDGLVQEAMKLFGLM
Sbjct: 149 NSNQMANEGQPPLPE----DADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLM 204

Query: 443 REKGTIPEVVIYTAVVEGFCKAAKFDDAKRIFRKMQKNGIVPNAFSYTVLIQGLCRGKRL 264
           REKG+IPEVVIYTAVVEGFCKA KFDDAKRIFRKMQ NGI PNAFS+TVLIQGL R  RL
Sbjct: 205 REKGSIPEVVIYTAVVEGFCKAQKFDDAKRIFRKMQNNGITPNAFSFTVLIQGLYRCDRL 264

Query: 263 EDAVEFCMEMVDAGHSLNAATLTGLVNGFCKEKGVEEAEGFVRSLRERGFVIDDKAVSEY 84
           EDA+EFC EM+DAGHS N  T  GLVNG CK+KGVEEA+  +  LRE+G+ I++KAV E+
Sbjct: 265 EDALEFCQEMIDAGHSPNVMTFVGLVNGVCKQKGVEEAQTVINRLREKGYFINEKAVREF 324

Query: 83  LDKKGPYSPMVWEAILGKKNS 21
           L+KK P+S MVWEAI GKK S
Sbjct: 325 LEKKAPFSSMVWEAIFGKKQS 345


>ref|XP_006573589.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X3 [Glycine max] gi|571435834|ref|XP_006573590.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g38150-like isoform X4 [Glycine max]
          Length = 403

 Score =  313 bits (803), Expect = 1e-82
 Identities = 185/364 (50%), Positives = 232/364 (63%), Gaps = 24/364 (6%)
 Frame = -1

Query: 1028 HFSTIPNRPMRGNRRRDDGSEDDFLRNLNFGGERD-GDN----AENTHQNSPSRMPHRPL 864
            HFS   +R   G  ++  G  DDF R  +    +D G N    + N  Q+    +P RPL
Sbjct: 48   HFSFTDDRS--GRSKQPVGESDDFFREQSDSSFKDNGSNRTQESYNVEQSLSEPIPSRPL 105

Query: 863  RGDQRPIK------REGRDGDDRFHQKFKGGNLAFGGLRE----NGDERIDESLKTGNGE 714
            RG ++PI       RE   G   F  +F   +   GG  E    N   +ID + +     
Sbjct: 106  RG-KKPINQPPPRFREYDRGSHSFPPRFDDNH---GGPDELDKINKSSQIDLAFQGTTNV 161

Query: 713  KERNQ----SGDTLLDKLNFGYMGSRKSAEETDTNSSTQVPGTEPSVPEKP-----PEDA 561
             E N+    SG + LDK   G+    K+   ++  +S Q    + S P +P     P+DA
Sbjct: 162  AETNRDVGKSGGSFLDKFKLGF--DDKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDA 219

Query: 560  DEIFKKMKATGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCK 381
            +EIFKKMK TGLIPNAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+VIYTAVVEG+ K
Sbjct: 220  NEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTK 279

Query: 380  AAKFDDAKRIFRKMQKNGIVPNAFSYTVLIQGLCRGKRLEDAVEFCMEMVDAGHSLNAAT 201
            A K DDAKRIFRKMQ +GI PNAFSYTVLIQGL +  RL DA EFC+EM++AGHS N   
Sbjct: 280  AHKADDAKRIFRKMQSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTA 339

Query: 200  LTGLVNGFCKEKGVEEAEGFVRSLRERGFVIDDKAVSEYLDKKGPYSPMVWEAILGKKNS 21
              GLV+GFC EKGVEEA+  +++L E+GFV+++KAV ++LDKK P+SP VWEAI GKK  
Sbjct: 340  FVGLVDGFCNEKGVEEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGKKAP 399

Query: 20   QRRF 9
            QR F
Sbjct: 400  QRPF 403


>ref|XP_006573588.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X2 [Glycine max]
          Length = 431

 Score =  313 bits (803), Expect = 1e-82
 Identities = 185/364 (50%), Positives = 232/364 (63%), Gaps = 24/364 (6%)
 Frame = -1

Query: 1028 HFSTIPNRPMRGNRRRDDGSEDDFLRNLNFGGERD-GDN----AENTHQNSPSRMPHRPL 864
            HFS   +R   G  ++  G  DDF R  +    +D G N    + N  Q+    +P RPL
Sbjct: 76   HFSFTDDRS--GRSKQPVGESDDFFREQSDSSFKDNGSNRTQESYNVEQSLSEPIPSRPL 133

Query: 863  RGDQRPIK------REGRDGDDRFHQKFKGGNLAFGGLRE----NGDERIDESLKTGNGE 714
            RG ++PI       RE   G   F  +F   +   GG  E    N   +ID + +     
Sbjct: 134  RG-KKPINQPPPRFREYDRGSHSFPPRFDDNH---GGPDELDKINKSSQIDLAFQGTTNV 189

Query: 713  KERNQ----SGDTLLDKLNFGYMGSRKSAEETDTNSSTQVPGTEPSVPEKP-----PEDA 561
             E N+    SG + LDK   G+    K+   ++  +S Q    + S P +P     P+DA
Sbjct: 190  AETNRDVGKSGGSFLDKFKLGF--DDKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDA 247

Query: 560  DEIFKKMKATGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCK 381
            +EIFKKMK TGLIPNAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+VIYTAVVEG+ K
Sbjct: 248  NEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTK 307

Query: 380  AAKFDDAKRIFRKMQKNGIVPNAFSYTVLIQGLCRGKRLEDAVEFCMEMVDAGHSLNAAT 201
            A K DDAKRIFRKMQ +GI PNAFSYTVLIQGL +  RL DA EFC+EM++AGHS N   
Sbjct: 308  AHKADDAKRIFRKMQSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTA 367

Query: 200  LTGLVNGFCKEKGVEEAEGFVRSLRERGFVIDDKAVSEYLDKKGPYSPMVWEAILGKKNS 21
              GLV+GFC EKGVEEA+  +++L E+GFV+++KAV ++LDKK P+SP VWEAI GKK  
Sbjct: 368  FVGLVDGFCNEKGVEEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGKKAP 427

Query: 20   QRRF 9
            QR F
Sbjct: 428  QRPF 431


>ref|XP_003516576.2| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X1 [Glycine max] gi|734411753|gb|KHN36139.1|
            Pentatricopeptide repeat-containing protein [Glycine
            soja]
          Length = 457

 Score =  313 bits (803), Expect = 1e-82
 Identities = 185/364 (50%), Positives = 232/364 (63%), Gaps = 24/364 (6%)
 Frame = -1

Query: 1028 HFSTIPNRPMRGNRRRDDGSEDDFLRNLNFGGERD-GDN----AENTHQNSPSRMPHRPL 864
            HFS   +R   G  ++  G  DDF R  +    +D G N    + N  Q+    +P RPL
Sbjct: 102  HFSFTDDRS--GRSKQPVGESDDFFREQSDSSFKDNGSNRTQESYNVEQSLSEPIPSRPL 159

Query: 863  RGDQRPIK------REGRDGDDRFHQKFKGGNLAFGGLRE----NGDERIDESLKTGNGE 714
            RG ++PI       RE   G   F  +F   +   GG  E    N   +ID + +     
Sbjct: 160  RG-KKPINQPPPRFREYDRGSHSFPPRFDDNH---GGPDELDKINKSSQIDLAFQGTTNV 215

Query: 713  KERNQ----SGDTLLDKLNFGYMGSRKSAEETDTNSSTQVPGTEPSVPEKP-----PEDA 561
             E N+    SG + LDK   G+    K+   ++  +S Q    + S P +P     P+DA
Sbjct: 216  AETNRDVGKSGGSFLDKFKLGF--DDKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDA 273

Query: 560  DEIFKKMKATGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCK 381
            +EIFKKMK TGLIPNAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+VIYTAVVEG+ K
Sbjct: 274  NEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTK 333

Query: 380  AAKFDDAKRIFRKMQKNGIVPNAFSYTVLIQGLCRGKRLEDAVEFCMEMVDAGHSLNAAT 201
            A K DDAKRIFRKMQ +GI PNAFSYTVLIQGL +  RL DA EFC+EM++AGHS N   
Sbjct: 334  AHKADDAKRIFRKMQSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTA 393

Query: 200  LTGLVNGFCKEKGVEEAEGFVRSLRERGFVIDDKAVSEYLDKKGPYSPMVWEAILGKKNS 21
              GLV+GFC EKGVEEA+  +++L E+GFV+++KAV ++LDKK P+SP VWEAI GKK  
Sbjct: 394  FVGLVDGFCNEKGVEEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGKKAP 453

Query: 20   QRRF 9
            QR F
Sbjct: 454  QRPF 457


>ref|XP_007156913.1| hypothetical protein PHAVU_002G027800g [Phaseolus vulgaris]
            gi|593787750|ref|XP_007156914.1| hypothetical protein
            PHAVU_002G027800g [Phaseolus vulgaris]
            gi|561030328|gb|ESW28907.1| hypothetical protein
            PHAVU_002G027800g [Phaseolus vulgaris]
            gi|561030329|gb|ESW28908.1| hypothetical protein
            PHAVU_002G027800g [Phaseolus vulgaris]
          Length = 451

 Score =  313 bits (803), Expect = 1e-82
 Identities = 190/399 (47%), Positives = 242/399 (60%), Gaps = 17/399 (4%)
 Frame = -1

Query: 1154 RSIGKKRMQPRISRALFSKLLRKFSHVPKSPIADRSPSLPLAHFSTIPNRPMRGNRRRDD 975
            R + K     RI +   + LLR   H+P  P  +      + HFS   +    G  ++  
Sbjct: 66   RGVHKLVSSSRIEK--LASLLRSKQHLP--PWVET-----VRHFSFADD--FSGRSKQYA 114

Query: 974  GSEDDFLRNLNFGG-ERDGDNAE----NTHQNSPSRMPHRPLRGDQ---RPIKREGRDGD 819
               DDFLR  +    E +G N      N  Q S   +P RPLRG +   +P  R    G 
Sbjct: 115  WERDDFLRQQSDSSFEDNGSNRTHEEYNVEQGSSESIPSRPLRGRKPINQPPPRFRESGR 174

Query: 818  DRFHQKFKGGNLAFGGL-RENGDERID---ESLKTGNGEKERNQSGDTLLDKLNFGYMGS 651
              F   F   +     L R N   +ID   + +   +  ++  QSGD+ LDK    +   
Sbjct: 175  GSFPPTFDDNHRGPDALDRTNKSSKIDLAFQGMNVADTNRDFEQSGDSFLDKFKLAF--D 232

Query: 650  RKSAEETDTNSSTQVPGTEPSVPEKP-----PEDADEIFKKMKATGLIPNAVAMLDGLCK 486
             K+   ++  +S Q    + S P++      P+DADEIFKKMK TGLIPNAVAMLDGLCK
Sbjct: 233  DKTVNLSEVAASKQSEEAKRSNPDQQAQEPVPQDADEIFKKMKETGLIPNAVAMLDGLCK 292

Query: 485  DGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFDDAKRIFRKMQKNGIVPNAFS 306
            DGLVQEA+KLF LMREKGTIPE+VIYTAVVEG+ KA K DDAKRIFRKMQ +GI PNAFS
Sbjct: 293  DGLVQEALKLFALMREKGTIPEIVIYTAVVEGYTKADKADDAKRIFRKMQSSGISPNAFS 352

Query: 305  YTVLIQGLCRGKRLEDAVEFCMEMVDAGHSLNAATLTGLVNGFCKEKGVEEAEGFVRSLR 126
            YTV++QGL + +RL+DA EFC+EM++AGHS N  T   LV+GFCKEKGVEEA+  V++L 
Sbjct: 353  YTVIVQGLYKCRRLQDAFEFCVEMLEAGHSPNVTTFVSLVDGFCKEKGVEEAKDAVKTLT 412

Query: 125  ERGFVIDDKAVSEYLDKKGPYSPMVWEAILGKKNSQRRF 9
             +GF  D+KAV ++LDKK P+SP VWEAI GKK  QR F
Sbjct: 413  GKGFAFDEKAVRQFLDKKTPFSPSVWEAIFGKKAPQRPF 451


Top