BLASTX nr result

ID: Akebia25_contig00040687 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00040687
         (830 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI22109.3| unnamed protein product [Vitis vinifera]              294   3e-77
ref|XP_002279448.1| PREDICTED: pentatricopeptide repeat-containi...   294   3e-77
ref|XP_006469838.1| PREDICTED: pentatricopeptide repeat-containi...   269   8e-70
ref|XP_006447383.1| hypothetical protein CICLE_v10016159mg [Citr...   269   8e-70
ref|XP_002533283.1| pentatricopeptide repeat-containing protein,...   267   3e-69
ref|XP_002313272.1| hypothetical protein POPTR_0009s07170g [Popu...   265   2e-68
ref|XP_002299968.2| hypothetical protein POPTR_0001s27990g, part...   263   8e-68
ref|XP_007043483.1| Tetratricopeptide repeat (TPR)-like superfam...   262   1e-67
ref|XP_007043481.1| Tetratricopeptide repeat-like superfamily pr...   262   1e-67
ref|XP_006357531.1| PREDICTED: pentatricopeptide repeat-containi...   253   5e-65
ref|XP_004243315.1| PREDICTED: pentatricopeptide repeat-containi...   248   3e-63
ref|XP_006842606.1| hypothetical protein AMTR_s00077p00171190 [A...   238   3e-60
ref|XP_006594863.1| PREDICTED: pentatricopeptide repeat-containi...   229   1e-57
ref|XP_003543355.1| PREDICTED: pentatricopeptide repeat-containi...   229   1e-57
ref|XP_002439972.1| hypothetical protein SORBIDRAFT_09g023670 [S...   220   4e-55
ref|XP_007149892.1| hypothetical protein PHAVU_005G107600g [Phas...   219   1e-54
ref|NP_001143211.1| uncharacterized protein LOC100275714 [Zea ma...   217   4e-54
ref|XP_006417295.1| hypothetical protein EUTSA_v10007999mg [Eutr...   216   6e-54
gb|ACR38556.1| unknown [Zea mays] gi|413945770|gb|AFW78419.1| hy...   216   1e-53
ref|XP_006654552.1| PREDICTED: pentatricopeptide repeat-containi...   215   1e-53

>emb|CBI22109.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  294 bits (752), Expect = 3e-77
 Identities = 144/214 (67%), Positives = 180/214 (84%)
 Frame = +2

Query: 5   LAKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLK 184
           +AK      DSV L KFV ++SEL FPR ATI+N+II  FAE RQI+K+L+IFD+MK+LK
Sbjct: 174 VAKVFTKTDDSV-LLKFVREVSELTFPRNATILNRIIHAFAECRQIEKSLIIFDHMKSLK 232

Query: 185 CKPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCL 364
           CKPD++TYNT+L  LGRAGR+D MLHEF+SMK AN+APDIISYNT++N+L+K+GRLDLCL
Sbjct: 233 CKPDLITYNTVLGFLGRAGRLDEMLHEFSSMKVANIAPDIISYNTLLNSLQKVGRLDLCL 292

Query: 365 VFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNL 544
           VF  EMGE  L+PDLRTY ALIE FG+SG +EEAL LF E+K+  I PS+YIYR+LI+  
Sbjct: 293 VFFREMGENGLKPDLRTYRALIEGFGQSGNLEEALRLFSEMKQGQICPSIYIYRSLINYS 352

Query: 545 KKVGKLELAISLLEEMNSCISDLIGPKDFKRRNR 646
           KK+GK+ELA+SL EEMN+C+ DLIGPKDFK++NR
Sbjct: 353 KKMGKVELAMSLSEEMNACLPDLIGPKDFKQKNR 386


>ref|XP_002279448.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g11900-like [Vitis vinifera]
          Length = 357

 Score =  294 bits (752), Expect = 3e-77
 Identities = 144/214 (67%), Positives = 180/214 (84%)
 Frame = +2

Query: 5   LAKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLK 184
           +AK      DSV L KFV ++SEL FPR ATI+N+II  FAE RQI+K+L+IFD+MK+LK
Sbjct: 145 VAKVFTKTDDSV-LLKFVREVSELTFPRNATILNRIIHAFAECRQIEKSLIIFDHMKSLK 203

Query: 185 CKPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCL 364
           CKPD++TYNT+L  LGRAGR+D MLHEF+SMK AN+APDIISYNT++N+L+K+GRLDLCL
Sbjct: 204 CKPDLITYNTVLGFLGRAGRLDEMLHEFSSMKVANIAPDIISYNTLLNSLQKVGRLDLCL 263

Query: 365 VFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNL 544
           VF  EMGE  L+PDLRTY ALIE FG+SG +EEAL LF E+K+  I PS+YIYR+LI+  
Sbjct: 264 VFFREMGENGLKPDLRTYRALIEGFGQSGNLEEALRLFSEMKQGQICPSIYIYRSLINYS 323

Query: 545 KKVGKLELAISLLEEMNSCISDLIGPKDFKRRNR 646
           KK+GK+ELA+SL EEMN+C+ DLIGPKDFK++NR
Sbjct: 324 KKMGKVELAMSLSEEMNACLPDLIGPKDFKQKNR 357


>ref|XP_006469838.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g11900-like isoform X1 [Citrus sinensis]
          Length = 390

 Score =  269 bits (688), Expect = 8e-70
 Identities = 124/213 (58%), Positives = 172/213 (80%)
 Frame = +2

Query: 8   AKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLKC 187
           A+A +   D   L  F+ ++ ++  P    ++N+II  FA+SRQI+KAL+IFD++K LKC
Sbjct: 178 ARAFIMTDDCTQLLIFIEEVVQIASPESIIVVNRIIFAFAKSRQIEKALLIFDHIKGLKC 237

Query: 188 KPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCLV 367
           KPD++TYN +LDILGR GRV+ ML+EFASMKEA + PD ISYNT++NNLRK+ RLDLCL+
Sbjct: 238 KPDLITYNIVLDILGRVGRVNDMLNEFASMKEAGVVPDFISYNTLLNNLRKIRRLDLCLI 297

Query: 368 FLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNLK 547
           +  EMGE  ++PDL TYTALI+SFGR+G +EE+L LF+++K++ I PS+Y+YR+LI NLK
Sbjct: 298 YFREMGESGIKPDLLTYTALIDSFGRTGNIEESLRLFNDMKQQQIRPSIYVYRSLIDNLK 357

Query: 548 KVGKLELAISLLEEMNSCISDLIGPKDFKRRNR 646
           K+GK++LA+++ EEMNS +SDL GPKDFKR+ R
Sbjct: 358 KMGKVDLAMTIFEEMNSSLSDLAGPKDFKRKAR 390


>ref|XP_006447383.1| hypothetical protein CICLE_v10016159mg [Citrus clementina]
           gi|557549994|gb|ESR60623.1| hypothetical protein
           CICLE_v10016159mg [Citrus clementina]
          Length = 287

 Score =  269 bits (688), Expect = 8e-70
 Identities = 124/213 (58%), Positives = 172/213 (80%)
 Frame = +2

Query: 8   AKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLKC 187
           A+A +   D   L  F+ ++ ++  P    ++N+II  FA+SRQI+KAL+IFD++K LKC
Sbjct: 75  ARAFIMTDDCTQLLIFIEEVVQIASPESIIVVNRIIFAFAKSRQIEKALLIFDHIKGLKC 134

Query: 188 KPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCLV 367
           KPD++TYN +LDILGR GRV+ ML+EFASMKEA + PD ISYNT++NNLRK+ RLDLCL+
Sbjct: 135 KPDLITYNIVLDILGRVGRVNDMLNEFASMKEAGVVPDFISYNTLLNNLRKIRRLDLCLI 194

Query: 368 FLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNLK 547
           +  EMGE  ++PDL TYTALI+SFGR+G +EE+L LF+++K++ I PS+Y+YR+LI NLK
Sbjct: 195 YFREMGESGIKPDLLTYTALIDSFGRTGNIEESLRLFNDMKQQQIRPSIYVYRSLIDNLK 254

Query: 548 KVGKLELAISLLEEMNSCISDLIGPKDFKRRNR 646
           K+GK++LA+++ EEMNS +SDL GPKDFKR+ R
Sbjct: 255 KMGKVDLAMTIFEEMNSSLSDLAGPKDFKRKAR 287


>ref|XP_002533283.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223526886|gb|EEF29094.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 405

 Score =  267 bits (683), Expect = 3e-69
 Identities = 129/214 (60%), Positives = 168/214 (78%)
 Frame = +2

Query: 5   LAKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLK 184
           LA+  +N  D V L KFV ++ EL FPR   +IN+II  FAE RQ DKAL+IFD +K+L+
Sbjct: 192 LARGFINTNDHVLLMKFVKEVLELAFPRSMVVINRIIFAFAECRQFDKALLIFDQIKDLE 251

Query: 185 CKPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCL 364
            KPD++TYN +L ILGRAGRVD ML+EF+SMKEA + PD I YNT++N L+K GRLDLCL
Sbjct: 252 YKPDLITYNMVLHILGRAGRVDEMLYEFSSMKEAGIVPDFICYNTLLNQLQKAGRLDLCL 311

Query: 365 VFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNL 544
           V++ EMGE  +E DL TYTALI+SFG+SG +EE+L LFD++K + I PS+YIYR+LI+  
Sbjct: 312 VYIREMGESGIEADLLTYTALIQSFGKSGHIEESLRLFDDMKTRQIRPSIYIYRSLINTA 371

Query: 545 KKVGKLELAISLLEEMNSCISDLIGPKDFKRRNR 646
           KK+GK+ELA++LLEEMN+   +L GP DFKR+ R
Sbjct: 372 KKMGKVELAMTLLEEMNASPPNLAGPNDFKRKRR 405


>ref|XP_002313272.1| hypothetical protein POPTR_0009s07170g [Populus trichocarpa]
           gi|222849680|gb|EEE87227.1| hypothetical protein
           POPTR_0009s07170g [Populus trichocarpa]
          Length = 382

 Score =  265 bits (676), Expect = 2e-68
 Identities = 127/214 (59%), Positives = 168/214 (78%)
 Frame = +2

Query: 5   LAKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLK 184
           LA+  + + D V L + V ++SEL FP    ++N+ I  FAE  Q DKA++IF+ M+NLK
Sbjct: 169 LARGFVKSNDDVQLLRLVKEVSELTFPSSTKVVNRFIFAFAECGQFDKAILIFEQMENLK 228

Query: 185 CKPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCL 364
           CKPD+VTYNT+LD+LGRAGR+D ML EFASMKEA + PD ISYNT++N L K+GRLDLC 
Sbjct: 229 CKPDLVTYNTVLDLLGRAGRIDEMLGEFASMKEAGILPDFISYNTLLNQLTKVGRLDLCS 288

Query: 365 VFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNL 544
           V+  +M    +EPDL TYTALI SFG+SG +EE+L LF+E+K K I PS+YIYR+LI++L
Sbjct: 289 VYFRDMVGNGIEPDLLTYTALIWSFGQSGNIEESLRLFNEMKTKQIRPSIYIYRSLIASL 348

Query: 545 KKVGKLELAISLLEEMNSCISDLIGPKDFKRRNR 646
           KK+GK+ELA++ LEEMN+ +S+L GPKDFKR +R
Sbjct: 349 KKMGKIELAMTFLEEMNASMSNLAGPKDFKRTHR 382


>ref|XP_002299968.2| hypothetical protein POPTR_0001s27990g, partial [Populus
           trichocarpa] gi|550348348|gb|EEE84773.2| hypothetical
           protein POPTR_0001s27990g, partial [Populus trichocarpa]
          Length = 348

 Score =  263 bits (671), Expect = 8e-68
 Identities = 128/214 (59%), Positives = 167/214 (78%)
 Frame = +2

Query: 5   LAKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLK 184
           LA+  +   D V L + V ++SE+ FP    ++N+II  FAE  Q DKAL+IF  M+NLK
Sbjct: 135 LARGFVKTNDDVQLLRLVKEVSEMTFPSSMMVVNRIIFAFAECGQFDKALLIFKQMENLK 194

Query: 185 CKPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCL 364
           CKPD+VTYNT+LD+LG AGR+D ML EFASMKEA + PD ISYNT++N LRK+GRLDLC 
Sbjct: 195 CKPDLVTYNTVLDLLGHAGRIDEMLCEFASMKEAGILPDFISYNTLLNQLRKVGRLDLCS 254

Query: 365 VFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNL 544
           V+  +M E  +EPDL TYTALI SFG+SG +EE+L LF+E+K K I PS+YIYR+LI++L
Sbjct: 255 VYSRDMVESGIEPDLLTYTALIGSFGQSGNIEESLRLFNEMKTKQIRPSIYIYRSLIASL 314

Query: 545 KKVGKLELAISLLEEMNSCISDLIGPKDFKRRNR 646
           KK+GK+ELA++LLEEMN+ +S+L G KDFKR  +
Sbjct: 315 KKMGKVELAMTLLEEMNASMSNLAGHKDFKRTRK 348


>ref|XP_007043483.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
           isoform 3 [Theobroma cacao] gi|508707418|gb|EOX99314.1|
           Tetratricopeptide repeat (TPR)-like superfamily protein,
           putative isoform 3 [Theobroma cacao]
          Length = 272

 Score =  262 bits (669), Expect = 1e-67
 Identities = 128/215 (59%), Positives = 170/215 (79%), Gaps = 1/215 (0%)
 Frame = +2

Query: 5   LAKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLK 184
           LA++ + + D   L +FV ++SEL FP   T+IN+II  FAE  QI+KAL++F+ +K+  
Sbjct: 58  LARSFVKSNDCTALIRFVKQVSELAFPSSTTVINRIILAFAECWQIEKALLVFNQIKSFG 117

Query: 185 CKPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCL 364
           CKPD++TYNTILDILGRAGRVD M+HEFASMKEA + PDII+YNT++NNLRK+GRLD+CL
Sbjct: 118 CKPDVITYNTILDILGRAGRVDEMVHEFASMKEAGLVPDIITYNTLLNNLRKLGRLDMCL 177

Query: 365 VFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNL 544
           VF  EM +  +EPDL TY A+IE+FGRSG + EAL LF E+K++ I PSVYIYR+LI  L
Sbjct: 178 VFFREMSDTGVEPDLLTYRAMIETFGRSGNINEALRLFREMKQRQIYPSVYIYRSLICIL 237

Query: 545 KKVGKLELAISLLEEMN-SCISDLIGPKDFKRRNR 646
           KK GK++LA+S  EEMN S +SD+   ++FKR++R
Sbjct: 238 KKAGKVDLAMSFSEEMNSSSLSDIAATENFKRKHR 272


>ref|XP_007043481.1| Tetratricopeptide repeat-like superfamily protein, putative isoform
           1 [Theobroma cacao] gi|590690339|ref|XP_007043482.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508707416|gb|EOX99312.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508707417|gb|EOX99313.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
          Length = 387

 Score =  262 bits (669), Expect = 1e-67
 Identities = 128/215 (59%), Positives = 170/215 (79%), Gaps = 1/215 (0%)
 Frame = +2

Query: 5   LAKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLK 184
           LA++ + + D   L +FV ++SEL FP   T+IN+II  FAE  QI+KAL++F+ +K+  
Sbjct: 173 LARSFVKSNDCTALIRFVKQVSELAFPSSTTVINRIILAFAECWQIEKALLVFNQIKSFG 232

Query: 185 CKPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCL 364
           CKPD++TYNTILDILGRAGRVD M+HEFASMKEA + PDII+YNT++NNLRK+GRLD+CL
Sbjct: 233 CKPDVITYNTILDILGRAGRVDEMVHEFASMKEAGLVPDIITYNTLLNNLRKLGRLDMCL 292

Query: 365 VFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNL 544
           VF  EM +  +EPDL TY A+IE+FGRSG + EAL LF E+K++ I PSVYIYR+LI  L
Sbjct: 293 VFFREMSDTGVEPDLLTYRAMIETFGRSGNINEALRLFREMKQRQIYPSVYIYRSLICIL 352

Query: 545 KKVGKLELAISLLEEMN-SCISDLIGPKDFKRRNR 646
           KK GK++LA+S  EEMN S +SD+   ++FKR++R
Sbjct: 353 KKAGKVDLAMSFSEEMNSSSLSDIAATENFKRKHR 387


>ref|XP_006357531.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g11900-like isoform X1 [Solanum tuberosum]
          Length = 403

 Score =  253 bits (647), Expect = 5e-65
 Identities = 125/215 (58%), Positives = 164/215 (76%)
 Frame = +2

Query: 2   ILAKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNL 181
           I A+A +   D   L +FV +MSELIFP   T++N+II  FAE  QIDKAL+IFD MK+L
Sbjct: 189 IFAQAFIKENDVPCLLRFVREMSELIFPSSTTVMNRIIFAFAECGQIDKALLIFDQMKSL 248

Query: 182 KCKPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLC 361
           K KPD++TYNTIL ILG+ GR+D ML+EF +MKE  + PDI+SYNT+I  LRK+GRL+ C
Sbjct: 249 KSKPDVITYNTILGILGKCGRIDEMLNEFLAMKEDGLIPDIVSYNTLITGLRKVGRLESC 308

Query: 362 LVFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISN 541
           LVF  EM E+ +EPDLRTY+ALI+SFG+SG +EE+L LF+E+K KGI PS+++Y+ LISN
Sbjct: 309 LVFFREMCEREIEPDLRTYSALIDSFGKSGNIEESLRLFNEMKHKGICPSIHVYKLLISN 368

Query: 542 LKKVGKLELAISLLEEMNSCISDLIGPKDFKRRNR 646
           LKK+GK ELAI+   EM   IS+  G    +++NR
Sbjct: 369 LKKMGKFELAIAFSNEMKESISNNHGSNYNRQKNR 403



 Score = 65.5 bits (158), Expect = 2e-08
 Identities = 44/169 (26%), Positives = 72/169 (42%), Gaps = 1/169 (0%)
 Frame = +2

Query: 89  RATIINKIISGFAESRQIDKALMIFDYMKNLKCKP-DIVTYNTILDILGRAGRVDSMLHE 265
           R +  N+++    E   I      F  +  + CK  +  TY        +   V  +L  
Sbjct: 148 RPSAHNRLLKAAIEENDIGLLCQCFKDLL-VSCKSLNSSTYLIFAQAFIKENDVPCLLRF 206

Query: 266 FASMKEANMAPDIISYNTIINNLRKMGRLDLCLVFLHEMGEKNLEPDLRTYTALIESFGR 445
              M E          N II    + G++D  L+   +M     +PD+ TY  ++   G+
Sbjct: 207 VREMSELIFPSSTTVMNRIIFAFAECGQIDKALLIFDQMKSLKSKPDVITYNTILGILGK 266

Query: 446 SGRVEEALGLFDELKRKGISPSVYIYRALISNLKKVGKLELAISLLEEM 592
            GR++E L  F  +K  G+ P +  Y  LI+ L+KVG+LE  +    EM
Sbjct: 267 CGRIDEMLNEFLAMKEDGLIPDIVSYNTLITGLRKVGRLESCLVFFREM 315


>ref|XP_004243315.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g11900-like [Solanum lycopersicum]
          Length = 403

 Score =  248 bits (632), Expect = 3e-63
 Identities = 118/215 (54%), Positives = 164/215 (76%)
 Frame = +2

Query: 2   ILAKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNL 181
           I A+A +   D   L +FV ++SELIFP    ++N+II  FAE  Q+DK+L+IFD MK+L
Sbjct: 189 IFAQAFIKENDVACLLRFVREISELIFPSSTPVMNRIIFAFAECGQLDKSLLIFDQMKSL 248

Query: 182 KCKPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLC 361
           K KPD++TYNTIL +LG+ GR+D ML++F +MKE  + PDI+SYNT+I  LRK+GRL+LC
Sbjct: 249 KSKPDVITYNTILGLLGKCGRIDEMLNQFVAMKEDGLIPDIVSYNTLITGLRKVGRLELC 308

Query: 362 LVFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISN 541
           LVF  EM E+ +EPDLRTY+ALI+SFG+SG +EE+L LF+E+K +GI PS+++Y+ LISN
Sbjct: 309 LVFFREMCEREIEPDLRTYSALIDSFGKSGNIEESLRLFNEMKHRGICPSIHVYKLLISN 368

Query: 542 LKKVGKLELAISLLEEMNSCISDLIGPKDFKRRNR 646
           LKK+GK ELAI+   EM   +S+  G    +++NR
Sbjct: 369 LKKMGKFELAIAFSNEMKESVSNHRGSNYNRQKNR 403


>ref|XP_006842606.1| hypothetical protein AMTR_s00077p00171190 [Amborella trichopoda]
           gi|548844692|gb|ERN04281.1| hypothetical protein
           AMTR_s00077p00171190 [Amborella trichopoda]
          Length = 308

 Score =  238 bits (606), Expect = 3e-60
 Identities = 117/201 (58%), Positives = 151/201 (75%)
 Frame = +2

Query: 44  LQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLKCKPDIVTYNTILD 223
           L K   ++SE+ FPR A ++N+II  FAE+ Q  KAL+IF+ MKN KCKPD +TYNT++ 
Sbjct: 108 LLKLSRELSEITFPRSALVMNRIIYAFAETGQNKKALLIFEDMKNAKCKPDQITYNTVIA 167

Query: 224 ILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCLVFLHEMGEKNLEP 403
           ILG+ G++D ML EF+SMKE+   PDII+YNT+IN+ R+MGRLDLC+  + EM    +EP
Sbjct: 168 ILGKMGKIDGMLQEFSSMKESGHLPDIITYNTLINSFRQMGRLDLCINLMREMVRNGIEP 227

Query: 404 DLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNLKKVGKLELAISLL 583
           DLR+YTA+I+  GR G+V+EAL LF E+  K   PSVYIYR+LISNLKK GK ELA  L 
Sbjct: 228 DLRSYTAMIDCLGRVGQVDEALELFSEMTNKRQKPSVYIYRSLISNLKKAGKWELAKRLS 287

Query: 584 EEMNSCISDLIGPKDFKRRNR 646
           EE  SC ++LIGP DFKR+ R
Sbjct: 288 EEHRSCSANLIGPSDFKRKKR 308


>ref|XP_006594863.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g11900-like isoform X2 [Glycine max]
          Length = 329

 Score =  229 bits (583), Expect = 1e-57
 Identities = 116/213 (54%), Positives = 150/213 (70%)
 Frame = +2

Query: 8   AKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLKC 187
           A+A     D V L +F+ ++SE+     ++ INKII  FA+  Q DK+L+IFD++K    
Sbjct: 110 AQAFSKVNDCVELLRFLEEISEITCSSTSSFINKIIFAFAKCGQRDKSLVIFDHLKRQGY 169

Query: 188 KPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCLV 367
             D+VTYN +LDILGR GRVD ML  FAS+K+    PD +SYNT+IN LRK GR D+C V
Sbjct: 170 GLDLVTYNIVLDILGRTGRVDEMLDVFASIKDTGFVPDTVSYNTLINGLRKAGRFDMCFV 229

Query: 368 FLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNLK 547
           +  EM EK +EPDL TYTA+IE FGRSG VEE+L  F E+K KG+ PS+YIYR+LI NL 
Sbjct: 230 YFKEMTEKGVEPDLLTYTAIIEIFGRSGNVEESLKCFREMKLKGVLPSIYIYRSLIHNLN 289

Query: 548 KVGKLELAISLLEEMNSCISDLIGPKDFKRRNR 646
           K GK+ELA  LLEE+NS  + L GP DFK++ +
Sbjct: 290 KTGKVELATELLEELNSSSTCLAGPADFKQKRK 322


>ref|XP_003543355.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g11900-like isoform X1 [Glycine max]
          Length = 366

 Score =  229 bits (583), Expect = 1e-57
 Identities = 116/213 (54%), Positives = 150/213 (70%)
 Frame = +2

Query: 8   AKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLKC 187
           A+A     D V L +F+ ++SE+     ++ INKII  FA+  Q DK+L+IFD++K    
Sbjct: 147 AQAFSKVNDCVELLRFLEEISEITCSSTSSFINKIIFAFAKCGQRDKSLVIFDHLKRQGY 206

Query: 188 KPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCLV 367
             D+VTYN +LDILGR GRVD ML  FAS+K+    PD +SYNT+IN LRK GR D+C V
Sbjct: 207 GLDLVTYNIVLDILGRTGRVDEMLDVFASIKDTGFVPDTVSYNTLINGLRKAGRFDMCFV 266

Query: 368 FLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNLK 547
           +  EM EK +EPDL TYTA+IE FGRSG VEE+L  F E+K KG+ PS+YIYR+LI NL 
Sbjct: 267 YFKEMTEKGVEPDLLTYTAIIEIFGRSGNVEESLKCFREMKLKGVLPSIYIYRSLIHNLN 326

Query: 548 KVGKLELAISLLEEMNSCISDLIGPKDFKRRNR 646
           K GK+ELA  LLEE+NS  + L GP DFK++ +
Sbjct: 327 KTGKVELATELLEELNSSSTCLAGPADFKQKRK 359


>ref|XP_002439972.1| hypothetical protein SORBIDRAFT_09g023670 [Sorghum bicolor]
           gi|241945257|gb|EES18402.1| hypothetical protein
           SORBIDRAFT_09g023670 [Sorghum bicolor]
          Length = 377

 Score =  220 bits (561), Expect = 4e-55
 Identities = 113/214 (52%), Positives = 154/214 (71%)
 Frame = +2

Query: 5   LAKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLK 184
           +AKAL    D   + KFV ++ E+   R  T++N+II   A+   IDK+L+IF+ +K  +
Sbjct: 157 VAKALQKLDDCELILKFVRELLEITHHRDPTVMNRIIFATAQYGHIDKSLVIFEELKKYQ 216

Query: 185 CKPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCL 364
              D+VT+NT+LD+LG+AGRVD MLHE   M+E    PDI++YNT+ N LR++GRLDLC 
Sbjct: 217 TSLDVVTFNTVLDMLGKAGRVDEMLHEVKLMEELGHFPDIVTYNTLTNCLRRLGRLDLCK 276

Query: 365 VFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNL 544
            F  EM E+ + PDLRTYTALI+SFGRSG + +AL +F ++K K   PSVY+YRALISNL
Sbjct: 277 RFFGEMLERGIAPDLRTYTALIDSFGRSGHITDALEMFQKMK-KSHQPSVYVYRALISNL 335

Query: 545 KKVGKLELAISLLEEMNSCISDLIGPKDFKRRNR 646
           KK G+ ELA  L E+M+S  S+LIGP+DFK +N+
Sbjct: 336 KKAGQFELAEKLTEDMSSSASELIGPEDFKPKNK 369


>ref|XP_007149892.1| hypothetical protein PHAVU_005G107600g [Phaseolus vulgaris]
           gi|561023156|gb|ESW21886.1| hypothetical protein
           PHAVU_005G107600g [Phaseolus vulgaris]
          Length = 360

 Score =  219 bits (557), Expect = 1e-54
 Identities = 113/214 (52%), Positives = 152/214 (71%), Gaps = 1/214 (0%)
 Frame = +2

Query: 8   AKALLNAPDSVPLQKFVGKMSELIFPRRAT-IINKIISGFAESRQIDKALMIFDYMKNLK 184
           A+A     D V L +F+ ++SEL+    ++  INKII  FA+  Q DK+L+IFD+++   
Sbjct: 140 AQAFTKENDCVQLLRFLEEISELMSSSTSSSFINKIIFAFAKCGQKDKSLVIFDHLRRQS 199

Query: 185 CKPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCL 364
              D+VTYN +L+ILG  GRVD ML  FAS+K+  + PD +SYNT++N LRK+GR D+C 
Sbjct: 200 YGIDLVTYNIVLNILGHMGRVDEMLDVFASIKDTGLIPDTVSYNTLMNCLRKVGRFDMCF 259

Query: 365 VFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNL 544
           V+  EM E  +EPDL TYTALIE FGRSG VE++L  F E+K KGI PS+YIYR+LI NL
Sbjct: 260 VYYKEMTENGIEPDLLTYTALIEIFGRSGNVEDSLKCFREMKLKGILPSIYIYRSLIQNL 319

Query: 545 KKVGKLELAISLLEEMNSCISDLIGPKDFKRRNR 646
            K GK+ELA  LLEE++S  + L GP+DFK++ R
Sbjct: 320 NKTGKVELATELLEELSSSSTCLAGPEDFKKKTR 353


>ref|NP_001143211.1| uncharacterized protein LOC100275714 [Zea mays]
           gi|195615844|gb|ACG29752.1| hypothetical protein [Zea
           mays]
          Length = 377

 Score =  217 bits (553), Expect = 4e-54
 Identities = 114/214 (53%), Positives = 152/214 (71%)
 Frame = +2

Query: 5   LAKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLK 184
           +A AL    D   + KFV ++ E+   R AT++N+II   A+   IDK+L+IF+ +K  +
Sbjct: 157 VALALQKLDDCELILKFVREILEITHSRDATVMNRIIFATAKYGHIDKSLVIFEELKKYE 216

Query: 185 CKPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCL 364
              D+VT+NT+LD+LG+AGRVD ML E   M++    PDI++YNT+IN LR++GRLDLC 
Sbjct: 217 TSLDVVTFNTVLDMLGKAGRVDQMLGEVKLMEKLRHFPDIVTYNTLINCLRRLGRLDLCK 276

Query: 365 VFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNL 544
            F  EM E+ + PDLRTYTALI+SFGRSG + EAL +F E+K K   PSVY+YRAL SNL
Sbjct: 277 RFALEMVERGITPDLRTYTALIDSFGRSGHITEALEMFHEMK-KSHQPSVYVYRALTSNL 335

Query: 545 KKVGKLELAISLLEEMNSCISDLIGPKDFKRRNR 646
           KK G+ ELA  L E+MNS  S+LIGP+DFK  N+
Sbjct: 336 KKAGQFELAQKLTEDMNSSASELIGPEDFKANNK 369


>ref|XP_006417295.1| hypothetical protein EUTSA_v10007999mg [Eutrema salsugineum]
           gi|557095066|gb|ESQ35648.1| hypothetical protein
           EUTSA_v10007999mg [Eutrema salsugineum]
          Length = 363

 Score =  216 bits (551), Expect = 6e-54
 Identities = 111/216 (51%), Positives = 155/216 (71%), Gaps = 2/216 (0%)
 Frame = +2

Query: 5   LAKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLK 184
           LA+A +N  D + L   + ++SE   P R  ++N+ I  FAE+RQIDK LMI + MK  +
Sbjct: 143 LARAFINTDDCIHLLSLLKEVSESSLPCRLIVLNRTILAFAETRQIDKVLMILEQMKEWQ 202

Query: 185 CKPDIVTYNTILDILGRAGRVDSMLHEFASMKE-ANMAPDIISYNTIINNLRKMGRLDLC 361
           CKPD +TYN++LDILGRAG V+ ML   +SMKE  +++ +II+YNT++N LRK  R D+C
Sbjct: 203 CKPDAITYNSVLDILGRAGLVNEMLRLLSSMKEDCHVSLNIITYNTVLNGLRKACRFDMC 262

Query: 362 LVFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISN 541
           LV  +EM +  +EPDL +YTA+I+S GRSG ++E+L LFDE+K++ I PSVY+YRALI  
Sbjct: 263 LVLYNEMVQGGIEPDLLSYTAVIDSLGRSGNIKESLRLFDEMKQREIRPSVYVYRALIDC 322

Query: 542 LKKVGKLELAISLLEEM-NSCISDLIGPKDFKRRNR 646
           LKK G  + A+ L +E+ N+  SDL GP+DFKR  R
Sbjct: 323 LKKSGDFQRALQLSDELKNTSSSDLAGPQDFKRHLR 358


>gb|ACR38556.1| unknown [Zea mays] gi|413945770|gb|AFW78419.1| hypothetical protein
           ZEAMMB73_401277 [Zea mays]
          Length = 377

 Score =  216 bits (549), Expect = 1e-53
 Identities = 113/214 (52%), Positives = 151/214 (70%)
 Frame = +2

Query: 5   LAKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLK 184
           +A AL    D   + KFV ++ E+   R AT++N+II   A+   IDK+L+IF+ +K  +
Sbjct: 157 VALALQKLDDCEMILKFVREILEITHSRDATVMNRIIFATAKYGHIDKSLVIFEELKKYE 216

Query: 185 CKPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCL 364
              D+VT+NT+LD+LG+AGRVD ML E   M++    PDI++YNT+IN +R++GRLDLC 
Sbjct: 217 TSLDVVTFNTVLDMLGKAGRVDQMLGEVKLMEKLRHFPDIVTYNTLINCMRRLGRLDLCK 276

Query: 365 VFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNL 544
            F  EM E+ + PDLRTYTALI+SFGRSG + EAL +F E+K K   PSVY+YRAL SNL
Sbjct: 277 RFALEMVERGITPDLRTYTALIDSFGRSGHITEALEMFHEMK-KSHQPSVYVYRALTSNL 335

Query: 545 KKVGKLELAISLLEEMNSCISDLIGPKDFKRRNR 646
           KK G  ELA  L E+MNS  S+LIGP+DFK  N+
Sbjct: 336 KKAGHFELAQKLTEDMNSSASELIGPEDFKPNNK 369


>ref|XP_006654552.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g11900-like [Oryza brachyantha]
          Length = 338

 Score =  215 bits (548), Expect = 1e-53
 Identities = 112/212 (52%), Positives = 150/212 (70%)
 Frame = +2

Query: 5   LAKALLNAPDSVPLQKFVGKMSELIFPRRATIINKIISGFAESRQIDKALMIFDYMKNLK 184
           +AKAL    +   + KFV +  E+   R  T++N II   A+   IDK+L+IF  +K  +
Sbjct: 113 VAKALQTLDEYELILKFVRQTLEITHDRDPTVMNCIIFAMAKYGHIDKSLIIFKELKKDQ 172

Query: 185 CKPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLRKMGRLDLCL 364
              D+VT+NTILD+LG+AGRVD MLHE   M E   +PDII+YNT+IN LR++GRLD C 
Sbjct: 173 RGLDVVTFNTILDMLGKAGRVDQMLHEMTLMDELGHSPDIITYNTVINCLRRLGRLDQCK 232

Query: 365 VFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVYIYRALISNL 544
           +F  EM E+ + PDLRTYTALI+ FGR+G + EAL +FD++KR    PS+Y+YRALISNL
Sbjct: 233 IFAREMIERGINPDLRTYTALIDIFGRTGDITEALEMFDQMKR-SYQPSIYVYRALISNL 291

Query: 545 KKVGKLELAISLLEEMNSCISDLIGPKDFKRR 640
           +K G+ ELA  L EEM S  S+L+GP+DFKR+
Sbjct: 292 RKAGQFELAEKLSEEMKSSASELLGPEDFKRK 323



 Score = 59.3 bits (142), Expect = 2e-06
 Identities = 35/145 (24%), Positives = 65/145 (44%)
 Frame = +2

Query: 158 IFDYMKNLKCKPDIVTYNTILDILGRAGRVDSMLHEFASMKEANMAPDIISYNTIINNLR 337
           +F Y+   K  PD+ +Y  +   L      + +L       E     D    N II  + 
Sbjct: 94  VFRYLLLSKIAPDLTSYKNVAKALQTLDEYELILKFVRQTLEITHDRDPTVMNCIIFAMA 153

Query: 338 KMGRLDLCLVFLHEMGEKNLEPDLRTYTALIESFGRSGRVEEALGLFDELKRKGISPSVY 517
           K G +D  L+   E+ +     D+ T+  +++  G++GRV++ L     +   G SP + 
Sbjct: 154 KYGHIDKSLIIFKELKKDQRGLDVVTFNTILDMLGKAGRVDQMLHEMTLMDELGHSPDII 213

Query: 518 IYRALISNLKKVGKLELAISLLEEM 592
            Y  +I+ L+++G+L+       EM
Sbjct: 214 TYNTVINCLRRLGRLDQCKIFAREM 238


Top