BLASTX nr result

ID: Akebia24_contig00025238 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00025238
         (453 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004508296.1| PREDICTED: pentatricopeptide repeat-containi...   147   1e-33
ref|XP_004147606.1| PREDICTED: pentatricopeptide repeat-containi...   144   1e-32
ref|XP_007040009.1| Tetratricopeptide repeat-like superfamily pr...   144   1e-32
ref|XP_003609609.1| Pentatricopeptide repeat-containing protein ...   142   4e-32
ref|XP_007210066.1| hypothetical protein PRUPE_ppa021101mg [Prun...   142   6e-32
ref|XP_002303536.1| pentatricopeptide repeat-containing family p...   139   3e-31
ref|XP_002270938.2| PREDICTED: pentatricopeptide repeat-containi...   139   5e-31
ref|XP_003635554.1| PREDICTED: pentatricopeptide repeat-containi...   135   8e-30
emb|CAN64316.1| hypothetical protein VITISV_027915 [Vitis vinifera]   135   8e-30
gb|EYU42170.1| hypothetical protein MIMGU_mgv1a003558mg [Mimulus...   134   1e-29
ref|XP_003550529.1| PREDICTED: pentatricopeptide repeat-containi...   132   6e-29
ref|XP_006362924.1| PREDICTED: pentatricopeptide repeat-containi...   131   1e-28
gb|EXC17845.1| hypothetical protein L484_023201 [Morus notabilis]     130   1e-28
ref|XP_007154241.1| hypothetical protein PHAVU_003G102300g [Phas...   130   2e-28
ref|XP_004248284.1| PREDICTED: pentatricopeptide repeat-containi...   126   3e-27
ref|XP_003634255.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   120   2e-25
gb|EPS61154.1| hypothetical protein M569_13643, partial [Genlise...   116   3e-24
ref|XP_002276196.1| PREDICTED: pentatricopeptide repeat-containi...   105   5e-21
ref|NP_194007.2| pentatricopeptide repeat-containing protein [Ar...   103   3e-20
emb|CAA16561.1| predicted protein [Arabidopsis thaliana] gi|7269...   103   3e-20

>ref|XP_004508296.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g22760-like [Cicer arietinum]
          Length = 579

 Score =  147 bits (372), Expect = 1e-33
 Identities = 73/150 (48%), Positives = 95/150 (63%)
 Frame = +3

Query: 3   QIHAQIXXXXXXXXXXXXXXXXXXXXSNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFS 182
           QIHAQI                      +    SHYV  I   L NPD FSW C IR+FS
Sbjct: 21  QIHAQIFINGLTRLEPLFIHHILLWDITNYNTISHYVLSILHHLRNPDSFSWGCVIRFFS 80

Query: 183 QHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACARISEIFVGMEIHSQVHKYGFNCDVY 362
           Q   F EA+SLYV+M++ G  PS++S+S+ LK+CAR+ + F G+ IH QVHK+GFN  VY
Sbjct: 81  QKGLFIEAVSLYVQMRKMGLCPSSHSISSALKSCARVHDNFCGVSIHGQVHKFGFNTCVY 140

Query: 363 VQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           VQT+L+DLY K+G++ TA++VFDE   RNV
Sbjct: 141 VQTSLLDLYCKIGDVRTARKVFDEMTDRNV 170



 Score = 62.0 bits (149), Expect = 8e-08
 Identities = 34/126 (26%), Positives = 72/126 (57%), Gaps = 2/126 (1%)
 Frame = +3

Query: 81  SNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQR--FGSHPST 254
           S S  + S   +++FD++ + D+ S+   I  ++Q+++ KEA+ L+  M +     HP  
Sbjct: 274 SKSGDVDS--ARELFDQMDDKDLLSYNAIIACYAQNSKPKEALDLFNLMLKPDILVHPDK 331

Query: 255 YSVSATLKACARISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDE 434
            ++++ + AC+++  +     + S+++++    D ++ TALIDLY+K G ++ A   F  
Sbjct: 332 MTLASIISACSQLGSLEHWRWVESRINEFRIVLDDHLATALIDLYAKCGSIDKAYEQFHG 391

Query: 435 FPKRNV 452
             KR+V
Sbjct: 392 LRKRDV 397


>ref|XP_004147606.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g22760-like [Cucumis sativus]
           gi|449529850|ref|XP_004171911.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At4g22760-like [Cucumis sativus]
          Length = 580

 Score =  144 bits (364), Expect = 1e-32
 Identities = 75/150 (50%), Positives = 97/150 (64%)
 Frame = +3

Query: 3   QIHAQIXXXXXXXXXXXXXXXXXXXXSNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFS 182
           QIHAQI                       ++I S Y+Q+I     NPD F+WAC +R+FS
Sbjct: 21  QIHAQILVNGLPNLESCLVRQITRSQFTCARIVSRYLQRILHHSRNPDAFTWACAVRFFS 80

Query: 183 QHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACARISEIFVGMEIHSQVHKYGFNCDVY 362
           Q+ +F EAI+ YV+MQR G HPST++VS+TL+AC RI   F G  IH+QV+K GF   VY
Sbjct: 81  QNGQFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCKFRGWCIHAQVYKLGFCRCVY 140

Query: 363 VQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           VQTAL+D YSKLG+M  AQ+VFDE  ++NV
Sbjct: 141 VQTALVDFYSKLGDMGFAQKVFDEMTEKNV 170



 Score = 55.8 bits (133), Expect = 6e-06
 Identities = 34/128 (26%), Positives = 67/128 (52%), Gaps = 4/128 (3%)
 Frame = +3

Query: 81  SNSSQISSHYVQQIFDKLPNPD--VFSWACTIRYFSQHNRFKEAISLYVRMQR--FGSHP 248
           S   +++S Y  ++FDK+   +  + S+   I  +SQ++   +A+ L+  M +      P
Sbjct: 274 SKLGEVNSAY--ELFDKMEESEKELLSFNAMIACYSQNSMPNKALELFNLMLQPHVNIQP 331

Query: 249 STYSVSATLKACARISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVF 428
              + ++ + AC ++  +  G  I S + K G   D ++ TAL+DLY+K G +  A  +F
Sbjct: 332 DEMTFASVISACTQLGNLSYGTWIESYMEKLGIELDDHLATALVDLYAKSGNINRAFELF 391

Query: 429 DEFPKRNV 452
           +   KR++
Sbjct: 392 NGLKKRDL 399


>ref|XP_007040009.1| Tetratricopeptide repeat-like superfamily protein [Theobroma cacao]
           gi|508777254|gb|EOY24510.1| Tetratricopeptide
           repeat-like superfamily protein [Theobroma cacao]
          Length = 576

 Score =  144 bits (363), Expect = 1e-32
 Identities = 71/150 (47%), Positives = 97/150 (64%)
 Frame = +3

Query: 3   QIHAQIXXXXXXXXXXXXXXXXXXXXSNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFS 182
           QIHAQI                    +N S     YV+QI   L  PD FSW C +R+ S
Sbjct: 21  QIHAQILISSLNHLQPLLVHQFLLSTNNYSSSVFLYVRQILYHLQKPDAFSWGCAVRFLS 80

Query: 183 QHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACARISEIFVGMEIHSQVHKYGFNCDVY 362
           QH +F EA SLYV+MQ+ G +P+T+++S+ L+ACAR      G+ IH+QVH+YG    V+
Sbjct: 81  QHGQFAEAFSLYVKMQKLGLYPTTHAISSALRACARTECKTGGISIHAQVHRYGVCNCVF 140

Query: 363 VQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           VQTAL+DLY+KLG+M+TA++VFDE P++NV
Sbjct: 141 VQTALVDLYTKLGDMDTAKKVFDEMPEKNV 170



 Score = 65.5 bits (158), Expect = 7e-09
 Identities = 34/115 (29%), Positives = 66/115 (57%), Gaps = 2/115 (1%)
 Frame = +3

Query: 114 QQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQRFGS--HPSTYSVSATLKACA 287
           +++FDK+   D  ++   I  ++Q+++  EA+ L+  M + G    P   ++++ + AC+
Sbjct: 283 RELFDKMGEKDHLAFNAMISCYAQNSQPTEALKLFDEMLKAGVCIQPDGITLASVISACS 342

Query: 288 RISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           ++ E+  G  I S + K G   D ++ TALIDLY+K G ++ A  +F    K++V
Sbjct: 343 QLRELRFGSWIESYISKLGIQMDDHMATALIDLYAKCGSIDKAYHLFHGLRKKDV 397


>ref|XP_003609609.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355510664|gb|AES91806.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 576

 Score =  142 bits (359), Expect = 4e-32
 Identities = 70/150 (46%), Positives = 96/150 (64%)
 Frame = +3

Query: 3   QIHAQIXXXXXXXXXXXXXXXXXXXXSNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFS 182
           QIHAQI                      + +  S+Y+  I   L NPD FSW C IR+FS
Sbjct: 21  QIHAQIITNNLTHLEPIFIHRILLCDITNYKTISNYILSILHHLRNPDSFSWGCVIRFFS 80

Query: 183 QHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACARISEIFVGMEIHSQVHKYGFNCDVY 362
           Q  +F EA+SLYV+M+R G  PS+++VS+ LK+CAR+ +   G+ IH  VHK+GF+  VY
Sbjct: 81  QKGQFVEAVSLYVQMRRIGLCPSSHAVSSILKSCARVEDDLCGLLIHGHVHKFGFDACVY 140

Query: 363 VQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           VQTAL+DLY K+G++ TA++VFDE P +NV
Sbjct: 141 VQTALLDLYCKIGDVVTARKVFDEMPDKNV 170



 Score = 70.5 bits (171), Expect = 2e-10
 Identities = 36/118 (30%), Positives = 70/118 (59%), Gaps = 2/118 (1%)
 Frame = +3

Query: 105 HYVQQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQRFGS--HPSTYSVSATLK 278
           H  +++FD++ + D+ S+   I  ++Q ++ KEA+ L+  M +  S  HP   ++++ + 
Sbjct: 280 HSARELFDQMDDKDLLSYNAMIACYAQSSKPKEALDLFNVMLKPDSSLHPDKMTLASVIS 339

Query: 279 ACARISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           AC+++  +     I SQ++ +G   D ++ TALIDLY+K G ++ A  +F    KR+V
Sbjct: 340 ACSQLGNLEHWRWIESQINNFGIVLDDHLATALIDLYAKCGSIDKAYELFHGLRKRDV 397


>ref|XP_007210066.1| hypothetical protein PRUPE_ppa021101mg [Prunus persica]
           gi|462405801|gb|EMJ11265.1| hypothetical protein
           PRUPE_ppa021101mg [Prunus persica]
          Length = 578

 Score =  142 bits (357), Expect = 6e-32
 Identities = 68/121 (56%), Positives = 91/121 (75%)
 Frame = +3

Query: 90  SQISSHYVQQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQRFGSHPSTYSVSA 269
           S+  + YV QI   L NPD FSW C IR+FS H ++K A+SLYV+++R G  P++++VS+
Sbjct: 52  SRTIAQYVHQILHNLQNPDDFSWGCAIRFFSLHFQYKTALSLYVQIKRLGLCPTSFAVSS 111

Query: 270 TLKACARISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRN 449
            L+ACARI     G+ IH+QVHKYGF   VYVQTAL+DLYSKLG+META++VFD   ++N
Sbjct: 112 ALRACARIVHKVGGISIHAQVHKYGFCGCVYVQTALVDLYSKLGDMETARKVFDGMTEKN 171

Query: 450 V 452
           V
Sbjct: 172 V 172



 Score = 66.6 bits (161), Expect = 3e-09
 Identities = 32/115 (27%), Positives = 70/115 (60%), Gaps = 2/115 (1%)
 Frame = +3

Query: 114 QQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLY--VRMQRFGSHPSTYSVSATLKACA 287
           +Q+FD++   DV S+   I  ++Q+++ K+A+ L+  +  +     P+  ++++ + A +
Sbjct: 285 RQVFDQMSEKDVLSFNAMIACYAQNSQPKDALELFNQILKREVNIQPNEMTLASVISASS 344

Query: 288 RISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           ++ ++  G+ I + + K G   D ++ TAL+DLY+K G++E A ++F    KR+V
Sbjct: 345 QLGDLKFGLWIETYMSKDGIELDDHLATALLDLYTKCGDIERAYKLFHGLKKRDV 399


>ref|XP_002303536.1| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|222840968|gb|EEE78515.1|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 596

 Score =  139 bits (351), Expect = 3e-31
 Identities = 70/123 (56%), Positives = 90/123 (73%)
 Frame = +3

Query: 84  NSSQISSHYVQQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQRFGSHPSTYSV 263
           N+ +  S YV  I   LP+PD FSW   IRYFSQ  +FKEA+ LYV+MQR G  PST++V
Sbjct: 68  NNPRRISQYVHSILYHLPHPDSFSWGWAIRYFSQQGQFKEALYLYVQMQRQGLCPSTFAV 127

Query: 264 SATLKACARISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPK 443
           S+ L+A AR +    GM IH++ +KYGF+  VYVQTAL+DLYSKLG+M TAQ+VFDE  +
Sbjct: 128 SSALRAYARTTYKMGGMSIHAESYKYGFSNCVYVQTALVDLYSKLGDMNTAQKVFDELAE 187

Query: 444 RNV 452
           +NV
Sbjct: 188 KNV 190



 Score = 55.8 bits (133), Expect = 6e-06
 Identities = 29/114 (25%), Positives = 66/114 (57%), Gaps = 2/114 (1%)
 Frame = +3

Query: 117 QIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQRFGSH--PSTYSVSATLKACAR 290
           ++FD++   D+ ++   I  F+Q+++ ++A+ L+  M +  ++  P   ++++ + AC++
Sbjct: 304 KLFDQIAKKDLLTFNAMISCFAQNSQPRKALWLFSEMLKAYANIQPDQMTLASVVSACSQ 363

Query: 291 ISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           + ++     I S V+  G   D  + TAL+DLY+K G ++ A  +F    K++V
Sbjct: 364 LGDLRFASWIESYVNDLGTEIDDQLVTALLDLYAKCGSVDKAYELFHGLNKKDV 417


>ref|XP_002270938.2| PREDICTED: pentatricopeptide repeat-containing protein
           At4g22760-like [Vitis vinifera]
          Length = 580

 Score =  139 bits (349), Expect = 5e-31
 Identities = 69/153 (45%), Positives = 96/153 (62%), Gaps = 3/153 (1%)
 Frame = +3

Query: 3   QIHAQIXXXXXXXXXXXXXXXXXXXXSNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFS 182
           Q+HA I                    SN S   + YV  +     +PD FSWAC IR+ +
Sbjct: 22  QVHALILIHGLSHLEPILARQILISASNYSATVAQYVHSVLHHSKSPDSFSWACAIRFST 81

Query: 183 QHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACARISEIFVGMEIHSQVHKYGFNC--- 353
           QH +FKEA +LYV+MQR+G  P+T+++S+ LKACARI+    G+ IH QV K+GF+C   
Sbjct: 82  QHGQFKEAFALYVQMQRWGLCPTTFALSSALKACARIAYRMGGISIHGQVQKFGFSCGGD 141

Query: 354 DVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
            +YV+TAL+D Y KLG+ME A+++FDE  +RNV
Sbjct: 142 GIYVETALVDFYCKLGDMEIARKMFDEMAERNV 174



 Score = 58.9 bits (141), Expect = 7e-07
 Identities = 29/114 (25%), Positives = 64/114 (56%), Gaps = 2/114 (1%)
 Frame = +3

Query: 117 QIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQR--FGSHPSTYSVSATLKACAR 290
           ++FD++   D+  +   I  ++Q++R  EA++L+  M        P   ++++ + AC++
Sbjct: 288 ELFDQVGGKDLLLFNAMIACYAQNSRPNEALNLFNNMLNPYVNVQPDEMTLASVISACSQ 347

Query: 291 ISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           + ++  G  I S + + G   D ++ TAL+DLY+K G ++ A  +F    K+++
Sbjct: 348 LGDLRFGPWIESYMRRLGIEMDGHLATALLDLYAKCGSIDKAYELFHGLRKKDL 401


>ref|XP_003635554.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g22760-like [Vitis vinifera]
           gi|296083555|emb|CBI23551.3| unnamed protein product
           [Vitis vinifera]
          Length = 580

 Score =  135 bits (339), Expect = 8e-30
 Identities = 68/153 (44%), Positives = 95/153 (62%), Gaps = 3/153 (1%)
 Frame = +3

Query: 3   QIHAQIXXXXXXXXXXXXXXXXXXXXSNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFS 182
           Q+HA I                    SN S   + YV  +     +PD FSWAC IR+ +
Sbjct: 22  QVHALILIHGLSHLEPILARQILLSASNYSATVAQYVHSVLHHSKSPDSFSWACAIRFST 81

Query: 183 QHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACARISEIFVGMEIHSQVHKYGFN---C 353
           QH +FKEA +LYV+MQR+G  P+T+++S+ LKACARI+    G+ IH QV K+GF+    
Sbjct: 82  QHGQFKEAFALYVQMQRWGLWPTTFALSSALKACARIAYRMGGLSIHGQVQKFGFSGGGD 141

Query: 354 DVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
            +YV+TAL+D Y KLG+ME A+++FDE  +RNV
Sbjct: 142 GIYVETALVDFYCKLGDMEIARKMFDEMAERNV 174



 Score = 60.5 bits (145), Expect = 2e-07
 Identities = 30/114 (26%), Positives = 64/114 (56%), Gaps = 2/114 (1%)
 Frame = +3

Query: 117 QIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQR--FGSHPSTYSVSATLKACAR 290
           ++FD++   D+  +   I  ++Q++R KEA+ L+  M        P   ++++ + AC++
Sbjct: 288 ELFDQVGGKDLLLFNAMIACYAQNSRPKEALKLFNNMLNPDVNVQPDEMTLASVISACSQ 347

Query: 291 ISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           + ++  G  I S + + G   D ++ TAL+DLY+K G ++ A  +F    K+++
Sbjct: 348 LGDLRFGPWIESYMRRLGIEMDGHLATALLDLYAKCGSIDKAYELFHGLRKKDL 401


>emb|CAN64316.1| hypothetical protein VITISV_027915 [Vitis vinifera]
          Length = 841

 Score =  135 bits (339), Expect = 8e-30
 Identities = 68/153 (44%), Positives = 95/153 (62%), Gaps = 3/153 (1%)
 Frame = +3

Query: 3   QIHAQIXXXXXXXXXXXXXXXXXXXXSNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFS 182
           Q+HA I                    SN S   + YV  +     +PD FSWAC IR+ +
Sbjct: 18  QVHALILIHGLSHLEPILARQILJSASNYSATVAQYVHSVLHHSKSPDSFSWACAIRFST 77

Query: 183 QHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACARISEIFVGMEIHSQVHKYGFN---C 353
           QH +FKEA +LYV+MQR+G  P+T+++S+ LKACARI+    G+ IH QV K+GF+    
Sbjct: 78  QHGQFKEAFALYVQMQRWGLWPTTFALSSALKACARIAYRMGGLSIHGQVQKFGFSGGGD 137

Query: 354 DVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
            +YV+TAL+D Y KLG+ME A+++FDE  +RNV
Sbjct: 138 GIYVETALVDFYCKLGDMEIARKMFDEMAERNV 170



 Score = 60.5 bits (145), Expect = 2e-07
 Identities = 30/114 (26%), Positives = 64/114 (56%), Gaps = 2/114 (1%)
 Frame = +3

Query: 117 QIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQR--FGSHPSTYSVSATLKACAR 290
           ++FD++   D+  +   I  ++Q++R KEA+ L+  M        P   ++++ + AC++
Sbjct: 284 ELFDQVGGKDLLLFNAMIACYAQNSRPKEALKLFNNMLNPDVNVQPDEMTLASVISACSQ 343

Query: 291 ISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           + ++  G  I S + + G   D ++ TAL+DLY+K G ++ A  +F    K+++
Sbjct: 344 LGDLRFGPWIESYMRRLGIEMDGHLATALLDLYAKCGSIDKAYELFHGLRKKDL 397


>gb|EYU42170.1| hypothetical protein MIMGU_mgv1a003558mg [Mimulus guttatus]
          Length = 578

 Score =  134 bits (337), Expect = 1e-29
 Identities = 68/150 (45%), Positives = 94/150 (62%)
 Frame = +3

Query: 3   QIHAQIXXXXXXXXXXXXXXXXXXXXSNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFS 182
           ++H QI                    S+ S   SHYV+ I  ++ NPDVFS +CTIRY S
Sbjct: 21  ELHCQILINSLKNLEPLLIRQTLKYSSHYSPRISHYVKSILRRMQNPDVFSTSCTIRYLS 80

Query: 183 QHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACARISEIFVGMEIHSQVHKYGFNCDVY 362
           QH +F+EA SLYV+M   G  PST++VS+ LKACA+I     G+ IH QV K+GF   V+
Sbjct: 81  QHGQFREAFSLYVQMHSSGLFPSTFAVSSALKACAKILCQIGGLMIHGQVEKFGFRNVVF 140

Query: 363 VQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           VQTAL+D Y K+G++  A+++FDE  ++NV
Sbjct: 141 VQTALVDFYLKMGDLMIARKLFDEITEKNV 170



 Score = 59.7 bits (143), Expect = 4e-07
 Identities = 29/115 (25%), Positives = 65/115 (56%), Gaps = 2/115 (1%)
 Frame = +3

Query: 114 QQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQR--FGSHPSTYSVSATLKACA 287
           + ++D++   ++  +   I  F+Q+NR KEA+ L+  M +      P   ++++ + +C+
Sbjct: 283 KDLYDQVIEKNLLLYNAMIACFAQNNRAKEALQLFDEMLKPNVNFQPDKMTLASVISSCS 342

Query: 288 RISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           ++ +   G  I S + + G   D ++ T+ IDLY+K G++E A ++F+    R++
Sbjct: 343 QLGDSKYGAWIESYMRESGIRMDDHLATSFIDLYAKCGDIEKAYKLFNGLQNRDL 397


>ref|XP_003550529.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g22760-like [Glycine max]
          Length = 576

 Score =  132 bits (331), Expect = 6e-29
 Identities = 67/150 (44%), Positives = 91/150 (60%)
 Frame = +3

Query: 3   QIHAQIXXXXXXXXXXXXXXXXXXXXSNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFS 182
           QIHA I                      + +  ++Y   +   L  PD FSW C IR+FS
Sbjct: 21  QIHAHILINGFTFLRPLLIHRMLLWDVTNYRTMANYAYSMLHHLHIPDSFSWGCVIRFFS 80

Query: 183 QHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACARISEIFVGMEIHSQVHKYGFNCDVY 362
           Q   F EA+SLYV+M R    P++++VS+ LK+CARI ++  GM IH QVH +GFN  VY
Sbjct: 81  QKCLFTEAVSLYVQMHRTSLCPTSHAVSSALKSCARIHDMLCGMSIHGQVHVFGFNTCVY 140

Query: 363 VQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           VQTAL+DLYSK+G+M TA++VFDE   ++V
Sbjct: 141 VQTALLDLYSKIGDMGTARKVFDEMANKSV 170



 Score = 65.9 bits (159), Expect = 6e-09
 Identities = 33/115 (28%), Positives = 69/115 (60%), Gaps = 2/115 (1%)
 Frame = +3

Query: 114 QQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRM--QRFGSHPSTYSVSATLKACA 287
           +++FD++ + D+ S+   I  ++Q+++ KEA+ L+  M  Q    HP   ++++ + AC+
Sbjct: 283 RKLFDQMDHKDLLSYNAMIACYAQNSKPKEALELFNDMLKQDIYVHPDKMTLASVISACS 342

Query: 288 RISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           ++ ++     I S ++ +G   D ++ TALIDLY+K G ++ A  +F    KR++
Sbjct: 343 QLGDLEHWWWIESHMNDFGIVLDDHLATALIDLYAKCGSIDKAYELFHNLRKRDL 397


>ref|XP_006362924.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g22760-like [Solanum tuberosum]
          Length = 576

 Score =  131 bits (329), Expect = 1e-28
 Identities = 67/150 (44%), Positives = 98/150 (65%)
 Frame = +3

Query: 3   QIHAQIXXXXXXXXXXXXXXXXXXXXSNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFS 182
           QIHA I                    S+ S  +SHY++ +   + N DVFS A TIR++S
Sbjct: 21  QIHAAILVNGLFDLESLLVQQIINSASSYSYYTSHYIKLVLSHVQNLDVFSVASTIRFYS 80

Query: 183 QHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACARISEIFVGMEIHSQVHKYGFNCDVY 362
           +H +F+EA++LY ++QR G  PST++VS++LKACARI     G+ IH+QV+KYGF   VY
Sbjct: 81  RHCQFREAVNLYGKLQRCGLSPSTFAVSSSLKACARILYRSGGISIHAQVYKYGFCNVVY 140

Query: 363 VQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           VQTAL+D YSK+G M+ A+ +F+E  ++N+
Sbjct: 141 VQTALVDFYSKVGNMDFARSIFNEMVEKNI 170



 Score = 71.2 bits (173), Expect = 1e-10
 Identities = 35/124 (28%), Positives = 70/124 (56%)
 Frame = +3

Query: 81  SNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQRFGSHPSTYS 260
           S S  + S   +++F +L   D   +   I  ++Q++R KEA+ L+  M R    P   +
Sbjct: 274 SKSGDVES--AEELFGQLHKKDQLVYNSMIACYAQNSRAKEALQLFNEMLRLDLQPDEMT 331

Query: 261 VSATLKACARISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFP 440
           +++ + AC+++ ++  G  I S +H+ G   D ++ T+LIDLY+K G ++ A ++F    
Sbjct: 332 LASAISACSQLGDLKFGSWIESFIHETGIQMDDFLATSLIDLYAKCGSIDKAHKLFHGLK 391

Query: 441 KRNV 452
           K+++
Sbjct: 392 KKDL 395


>gb|EXC17845.1| hypothetical protein L484_023201 [Morus notabilis]
          Length = 577

 Score =  130 bits (328), Expect = 1e-28
 Identities = 62/123 (50%), Positives = 88/123 (71%)
 Frame = +3

Query: 84  NSSQISSHYVQQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQRFGSHPSTYSV 263
           NS  I+ H    +F  + NPD  SW   +R+FS + +FK A SLYV +QR G  PS+++ 
Sbjct: 50  NSRVIAQHIYGTLFH-MQNPDALSWGFAVRFFSANGQFKTAFSLYVHLQRLGLCPSSFAF 108

Query: 264 SATLKACARISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPK 443
           S++LKACARI     G+ +H+QV+KYG    VYVQTAL+DLYSK+G++++AQ++FDE P+
Sbjct: 109 SSSLKACARIGHKLGGLSVHAQVYKYGVCGSVYVQTALLDLYSKVGDVKSAQKLFDEMPE 168

Query: 444 RNV 452
           RNV
Sbjct: 169 RNV 171


>ref|XP_007154241.1| hypothetical protein PHAVU_003G102300g [Phaseolus vulgaris]
           gi|561027595|gb|ESW26235.1| hypothetical protein
           PHAVU_003G102300g [Phaseolus vulgaris]
          Length = 579

 Score =  130 bits (327), Expect = 2e-28
 Identities = 60/117 (51%), Positives = 82/117 (70%)
 Frame = +3

Query: 102 SHYVQQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQRFGSHPSTYSVSATLKA 281
           +HY   +   L NPD FSW C IR+ SQ   F EA+SLYV M R   +P+++++S+ LK+
Sbjct: 54  AHYAYSMLHHLQNPDSFSWGCVIRFLSQKGLFTEAVSLYVEMHRRRLYPTSHAISSALKS 113

Query: 282 CARISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           CARI  +  G+ IH QVH +GF+  VYVQTAL+DLYSK+G+M TA+ VFD   +R+V
Sbjct: 114 CARIQNMLGGVSIHGQVHVFGFDTCVYVQTALLDLYSKMGDMGTARHVFDGMAERSV 170



 Score = 61.6 bits (148), Expect = 1e-07
 Identities = 32/114 (28%), Positives = 67/114 (58%), Gaps = 2/114 (1%)
 Frame = +3

Query: 117 QIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQRFGS--HPSTYSVSATLKACAR 290
           ++FD++ + D+ S+   I  ++Q+++ KEA+ L+  M +     HP   + ++ + AC++
Sbjct: 284 KLFDQMNHKDLLSYNAMIACYAQNSKPKEALELFNDMLKHNIYIHPDKMTFASVISACSQ 343

Query: 291 ISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           + ++     I S+++ +G   D ++ TALIDLY+K G +  A  +F    KR++
Sbjct: 344 LGDLERWCWIESRMNDFGIVLDDHLATALIDLYAKSGSIGMAYELFHGLRKRDL 397


>ref|XP_004248284.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g22760-like [Solanum lycopersicum]
          Length = 577

 Score =  126 bits (317), Expect = 3e-27
 Identities = 68/151 (45%), Positives = 95/151 (62%), Gaps = 1/151 (0%)
 Frame = +3

Query: 3   QIHAQIXXXXXXXXXXXXXXXXXXXXSNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFS 182
           QIHA I                    S+ +  +SHY++ +   L N DVFS A TIR++S
Sbjct: 21  QIHAAILVNGLFDLESLLVQQIINSASSCTYYTSHYIKLVLSHLQNLDVFSVASTIRFYS 80

Query: 183 QHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACARISEIFVGMEIHSQVHKYGFNCDVY 362
           +H +F+EA++LY  +QR G  PST++VS+ LKACARI     G+ IH+QV KYGF   VY
Sbjct: 81  RHCQFREAVNLYGELQRCGLSPSTFAVSSALKACARILYRSGGISIHAQVFKYGFCNVVY 140

Query: 363 VQTALIDLYSKLGEMETAQRVF-DEFPKRNV 452
           VQTAL+D YSK+G M+ A+ +F DE  ++N+
Sbjct: 141 VQTALVDFYSKVGNMDFARSIFDDEMVEKNI 171



 Score = 70.5 bits (171), Expect = 2e-10
 Identities = 32/113 (28%), Positives = 66/113 (58%)
 Frame = +3

Query: 114 QQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACARI 293
           +++F KL   D   +   I  ++Q++R KEA+ L+  M +    P   ++++ + AC+++
Sbjct: 284 EELFGKLRKKDQVVYNAMIACYAQNSRAKEALQLFNEMLQLDLQPDEMTLASAISACSQL 343

Query: 294 SEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
            ++  G  I S +H+ G   D ++ T+LIDLY+K G ++ A ++F    K+++
Sbjct: 344 GDLKFGSWIESFIHETGIQMDDFLATSLIDLYAKCGSIDKAHKLFHGLKKKDL 396


>ref|XP_003634255.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
           protein At4g22760-like [Vitis vinifera]
          Length = 583

 Score =  120 bits (301), Expect = 2e-25
 Identities = 59/121 (48%), Positives = 82/121 (67%), Gaps = 3/121 (2%)
 Frame = +3

Query: 81  SNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQRFGSHPSTYS 260
           SN S   +HYV  I     +PD FSWACTIR+F+QH +FK A +LYV+MQR+G  P+T++
Sbjct: 36  SNCSATIAHYVHSILHHSKSPDSFSWACTIRFFTQHGQFKGAFALYVQMQRWGLCPTTFA 95

Query: 261 VSATLKACARISEIFVGMEIHSQVHKYGF---NCDVYVQTALIDLYSKLGEMETAQRVFD 431
           +S+ LKAC RI      + IH QV K+GF      +Y++TAL+D   KLG+ME + ++FD
Sbjct: 96  LSSALKACDRIVYRMGVLSIHGQVQKFGFCGGGDGIYMETALVDFDCKLGDMEISSKMFD 155

Query: 432 E 434
           E
Sbjct: 156 E 156


>gb|EPS61154.1| hypothetical protein M569_13643, partial [Genlisea aurea]
          Length = 563

 Score =  116 bits (291), Expect = 3e-24
 Identities = 57/116 (49%), Positives = 81/116 (69%)
 Frame = +3

Query: 105 HYVQQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQRFGSHPSTYSVSATLKAC 284
           +Y   I  ++  PD FS +  IRY SQH +F+EA+S+YV++ R G  P+TY++S+ L+AC
Sbjct: 42  NYATGILSRMQCPDPFSASRAIRYLSQHGQFREALSIYVQIHRLGFSPNTYALSSVLRAC 101

Query: 285 ARISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
           ARI     G+ IH QV K GF+  ++VQT+L+D YSK+GEME AQR+FD    +NV
Sbjct: 102 ARILCKAGGLMIHGQVKKLGFSGVIFVQTSLLDFYSKIGEMEIAQRLFDGIRGKNV 157


>ref|XP_002276196.1| PREDICTED: pentatricopeptide repeat-containing protein At3g12770
           [Vitis vinifera] gi|296081235|emb|CBI17979.3| unnamed
           protein product [Vitis vinifera]
          Length = 742

 Score =  105 bits (263), Expect = 5e-21
 Identities = 47/115 (40%), Positives = 71/115 (61%)
 Frame = +3

Query: 108 YVQQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACA 287
           Y +++FD+ P P VF W   IR +S HN F +AI +Y RMQ  G +P  +++   LKAC+
Sbjct: 121 YARKVFDEFPEPSVFLWNAIIRGYSSHNFFGDAIEMYSRMQASGVNPDGFTLPCVLKACS 180

Query: 288 RISEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
            +  + VG  +H Q+ + GF  DV+VQ  L+ LY+K G +E A+ VF+    RN+
Sbjct: 181 GVPVLEVGKRVHGQIFRLGFESDVFVQNGLVALYAKCGRVEQARIVFEGLDDRNI 235



 Score = 62.4 bits (150), Expect = 6e-08
 Identities = 30/113 (26%), Positives = 61/113 (53%)
 Frame = +3

Query: 114 QQIFDKLPNPDVFSWACTIRYFSQHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACARI 293
           +  FD++  P+V  W   I  ++++    EA+ L+  M        + +V + + ACA++
Sbjct: 325 RSFFDQMEIPNVMMWNAMISGYAKNGYTNEAVGLFQEMISKNIRTDSITVRSAILACAQV 384

Query: 294 SEIFVGMEIHSQVHKYGFNCDVYVQTALIDLYSKLGEMETAQRVFDEFPKRNV 452
             + +   +   ++K  +  DV+V TALID+++K G ++ A+ VFD    ++V
Sbjct: 385 GSLDLAKWMGDYINKTEYRNDVFVNTALIDMFAKCGSVDLAREVFDRTLDKDV 437


>ref|NP_194007.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|223635615|sp|P0C8Q5.1|PP336_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At4g22760 gi|332659255|gb|AEE84655.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 578

 Score =  103 bits (256), Expect = 3e-20
 Identities = 51/149 (34%), Positives = 80/149 (53%)
 Frame = +3

Query: 3   QIHAQIXXXXXXXXXXXXXXXXXXXXSNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFS 182
           Q+HAQ+                       S+    YV++I       D FSW C +R+ S
Sbjct: 21  QVHAQLVVNRYNHLEPILVHQTLHFTKEFSRNIVTYVKRILKGFNGHDSFSWGCLVRFLS 80

Query: 183 QHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACARISEIFVGMEIHSQVHKYGFNCDVY 362
           QH +FKE + +Y+ M   G  PS+++V++ L+AC ++  +  G  IH+Q  K G    VY
Sbjct: 81  QHRKFKETVDVYIDMHNSGIPPSSHAVTSVLRACGKMENMVDGKPIHAQALKNGLCGCVY 140

Query: 363 VQTALIDLYSKLGEMETAQRVFDEFPKRN 449
           VQT L+ LYS+LG +E A++ FD+  ++N
Sbjct: 141 VQTGLVGLYSRLGYIELAKKAFDDIAEKN 169


>emb|CAA16561.1| predicted protein [Arabidopsis thaliana] gi|7269122|emb|CAB79231.1|
           predicted protein [Arabidopsis thaliana]
          Length = 889

 Score =  103 bits (256), Expect = 3e-20
 Identities = 51/149 (34%), Positives = 80/149 (53%)
 Frame = +3

Query: 3   QIHAQIXXXXXXXXXXXXXXXXXXXXSNSSQISSHYVQQIFDKLPNPDVFSWACTIRYFS 182
           Q+HAQ+                       S+    YV++I       D FSW C +R+ S
Sbjct: 290 QVHAQLVVNRYNHLEPILVHQTLHFTKEFSRNIVTYVKRILKGFNGHDSFSWGCLVRFLS 349

Query: 183 QHNRFKEAISLYVRMQRFGSHPSTYSVSATLKACARISEIFVGMEIHSQVHKYGFNCDVY 362
           QH +FKE + +Y+ M   G  PS+++V++ L+AC ++  +  G  IH+Q  K G    VY
Sbjct: 350 QHRKFKETVDVYIDMHNSGIPPSSHAVTSVLRACGKMENMVDGKPIHAQALKNGLCGCVY 409

Query: 363 VQTALIDLYSKLGEMETAQRVFDEFPKRN 449
           VQT L+ LYS+LG +E A++ FD+  ++N
Sbjct: 410 VQTGLVGLYSRLGYIELAKKAFDDIAEKN 438


Top