BLASTX nr result

ID: Cinnamomum25_contig00002075 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum25_contig00002075
         (1104 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010270624.1| PREDICTED: pentatricopeptide repeat-containi...   343   1e-91
ref|XP_008796414.1| PREDICTED: pentatricopeptide repeat-containi...   319   2e-84
ref|XP_009410349.1| PREDICTED: pentatricopeptide repeat-containi...   306   2e-80
ref|XP_009415576.1| PREDICTED: pentatricopeptide repeat-containi...   302   3e-79
ref|XP_008799153.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   301   4e-79
gb|KDO68436.1| hypothetical protein CISIN_1g047934mg [Citrus sin...   300   2e-78
ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containi...   300   2e-78
ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citr...   300   2e-78
ref|XP_010030710.1| PREDICTED: pentatricopeptide repeat-containi...   298   6e-78
gb|KCW55702.1| hypothetical protein EUGRSUZ_I01549 [Eucalyptus g...   298   6e-78
ref|XP_010090734.1| hypothetical protein L484_013756 [Morus nota...   297   1e-77
ref|XP_007043123.1| Pentatricopeptide repeat superfamily protein...   295   5e-77
ref|XP_008235841.1| PREDICTED: pentatricopeptide repeat-containi...   292   3e-76
ref|XP_012462310.1| PREDICTED: pentatricopeptide repeat-containi...   291   8e-76
ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containi...   289   2e-75
emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera]   288   4e-75
ref|XP_009759999.1| PREDICTED: pentatricopeptide repeat-containi...   287   8e-75
ref|XP_009624485.1| PREDICTED: pentatricopeptide repeat-containi...   287   8e-75
ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containi...   287   1e-74
ref|XP_008382522.1| PREDICTED: pentatricopeptide repeat-containi...   286   1e-74

>ref|XP_010270624.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Nelumbo nucifera] gi|720046844|ref|XP_010270625.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g38150-like [Nelumbo nucifera]
          Length = 407

 Score =  343 bits (881), Expect = 1e-91
 Identities = 191/371 (51%), Positives = 238/371 (64%), Gaps = 4/371 (1%)
 Frame = -3

Query: 1102 GEKRDGLSEDRSPRKFXXXXXXXXXXDQNMRNAQKNPSLQFMKRPLRGEKR---DDFNHL 932
            GE R   S+D    K           ++      +NP      RP+RGEKR    +++  
Sbjct: 40   GEIRSNASQDPFFSKLESGYGQDGKDEERSNRTYQNPPNPIPNRPMRGEKRREPSEYHFN 99

Query: 931  KKFDFGDEDDGVEKASKIQQGFSTQLPXXXXXXXXXXXXESDDLFNEKLFKVDNDVEDQN 752
             KF  GD++D  EK  K  Q   T                + D F  K F   +D+ D+ 
Sbjct: 100  GKFKLGDDEDD-EKMRKPDQIRQTHF--GSSREGKREGRFNGDTFARK-FDFGSDIVDER 155

Query: 751  KEEPERNLQSQMQSRTMKEDNRGDQFSDLFSKQSGLRGDVANKGGAKSTHQNPSIQIPKT 572
              E +++   Q  +R MK + RG    + F ++  L  +       K+T++    Q+ +T
Sbjct: 156  TSESQQSPSVQFPNRPMKGEKRGSPLDESFLEKLRLCEEKK-----KNTNETSPTQVTET 210

Query: 571  EADAL-HSSPQDADEIFRKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPE 395
            +  A   S+PQDADEIFRKMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGT PE
Sbjct: 211  DVKAEPDSTPQDADEIFRKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTFPE 270

Query: 394  VVIYTAVVEGFCKAAKFHDAEKIFRKMQNNGISPNAFSYTVLIQGLCRGRKLEDSVSFCV 215
            VVIYTAVVEGFCKA K  DA++IFRKMQNNGISPNAFSYTV IQGL +G++LED++  CV
Sbjct: 271  VVIYTAVVEGFCKAEKLDDAKRIFRKMQNNGISPNAFSYTVFIQGLYKGKRLEDAIDICV 330

Query: 214  EMLEAGHSPNVATFVTLIDLICREKGQEEAASVIGRLREKGFAMDDKAVREHLDKKGPFS 35
            EMLEAGHSPNV TF  L+D ICR+KG EEA S I RLREKG+ +D+KA+RE+LDKKGPFS
Sbjct: 331  EMLEAGHSPNVTTFTGLVDAICRDKGVEEAKSTIERLREKGYFVDEKAIREYLDKKGPFS 390

Query: 34   SSLWEVIFGKK 2
              +WE +FGKK
Sbjct: 391  PLIWEAVFGKK 401


>ref|XP_008796414.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Phoenix dactylifera]
          Length = 424

 Score =  319 bits (818), Expect = 2e-84
 Identities = 179/334 (53%), Positives = 217/334 (64%)
 Frame = -3

Query: 1003 QKNPSLQFMKRPLRGEKRDDFNHLKKFDFGDEDDGVEKASKIQQGFSTQLPXXXXXXXXX 824
            ++ P  +  +RP+RGE+R+D  + ++  F +  +  E+   I    S  L          
Sbjct: 113  RRTPESRIPERPMRGERREDSGYSRQ-RFRNHGEDYEENFGIPGPKSASL---------- 161

Query: 823  XXXESDDLFNEKLFKVDNDVEDQNKEEPERNLQSQMQSRTMKEDNRGDQFSDLFSKQSGL 644
                SD   +E+  K   D  DQ K+  E              +  GD+  D   K+  L
Sbjct: 162  ---FSDGPKSEEKNKESIDTGDQLKDSAEI-------------EKGGDKTGDTLFKKLNL 205

Query: 643  RGDVANKGGAKSTHQNPSIQIPKTEADALHSSPQDADEIFRKMKETGLIPNAVAMLDGLC 464
             GD    G  +   Q  + Q    ++    S  +DADEIF++MKETGLIPNAVAMLDGLC
Sbjct: 206  -GDAGRGGKVEEAPQKQTKQSYGPDSMVPKSQSEDADEIFKEMKETGLIPNAVAMLDGLC 264

Query: 463  KDGLIQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFHDAEKIFRKMQNNGISPNAF 284
            KDGLIQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKF DA++IFRKMQ NGI PNAF
Sbjct: 265  KDGLIQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFDDAKRIFRKMQKNGIMPNAF 324

Query: 283  SYTVLIQGLCRGRKLEDSVSFCVEMLEAGHSPNVATFVTLIDLICREKGQEEAASVIGRL 104
            SY VLIQGLC+G KLEDSV +C+EML AGH PN ATF  L+D  C+EKG EEA S++  L
Sbjct: 325  SYAVLIQGLCKGGKLEDSVEYCMEMLGAGHLPNAATFTGLVDRYCKEKGVEEAGSLVRTL 384

Query: 103  REKGFAMDDKAVREHLDKKGPFSSSLWEVIFGKK 2
            RE+GFAMD+KAVREHLDKKGPFS  +WE IFGKK
Sbjct: 385  RERGFAMDEKAVREHLDKKGPFSPMVWEAIFGKK 418


>ref|XP_009410349.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like [Musa acuminata subsp. malaccensis]
          Length = 413

 Score =  306 bits (784), Expect = 2e-80
 Identities = 148/232 (63%), Positives = 184/232 (79%)
 Frame = -3

Query: 697 EDNRGDQFSDLFSKQSGLRGDVANKGGAKSTHQNPSIQIPKTEADALHSSPQDADEIFRK 518
           E+ R D+  D  +++    G+   +   +   Q P++     E+ A  + P+DADEIF+K
Sbjct: 181 EERRSDRIGDSLAQKINF-GEAGRRNRVEEADQKPAV----AESAAQEAPPEDADEIFKK 235

Query: 517 MKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFHD 338
           MKETGLIPNAVAMLDGLCKDGL+Q+AMKLFGLMREKGTIPEVVIYTAVVEGFCK AKF D
Sbjct: 236 MKETGLIPNAVAMLDGLCKDGLVQDAMKLFGLMREKGTIPEVVIYTAVVEGFCKGAKFDD 295

Query: 337 AEKIFRKMQNNGISPNAFSYTVLIQGLCRGRKLEDSVSFCVEMLEAGHSPNVATFVTLID 158
           A++IFRKMQ NGI PNAFS+ VLIQGLC+G+KLEDSV FC+EML+AGH+P+VAT + L+D
Sbjct: 296 AKRIFRKMQKNGIVPNAFSFKVLIQGLCKGKKLEDSVEFCMEMLDAGHAPSVATLIGLVD 355

Query: 157 LICREKGQEEAASVIGRLREKGFAMDDKAVREHLDKKGPFSSSLWEVIFGKK 2
             C+EKG EE  +VI RLRE+GF +D++AVREHL+KKGPFS  +W+  FGKK
Sbjct: 356 GFCQEKGVEEGENVIIRLRERGFVLDERAVREHLNKKGPFSPKVWDAFFGKK 407


>ref|XP_009415576.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
            [Musa acuminata subsp. malaccensis]
          Length = 390

 Score =  302 bits (774), Expect = 3e-79
 Identities = 177/350 (50%), Positives = 219/350 (62%), Gaps = 25/350 (7%)
 Frame = -3

Query: 976  KRPLRGEKRDDFNH----LKKFDFGDEDDGVEKASKIQQGFSTQLPXXXXXXXXXXXXES 809
            +RP+RGE+R D       L+  +FGD DDGV    +  +      P            + 
Sbjct: 42   RRPMRGERRRDDRSEDIFLRGLNFGD-DDGVNGPQRAHREAFPDRPYDGPSLRGAQQRKK 100

Query: 808  DDLFNEKL--------FKVDNDVEDQNKEEPERNLQSQMQSRTMKE-----------DNR 686
            +    E+           VD D+ D+    P  + ++ ++    +E           D  
Sbjct: 101  EPPLREEDGSDGAADDLLVDFDLADRTGRVPPGHTRNSVRRDPPREGFGPSPQSQFKDFG 160

Query: 685  GDQFSDLFSKQSGLRGDVANKGGAKSTHQNPSIQIPKTEAD--ALHSSPQDADEIFRKMK 512
            GD F    S Q   R   A+  G +    +   Q P T A   A  + P+DADEIF+KMK
Sbjct: 161  GDYFEGSGSPQQKARPPSAD--GHRVDKSDVVDQTPPTVAKSAAEEAPPEDADEIFKKMK 218

Query: 511  ETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFHDAE 332
            ETGLIPNAVAMLDGLCKDGLIQEAMKLFG MREKGT+PEVVIYTA VEGFCKAA+F DA+
Sbjct: 219  ETGLIPNAVAMLDGLCKDGLIQEAMKLFGSMREKGTMPEVVIYTAAVEGFCKAARFDDAK 278

Query: 331  KIFRKMQNNGISPNAFSYTVLIQGLCRGRKLEDSVSFCVEMLEAGHSPNVATFVTLIDLI 152
            +IFRKMQ NG +PNAFSY VLIQGLC+G+KL+DSV FC+EML+AGHSP+V T V ++D  
Sbjct: 279  RIFRKMQKNGTAPNAFSYKVLIQGLCKGKKLDDSVEFCMEMLDAGHSPSVTTVVDVVDGF 338

Query: 151  CREKGQEEAASVIGRLREKGFAMDDKAVREHLDKKGPFSSSLWEVIFGKK 2
            CREKG EEAA V+ RLRE+GF +D KAV EHLDKKGPFS  ++E I GKK
Sbjct: 339  CREKGVEEAADVVKRLRERGFVLDLKAVSEHLDKKGPFSPMVFEAISGKK 388


>ref|XP_008799153.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At4g38150 [Phoenix dactylifera]
          Length = 357

 Score =  301 bits (772), Expect = 4e-79
 Identities = 171/339 (50%), Positives = 212/339 (62%), Gaps = 7/339 (2%)
 Frame = -3

Query: 1015 MRNAQKNPSLQFMKRPLRGEKRDDFNHLKKFDFGDEDDGVEKASKIQQGFSTQLPXXXXX 836
            M    + P  +   RPLRG  R+DF H + +  G+  +  ++     +  S         
Sbjct: 37   MGRGGRAPESRIPNRPLRGVGREDFGHFR-WKIGNRGEDYKEFFLGPKSASL-------- 87

Query: 835  XXXXXXXESDDLFNEKLFKVDNDVEDQNKEEPERNLQSQMQSRTMKEDNRGDQFSDLFSK 656
                    +D   +E+  K   D+ DQ+K+             + K +  GD+  D   K
Sbjct: 88   -------FADGPKSEEKNKESTDIGDQSKD-------------SAKIEKGGDKTGDTLFK 127

Query: 655  QSGLRGDVANKGGAKSTHQNPSIQIPKTEADALHSSPQDADEIFRKMKETGL-------I 497
            +  L GD  + G  +   Q  S Q    ++ AL S P+DADEIFRKMKETGL       I
Sbjct: 128  KLNL-GDAGSGGNVEVAPQKKSKQSSGPDSVALESVPEDADEIFRKMKETGLSLFSFFFI 186

Query: 496  PNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFHDAEKIFRK 317
            PNAVAMLDGLCKDGLIQEAMKLFGL+REKGT+PEVVIYTAVVEGFCKAAKF DA++IFRK
Sbjct: 187  PNAVAMLDGLCKDGLIQEAMKLFGLLREKGTVPEVVIYTAVVEGFCKAAKFDDAKRIFRK 246

Query: 316  MQNNGISPNAFSYTVLIQGLCRGRKLEDSVSFCVEMLEAGHSPNVATFVTLIDLICREKG 137
            MQ NGI PNAFSY VLIQGLC+G KLED V FC+EML+ GH PN ATF  L+D  C++KG
Sbjct: 247  MQKNGIVPNAFSYAVLIQGLCKGGKLEDFVEFCMEMLDVGHLPNAATFTGLVDGCCKDKG 306

Query: 136  QEEAASVIGRLREKGFAMDDKAVREHLDKKGPFSSSLWE 20
             EEA S++  LRE+GFA+D+KA R HLDKKGP S  +WE
Sbjct: 307  VEEAGSLVRTLRERGFAVDEKAARVHLDKKGPLSPVVWE 345


>gb|KDO68436.1| hypothetical protein CISIN_1g047934mg [Citrus sinensis]
          Length = 344

 Score =  300 bits (767), Expect = 2e-78
 Identities = 159/261 (60%), Positives = 190/261 (72%), Gaps = 10/261 (3%)
 Frame = -3

Query: 754 NKEEPERNLQSQMQSRT----------MKEDNRGDQFSDLFSKQSGLRGDVANKGGAKST 605
           N ++ +R  Q   QS              ++N  DQF     K+ G      N+   +  
Sbjct: 86  NYQQQQRPQQQSFQSPNGPRPKSPDGVQSDENFLDQFKLAIDKKPG--NPQQNESLGQRQ 143

Query: 604 HQNPSIQIPKTEADALHSSPQDADEIFRKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFG 425
            Q P+   P +E       PQ+ADEIF+KMKETGLIPNAVAMLDGLCKDGLIQEAMKLFG
Sbjct: 144 EQKPNRNEPISEP------PQEADEIFKKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFG 197

Query: 424 LMREKGTIPEVVIYTAVVEGFCKAAKFHDAEKIFRKMQNNGISPNAFSYTVLIQGLCRGR 245
           LMREKGTIPEVVIYTAVV+GFCKA KF DA++IFRKMQ+NGI+PNAFSY +LIQGL +  
Sbjct: 198 LMREKGTIPEVVIYTAVVDGFCKAQKFDDAKRIFRKMQSNGIAPNAFSYNLLIQGLYKCN 257

Query: 244 KLEDSVSFCVEMLEAGHSPNVATFVTLIDLICREKGQEEAASVIGRLREKGFAMDDKAVR 65
           KLE++V +C+EMLEAGHSPNV TFV L+D +CRE+G E+A SVI  L+EKGF ++DKAVR
Sbjct: 258 KLEEAVEYCIEMLEAGHSPNVTTFVGLVDGLCRERGVEKAQSVIATLKEKGFLVNDKAVR 317

Query: 64  EHLDKKGPFSSSLWEVIFGKK 2
           E LDKK PFSSS+WE IFGKK
Sbjct: 318 EFLDKKAPFSSSVWEAIFGKK 338


>ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like [Citrus sinensis]
          Length = 387

 Score =  300 bits (767), Expect = 2e-78
 Identities = 159/261 (60%), Positives = 190/261 (72%), Gaps = 10/261 (3%)
 Frame = -3

Query: 754 NKEEPERNLQSQMQSRT----------MKEDNRGDQFSDLFSKQSGLRGDVANKGGAKST 605
           N ++ +R  Q   QS              ++N  DQF     K+ G      N+   +  
Sbjct: 129 NYQQQQRPQQQSFQSPNGPRPKSPDGVQSDENFLDQFKLAIDKKPG--NPQQNESLGQRQ 186

Query: 604 HQNPSIQIPKTEADALHSSPQDADEIFRKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFG 425
            Q P+   P +E       PQ+ADEIF+KMKETGLIPNAVAMLDGLCKDGLIQEAMKLFG
Sbjct: 187 EQKPNRNEPISEP------PQEADEIFKKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFG 240

Query: 424 LMREKGTIPEVVIYTAVVEGFCKAAKFHDAEKIFRKMQNNGISPNAFSYTVLIQGLCRGR 245
           LMREKGTIPEVVIYTAVV+GFCKA KF DA++IFRKMQ+NGI+PNAFSY +LIQGL +  
Sbjct: 241 LMREKGTIPEVVIYTAVVDGFCKAQKFDDAKRIFRKMQSNGIAPNAFSYNLLIQGLYKCN 300

Query: 244 KLEDSVSFCVEMLEAGHSPNVATFVTLIDLICREKGQEEAASVIGRLREKGFAMDDKAVR 65
           KLE++V +C+EMLEAGHSPNV TFV L+D +CRE+G E+A SVI  L+EKGF ++DKAVR
Sbjct: 301 KLEEAVEYCIEMLEAGHSPNVTTFVGLVDGLCRERGVEKAQSVIATLKEKGFLVNDKAVR 360

Query: 64  EHLDKKGPFSSSLWEVIFGKK 2
           E LDKK PFSSS+WE IFGKK
Sbjct: 361 EFLDKKAPFSSSVWEAIFGKK 381


>ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citrus clementina]
           gi|557524309|gb|ESR35615.1| hypothetical protein
           CICLE_v10028759mg [Citrus clementina]
          Length = 344

 Score =  300 bits (767), Expect = 2e-78
 Identities = 160/269 (59%), Positives = 194/269 (72%), Gaps = 4/269 (1%)
 Frame = -3

Query: 796 NEKLFKVDNDVEDQNKEEPERNLQSQMQSRTMKEDN--RGDQFSDLFSKQSGLRGD--VA 629
           N + F+   +   Q +   +++ QS  + R    D     + F D F      + D    
Sbjct: 76  NRRSFQPRFNNYQQQQRPQQQSFQSPNRPRPKSPDGVQSDENFLDQFKLAIDKKPDNPQQ 135

Query: 628 NKGGAKSTHQNPSIQIPKTEADALHSSPQDADEIFRKMKETGLIPNAVAMLDGLCKDGLI 449
           N+   +   Q P+   P +E       PQ+ADEIF+KMKETGLIPNAVAMLDGLCKDGLI
Sbjct: 136 NESLGERQEQKPNRNEPISEP------PQEADEIFKKMKETGLIPNAVAMLDGLCKDGLI 189

Query: 448 QEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFHDAEKIFRKMQNNGISPNAFSYTVL 269
           QEAMKLFGLMREKGTIPEVVIYTAVV+GFCKA KF DA++IFRKMQ+NGI+PNAFSY +L
Sbjct: 190 QEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKFDDAKRIFRKMQSNGIAPNAFSYNLL 249

Query: 268 IQGLCRGRKLEDSVSFCVEMLEAGHSPNVATFVTLIDLICREKGQEEAASVIGRLREKGF 89
           IQGL +  KLE++V +C+EMLEAGHSPNV TFV L+D +CREKG E+A SVI  L+EKGF
Sbjct: 250 IQGLYKCNKLEEAVEYCIEMLEAGHSPNVTTFVGLVDGLCREKGVEKAQSVIATLKEKGF 309

Query: 88  AMDDKAVREHLDKKGPFSSSLWEVIFGKK 2
            ++DKAVRE LDKK PFSSS+WE IFGKK
Sbjct: 310 LVNDKAVREFLDKKAPFSSSVWEAIFGKK 338


>ref|XP_010030710.1| PREDICTED: pentatricopeptide repeat-containing protein At5g48910
           [Eucalyptus grandis]
          Length = 1024

 Score =  298 bits (762), Expect = 6e-78
 Identities = 151/244 (61%), Positives = 188/244 (77%)
 Frame = -3

Query: 733 NLQSQMQSRTMKEDNRGDQFSDLFSKQSGLRGDVANKGGAKSTHQNPSIQIPKTEADALH 554
           N + +   R   +D+  ++F   F K+    GDVA+     S  +N  +   +   +   
Sbjct: 101 NFRGEGVRRDPSDDSFLEKFKLSFDKRDKPEGDVASATTQPSQEEN-KVNSNQMANEGQP 159

Query: 553 SSPQDADEIFRKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAV 374
             P+DADEIF+KMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKG+IPEVVIYTAV
Sbjct: 160 PLPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGSIPEVVIYTAV 219

Query: 373 VEGFCKAAKFHDAEKIFRKMQNNGISPNAFSYTVLIQGLCRGRKLEDSVSFCVEMLEAGH 194
           VEGFCKA KF DA++IFRKMQNNGI+PNAFS+TVLIQGL R  +LED++ FC EM++AGH
Sbjct: 220 VEGFCKAQKFDDAKRIFRKMQNNGITPNAFSFTVLIQGLYRCDRLEDALEFCQEMIDAGH 279

Query: 193 SPNVATFVTLIDLICREKGQEEAASVIGRLREKGFAMDDKAVREHLDKKGPFSSSLWEVI 14
           SPNV TFV L++ +C++KG EEA +VI RLREKG+ +++KAVRE L+KK PFSS +WE I
Sbjct: 280 SPNVMTFVGLVNGVCKQKGVEEAQTVINRLREKGYFINEKAVREFLEKKAPFSSMVWEAI 339

Query: 13  FGKK 2
           FGKK
Sbjct: 340 FGKK 343


>gb|KCW55702.1| hypothetical protein EUGRSUZ_I01549 [Eucalyptus grandis]
           gi|629089450|gb|KCW55703.1| hypothetical protein
           EUGRSUZ_I01549 [Eucalyptus grandis]
           gi|629089451|gb|KCW55704.1| hypothetical protein
           EUGRSUZ_I01549 [Eucalyptus grandis]
          Length = 349

 Score =  298 bits (762), Expect = 6e-78
 Identities = 151/244 (61%), Positives = 188/244 (77%)
 Frame = -3

Query: 733 NLQSQMQSRTMKEDNRGDQFSDLFSKQSGLRGDVANKGGAKSTHQNPSIQIPKTEADALH 554
           N + +   R   +D+  ++F   F K+    GDVA+     S  +N  +   +   +   
Sbjct: 101 NFRGEGVRRDPSDDSFLEKFKLSFDKRDKPEGDVASATTQPSQEEN-KVNSNQMANEGQP 159

Query: 553 SSPQDADEIFRKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAV 374
             P+DADEIF+KMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKG+IPEVVIYTAV
Sbjct: 160 PLPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGSIPEVVIYTAV 219

Query: 373 VEGFCKAAKFHDAEKIFRKMQNNGISPNAFSYTVLIQGLCRGRKLEDSVSFCVEMLEAGH 194
           VEGFCKA KF DA++IFRKMQNNGI+PNAFS+TVLIQGL R  +LED++ FC EM++AGH
Sbjct: 220 VEGFCKAQKFDDAKRIFRKMQNNGITPNAFSFTVLIQGLYRCDRLEDALEFCQEMIDAGH 279

Query: 193 SPNVATFVTLIDLICREKGQEEAASVIGRLREKGFAMDDKAVREHLDKKGPFSSSLWEVI 14
           SPNV TFV L++ +C++KG EEA +VI RLREKG+ +++KAVRE L+KK PFSS +WE I
Sbjct: 280 SPNVMTFVGLVNGVCKQKGVEEAQTVINRLREKGYFINEKAVREFLEKKAPFSSMVWEAI 339

Query: 13  FGKK 2
           FGKK
Sbjct: 340 FGKK 343


>ref|XP_010090734.1| hypothetical protein L484_013756 [Morus notabilis]
           gi|587850267|gb|EXB40453.1| hypothetical protein
           L484_013756 [Morus notabilis]
          Length = 306

 Score =  297 bits (760), Expect = 1e-77
 Identities = 142/182 (78%), Positives = 162/182 (89%)
 Frame = -3

Query: 547 PQDADEIFRKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVE 368
           P+DADEIF+KMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLM+EKGTIPEVVIYTAVV+
Sbjct: 119 PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMKEKGTIPEVVIYTAVVD 178

Query: 367 GFCKAAKFHDAEKIFRKMQNNGISPNAFSYTVLIQGLCRGRKLEDSVSFCVEMLEAGHSP 188
           GFCKA K  DA +IFRKMQ+NGI PNAFSY+VL+QGLC G++LED + FCVEMLEAGHSP
Sbjct: 179 GFCKAQKLDDAVRIFRKMQSNGIEPNAFSYSVLVQGLCGGKRLEDGLEFCVEMLEAGHSP 238

Query: 187 NVATFVTLIDLICREKGQEEAASVIGRLREKGFAMDDKAVREHLDKKGPFSSSLWEVIFG 8
           NVATFV L+D +C EKG EEA  VIG+LR+KGF +++KAVRE LDKK  FS S+WE IFG
Sbjct: 239 NVATFVGLVDGLCEEKGVEEAQGVIGKLRDKGFLLNEKAVREFLDKKASFSPSVWEAIFG 298

Query: 7   KK 2
           KK
Sbjct: 299 KK 300


>ref|XP_007043123.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
           cacao] gi|508707058|gb|EOX98954.1| Pentatricopeptide
           repeat superfamily protein, putative [Theobroma cacao]
          Length = 345

 Score =  295 bits (754), Expect = 5e-77
 Identities = 144/209 (68%), Positives = 171/209 (81%)
 Frame = -3

Query: 628 NKGGAKSTHQNPSIQIPKTEADALHSSPQDADEIFRKMKETGLIPNAVAMLDGLCKDGLI 449
           NK G + +    +  + + E +   S PQDADEIF+KMKETGLIPNAVAMLDGLCKDGLI
Sbjct: 131 NKRGKQPSDSEAAALLRRKEQEEKPSPPQDADEIFKKMKETGLIPNAVAMLDGLCKDGLI 190

Query: 448 QEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFHDAEKIFRKMQNNGISPNAFSYTVL 269
           QEAMKLFG MREKGTIPEVVIYTAVV+GFCKA K  DA++IFRKMQ+ G++PN+FSY VL
Sbjct: 191 QEAMKLFGSMREKGTIPEVVIYTAVVDGFCKAHKLDDAKRIFRKMQSKGVTPNSFSYIVL 250

Query: 268 IQGLCRGRKLEDSVSFCVEMLEAGHSPNVATFVTLIDLICREKGQEEAASVIGRLREKGF 89
           IQGL R  KL+D++ FC+EMLEAGHSPNV TFV L+D +C+EKG EEA SVIG L++KGF
Sbjct: 251 IQGLYRCNKLDDAIEFCLEMLEAGHSPNVTTFVGLVDGLCKEKGVEEAQSVIGTLKQKGF 310

Query: 88  AMDDKAVREHLDKKGPFSSSLWEVIFGKK 2
            ++DKAVR+ LDKK PFS  +WE IFGKK
Sbjct: 311 VLNDKAVRQFLDKKAPFSPLVWEAIFGKK 339


>ref|XP_008235841.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
           [Prunus mume]
          Length = 317

 Score =  292 bits (748), Expect = 3e-76
 Identities = 141/190 (74%), Positives = 163/190 (85%)
 Frame = -3

Query: 571 EADALHSSPQDADEIFRKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEV 392
           E D     P++ADEIF+KMKETGLIPNAVAMLDGLCKDGL+Q+AMKLFG MREKGTIPEV
Sbjct: 122 EVDEPPQPPEEADEIFKKMKETGLIPNAVAMLDGLCKDGLVQDAMKLFGSMREKGTIPEV 181

Query: 391 VIYTAVVEGFCKAAKFHDAEKIFRKMQNNGISPNAFSYTVLIQGLCRGRKLEDSVSFCVE 212
           VIYTAVV+GFCKA K  DA++IFRKMQ+NGI PNAFSYTVLIQGL R  KLED+V FC E
Sbjct: 182 VIYTAVVDGFCKAQKLEDAKRIFRKMQSNGIIPNAFSYTVLIQGLYRSNKLEDAVEFCAE 241

Query: 211 MLEAGHSPNVATFVTLIDLICREKGQEEAASVIGRLREKGFAMDDKAVREHLDKKGPFSS 32
           MLEAGHSPNVATFV L+D IC+E   EEA SV+G+L++KG+ +++KAVRE LDKK PFS 
Sbjct: 242 MLEAGHSPNVATFVGLVDTICKENDLEEAESVVGKLKQKGYLVNEKAVREFLDKKAPFSP 301

Query: 31  SLWEVIFGKK 2
           ++WE IFGKK
Sbjct: 302 TVWEAIFGKK 311


>ref|XP_012462310.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
           isoform X1 [Gossypium raimondii]
           gi|763812905|gb|KJB79757.1| hypothetical protein
           B456_013G065500 [Gossypium raimondii]
          Length = 341

 Score =  291 bits (744), Expect = 8e-76
 Identities = 148/232 (63%), Positives = 182/232 (78%), Gaps = 8/232 (3%)
 Frame = -3

Query: 673 SDLFSKQSGLRGDVANKGGAKSTHQNPSIQIPKTEADALH--------SSPQDADEIFRK 518
           SD   K+   + DV      K   +N   ++P +E++A+H        S P+DADEIF+K
Sbjct: 105 SDPTKKREDSQSDVNFLEKFKLGLENKRERVP-SESEAMHRKEHEEKLSPPEDADEIFKK 163

Query: 517 MKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFHD 338
           MKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVV+GFCKA K  D
Sbjct: 164 MKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAHKLED 223

Query: 337 AEKIFRKMQNNGISPNAFSYTVLIQGLCRGRKLEDSVSFCVEMLEAGHSPNVATFVTLID 158
           A++IFRKMQ+ G+ PNAFSYTVLIQGL + + L+D++ FC+EM+EAGHSPNV TFV L+D
Sbjct: 224 AKRIFRKMQSKGVIPNAFSYTVLIQGLYKCKHLDDAIEFCLEMVEAGHSPNVTTFVGLVD 283

Query: 157 LICREKGQEEAASVIGRLREKGFAMDDKAVREHLDKKGPFSSSLWEVIFGKK 2
            +C+EKG EEA +VIG L++KGF ++DKAVR+ LDK+ PFS  +WE IFGKK
Sbjct: 284 GLCKEKGVEEAVNVIGTLKQKGFLVNDKAVRQFLDKRAPFSPLVWEAIFGKK 335


>ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
           [Vitis vinifera]
          Length = 380

 Score =  289 bits (740), Expect = 2e-75
 Identities = 147/207 (71%), Positives = 167/207 (80%), Gaps = 3/207 (1%)
 Frame = -3

Query: 613 KSTHQNPSIQIPKTEADALHSS---PQDADEIFRKMKETGLIPNAVAMLDGLCKDGLIQE 443
           K   Q  +   P  E DA H     PQ+ADEIFRKMKE+GLIPNAVAMLDGLCKDGL+QE
Sbjct: 168 KERPQESAAAQPSREQDANHGKEQPPQNADEIFRKMKESGLIPNAVAMLDGLCKDGLVQE 227

Query: 442 AMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFHDAEKIFRKMQNNGISPNAFSYTVLIQ 263
           AMKLFGLMREKGTIPEVVIYTAVVEGFCKA + +DA +IFRKMQNNGISPNAFSYTVLI+
Sbjct: 228 AMKLFGLMREKGTIPEVVIYTAVVEGFCKARQLNDAVRIFRKMQNNGISPNAFSYTVLIR 287

Query: 262 GLCRGRKLEDSVSFCVEMLEAGHSPNVATFVTLIDLICREKGQEEAASVIGRLREKGFAM 83
           G+ +G +L+ +V FCVEMLEAGHSPNVAT V LI   C+EKG EEA +VI  L++KG  +
Sbjct: 288 GMYKGNRLDIAVDFCVEMLEAGHSPNVATLVDLIHEFCKEKGVEEAKNVIVTLKQKGLFV 347

Query: 82  DDKAVREHLDKKGPFSSSLWEVIFGKK 2
           DDKAVRE+LDKKGP S  +WE  FGKK
Sbjct: 348 DDKAVREYLDKKGPQSPLVWEAFFGKK 374


>emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera]
          Length = 381

 Score =  288 bits (738), Expect = 4e-75
 Identities = 147/207 (71%), Positives = 166/207 (80%), Gaps = 3/207 (1%)
 Frame = -3

Query: 613 KSTHQNPSIQIPKTEADALHSS---PQDADEIFRKMKETGLIPNAVAMLDGLCKDGLIQE 443
           K   Q  +   P  E DA H     PQ+ADEIFRKMKE+GLIPNAVAMLDGLCKDGL+QE
Sbjct: 169 KERPQESAAAQPSREQDANHGKEQPPQNADEIFRKMKESGLIPNAVAMLDGLCKDGLVQE 228

Query: 442 AMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFHDAEKIFRKMQNNGISPNAFSYTVLIQ 263
           AMKLFGLMREKGTIPEVVIYTAVVEGFCKA +  DA +IFRKMQNNGISPNAFSYTVLI+
Sbjct: 229 AMKLFGLMREKGTIPEVVIYTAVVEGFCKARQLDDAVRIFRKMQNNGISPNAFSYTVLIR 288

Query: 262 GLCRGRKLEDSVSFCVEMLEAGHSPNVATFVTLIDLICREKGQEEAASVIGRLREKGFAM 83
           G+ +G +L+ +V FCVEMLEAGHSPNVAT V LI   C+EKG EEA +VI  L++KG  +
Sbjct: 289 GMYKGNRLDIAVDFCVEMLEAGHSPNVATLVDLIHEFCKEKGVEEAKNVIVTLKQKGLFV 348

Query: 82  DDKAVREHLDKKGPFSSSLWEVIFGKK 2
           DDKAVRE+LDKKGP S  +WE  FGKK
Sbjct: 349 DDKAVREYLDKKGPQSPLVWEAFFGKK 375


>ref|XP_009759999.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like [Nicotiana sylvestris]
           gi|698526340|ref|XP_009760000.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At4g38150-like [Nicotiana sylvestris]
          Length = 342

 Score =  287 bits (735), Expect = 8e-75
 Identities = 140/209 (66%), Positives = 171/209 (81%), Gaps = 5/209 (2%)
 Frame = -3

Query: 613 KSTHQNPSIQIPKTEADALHSS-----PQDADEIFRKMKETGLIPNAVAMLDGLCKDGLI 449
           ++T+ NP++      +DA  S      P+D+DEIF+KMKETGLIPNAVAMLDGLCKDGL+
Sbjct: 126 ENTNTNPALHPEGERSDAPASEAPPAPPEDSDEIFKKMKETGLIPNAVAMLDGLCKDGLV 185

Query: 448 QEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAAKFHDAEKIFRKMQNNGISPNAFSYTVL 269
           QEAMKLFGLMREKG IPEVVIYTAVVEGFCKA K+ DA +IFRKMQ NGI PNAFSY +L
Sbjct: 186 QEAMKLFGLMREKGAIPEVVIYTAVVEGFCKAHKYDDAVRIFRKMQGNGIIPNAFSYGIL 245

Query: 268 IQGLCRGRKLEDSVSFCVEMLEAGHSPNVATFVTLIDLICREKGQEEAASVIGRLREKGF 89
           I+GLC+G++LED++ FC+EMLEAGHSPN+ TFV L+D  C+EK  E+A S+I  +R+KGF
Sbjct: 246 IRGLCQGKRLEDALEFCLEMLEAGHSPNLMTFVGLVDGYCKEKSLEDAQSMIKAVRQKGF 305

Query: 88  AMDDKAVREHLDKKGPFSSSLWEVIFGKK 2
            +D+KAVRE+LDKKGPF   +WE I GKK
Sbjct: 306 TLDEKAVREYLDKKGPFLPLVWEAILGKK 334


>ref|XP_009624485.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like [Nicotiana tomentosiformis]
          Length = 342

 Score =  287 bits (735), Expect = 8e-75
 Identities = 147/258 (56%), Positives = 189/258 (73%), Gaps = 1/258 (0%)
 Frame = -3

Query: 772 NDVEDQNKEEPERNLQSQMQSRTMKEDNRGDQFSDLFSKQSGLRGDVANKGGAKSTHQNP 593
           N         P  N ++Q++S+  ++          F K+  L  D  ++    +   +P
Sbjct: 87  NPTHSTTFRRPGENNENQIKSQDSQD----------FLKRFQLGFDRKDENPNTNPALHP 136

Query: 592 SIQIPKTEA-DALHSSPQDADEIFRKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMR 416
             ++  T A ++  + P+D+DEIF+KMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMR
Sbjct: 137 KGEMSDTPASESSPAPPEDSDEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMR 196

Query: 415 EKGTIPEVVIYTAVVEGFCKAAKFHDAEKIFRKMQNNGISPNAFSYTVLIQGLCRGRKLE 236
           EKGTIPEVVIYTAVVEGFCKA K+ D  +IFRKMQ NGI PNAFSY++LI+GLC+GR+LE
Sbjct: 197 EKGTIPEVVIYTAVVEGFCKAHKYDDGVRIFRKMQGNGIIPNAFSYSILIRGLCQGRRLE 256

Query: 235 DSVSFCVEMLEAGHSPNVATFVTLIDLICREKGQEEAASVIGRLREKGFAMDDKAVREHL 56
           D++ FC+EMLEAGHSPN+ TFV L+D  C+EK  E+A S+I  +R+KGF +D+KAVRE+L
Sbjct: 257 DALEFCLEMLEAGHSPNLMTFVGLVDGYCKEKSLEDAQSMIKAVRQKGFILDEKAVREYL 316

Query: 55  DKKGPFSSSLWEVIFGKK 2
           DKKGPF   +WE I GKK
Sbjct: 317 DKKGPFLPLVWEAILGKK 334


>ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
           [Solanum lycopersicum]
          Length = 340

 Score =  287 bits (734), Expect = 1e-74
 Identities = 151/257 (58%), Positives = 185/257 (71%), Gaps = 6/257 (2%)
 Frame = -3

Query: 754 NKEEPERNLQSQMQSRTMKEDNRGDQFSDLFSKQSGLRGDVANKGGAKSTHQNPSIQIPK 575
           N+  P  +   +  S    E     Q S+ F K+  L        G     +NP+   PK
Sbjct: 86  NRSSPNHSTTFRRSSEN-NESQMKSQDSEDFLKRFQL--------GFDRKEENPNTN-PK 135

Query: 574 TEA------DALHSSPQDADEIFRKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMRE 413
            E+      +A  + P+DADEIF+KMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMRE
Sbjct: 136 AESRDCPVSEAPPAPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRE 195

Query: 412 KGTIPEVVIYTAVVEGFCKAAKFHDAEKIFRKMQNNGISPNAFSYTVLIQGLCRGRKLED 233
           KGTIPEVVIYTAVV+GFCKA KF DA +IFRKMQ NGI PNAFSY ++I+GL +G++L+D
Sbjct: 196 KGTIPEVVIYTAVVDGFCKAQKFDDAVRIFRKMQGNGIIPNAFSYGIIIRGLSQGKRLDD 255

Query: 232 SVSFCVEMLEAGHSPNVATFVTLIDLICREKGQEEAASVIGRLREKGFAMDDKAVREHLD 53
           ++ FC+EMLEAGHSPNV TFVTL+D  C+EK  E+A ++I  +R+KGF +DDKAVRE LD
Sbjct: 256 ALEFCLEMLEAGHSPNVVTFVTLVDGFCKEKSLEDAQNMIKTVRQKGFIVDDKAVREFLD 315

Query: 52  KKGPFSSSLWEVIFGKK 2
           KKGPF   +WE I GKK
Sbjct: 316 KKGPFLPVVWEAILGKK 332


>ref|XP_008382522.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
           [Malus domestica]
          Length = 313

 Score =  286 bits (733), Expect = 1e-74
 Identities = 137/181 (75%), Positives = 159/181 (87%)
 Frame = -3

Query: 547 PQDADEIFRKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVE 368
           P++AD+IF+KMKETGLIPNAVAMLDGLCKDGL+QEAMKLFG MREKGTIPEVVIYTAVV+
Sbjct: 126 PEEADQIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGSMREKGTIPEVVIYTAVVD 185

Query: 367 GFCKAAKFHDAEKIFRKMQNNGISPNAFSYTVLIQGLCRGRKLEDSVSFCVEMLEAGHSP 188
           GFCKA K  DA++IFRKMQ+NGI PNAFSYTVLIQGL R   LED+V FC EMLEAGHSP
Sbjct: 186 GFCKAQKLEDAKRIFRKMQSNGIVPNAFSYTVLIQGLYRANMLEDAVEFCSEMLEAGHSP 245

Query: 187 NVATFVTLIDLICREKGQEEAASVIGRLREKGFAMDDKAVREHLDKKGPFSSSLWEVIFG 8
           NV TFV LID++C+EK  EEA SVIG+L++KG+ +++KAV+E LDKK PFS  +WE IFG
Sbjct: 246 NVTTFVGLIDMVCKEKDMEEAESVIGKLKQKGYLVNEKAVKEFLDKKAPFSPRVWEAIFG 305

Query: 7   K 5
           K
Sbjct: 306 K 306


Top