BLASTX nr result

ID: Forsythia22_contig00029242 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00029242
         (1034 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011070580.1| PREDICTED: pentatricopeptide repeat-containi...   383   e-103
ref|XP_012843466.1| PREDICTED: pentatricopeptide repeat-containi...   374   e-101
gb|EYU32378.1| hypothetical protein MIMGU_mgv1a026042mg, partial...   374   e-101
ref|XP_009759999.1| PREDICTED: pentatricopeptide repeat-containi...   354   7e-95
ref|XP_009624485.1| PREDICTED: pentatricopeptide repeat-containi...   347   8e-93
ref|XP_009626842.1| PREDICTED: pentatricopeptide repeat-containi...   343   9e-92
ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containi...   341   4e-91
ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containi...   340   1e-90
ref|XP_009784631.1| PREDICTED: pentatricopeptide repeat-containi...   337   8e-90
gb|EPS62438.1| hypothetical protein M569_12351, partial [Genlise...   335   4e-89
ref|XP_010090734.1| hypothetical protein L484_013756 [Morus nota...   330   8e-88
ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citr...   321   6e-85
ref|XP_012462310.1| PREDICTED: pentatricopeptide repeat-containi...   320   8e-85
gb|KDO68436.1| hypothetical protein CISIN_1g047934mg [Citrus sin...   320   1e-84
ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containi...   320   1e-84
ref|XP_007043123.1| Pentatricopeptide repeat superfamily protein...   318   5e-84
ref|XP_010270624.1| PREDICTED: pentatricopeptide repeat-containi...   316   2e-83
gb|KCW55702.1| hypothetical protein EUGRSUZ_I01549 [Eucalyptus g...   315   3e-83
ref|XP_010030710.1| PREDICTED: pentatricopeptide repeat-containi...   315   4e-83
ref|XP_008235841.1| PREDICTED: pentatricopeptide repeat-containi...   309   2e-81

>ref|XP_011070580.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
            [Sesamum indicum]
          Length = 305

 Score =  383 bits (984), Expect = e-103
 Identities = 199/253 (78%), Positives = 218/253 (86%), Gaps = 1/253 (0%)
 Frame = -3

Query: 1032 RFPKFNRVGENGNPSSVRSEGD-NFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIENVE 856
            RFPK NRV EN N +S R+E D +FLERFKLGFD K+  ++  SG+ N+    EK +N E
Sbjct: 57   RFPKPNRVRENENLNSFRAETDADFLERFKLGFDRKL-ESQTDSGDKNIQY--EKAKNPE 113

Query: 855  PISPPGDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYT 676
            P SPP DADEIFKKMKETGLI NAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYT
Sbjct: 114  P-SPPEDADEIFKKMKETGLIANAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYT 172

Query: 675  AVVEGFCKAQKFEDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEA 496
            AVVEGFCKA KF+DA+RIFKKMQSNGVVPN FSY +L+ GLC GKRLEDAY   IEMLEA
Sbjct: 173  AVVEGFCKAHKFDDAIRIFKKMQSNGVVPNVFSYQVLVLGLCSGKRLEDAYLFTIEMLEA 232

Query: 495  GHSPNTATFIGLVDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWE 316
            GHSPN ATF GLVDGYCREKGL EAQ VI AMRQKG+F++EKAVREYL+KKGPFLPL+WE
Sbjct: 233  GHSPNLATFTGLVDGYCREKGLGEAQFVIQAMRQKGYFVEEKAVREYLDKKGPFLPLVWE 292

Query: 315  AILGKKASKKSLF 277
            AILGKKASK+SLF
Sbjct: 293  AILGKKASKRSLF 305


>ref|XP_012843466.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
            [Erythranthe guttatus]
          Length = 301

 Score =  374 bits (960), Expect = e-101
 Identities = 193/253 (76%), Positives = 209/253 (82%), Gaps = 1/253 (0%)
 Frame = -3

Query: 1032 RFPKFNRVGENGNPSSVRSEGD-NFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIENVE 856
            RFP    V EN NP+S ++E D +FLE+FKLGFD K       +   N S   EK ENVE
Sbjct: 51   RFPNSRGVRENENPNSFKAETDADFLEKFKLGFDRKSETL--TTDSINKSIQPEKKENVE 108

Query: 855  PISPPGDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYT 676
            PISPP DADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYT
Sbjct: 109  PISPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYT 168

Query: 675  AVVEGFCKAQKFEDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEA 496
            AVV+GFCKA K EDAVRIFKKMQSNG+VPNAFSY +LI+GLC G RL+D Y   IEMLEA
Sbjct: 169  AVVDGFCKAHKLEDAVRIFKKMQSNGIVPNAFSYQVLIRGLCSGNRLDDVYGFTIEMLEA 228

Query: 495  GHSPNTATFIGLVDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWE 316
            GHSPN ATF GLVD YCREK LEEAQ VI AMR KGFF +EKAVRE+L+KKGPFLPL+WE
Sbjct: 229  GHSPNLATFTGLVDVYCREKDLEEAQNVIKAMRHKGFFFEEKAVREHLDKKGPFLPLVWE 288

Query: 315  AILGKKASKKSLF 277
            AILG KASK+SLF
Sbjct: 289  AILGNKASKRSLF 301


>gb|EYU32378.1| hypothetical protein MIMGU_mgv1a026042mg, partial [Erythranthe
            guttata]
          Length = 269

 Score =  374 bits (960), Expect = e-101
 Identities = 193/253 (76%), Positives = 209/253 (82%), Gaps = 1/253 (0%)
 Frame = -3

Query: 1032 RFPKFNRVGENGNPSSVRSEGD-NFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIENVE 856
            RFP    V EN NP+S ++E D +FLE+FKLGFD K       +   N S   EK ENVE
Sbjct: 19   RFPNSRGVRENENPNSFKAETDADFLEKFKLGFDRKSETL--TTDSINKSIQPEKKENVE 76

Query: 855  PISPPGDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYT 676
            PISPP DADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYT
Sbjct: 77   PISPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYT 136

Query: 675  AVVEGFCKAQKFEDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEA 496
            AVV+GFCKA K EDAVRIFKKMQSNG+VPNAFSY +LI+GLC G RL+D Y   IEMLEA
Sbjct: 137  AVVDGFCKAHKLEDAVRIFKKMQSNGIVPNAFSYQVLIRGLCSGNRLDDVYGFTIEMLEA 196

Query: 495  GHSPNTATFIGLVDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWE 316
            GHSPN ATF GLVD YCREK LEEAQ VI AMR KGFF +EKAVRE+L+KKGPFLPL+WE
Sbjct: 197  GHSPNLATFTGLVDVYCREKDLEEAQNVIKAMRHKGFFFEEKAVREHLDKKGPFLPLVWE 256

Query: 315  AILGKKASKKSLF 277
            AILG KASK+SLF
Sbjct: 257  AILGNKASKRSLF 269


>ref|XP_009759999.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Nicotiana sylvestris] gi|698526340|ref|XP_009760000.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g38150-like [Nicotiana sylvestris]
          Length = 342

 Score =  354 bits (908), Expect = 7e-95
 Identities = 174/249 (69%), Positives = 203/249 (81%), Gaps = 1/249 (0%)
 Frame = -3

Query: 1020 FNRVGENGNPSSVRSEGDNFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIENVEPISPP 841
            F + GEN        +  +FL+RF+LGFD K  NT            ++   +  P +PP
Sbjct: 94   FRKPGENNENQIKSQDSQDFLKRFQLGFDRKDENTNTNPALHPEGERSDAPASEAPPAPP 153

Query: 840  GDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEG 661
             D+DEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKG IPEVV+YTAVVEG
Sbjct: 154  EDSDEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGAIPEVVIYTAVVEG 213

Query: 660  FCKAQKFEDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEAGHSPN 481
            FCKA K++DAVRIF+KMQ NG++PNAFSYGILI+GLC+GKRLEDA E C+EMLEAGHSPN
Sbjct: 214  FCKAHKYDDAVRIFRKMQGNGIIPNAFSYGILIRGLCQGKRLEDALEFCLEMLEAGHSPN 273

Query: 480  TATFIGLVDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWEAILGK 301
              TF+GLVDGYC+EK LE+AQ +I A+RQKGF +DEKAVREYL+KKGPFLPL+WEAILGK
Sbjct: 274  LMTFVGLVDGYCKEKSLEDAQSMIKAVRQKGFTLDEKAVREYLDKKGPFLPLVWEAILGK 333

Query: 300  KAS-KKSLF 277
            KAS ++SLF
Sbjct: 334  KASQRQSLF 342


>ref|XP_009624485.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Nicotiana tomentosiformis]
          Length = 342

 Score =  347 bits (890), Expect = 8e-93
 Identities = 171/249 (68%), Positives = 200/249 (80%), Gaps = 1/249 (0%)
 Frame = -3

Query: 1020 FNRVGENGNPSSVRSEGDNFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIENVEPISPP 841
            F R GEN        +  +FL+RF+LGFD K  N             ++   +    +PP
Sbjct: 94   FRRPGENNENQIKSQDSQDFLKRFQLGFDRKDENPNTNPALHPKGEMSDTPASESSPAPP 153

Query: 840  GDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEG 661
             D+DEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVV+YTAVVEG
Sbjct: 154  EDSDEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEG 213

Query: 660  FCKAQKFEDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEAGHSPN 481
            FCKA K++D VRIF+KMQ NG++PNAFSY ILI+GLC+G+RLEDA E C+EMLEAGHSPN
Sbjct: 214  FCKAHKYDDGVRIFRKMQGNGIIPNAFSYSILIRGLCQGRRLEDALEFCLEMLEAGHSPN 273

Query: 480  TATFIGLVDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWEAILGK 301
              TF+GLVDGYC+EK LE+AQ +I A+RQKGF +DEKAVREYL+KKGPFLPL+WEAILGK
Sbjct: 274  LMTFVGLVDGYCKEKSLEDAQSMIKAVRQKGFILDEKAVREYLDKKGPFLPLVWEAILGK 333

Query: 300  KAS-KKSLF 277
            KAS ++SLF
Sbjct: 334  KASQRQSLF 342


>ref|XP_009626842.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Nicotiana tomentosiformis]
            gi|697098468|ref|XP_009626850.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g38150-like [Nicotiana tomentosiformis]
          Length = 336

 Score =  343 bits (881), Expect = 9e-92
 Identities = 171/252 (67%), Positives = 204/252 (80%), Gaps = 4/252 (1%)
 Frame = -3

Query: 1020 FNRVGENGNPSSVRSEGDNFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIE---NVEPI 850
            F R GEN        +  +FL+RF+LGFD K         ++N ++N  + +   +  P 
Sbjct: 94   FRRPGENNENQIECQDSQDFLKRFQLGFDRK---------DENPNTNPARSDTPASESPP 144

Query: 849  SPPGDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAV 670
            +PP D+DEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIP VV+YTAV
Sbjct: 145  APPEDSDEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPAVVIYTAV 204

Query: 669  VEGFCKAQKFEDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEAGH 490
            V+GFCKA K++DAVRIF+KMQ NG++PNAFSY  LI+GLC+GKRLEDA E C+EMLEAGH
Sbjct: 205  VQGFCKAHKYDDAVRIFRKMQGNGIIPNAFSYSSLIRGLCQGKRLEDALEFCLEMLEAGH 264

Query: 489  SPNTATFIGLVDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWEAI 310
            SPN  TF+ LVDGYC+EK LE+AQ +I A+RQKGF +DEKAVREYL+KKGPFLPL+WEAI
Sbjct: 265  SPNMTTFVDLVDGYCKEKSLEDAQSMIKAVRQKGFILDEKAVREYLDKKGPFLPLVWEAI 324

Query: 309  LGKKAS-KKSLF 277
            LGKKAS ++SLF
Sbjct: 325  LGKKASQRQSLF 336


>ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
            [Solanum lycopersicum]
          Length = 340

 Score =  341 bits (875), Expect = 4e-91
 Identities = 167/250 (66%), Positives = 202/250 (80%), Gaps = 5/250 (2%)
 Frame = -3

Query: 1020 FNRVGENGNPSSVRSEGDNFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIENVEPIS-- 847
            F R  EN        + ++FL+RF+LGFD K         E+N ++N +      P+S  
Sbjct: 96   FRRSSENNESQMKSQDSEDFLKRFQLGFDRK---------EENPNTNPKAESRDCPVSEA 146

Query: 846  ---PPGDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYT 676
               PP DADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVV+YT
Sbjct: 147  PPAPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYT 206

Query: 675  AVVEGFCKAQKFEDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEA 496
            AVV+GFCKAQKF+DAVRIF+KMQ NG++PNAFSYGI+I+GL +GKRL+DA E C+EMLEA
Sbjct: 207  AVVDGFCKAQKFDDAVRIFRKMQGNGIIPNAFSYGIIIRGLSQGKRLDDALEFCLEMLEA 266

Query: 495  GHSPNTATFIGLVDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWE 316
            GHSPN  TF+ LVDG+C+EK LE+AQ +I  +RQKGF +D+KAVRE+L+KKGPFLP++WE
Sbjct: 267  GHSPNVVTFVTLVDGFCKEKSLEDAQNMIKTVRQKGFIVDDKAVREFLDKKGPFLPVVWE 326

Query: 315  AILGKKASKK 286
            AILGKKAS++
Sbjct: 327  AILGKKASQR 336


>ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Solanum tuberosum]
          Length = 354

 Score =  340 bits (871), Expect = 1e-90
 Identities = 164/243 (67%), Positives = 198/243 (81%)
 Frame = -3

Query: 1014 RVGENGNPSSVRSEGDNFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIENVEPISPPGD 835
            R GEN        + ++FL+RF+LGFD K  N            +++   +  P +PP D
Sbjct: 108  RSGENNGGQMKSQDSEDFLKRFQLGFDRKEENPNTNPALHPKGESSDSPVSEAPPAPPED 167

Query: 834  ADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFC 655
            ADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVV+YTAVV+GF 
Sbjct: 168  ADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVDGFF 227

Query: 654  KAQKFEDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEAGHSPNTA 475
            KAQKF+DAVRIF+KMQ NG++PNAFSYGILI+GL +G RL+DA+E C+EMLEAGHSPN  
Sbjct: 228  KAQKFDDAVRIFRKMQGNGIIPNAFSYGILIRGLSQGNRLDDAFEFCLEMLEAGHSPNVV 287

Query: 474  TFIGLVDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWEAILGKKA 295
            TF+ LVDG+C+EK LE+AQ +I  +RQKGF +D+KAVREYL+KKGPFLP++WEAILGKKA
Sbjct: 288  TFVTLVDGFCKEKSLEDAQNMIKTVRQKGFIVDDKAVREYLDKKGPFLPVVWEAILGKKA 347

Query: 294  SKK 286
            S++
Sbjct: 348  SQR 350


>ref|XP_009784631.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Nicotiana sylvestris]
          Length = 314

 Score =  337 bits (864), Expect = 8e-90
 Identities = 168/250 (67%), Positives = 201/250 (80%), Gaps = 4/250 (1%)
 Frame = -3

Query: 1014 RVGENGNPSSVRSEGDNFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIE---NVEPISP 844
            R G+N        +  +FL+RF+LGFD K         ++N ++N  + +   +  P +P
Sbjct: 74   RPGQNNENQIKSLDNQDFLKRFQLGFDHK---------DENPNTNPARSDTPASESPPAP 124

Query: 843  PGDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVE 664
            P D+DEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFG MREKG IPEVV+YTAVVE
Sbjct: 125  PEDSDEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGRMREKGAIPEVVIYTAVVE 184

Query: 663  GFCKAQKFEDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEAGHSP 484
            GFCKA K++DAVRIF+KMQ NG++PNAFSY  LI+GLC+GKRLEDA E C+EMLEAG SP
Sbjct: 185  GFCKAHKYDDAVRIFRKMQGNGIIPNAFSYSSLIRGLCQGKRLEDALEFCLEMLEAGQSP 244

Query: 483  NTATFIGLVDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWEAILG 304
            N  TF+GLVDGYC+EK LE+AQ +I A+R KGF +DEKAVREYL+KKGPFLPL+WEAILG
Sbjct: 245  NMTTFVGLVDGYCKEKSLEDAQSMIKAVRHKGFILDEKAVREYLDKKGPFLPLVWEAILG 304

Query: 303  KKAS-KKSLF 277
            KKAS ++SLF
Sbjct: 305  KKASQRQSLF 314


>gb|EPS62438.1| hypothetical protein M569_12351, partial [Genlisea aurea]
          Length = 272

 Score =  335 bits (858), Expect = 4e-89
 Identities = 168/252 (66%), Positives = 203/252 (80%), Gaps = 2/252 (0%)
 Frame = -3

Query: 1026 PKFNRVGENGNP-SSVRSEGDN-FLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIENVEP 853
            PK +R+  +GNP ++  +E D+ FLERFKLGFD K     G   E   +   E+ E  +P
Sbjct: 21   PKSDRIRGSGNPRAAAAAESDSDFLERFKLGFDRKTTTPPGRVVESEKAGGEEEKEEQQP 80

Query: 852  ISPPGDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTA 673
            +SPP +ADEIF+KMKETGLIPNAVAMLDGLCKDGLVQ+A+KLFG MREKG+IP+VVVYTA
Sbjct: 81   LSPPENADEIFRKMKETGLIPNAVAMLDGLCKDGLVQDALKLFGTMREKGSIPDVVVYTA 140

Query: 672  VVEGFCKAQKFEDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEAG 493
            VVEGFCKAQK +DA+RIFKKM+SNG+ PNAFSY ILI+GLC GKRLEDA     EMLE G
Sbjct: 141  VVEGFCKAQKHDDAIRIFKKMKSNGIAPNAFSYQILIRGLCDGKRLEDASGFTAEMLETG 200

Query: 492  HSPNTATFIGLVDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWEA 313
            +SPN ATF GLV+G+C+EKGLEEA+ ++GAM+QKGF ++EKAVREYL+KKGPF   +WEA
Sbjct: 201  YSPNLATFTGLVNGWCQEKGLEEAKTLVGAMKQKGFSVEEKAVREYLDKKGPFSSPVWEA 260

Query: 312  ILGKKASKKSLF 277
            ILG K   +SLF
Sbjct: 261  ILGIKDYTRSLF 272


>ref|XP_010090734.1| hypothetical protein L484_013756 [Morus notabilis]
           gi|587850267|gb|EXB40453.1| hypothetical protein
           L484_013756 [Morus notabilis]
          Length = 306

 Score =  330 bits (847), Expect = 8e-88
 Identities = 162/231 (70%), Positives = 188/231 (81%)
 Frame = -3

Query: 978 SEGDNFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIENVEPISPPGDADEIFKKMKETG 799
           SE D+FLE+FKLG D    +  G+  +    +   K    +P  PP DADEIFKKMKETG
Sbjct: 77  SEDDSFLEKFKLGLDS---SKDGMQEKPRREAARPKPPLPQPPPPPEDADEIFKKMKETG 133

Query: 798 LIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAQKFEDAVRIF 619
           LIPNAVAMLDGLCKDGLVQEAMKLFGLM+EKGTIPEVV+YTAVV+GFCKAQK +DAVRIF
Sbjct: 134 LIPNAVAMLDGLCKDGLVQEAMKLFGLMKEKGTIPEVVIYTAVVDGFCKAQKLDDAVRIF 193

Query: 618 KKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEAGHSPNTATFIGLVDGYCRE 439
           +KMQSNG+ PNAFSY +L+QGLC GKRLED  E C+EMLEAGHSPN ATF+GLVDG C E
Sbjct: 194 RKMQSNGIEPNAFSYSVLVQGLCGGKRLEDGLEFCVEMLEAGHSPNVATFVGLVDGLCEE 253

Query: 438 KGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWEAILGKKASKK 286
           KG+EEAQ VIG +R KGF ++EKAVRE+L+KK  F P +WEAI GKKAS++
Sbjct: 254 KGVEEAQGVIGKLRDKGFLLNEKAVREFLDKKASFSPSVWEAIFGKKASQR 304


>ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citrus clementina]
           gi|557524309|gb|ESR35615.1| hypothetical protein
           CICLE_v10028759mg [Citrus clementina]
          Length = 344

 Score =  321 bits (822), Expect = 6e-85
 Identities = 160/238 (67%), Positives = 192/238 (80%), Gaps = 1/238 (0%)
 Frame = -3

Query: 996 NPSSVRSEGDNFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIENVEPIS-PPGDADEIF 820
           +P  V+S+ +NFL++FKL  D+K  N +    E       +K    EPIS PP +ADEIF
Sbjct: 108 SPDGVQSD-ENFLDQFKLAIDKKPDNPQ--QNESLGERQEQKPNRNEPISEPPQEADEIF 164

Query: 819 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAQKF 640
           KKMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVV+YTAVV+GFCKAQKF
Sbjct: 165 KKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKF 224

Query: 639 EDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEAGHSPNTATFIGL 460
           +DA RIF+KMQSNG+ PNAFSY +LIQGL K  +LE+A E CIEMLEAGHSPN  TF+GL
Sbjct: 225 DDAKRIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEEAVEYCIEMLEAGHSPNVTTFVGL 284

Query: 459 VDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWEAILGKKASKK 286
           VDG CREKG+E+AQ VI  +++KGF +++KAVRE+L+KK PF   +WEAI GKK S+K
Sbjct: 285 VDGLCREKGVEKAQSVIATLKEKGFLVNDKAVREFLDKKAPFSSSVWEAIFGKKTSQK 342


>ref|XP_012462310.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
           isoform X1 [Gossypium raimondii]
           gi|763812905|gb|KJB79757.1| hypothetical protein
           B456_013G065500 [Gossypium raimondii]
          Length = 341

 Score =  320 bits (821), Expect = 8e-85
 Identities = 157/231 (67%), Positives = 188/231 (81%), Gaps = 3/231 (1%)
 Frame = -3

Query: 966 NFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKI---ENVEPISPPGDADEIFKKMKETGL 796
           NFLE+FKLG + K          + V S +E +   E+ E +SPP DADEIFKKMKETGL
Sbjct: 119 NFLEKFKLGLENK---------RERVPSESEAMHRKEHEEKLSPPEDADEIFKKMKETGL 169

Query: 795 IPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAQKFEDAVRIFK 616
           IPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVV+YTAVV+GFCKA K EDA RIF+
Sbjct: 170 IPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAHKLEDAKRIFR 229

Query: 615 KMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEAGHSPNTATFIGLVDGYCREK 436
           KMQS GV+PNAFSY +LIQGL K K L+DA E C+EM+EAGHSPN  TF+GLVDG C+EK
Sbjct: 230 KMQSKGVIPNAFSYTVLIQGLYKCKHLDDAIEFCLEMVEAGHSPNVTTFVGLVDGLCKEK 289

Query: 435 GLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWEAILGKKASKKS 283
           G+EEA  VIG ++QKGF +++KAVR++L+K+ PF PL+WEAI GKK S+K+
Sbjct: 290 GVEEAVNVIGTLKQKGFLVNDKAVRQFLDKRAPFSPLVWEAIFGKKTSQKA 340


>gb|KDO68436.1| hypothetical protein CISIN_1g047934mg [Citrus sinensis]
          Length = 344

 Score =  320 bits (820), Expect = 1e-84
 Identities = 159/238 (66%), Positives = 192/238 (80%), Gaps = 1/238 (0%)
 Frame = -3

Query: 996 NPSSVRSEGDNFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIENVEPIS-PPGDADEIF 820
           +P  V+S+ +NFL++FKL  D+K GN +    E       +K    EPIS PP +ADEIF
Sbjct: 108 SPDGVQSD-ENFLDQFKLAIDKKPGNPQ--QNESLGQRQEQKPNRNEPISEPPQEADEIF 164

Query: 819 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAQKF 640
           KKMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVV+YTAVV+GFCKAQKF
Sbjct: 165 KKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKF 224

Query: 639 EDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEAGHSPNTATFIGL 460
           +DA RIF+KMQSNG+ PNAFSY +LIQGL K  +LE+A E CIEMLEAGHSPN  TF+GL
Sbjct: 225 DDAKRIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEEAVEYCIEMLEAGHSPNVTTFVGL 284

Query: 459 VDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWEAILGKKASKK 286
           VDG CRE+G+E+AQ VI  +++KGF +++KAVRE+L+KK PF   +WEAI GKK  +K
Sbjct: 285 VDGLCRERGVEKAQSVIATLKEKGFLVNDKAVREFLDKKAPFSSSVWEAIFGKKTLQK 342


>ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like [Citrus sinensis]
          Length = 387

 Score =  320 bits (820), Expect = 1e-84
 Identities = 159/238 (66%), Positives = 192/238 (80%), Gaps = 1/238 (0%)
 Frame = -3

Query: 996 NPSSVRSEGDNFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIENVEPIS-PPGDADEIF 820
           +P  V+S+ +NFL++FKL  D+K GN +    E       +K    EPIS PP +ADEIF
Sbjct: 151 SPDGVQSD-ENFLDQFKLAIDKKPGNPQ--QNESLGQRQEQKPNRNEPISEPPQEADEIF 207

Query: 819 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAQKF 640
           KKMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVV+YTAVV+GFCKAQKF
Sbjct: 208 KKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKF 267

Query: 639 EDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEAGHSPNTATFIGL 460
           +DA RIF+KMQSNG+ PNAFSY +LIQGL K  +LE+A E CIEMLEAGHSPN  TF+GL
Sbjct: 268 DDAKRIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEEAVEYCIEMLEAGHSPNVTTFVGL 327

Query: 459 VDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWEAILGKKASKK 286
           VDG CRE+G+E+AQ VI  +++KGF +++KAVRE+L+KK PF   +WEAI GKK  +K
Sbjct: 328 VDGLCRERGVEKAQSVIATLKEKGFLVNDKAVREFLDKKAPFSSSVWEAIFGKKTLQK 385


>ref|XP_007043123.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao] gi|508707058|gb|EOX98954.1| Pentatricopeptide
            repeat superfamily protein, putative [Theobroma cacao]
          Length = 345

 Score =  318 bits (814), Expect = 5e-84
 Identities = 160/251 (63%), Positives = 193/251 (76%), Gaps = 2/251 (0%)
 Frame = -3

Query: 1029 FPKFN-RVGENGNPSSVRSEGD-NFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIENVE 856
            F  FN +   + N     S+ D NFLE+FKLG D K G       +   ++   + E  E
Sbjct: 97   FQSFNTKFASDPNRKREDSQSDENFLEKFKLGLDNKRGKQPS---DSEAAALLRRKEQEE 153

Query: 855  PISPPGDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYT 676
              SPP DADEIFKKMKETGLIPNAVAMLDGLCKDGL+QEAMKLFG MREKGTIPEVV+YT
Sbjct: 154  KPSPPQDADEIFKKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGSMREKGTIPEVVIYT 213

Query: 675  AVVEGFCKAQKFEDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEA 496
            AVV+GFCKA K +DA RIF+KMQS GV PN+FSY +LIQGL +  +L+DA E C+EMLEA
Sbjct: 214  AVVDGFCKAHKLDDAKRIFRKMQSKGVTPNSFSYIVLIQGLYRCNKLDDAIEFCLEMLEA 273

Query: 495  GHSPNTATFIGLVDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWE 316
            GHSPN  TF+GLVDG C+EKG+EEAQ VIG ++QKGF +++KAVR++L+KK PF PL+WE
Sbjct: 274  GHSPNVTTFVGLVDGLCKEKGVEEAQSVIGTLKQKGFVLNDKAVRQFLDKKAPFSPLVWE 333

Query: 315  AILGKKASKKS 283
            AI GKK S+K+
Sbjct: 334  AIFGKKPSQKT 344


>ref|XP_010270624.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Nelumbo nucifera] gi|720046844|ref|XP_010270625.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g38150-like [Nelumbo nucifera]
          Length = 407

 Score =  316 bits (810), Expect = 2e-83
 Identities = 163/251 (64%), Positives = 193/251 (76%), Gaps = 1/251 (0%)
 Frame = -3

Query: 1032 RFPKFNRVGEN-GNPSSVRSEGDNFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIENVE 856
            +FP     GE  G+P       ++FLE+ +L  +EK  NT     E + +   E     E
Sbjct: 166  QFPNRPMKGEKRGSPLD-----ESFLEKLRL-CEEKKKNTN----ETSPTQVTETDVKAE 215

Query: 855  PISPPGDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYT 676
            P S P DADEIF+KMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGT PEVV+YT
Sbjct: 216  PDSTPQDADEIFRKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTFPEVVIYT 275

Query: 675  AVVEGFCKAQKFEDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEA 496
            AVVEGFCKA+K +DA RIF+KMQ+NG+ PNAFSY + IQGL KGKRLEDA ++C+EMLEA
Sbjct: 276  AVVEGFCKAEKLDDAKRIFRKMQNNGISPNAFSYTVFIQGLYKGKRLEDAIDICVEMLEA 335

Query: 495  GHSPNTATFIGLVDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWE 316
            GHSPN  TF GLVD  CR+KG+EEA+  I  +R+KG+F+DEKA+REYL+KKGPF PLIWE
Sbjct: 336  GHSPNVTTFTGLVDAICRDKGVEEAKSTIERLREKGYFVDEKAIREYLDKKGPFSPLIWE 395

Query: 315  AILGKKASKKS 283
            A+ GKK SK S
Sbjct: 396  AVFGKKNSKLS 406


>gb|KCW55702.1| hypothetical protein EUGRSUZ_I01549 [Eucalyptus grandis]
            gi|629089450|gb|KCW55703.1| hypothetical protein
            EUGRSUZ_I01549 [Eucalyptus grandis]
            gi|629089451|gb|KCW55704.1| hypothetical protein
            EUGRSUZ_I01549 [Eucalyptus grandis]
          Length = 349

 Score =  315 bits (807), Expect = 3e-83
 Identities = 164/258 (63%), Positives = 196/258 (75%), Gaps = 8/258 (3%)
 Frame = -3

Query: 1026 PKFNRVGENGNPSSVRSEGDNFLERFKLGFDEK--------IGNTKGVSGEDNVSSNNEK 871
            P F   G   +PS      D+FLE+FKL FD++           T+    E+ V+SN   
Sbjct: 100  PNFRGEGVRRDPSD-----DSFLEKFKLSFDKRDKPEGDVASATTQPSQEENKVNSNQMA 154

Query: 870  IENVEPISPPGDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPE 691
             E   P+  P DADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKG+IPE
Sbjct: 155  NEGQPPL--PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGSIPE 212

Query: 690  VVVYTAVVEGFCKAQKFEDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCI 511
            VV+YTAVVEGFCKAQKF+DA RIF+KMQ+NG+ PNAFS+ +LIQGL +  RLEDA E C 
Sbjct: 213  VVIYTAVVEGFCKAQKFDDAKRIFRKMQNNGITPNAFSFTVLIQGLYRCDRLEDALEFCQ 272

Query: 510  EMLEAGHSPNTATFIGLVDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFL 331
            EM++AGHSPN  TF+GLV+G C++KG+EEAQ VI  +R+KG+FI+EKAVRE+LEKK PF 
Sbjct: 273  EMIDAGHSPNVMTFVGLVNGVCKQKGVEEAQTVINRLREKGYFINEKAVREFLEKKAPFS 332

Query: 330  PLIWEAILGKKASKKSLF 277
             ++WEAI GKK S  SLF
Sbjct: 333  SMVWEAIFGKKQS-HSLF 349


>ref|XP_010030710.1| PREDICTED: pentatricopeptide repeat-containing protein At5g48910
            [Eucalyptus grandis]
          Length = 1024

 Score =  315 bits (806), Expect = 4e-83
 Identities = 161/253 (63%), Positives = 193/253 (76%), Gaps = 8/253 (3%)
 Frame = -3

Query: 1026 PKFNRVGENGNPSSVRSEGDNFLERFKLGFDEK--------IGNTKGVSGEDNVSSNNEK 871
            P F   G   +PS      D+FLE+FKL FD++           T+    E+ V+SN   
Sbjct: 100  PNFRGEGVRRDPSD-----DSFLEKFKLSFDKRDKPEGDVASATTQPSQEENKVNSNQMA 154

Query: 870  IENVEPISPPGDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPE 691
             E   P+  P DADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKG+IPE
Sbjct: 155  NEGQPPL--PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGSIPE 212

Query: 690  VVVYTAVVEGFCKAQKFEDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCI 511
            VV+YTAVVEGFCKAQKF+DA RIF+KMQ+NG+ PNAFS+ +LIQGL +  RLEDA E C 
Sbjct: 213  VVIYTAVVEGFCKAQKFDDAKRIFRKMQNNGITPNAFSFTVLIQGLYRCDRLEDALEFCQ 272

Query: 510  EMLEAGHSPNTATFIGLVDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFL 331
            EM++AGHSPN  TF+GLV+G C++KG+EEAQ VI  +R+KG+FI+EKAVRE+LEKK PF 
Sbjct: 273  EMIDAGHSPNVMTFVGLVNGVCKQKGVEEAQTVINRLREKGYFINEKAVREFLEKKAPFS 332

Query: 330  PLIWEAILGKKAS 292
             ++WEAI GKK S
Sbjct: 333  SMVWEAIFGKKQS 345


>ref|XP_008235841.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
           [Prunus mume]
          Length = 317

 Score =  309 bits (792), Expect = 2e-81
 Identities = 156/238 (65%), Positives = 184/238 (77%), Gaps = 1/238 (0%)
 Frame = -3

Query: 996 NPSSVRSEGDNFLERFKLGFDEKIGNTKGVSGEDNVSSNNEKIENV-EPISPPGDADEIF 820
           NPS    +  +FLE+ KLG D+               S  EK + V EP  PP +ADEIF
Sbjct: 94  NPSPPLQDS-SFLEKLKLGLDK---------------SKREKPQEVDEPPQPPEEADEIF 137

Query: 819 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAQKF 640
           KKMKETGLIPNAVAMLDGLCKDGLVQ+AMKLFG MREKGTIPEVV+YTAVV+GFCKAQK 
Sbjct: 138 KKMKETGLIPNAVAMLDGLCKDGLVQDAMKLFGSMREKGTIPEVVIYTAVVDGFCKAQKL 197

Query: 639 EDAVRIFKKMQSNGVVPNAFSYGILIQGLCKGKRLEDAYELCIEMLEAGHSPNTATFIGL 460
           EDA RIF+KMQSNG++PNAFSY +LIQGL +  +LEDA E C EMLEAGHSPN ATF+GL
Sbjct: 198 EDAKRIFRKMQSNGIIPNAFSYTVLIQGLYRSNKLEDAVEFCAEMLEAGHSPNVATFVGL 257

Query: 459 VDGYCREKGLEEAQMVIGAMRQKGFFIDEKAVREYLEKKGPFLPLIWEAILGKKASKK 286
           VD  C+E  LEEA+ V+G ++QKG+ ++EKAVRE+L+KK PF P +WEAI GKK S+K
Sbjct: 258 VDTICKENDLEEAESVVGKLKQKGYLVNEKAVREFLDKKAPFSPTVWEAIFGKKKSQK 315


Top