BLASTX nr result
ID: Akebia27_contig00031701
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00031701 (1202 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI28420.3| unnamed protein product [Vitis vinifera] 314 4e-83 ref|XP_006443657.1| hypothetical protein CICLE_v10019256mg [Citr... 303 1e-79 ref|XP_002272744.1| PREDICTED: pentatricopeptide repeat-containi... 302 2e-79 gb|EXB41428.1| hypothetical protein L484_007578 [Morus notabilis] 301 5e-79 ref|XP_007020019.1| Pentatricopeptide repeat (PPR) superfamily p... 300 6e-79 emb|CAN66581.1| hypothetical protein VITISV_030261 [Vitis vinifera] 300 6e-79 ref|XP_006844721.1| hypothetical protein AMTR_s00016p00252780 [A... 300 1e-78 emb|CBI30729.3| unnamed protein product [Vitis vinifera] 296 9e-78 ref|XP_004139718.1| PREDICTED: pentatricopeptide repeat-containi... 296 1e-77 ref|XP_007029178.1| Pentatricopeptide repeat (PPR) superfamily p... 296 2e-77 ref|XP_004154482.1| PREDICTED: pentatricopeptide repeat-containi... 295 3e-77 ref|XP_002274514.2| PREDICTED: pentatricopeptide repeat-containi... 294 6e-77 ref|XP_004159154.1| PREDICTED: uncharacterized protein LOC101226... 292 2e-76 ref|XP_004145727.1| PREDICTED: uncharacterized protein LOC101212... 292 2e-76 ref|XP_006395538.1| hypothetical protein EUTSA_v10004197mg [Eutr... 290 1e-75 ref|XP_002320601.2| pentatricopeptide repeat-containing family p... 289 2e-75 ref|XP_002876985.1| pentatricopeptide repeat-containing protein ... 288 4e-75 ref|NP_181820.1| pentatricopeptide repeat-containing protein [Ar... 287 6e-75 ref|XP_002268530.1| PREDICTED: pentatricopeptide repeat-containi... 286 9e-75 ref|XP_002306741.1| pentatricopeptide repeat-containing family p... 286 1e-74 >emb|CBI28420.3| unnamed protein product [Vitis vinifera] Length = 631 Score = 314 bits (805), Expect = 4e-83 Identities = 161/368 (43%), Positives = 242/368 (65%), Gaps = 11/368 (2%) Frame = -3 Query: 1071 CNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQTF 892 C+ +E++QLH ++KT +F+H PF++++++ + S P I+D+ YA SIF + Sbjct: 26 CSAPQEVEQLHAFSLKTAIFNH-PFVSSRLL-----ALYSDP--KINDLGYARSIFDRIQ 77 Query: 891 DSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSSIEEG 712 S +NT+I+ + + +LF+ ++H+ LPD FT P V+K CA+L ++EG Sbjct: 78 RRSLIHWNTIIKCYVENQFSHDGIVLFHELVHE---YLPDNFTLPCVIKGCARLGVVQEG 134 Query: 711 EQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCE-----------NVVS 565 +QIH LK FGSD+FVQ SL++MY +CG I+ ARKVF+GM + N+VS Sbjct: 135 KQIHGLALKIGFGSDVFVQGSLVNMYSKCGEIDCARKVFDGMIDKDVVLWNSLIDGNLVS 194 Query: 564 WNSMIDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGL 385 WN+MI+G++KSGD SA +F +MP ++V+WN +IAGY +A+K+F + G Sbjct: 195 WNAMINGYMKSGDFDSALELFYQMPIWDLVTWNLMIAGYELNGQFMDAVKMFFMMLKLGS 254 Query: 384 RPDEFTMVSIISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASR 205 RP T+VS++SA+S L L G+ +H Y+ ++ F LDG LG +LI+MY+KCG I +A Sbjct: 255 RPSHATLVSVLSAVSGLAVLGKGRWIHSYMEKNGFELDGILGTSLIEMYAKCGCIESALT 314 Query: 204 VFEDIPNKNVGHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGL 25 VF I K VGHWT++IVG IHG A +L LF EM ++G+KPN + FIGVL+AC+HAGL Sbjct: 315 VFRAIQKKKVGHWTAIIVGLGIHGMANHALALFLEMCKTGLKPNAIIFIGVLNACNHAGL 374 Query: 24 VEEGLKHF 1 V++G ++F Sbjct: 375 VDDGRQYF 382 >ref|XP_006443657.1| hypothetical protein CICLE_v10019256mg [Citrus clementina] gi|568853066|ref|XP_006480188.1| PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like [Citrus sinensis] gi|557545919|gb|ESR56897.1| hypothetical protein CICLE_v10019256mg [Citrus clementina] Length = 642 Score = 303 bits (776), Expect = 1e-79 Identities = 164/373 (43%), Positives = 239/373 (64%), Gaps = 19/373 (5%) Frame = -3 Query: 1074 QCNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQT 895 +C + RE+ Q+H IKTG DP A +I+ C S + D+ YA +F Q Sbjct: 27 KCKSMRELTQVHAHFIKTGQIR-DPLAAAEILRFCAVS-------DLGDLEYAHKVFTQI 78 Query: 894 FDSSSFLYNTLIRALTQV---DQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSS 724 + + F YNT+IRA ++ D + A L+FY M+ D +LP+KFTFP VLK+CA+ + Sbjct: 79 REPNCFSYNTIIRAFSECKDDDDSLHALLVFYQMVSDGL-VLPNKFTFPSVLKACAKTAR 137 Query: 723 IEEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCE----------- 577 + EG+Q+H I+K D FV ++L+ MY CG++++A ++F E Sbjct: 138 LREGKQVHGLIVKFGLVYDEFVVSNLVRMYVMCGDMDNAHRLFYKSVVEFGNNGLLLRDT 197 Query: 576 -----NVVSWNSMIDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKL 412 V+ WN MIDG+V+ G+ ++R +FDEMP R++VSWN +I+GYA+ EA+++ Sbjct: 198 RRQEGYVILWNVMIDGYVRLGNFRASRALFDEMPQRSVVSWNVMISGYAQNGQFREAIEM 257 Query: 411 FLELKNSGLRPDEFTMVSIISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSK 232 FLE++N + P+ T+VS++ AIS LG L LGK VH Y ++ ++ LG+ALIDMYSK Sbjct: 258 FLEMQNGDVCPNYVTLVSVLPAISRLGALELGKWVHLYAEKNAIEINDILGSALIDMYSK 317 Query: 231 CGSIYNASRVFEDIPNKNVGHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGV 52 CGSI NA +VFE IP +N W++MI GFA+HG A+ +L FS M+++GVKP+ V +IG+ Sbjct: 318 CGSIENAIQVFERIPQRNAIAWSAMIGGFAMHGRAQDALDCFSRMEQAGVKPSDVVYIGL 377 Query: 51 LSACSHAGLVEEG 13 LSACSHAGLVEEG Sbjct: 378 LSACSHAGLVEEG 390 Score = 72.0 bits (175), Expect = 5e-10 Identities = 55/266 (20%), Positives = 119/266 (44%), Gaps = 3/266 (1%) Frame = -3 Query: 912 SIFHQTFDSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQ 733 ++F + S +N +I Q Q EA +F M + + P+ T VL + ++ Sbjct: 225 ALFDEMPQRSVVSWNVMISGYAQNGQFREAIEMFLEMQNGD--VCPNYVTLVSVLPAISR 282 Query: 732 LSSIEEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSM 553 L ++E G+ +H + K + + ++LI MY +CG S EN + Sbjct: 283 LGALELGKWVHLYAEKNAIEINDILGSALIDMYSKCG------------SIENAI----- 325 Query: 552 IDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDE 373 ++F+ +P RN ++W+++I G+A +AL F ++ +G++P + Sbjct: 326 --------------QVFERIPQRNAIAWSAMIGGFAMHGRAQDALDCFSRMEQAGVKPSD 371 Query: 372 FTMVSIISAISDLGFLSLGKCVHGYILRHEFSLDGGLG--AALIDMYSKCGSIYNASRVF 199 + ++SA S G + G+ + +++ + L+ + ++D+ + G + A + Sbjct: 372 VVYIGLLSACSHAGLVEEGRLMFNHMV-NVTGLEPRIEHYGCMVDLLGRAGLLEEAEELV 430 Query: 198 EDIP-NKNVGHWTSMIVGFAIHGFAE 124 ++P + W +++ HG E Sbjct: 431 LNMPIEPDDVIWKALLGACKTHGNIE 456 >ref|XP_002272744.1| PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Vitis vinifera] Length = 622 Score = 302 bits (773), Expect = 2e-79 Identities = 150/353 (42%), Positives = 227/353 (64%) Frame = -3 Query: 1074 QCNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQT 895 +C+ E++Q+H +KTGL D A+K++ C S S + YA ++F + Sbjct: 27 RCSNMEELRQIHGQMLKTGLIL-DEIPASKLLAFCASPNSGS-------LAYARTVFDRI 78 Query: 894 FDSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSSIEE 715 F ++F++NT+IR + +P EA LL++ ML+ + + +TFPF+LK+C+ +S++EE Sbjct: 79 FRPNTFMWNTMIRGYSNSKEPEEALLLYHHMLYH--SVPHNAYTFPFLLKACSSMSALEE 136 Query: 714 GEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMIDGFVK 535 +QIH I+K FGS+++ NSL+++Y + G+I+SAR +F+ + + VSWNSMIDG+ K Sbjct: 137 TQQIHAHIIKMGFGSEIYTTNSLLNVYSKSGDIKSARLLFDQVDQRDTVSWNSMIDGYTK 196 Query: 534 SGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEFTMVSI 355 G+I A +F+ MP RNI+SW S+I+G P EAL LF ++ +G++ D +VS Sbjct: 197 CGEIEMAYEIFNHMPERNIISWTSMISGCVGAGKPKEALNLFHRMQTAGIKLDNVALVST 256 Query: 354 ISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASRVFEDIPNKNV 175 + A +DLG L GK +H YI +HE +D LG LIDMY+KCG + A VF + K V Sbjct: 257 LQACADLGVLDQGKWIHAYIKKHEIEIDPILGCVLIDMYAKCGDLEEAIEVFRKMEEKGV 316 Query: 174 GHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLVEE 16 WT+MI G+AIHG +L F +MQ +GV+PN +TF G+L+ACSHAGLV E Sbjct: 317 SVWTAMISGYAIHGRGREALEWFMKMQTAGVEPNQMTFTGILTACSHAGLVHE 369 Score = 101 bits (251), Expect = 7e-19 Identities = 67/248 (27%), Positives = 115/248 (46%), Gaps = 37/248 (14%) Frame = -3 Query: 645 IHMYFRCGNIESARKVFEGMSCENVVSWN---SMIDGFV---KSGDIVSARRMFDEMPHR 484 +H+ RC N+E R++ M ++ S + F SG + AR +FD + Sbjct: 22 LHLLQRCSNMEELRQIHGQMLKTGLILDEIPASKLLAFCASPNSGSLAYARTVFDRIFRP 81 Query: 483 NIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEFTMVSIISAISDLGFLSLGKCVH 304 N WN++I GY+ P+EAL L+ + + + +T ++ A S + L + +H Sbjct: 82 NTFMWNTMIRGYSNSKEPEEALLLYHHMLYHSVPHNAYTFPFLLKACSSMSALEETQQIH 141 Query: 303 GYILRHEF--------------SLDGGLGAA-----------------LIDMYSKCGSIY 217 +I++ F S G + +A +ID Y+KCG I Sbjct: 142 AHIIKMGFGSEIYTTNSLLNVYSKSGDIKSARLLFDQVDQRDTVSWNSMIDGYTKCGEIE 201 Query: 216 NASRVFEDIPNKNVGHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACS 37 A +F +P +N+ WTSMI G G + +L+LF MQ +G+K + V + L AC+ Sbjct: 202 MAYEIFNHMPERNIISWTSMISGCVGAGKPKEALNLFHRMQTAGIKLDNVALVSTLQACA 261 Query: 36 HAGLVEEG 13 G++++G Sbjct: 262 DLGVLDQG 269 Score = 87.0 bits (214), Expect = 1e-14 Identities = 64/271 (23%), Positives = 117/271 (43%), Gaps = 2/271 (0%) Frame = -3 Query: 930 DVNYALSIFHQTFDSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFV 751 ++ A IF+ + + + ++I +P EA LF+ M K+ D Sbjct: 199 EIEMAYEIFNHMPERNIISWTSMISGCVGAGKPKEALNLFHRMQTAGIKL--DNVALVST 256 Query: 750 LKSCAQLSSIEEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENV 571 L++CA L +++G+ IH +I K D + LI MY +CG Sbjct: 257 LQACADLGVLDQGKWIHAYIKKHEIEIDPILGCVLIDMYAKCG----------------- 299 Query: 570 VSWNSMIDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNS 391 D+ A +F +M + + W ++I+GYA EAL+ F++++ + Sbjct: 300 --------------DLEEAIEVFRKMEEKGVSVWTAMISGYAIHGRGREALEWFMKMQTA 345 Query: 390 GLRPDEFTMVSIISAISDLGFLSLGKCVHGYILR-HEFSLDGGLGAALIDMYSKCGSIYN 214 G+ P++ T I++A S G + K + + R H F ++D+ + G + Sbjct: 346 GVEPNQMTFTGILTACSHAGLVHEAKLLFESMERIHGFKPSIEHYGCMVDLLGRAGLLKE 405 Query: 213 ASRVFEDIPNK-NVGHWTSMIVGFAIHGFAE 124 A + E++P K N W +++ IHG E Sbjct: 406 AEELIENMPVKPNAAIWGALLNACHIHGNLE 436 >gb|EXB41428.1| hypothetical protein L484_007578 [Morus notabilis] Length = 428 Score = 301 bits (770), Expect = 5e-79 Identities = 150/358 (41%), Positives = 226/358 (63%), Gaps = 1/358 (0%) Frame = -3 Query: 1071 CNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIF-HQT 895 C + R+++Q+H I++GL HD + K+++ C +S +++YA +F HQ Sbjct: 32 CTSFRQLKQIHAKIIRSGL-SHDQLLLRKMLQFCSTS---------GNMDYAALVFRHQI 81 Query: 894 FDSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSSIEE 715 +F +N +IRA T P +A LLF LM + PDKFTFPFV+K+C S+ Sbjct: 82 PYPLTFTWNLMIRAYTLNASPRQALLLFTLMTS--RGFPPDKFTFPFVIKACTASSAFRP 139 Query: 714 GEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMIDGFVK 535 G+ +H +K F D+FVQN+L+ YF+CG+ S RKVF+ M N+VSW +M+ G V Sbjct: 140 GDAVHGLAIKARFSGDIFVQNTLMDFYFKCGDAHSGRKVFDKMRVRNLVSWTTMVTGLVG 199 Query: 534 SGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEFTMVSI 355 SGD+ +AR +F++MP +N+VSW +I GY + P+EA KLF ++ + P+EFT+VS+ Sbjct: 200 SGDLRAARAIFEQMPAKNVVSWTIMIDGYVEDRQPEEAFKLFRRMQLDNVSPNEFTLVSL 259 Query: 354 ISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASRVFEDIPNKNV 175 + A ++LG L LG+ VH + L++ F LD G ALID YSKCGS+ +A RVF+ + K++ Sbjct: 260 LKACTELGSLKLGRWVHDFALKNGFELDVFFGTALIDTYSKCGSLEDARRVFDKMQAKSI 319 Query: 174 GHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLVEEGLKHF 1 W SMI +HGF E +L LF+EM+R V+P+ +TF+G+LSAC V + K+F Sbjct: 320 ATWNSMITSLGVHGFGEEALALFAEMERQNVRPDEITFVGILSACLQKNSVSDCRKYF 377 >ref|XP_007020019.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] gi|508725347|gb|EOY17244.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] Length = 646 Score = 300 bits (769), Expect = 6e-79 Identities = 163/380 (42%), Positives = 233/380 (61%), Gaps = 22/380 (5%) Frame = -3 Query: 1074 QCNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQT 895 +C T R++ Q+H + +KTG H DP A +I++ C D++YA +F Q Sbjct: 28 RCKTMRDLHQVHAIVLKTGQIH-DPLAAAEILKFCSLGTH-------RDIDYARKVFRQM 79 Query: 894 FDSSSFLYNTLIRALTQVDQ------PVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQ 733 + + F +NT+IRALT+ D+ P+EA LF M+ D +LP++FTFP VLK+CA+ Sbjct: 80 GEPNCFSWNTIIRALTESDESNETNEPLEALFLFTEMVADGN-VLPNRFTFPSVLKACAR 138 Query: 732 LSSIEEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCE-------- 577 + EGEQ+H ++K F D FV ++L+ +Y CG +E A + M E Sbjct: 139 TGKLPEGEQVHGLVVKFGFEKDEFVASNLVRVYVMCGAMEEAHILLNKMMVEFENGGKLV 198 Query: 576 --------NVVSWNSMIDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEA 421 N+V WN MIDG+V+ GD+ +AR +FD+M R+++SWN +I+GYA+ EA Sbjct: 199 RDKRRIEGNIVLWNVMIDGYVRIGDLRTARELFDKMSLRSVISWNVMISGYAQNGYFKEA 258 Query: 420 LKLFLELKNSGLRPDEFTMVSIISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDM 241 +++F ++ +RP+ T+VS++ AIS LG L LGK VH Y ++E +D LG+ALIDM Sbjct: 259 IEMFRLMQIGEVRPNYVTLVSVLPAISRLGALELGKWVHLYAEKNEIEIDDVLGSALIDM 318 Query: 240 YSKCGSIYNASRVFEDIPNKNVGHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTF 61 YSKCGSI A +VFE I N W++MI G A+HG AE +L FS M+ GV P+ V + Sbjct: 319 YSKCGSIDKAVQVFERISKPNTITWSAMIGGLAMHGRAEGALDYFSRMELEGVTPSDVVY 378 Query: 60 IGVLSACSHAGLVEEGLKHF 1 IGVLSACSHAG VEEG F Sbjct: 379 IGVLSACSHAGFVEEGRLFF 398 Score = 67.4 bits (163), Expect = 1e-08 Identities = 59/280 (21%), Positives = 117/280 (41%), Gaps = 4/280 (1%) Frame = -3 Query: 936 IHDVNYALSIFHQTFDSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFP 757 I D+ A +F + S +N +I Q EA +F LM ++ P+ T Sbjct: 221 IGDLRTARELFDKMSLRSVISWNVMISGYAQNGYFKEAIEMFRLM--QIGEVRPNYVTLV 278 Query: 756 FVLKSCAQLSSIEEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCE 577 VL + ++L ++E G+ +H + K D + ++LI MY +CG+ Sbjct: 279 SVLPAISRLGALELGKWVHLYAEKNEIEIDDVLGSALIDMYSKCGS-------------- 324 Query: 576 NVVSWNSMIDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELK 397 I A ++F+ + N ++W+++I G A + AL F ++ Sbjct: 325 -----------------IDKAVQVFERISKPNTITWSAMIGGLAMHGRAEGALDYFSRME 367 Query: 396 NSGLRPDEFTMVSIISAISDLGFLSLGKCVHGY---ILRHEFSLDGGLGAALIDMYSKCG 226 G+ P + + ++SA S GF+ G+ + ++ E L+ ++D+ + G Sbjct: 368 LEGVTPSDVVYIGVLSACSHAGFVEEGRLFFNHMVNVVGFEPRLEH--YGCMVDLLGRAG 425 Query: 225 SIYNASRVFEDIP-NKNVGHWTSMIVGFAIHGFAEASLHL 109 + A ++P + W +++ +HG E H+ Sbjct: 426 LLKEAEEFILNMPIEPDDVIWKALLGACKMHGNIEMGDHV 465 >emb|CAN66581.1| hypothetical protein VITISV_030261 [Vitis vinifera] Length = 622 Score = 300 bits (769), Expect = 6e-79 Identities = 150/353 (42%), Positives = 226/353 (64%) Frame = -3 Query: 1074 QCNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQT 895 +C+ E++Q+H +KTGL D A+K++ C S S + YA ++F + Sbjct: 27 RCSNMEELRQIHGQMLKTGLIL-DEIPASKLLAFCASPNSGS-------LAYARTVFDRI 78 Query: 894 FDSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSSIEE 715 F ++F++NT+IR + +P EA LL++ ML+ + + +TFPF+LK+C+ +S+ EE Sbjct: 79 FRPNTFMWNTMIRGYSNSKEPEEALLLYHHMLYH--SVPHNAYTFPFLLKACSSMSASEE 136 Query: 714 GEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMIDGFVK 535 +QIH I+K FGS+++ NSL+++Y + G+I+SAR +F+ + + VSWNSMIDG+ K Sbjct: 137 TQQIHAHIIKMGFGSEIYTTNSLLNVYSKSGDIKSARLLFDQVDQRDTVSWNSMIDGYTK 196 Query: 534 SGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEFTMVSI 355 G+I A +F+ MP RNI+SW S+I+G P EAL LF ++ +G++ D +VS Sbjct: 197 CGEIEMAYEIFNHMPERNIISWTSMISGCVGAGKPKEALNLFHRMQTAGIKLDNVALVST 256 Query: 354 ISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASRVFEDIPNKNV 175 + A +DLG L GK +H YI +HE +D LG LIDMY+KCG + A VF + K V Sbjct: 257 LQACADLGVLDQGKWIHAYIKKHEIEIDPILGCVLIDMYAKCGDLEEAIEVFRKMEEKGV 316 Query: 174 GHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLVEE 16 WT+MI G+AIHG +L F +MQ +GV+PN +TF G+L+ACSHAGLV E Sbjct: 317 SVWTAMISGYAIHGRGREALEWFMKMQTAGVEPNQMTFTGILTACSHAGLVHE 369 Score = 99.0 bits (245), Expect = 4e-18 Identities = 66/248 (26%), Positives = 114/248 (45%), Gaps = 37/248 (14%) Frame = -3 Query: 645 IHMYFRCGNIESARKVFEGMSCENVVSWN---SMIDGFV---KSGDIVSARRMFDEMPHR 484 +H+ RC N+E R++ M ++ S + F SG + AR +FD + Sbjct: 22 LHLLQRCSNMEELRQIHGQMLKTGLILDEIPASKLLAFCASPNSGSLAYARTVFDRIFRP 81 Query: 483 NIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEFTMVSIISAISDLGFLSLGKCVH 304 N WN++I GY+ P+EAL L+ + + + +T ++ A S + + +H Sbjct: 82 NTFMWNTMIRGYSNSKEPEEALLLYHHMLYHSVPHNAYTFPFLLKACSSMSASEETQQIH 141 Query: 303 GYILRHEF--------------SLDGGLGAA-----------------LIDMYSKCGSIY 217 +I++ F S G + +A +ID Y+KCG I Sbjct: 142 AHIIKMGFGSEIYTTNSLLNVYSKSGDIKSARLLFDQVDQRDTVSWNSMIDGYTKCGEIE 201 Query: 216 NASRVFEDIPNKNVGHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACS 37 A +F +P +N+ WTSMI G G + +L+LF MQ +G+K + V + L AC+ Sbjct: 202 MAYEIFNHMPERNIISWTSMISGCVGAGKPKEALNLFHRMQTAGIKLDNVALVSTLQACA 261 Query: 36 HAGLVEEG 13 G++++G Sbjct: 262 DLGVLDQG 269 Score = 87.0 bits (214), Expect = 1e-14 Identities = 64/271 (23%), Positives = 117/271 (43%), Gaps = 2/271 (0%) Frame = -3 Query: 930 DVNYALSIFHQTFDSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFV 751 ++ A IF+ + + + ++I +P EA LF+ M K+ D Sbjct: 199 EIEMAYEIFNHMPERNIISWTSMISGCVGAGKPKEALNLFHRMQTAGIKL--DNVALVST 256 Query: 750 LKSCAQLSSIEEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENV 571 L++CA L +++G+ IH +I K D + LI MY +CG Sbjct: 257 LQACADLGVLDQGKWIHAYIKKHEIEIDPILGCVLIDMYAKCG----------------- 299 Query: 570 VSWNSMIDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNS 391 D+ A +F +M + + W ++I+GYA EAL+ F++++ + Sbjct: 300 --------------DLEEAIEVFRKMEEKGVSVWTAMISGYAIHGRGREALEWFMKMQTA 345 Query: 390 GLRPDEFTMVSIISAISDLGFLSLGKCVHGYILR-HEFSLDGGLGAALIDMYSKCGSIYN 214 G+ P++ T I++A S G + K + + R H F ++D+ + G + Sbjct: 346 GVEPNQMTFTGILTACSHAGLVHEAKLLFESMERIHGFKPSIEHYGCMVDLLGRAGLLKE 405 Query: 213 ASRVFEDIPNK-NVGHWTSMIVGFAIHGFAE 124 A + E++P K N W +++ IHG E Sbjct: 406 AEELIENMPVKPNAAIWGALLNACHIHGNLE 436 >ref|XP_006844721.1| hypothetical protein AMTR_s00016p00252780 [Amborella trichopoda] gi|548847192|gb|ERN06396.1| hypothetical protein AMTR_s00016p00252780 [Amborella trichopoda] Length = 428 Score = 300 bits (767), Expect = 1e-78 Identities = 151/360 (41%), Positives = 232/360 (64%), Gaps = 2/360 (0%) Frame = -3 Query: 1074 QCNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHD-VNYALSIFHQ 898 +C+T+ + Q+H +TGL H D + TK++ C SIH +++A +F+Q Sbjct: 40 KCSTSNHLLQIHAHLFRTGL-HRDYILITKLINLC----------SIHQKIDHATLVFNQ 88 Query: 897 TFDSSSFLYNTLIRALTQVDQPVEAFLLFYLM-LHDPKKILPDKFTFPFVLKSCAQLSSI 721 + +F +NT+IRA + + P EA L++ LM +H LPDKFT+PFV+K+C SS+ Sbjct: 89 IENPLTFTWNTMIRAYFKSNYPEEAILMYNLMVIHG---FLPDKFTYPFVIKACVAFSSL 145 Query: 720 EEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMIDGF 541 E+G++IH +K D+F+QN+L+ +Y +C A K+F+ MS ++VVSW +M+ G Sbjct: 146 EKGKEIHGRAIKAGMVPDIFLQNTLMELYMKCNEKTLAHKLFDKMSVKSVVSWTTMVAGL 205 Query: 540 VKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEFTMV 361 V GD+ SARR+FDEMP RN+VSW ++I GY R + P EAL+LF+ + + +RP+EFT+V Sbjct: 206 VSHGDMASARRVFDEMPERNVVSWTAMIHGYVRNNQPHEALELFILMLRANVRPNEFTIV 265 Query: 360 SIISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASRVFEDIPNK 181 S++ + L L LG+ VH ++ + F L LG ALIDMYS CGSI +A VF+ + + Sbjct: 266 SLLLVCTSLNSLRLGRWVHEFMAKSGFELSVYLGTALIDMYSNCGSINDAKNVFDGMSER 325 Query: 180 NVGHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLVEEGLKHF 1 +V W SMI +HG + +L++F M++ V+P+ +TF+GVL AC + GLVEEG +F Sbjct: 326 SVATWNSMITSLGVHGKGKEALNVFGAMEKGKVRPDDITFVGVLCACVNMGLVEEGGVYF 385 >emb|CBI30729.3| unnamed protein product [Vitis vinifera] Length = 506 Score = 296 bits (759), Expect = 9e-78 Identities = 152/353 (43%), Positives = 226/353 (64%), Gaps = 1/353 (0%) Frame = -3 Query: 1056 EIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQTFDSSSF 877 E+ Q H +K+GL H F A++++ S ++ + + YA SIF + + +S+ Sbjct: 22 ELHQAHAHILKSGLIH-STFAASRLIASVSTNSHAQA------IPYAHSIFSRIPNPNSY 74 Query: 876 LYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSSIEEGEQIHC 697 ++NT+IRA P A +F+ MLH +LPDK+TF F LKSC S +EEG QIH Sbjct: 75 MWNTIIRAYANSPTPEAALTIFHQMLH--ASVLPDKYTFTFALKSCGSFSGVEEGRQIHG 132 Query: 696 FILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMIDGFVKSGDI-V 520 +LKT G DLF+QN+LIH+Y CG IE AR + + M +VVSWN+++ + + G + + Sbjct: 133 HVLKTGLGDDLFIQNTLIHLYASCGCIEDARHLLDRMLERDVVSWNALLSAYAERGLMEL 192 Query: 519 SARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEFTMVSIISAIS 340 ++RR+F E P +N+VSWN++I GY+ E L LF +++++G++PD T+VS++SA + Sbjct: 193 ASRRVFGETPVKNVVSWNAMITGYSHAGRFSEVLVLFEDMQHAGVKPDNCTLVSVLSACA 252 Query: 339 DLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASRVFEDIPNKNVGHWTS 160 +G LS G+ VH YI ++ S+DG + AL+DMYSKCGSI A VF K++ W S Sbjct: 253 HVGALSQGEWVHAYIDKNGISIDGFVATALVDMYSKCGSIEKALEVFNSCLRKDISTWNS 312 Query: 159 MIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLVEEGLKHF 1 +I G + HG + +L +FSEM G KPN VTF+ VLSACS AGL++EG + F Sbjct: 313 IISGLSTHGSGQHALQIFSEMLVEGFKPNEVTFVCVLSACSRAGLLDEGREMF 365 Score = 109 bits (273), Expect = 2e-21 Identities = 78/288 (27%), Positives = 122/288 (42%), Gaps = 32/288 (11%) Frame = -3 Query: 780 LPDKFTFPFVLKSCAQLSSIEEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARK 601 + F P +L +SI E Q H ILK + LIH F + ++ Sbjct: 1 MSSSFPPPPILSFAEMATSISELHQAHAHILK----------SGLIHSTFAASRLIAS-- 48 Query: 600 VFEGMSCENVVSWNSMIDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEA 421 VS NS I A +F +P+ N WN++I YA P+ A Sbjct: 49 ----------VSTNSHAQA------IPYAHSIFSRIPNPNSYMWNTIIRAYANSPTPEAA 92 Query: 420 LKLFLELKNSGLRPDEFTMVSIISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDM 241 L +F ++ ++ + PD++T + + + G+ +HG++L+ D + LI + Sbjct: 93 LTIFHQMLHASVLPDKYTFTFALKSCGSFSGVEEGRQIHGHVLKTGLGDDLFIQNTLIHL 152 Query: 240 YSKCGSIYNA--------------------------------SRVFEDIPNKNVGHWTSM 157 Y+ CG I +A RVF + P KNV W +M Sbjct: 153 YASCGCIEDARHLLDRMLERDVVSWNALLSAYAERGLMELASRRVFGETPVKNVVSWNAM 212 Query: 156 IVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLVEEG 13 I G++ G L LF +MQ +GVKP+ T + VLSAC+H G + +G Sbjct: 213 ITGYSHAGRFSEVLVLFEDMQHAGVKPDNCTLVSVLSACAHVGALSQG 260 >ref|XP_004139718.1| PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like [Cucumis sativus] Length = 642 Score = 296 bits (758), Expect = 1e-77 Identities = 163/377 (43%), Positives = 239/377 (63%), Gaps = 20/377 (5%) Frame = -3 Query: 1071 CNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQTF 892 C T R+++QLH + IKTG DP A ++++ C S + D++YA ++F Q Sbjct: 29 CKTPRDLKQLHAIFIKTGQIQ-DPLTAAEVIKFCAFSSR--------DIDYARAVFRQMP 79 Query: 891 DSSSFLYNTLIRALTQVDQP---VEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSSI 721 + + F +NT++R L + + EA +LF ML D + + P++FTFP VLK+CA+ S + Sbjct: 80 EPNCFCWNTILRVLAETNDEHLQSEALMLFSAMLCDGR-VKPNRFTFPSVLKACARASRL 138 Query: 720 EEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVF-------EGMSCE----- 577 EG+QIH I+K F D FV ++L+ MY C +E A +F +G SC+ Sbjct: 139 REGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLFCKNVVDFDG-SCQMELDK 197 Query: 576 -----NVVSWNSMIDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKL 412 NVV WN MIDG V+ GDI SA+ +FDEMP R++VSWN +I+GYA+ EA+ L Sbjct: 198 RKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNVMISGYAQNGHFIEAINL 257 Query: 411 FLELKNSGLRPDEFTMVSIISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSK 232 F E+++S + P+ T+VS++ AI+ +G L LGK +H Y +++ +D LG+AL+DMYSK Sbjct: 258 FQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAGKNKIEIDDVLGSALVDMYSK 317 Query: 231 CGSIYNASRVFEDIPNKNVGHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGV 52 CGSI A +VFE +P +N W+++I FA+HG AE ++ F M ++GV PN V +IG+ Sbjct: 318 CGSIDEALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFHLMGKAGVTPNDVAYIGI 377 Query: 51 LSACSHAGLVEEGLKHF 1 LSACSHAGLVEEG F Sbjct: 378 LSACSHAGLVEEGRSFF 394 >ref|XP_007029178.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] gi|508717783|gb|EOY09680.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] Length = 626 Score = 296 bits (757), Expect = 2e-77 Identities = 150/357 (42%), Positives = 233/357 (65%) Frame = -3 Query: 1071 CNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQTF 892 C +++ +H I+T + D F A++++ C + P ++YA IF Q Sbjct: 30 CKNLSQLKIIHGHMIRTHIIF-DIFAASRLISLC-----TDPSFGTALLDYAFKIFSQIE 83 Query: 891 DSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSSIEEG 712 + F++N LI+ + P ++F + +L ILPD +FPF++++CAQL S++ G Sbjct: 84 TPNLFIFNALIKGFSACQNPHQSFHFYTQLLR--ANILPDNLSFPFLVRACAQLESLDMG 141 Query: 711 EQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMIDGFVKS 532 Q H I+K F S+++VQNSL+HMY CG+I++A +F+ M+ NVVSW SMI G K Sbjct: 142 IQAHGQIIKHGFESNVYVQNSLVHMYSTCGDIKAANAIFQRMTFLNVVSWTSMIAGLNKV 201 Query: 531 GDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEFTMVSII 352 GD+ AR++FD MP +N+V+W+ +I+GYA+ S ++A++LF L+ G++ +E MVS+I Sbjct: 202 GDVEMARKLFDTMPEKNLVTWSIMISGYAKNSYFEKAVELFQVLQEEGVQANETVMVSVI 261 Query: 351 SAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASRVFEDIPNKNVG 172 S+ + LG + LG+ H YI R+ SL+ LG AL+DMY++CGSI A VFE++P ++V Sbjct: 262 SSCAHLGAIELGEKAHEYIFRNNLSLNVILGTALVDMYARCGSIEKAIGVFEELPERDVL 321 Query: 171 HWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLVEEGLKHF 1 WT++I G A+HG+AE +L FSEM +SG+KP ++F VLSACSH GLV +GL+ F Sbjct: 322 SWTALIAGLAMHGYAERALWFFSEMVKSGLKPRDISFTAVLSACSHGGLVGKGLELF 378 Score = 87.8 bits (216), Expect = 8e-15 Identities = 70/276 (25%), Positives = 131/276 (47%), Gaps = 7/276 (2%) Frame = -3 Query: 930 DVNYALSIFHQTFDSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFV 751 D+ A +IF + + + ++I L +V A LF M P+K L T+ + Sbjct: 172 DIKAANAIFQRMTFLNVVSWTSMIAGLNKVGDVEMARKLFDTM---PEKNL---VTWSIM 225 Query: 750 LKSCAQLSSIEEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARK----VFEGMS 583 + A+ S E+ ++ + + ++ V S+I G IE K +F Sbjct: 226 ISGYAKNSYFEKAVELFQVLQEEGVQANETVMVSVISSCAHLGAIELGEKAHEYIFRNNL 285 Query: 582 CENVVSWNSMIDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLE 403 NV+ +++D + + G I A +F+E+P R+++SW +LIAG A + AL F E Sbjct: 286 SLNVILGTALVDMYARCGSIEKAIGVFEELPERDVLSWTALIAGLAMHGYAERALWFFSE 345 Query: 402 LKNSGLRPDEFTMVSIISAISDLGFLSLGKCVHGYILRHEFSLDGGLG--AALIDMYSKC 229 + SGL+P + + +++SA S G + G + G ++ +F ++ L ++D+ + Sbjct: 346 MVKSGLKPRDISFTAVLSACSHGGLVGKGLELFG-SMKRDFGIEPRLEHYGCVVDLLGRA 404 Query: 228 GSIYNASRVFEDIPNK-NVGHWTSMIVGFAIHGFAE 124 G + A + ++P K N W +++ IH AE Sbjct: 405 GKLAEAEKFVLEMPVKPNAPIWGALLGACRIHRNAE 440 >ref|XP_004154482.1| PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like [Cucumis sativus] Length = 642 Score = 295 bits (754), Expect = 3e-77 Identities = 163/377 (43%), Positives = 239/377 (63%), Gaps = 20/377 (5%) Frame = -3 Query: 1071 CNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQTF 892 C T R+++QLH + IKTG DP A ++++ C S + D++YA ++F Q Sbjct: 29 CKTPRDLKQLHAIFIKTGQIQ-DPLTAAEVIKFCAFSSR--------DIDYARAVFRQMP 79 Query: 891 DSSSFLYNTLIRALTQVDQP---VEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSSI 721 + + F +NT++R L + + EA +LF ML D + + P++FTFP VLK+CA+ S + Sbjct: 80 EPNCFCWNTILRILAETNDEHLQSEALMLFSAMLCDGR-VKPNRFTFPSVLKACARASRL 138 Query: 720 EEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVF-------EGMSCE----- 577 EG+QIH I+K F D FV ++L+ MY C +E A +F +G SC+ Sbjct: 139 REGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLFCKNVVDFDG-SCQMELDK 197 Query: 576 -----NVVSWNSMIDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKL 412 NVV WN MIDG V+ GDI SA+ +FDEMP R++VSWN +I+GYA+ EA+ L Sbjct: 198 RKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPPRSVVSWNVMISGYAQNGHFIEAINL 257 Query: 411 FLELKNSGLRPDEFTMVSIISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSK 232 F E+++S + P+ T+VS++ AI+ +G L LGK +H Y +++ +D LG+AL+DMYSK Sbjct: 258 FQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAGKNKVEIDDVLGSALVDMYSK 317 Query: 231 CGSIYNASRVFEDIPNKNVGHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGV 52 CGSI A +VFE +P +N W+++I FA+HG AE ++ F M ++GV PN V +IG+ Sbjct: 318 CGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFHLMGKAGVTPNDVAYIGI 377 Query: 51 LSACSHAGLVEEGLKHF 1 LSACSHAGLVEEG F Sbjct: 378 LSACSHAGLVEEGRSFF 394 >ref|XP_002274514.2| PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Vitis vinifera] Length = 616 Score = 294 bits (752), Expect = 6e-77 Identities = 153/360 (42%), Positives = 230/360 (63%), Gaps = 2/360 (0%) Frame = -3 Query: 1074 QCNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQT 895 +C + E++Q+H IKT L +H F ++++ C S S ++YA S+F + Sbjct: 15 KCKSLCELRQIHAQMIKTNLLNHQ-FTVSRLIAFCSLSGVSG------GLDYASSVFSRI 67 Query: 894 FDSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSSIEE 715 +SF++ LI+ + PVE+ +L+ ML +F+ P VLK+C +L + +E Sbjct: 68 QHPNSFIFFALIKGFSDTSNPVESLILYARMLSCLNYSSGVEFSIPSVLKACGKLLAFDE 127 Query: 714 GEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMIDGFVK 535 G Q+H +LKT+ D FV NS++ MY G IE AR+VF+ M +VVSWNSMI G++K Sbjct: 128 GRQVHGQVLKTHLWFDPFVGNSMVRMYIDFGEIELARRVFDRMPNRDVVSWNSMIAGYLK 187 Query: 534 SGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEFTMVSI 355 +G+I A+++F+ M +++V+W S+I+ Y + P +AL LF E+ + GLRPD +VS+ Sbjct: 188 AGEIELAKKVFETMSDKDVVTWTSMISAYVQNRCPMKALDLFREMLSLGLRPDGPAIVSV 247 Query: 354 ISAISDLGFLSLGKCVHGYILRHEFSLDGG-LGAALIDMYSKCGSIYNASRVFEDIPN-K 181 +SAI+DLGF+ GK +H Y+ ++ L G +G+ALIDMYSKCG I NA VF I + + Sbjct: 248 LSAIADLGFVEEGKWLHAYVSMNKIELSSGFIGSALIDMYSKCGYIENAYHVFRSISHRR 307 Query: 180 NVGHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLVEEGLKHF 1 N+G W SMI G AIHG A +L +F EM+R ++PN +TF+G+LS CSH GLVEEG +F Sbjct: 308 NIGDWNSMISGLAIHGLAREALDIFVEMERMDIEPNEITFLGLLSTCSHGGLVEEGQFYF 367 >ref|XP_004159154.1| PREDICTED: uncharacterized protein LOC101226880 [Cucumis sativus] Length = 1725 Score = 292 bits (748), Expect = 2e-76 Identities = 148/357 (41%), Positives = 225/357 (63%) Frame = -3 Query: 1071 CNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQTF 892 C + ++Q+H I++GL +D + K++ + + + YA+ +F+Q Sbjct: 37 CKNFKHLRQIHAKIIRSGL-SNDQLLTRKLIHLYSTHGR---------IAYAILLFYQIQ 86 Query: 891 DSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSSIEEG 712 + +F +N +IRA T +A +L+ M+ + I DKFTFPFV+K+C SI+ G Sbjct: 87 NPCTFTWNLIIRANTINGLSEQALMLYKNMVC--QGIAADKFTFPFVIKACTNFLSIDLG 144 Query: 711 EQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMIDGFVKS 532 + +H ++K F D+FVQN+LI YF+CG+ A KVFE M NVVSW ++I G + Sbjct: 145 KVVHGSLIKYGFSGDVFVQNNLIDFYFKCGHTRFALKVFEKMRVRNVVSWTTVISGLISC 204 Query: 531 GDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEFTMVSII 352 GD+ ARR+FDE+P +N+VSW ++I GY R P+EAL+LF ++ + P+E+TMVS+I Sbjct: 205 GDLQEARRIFDEIPSKNVVSWTAMINGYIRNQQPEEALELFKRMQAENIFPNEYTMVSLI 264 Query: 351 SAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASRVFEDIPNKNVG 172 A +++G L+LG+ +H Y +++ + LG ALIDMYSKCGSI +A VFE +P K++ Sbjct: 265 KACTEMGILTLGRGIHDYAIKNCIEIGVYLGTALIDMYSKCGSIKDAIEVFETMPRKSLP 324 Query: 171 HWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLVEEGLKHF 1 W SMI +HG + +L+LFSEM+R VKP+ +TFIGVL AC H V+EG +F Sbjct: 325 TWNSMITSLGVHGLGQEALNLFSEMERVNVKPDAITFIGVLCACVHIKNVKEGCAYF 381 Score = 173 bits (438), Expect = 2e-40 Identities = 97/298 (32%), Positives = 163/298 (54%), Gaps = 7/298 (2%) Frame = -3 Query: 873 YNTLIRALTQVDQPVEAFLLFYLMLHDPKKI-----LP-DKFTFPFVLKSCAQLSSIEEG 712 + ++I Q +Q A LLF L + ++ +P D VL +C+++S Sbjct: 1211 WTSMITGYVQNEQADNALLLFKDFLEEETEVEDGNNVPLDSVVMVSVLSACSRVSGKGIT 1270 Query: 711 EQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMIDGFVKS 532 E +H F++K F + V N+L+ D + K Sbjct: 1271 EGVHGFVVKKGFDGSIGVGNTLM-------------------------------DAYAKC 1299 Query: 531 GDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLEL-KNSGLRPDEFTMVSI 355 G + ++++FD M ++ +SWNS+IA YA+ L EAL++F + ++ G+R + T+ ++ Sbjct: 1300 GQPLVSKKVFDWMEEKDDISWNSMIAVYAQSGLSGEALEVFHGMVRHVGVRYNAVTLSAV 1359 Query: 354 ISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASRVFEDIPNKNV 175 + A + G L GKC+H +++ + + +G ++IDMY KCG + A + F+ + KNV Sbjct: 1360 LLACAHAGALRAGKCIHDQVIKMDLEYNVCVGTSIIDMYCKCGRVEMAKKTFDRMKEKNV 1419 Query: 174 GHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLVEEGLKHF 1 WT+M+ G+ +HG A+ +L +F +M R+GVKPNY+TF+ VL+ACSHAGLVEEG F Sbjct: 1420 KSWTAMVAGYGMHGRAKEALDIFYKMVRAGVKPNYITFVSVLAACSHAGLVEEGWHWF 1477 Score = 138 bits (348), Expect = 4e-30 Identities = 97/319 (30%), Positives = 160/319 (50%), Gaps = 3/319 (0%) Frame = -3 Query: 960 FQSSPPKSIHDVNYALSIFHQTFDSSSF-LYNTLIRALTQVDQPVEAFLLFYLMLHDPKK 784 F SS + + + + F++ D S+ +N++I L + VEA F + Sbjct: 1080 FPSSRRRPVSLSSNLATWFYKYVDKSNVHSWNSVIADLARGGDSVEALRAFSSLRK--LG 1137 Query: 783 ILPDKFTFPFVLKSCAQLSSIEEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESAR 604 ++P + +FP +KSC+ L + G H F +DLFV ++LI MY +CG ++ AR Sbjct: 1138 LIPTRSSFPCTIKSCSALCDLVSGRMSHQQAFVFGFETDLFVSSALIDMYSKCGQLKDAR 1197 Query: 603 KVFEGMSCENVVSWNSMIDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDE 424 +F+ + NVVSW SMI G+V++ +A +F ++ L +E Sbjct: 1198 ALFDEIPLRNVVSWTSMITGYVQNEQADNALLLF-------------------KDFLEEE 1238 Query: 423 ALKLFLELKNSGLRP-DEFTMVSIISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALI 247 E+++ P D MVS++SA S + + + VHG++++ F G+G L+ Sbjct: 1239 T-----EVEDGNNVPLDSVVMVSVLSACSRVSGKGITEGVHGFVVKKGFDGSIGVGNTLM 1293 Query: 246 DMYSKCGSIYNASRVFEDIPNKNVGHWTSMIVGFAIHGFAEASLHLFSEMQRS-GVKPNY 70 D Y+KCG + +VF+ + K+ W SMI +A G + +L +F M R GV+ N Sbjct: 1294 DAYAKCGQPLVSKKVFDWMEEKDDISWNSMIAVYAQSGLSGEALEVFHGMVRHVGVRYNA 1353 Query: 69 VTFIGVLSACSHAGLVEEG 13 VT VL AC+HAG + G Sbjct: 1354 VTLSAVLLACAHAGALRAG 1372 Score = 75.5 bits (184), Expect = 4e-11 Identities = 60/293 (20%), Positives = 122/293 (41%), Gaps = 3/293 (1%) Frame = -3 Query: 909 IFHQTFDSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQL 730 +F + +N++I Q EA +F+ M+ + + T VL +CA Sbjct: 1308 VFDWMEEKDDISWNSMIAVYAQSGLSGEALEVFHGMVRHVG-VRYNAVTLSAVLLACAHA 1366 Query: 729 SSIEEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMI 550 ++ G+ IH ++K + ++ V S+I MY Sbjct: 1367 GALRAGKCIHDQVIKMDLEYNVCVGTSIIDMY---------------------------- 1398 Query: 549 DGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEF 370 K G + A++ FD M +N+ SW +++AGY EAL +F ++ +G++P+ Sbjct: 1399 ---CKCGRVEMAKKTFDRMKEKNVKSWTAMVAGYGMHGRAKEALDIFYKMVRAGVKPNYI 1455 Query: 369 TMVSIISAISDLGFLSLGKCVHGY-ILRHEFSLDGGLG--AALIDMYSKCGSIYNASRVF 199 T VS+++A S G + G H + ++H++ ++ G+ ++D++ + G + A Sbjct: 1456 TFVSVLAACSHAGLVEEGW--HWFNAMKHKYDIEPGIEHYGCMVDLFGRAGCLNEA---- 1509 Query: 198 EDIPNKNVGHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSAC 40 ++ ++R +KP++V + +L AC Sbjct: 1510 ------------------------------YNLIKRMKMKPDFVVWGSLLGAC 1532 >ref|XP_004145727.1| PREDICTED: uncharacterized protein LOC101212001 [Cucumis sativus] Length = 2598 Score = 292 bits (748), Expect = 2e-76 Identities = 148/357 (41%), Positives = 225/357 (63%) Frame = -3 Query: 1071 CNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQTF 892 C + ++Q+H I++GL +D + K++ + + + YA+ +F+Q Sbjct: 37 CKNFKHLRQIHAKIIRSGL-SNDQLLTRKLIHLYSTHGR---------IAYAILLFYQIQ 86 Query: 891 DSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSSIEEG 712 + +F +N +IRA T +A +L+ M+ + I DKFTFPFV+K+C SI+ G Sbjct: 87 NPCTFTWNLIIRANTINGLSEQALMLYKNMVC--QGIAADKFTFPFVIKACTNFLSIDLG 144 Query: 711 EQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMIDGFVKS 532 + +H ++K F D+FVQN+LI YF+CG+ A KVFE M NVVSW ++I G + Sbjct: 145 KVVHGSLIKYGFSGDVFVQNNLIDFYFKCGHTRFALKVFEKMRVRNVVSWTTVISGLISC 204 Query: 531 GDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEFTMVSII 352 GD+ ARR+FDE+P +N+VSW ++I GY R P+EAL+LF ++ + P+E+TMVS+I Sbjct: 205 GDLQEARRIFDEIPSKNVVSWTAMINGYIRNQQPEEALELFKRMQAENIFPNEYTMVSLI 264 Query: 351 SAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASRVFEDIPNKNVG 172 A +++G L+LG+ +H Y +++ + LG ALIDMYSKCGSI +A VFE +P K++ Sbjct: 265 KACTEMGILTLGRGIHDYAIKNCIEIGVYLGTALIDMYSKCGSIKDAIEVFETMPRKSLP 324 Query: 171 HWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLVEEGLKHF 1 W SMI +HG + +L+LFSEM+R VKP+ +TFIGVL AC H V+EG +F Sbjct: 325 TWNSMITSLGVHGLGQEALNLFSEMERVNVKPDAITFIGVLCACVHIKNVKEGCAYF 381 Score = 173 bits (438), Expect = 2e-40 Identities = 97/298 (32%), Positives = 163/298 (54%), Gaps = 7/298 (2%) Frame = -3 Query: 873 YNTLIRALTQVDQPVEAFLLFYLMLHDPKKI-----LP-DKFTFPFVLKSCAQLSSIEEG 712 + ++I Q +Q A LLF L + ++ +P D VL +C+++S Sbjct: 2084 WTSMITGYVQNEQADNALLLFKDFLEEETEVEDGNNVPLDSVVMVSVLSACSRVSGKGIT 2143 Query: 711 EQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMIDGFVKS 532 E +H F++K F + V N+L+ D + K Sbjct: 2144 EGVHGFVVKKGFDGSIGVGNTLM-------------------------------DAYAKC 2172 Query: 531 GDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLEL-KNSGLRPDEFTMVSI 355 G + ++++FD M ++ +SWNS+IA YA+ L EAL++F + ++ G+R + T+ ++ Sbjct: 2173 GQPLVSKKVFDWMEEKDDISWNSMIAVYAQSGLSGEALEVFHGMVRHVGVRYNAVTLSAV 2232 Query: 354 ISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASRVFEDIPNKNV 175 + A + G L GKC+H +++ + + +G ++IDMY KCG + A + F+ + KNV Sbjct: 2233 LLACAHAGALRAGKCIHDQVIKMDLEYNVCVGTSIIDMYCKCGRVEMAKKTFDRMKEKNV 2292 Query: 174 GHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLVEEGLKHF 1 WT+M+ G+ +HG A+ +L +F +M R+GVKPNY+TF+ VL+ACSHAGLVEEG F Sbjct: 2293 KSWTAMVAGYGMHGRAKEALDIFYKMVRAGVKPNYITFVSVLAACSHAGLVEEGWHWF 2350 Score = 139 bits (349), Expect = 3e-30 Identities = 96/314 (30%), Positives = 157/314 (50%), Gaps = 3/314 (0%) Frame = -3 Query: 945 PKSIHDVNYALSIFHQTFDSSSF-LYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDK 769 P D + + F++ D S+ +N++I L + VEA F + ++P + Sbjct: 1958 PSGREDHSNLATWFYKYVDKSNVHSWNSVIADLARGGDSVEALRAFSSLRK--LGLIPTR 2015 Query: 768 FTFPFVLKSCAQLSSIEEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEG 589 +FP +KSC+ L + G H F +DLFV ++LI MY +CG ++ AR +F+ Sbjct: 2016 SSFPCTIKSCSALCDLVSGRMSHQQAFVFGFETDLFVSSALIDMYSKCGQLKDARALFDE 2075 Query: 588 MSCENVVSWNSMIDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLF 409 + NVVSW SMI G+V++ +A +F ++ L +E Sbjct: 2076 IPLRNVVSWTSMITGYVQNEQADNALLLF-------------------KDFLEEET---- 2112 Query: 408 LELKNSGLRP-DEFTMVSIISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSK 232 E+++ P D MVS++SA S + + + VHG++++ F G+G L+D Y+K Sbjct: 2113 -EVEDGNNVPLDSVVMVSVLSACSRVSGKGITEGVHGFVVKKGFDGSIGVGNTLMDAYAK 2171 Query: 231 CGSIYNASRVFEDIPNKNVGHWTSMIVGFAIHGFAEASLHLFSEMQRS-GVKPNYVTFIG 55 CG + +VF+ + K+ W SMI +A G + +L +F M R GV+ N VT Sbjct: 2172 CGQPLVSKKVFDWMEEKDDISWNSMIAVYAQSGLSGEALEVFHGMVRHVGVRYNAVTLSA 2231 Query: 54 VLSACSHAGLVEEG 13 VL AC+HAG + G Sbjct: 2232 VLLACAHAGALRAG 2245 Score = 94.7 bits (234), Expect = 7e-17 Identities = 65/192 (33%), Positives = 97/192 (50%), Gaps = 10/192 (5%) Frame = -3 Query: 582 CENVVSWNSMIDGFVKSGDIVS--ARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLF 409 C + +++NS++ G + S A + + N+ SWNS+IA AR EAL+ F Sbjct: 1944 CFDGITYNSILFGVPSGREDHSNLATWFYKYVDKSNVHSWNSVIADLARGGDSVEALRAF 2003 Query: 408 LELKNSGLRPDEFTMVSIISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKC 229 L+ GL P + I + S L L G+ H F D + +ALIDMYSKC Sbjct: 2004 SSLRKLGLIPTRSSFPCTIKSCSALCDLVSGRMSHQQAFVFGFETDLFVSSALIDMYSKC 2063 Query: 228 GSIYNASRVFEDIPNKNVGHWTSMIVGFAIHGFAEASLHLFSEM--------QRSGVKPN 73 G + +A +F++IP +NV WTSMI G+ + A+ +L LF + + V + Sbjct: 2064 GQLKDARALFDEIPLRNVVSWTSMITGYVQNEQADNALLLFKDFLEEETEVEDGNNVPLD 2123 Query: 72 YVTFIGVLSACS 37 V + VLSACS Sbjct: 2124 SVVMVSVLSACS 2135 Score = 75.5 bits (184), Expect = 4e-11 Identities = 60/293 (20%), Positives = 122/293 (41%), Gaps = 3/293 (1%) Frame = -3 Query: 909 IFHQTFDSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQL 730 +F + +N++I Q EA +F+ M+ + + T VL +CA Sbjct: 2181 VFDWMEEKDDISWNSMIAVYAQSGLSGEALEVFHGMVRHVG-VRYNAVTLSAVLLACAHA 2239 Query: 729 SSIEEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMI 550 ++ G+ IH ++K + ++ V S+I MY Sbjct: 2240 GALRAGKCIHDQVIKMDLEYNVCVGTSIIDMY---------------------------- 2271 Query: 549 DGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEF 370 K G + A++ FD M +N+ SW +++AGY EAL +F ++ +G++P+ Sbjct: 2272 ---CKCGRVEMAKKTFDRMKEKNVKSWTAMVAGYGMHGRAKEALDIFYKMVRAGVKPNYI 2328 Query: 369 TMVSIISAISDLGFLSLGKCVHGY-ILRHEFSLDGGLG--AALIDMYSKCGSIYNASRVF 199 T VS+++A S G + G H + ++H++ ++ G+ ++D++ + G + A Sbjct: 2329 TFVSVLAACSHAGLVEEGW--HWFNAMKHKYDIEPGIEHYGCMVDLFGRAGCLNEA---- 2382 Query: 198 EDIPNKNVGHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSAC 40 ++ ++R +KP++V + +L AC Sbjct: 2383 ------------------------------YNLIKRMKMKPDFVVWGSLLGAC 2405 >ref|XP_006395538.1| hypothetical protein EUTSA_v10004197mg [Eutrema salsugineum] gi|557092177|gb|ESQ32824.1| hypothetical protein EUTSA_v10004197mg [Eutrema salsugineum] Length = 453 Score = 290 bits (741), Expect = 1e-75 Identities = 144/357 (40%), Positives = 229/357 (64%) Frame = -3 Query: 1071 CNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQTF 892 C+ +++Q+H I+ L + D + +++ S S+ + YA +F Q Sbjct: 30 CSNFSQLKQIHAKIIRYNLTN-DQLLVRQLI---------SVSSSLGETRYASLVFSQLQ 79 Query: 891 DSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSSIEEG 712 S+F +N +IR+L+ D+P EA LLF LML ++ DKFTFPFV+K+C SS+ G Sbjct: 80 SPSTFTWNLMIRSLSVNDKPREALLLFILMLSHQSQL--DKFTFPFVIKACLASSSLRLG 137 Query: 711 EQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMIDGFVKS 532 Q+H +K+ F SD+F QN+L+ +Y +CG + RKVF+ M +VSW +M+ G V + Sbjct: 138 TQVHGLAIKSGFFSDVFFQNTLMDLYLKCGKPDCGRKVFDKMPGRTIVSWTTMLYGLVSN 197 Query: 531 GDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEFTMVSII 352 + SA +F++MP RN+VSW ++I Y + PDEA +LF ++ ++P+EFT+VS++ Sbjct: 198 SQLDSAEIIFNQMPTRNVVSWTAMITAYVKNCRPDEAFQLFRRMQVDEVKPNEFTIVSML 257 Query: 351 SAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASRVFEDIPNKNVG 172 A + LG LS+G+ VH Y ++ F LD LG ALIDMYSKCGS+ +A +VF+ + +K++ Sbjct: 258 QASTQLGSLSMGRWVHDYAHKNGFPLDCFLGTALIDMYSKCGSLQDAWKVFDAMQSKSLA 317 Query: 171 HWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLVEEGLKHF 1 W SMI +HG E +L L+ +M+ +GV+P+ +TF+GVLSAC++ G V++GL++F Sbjct: 318 TWNSMITSLGVHGCGEEALDLYDQMEEAGVEPDAITFVGVLSACANIGNVKDGLRYF 374 >ref|XP_002320601.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550324522|gb|EEE98916.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 629 Score = 289 bits (739), Expect = 2e-75 Identities = 161/363 (44%), Positives = 233/363 (64%), Gaps = 9/363 (2%) Frame = -3 Query: 1074 QCNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQT 895 +C TTR ++Q+H IKTG HH P A ++++ S Q ++ YA F Q Sbjct: 24 RCKTTRHLKQIHAHFIKTGQIHH-PLAAAELLKFLTLSTQ-------REIKYARKFFSQI 75 Query: 894 FDSSSFLYNTLIRALTQVDQP-------VEAFLLFYLMLHDPKKILPDKFTFPFVLKSCA 736 + F +NT+IRAL D +EA L F ML D + P+KFTFP VLK+CA Sbjct: 76 HHPNCFSWNTIIRALADSDDDDLFHVNSLEALLYFSHMLTDGL-VEPNKFTFPCVLKACA 134 Query: 735 QLSSIEEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCE-NVVSWN 559 +L+ IEEG+Q+H F++K SD FV+++L+ +Y CG ++ A +F E NVV WN Sbjct: 135 KLARIEEGKQLHGFVVKLGLVSDEFVRSNLVRVYVMCGAMKDAHVLFYQTRLEGNVVLWN 194 Query: 558 SMIDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRP 379 MIDG+V+ GD+ ++R +FD MP++++VSWN +I+G A+ EA+++F +++ + P Sbjct: 195 VMIDGYVRMGDLRASRELFDSMPNKSVVSWNVMISGCAQNGHFKEAIEMFHDMQLGDVHP 254 Query: 378 DEFTMVSIISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASRVF 199 + T+VS++ A+S LG + LGK VH + ++E +D LG+ALIDMYSKCGSI A +VF Sbjct: 255 NYVTLVSVLPAVSRLGAIELGKWVHLFAEKNEIEIDDVLGSALIDMYSKCGSIDKAVQVF 314 Query: 198 EDIPN-KNVGHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLV 22 E I N KN W+++I G A+HG A +L F MQ++GV P+ V +IGVLSACSHAGLV Sbjct: 315 EGIRNKKNPITWSAIIGGLAMHGRARDALDHFWRMQQAGVTPSDVVYIGVLSACSHAGLV 374 Query: 21 EEG 13 EEG Sbjct: 375 EEG 377 >ref|XP_002876985.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297322823|gb|EFH53244.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 451 Score = 288 bits (736), Expect = 4e-75 Identities = 147/359 (40%), Positives = 231/359 (64%), Gaps = 2/359 (0%) Frame = -3 Query: 1071 CNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQTF 892 C+ +++Q+HT IK L + D + +++ S S + YA +F+Q Sbjct: 30 CSNFSQLKQIHTKIIKHNLTN-DQLLVRQLI---------SVSSSFGETQYASLVFNQLQ 79 Query: 891 DSSSFLYNTLIRALTQVDQPVEAFLLFYLML-HDPKKILPDKFTFPFVLKSCAQLSSIEE 715 S+F +N +IR+L+ +P EA LLF LML H P+ DKFTFPFV+K+C SS+ Sbjct: 80 SPSTFTWNLMIRSLSLNHKPREALLLFILMLSHQPQF---DKFTFPFVIKACLASSSLRL 136 Query: 714 GEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMIDGFVK 535 G Q+H +K F +D+F QN+L+ +YF+CG + RKVF+ M ++VSW +M+ G V Sbjct: 137 GTQVHGLAIKAGFFNDVFFQNTLMDLYFKCGKPDCGRKVFDKMPGRSIVSWTTMLYGLVS 196 Query: 534 SGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEFTMVSI 355 + + SA +F++MP RN+VSW ++I Y + PDEA +LF ++ ++P+EFT+V++ Sbjct: 197 NSQLDSAEIVFNQMPTRNVVSWTAMITAYVKNRRPDEAFQLFRRMQVDDVKPNEFTIVNL 256 Query: 354 ISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASRVFEDIPNKNV 175 + A + LG LS+G+ VH Y ++ F LD LG ALIDMYSKCGS+ +A +VF+ + +K++ Sbjct: 257 LQASTQLGSLSMGRWVHDYAHKNGFVLDCYLGTALIDMYSKCGSLQDARKVFDVMQSKSL 316 Query: 174 GHWTSMIVGFAIHGFAEASLHLFSEM-QRSGVKPNYVTFIGVLSACSHAGLVEEGLKHF 1 W SMI +HG E +L+LF EM + + V+P+ +TF+GVLSAC++ G V++GL++F Sbjct: 317 ATWNSMITSLGVHGCGEEALYLFEEMEEEASVEPDAITFVGVLSACANTGNVKDGLRYF 375 >ref|NP_181820.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75206274|sp|Q9SJG6.1|PP200_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g42920, chloroplastic; Flags: Precursor gi|4512663|gb|AAD21717.1| hypothetical protein [Arabidopsis thaliana] gi|20197867|gb|AAM15291.1| hypothetical protein [Arabidopsis thaliana] gi|110738441|dbj|BAF01146.1| hypothetical protein [Arabidopsis thaliana] gi|330255093|gb|AEC10187.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 559 Score = 287 bits (735), Expect = 6e-75 Identities = 147/358 (41%), Positives = 221/358 (61%) Frame = -3 Query: 1074 QCNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQT 895 QC+T RE++Q+H IKTGL D A++++ CC+S D+NYA +F + Sbjct: 34 QCSTMRELKQIHASLIKTGLIS-DTVTASRVLAFCCASPS--------DMNYAYLVFTRI 84 Query: 894 FDSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSSIEE 715 + F++NT+IR ++ P A +F ML + P + T+P V K+ +L + Sbjct: 85 NHKNPFVWNTIIRGFSRSSFPEMAISIFIDMLCSSPSVKPQRLTYPSVFKAYGRLGQARD 144 Query: 714 GEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMIDGFVK 535 G Q+H ++K D F++N+++HMY CG + A ++F GM +VV+WNSMI GF K Sbjct: 145 GRQLHGMVIKEGLEDDSFIRNTMLHMYVTCGCLIEAWRIFLGMIGFDVVAWNSMIMGFAK 204 Query: 534 SGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEFTMVSI 355 G I A+ +FDEMP RN VSWNS+I+G+ R +AL +F E++ ++PD FTMVS+ Sbjct: 205 CGLIDQAQNLFDEMPQRNGVSWNSMISGFVRNGRFKDALDMFREMQEKDVKPDGFTMVSL 264 Query: 354 ISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASRVFEDIPNKNV 175 ++A + LG G+ +H YI+R+ F L+ + ALIDMY KCG I VFE P K + Sbjct: 265 LNACAYLGASEQGRWIHEYIVRNRFELNSIVVTALIDMYCKCGCIEEGLNVFECAPKKQL 324 Query: 174 GHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLVEEGLKHF 1 W SMI+G A +GF E ++ LFSE++RSG++P+ V+FIGVL+AC+H+G V + F Sbjct: 325 SCWNSMILGLANNGFEERAMDLFSELERSGLEPDSVSFIGVLTACAHSGEVHRADEFF 382 >ref|XP_002268530.1| PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like [Vitis vinifera] Length = 631 Score = 286 bits (733), Expect = 9e-75 Identities = 155/368 (42%), Positives = 229/368 (62%), Gaps = 15/368 (4%) Frame = -3 Query: 1071 CNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQTF 892 C T ++++QLH IKT DP A +++ + S D++YA IF Sbjct: 21 CKTMQDLKQLHAQMIKTAQIR-DPLAAAELL-------RFSAVSDHRDLDYARKIFRSMH 72 Query: 891 DSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSSIEEG 712 + F YNTLIRAL++ + P +A L+F M+ D + P+ FTFP V K+C + + EG Sbjct: 73 RPNCFSYNTLIRALSESNDPCDALLVFIEMVEDCS-VEPNCFTFPSVFKACGRAERLREG 131 Query: 711 EQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGM----SCE----------- 577 Q+H +K SD FV ++++ MY CG +E A ++F C+ Sbjct: 132 RQVHGLAVKFGLDSDEFVVSNVVRMYLSCGVMEDAHRLFYRRVFVDGCDGIRDKKRRVDG 191 Query: 576 NVVSWNSMIDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELK 397 +VV WN MIDG+V+ G++ AR +FDEMP R++VSWN +IAGYA+ EA+++F E++ Sbjct: 192 DVVLWNVMIDGYVRIGELEVARNLFDEMPQRSVVSWNVMIAGYAQSGHFKEAVEVFREMQ 251 Query: 396 NSGLRPDEFTMVSIISAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIY 217 + + P+ T+VS++ A+S LG L LGK VH Y +R+ +D LG+ALIDMY+KCGSI Sbjct: 252 MAEVPPNYVTLVSVLPAMSRLGALELGKWVHLYAVRNNIGVDDVLGSALIDMYAKCGSIE 311 Query: 216 NASRVFEDIPNKNVGHWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACS 37 A +VFE +P +NV W+++I G A+HG A+ +L F +M+R+GV P+ VT+IG+LSACS Sbjct: 312 KALQVFEGLPKRNVVTWSTIIAGLAMHGRAKDTLDHFEDMERAGVMPSDVTYIGLLSACS 371 Query: 36 HAGLVEEG 13 HAGLV EG Sbjct: 372 HAGLVNEG 379 Score = 80.9 bits (198), Expect = 1e-12 Identities = 60/274 (21%), Positives = 126/274 (45%), Gaps = 3/274 (1%) Frame = -3 Query: 936 IHDVNYALSIFHQTFDSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFP 757 I ++ A ++F + S +N +I Q EA +F M ++ P+ T Sbjct: 206 IGELEVARNLFDEMPQRSVVSWNVMIAGYAQSGHFKEAVEVFREM--QMAEVPPNYVTLV 263 Query: 756 FVLKSCAQLSSIEEGEQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCE 577 VL + ++L ++E G+ +H + ++ N G D + ++LI MY +C Sbjct: 264 SVLPAMSRLGALELGKWVHLYAVRNNIGVDDVLGSALIDMYAKC---------------- 307 Query: 576 NVVSWNSMIDGFVKSGDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELK 397 G I A ++F+ +P RN+V+W+++IAG A + L F +++ Sbjct: 308 ---------------GSIEKALQVFEGLPKRNVVTWSTIIAGLAMHGRAKDTLDHFEDME 352 Query: 396 NSGLRPDEFTMVSIISAISDLGFLSLGKCVHGYILRHEFSLDGGLG--AALIDMYSKCGS 223 +G+ P + T + ++SA S G ++ G+ +++R L+ + ++D+ + G Sbjct: 353 RAGVMPSDVTYIGLLSACSHAGLVNEGRWFFDHMVRVS-GLEPRIEHYGCMVDLLGRAGL 411 Query: 222 IYNASRVFEDIPNK-NVGHWTSMIVGFAIHGFAE 124 + + + ++P K + W +++ +HG E Sbjct: 412 LEESEELILNMPIKPDDVIWKALLGACKMHGNVE 445 >ref|XP_002306741.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|222856190|gb|EEE93737.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 509 Score = 286 bits (732), Expect = 1e-74 Identities = 142/357 (39%), Positives = 228/357 (63%) Frame = -3 Query: 1071 CNTTREIQQLHTLTIKTGLFHHDPFIATKIVESCCSSFQSSPPKSIHDVNYALSIFHQTF 892 C + +++Q++H IKTGL D A++++ C S D+NYA +F Q Sbjct: 6 CTSMKDLQKIHAQLIKTGLAK-DTIAASRVLAFCTSP--------AGDINYAYLVFTQIR 56 Query: 891 DSSSFLYNTLIRALTQVDQPVEAFLLFYLMLHDPKKILPDKFTFPFVLKSCAQLSSIEEG 712 + + F++NT+IR +Q P A LF M+ P + T+P V K+ AQL EG Sbjct: 57 NPNLFVWNTIIRGFSQSSTPHNAISLFIDMMFTSPTTQPQRLTYPSVFKAYAQLGLAHEG 116 Query: 711 EQIHCFILKTNFGSDLFVQNSLIHMYFRCGNIESARKVFEGMSCENVVSWNSMIDGFVKS 532 Q+H ++K +D F+QN++++MY CG + A+++F+G + +VV+WN+MI G K Sbjct: 117 AQLHGRVIKLGLENDQFIQNTILNMYVNCGFLGEAQRIFDGATGFDVVTWNTMIIGLAKC 176 Query: 531 GDIVSARRMFDEMPHRNIVSWNSLIAGYARESLPDEALKLFLELKNSGLRPDEFTMVSII 352 G+I +RR+FD+M RN VSWNS+I+GY R+ EA++LF ++ G++P EFTMVS++ Sbjct: 177 GEIDKSRRLFDKMLLRNTVSWNSMISGYVRKGRFFEAMELFSRMQEEGIKPSEFTMVSLL 236 Query: 351 SAISDLGFLSLGKCVHGYILRHEFSLDGGLGAALIDMYSKCGSIYNASRVFEDIPNKNVG 172 +A + LG L G+ +H YI+++ F+L+ + A+IDMYSKCGSI A +VF+ P K + Sbjct: 237 NACACLGALRQGEWIHDYIVKNNFALNSIVITAIIDMYSKCGSIDKALQVFKSAPKKGLS 296 Query: 171 HWTSMIVGFAIHGFAEASLHLFSEMQRSGVKPNYVTFIGVLSACSHAGLVEEGLKHF 1 W S+I+G A+ G ++ LFS+++ S +KP++V+FIGVL+AC+HAG+V+ +F Sbjct: 297 CWNSLILGLAMSGRGNEAVRLFSKLESSNLKPDHVSFIGVLTACNHAGMVDRAKDYF 353