BLASTX nr result
ID: Catharanthus22_contig00015340
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00015340 (1622 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI40732.3| unnamed protein product [Vitis vinifera] 574 e-161 emb|CAN84084.1| hypothetical protein VITISV_018999 [Vitis vinifera] 570 e-160 gb|EMJ11879.1| hypothetical protein PRUPE_ppa023340mg [Prunus pe... 560 e-157 gb|EOY15303.1| Pentatricopeptide repeat (PPR) superfamily protei... 553 e-154 ref|XP_006435387.1| hypothetical protein CICLE_v10000757mg [Citr... 551 e-154 ref|XP_006473809.1| PREDICTED: putative pentatricopeptide repeat... 550 e-154 gb|ESW30549.1| hypothetical protein PHAVU_002G162200g [Phaseolus... 509 e-141 ref|NP_199195.4| pentatricopeptide repeat-containing protein [Ar... 504 e-140 ref|XP_006279800.1| hypothetical protein CARUB_v10027962mg [Caps... 499 e-138 ref|XP_004145547.1| PREDICTED: putative pentatricopeptide repeat... 496 e-137 ref|XP_004486824.1| PREDICTED: putative pentatricopeptide repeat... 496 e-137 ref|XP_006403177.1| hypothetical protein EUTSA_v10003177mg [Eutr... 493 e-136 ref|XP_003597616.1| hypothetical protein MTR_2g100200 [Medicago ... 489 e-135 gb|EXC31210.1| hypothetical protein L484_005635 [Morus notabilis] 482 e-133 ref|XP_002307761.2| hypothetical protein POPTR_0005s26850g [Popu... 473 e-130 ref|XP_002865400.1| pentatricopeptide repeat-containing protein ... 472 e-130 dbj|BAB11311.1| unnamed protein product [Arabidopsis thaliana] 463 e-127 gb|EPS70746.1| hypothetical protein M569_04016, partial [Genlise... 462 e-127 ref|XP_006855725.1| hypothetical protein AMTR_s00044p00153760 [A... 435 e-119 ref|XP_006598344.1| PREDICTED: putative pentatricopeptide repeat... 412 e-112 >emb|CBI40732.3| unnamed protein product [Vitis vinifera] Length = 520 Score = 574 bits (1480), Expect = e-161 Identities = 275/446 (61%), Positives = 350/446 (78%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 K+AIE ALTNV ++LTIDI+++V+NRG+L GEAMV FFNWAVKQP I KD+D+Y+VI+KA Sbjct: 75 KAAIELALTNVGIDLTIDIVSEVINRGNLGGEAMVIFFNWAVKQPTIPKDVDTYNVIIKA 134 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LGRRK+ + +V +L DM GI P ETL IVMDSFI+ARQVSKAI++F NLE++ + Sbjct: 135 LGRRKFIEFVVKVLKDMHIQGISPNYETLSIVMDSFIKARQVSKAIEMFRNLEEFGGKCD 194 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEA 543 +E+ N+LL+CLCQRSHVG A N MKG +PFN MTYN++IGGWSK+G+I E+ER L+A Sbjct: 195 TESLNVLLQCLCQRSHVGAANLFFNAMKGGIPFNCMTYNIIIGGWSKYGKIGEMERCLKA 254 Query: 544 MVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVGN 723 MV G P+ LT+S+++EGLGRA +IDDAV++F +EE GC+ A VYNA+ISNF S + Sbjct: 255 MVADGFSPNCLTFSHLIEGLGRAGRIDDAVEVFHHMEETGCVPNACVYNALISNFISTRD 314 Query: 724 IDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVTS 903 DE L YYN M+ SNC PNMDTY +LI A LK R+VA+A+E+ DEM+ RG++PTTG +TS Sbjct: 315 FDECLKYYNFMVSSNCDPNMDTYTKLIVAFLKARKVADALEMLDEMVGRGMIPTTGAITS 374 Query: 904 FMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDSG 1083 F+EPLC YGPPHAA++IYKKARKVGC++S++ YK GKCGML ++W +MQ+SG Sbjct: 375 FIEPLCQYGPPHAAMMIYKKARKVGCRISLSAYKLLLMRLSRFGKCGMLLNLWDEMQESG 434 Query: 1084 YSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEMAY 1263 YSSDTEV+EY+INGLC+IG+L+ AVLVMEESL GFCPSRL VEMAY Sbjct: 435 YSSDTEVYEYVINGLCNIGQLDTAVLVMEESLHKGFCPSRLIRSKLNNKLLASNKVEMAY 494 Query: 1264 KLLLKIKVARKYENARTYWRAKGWHF 1341 KL LKIK+AR+ +NAR +WR GWHF Sbjct: 495 KLFLKIKIARQNDNARRFWRGNGWHF 520 >emb|CAN84084.1| hypothetical protein VITISV_018999 [Vitis vinifera] Length = 561 Score = 570 bits (1469), Expect = e-160 Identities = 275/446 (61%), Positives = 347/446 (77%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 K+AIE ALTNV ++LTIDI+++V NRG+L GEAMV FFNWAVKQP I KD+D+Y+VI+KA Sbjct: 116 KAAIELALTNVGIDLTIDIVSEVXNRGNLGGEAMVXFFNWAVKQPTIPKDVDTYNVIIKA 175 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LGRRK+ + V +L DM GI P ETL IVMDSFI+ARQVSKAI++F NLE++ + Sbjct: 176 LGRRKFIEFXVXVLKDMHIQGISPNYETLSIVMDSFIKARQVSKAIEMFRNLEEFGGKCD 235 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEA 543 +E+ N+LL+CLCQRSHVG A N MKG +PFN MTYN++IGGWSK+G+I E+ER L+A Sbjct: 236 TESLNVLLQCLCQRSHVGAANLFFNAMKGGIPFNCMTYNIIIGGWSKYGKIGEMERCLKA 295 Query: 544 MVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVGN 723 MV G P+ LT+S+++EGLGRA +IDDAV++F +EE GC+ A VYNA+ISNF S + Sbjct: 296 MVADGFSPNCLTFSHLIEGLGRAGRIDDAVEVFHHMEETGCVPNACVYNALISNFISTRD 355 Query: 724 IDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVTS 903 DE L YYN M+ SNC PNMDTY +LI A LK R+VA+A+E+ DEM+ RG++PTTG +TS Sbjct: 356 FDECLKYYNFMVSSNCDPNMDTYTKLIVAFLKARKVADALEMLDEMVGRGMIPTTGAITS 415 Query: 904 FMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDSG 1083 F+EPLC YGPPHAA++IYKKARKVGC++S++ YK GKCGML ++W +MQ+SG Sbjct: 416 FIEPLCQYGPPHAAMMIYKKARKVGCRISLSAYKLLLMRLSRFGKCGMLLNLWDEMQESG 475 Query: 1084 YSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEMAY 1263 YSSDTEV+EY+INGLC+IG+L+ AVLVMEESL GFCPSRL VEMAY Sbjct: 476 YSSDTEVYEYVINGLCNIGQLDTAVLVMEESLXKGFCPSRLIRSKLNNKLLASNKVEMAY 535 Query: 1264 KLLLKIKVARKYENARTYWRAKGWHF 1341 KL LKIK AR+ +NAR +WR GWHF Sbjct: 536 KLFLKIKXARQNDNARRFWRGNGWHF 561 >gb|EMJ11879.1| hypothetical protein PRUPE_ppa023340mg [Prunus persica] Length = 562 Score = 560 bits (1442), Expect = e-157 Identities = 268/445 (60%), Positives = 341/445 (76%) Frame = +1 Query: 7 SAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKAL 186 +AIE AL N V+L++D++A+VVNRG L EAM+ FFNWA+++P I+K +++YH+ILKAL Sbjct: 118 AAIEHALDNGGVDLSVDVVAQVVNRGGLGAEAMLVFFNWAIRKPTIAKYIETYHIILKAL 177 Query: 187 GRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERGS 366 GRRK+F HM+ +L M GI P ET+ IVMDSF+RA+ VSKAIQ+F NLE+ LE + Sbjct: 178 GRRKFFTHMMQILHHMRAQGISPNLETISIVMDSFVRAQHVSKAIQMFRNLEEIGLECDT 237 Query: 367 ETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEAM 546 E+ N+LL+CLCQRSHVG A S LN +KGK+ FN TYN++IGGWS+ GR+SE+ER LEAM Sbjct: 238 ESLNLLLQCLCQRSHVGAANSFLNSVKGKIQFNGNTYNIIIGGWSRHGRVSEIERILEAM 297 Query: 547 VEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVGNI 726 V G DS T+S+ILEGLGRA +IDDAV+IF ++ GCM VYNAMISNF SV N Sbjct: 298 VADGFSADSSTFSFILEGLGRAGRIDDAVEIFDSMKGKGCMPDTRVYNAMISNFISVRNF 357 Query: 727 DEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVTSF 906 DE + YY M ++C PN+DTY +LI+A LK R+VA A+E++DEML RG+VPTTG +TSF Sbjct: 358 DECVRYYKGMSSNSCDPNIDTYTKLIAAFLKARKVAGALEMFDEMLGRGLVPTTGTITSF 417 Query: 907 MEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDSGY 1086 +EPLC YGPP+AA++IYKKARKVGC++S++ YK GKCGML +IW MQ+ GY Sbjct: 418 IEPLCSYGPPYAAMMIYKKARKVGCRISLSAYKLLLMRLSRFGKCGMLLNIWEDMQECGY 477 Query: 1087 SSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEMAYK 1266 +SD EV++Y+INGLC+IG LENAVLVMEESLQ GFCPSRL VE AYK Sbjct: 478 ASDKEVYDYVINGLCNIGHLENAVLVMEESLQKGFCPSRLVYSKLNNKLLASNKVERAYK 537 Query: 1267 LLLKIKVARKYENARTYWRAKGWHF 1341 L LKIK AR+Y+NA+ +WR+KGWHF Sbjct: 538 LFLKIKHARRYDNAQRFWRSKGWHF 562 >gb|EOY15303.1| Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 [Theobroma cacao] gi|508723407|gb|EOY15304.1| Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 [Theobroma cacao] gi|508723408|gb|EOY15305.1| Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 [Theobroma cacao] gi|508723409|gb|EOY15306.1| Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 [Theobroma cacao] Length = 562 Score = 553 bits (1424), Expect = e-154 Identities = 269/446 (60%), Positives = 344/446 (77%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 K+AIE AL+NV VEL+IDI+AKVVN G+L GEAMV FFNWA+KQP I++D+ SY++I+KA Sbjct: 117 KTAIEHALSNVPVELSIDIIAKVVNIGNLGGEAMVLFFNWAMKQPGIARDIHSYYIIIKA 176 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LGRRK+FK M++ L DM K GI P ETL IVMDSFIRA++V KAI+ F NLE+ L+R Sbjct: 177 LGRRKFFKFMIETLHDMVKEGIKPDVETLSIVMDSFIRAQRVQKAIETFENLEELGLKRD 236 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEA 543 +++ N+LL+CLC+R+HVG A SL N + GKV FN TYN++I GWSK GR+S++ER L+A Sbjct: 237 TKSLNVLLQCLCRRAHVGAANSLFNAVNGKVKFNCDTYNIMISGWSKLGRVSKIERILKA 296 Query: 544 MVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVGN 723 M+ PD T+SY++EGLGRA +IDDAV+IF ++E GC+ VYNAMISNF SVGN Sbjct: 297 MIADEFTPDCSTFSYLIEGLGRAGRIDDAVEIFDHMKEKGCIPDTRVYNAMISNFISVGN 356 Query: 724 IDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVTS 903 DE + YY +L SN P++DTY +LISA LK + VA+A+E++DEML +GIVPTTG +TS Sbjct: 357 FDECMKYYKGLLNSNSDPDVDTYTKLISAFLKAQNVADALEIFDEMLVQGIVPTTGTLTS 416 Query: 904 FMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDSG 1083 F+EPLC YGPP+AA++ YKKARK GCK+S++ YK GKCGML +IW +MQ+SG Sbjct: 417 FVEPLCSYGPPYAAMMFYKKARKFGCKISLSAYKLLLMRLSRFGKCGMLLNIWDEMQESG 476 Query: 1084 YSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEMAY 1263 ++SD EV+E++INGLC+IG LENAVLVMEE+L+ GFCPSR+ VE AY Sbjct: 477 HTSDMEVYEHVINGLCNIGHLENAVLVMEEALRKGFCPSRVLYSKLNNKLLASNEVEKAY 536 Query: 1264 KLLLKIKVARKYENARTYWRAKGWHF 1341 KL LKIK AR+ ENAR YWRA GWHF Sbjct: 537 KLFLKIKNARRDENARRYWRANGWHF 562 >ref|XP_006435387.1| hypothetical protein CICLE_v10000757mg [Citrus clementina] gi|557537509|gb|ESR48627.1| hypothetical protein CICLE_v10000757mg [Citrus clementina] Length = 551 Score = 551 bits (1420), Expect = e-154 Identities = 264/446 (59%), Positives = 340/446 (76%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 K IE AL NV+V+L++D++ KVVNRG+L GEAMV FFNWA+K PN++KD+ SY+VI+KA Sbjct: 106 KGVIEDALWNVNVDLSLDVVGKVVNRGNLSGEAMVLFFNWAIKHPNVAKDVKSYNVIVKA 165 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LGRRK+F M ++L DM+K G+ P ETL IVMDSFIRA QV KAIQ+ LED+ L+ Sbjct: 166 LGRRKFFDFMCNVLSDMAKEGVNPDLETLSIVMDSFIRAGQVYKAIQMLGRLEDFGLKFD 225 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEA 543 +E+ N++L CLCQR HVG A SL N MKGK+ FN MTYN+VI GWSK G++ E+ER L+ Sbjct: 226 AESLNVVLWCLCQRLHVGAASSLFNSMKGKILFNVMTYNIVISGWSKLGQVVEMERVLKE 285 Query: 544 MVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVGN 723 +V G PDSLT+S+++EGLGRA +IDDA+++F ++E GC YNA+ISN+ SVG+ Sbjct: 286 IVAEGFSPDSLTFSFLIEGLGRAGRIDDAIEVFDTMKEKGCGPDTNAYNAVISNYISVGD 345 Query: 724 IDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVTS 903 DE + YY M +NC PNMDTY RLIS LLK R+VA+A+E+++EML+RGIVP+TG +TS Sbjct: 346 FDECMKYYKGMSSNNCEPNMDTYTRLISGLLKSRKVADALEVFEEMLDRGIVPSTGTITS 405 Query: 904 FMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDSG 1083 F+EPLC YGPPHAA+++YKKARKVGCK+S+T YK GKCGML D+W +MQ+SG Sbjct: 406 FLEPLCSYGPPHAAMMMYKKARKVGCKLSLTAYKLLLRRLSGFGKCGMLLDLWHEMQESG 465 Query: 1084 YSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEMAY 1263 Y SD E++EY+I GLC+IG+LENAVLVMEESL+ GFCPSRL +E AY Sbjct: 466 YPSDGEIYEYVIAGLCNIGQLENAVLVMEESLRKGFCPSRLVYSKLSNKLLASNKLESAY 525 Query: 1264 KLLLKIKVARKYENARTYWRAKGWHF 1341 L KIK+AR+ + AR WR+KGWHF Sbjct: 526 NLFRKIKIARQNDYARRLWRSKGWHF 551 >ref|XP_006473809.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820-like [Citrus sinensis] Length = 558 Score = 550 bits (1418), Expect = e-154 Identities = 265/446 (59%), Positives = 339/446 (76%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 K IE AL NV+V+L++D++ KVVNRG+L GEAMV FFNWA+K PN++KD+ SY+VI+KA Sbjct: 113 KGVIEDALWNVNVDLSLDVVGKVVNRGNLSGEAMVLFFNWAIKHPNVAKDVKSYNVIVKA 172 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LGRRK+F M ++L DM+K G+ P ETL IVMDSFIRA QV KAIQ+ LED+ L+ Sbjct: 173 LGRRKFFDFMCNVLSDMAKEGVNPDLETLSIVMDSFIRAGQVYKAIQMLGRLEDFGLKFD 232 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEA 543 +E+ N++L CLCQR HVG A SL N MKGKV FN MTYN+VI GWSK G++ E+ER L+ Sbjct: 233 AESLNVVLWCLCQRLHVGAASSLFNSMKGKVLFNVMTYNIVISGWSKLGQVVEMERVLKE 292 Query: 544 MVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVGN 723 +V G PDSLT+S+++EGLGRA +IDDA+++F ++E GC YNA+ISN+ SVG+ Sbjct: 293 IVAEGFSPDSLTFSFLIEGLGRAGRIDDAIEVFDTMKEKGCGPDTNAYNAVISNYISVGD 352 Query: 724 IDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVTS 903 DE + YY M NC PNMDTY RLIS LLK R+VA+A+E+++EML+RGIVP+TG +TS Sbjct: 353 FDECMKYYKGMSSYNCEPNMDTYTRLISGLLKSRKVADALEVFEEMLDRGIVPSTGTITS 412 Query: 904 FMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDSG 1083 F+EPLC YGPPHAA+++YKKARKVGCK+S+T YK GKCGML D+W +MQ+SG Sbjct: 413 FLEPLCSYGPPHAAMMMYKKARKVGCKLSLTAYKLLLRRLSGFGKCGMLLDLWHEMQESG 472 Query: 1084 YSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEMAY 1263 Y SD E++EY+I GLC+IG+LENAVLVMEESL+ GFCPSRL +E AY Sbjct: 473 YPSDGEIYEYVIAGLCNIGQLENAVLVMEESLRKGFCPSRLVYSKLSNKLLASNKLESAY 532 Query: 1264 KLLLKIKVARKYENARTYWRAKGWHF 1341 L KIK+AR+ + AR WR+KGWHF Sbjct: 533 NLFRKIKIARQNDYARRLWRSKGWHF 558 >gb|ESW30549.1| hypothetical protein PHAVU_002G162200g [Phaseolus vulgaris] Length = 549 Score = 509 bits (1311), Expect = e-141 Identities = 244/446 (54%), Positives = 320/446 (71%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 K+AIE+AL+NV ++ ++IL KV+N G+L GE MVTFFNWAVK P I ++ SYHVI+KA Sbjct: 104 KAAIETALSNVGADVDVNILGKVLNNGNLSGEFMVTFFNWAVKLPGIPNEVGSYHVIVKA 163 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LGRRK+F M+ +LCDM K GI L IV+DSF+RA VS+AIQIF NL+D + R Sbjct: 164 LGRRKFFVFMMGVLCDMRKCGINGDLLLLSIVIDSFVRAGHVSRAIQIFGNLDDLGVRRD 223 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEA 543 +E N+LL CLC RSHVG A S+LN MKGKV F+ TYN+V GGWSK G++ EVER + Sbjct: 224 TEALNVLLSCLCHRSHVGAANSVLNSMKGKVCFDVGTYNVVAGGWSKIGKVGEVERIMRE 283 Query: 544 MVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVGN 723 M G+ D T+ +++E LGR ++D+AV++F + E C YNAMI NF SVG+ Sbjct: 284 MEVDGVGHDCRTFGFLMESLGRVGRMDEAVEVFCGMREKNCQPDTAAYNAMIFNFVSVGD 343 Query: 724 IDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVTS 903 +E + YY +ML NC P++DT++R+I+ L+VR+VA+A++++DEML RG+VP+ G++T+ Sbjct: 344 FEECIKYYKKMLSDNCEPDLDTFVRIITGFLRVRKVADALQMFDEMLRRGVVPSIGIITT 403 Query: 904 FMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDSG 1083 F++ LC YGPP+AAL+IYKKARK+GC +SM YK GKCG L IW +MQ+ G Sbjct: 404 FIKRLCSYGPPYAALVIYKKARKLGCMISMEAYKILLMRLSEVGKCGTLLSIWEEMQECG 463 Query: 1084 YSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEMAY 1263 YSSD EV+EYII+GLC++G+LENAVLVMEE+L GFCPSRL E AY Sbjct: 464 YSSDLEVYEYIISGLCNVGQLENAVLVMEEALHKGFCPSRLVYSKLSNRLLATEKTERAY 523 Query: 1264 KLLLKIKVARKYENARTYWRAKGWHF 1341 KL LKIK AR ENAR YWR+ GWHF Sbjct: 524 KLFLKIKHARSLENARNYWRSNGWHF 549 >ref|NP_199195.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|223635652|sp|P0C8R0.1|PP416_ARATH RecName: Full=Putative pentatricopeptide repeat-containing protein At5g43820 gi|332007631|gb|AED95014.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 546 Score = 504 bits (1298), Expect = e-140 Identities = 234/446 (52%), Positives = 328/446 (73%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 KSAI+ +L+++ + L+IDI+A V+NRG+L GEAMVTFF+WAV++P ++KD+ SY VIL+A Sbjct: 101 KSAIQKSLSSLGIGLSIDIVADVLNRGNLSGEAMVTFFDWAVREPGVTKDVGSYSVILRA 160 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LGRRK F M+D+L M G+ P E L I MDSF+R V +AI++F E + ++ Sbjct: 161 LGRRKLFSFMMDVLKGMVCEGVNPDLECLTIAMDSFVRVHYVRRAIELFEESESFGVKCS 220 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEA 543 +E+FN LLRCLC+RSHV A S+ N KG +PF+S +YN++I GWSK G + E+E+ L+ Sbjct: 221 TESFNALLRCLCERSHVSAAKSVFNAKKGNIPFDSCSYNIMISGWSKLGEVEEMEKVLKE 280 Query: 544 MVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVGN 723 MVE G PD L+YS+++EGLGR +I+D+V+IF ++ G + A VYNAMI NF S + Sbjct: 281 MVESGFGPDCLSYSHLIEGLGRTGRINDSVEIFDNIKHKGNVPDANVYNAMICNFISARD 340 Query: 724 IDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVTS 903 DE + YY +ML C PN++TY +L+S L+K R+V++A+E+++EML+RG++PTTG+VTS Sbjct: 341 FDESMRYYRRMLDEECEPNLETYSKLVSGLIKGRKVSDALEIFEEMLSRGVLPTTGLVTS 400 Query: 904 FMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDSG 1083 F++PLC YGPPHAA++IY+K+RK GC++S + YK GKCGML ++W +MQ+SG Sbjct: 401 FLKPLCSYGPPHAAMVIYQKSRKAGCRISESAYKLLLKRLSRFGKCGMLLNVWDEMQESG 460 Query: 1084 YSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEMAY 1263 Y SD EV+EYI++GLC IG LENAVLVMEE+++ GFCP+R E+AY Sbjct: 461 YPSDVEVYEYIVDGLCIIGHLENAVLVMEEAMRKGFCPNRFVYSRLSSKLMASNKTELAY 520 Query: 1264 KLLLKIKVARKYENARTYWRAKGWHF 1341 KL LKIK AR ENAR++WR+ GWHF Sbjct: 521 KLFLKIKKARATENARSFWRSNGWHF 546 >ref|XP_006279800.1| hypothetical protein CARUB_v10027962mg [Capsella rubella] gi|482548504|gb|EOA12698.1| hypothetical protein CARUB_v10027962mg [Capsella rubella] Length = 547 Score = 499 bits (1286), Expect = e-138 Identities = 233/446 (52%), Positives = 324/446 (72%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 K AI+ +L+++ + L+I+I+A VVNRG+L GEAMV+FFNWA+ +P +SKD+DSY VIL+A Sbjct: 102 KFAIQKSLSSLGIGLSIEIVADVVNRGNLSGEAMVSFFNWAICEPGVSKDVDSYCVILRA 161 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LGRRK+F M+D+L M G+ P L I MDSF + V +AI++F E + + Sbjct: 162 LGRRKFFSFMMDVLRGMLCEGVKPDLRCLTIAMDSFTKVHYVRRAIELFEESESFGVNCN 221 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEA 543 +E+FN LLRCLC+RSHV A S+ N KG +PF+ +TYN++I GWSK G I E+E+ L+ Sbjct: 222 TESFNALLRCLCERSHVTAAKSVFNSKKGNIPFDGLTYNVMISGWSKLGEIEEMEKVLKE 281 Query: 544 MVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVGN 723 MVE G PD L+YS+++EGLGRA +I+D+V+IF ++ G + A VYNAMI NF S + Sbjct: 282 MVESGFGPDCLSYSHLIEGLGRAGRINDSVEIFDNIKHKGSVPDANVYNAMICNFISARD 341 Query: 724 IDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVTS 903 DE ++YY +ML C PN++TY +L+S L+K R+V++A+E+++EML+RG +PTTG+VTS Sbjct: 342 FDESVMYYRRMLDEECEPNLETYSKLVSGLIKGRKVSDALEIFEEMLSRGFLPTTGLVTS 401 Query: 904 FMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDSG 1083 F++PLC YGPPHAA++IY+K+RK GCK+S + YK GKCGML ++W +MQ+ G Sbjct: 402 FLKPLCSYGPPHAAMVIYQKSRKAGCKISESAYKLLLKRLSRFGKCGMLLNVWDEMQECG 461 Query: 1084 YSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEMAY 1263 Y SD EV+EYI++GLC IG L+NAVLVMEE+++ GFCP+R E+AY Sbjct: 462 YPSDVEVYEYIVDGLCIIGHLDNAVLVMEEAMRKGFCPNRFVYSRLSSKLMASNKTELAY 521 Query: 1264 KLLLKIKVARKYENARTYWRAKGWHF 1341 KL LKIK AR ENAR +WR+ GWHF Sbjct: 522 KLFLKIKKARATENARRFWRSNGWHF 547 >ref|XP_004145547.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820-like [Cucumis sativus] Length = 572 Score = 496 bits (1277), Expect = e-137 Identities = 244/446 (54%), Positives = 320/446 (71%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 K+AIE AL N V L+ D+++KV+N GSL EAMVTFF WA+KQP+I KD SY++ILKA Sbjct: 128 KTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASSYNIILKA 187 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LGRR +F M+D+L +M++ G+ T E + IV+DS ++ QVSKA+Q F NL++ L+ Sbjct: 188 LGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLKEIGLKCD 247 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEA 543 +ET NILL+C+C+RSHVG A S N KG +PFN MTYN+VIGGWS++GR EVE+ L+A Sbjct: 248 TETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGEVEQMLKA 307 Query: 544 MVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVGN 723 M G PD LT++Y++E LGRA QIDDAVKIF +++E GC + YNAMISNF +G+ Sbjct: 308 MELDGFSPDCLTHTYLIECLGRANQIDDAVKIFDKMDENGCTPDVDAYNAMISNFICIGD 367 Query: 724 IDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVTS 903 D+ L YY +ML + C P+M+TY LI+ LK ++VA+A+E++DEM+ R I+PTTG +TS Sbjct: 368 FDQCLTYYERMLSNRCEPDMNTYSNLITGFLKAKKVADALEMFDEMVAR-IIPTTGAITS 426 Query: 904 FMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDSG 1083 F++ C YGPPHAA++IYKKARKVGC++S YK GK GML +IW +MQ+SG Sbjct: 427 FIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESG 486 Query: 1084 YSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEMAY 1263 Y D E +E+ I+ LC G+LENAVLVMEE L+ GF PSR T EMAY Sbjct: 487 YDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLACNRTEMAY 546 Query: 1264 KLLLKIKVARKYENARTYWRAKGWHF 1341 KL LKIKVAR EN + WRAKGWH+ Sbjct: 547 KLWLKIKVARHQENLQRCWRAKGWHY 572 >ref|XP_004486824.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820-like isoform X1 [Cicer arietinum] gi|502081302|ref|XP_004486825.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820-like isoform X2 [Cicer arietinum] Length = 539 Score = 496 bits (1276), Expect = e-137 Identities = 241/446 (54%), Positives = 318/446 (71%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 K+ +E AL+ V V++ DI+ +V+N G+L GEAMVTFFNWA+KQP + D+ +YHVI+KA Sbjct: 98 KTTVEQALSGVCVDVNADIIGRVLNYGNLGGEAMVTFFNWALKQPMVPNDVGTYHVIVKA 157 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LGRRK+F M+ +L DM GI L IV+DSF+ A VSKAIQ+F NL+D L+R Sbjct: 158 LGRRKFFVFMMQVLNDMRLNGIKADLFMLSIVIDSFVNAGHVSKAIQVFGNLDDLGLDRD 217 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEA 543 +E N+LL CLC+R HVG A S+ N MKGKV FN TYN+V GGWSK GR++E+ER ++ Sbjct: 218 TEALNVLLSCLCRRCHVGAAASVFNSMKGKVIFNVATYNVVAGGWSKSGRVNEIERVMKE 277 Query: 544 MVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVGN 723 M G PD TY++ LEGLGRA ++D+AV++F ++E YNAMI NF S+GN Sbjct: 278 MEVEGFSPDFTTYAFYLEGLGRAGRMDEAVQVFCNMKEKD----TTTYNAMIFNFISIGN 333 Query: 724 IDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVTS 903 DE + YYN+M NC PN+DTY R+I+A L+ R+VA+A+ ++DEML +G+VP TG ++S Sbjct: 334 FDECMKYYNEMSSDNCEPNIDTYTRMITAFLRTRKVADALLMFDEMLRQGVVPPTGTISS 393 Query: 904 FMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDSG 1083 F++ LC YGPP+AA++IYKKARK+ CK+SM YK GKCG L +W +MQ+ G Sbjct: 394 FIKRLCSYGPPYAAMMIYKKARKLECKISMEAYKLLLMRLSKFGKCGTLLSVWQEMQECG 453 Query: 1084 YSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEMAY 1263 YSSD EV+EYII+GL +IG+LENAVLVMEE+L+ GFCPSRL E AY Sbjct: 454 YSSDIEVYEYIISGLYNIGQLENAVLVMEEALRKGFCPSRLVYSKLSNKLLASDKTERAY 513 Query: 1264 KLLLKIKVARKYENARTYWRAKGWHF 1341 +L LKIK AR +NAR+YWR+ GWHF Sbjct: 514 RLFLKIKHARALKNARSYWRSNGWHF 539 >ref|XP_006403177.1| hypothetical protein EUTSA_v10003177mg [Eutrema salsugineum] gi|557104290|gb|ESQ44630.1| hypothetical protein EUTSA_v10003177mg [Eutrema salsugineum] Length = 541 Score = 493 bits (1268), Expect = e-136 Identities = 232/446 (52%), Positives = 326/446 (73%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 ++A ALT++ ++L+I+ ++ VV+RG+L GEAMVTFF+WA+++P +SKD++SY+VIL+A Sbjct: 104 ETATRKALTSLGIDLSIETVSNVVDRGNLSGEAMVTFFDWAIREPGVSKDVESYYVILRA 163 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LGRRK+F M D+L +M + P + L I MDSF +AR V +AIQ+F ED+ ++ Sbjct: 164 LGRRKFFSFMTDVLREM----VNPDLKCLIIAMDSFAKARYVRRAIQLFEESEDFGVKCC 219 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEA 543 +E+FN LL+CLC+RSHV A S+ N KGK+PF+ TYN++I GWSK G + E+E+ L+ Sbjct: 220 TESFNALLQCLCERSHVSAASSVFNAKKGKIPFDVCTYNIMISGWSKLGEVGEMEKVLKE 279 Query: 544 MVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVGN 723 MVE G P+ L++SY++EGLGRA +++D+VKIF ++ + A VYNAMI NF + Sbjct: 280 MVESGFVPNGLSFSYLIEGLGRAGRVNDSVKIFDNMD----VPDANVYNAMICNFIFARD 335 Query: 724 IDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVTS 903 DE + YY +ML C PN +TY +L+S L+K R++A+A+E+Y+EML+RGIVPTTG+VTS Sbjct: 336 FDESVRYYRRMLDKGCEPNWETYSKLVSGLIKGRKIADALEIYEEMLSRGIVPTTGLVTS 395 Query: 904 FMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDSG 1083 F++PLC YGPPHAA++IY+KARK GC++S + YK GKCGML ++W +MQ+ Sbjct: 396 FLKPLCCYGPPHAAMVIYQKARKAGCRISQSAYKLLLKRLSGFGKCGMLLNVWDEMQECE 455 Query: 1084 YSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEMAY 1263 YSSD EV+EYI++GLC+IG LENAVLVMEE+++ GFCP+R EMAY Sbjct: 456 YSSDVEVYEYIVDGLCNIGHLENAVLVMEEAMRKGFCPNRFVYSRLSNKLMSSRKTEMAY 515 Query: 1264 KLLLKIKVARKYENARTYWRAKGWHF 1341 KL LKIK AR +NAR +WR GWHF Sbjct: 516 KLFLKIKEARLKDNARRFWRRNGWHF 541 >ref|XP_003597616.1| hypothetical protein MTR_2g100200 [Medicago truncatula] gi|124360397|gb|ABN08410.1| Pentatricopeptide repeat [Medicago truncatula] gi|355486664|gb|AES67867.1| hypothetical protein MTR_2g100200 [Medicago truncatula] Length = 527 Score = 489 bits (1260), Expect = e-135 Identities = 239/446 (53%), Positives = 317/446 (71%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 K+AIE AL+NV +++ +DI+ KV+N G+L GEAMV FFNWA+KQP + +D+ SYHVI+KA Sbjct: 86 KAAIEQALSNVCIDVNVDIIGKVLNFGNLGGEAMVMFFNWALKQPMVPRDVGSYHVIVKA 145 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LGRRK+F M+ +L +M GI L IV+DSF+ A VSKAIQ+F NL+D L R Sbjct: 146 LGRRKFFVFMMQVLDEMRLNGIKADLLMLSIVIDSFVNAGHVSKAIQLFGNLDDLGLCRD 205 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEA 543 +E N+LL CLC+R HVG A S+ N MKGKV FN TYN+V+GGWSK GR++E+E+ ++ Sbjct: 206 TEVLNVLLSCLCRRCHVGAAASVFNSMKGKVSFNVDTYNVVVGGWSKLGRVNEIEKVMKE 265 Query: 544 MVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVGN 723 M G PD T ++ LEGLGRA ++D+AV++F ++E +YNAMI NF S+G+ Sbjct: 266 MEVEGFSPDFNTLAFFLEGLGRAGRMDEAVEVFGSMKEKD----TAIYNAMIFNFISIGD 321 Query: 724 IDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVTS 903 D + YYN ML NC PN+ TY R+I+A L+ R+VA+A+ ++DEML +G+VP TG +TS Sbjct: 322 FDGFMKYYNGMLSDNCEPNIHTYSRMITAFLRTRKVADALLMFDEMLRQGVVPPTGTITS 381 Query: 904 FMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDSG 1083 F++ LC YGPP+AA++IYKK RK+ CK+SM YK GKCG L +W +MQ+ G Sbjct: 382 FIKQLCSYGPPYAAMMIYKKTRKLECKISMEAYKILLMRLSKFGKCGSLLSVWQEMQECG 441 Query: 1084 YSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEMAY 1263 YSSD EV+EYII+GL +IG+LENAVLVMEE+L+ GFCPSRL E AY Sbjct: 442 YSSDVEVYEYIISGLYNIGQLENAVLVMEEALRKGFCPSRLVYSKLSNKLLASNLTERAY 501 Query: 1264 KLLLKIKVARKYENARTYWRAKGWHF 1341 +L LKIK AR +NAR+YWR GWHF Sbjct: 502 RLFLKIKHARSLKNARSYWRDNGWHF 527 >gb|EXC31210.1| hypothetical protein L484_005635 [Morus notabilis] Length = 591 Score = 482 bits (1241), Expect = e-133 Identities = 244/444 (54%), Positives = 316/444 (71%), Gaps = 1/444 (0%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 K+AIE ALT+V VEL ++++ KVVNRG+LD + MV FFNWA++QP ISKD+D+YH+ILKA Sbjct: 120 KTAIEQALTDVDVELNVEVVGKVVNRGNLDDKKMVMFFNWAIRQPTISKDIDTYHIILKA 179 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LGRRK+ MV++L + G+ P ETL IVMDS +RARQVSKAI+ F NL++ L+ Sbjct: 180 LGRRKFLNCMVEVLHQLRIEGVNPNLETLEIVMDSLVRARQVSKAIRTFRNLDELGLDCD 239 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEA 543 +E+ N+LL CLC+RSHVG A SLL+ MKGK+PFN TYN+V+ GW +FGR+ E+ER LE Sbjct: 240 TESLNVLLECLCRRSHVGAANSLLHSMKGKIPFNGATYNIVMSGWCRFGRVGEMERILEM 299 Query: 544 MVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEM-GCMLTAEVYNAMISNFTSVG 720 MV G+DPD T S ++EGLGRA +IDDAVKIF +++E G + + VYNAMISN+ +VG Sbjct: 300 MVGDGIDPDGSTVSNLIEGLGRAGRIDDAVKIFEDMKEKNGWVPDSSVYNAMISNYIAVG 359 Query: 721 NIDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVT 900 + DE + YYN ML S C P++DTY +LI A LKVRRVA+A+EL+DEML+RG+VP+TG VT Sbjct: 360 DCDECVKYYNSMLSSACEPSIDTYTKLIGAFLKVRRVADALELFDEMLDRGVVPSTGTVT 419 Query: 901 SFMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDS 1080 SF+EPLC YGPPHAA+++YKKA+KVGC++S++ YK Sbjct: 420 SFIEPLCSYGPPHAAMMVYKKAKKVGCRISLSAYK------------------------- 454 Query: 1081 GYSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEMA 1260 ++ L G+LENAVLVMEE L+ GFCPSRL C VE+A Sbjct: 455 ----------LLLIRLSRFGQLENAVLVMEECLRKGFCPSRLICSKLNNKLLALNKVEIA 504 Query: 1261 YKLLLKIKVARKYENARTYWRAKG 1332 YKL LK+K AR +NAR YWRAKG Sbjct: 505 YKLFLKLKDARLEDNARRYWRAKG 528 >ref|XP_002307761.2| hypothetical protein POPTR_0005s26850g [Populus trichocarpa] gi|550339816|gb|EEE94757.2| hypothetical protein POPTR_0005s26850g [Populus trichocarpa] Length = 398 Score = 473 bits (1216), Expect = e-130 Identities = 223/368 (60%), Positives = 287/368 (77%), Gaps = 1/368 (0%) Frame = +1 Query: 103 MVTFFNWAVKQPNISKDLDSYHVILKALGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVM 282 M+ FFNWA+KQP ISKD+DSY+V+++ALGRRK+ MV L ++ G+ SET IV+ Sbjct: 1 MIMFFNWAIKQPMISKDVDSYNVVIRALGRRKFIDFMVKFLHELRVEGVSMNSETFSIVI 60 Query: 283 DSFIRARQVSKAIQIFYNLED-YALERGSETFNILLRCLCQRSHVGTACSLLNKMKGKVP 459 DS +RAR+V KAIQ+F NLE+ + ER +E+ N+LL+CLC+RSHVG A S N +KGK+P Sbjct: 61 DSLVRARRVYKAIQMFGNLEEEFGFERDAESLNVLLQCLCRRSHVGAANSYFNSVKGKIP 120 Query: 460 FNSMTYNLVIGGWSKFGRISEVERNLEAMVEIGLDPDSLTYSYILEGLGRAEQIDDAVKI 639 FN MTYN++IGGWSKFGR+SE++R E M E G PD L++SY+LEGLGRA +I+DAV I Sbjct: 121 FNCMTYNVIIGGWSKFGRVSEMQRVFEEMEEDGFSPDCLSFSYLLEGLGRAGKIEDAVMI 180 Query: 640 FRELEEMGCMLTAEVYNAMISNFTSVGNIDEGLIYYNQMLRSNCSPNMDTYIRLISALLK 819 F LEE GC+ VYNAMISNF SVGN DE + YY +L NC PN+DTY R+IS L+K Sbjct: 181 FGSLEEKGCVPDTNVYNAMISNFISVGNFDECMKYYRCLLSKNCDPNIDTYTRMISGLIK 240 Query: 820 VRRVAEAIELYDEMLNRGIVPTTGVVTSFMEPLCGYGPPHAALIIYKKARKVGCKVSMTT 999 +VA+A+E++DEML+RG+V TG VTSF+EPLC +GPPHAA++IY KARKVGCK+S++ Sbjct: 241 ASKVADALEMFDEMLDRGMVTKTGTVTSFIEPLCSFGPPHAAMVIYTKARKVGCKISLSA 300 Query: 1000 YKXXXXXXXXXGKCGMLFDIWAQMQDSGYSSDTEVFEYIINGLCSIGKLENAVLVMEESL 1179 YK GKCGM+ IW +MQ+SGYSSD EV+EY+I+GLC+IG+ ENAVLVMEES+ Sbjct: 301 YKLLLMRLSRFGKCGMMLKIWDEMQESGYSSDMEVYEYLISGLCNIGQFENAVLVMEESM 360 Query: 1180 QMGFCPSR 1203 + GFCPSR Sbjct: 361 RKGFCPSR 368 >ref|XP_002865400.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297311235|gb|EFH41659.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 675 Score = 472 bits (1215), Expect = e-130 Identities = 218/400 (54%), Positives = 307/400 (76%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 KSAI++ L+++ ++L+IDI++ V+NRG+L GEAMVTFFNWA+++P +SKD+DSY VIL+A Sbjct: 96 KSAIQNCLSSLGIDLSIDIVSDVLNRGNLSGEAMVTFFNWAIREPGVSKDVDSYCVILRA 155 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LGRRK+F M+D+L M G+ P L I MDSF+RA V +AI++F E Y ++ Sbjct: 156 LGRRKFFSFMMDVLRGMVCEGVNPDLRCLTIAMDSFVRAHYVRRAIELFEESESYGVKCS 215 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEA 543 +E+FN LLRCLC+RSHV A S+ N KGK+PF+S +YN++I GWSK G I +E+ L+ Sbjct: 216 TESFNALLRCLCERSHVSAANSVFNAKKGKIPFDSCSYNIMISGWSKLGEIEGMEKVLKE 275 Query: 544 MVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVGN 723 MVE G PD L+YS+++EGLGRA +I+D+V+IF ++ G +L A VYNAMI NF S + Sbjct: 276 MVEGGFVPDCLSYSHLIEGLGRAGRINDSVEIFDNMKHKGSVLDANVYNAMICNFISARD 335 Query: 724 IDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVTS 903 DE + YY +ML C PN++TY +L+S L+K R+V++A+E+++EML+RGI+PTTG+VTS Sbjct: 336 FDESMRYYRRMLDEECEPNLETYSKLVSGLIKGRKVSDALEIFEEMLSRGILPTTGLVTS 395 Query: 904 FMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDSG 1083 F++PLC YGPPHAA++IY+K+RK GC++S + YK GKCGML ++W +MQ+ G Sbjct: 396 FLKPLCSYGPPHAAMVIYQKSRKAGCRISESAYKLLLKRLSRFGKCGMLLNVWDEMQECG 455 Query: 1084 YSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSR 1203 Y SD EV+EYI++GLC IG LENAVLVMEE+++ GFCP+R Sbjct: 456 YPSDVEVYEYIVDGLCIIGHLENAVLVMEEAMRKGFCPNR 495 >dbj|BAB11311.1| unnamed protein product [Arabidopsis thaliana] Length = 680 Score = 463 bits (1191), Expect = e-127 Identities = 213/400 (53%), Positives = 303/400 (75%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 KSAI+ +L+++ + L+IDI+A V+NRG+L GEAMVTFF+WAV++P ++KD+ SY VIL+A Sbjct: 101 KSAIQKSLSSLGIGLSIDIVADVLNRGNLSGEAMVTFFDWAVREPGVTKDVGSYSVILRA 160 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LGRRK F M+D+L M G+ P E L I MDSF+R V +AI++F E + ++ Sbjct: 161 LGRRKLFSFMMDVLKGMVCEGVNPDLECLTIAMDSFVRVHYVRRAIELFEESESFGVKCS 220 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLEA 543 +E+FN LLRCLC+RSHV A S+ N KG +PF+S +YN++I GWSK G + E+E+ L+ Sbjct: 221 TESFNALLRCLCERSHVSAAKSVFNAKKGNIPFDSCSYNIMISGWSKLGEVEEMEKVLKE 280 Query: 544 MVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVGN 723 MVE G PD L+YS+++EGLGR +I+D+V+IF ++ G + A VYNAMI NF S + Sbjct: 281 MVESGFGPDCLSYSHLIEGLGRTGRINDSVEIFDNIKHKGNVPDANVYNAMICNFISARD 340 Query: 724 IDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVTS 903 DE + YY +ML C PN++TY +L+S L+K R+V++A+E+++EML+RG++PTTG+VTS Sbjct: 341 FDESMRYYRRMLDEECEPNLETYSKLVSGLIKGRKVSDALEIFEEMLSRGVLPTTGLVTS 400 Query: 904 FMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDSG 1083 F++PLC YGPPHAA++IY+K+RK GC++S + YK GKCGML ++W +MQ+SG Sbjct: 401 FLKPLCSYGPPHAAMVIYQKSRKAGCRISESAYKLLLKRLSRFGKCGMLLNVWDEMQESG 460 Query: 1084 YSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSR 1203 Y SD EV+EYI++GLC IG LENAVLVMEE+++ GFCP+R Sbjct: 461 YPSDVEVYEYIVDGLCIIGHLENAVLVMEEAMRKGFCPNR 500 >gb|EPS70746.1| hypothetical protein M569_04016, partial [Genlisea aurea] Length = 471 Score = 462 bits (1190), Expect = e-127 Identities = 226/448 (50%), Positives = 316/448 (70%), Gaps = 2/448 (0%) Frame = +1 Query: 4 KSAIESALTNVSVELTIDILAKVVNRGSLDGEAMVTFFNWAVKQPNISKDLDSYHVILKA 183 KSA+ +AL V VEL ++L KV+N G+L G+++V FFNWA+ QPN+S+ + Y+ +KA Sbjct: 27 KSAVFNALNGVQVELNDELLVKVMNEGNLSGDSIVLFFNWALDQPNVSEKVSIYNSTIKA 86 Query: 184 LGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALERG 363 LG+RK+FKHM+ +L M I P ++TLFIVM+S++RARQVSKA++IF LE Y + Sbjct: 87 LGKRKFFKHMMQVLNGMKDKAISPNADTLFIVMNSYLRARQVSKAVKIFGELEKYGFQSN 146 Query: 364 SETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMT-YNLVIGGWSKFGRISEVERNLE 540 S T ++ L CL + S+VG ACSL NK++ K + YN++IGGWSK GRIS+VER ++ Sbjct: 147 SGTISVALNCLSRHSYVGAACSLFNKLRQKSRQRDCSIYNIMIGGWSKMGRISQVERLVK 206 Query: 541 AMVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMG-CMLTAEVYNAMISNFTSV 717 MV+ G+DPD +TYS+++E GRA ++D A++IF+ LEE G L+ EVYNA+I + + Sbjct: 207 LMVDDGVDPDCITYSHVIEVFGRAGRVDSAIEIFKHLEEKGNSTLSPEVYNAVIFSCLAN 266 Query: 718 GNIDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVV 897 DEGL YY +M R PN TY +IS+LL++R+VA+AIE++DEM+++ ++P+ G + Sbjct: 267 DKADEGLKYYEEMQRKGFDPNAKTYTGVISSLLRIRKVADAIEMFDEMVSKEMIPSAGTL 326 Query: 898 TSFMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQD 1077 T F+EPLC YGPPHAAL+IY +ARK GC VS + YK GKCGML IW ++ Sbjct: 327 TKFIEPLCRYGPPHAALMIYSRARKAGCLVSDSAYKLLLMRLGRFGKCGMLLKIW---EE 383 Query: 1078 SGYSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEM 1257 SGY D +V+EY+INGLC++GKLE AV+VME+ +Q G P ++ C VE+ Sbjct: 384 SGYPCDLKVYEYMINGLCNVGKLETAVVVMEDCVQRGLFPGKIICSKLKNKLMSSGKVEI 443 Query: 1258 AYKLLLKIKVARKYENARTYWRAKGWHF 1341 AYKL LK++ AR ENAR YWR+KGWHF Sbjct: 444 AYKLFLKLRNARAEENARRYWRSKGWHF 471 >ref|XP_006855725.1| hypothetical protein AMTR_s00044p00153760 [Amborella trichopoda] gi|548859512|gb|ERN17192.1| hypothetical protein AMTR_s00044p00153760 [Amborella trichopoda] Length = 413 Score = 435 bits (1119), Expect = e-119 Identities = 206/413 (49%), Positives = 288/413 (69%) Frame = +1 Query: 103 MVTFFNWAVKQPNISKDLDSYHVILKALGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVM 282 MVTFF+WA+ QP+ KDL +Y+++L++LGRRKYF HM +L M+K G P+ ET+ IVM Sbjct: 1 MVTFFSWAITQPSCPKDLQNYNILLRSLGRRKYFDHMERVLHHMNKEGPKPSLETMLIVM 60 Query: 283 DSFIRARQVSKAIQIFYNLEDYALERGSETFNILLRCLCQRSHVGTACSLLNKMKGKVPF 462 S+ RA +VSKAIQ F NLE++ L + FN+ L+ L +R HV A SLL+ +GK+PF Sbjct: 61 GSYSRAHRVSKAIQYFENLEEFGLPSDTGAFNVFLKSLSERGHVRVATSLLHTFEGKIPF 120 Query: 463 NSMTYNLVIGGWSKFGRISEVERNLEAMVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIF 642 ++ TY ++IGGWS+ GRISE E+ AM+ G PD T++Y+LEGLGRA +ID+A+ +F Sbjct: 121 DTTTYTILIGGWSRLGRISETEKIWAAMLSNGFQPDCSTFNYLLEGLGRAGRIDNAIAVF 180 Query: 643 RELEEMGCMLTAEVYNAMISNFTSVGNIDEGLIYYNQMLRSNCSPNMDTYIRLISALLKV 822 + E GC YNAMI NF S G ++E + YY M +C+P++ TY ++I A +KV Sbjct: 181 ESMGEKGCPPNTSSYNAMICNFISCGALNECVKYYATMSEKHCAPDIVTYTKMIGAFIKV 240 Query: 823 RRVAEAIELYDEMLNRGIVPTTGVVTSFMEPLCGYGPPHAALIIYKKARKVGCKVSMTTY 1002 RVA+A+E++D ML RG++P+TG +TSF+EPLC +GPPHAAL IY+KA+KVGCK S+ Y Sbjct: 241 CRVADALEMFDSMLGRGVIPSTGTLTSFIEPLCKFGPPHAALEIYRKAKKVGCKFSVKAY 300 Query: 1003 KXXXXXXXXXGKCGMLFDIWAQMQDSGYSSDTEVFEYIINGLCSIGKLENAVLVMEESLQ 1182 K GKCG + +W M+ G+SSD EV+E +I+G C+IG+L+NAVL +EE+L Sbjct: 301 KLLLGRLARFGKCGTVLRVWDDMRTDGHSSDKEVYECVIDGFCNIGQLDNAVLALEEALS 360 Query: 1183 MGFCPSRLTCXXXXXXXXXXXXVEMAYKLLLKIKVARKYENARTYWRAKGWHF 1341 +GFCP+++ VE+AYKL +KIK AR+ E +R YW A GWHF Sbjct: 361 LGFCPNKVIYSKLNCKLLDASKVELAYKLYVKIKEARRNELSRKYWFANGWHF 413 >ref|XP_006598344.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820-like [Glycine max] Length = 482 Score = 412 bits (1058), Expect = e-112 Identities = 201/368 (54%), Positives = 262/368 (71%) Frame = +1 Query: 181 ALGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIFYNLEDYALER 360 ALGRRK+F M+D LCDM + I L +V+DSF+RA VS+AIQ+F NL+D + R Sbjct: 111 ALGRRKFFDFMMDALCDMRRNAIDGDLFMLSVVVDSFVRAGHVSRAIQVFGNLDDLGVRR 170 Query: 361 GSETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFGRISEVERNLE 540 +E N+LL CLC+RSHVG A S+LN MKGKV F+ TYN V GGWS+FGR+SEVER + Sbjct: 171 DTEALNVLLLCLCRRSHVGAANSVLNSMKGKVDFDVGTYNAVAGGWSRFGRVSEVERVMR 230 Query: 541 AMVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYNAMISNFTSVG 720 M GL PD T+ +++EGLGR ++D+AV+I ++EM C E YNA+I NF SVG Sbjct: 231 EMEADGLRPDCRTFGFLIEGLGREGRMDEAVEILCGMKEMNCQPDTETYNAVIFNFVSVG 290 Query: 721 NIDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNRGIVPTTGVVT 900 + +E + YYN+ML NC PN+DTY R+I+ L+ R+VA+A+ ++DEML RG+VP+TG +T Sbjct: 291 DFEECIKYYNRMLSDNCEPNLDTYARMINRFLRARKVADALLMFDEMLRRGVVPSTGTIT 350 Query: 901 SFMEPLCGYGPPHAALIIYKKARKVGCKVSMTTYKXXXXXXXXXGKCGMLFDIWAQMQDS 1080 +F++ LC YGPP+AAL+IYKKARK+GC +SM YK GKCG L IW +MQ+ Sbjct: 351 TFIKRLCSYGPPYAALMIYKKARKLGCVISMEAYKILLMRLSMVGKCGTLLSIWEEMQEC 410 Query: 1081 GYSSDTEVFEYIINGLCSIGKLENAVLVMEESLQMGFCPSRLTCXXXXXXXXXXXXVEMA 1260 GYSSD EV+E II+GLC++G+LENAVLVMEE+L+ GFCPSRL E A Sbjct: 411 GYSSDLEVYECIISGLCNVGQLENAVLVMEEALRKGFCPSRLVYSKLSNRLLASDKSERA 470 Query: 1261 YKLLLKIK 1284 YKL LKIK Sbjct: 471 YKLFLKIK 478 Score = 61.6 bits (148), Expect = 9e-07 Identities = 54/277 (19%), Positives = 112/277 (40%) Frame = +1 Query: 151 DLDSYHVILKALGRRKYFKHMVDMLCDMSKWGIIPTSETLFIVMDSFIRARQVSKAIQIF 330 D ++ +++ LGR V++LC M + P +ET V+ +F+ + I+ Sbjct: 240 DCRTFGFLIEGLGREGRMDEAVEILCGMKEMNCQPDTETYNAVIFNFVSVGDFEECIKY- 298 Query: 331 YNLEDYALERGSETFNILLRCLCQRSHVGTACSLLNKMKGKVPFNSMTYNLVIGGWSKFG 510 +N +L C+ N TY +I + + Sbjct: 299 --------------YNRMLSDNCEP-------------------NLDTYARMINRFLRAR 325 Query: 511 RISEVERNLEAMVEIGLDPDSLTYSYILEGLGRAEQIDDAVKIFRELEEMGCMLTAEVYN 690 ++++ + M+ G+ P + T + ++ L A+ I+++ ++GC+++ E Y Sbjct: 326 KVADALLMFDEMLRRGVVPSTGTITTFIKRLCSYGPPYAALMIYKKARKLGCVISMEAYK 385 Query: 691 AMISNFTSVGNIDEGLIYYNQMLRSNCSPNMDTYIRLISALLKVRRVAEAIELYDEMLNR 870 ++ + VG L + +M S +++ Y +IS L V ++ A+ + +E L + Sbjct: 386 ILLMRLSMVGKCGTLLSIWEEMQECGYSSDLEVYECIISGLCNVGQLENAVLVMEEALRK 445 Query: 871 GIVPTTGVVTSFMEPLCGYGPPHAALIIYKKARKVGC 981 G P+ V + L A YK K+ C Sbjct: 446 GFCPSRLVYSKLSNRLLASDKSERA---YKLFLKIKC 479