BLASTX nr result
ID: Akebia27_contig00035404
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00035404 (719 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006386647.1| hypothetical protein POPTR_0002s18000g [Popu... 346 4e-93 ref|XP_002301440.2| pentatricopeptide repeat-containing family p... 346 4e-93 ref|XP_002268415.1| PREDICTED: pentatricopeptide repeat-containi... 339 6e-91 emb|CBI38708.3| unnamed protein product [Vitis vinifera] 339 6e-91 ref|XP_007219034.1| hypothetical protein PRUPE_ppa004557mg [Prun... 338 1e-90 ref|XP_002515072.1| pentatricopeptide repeat-containing protein,... 327 2e-87 ref|XP_006444734.1| hypothetical protein CICLE_v10019806mg [Citr... 320 4e-85 ref|XP_004306649.1| PREDICTED: pentatricopeptide repeat-containi... 319 7e-85 ref|XP_006491362.1| PREDICTED: pentatricopeptide repeat-containi... 317 3e-84 ref|XP_003534128.1| PREDICTED: pentatricopeptide repeat-containi... 314 2e-83 ref|XP_007041542.1| Tetratricopeptide repeat (TPR)-like superfam... 306 3e-81 ref|XP_003614316.1| Pentatricopeptide repeat-containing protein ... 306 3e-81 gb|EXC13671.1| hypothetical protein L484_019632 [Morus notabilis] 305 8e-81 ref|XP_003619886.1| Pentatricopeptide repeat-containing protein ... 305 1e-80 ref|XP_004512775.1| PREDICTED: pentatricopeptide repeat-containi... 303 3e-80 gb|EYU37119.1| hypothetical protein MIMGU_mgv1a022987mg [Mimulus... 298 2e-78 ref|XP_007152653.1| hypothetical protein PHAVU_004G147700g [Phas... 298 2e-78 ref|XP_006357649.1| PREDICTED: pentatricopeptide repeat-containi... 294 2e-77 ref|XP_004243578.1| PREDICTED: pentatricopeptide repeat-containi... 291 1e-76 ref|XP_006306345.1| hypothetical protein CARUB_v10012229mg [Caps... 273 6e-71 >ref|XP_006386647.1| hypothetical protein POPTR_0002s18000g [Populus trichocarpa] gi|550345261|gb|ERP64444.1| hypothetical protein POPTR_0002s18000g [Populus trichocarpa] Length = 467 Score = 346 bits (888), Expect = 4e-93 Identities = 171/233 (73%), Positives = 196/233 (84%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 G + EAVSLF+N+P+FNCVNWTESFNTLL+I+VKE++ E AHR +LENS GWEVKSRIR+ Sbjct: 94 GQISEAVSLFKNIPKFNCVNWTESFNTLLQILVKESKLETAHRFFLENSCGWEVKSRIRA 153 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L+D LC RNRSDLAL IFQEM+ Q CYP+R++YR+LMRGLCEDGRLNEATHLLYSMF Sbjct: 154 LNLLLDVLCQRNRSDLALQIFQEMDYQGCYPNRDSYRILMRGLCEDGRLNEATHLLYSMF 213 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRISQKGSGED+V+YRTLL+ALCDNGQVEEAMEILGKI RK LKA KRYR L L Sbjct: 214 WRISQKGSGEDIVVYRTLLDALCDNGQVEEAMEILGKILRKGLKAPKRYRHRLDLSQCNN 273 Query: 561 GESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 E +E K LINEAL RGGVPSLASY +MA DLY EGK A++V DE +RG Sbjct: 274 CEDIEATKLLINEALIRGGVPSLASYTAMAVDLYCEGKTGQADKVLDETQERG 326 >ref|XP_002301440.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550345260|gb|EEE80713.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 503 Score = 346 bits (888), Expect = 4e-93 Identities = 171/233 (73%), Positives = 196/233 (84%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 G + EAVSLF+N+P+FNCVNWTESFNTLL+I+VKE++ E AHR +LENS GWEVKSRIR+ Sbjct: 94 GQISEAVSLFKNIPKFNCVNWTESFNTLLQILVKESKLETAHRFFLENSCGWEVKSRIRA 153 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L+D LC RNRSDLAL IFQEM+ Q CYP+R++YR+LMRGLCEDGRLNEATHLLYSMF Sbjct: 154 LNLLLDVLCQRNRSDLALQIFQEMDYQGCYPNRDSYRILMRGLCEDGRLNEATHLLYSMF 213 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRISQKGSGED+V+YRTLL+ALCDNGQVEEAMEILGKI RK LKA KRYR L L Sbjct: 214 WRISQKGSGEDIVVYRTLLDALCDNGQVEEAMEILGKILRKGLKAPKRYRHRLDLSQCNN 273 Query: 561 GESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 E +E K LINEAL RGGVPSLASY +MA DLY EGK A++V DE +RG Sbjct: 274 CEDIEATKLLINEALIRGGVPSLASYTAMAVDLYCEGKTGQADKVLDETQERG 326 >ref|XP_002268415.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05600-like [Vitis vinifera] Length = 503 Score = 339 bits (869), Expect = 6e-91 Identities = 168/233 (72%), Positives = 193/233 (82%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 G++DEAVSLF+ LPQFNCVNWT SFNTLL+I+VKE++ E A RL+LE+S GWEVKSRI S Sbjct: 94 GMVDEAVSLFKTLPQFNCVNWTGSFNTLLRILVKESKLETACRLFLEHSCGWEVKSRIGS 153 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L+DALC NRSDLAL +FQEM QCC PD+E+YR+LMRGLCEDGRLNEATHLLYSMF Sbjct: 154 LNLLMDALCQINRSDLALHVFQEMRYQCCSPDKESYRILMRGLCEDGRLNEATHLLYSMF 213 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRISQKG GED+ YRTLL+ALCDNG VEEA+EILGK+ +K LKA KR R L L Sbjct: 214 WRISQKGGGEDIAAYRTLLDALCDNGHVEEALEILGKVLKKGLKAPKRCRGHLDLSYCCN 273 Query: 561 GESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 GE +E KGLINEAL RGGVPS+ASY +MA DLYSE KI +AN+V DEM RG Sbjct: 274 GEDIERTKGLINEALIRGGVPSMASYSAMAIDLYSERKIGEANQVLDEMRDRG 326 >emb|CBI38708.3| unnamed protein product [Vitis vinifera] Length = 466 Score = 339 bits (869), Expect = 6e-91 Identities = 168/233 (72%), Positives = 193/233 (82%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 G++DEAVSLF+ LPQFNCVNWT SFNTLL+I+VKE++ E A RL+LE+S GWEVKSRI S Sbjct: 94 GMVDEAVSLFKTLPQFNCVNWTGSFNTLLRILVKESKLETACRLFLEHSCGWEVKSRIGS 153 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L+DALC NRSDLAL +FQEM QCC PD+E+YR+LMRGLCEDGRLNEATHLLYSMF Sbjct: 154 LNLLMDALCQINRSDLALHVFQEMRYQCCSPDKESYRILMRGLCEDGRLNEATHLLYSMF 213 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRISQKG GED+ YRTLL+ALCDNG VEEA+EILGK+ +K LKA KR R L L Sbjct: 214 WRISQKGGGEDIAAYRTLLDALCDNGHVEEALEILGKVLKKGLKAPKRCRGHLDLSYCCN 273 Query: 561 GESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 GE +E KGLINEAL RGGVPS+ASY +MA DLYSE KI +AN+V DEM RG Sbjct: 274 GEDIERTKGLINEALIRGGVPSMASYSAMAIDLYSERKIGEANQVLDEMRDRG 326 >ref|XP_007219034.1| hypothetical protein PRUPE_ppa004557mg [Prunus persica] gi|462415496|gb|EMJ20233.1| hypothetical protein PRUPE_ppa004557mg [Prunus persica] Length = 503 Score = 338 bits (866), Expect = 1e-90 Identities = 166/233 (71%), Positives = 195/233 (83%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 GLLDEAVSLF+N+ QFNCVNWT+SFNTLL+IMVKE++ EAAHR+++E+ GWEV SR+ S Sbjct: 94 GLLDEAVSLFKNISQFNCVNWTQSFNTLLEIMVKESKLEAAHRIFMEHCCGWEVSSRVPS 153 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L+ ALC + RSD+AL +FQEM+ Q C PDRE+YR+LMRGLCED RLNEATHLLYSMF Sbjct: 154 LNLLMLALCQKGRSDIALQVFQEMDYQSCNPDRESYRILMRGLCEDKRLNEATHLLYSMF 213 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRISQKG GEDVVIYRTLL+ALCDNGQVE+A+EILGKI RK LKA KR+R +L L Sbjct: 214 WRISQKGCGEDVVIYRTLLDALCDNGQVEDAVEILGKILRKGLKAPKRFRHNLDLSHYGN 273 Query: 561 GESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 GE EGIK LINEAL RGG+PSLASY +MA DLY E K+ +A+RV EM RG Sbjct: 274 GEDTEGIKRLINEALVRGGIPSLASYSAMAIDLYDENKVGEADRVLKEMQDRG 326 >ref|XP_002515072.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223545552|gb|EEF47056.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 504 Score = 327 bits (838), Expect = 2e-87 Identities = 158/233 (67%), Positives = 191/233 (81%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 GLL+EA+SLF+N+PQFNCVNWTESFNTLL+IMVKE++ EAAHRL+LE+S GWEVKSR+RS Sbjct: 94 GLLNEAISLFKNIPQFNCVNWTESFNTLLQIMVKESKLEAAHRLFLESSYGWEVKSRVRS 153 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L+D LC NRSD+AL +FQEMN Q CYPDR++YR++M GLC+DGRLNEATHLLYSMF Sbjct: 154 LNLLMDVLCQHNRSDVALQVFQEMNYQGCYPDRDSYRIVMMGLCKDGRLNEATHLLYSMF 213 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRISQKGSGED+VIYR L+ALCD G VE+A+E+LGKI RK LKA KR L L Sbjct: 214 WRISQKGSGEDIVIYRIFLDALCDIGMVEQALEVLGKILRKGLKAPKRCHPRLDLSNCNS 273 Query: 561 GESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 ++E K LINEAL RG +PSL+SY +MA D Y+EGK+ A++V DE RG Sbjct: 274 DGNIETTKHLINEALIRGAIPSLSSYTAMAVDFYAEGKLSQADKVLDETQDRG 326 >ref|XP_006444734.1| hypothetical protein CICLE_v10019806mg [Citrus clementina] gi|557546996|gb|ESR57974.1| hypothetical protein CICLE_v10019806mg [Citrus clementina] Length = 504 Score = 320 bits (819), Expect = 4e-85 Identities = 158/233 (67%), Positives = 190/233 (81%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 G L+EAVSLF+NL QFNCVNWT+SFNTLLK MVKE++ EAAH L+L + GWEVKSRI+S Sbjct: 94 GQLNEAVSLFKNLSQFNCVNWTQSFNTLLKEMVKESKLEAAHILFLRSCYGWEVKSRIQS 153 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L+D LC R RSDLAL +FQEM+ Q CYPDRE+Y +LM+GLC D RLNEATHLLYSMF Sbjct: 154 LNLLMDVLCQRRRSDLALHVFQEMDFQGCYPDRESYHILMKGLCNDRRLNEATHLLYSMF 213 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRISQKGSGED+VIYRTLL ALCD G++++AM+IL KI RK LKA K R + L + Sbjct: 214 WRISQKGSGEDIVIYRTLLFALCDQGKIQDAMQILEKILRKGLKAPKSRRHRIDLCPCND 273 Query: 561 GESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 GE +EG K LINEAL RGG+PSLASY +MA DLY+EG+I + ++V DEM +G Sbjct: 274 GEDIEGAKSLINEALIRGGIPSLASYSAMAIDLYNEGRIVEGDKVLDEMRTKG 326 >ref|XP_004306649.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05600-like [Fragaria vesca subsp. vesca] Length = 506 Score = 319 bits (817), Expect = 7e-85 Identities = 160/233 (68%), Positives = 191/233 (81%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 G LDEAVSLF+NL QFNCVNWT+SFNTL++IMV+E+R E A RL++E+ GWEV SR+RS Sbjct: 94 GQLDEAVSLFKNLSQFNCVNWTQSFNTLVEIMVEESRLEDACRLFVEHCCGWEVSSRVRS 153 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L+ ALC + RSD+AL +FQEM+ Q C PDRE+YR+LMRGLCED RLNEATHLLYSMF Sbjct: 154 LNLLMLALCQKGRSDIALHVFQEMDYQSCNPDRESYRILMRGLCEDRRLNEATHLLYSMF 213 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRISQKG ED+VIYRTLL+ALCDNG+VEEA+E+LGKI RK LKA K++R L L Sbjct: 214 WRISQKGCAEDIVIYRTLLDALCDNGKVEEAVEMLGKILRKGLKAPKKFRLQLDLSRYNY 273 Query: 561 GESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 GE EGIK LIN AL RGG PSLASY +MA DLY+E KI++A+ V +EM RG Sbjct: 274 GEDTEGIKRLINAALVRGGNPSLASYSAMAIDLYNENKINEADAVLNEMQDRG 326 >ref|XP_006491362.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05600-like [Citrus sinensis] Length = 504 Score = 317 bits (812), Expect = 3e-84 Identities = 157/233 (67%), Positives = 189/233 (81%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 G L+EAVSLF+NL QFNCVNWT+SFNTLLK MVKE++ EAAH L+L + GWEVKSRI+S Sbjct: 94 GQLNEAVSLFKNLSQFNCVNWTQSFNTLLKEMVKESKLEAAHILFLRSCYGWEVKSRIQS 153 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L+D LC RSDLAL +FQEM+ Q CYPDRE+Y +LM+GLC D RLNEATHLLYSMF Sbjct: 154 LNLLMDVLCQCRRSDLALHVFQEMDFQGCYPDRESYHILMKGLCNDRRLNEATHLLYSMF 213 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRISQKGSGED+VIYRTLL ALCD G++++AM+IL KI RK LKA K R + L + Sbjct: 214 WRISQKGSGEDIVIYRTLLFALCDQGKIQDAMQILEKILRKGLKAPKSRRHRIDLCPCND 273 Query: 561 GESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 GE +EG K LINEAL RGG+PSLASY +MA DLY+EG+I + ++V DEM +G Sbjct: 274 GEDIEGAKSLINEALIRGGIPSLASYSAMAVDLYNEGRIVEGDKVLDEMRTKG 326 >ref|XP_003534128.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05600-like [Glycine max] Length = 502 Score = 314 bits (804), Expect = 2e-83 Identities = 151/233 (64%), Positives = 194/233 (83%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 GL+DEA+SL++++P+FNCVNWTESFNT+L+IMVKE R E AHRL++E+S GWEV+S +R+ Sbjct: 94 GLVDEAISLYKSIPRFNCVNWTESFNTMLQIMVKENRLEIAHRLFVESSCGWEVRSLVRA 153 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L+ ALC ++RSDLAL +FQEM+ Q CYP+R++Y +LM+GLC+D RL+EATHLLYSMF Sbjct: 154 LNLLMYALCQKSRSDLALQLFQEMDYQSCYPNRDSYAILMKGLCQDRRLHEATHLLYSMF 213 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRISQKG+GED+V+YRTLL+ALCD G+ EEA EILGKI RK LKA KR L L + Sbjct: 214 WRISQKGNGEDIVVYRTLLDALCDAGKFEEAEEILGKILRKGLKAPKRCHSRLDLDQLSD 273 Query: 561 GESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 G+ +E K +I+EAL +G VPSLASY +MA DLYSEGKID+A++V EM RG Sbjct: 274 GKDIESAKRMIHEALIKGSVPSLASYNAMAVDLYSEGKIDEADKVIIEMQVRG 326 Score = 59.3 bits (142), Expect = 1e-06 Identities = 52/208 (25%), Positives = 96/208 (46%), Gaps = 2/208 (0%) Frame = +3 Query: 90 SFNTLLKIMVKEARFEAAHRLYLENSI-GWEVKSRIRSLNVLIDALCCRNRSDLALDIFQ 266 S+N + + E + + A ++ +E + G++ I V ALC ++ D A+ + + Sbjct: 298 SYNAMAVDLYSEGKIDEADKVIIEMQVRGFKPTHSIFEAKVA--ALCKVSKVDEAIKVIE 355 Query: 267 E-MNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMFWRISQKGSGEDVVIYRTLLEA 443 E M C P + Y +L++ LC G +T +L S+ S+ G D Y LLE Sbjct: 356 EDMVKVNCLPTAKVYNILLKNLCNVGN---STAILESLNKMSSKVGCTGDRDTYSILLEM 412 Query: 444 LCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVEGESLEGIKGLINEALTRGGVP 623 LC + EA ++L K+S K SL+ G E + L + +++G +P Sbjct: 413 LCGERRYLEASQLLEKMSIKSYWPCTNSYNSLIRGLCSIGRQYEAVMWL-EDMISQGKLP 471 Query: 624 SLASYGSMAADLYSEGKIDDANRVFDEM 707 ++ + S+A+ + KI ++ F + Sbjct: 472 EISVWNSLASLFCNSEKIKVSSETFSRL 499 >ref|XP_007041542.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma cacao] gi|508705477|gb|EOX97373.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma cacao] Length = 500 Score = 306 bits (785), Expect = 3e-81 Identities = 151/233 (64%), Positives = 189/233 (81%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 G+++EAV LF ++PQFNC+N+T+SF TLL IMVKE+ F+AA++L+LENS EVKSR++S Sbjct: 94 GMVNEAVDLFNSIPQFNCINFTQSFTTLLGIMVKESDFKAAYQLFLENSWRLEVKSRVKS 153 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L++ LC +SDLAL+IFQEM+ Q CYPDRE+YR+LM+GL +DGRLNEATHLLYSMF Sbjct: 154 LNLLMEGLCQFKKSDLALNIFQEMDFQGCYPDRESYRILMKGLSDDGRLNEATHLLYSMF 213 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRISQKGSGED+VIYR LL ALCDNG+VEEA+E+LGKI RK LKA K R + L Sbjct: 214 WRISQKGSGEDIVIYRILLYALCDNGKVEEALELLGKILRKGLKAPKSRRHQIDLSRCAN 273 Query: 561 GESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 GE E K LI EAL RGGVP++ SY +MA DLY+EG++D+A V DEM K+G Sbjct: 274 GEDSEATKRLITEALIRGGVPNMGSYSAMAIDLYNEGRVDEAETVLDEMRKKG 326 >ref|XP_003614316.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355515651|gb|AES97274.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 504 Score = 306 bits (785), Expect = 3e-81 Identities = 151/233 (64%), Positives = 190/233 (81%) Frame = +3 Query: 18 EGLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIR 197 EGL+DEA+SL++N+PQFNCVNWT+SFNTLL+IMV E + E AH L++E+S GWEVKSR++ Sbjct: 95 EGLVDEAISLYKNIPQFNCVNWTQSFNTLLEIMVNENKLEDAHSLFVESSCGWEVKSRVQ 154 Query: 198 SLNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSM 377 +LN+L+ ALC ++RSDLAL IFQEM+ Q CYP+RE+Y ++M+GLC+D RL+EATHLLYSM Sbjct: 155 ALNLLMYALCRKSRSDLALQIFQEMDYQGCYPNRESYLIVMKGLCQDKRLHEATHLLYSM 214 Query: 378 FWRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRV 557 FWRIS KG+GED+VIYRTLL+ALCDNG+ +EA+EILGKI RK LKA KR L + Sbjct: 215 FWRISLKGNGEDIVIYRTLLDALCDNGKFDEAVEILGKILRKGLKAPKRCYNRLDISQCG 274 Query: 558 EGESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKR 716 +G+ E K I+EAL RG VPS ASY SMA DLY EGKID+A++V EM R Sbjct: 275 DGKDAEVTKRWIHEALVRGSVPSTASYTSMAVDLYEEGKIDEADKVIIEMKDR 327 >gb|EXC13671.1| hypothetical protein L484_019632 [Morus notabilis] Length = 458 Score = 305 bits (782), Expect = 8e-81 Identities = 153/233 (65%), Positives = 188/233 (80%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 G+ DEA+SLF+++PQFNCVNWTESFNT+L+IMV E+RF+ A RL+L+NS WEV SRI+S Sbjct: 44 GMPDEALSLFKSIPQFNCVNWTESFNTILQIMVNESRFDDAGRLFLDNSSRWEVSSRIQS 103 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L+ LC + SD+AL +FQEM+ Q +PDRE+Y+VLMRGLC+DG+LNE HLLYSMF Sbjct: 104 LNLLMRTLCEKGCSDVALQVFQEMDYQGIHPDRESYQVLMRGLCQDGKLNEGIHLLYSMF 163 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRIS KGSGEDVVIYRTLL+ALCDNG+VEEA+EILGKI RK LKA KR R + L Sbjct: 164 WRISLKGSGEDVVIYRTLLDALCDNGKVEEAVEILGKILRKGLKAPKRCRIRIDLSQCSN 223 Query: 561 GESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 E +E IK LINEAL +GG+PSLASY +MA DLY+E I+D ++V EM RG Sbjct: 224 FEDVESIKRLINEALVKGGIPSLASYRAMAVDLYNENNINDGDKVLKEMQDRG 276 >ref|XP_003619886.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355494901|gb|AES76104.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 576 Score = 305 bits (780), Expect = 1e-80 Identities = 150/231 (64%), Positives = 189/231 (81%) Frame = +3 Query: 24 LLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRSL 203 L+DEA+SL++N+PQFNCVNWT+SFNTLLKIMV E + E AH L++E+S GWEVKSR+++L Sbjct: 77 LVDEAISLYKNIPQFNCVNWTQSFNTLLKIMVNENKLEDAHSLFVESSCGWEVKSRVQAL 136 Query: 204 NVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMFW 383 N+L+ ALC ++RSDLAL IFQEM+ Q CYP+RE+Y ++M+GLC+D RL+EATHLLYSMFW Sbjct: 137 NLLMYALCRKSRSDLALQIFQEMDYQDCYPNRESYLIVMKGLCQDKRLHEATHLLYSMFW 196 Query: 384 RISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVEG 563 RIS KG+GED+VIYRTLL+ALCDNG+ +EA+EILGKI RK LKA KR L + +G Sbjct: 197 RISLKGNGEDIVIYRTLLDALCDNGKFDEAVEILGKILRKGLKAPKRCYNRLDITQCGDG 256 Query: 564 ESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKR 716 + +E K I+EAL RG VPS ASY SMA DLY EGKID+A++V EM R Sbjct: 257 KDVEVTKRWIHEALVRGSVPSTASYTSMAVDLYEEGKIDEADKVIIEMKDR 307 >ref|XP_004512775.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05600-like [Cicer arietinum] Length = 502 Score = 303 bits (777), Expect = 3e-80 Identities = 149/233 (63%), Positives = 187/233 (80%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 GL+DEA+ L++N+P+FNCVNWT+SFNTLL+IMV E + E AH L++E+S GWEVKSR+++ Sbjct: 94 GLVDEAIGLYKNIPRFNCVNWTQSFNTLLEIMVNENKLEDAHSLFVESSCGWEVKSRVQA 153 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L+ ALC ++RSDLAL IFQEM+ Q CYP+R++Y +M+GLC D RLNEATHLLYSMF Sbjct: 154 LNLLMYALCQKSRSDLALQIFQEMDYQGCYPNRDSYLTVMKGLCRDKRLNEATHLLYSMF 213 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRIS KG+GED+VIYRTLL+ALCD+G EEA+EIL KI RK LKA KR L L Sbjct: 214 WRISLKGNGEDIVIYRTLLDALCDDGNFEEAVEILSKILRKGLKAPKRCYSRLDLSQCGN 273 Query: 561 GESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 G+ +E IK I+EAL RG VPS ASY +MA DLY+EGKI +A+ V DEM K+G Sbjct: 274 GKDVEDIKRWIHEALVRGSVPSTASYNAMAIDLYNEGKIGEADNVIDEMRKKG 326 >gb|EYU37119.1| hypothetical protein MIMGU_mgv1a022987mg [Mimulus guttatus] Length = 461 Score = 298 bits (762), Expect = 2e-78 Identities = 147/235 (62%), Positives = 183/235 (77%), Gaps = 2/235 (0%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 GL DEAVSLF LP+FNCVNWTESFNTLL+IMVK+++ E+A+ ++EN GWE+KSRIRS Sbjct: 96 GLFDEAVSLFHTLPEFNCVNWTESFNTLLEIMVKDSKLESAYHFFVENCHGWEIKSRIRS 155 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L+ ALC NRSDLAL +FQEM+ Q C DRE+Y++LM+GLCED RL EATHLLYSMF Sbjct: 156 LNLLMSALCRINRSDLALQVFQEMDYQWCCADRESYKILMKGLCEDRRLTEATHLLYSMF 215 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRG--R 554 W+IS+KG G DV IYRTLLEALCDNG+VEEA E+L K+ RK LKA +++R+ + L Sbjct: 216 WKISRKGCGADVSIYRTLLEALCDNGEVEEATEVLEKVLRKGLKAPRKFRKKIDLSQFYH 275 Query: 555 VEGESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 + + K LINEAL +GGV Y +MA DLYSEG+ID+ +V DEML+RG Sbjct: 276 KDEAGIYQTKALINEALIKGGVSGSDGYRAMAVDLYSEGRIDEGGKVLDEMLQRG 330 >ref|XP_007152653.1| hypothetical protein PHAVU_004G147700g [Phaseolus vulgaris] gi|561025962|gb|ESW24647.1| hypothetical protein PHAVU_004G147700g [Phaseolus vulgaris] Length = 502 Score = 298 bits (762), Expect = 2e-78 Identities = 143/233 (61%), Positives = 187/233 (80%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 G +DEA+SL++ +PQFNCVNWTESFNT+L+IMV E R E AH +++E+S GWEV+ RIR+ Sbjct: 94 GQVDEAMSLYKTIPQFNCVNWTESFNTILQIMVNENRLEMAHSIFVESSCGWEVRYRIRA 153 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L+ ALC ++RSDLAL +FQEM+ Q CYPDR++Y +LM+GLC+D RL+EATHLLYSMF Sbjct: 154 LNLLMYALCQKSRSDLALQLFQEMDYQSCYPDRDSYAILMKGLCQDKRLHEATHLLYSMF 213 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRISQKG+GED+V+YR LL+ALCD G++EEA+EILGKI RK LKA K L ++ Sbjct: 214 WRISQKGNGEDIVVYRILLDALCDAGKLEEAVEILGKILRKGLKAPKGCYNRSDLGQILD 273 Query: 561 GESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 +E K +I+EAL +G +PS+ASY +MA DLY+EGKID+A+ V EM RG Sbjct: 274 DNDIESAKRVIHEALIKGSIPSMASYNAMAVDLYTEGKIDEADTVIMEMQDRG 326 >ref|XP_006357649.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05600-like isoform X1 [Solanum tuberosum] Length = 503 Score = 294 bits (752), Expect = 2e-77 Identities = 146/239 (61%), Positives = 187/239 (78%), Gaps = 1/239 (0%) Frame = +3 Query: 6 RLYLE-GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEV 182 R Y++ GL +EA+ LF++LP+FNC+ WT S NTLL+I+V+E++ E+ ++L+LENS GWEV Sbjct: 88 RSYVQAGLTNEAIFLFKSLPEFNCIEWTRSLNTLLEILVEESKLESVYQLFLENSCGWEV 147 Query: 183 KSRIRSLNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATH 362 KSR LN+L++ALC RSDLAL IFQEM+ Q CYP++E+YR+LMRGLCE+ RLNEATH Sbjct: 148 KSRAHFLNLLMNALCRMKRSDLALHIFQEMSYQNCYPNKESYRILMRGLCEEKRLNEATH 207 Query: 363 LLYSMFWRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLV 542 LLYSMFWRISQKGSGEDVV+YR LLEALCDN + EEA++ILGK+ RK LKA K+ + + Sbjct: 208 LLYSMFWRISQKGSGEDVVVYRALLEALCDNEEGEEAIQILGKVLRKGLKAPKKCYKQID 267 Query: 543 LRGRVEGESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 L G E +K LINEAL +G VPS SY +M+ D Y+EGKID+ N+V EM RG Sbjct: 268 LTQCRNGSDTENMKVLINEALIKGIVPSSDSYRAMSVDFYAEGKIDEGNKVLKEMHDRG 326 >ref|XP_004243578.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05600-like isoform 1 [Solanum lycopersicum] gi|460396023|ref|XP_004243579.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05600-like isoform 2 [Solanum lycopersicum] Length = 503 Score = 291 bits (746), Expect = 1e-76 Identities = 142/233 (60%), Positives = 184/233 (78%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 GL +EA+ LF++LP+FNC+ WT S +TLL+I+V+E++ E+ ++L+LENS GWEVKSR Sbjct: 94 GLTNEAIFLFKSLPEFNCIEWTRSLSTLLEILVEESKLESVYQLFLENSCGWEVKSRAHF 153 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L++ALC RSDLAL IFQEM+ Q CYP++E+YR+LMRGLCE+ RLNEATHLLYSMF Sbjct: 154 LNLLMNALCRMKRSDLALHIFQEMSYQNCYPNKESYRILMRGLCEEKRLNEATHLLYSMF 213 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRISQKGSGEDVV+YR LLEALC+N + EEA++ILGK+ RK LKA + Y + + L Sbjct: 214 WRISQKGSGEDVVVYRALLEALCENEEGEEALQILGKVLRKGLKAPRSYYKQIDLTQCRN 273 Query: 561 GESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 G E +K LINEAL +G VPS SY +MA D Y+EGKID+ ++V EM +RG Sbjct: 274 GSDTENMKVLINEALIKGIVPSSDSYRAMAVDFYAEGKIDEGDKVLKEMHERG 326 >ref|XP_006306345.1| hypothetical protein CARUB_v10012229mg [Capsella rubella] gi|482575056|gb|EOA39243.1| hypothetical protein CARUB_v10012229mg [Capsella rubella] Length = 503 Score = 273 bits (697), Expect = 6e-71 Identities = 138/235 (58%), Positives = 176/235 (74%), Gaps = 2/235 (0%) Frame = +3 Query: 21 GLLDEAVSLFRNLPQFNCVNWTESFNTLLKIMVKEARFEAAHRLYLENSIGWEVKSRIRS 200 G LD+A+SLF++L +FNCVNWT SF+TLL+ MVKE+ EAA ++ + GWEV SRI + Sbjct: 94 GRLDDAISLFKSLHEFNCVNWTLSFDTLLQEMVKESELEAACHIFRKYCYGWEVNSRIMA 153 Query: 201 LNVLIDALCCRNRSDLALDIFQEMNDQCCYPDRETYRVLMRGLCEDGRLNEATHLLYSMF 380 LN+L+ LC NRSDLA +FQEMN Q CYPDR++YR+LM+G C +GRL+EATHLLYSMF Sbjct: 154 LNLLMKVLCQVNRSDLASQVFQEMNYQGCYPDRDSYRILMKGFCLEGRLDEATHLLYSMF 213 Query: 381 WRISQKGSGEDVVIYRTLLEALCDNGQVEEAMEILGKISRKRLKASKRYRQSLVLRGRVE 560 WRISQKGSGED+V+YR LL+ALCD G+V+EA+EILGKI RK LKA KR + G E Sbjct: 214 WRISQKGSGEDIVVYRILLDALCDAGEVDEAIEILGKILRKGLKAPKRCYHH-IEAGHWE 272 Query: 561 G--ESLEGIKGLINEALTRGGVPSLASYGSMAADLYSEGKIDDANRVFDEMLKRG 719 G E +E +K L+ E L RG +PSL SY +MA DL+ EGK+ + V M ++G Sbjct: 273 GNSEGIERVKRLLTETLIRGAIPSLDSYSAMATDLFGEGKLLEGEEVLLAMRRKG 327