BLASTX nr result
ID: Paeonia23_contig00044741
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia23_contig00044741 (397 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007224829.1| hypothetical protein PRUPE_ppa023452mg [Prun... 179 5e-43 ref|XP_002270695.1| PREDICTED: pentatricopeptide repeat-containi... 173 3e-41 emb|CAN74703.1| hypothetical protein VITISV_029224 [Vitis vinifera] 173 3e-41 ref|XP_006483222.1| PREDICTED: pentatricopeptide repeat-containi... 167 2e-39 ref|XP_006438631.1| hypothetical protein CICLE_v10033863mg, part... 167 2e-39 ref|XP_006344284.1| PREDICTED: pentatricopeptide repeat-containi... 158 6e-37 ref|XP_007046203.1| Pentatricopeptide repeat (PPR) superfamily p... 157 1e-36 ref|XP_004237212.1| PREDICTED: pentatricopeptide repeat-containi... 157 1e-36 ref|XP_002520126.1| pentatricopeptide repeat-containing protein,... 153 3e-35 ref|XP_006395761.1| hypothetical protein EUTSA_v10003832mg [Eutr... 146 3e-33 ref|XP_003519768.2| PREDICTED: pentatricopeptide repeat-containi... 144 1e-32 ref|XP_007157683.1| hypothetical protein PHAVU_002G089500g [Phas... 144 2e-32 ref|XP_002876827.1| pentatricopeptide repeat-containing protein ... 139 4e-31 ref|XP_003613604.1| Pentatricopeptide repeat-containing protein ... 138 9e-31 gb|ABK28160.1| unknown [Arabidopsis thaliana] 137 2e-30 gb|ABE65422.1| pentatricopeptide repeat-containing protein [Arab... 137 2e-30 ref|NP_178378.1| pentatricopeptide repeat-containing protein [Ar... 137 2e-30 ref|XP_006290425.1| hypothetical protein CARUB_v10019221mg [Caps... 133 3e-29 ref|XP_004490048.1| PREDICTED: pentatricopeptide repeat-containi... 132 6e-29 gb|EXB63826.1| hypothetical protein L484_021099 [Morus notabilis] 131 8e-29 >ref|XP_007224829.1| hypothetical protein PRUPE_ppa023452mg [Prunus persica] gi|462421765|gb|EMJ26028.1| hypothetical protein PRUPE_ppa023452mg [Prunus persica] Length = 619 Score = 179 bits (453), Expect = 5e-43 Identities = 88/131 (67%), Positives = 104/131 (79%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI 184 KVF EMPERNLAS+NAVISGF HNGY EA +F+ +G +PNSVTIAS+L+ C V Sbjct: 89 KVFEEMPERNLASLNAVISGFLHNGYCTEALRLFKNVGPGGFRPNSVTIASMLSACGTVE 148 Query: 185 HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLL 364 HG++MHC A+KLGV++DVYV TSV+TMYSNCG L SA K+FE KN+VS NAFISGLL Sbjct: 149 HGMEMHCLAVKLGVESDVYVATSVLTMYSNCGGLFSAAKVFEEMPIKNIVSCNAFISGLL 208 Query: 365 QNGAPRVVLDV 397 QNG P VVLD+ Sbjct: 209 QNGVPHVVLDI 219 Score = 82.4 bits (202), Expect = 6e-14 Identities = 50/128 (39%), Positives = 75/128 (58%), Gaps = 5/128 (3%) Frame = +2 Query: 2 AKVFVEMPERNLASVNAVISGFSHNGYFREAFLVFREL-GVERLKPNSVTIASVLAGCEN 178 AKVF EMP +N+ S NA ISG NG +F+++ PNSVT+ SVL+ C + Sbjct: 186 AKVFEEMPIKNIVSCNAFISGLLQNGVPHVVLDIFKKMRACTGENPNSVTLLSVLSACAS 245 Query: 179 VIH---GLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLF-ERTRNKNVVSYNA 346 +++ G Q+H +K+ V+ D + T++V MYS CG A F E N+N+ ++NA Sbjct: 246 LLYLRFGKQVHGLMMKIEVELDTMLGTALVDMYSKCGCWQLAYGTFKELNENRNLFTWNA 305 Query: 347 FISGLLQN 370 ISG++ N Sbjct: 306 MISGMMLN 313 Score = 62.0 bits (149), Expect = 8e-08 Identities = 33/103 (32%), Positives = 53/103 (51%), Gaps = 3/103 (2%) Frame = +2 Query: 74 NGYFREAFLVFRELGVERLKPNSVTIASVLAGC---ENVIHGLQMHCWAIKLGVDADVYV 244 +G +R+A ++ +L L+P+ T +L C ++ H +H +K G ADVY Sbjct: 11 DGLYRDALCLYAQLHSASLRPHKFTFPPLLKACGKLQSAPHAQILHTHLMKTGFSADVYS 70 Query: 245 LTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLLQNG 373 T++ +Y + A K+FE +N+ S NA ISG L NG Sbjct: 71 ATALTDVYMKLHLIGDAVKVFEEMPERNLASLNAVISGFLHNG 113 >ref|XP_002270695.1| PREDICTED: pentatricopeptide repeat-containing protein At2g02750 [Vitis vinifera] gi|296086418|emb|CBI32007.3| unnamed protein product [Vitis vinifera] Length = 617 Score = 173 bits (438), Expect = 3e-41 Identities = 85/131 (64%), Positives = 104/131 (79%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI 184 KVF EMP RNL S+N ISGFS NGYFREA F+++G+ +PNSVTIASVL C +V Sbjct: 89 KVFEEMPHRNLPSLNVTISGFSRNGYFREALGAFKQVGLGNFRPNSVTIASVLPACASVE 148 Query: 185 HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLL 364 Q+HC AIKLGV++D+YV T+VVTMYSNCG+L+ A K+F++ +KNVVSYNAFISGLL Sbjct: 149 LDGQVHCLAIKLGVESDIYVATAVVTMYSNCGELVLAKKVFDQILDKNVVSYNAFISGLL 208 Query: 365 QNGAPRVVLDV 397 QNGAP +V DV Sbjct: 209 QNGAPHLVFDV 219 Score = 80.1 bits (196), Expect = 3e-13 Identities = 47/136 (34%), Positives = 83/136 (61%), Gaps = 5/136 (3%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLK-PNSVTIASVLAGCENV 181 KVF ++ ++N+ S NA ISG NG F VF++L + PNSVT+ S+L+ C + Sbjct: 187 KVFDQILDKNVVSYNAFISGLLQNGAPHLVFDVFKDLLESSGEVPNSVTLVSILSACSKL 246 Query: 182 IH---GLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLF-ERTRNKNVVSYNAF 349 ++ G Q+H +K+ ++ D V T++V MYS CG A +F E + ++N+V++N+ Sbjct: 247 LYIRFGRQIHGLVVKIEINFDTMVGTALVDMYSKCGCWHWAYGIFIELSGSRNLVTWNSM 306 Query: 350 ISGLLQNGAPRVVLDV 397 I+G++ NG + +++ Sbjct: 307 IAGMMLNGQSDIAVEL 322 Score = 57.4 bits (137), Expect = 2e-06 Identities = 37/121 (30%), Positives = 63/121 (52%), Gaps = 5/121 (4%) Frame = +2 Query: 26 ERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI---HGLQ 196 E + A+ N +ISGFS G EAF F ++ + + +I S+L C + G + Sbjct: 332 EPDSATWNTMISGFSQQGQVVEAFKFFHKMQSAGVIASLKSITSLLRACSALSALQSGKE 391 Query: 197 MHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNK--NVVSYNAFISGLLQN 370 +H I+ +D D ++ T+++ MY CG A ++F + + K + +NA ISG +N Sbjct: 392 IHGHTIRTNIDTDEFISTALIDMYMKCGHSYLARRVFCQFQIKPDDPAFWNAMISGYGRN 451 Query: 371 G 373 G Sbjct: 452 G 452 >emb|CAN74703.1| hypothetical protein VITISV_029224 [Vitis vinifera] Length = 677 Score = 173 bits (438), Expect = 3e-41 Identities = 85/131 (64%), Positives = 104/131 (79%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI 184 KVF EMP RNL S+N ISGFS NGYFREA F+++G+ +PNSVTIASVL C +V Sbjct: 149 KVFEEMPHRNLPSLNVTISGFSRNGYFREALGAFKQVGLGNFRPNSVTIASVLPACASVE 208 Query: 185 HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLL 364 Q+HC AIKLGV++D+YV T+VVTMYSNCG+L+ A K+F++ +KNVVSYNAFISGLL Sbjct: 209 LDGQVHCLAIKLGVESDIYVATAVVTMYSNCGELVLAKKVFDQILDKNVVSYNAFISGLL 268 Query: 365 QNGAPRVVLDV 397 QNGAP +V DV Sbjct: 269 QNGAPHLVFDV 279 Score = 80.1 bits (196), Expect = 3e-13 Identities = 47/136 (34%), Positives = 83/136 (61%), Gaps = 5/136 (3%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLK-PNSVTIASVLAGCENV 181 KVF ++ ++N+ S NA ISG NG F VF++L + PNSVT+ S+L+ C + Sbjct: 247 KVFDQILDKNVVSYNAFISGLLQNGAPHLVFDVFKDLLESSGEVPNSVTLVSILSACSKL 306 Query: 182 IH---GLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLF-ERTRNKNVVSYNAF 349 ++ G Q+H +K+ ++ D V T++V MYS CG A +F E + ++N+V++N+ Sbjct: 307 LYIRFGRQIHGLVVKIEINFDTMVGTALVDMYSKCGCWHWAYGIFIELSGSRNLVTWNSM 366 Query: 350 ISGLLQNGAPRVVLDV 397 I+G++ NG + +++ Sbjct: 367 IAGMMLNGQSDIAVEL 382 Score = 57.4 bits (137), Expect = 2e-06 Identities = 37/121 (30%), Positives = 63/121 (52%), Gaps = 5/121 (4%) Frame = +2 Query: 26 ERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI---HGLQ 196 E + A+ N +ISGFS G EAF F ++ + + +I S+L C + G + Sbjct: 392 EPDSATWNTMISGFSQQGQVVEAFKFFHKMQSAGVIASLKSITSLLRACSALSALQSGKE 451 Query: 197 MHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNK--NVVSYNAFISGLLQN 370 +H I+ +D D ++ T+++ MY CG A ++F + + K + +NA ISG +N Sbjct: 452 IHGHTIRTNIDTDEFISTALIDMYMKCGHSYLARRVFCQFQIKPDDPAFWNAMISGYGRN 511 Query: 371 G 373 G Sbjct: 512 G 512 >ref|XP_006483222.1| PREDICTED: pentatricopeptide repeat-containing protein At2g02750-like [Citrus sinensis] Length = 618 Score = 167 bits (422), Expect = 2e-39 Identities = 84/131 (64%), Positives = 101/131 (77%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI 184 KVF EMP+ NLAS+NA ISGFS NGY REA VF+E VE +PNSVT+AS L+ CE++ Sbjct: 91 KVFNEMPDWNLASLNAAISGFSQNGYVREALWVFKEAVVEVFRPNSVTVASALSACESLD 150 Query: 185 HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLL 364 HGLQMHC AIKLGV+ DVYV TS+VT+YSN ++ AT++F T KN+VSYNAF +GLL Sbjct: 151 HGLQMHCLAIKLGVEMDVYVATSLVTIYSNFKEIAVATRVFGETPGKNIVSYNAFFTGLL 210 Query: 365 QNGAPRVVLDV 397 NG P VVL V Sbjct: 211 NNGVPLVVLKV 221 Score = 77.4 bits (189), Expect = 2e-12 Identities = 44/137 (32%), Positives = 81/137 (59%), Gaps = 6/137 (4%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERL--KPNSVTIASVLAGCEN 178 +VF E P +N+ S NA +G +NG VF+++ E L +PNSVT SV++ C + Sbjct: 189 RVFGETPGKNIVSYNAFFTGLLNNGVPLVVLKVFKDMK-ECLSDEPNSVTFISVISACAS 247 Query: 179 VIH---GLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTR-NKNVVSYNA 346 +++ G Q+H +K+ +D V T++V MY CG+L A +F++ + +N++++N Sbjct: 248 LLYLQFGRQVHGLTLKIEKQSDTMVGTALVDMYLKCGRLPCAHNVFQKLKGGRNILTWNT 307 Query: 347 FISGLLQNGAPRVVLDV 397 I+G++ NG +++ Sbjct: 308 MIAGMMLNGRSEKAMEL 324 Score = 56.2 bits (134), Expect = 5e-06 Identities = 32/109 (29%), Positives = 52/109 (47%), Gaps = 3/109 (2%) Frame = +2 Query: 74 NGYFREAFLVFRELGVERLKPNSVTIASVLAGCENV---IHGLQMHCWAIKLGVDADVYV 244 NG+++EA ++ + L P+ T + C + I G +H IK G ++++ Sbjct: 13 NGFYKEALSLYSQQHSASLPPHKFTFPPLFKVCAKLKSSIQGQILHAHLIKTGFSSEIHA 72 Query: 245 LTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLLQNGAPRVVL 391 T++ MY + A K+F + N+ S NA ISG QNG R L Sbjct: 73 ATALTDMYMKLNLVSDALKVFNEMPDWNLASLNAAISGFSQNGYVREAL 121 Score = 56.2 bits (134), Expect = 5e-06 Identities = 36/117 (30%), Positives = 64/117 (54%), Gaps = 5/117 (4%) Frame = +2 Query: 38 ASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVIH---GLQMHCW 208 A+ N++ISGFS G EAF +F ++ + P+ + SVL+ C ++ G + H Sbjct: 338 ATWNSMISGFSQLGMRFEAFKLFEKMQSTGMVPSLKCVTSVLSACADLSALKLGKETHGH 397 Query: 209 AIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNK--NVVSYNAFISGLLQNG 373 I+ ++ D + T++++MY CG+ A + F++ K + +NA ISG +NG Sbjct: 398 VIRADLNKDESMATALISMYMKCGQPSWARRFFDQFEIKPDDPAFWNAMISGYGRNG 454 >ref|XP_006438631.1| hypothetical protein CICLE_v10033863mg, partial [Citrus clementina] gi|557540827|gb|ESR51871.1| hypothetical protein CICLE_v10033863mg, partial [Citrus clementina] Length = 592 Score = 167 bits (422), Expect = 2e-39 Identities = 84/131 (64%), Positives = 101/131 (77%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI 184 KVF EMP+ NLAS+NA ISGFS NGY REA VF+E VE +PNSVT+AS L+ CE++ Sbjct: 65 KVFNEMPDWNLASLNAAISGFSQNGYVREALRVFKEAVVEVFRPNSVTVASALSACESLD 124 Query: 185 HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLL 364 HGLQMHC AIKLGV+ DVYV TS+VT+YSN ++ AT++F T KN+VSYNAF +GLL Sbjct: 125 HGLQMHCLAIKLGVEMDVYVATSLVTIYSNFKEIAVATRVFGETPGKNIVSYNAFFTGLL 184 Query: 365 QNGAPRVVLDV 397 NG P VVL V Sbjct: 185 NNGVPLVVLKV 195 Score = 77.8 bits (190), Expect = 1e-12 Identities = 44/137 (32%), Positives = 82/137 (59%), Gaps = 6/137 (4%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERL--KPNSVTIASVLAGCEN 178 +VF E P +N+ S NA +G +NG VF+++ E L +PNSVT SV++ C + Sbjct: 163 RVFGETPGKNIVSYNAFFTGLLNNGVPLVVLKVFKDMK-ECLSDEPNSVTFISVISACAS 221 Query: 179 VIH---GLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTR-NKNVVSYNA 346 +++ G Q+H +K+ +D V T++V MY CG+L A +F++ + ++N++++N Sbjct: 222 LLYLQFGRQVHGLTLKIEKQSDTMVGTALVDMYLKCGRLPCAHNVFQKLKGSRNILTWNT 281 Query: 347 FISGLLQNGAPRVVLDV 397 I+G++ NG +++ Sbjct: 282 MIAGMMLNGRSEKAMEL 298 Score = 57.8 bits (138), Expect = 2e-06 Identities = 37/117 (31%), Positives = 65/117 (55%), Gaps = 5/117 (4%) Frame = +2 Query: 38 ASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVIH---GLQMHCW 208 A+ N++ISGFS G EAF +F ++ + P+ + SVL+ C ++ G + H Sbjct: 312 ATWNSMISGFSQLGMRFEAFKLFEKMQSTGMVPSLKCVTSVLSACADLSALKLGKETHGH 371 Query: 209 AIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNK--NVVSYNAFISGLLQNG 373 AI+ ++ D + T++++MY CG+ A + F++ K + +NA ISG +NG Sbjct: 372 AIRADLNKDESMATALISMYMKCGQPSWARRFFDQFEIKPDDPAFWNAMISGYGRNG 428 >ref|XP_006344284.1| PREDICTED: pentatricopeptide repeat-containing protein At2g02750-like [Solanum tuberosum] Length = 615 Score = 158 bits (400), Expect = 6e-37 Identities = 76/131 (58%), Positives = 101/131 (77%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI 184 KVF E+P+ N+AS+NA+ISG S NG +AF +F ++P+SVTIASVL+GC N+ Sbjct: 89 KVFDEIPQPNIASLNAIISGVSQNGCHVDAFRMFGLFSGLLIRPDSVTIASVLSGCVNIN 148 Query: 185 HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLL 364 HG+QMHCW IK+GV+ DVYV+TS+++MY NC +SAT+LF +NKNVV +NAFISG+L Sbjct: 149 HGVQMHCWGIKIGVEMDVYVVTSILSMYLNCVDCVSATRLFGLVKNKNVVCWNAFISGML 208 Query: 365 QNGAPRVVLDV 397 +NG VVLDV Sbjct: 209 RNGVEEVVLDV 219 Score = 65.9 bits (159), Expect = 6e-09 Identities = 42/126 (33%), Positives = 72/126 (57%), Gaps = 4/126 (3%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVL---AGCE 175 ++F + +N+ NA ISG NG VF+++ + +PN VT+ SVL A + Sbjct: 187 RLFGLVKNKNVVCWNAFISGMLRNGVEEVVLDVFKKMLLHE-EPNEVTLVSVLSATANLK 245 Query: 176 NVIHGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLF-ERTRNKNVVSYNAFI 352 NV G Q+H +K+ + A V T++V MYS C + A ++F E N+N++++N+ I Sbjct: 246 NVKFGRQVHGLIVKIELQARTMVGTALVDMYSKCCCWLCAYEIFKELGGNRNLITWNSMI 305 Query: 353 SGLLQN 370 +G++ N Sbjct: 306 AGMMLN 311 Score = 63.9 bits (154), Expect = 2e-08 Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 3/103 (2%) Frame = +2 Query: 74 NGYFREAFLVFRELGVERLKPNSVTIASVLAGC---ENVIHGLQMHCWAIKLGVDADVYV 244 NG ++EA ++ +L L P T + C + G +H IK G + DVY Sbjct: 11 NGLYKEAINLYSQLHYSSLSPTKFTFPCLFKACAKLRTIPQGQVLHSHLIKHGFNTDVYA 70 Query: 245 LTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLLQNG 373 TS+ MY + SA K+F+ N+ S NA ISG+ QNG Sbjct: 71 ATSLTDMYMKFALVDSALKVFDEIPQPNIASLNAIISGVSQNG 113 Score = 57.0 bits (136), Expect = 3e-06 Identities = 39/132 (29%), Positives = 70/132 (53%), Gaps = 9/132 (6%) Frame = +2 Query: 5 KVFVEMPERNL----ASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGC 172 ++FVE+ L A+ N++I+GFS EAF FR++ + P+ TI S+L C Sbjct: 319 ELFVELELEGLQPDSAAWNSMITGFSLLQKESEAFKFFRKMLSAGVVPSVKTITSLLMVC 378 Query: 173 ENV---IHGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNK--NVVS 337 ++ G ++H + + D +++T+++ MY CG+ A K+F++ K + Sbjct: 379 SSLSSLCFGQEIHGYIFRTETIIDEFIVTAIIDMYMKCGQFPLARKVFDQLEVKYDDPAI 438 Query: 338 YNAFISGLLQNG 373 +N ISG +NG Sbjct: 439 WNVMISGFGRNG 450 >ref|XP_007046203.1| Pentatricopeptide repeat (PPR) superfamily protein, putative [Theobroma cacao] gi|508710138|gb|EOY02035.1| Pentatricopeptide repeat (PPR) superfamily protein, putative [Theobroma cacao] Length = 618 Score = 157 bits (398), Expect = 1e-36 Identities = 77/131 (58%), Positives = 101/131 (77%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI 184 KVF EMP RNLAS+N +ISGF NGY+ EA LVF+E+ +PNS+TIA+VL C+++ Sbjct: 89 KVFAEMPGRNLASLNTMISGFWRNGYWEEALLVFKEMIFGLSRPNSLTIATVLPACQSLE 148 Query: 185 HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLL 364 G+Q H A+KLGV+ DVYV TS++TMYS C +++ ATK+F + NKNVVSYNA +GLL Sbjct: 149 LGMQFHSLAVKLGVELDVYVATSLLTMYSKCEEIVLATKMFVKMTNKNVVSYNALATGLL 208 Query: 365 QNGAPRVVLDV 397 QNG PR+VL+V Sbjct: 209 QNGVPRMVLNV 219 Score = 69.3 bits (168), Expect = 5e-10 Identities = 40/128 (31%), Positives = 74/128 (57%), Gaps = 6/128 (4%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFREL--GVERLKPNSVTIASVLAGCEN 178 K+FV+M +N+ S NA+ +G NG R VF+E+ + +PN+VT+ +V++ C + Sbjct: 187 KMFVKMTNKNVVSYNALATGLLQNGVPRMVLNVFKEMRDSSQEKQPNTVTLVTVMSACAS 246 Query: 179 VIH---GLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLF-ERTRNKNVVSYNA 346 +++ G Q+H +K + + T++V MYS C +F E N+N++++N+ Sbjct: 247 LLYLQFGRQVHGVVMKAEMQFYTMIGTALVDMYSKCRAWRWGYDVFKEMDGNRNLITWNS 306 Query: 347 FISGLLQN 370 I+GL+ N Sbjct: 307 MIAGLMLN 314 Score = 58.9 bits (141), Expect = 7e-07 Identities = 36/125 (28%), Positives = 65/125 (52%), Gaps = 5/125 (4%) Frame = +2 Query: 38 ASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI---HGLQMHCW 208 A+ N++ISGFS G +AF F ++ ++P+ S+L C + G ++H Sbjct: 337 ATWNSMISGFSQLGKGFDAFKYFEKMQSAGVEPSLKCFTSLLPACSVLSALKQGKEIHGH 396 Query: 209 AIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNK--NVVSYNAFISGLLQNGAPR 382 A + G+ + ++ T+++ MY CG A K+F+ +K + +NA ISG +NG Sbjct: 397 ATRSGISKEEFMATALIDMYMKCGHSSCARKIFDHFESKPDDPAFWNAMISGYGRNGENE 456 Query: 383 VVLDV 397 L++ Sbjct: 457 SALEI 461 >ref|XP_004237212.1| PREDICTED: pentatricopeptide repeat-containing protein At2g02750-like [Solanum lycopersicum] Length = 615 Score = 157 bits (397), Expect = 1e-36 Identities = 75/131 (57%), Positives = 100/131 (76%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI 184 KVF E+P+ N+AS+NA+ISG S NGY +AF +F ++P+SVTIASVL+GC + Sbjct: 89 KVFDEIPQPNIASLNAIISGVSQNGYHVDAFKMFGLFSGLLIRPDSVTIASVLSGCVRID 148 Query: 185 HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLL 364 HG+QMHCW IK+GV+ DVYV+ S+++MY NC +SAT+LF +NKNVV +NAFISG+L Sbjct: 149 HGVQMHCWGIKIGVEMDVYVVASILSMYLNCVDCVSATRLFGLVKNKNVVCWNAFISGML 208 Query: 365 QNGAPRVVLDV 397 +NG VVLDV Sbjct: 209 RNGEEEVVLDV 219 Score = 65.5 bits (158), Expect = 7e-09 Identities = 36/103 (34%), Positives = 51/103 (49%), Gaps = 3/103 (2%) Frame = +2 Query: 74 NGYFREAFLVFRELGVERLKPNSVTIASVLAGCEN---VIHGLQMHCWAIKLGVDADVYV 244 NG ++EA ++ +L L P T + C + G +H IK G + DVY Sbjct: 11 NGLYKEAINLYSQLHYSSLSPTKFTFPCLFKACAKLKFIPQGQILHSHLIKHGFNTDVYA 70 Query: 245 LTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLLQNG 373 TS+ MY G + SA K+F+ N+ S NA ISG+ QNG Sbjct: 71 ATSLTDMYMKFGLVESALKVFDEIPQPNIASLNAIISGVSQNG 113 Score = 61.6 bits (148), Expect = 1e-07 Identities = 39/126 (30%), Positives = 72/126 (57%), Gaps = 4/126 (3%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVL---AGCE 175 ++F + +N+ NA ISG NG VF+++ ++ +PN VT+ VL A + Sbjct: 187 RLFGLVKNKNVVCWNAFISGMLRNGEEEVVLDVFKKMLLDE-EPNEVTLVLVLSATANLK 245 Query: 176 NVIHGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLF-ERTRNKNVVSYNAFI 352 NV G Q+H +K+ + + V T+++ MYS C + A ++F E N+N++++N+ I Sbjct: 246 NVKFGRQVHGLIVKIELQSRTMVGTALLDMYSKCCCWLCAYEIFKELGGNRNLITWNSMI 305 Query: 353 SGLLQN 370 +G++ N Sbjct: 306 AGMMLN 311 >ref|XP_002520126.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223540618|gb|EEF42181.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 593 Score = 153 bits (386), Expect = 3e-35 Identities = 80/131 (61%), Positives = 93/131 (70%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI 184 KVF EMP+RN AS NA ISGFS G EA +VF+E+ +PNSVTIASVL C++V Sbjct: 192 KVFDEMPDRNQASFNATISGFSQKGCCMEALIVFKEMAFCGFRPNSVTIASVLPACDSVD 251 Query: 185 HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLL 364 +QMHC AIKLGV+ DVYV TS+VT YS CG L ATK+F N+ VVSYNAF+SGLL Sbjct: 252 LSVQMHCCAIKLGVEMDVYVATSLVTTYSGCGHLTLATKVFGEMPNRTVVSYNAFVSGLL 311 Query: 365 QNGAPRVVLDV 397 NG VVL V Sbjct: 312 HNGVTNVVLKV 322 Score = 92.4 bits (228), Expect = 6e-17 Identities = 51/136 (37%), Positives = 80/136 (58%), Gaps = 5/136 (3%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFREL-GVERLKPNSVTIASVLAGCENV 181 KVF EMP R + S NA +SG HNG VF+++ LKPNS+T+ SV+A C + Sbjct: 290 KVFGEMPNRTVVSYNAFVSGLLHNGVTNVVLKVFKDMREYSTLKPNSLTLVSVIAACSTL 349 Query: 182 IH---GLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLF-ERTRNKNVVSYNAF 349 ++ G+Q+H + K + D V TS+V MYS CG A +F E NKN++++N+ Sbjct: 350 LYIQFGMQVHVFLKKTQMGCDTMVGTSLVDMYSKCGYWKWAYNVFNEMNDNKNLITWNSM 409 Query: 350 ISGLLQNGAPRVVLDV 397 I+G++ N + +++ Sbjct: 410 IAGMMLNAQSQNAIEL 425 Score = 60.1 bits (144), Expect = 3e-07 Identities = 35/129 (27%), Positives = 70/129 (54%), Gaps = 5/129 (3%) Frame = +2 Query: 26 ERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVIH---GLQ 196 E + A+ N++ISGF EAF F+++ + + P+ ++ S+LA C ++ G + Sbjct: 435 EPDSATWNSMISGFEQLDKGVEAFKFFKKMQLSGMVPSLKSVTSLLAACASLTALQCGKE 494 Query: 197 MHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFER--TRNKNVVSYNAFISGLLQN 370 +H ++ ++ D ++ T ++ MY CG + ++F++ + K+ +NA ISG +N Sbjct: 495 IHGHVVRTNMNFDEFMATGLIDMYMKCGFSLWGQRVFDQFEIKPKDPAIWNALISGYARN 554 Query: 371 GAPRVVLDV 397 G V +V Sbjct: 555 GENESVFEV 563 Score = 59.7 bits (143), Expect = 4e-07 Identities = 32/103 (31%), Positives = 53/103 (51%), Gaps = 3/103 (2%) Frame = +2 Query: 74 NGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVIHGLQ---MHCWAIKLGVDADVYV 244 NG+++EA + L + P+ T ++L C + LQ +H IK G + ++Y Sbjct: 114 NGFYKEAISLHSRLHSASIHPHQFTFPALLKACAKLNSPLQAQIIHTHLIKTGFNLNIYT 173 Query: 245 LTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLLQNG 373 T++ +MY L A K+F+ ++N S+NA ISG Q G Sbjct: 174 ATALTSMYMQLALLPDAMKVFDEMPDRNQASFNATISGFSQKG 216 >ref|XP_006395761.1| hypothetical protein EUTSA_v10003832mg [Eutrema salsugineum] gi|557092400|gb|ESQ33047.1| hypothetical protein EUTSA_v10003832mg [Eutrema salsugineum] Length = 618 Score = 146 bits (368), Expect = 3e-33 Identities = 72/131 (54%), Positives = 95/131 (72%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI 184 K+ EMPER +ASVNA +SG NG+ REAF +F E V NSVT+ASVL GC ++ Sbjct: 92 KLLDEMPERGIASVNAAVSGLMENGFTREAFRMFGEARVSGSGTNSVTVASVLGGCVDIE 151 Query: 185 HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLL 364 G+QMHC A+K G + DVYV TS+V+MYS CG+ I A K+FE+ +K+VV++NAFISGL+ Sbjct: 152 RGMQMHCLAMKSGFEMDVYVGTSLVSMYSRCGEWILAAKMFEKVPHKSVVTFNAFISGLM 211 Query: 365 QNGAPRVVLDV 397 +NG P +V V Sbjct: 212 ENGVPHLVPSV 222 Score = 67.0 bits (162), Expect = 3e-09 Identities = 43/139 (30%), Positives = 76/139 (54%), Gaps = 7/139 (5%) Frame = +2 Query: 2 AKVFVEMPERNLASVNAVISGFSHNG---YFREAFLVFRELGVERLKPNSVTIASVLAGC 172 AK+F ++P +++ + NA ISG NG F + R+ E +PN+VT+ + ++ C Sbjct: 189 AKMFEKVPHKSVVTFNAFISGLMENGVPHLVPSVFNLMRKFSSE--EPNAVTLINAISSC 246 Query: 173 E---NVIHGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLF-ERTRNKNVVSY 340 N+ +G Q+H K D V T+++ MYS C SA +F E +N++++ Sbjct: 247 ASLLNLQYGRQIHGLVTKREFRFDTMVGTALIDMYSKCRCWKSAYDVFTEMKDTRNLIAW 306 Query: 341 NAFISGLLQNGAPRVVLDV 397 N+ ISG++ NG V +++ Sbjct: 307 NSAISGMMINGQHEVAVEL 325 Score = 66.6 bits (161), Expect = 3e-09 Identities = 39/125 (31%), Positives = 71/125 (56%), Gaps = 5/125 (4%) Frame = +2 Query: 38 ASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCEN---VIHGLQMHCW 208 A+ N++ISGFS G EAF F+ + + + P+ + S+L+ C + V +G ++H Sbjct: 339 ATWNSLISGFSQLGKVFEAFKFFQRMLLVVMVPSLKCLTSLLSACSDIWVVKNGKEIHGH 398 Query: 209 AIKLGVDADVYVLTSVVTMYSNCGKLISATKLFE--RTRNKNVVSYNAFISGLLQNGAPR 382 IK + D++V TS++ MY CG +SA ++F+ + K+ V +N ISG ++G Sbjct: 399 VIKATAERDIFVWTSLIDMYMKCGLSLSARRIFDGFEPKPKDPVFWNVMISGYGKHGECE 458 Query: 383 VVLDV 397 +++ Sbjct: 459 SAIEI 463 >ref|XP_003519768.2| PREDICTED: pentatricopeptide repeat-containing protein At2g02750-like [Glycine max] Length = 637 Score = 144 bits (364), Expect = 1e-32 Identities = 78/133 (58%), Positives = 96/133 (72%), Gaps = 2/133 (1%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENV- 181 K F EMP+ N+AS+NA +SGFS NG EA VFR G+ L+PNSVTIA +L G V Sbjct: 104 KAFDEMPQPNVASLNAALSGFSRNGRRGEALRVFRRAGLGPLRPNSVTIACML-GVPRVG 162 Query: 182 -IHGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISG 358 H MHC A+KLGV+ D YV TS+VT Y CG+++SA+K+FE K+VVSYNAF+SG Sbjct: 163 ANHVEMMHCCAVKLGVEFDAYVATSLVTAYCKCGEVVSASKVFEELPVKSVVSYNAFVSG 222 Query: 359 LLQNGAPRVVLDV 397 LLQNG PR+VLDV Sbjct: 223 LLQNGVPRLVLDV 235 Score = 78.6 bits (192), Expect = 8e-13 Identities = 50/141 (35%), Positives = 81/141 (57%), Gaps = 9/141 (6%) Frame = +2 Query: 2 AKVFVEMPERNLASVNAVISGFSHNGYFREAFLVFREL--GVE--RLKPNSVTIASVLAG 169 +KVF E+P +++ S NA +SG NG R VF+E+ G E K NSVT+ SVL+ Sbjct: 202 SKVFEELPVKSVVSYNAFVSGLLQNGVPRLVLDVFKEMMRGEECVECKLNSVTLVSVLSA 261 Query: 170 C---ENVIHGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFE--RTRNKNVV 334 C +++ G Q+H +KL V V+T++V MYS CG SA ++F +N++ Sbjct: 262 CGSLQSIRFGRQVHGVVVKLEAGDGVMVMTALVDMYSKCGFWRSAFEVFTGVEGNRRNLI 321 Query: 335 SYNAFISGLLQNGAPRVVLDV 397 ++N+ I+G++ N +D+ Sbjct: 322 TWNSMIAGMMLNKESERAVDM 342 >ref|XP_007157683.1| hypothetical protein PHAVU_002G089500g [Phaseolus vulgaris] gi|561031098|gb|ESW29677.1| hypothetical protein PHAVU_002G089500g [Phaseolus vulgaris] Length = 628 Score = 144 bits (362), Expect = 2e-32 Identities = 79/132 (59%), Positives = 97/132 (73%), Gaps = 1/132 (0%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCE-NV 181 KVF EMP+ N+AS+NA +SGFS NG EA VFR +G+ L+PNSVTIA +L V Sbjct: 94 KVFDEMPQPNVASLNAALSGFSLNGRSGEAIRVFRRIGLGPLRPNSVTIACMLGVPHVGV 153 Query: 182 IHGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGL 361 H MHC A+KLGV+ DVYV TS+VT+YS C +L+SATK+FE K+VVSYNAFISGL Sbjct: 154 NHVALMHCCALKLGVEFDVYVATSLVTVYSRCEELVSATKVFEELPVKSVVSYNAFISGL 213 Query: 362 LQNGAPRVVLDV 397 L+NG +VLDV Sbjct: 214 LKNGVFHLVLDV 225 Score = 80.9 bits (198), Expect = 2e-13 Identities = 51/130 (39%), Positives = 76/130 (58%), Gaps = 8/130 (6%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVE---RLKPNSVTIASVLAGC- 172 KVF E+P +++ S NA ISG NG F VFRE+ E K NSVT+ SVL+ C Sbjct: 193 KVFEELPVKSVVSYNAFISGLLKNGVFHLVLDVFREMMREVCLECKLNSVTLVSVLSACG 252 Query: 173 --ENVIHGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLF--ERTRNKNVVSY 340 ++V G Q+H +KL D V V+T++V MY CG SA +F ++N++++ Sbjct: 253 SLQSVRLGRQVHGLIVKLEADDGVMVVTALVDMYLKCGFWHSAFDVFTGAEGNSRNLITW 312 Query: 341 NAFISGLLQN 370 N+ I+G++ N Sbjct: 313 NSMIAGMMLN 322 Score = 60.1 bits (144), Expect = 3e-07 Identities = 35/117 (29%), Positives = 65/117 (55%), Gaps = 5/117 (4%) Frame = +2 Query: 38 ASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCEN---VIHGLQMHCW 208 A+ N++ISGF+ G EAF FRE+ + P + S+++ C + + HG ++H + Sbjct: 345 ATWNSMISGFAQQGVCGEAFKYFREMQSVGVAPCLKIVTSLMSMCADSSMLRHGKEIHGF 404 Query: 209 AIKLGVDADVYVLTSVVTMYSNCGKLISATKLFER--TRNKNVVSYNAFISGLLQNG 373 A++ ++ D ++ T++V MY CG A ++F + + + +NA I G +NG Sbjct: 405 ALRTDINRDDFLATALVDMYMKCGHASWAREVFNQFDAKPDDPAFWNAMIGGYGRNG 461 >ref|XP_002876827.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297322665|gb|EFH53086.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 611 Score = 139 bits (350), Expect = 4e-31 Identities = 69/131 (52%), Positives = 95/131 (72%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI 184 KV EMPER +ASVNA +SG NG+ R+AF +F + V NSVT+ASVL GC ++ Sbjct: 85 KVLDEMPERGIASVNAAVSGLLENGFSRDAFRMFGDARVSGSGMNSVTVASVLGGCGDIE 144 Query: 185 HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLL 364 G+QMHC A+K G + +VYV TS+V+MYS CG+ I A ++FE+ +K+VV+YNAFISGL+ Sbjct: 145 GGMQMHCLAMKSGFEMEVYVGTSLVSMYSRCGEWILAARMFEKVPHKSVVTYNAFISGLM 204 Query: 365 QNGAPRVVLDV 397 +NG +V +V Sbjct: 205 ENGVMHLVPNV 215 Score = 68.9 bits (167), Expect = 7e-10 Identities = 40/139 (28%), Positives = 75/139 (53%), Gaps = 7/139 (5%) Frame = +2 Query: 2 AKVFVEMPERNLASVNAVISGFSHNGYFR---EAFLVFRELGVERLKPNSVTIASVLAGC 172 A++F ++P +++ + NA ISG NG F + R+ E +PN VT + + C Sbjct: 182 ARMFEKVPHKSVVTYNAFISGLMENGVMHLVPNVFNLMRKFSSE--EPNDVTFVNAITAC 239 Query: 173 ENVI---HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRN-KNVVSY 340 +++ +G Q+H +K D V T+++ MYS C SA +F ++ +N++S+ Sbjct: 240 ASLLNLQYGRQLHGLVMKTEFQFDTMVGTALIDMYSKCRCWKSAYSVFTELKDTRNLISW 299 Query: 341 NAFISGLLQNGAPRVVLDV 397 N+ ISG++ NG +++ Sbjct: 300 NSVISGMMLNGQHETAVEL 318 Score = 67.4 bits (163), Expect = 2e-09 Identities = 49/171 (28%), Positives = 80/171 (46%), Gaps = 41/171 (23%) Frame = +2 Query: 8 VFVEMPE-RNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVT------------ 148 VF E+ + RNL S N+VISG NG A +F +L E LKP+S T Sbjct: 286 VFTELKDTRNLISWNSVISGMMLNGQHETAVELFEQLDSEGLKPDSATWNSLISGFSQLG 345 Query: 149 -----------------------IASVLAGCENVI---HGLQMHCWAIKLGVDADVYVLT 250 + S+L+ C ++ +G ++H IK + D++VLT Sbjct: 346 KVVEAFKFFERMLSVVMVPSLKCLTSLLSACSDIWTLKNGKEIHGHVIKAAAERDIFVLT 405 Query: 251 SVVTMYSNCGKLISATKLFER--TRNKNVVSYNAFISGLLQNGAPRVVLDV 397 S++ MY CG + A ++F+R + K+ V +N ISG ++G +++ Sbjct: 406 SLIDMYMKCGFSLLARRIFDRFEPKPKDPVFWNVMISGYGKHGECESAIEI 456 Score = 55.8 bits (133), Expect = 6e-06 Identities = 28/86 (32%), Positives = 45/86 (52%), Gaps = 3/86 (3%) Frame = +2 Query: 134 PNSVTIASVLAGCE---NVIHGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKL 304 PN T +L C +V+ G +H +K G DV+ T++V+MY ++ A K+ Sbjct: 27 PNKFTFPPLLKSCAKLGDVVQGRILHAHVVKTGFFVDVFTATALVSMYMKVKQVTDALKV 86 Query: 305 FERTRNKNVVSYNAFISGLLQNGAPR 382 + + + S NA +SGLL+NG R Sbjct: 87 LDEMPERGIASVNAAVSGLLENGFSR 112 >ref|XP_003613604.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355514939|gb|AES96562.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 620 Score = 138 bits (347), Expect = 9e-31 Identities = 72/133 (54%), Positives = 95/133 (71%), Gaps = 2/133 (1%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAG--CEN 178 ++F EMP+ + + NAV+SG S NG +A +FR++G ++PNSVTI S+L+ +N Sbjct: 92 ELFDEMPQPTITAFNAVLSGLSRNGPRGQAVWLFRQIGFWNIRPNSVTIVSLLSARDVKN 151 Query: 179 VIHGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISG 358 H Q+HC A KLGV+ DVYV TS+VT YS CG L+S+ K+FE R KNVV+YNAF+SG Sbjct: 152 QSHVQQVHCLACKLGVEYDVYVSTSLVTAYSKCGVLVSSNKVFENLRVKNVVTYNAFMSG 211 Query: 359 LLQNGAPRVVLDV 397 LLQNG RVV DV Sbjct: 212 LLQNGFHRVVFDV 224 Score = 87.0 bits (214), Expect = 2e-15 Identities = 47/126 (37%), Positives = 77/126 (61%), Gaps = 4/126 (3%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVE-RLKPNSVTIASVLAGC--- 172 KVF + +N+ + NA +SG NG+ R F VF+++ + KPN VT+ SV++ C Sbjct: 192 KVFENLRVKNVVTYNAFMSGLLQNGFHRVVFDVFKDMTMNLEEKPNKVTLVSVVSACATL 251 Query: 173 ENVIHGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFI 352 N+ G Q+H ++KL V V+TS+V MYS CG SA +F R+ +N++++N+ I Sbjct: 252 SNIRLGKQVHGLSMKLEACDHVMVVTSLVDMYSKCGCWGSAFDVFSRSEKRNLITWNSMI 311 Query: 353 SGLLQN 370 +G++ N Sbjct: 312 AGMMMN 317 >gb|ABK28160.1| unknown [Arabidopsis thaliana] Length = 614 Score = 137 bits (345), Expect = 2e-30 Identities = 67/131 (51%), Positives = 94/131 (71%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI 184 KV EMPER +ASVNA +SG NG+ R+AF +F + V NSVT+ASVL GC ++ Sbjct: 87 KVLDEMPERGIASVNAAVSGLLENGFCRDAFRMFGDARVSGSGMNSVTVASVLGGCGDIE 146 Query: 185 HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLL 364 G+Q+HC A+K G + +VYV TS+V+MYS CG+ + A ++FE+ +K+VV+YNAFISGL+ Sbjct: 147 GGMQLHCLAMKSGFEMEVYVGTSLVSMYSRCGEWVLAARMFEKVPHKSVVTYNAFISGLM 206 Query: 365 QNGAPRVVLDV 397 +NG +V V Sbjct: 207 ENGVMNLVPSV 217 Score = 65.5 bits (158), Expect = 7e-09 Identities = 40/139 (28%), Positives = 74/139 (53%), Gaps = 7/139 (5%) Frame = +2 Query: 2 AKVFVEMPERNLASVNAVISGFSHNGYFR---EAFLVFRELGVERLKPNSVTIASVLAGC 172 A++F ++P +++ + NA ISG NG F + R+ E +PN VT + + C Sbjct: 184 ARMFEKVPHKSVVTYNAFISGLMENGVMNLVPSVFNLMRKFSSE--EPNDVTFVNAITAC 241 Query: 173 E---NVIHGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRN-KNVVSY 340 N+ +G Q+H +K + V T+++ MYS C SA +F ++ +N++S+ Sbjct: 242 ASLLNLQYGRQLHGLVMKKEFQFETMVGTALIDMYSKCRCWKSAYIVFTELKDTRNLISW 301 Query: 341 NAFISGLLQNGAPRVVLDV 397 N+ ISG++ NG +++ Sbjct: 302 NSVISGMMINGQHETAVEL 320 Score = 65.5 bits (158), Expect = 7e-09 Identities = 39/125 (31%), Positives = 69/125 (55%), Gaps = 5/125 (4%) Frame = +2 Query: 38 ASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI---HGLQMHCW 208 A+ N++ISGFS G EAF F + + P+ + S+L+ C ++ +G ++H Sbjct: 334 ATWNSLISGFSQLGKVIEAFKFFERMLSVVMVPSLKCLTSLLSACSDIWTLKNGKEIHGH 393 Query: 209 AIKLGVDADVYVLTSVVTMYSNCGKLISATKLFER--TRNKNVVSYNAFISGLLQNGAPR 382 IK + D++VLTS++ MY CG A ++F+R + K+ V +N ISG ++G Sbjct: 394 VIKAAAERDIFVLTSLIDMYMKCGLSSWARRIFDRFEPKPKDPVFWNVMISGYGKHGECE 453 Query: 383 VVLDV 397 +++ Sbjct: 454 SAIEI 458 >gb|ABE65422.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 613 Score = 137 bits (345), Expect = 2e-30 Identities = 67/131 (51%), Positives = 94/131 (71%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI 184 KV EMPER +ASVNA +SG NG+ R+AF +F + V NSVT+ASVL GC ++ Sbjct: 87 KVLDEMPERGIASVNAAVSGLLENGFCRDAFRMFGDARVSGSGMNSVTVASVLGGCGDIE 146 Query: 185 HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLL 364 G+Q+HC A+K G + +VYV TS+V+MYS CG+ + A ++FE+ +K+VV+YNAFISGL+ Sbjct: 147 GGMQLHCLAMKSGFEMEVYVGTSLVSMYSRCGEWVLAARMFEKVPHKSVVTYNAFISGLM 206 Query: 365 QNGAPRVVLDV 397 +NG +V V Sbjct: 207 ENGVMNLVPSV 217 Score = 65.5 bits (158), Expect = 7e-09 Identities = 40/139 (28%), Positives = 74/139 (53%), Gaps = 7/139 (5%) Frame = +2 Query: 2 AKVFVEMPERNLASVNAVISGFSHNGYFR---EAFLVFRELGVERLKPNSVTIASVLAGC 172 A++F ++P +++ + NA ISG NG F + R+ E +PN VT + + C Sbjct: 184 ARMFEKVPHKSVVTYNAFISGLMENGVMNLVPSVFNLMRKFSSE--EPNDVTFVNAITAC 241 Query: 173 E---NVIHGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRN-KNVVSY 340 N+ +G Q+H +K + V T+++ MYS C SA +F ++ +N++S+ Sbjct: 242 ASLLNLQYGRQLHGLVMKKEFQFETMVGTALIDMYSKCRCWKSAYIVFTELKDTRNLISW 301 Query: 341 NAFISGLLQNGAPRVVLDV 397 N+ ISG++ NG +++ Sbjct: 302 NSVISGMMINGQHETAVEL 320 Score = 64.3 bits (155), Expect = 2e-08 Identities = 38/125 (30%), Positives = 69/125 (55%), Gaps = 5/125 (4%) Frame = +2 Query: 38 ASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI---HGLQMHCW 208 A+ N++ISGFS G EAF F + + P+ + S+L+ C ++ +G ++H Sbjct: 334 ATWNSLISGFSQLGKVIEAFKFFERMLSVVMVPSLKCLTSLLSACSDIWTLKNGKEIHGH 393 Query: 209 AIKLGVDADVYVLTSVVTMYSNCGKLISATKLFER--TRNKNVVSYNAFISGLLQNGAPR 382 IK + D++VLTS++ MY CG A ++F+R + ++ V +N ISG ++G Sbjct: 394 VIKAAAERDIFVLTSLIDMYMKCGLSSWARRIFDRFEPKPRDPVFWNVMISGYGKHGECE 453 Query: 383 VVLDV 397 +++ Sbjct: 454 SAIEI 458 >ref|NP_178378.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|218546778|sp|Q1PFA6.2|PP144_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g02750 gi|2947066|gb|AAC05347.1| hypothetical protein [Arabidopsis thaliana] gi|330250526|gb|AEC05620.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 613 Score = 137 bits (345), Expect = 2e-30 Identities = 67/131 (51%), Positives = 94/131 (71%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI 184 KV EMPER +ASVNA +SG NG+ R+AF +F + V NSVT+ASVL GC ++ Sbjct: 87 KVLDEMPERGIASVNAAVSGLLENGFCRDAFRMFGDARVSGSGMNSVTVASVLGGCGDIE 146 Query: 185 HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLL 364 G+Q+HC A+K G + +VYV TS+V+MYS CG+ + A ++FE+ +K+VV+YNAFISGL+ Sbjct: 147 GGMQLHCLAMKSGFEMEVYVGTSLVSMYSRCGEWVLAARMFEKVPHKSVVTYNAFISGLM 206 Query: 365 QNGAPRVVLDV 397 +NG +V V Sbjct: 207 ENGVMNLVPSV 217 Score = 65.5 bits (158), Expect = 7e-09 Identities = 40/139 (28%), Positives = 74/139 (53%), Gaps = 7/139 (5%) Frame = +2 Query: 2 AKVFVEMPERNLASVNAVISGFSHNGYFR---EAFLVFRELGVERLKPNSVTIASVLAGC 172 A++F ++P +++ + NA ISG NG F + R+ E +PN VT + + C Sbjct: 184 ARMFEKVPHKSVVTYNAFISGLMENGVMNLVPSVFNLMRKFSSE--EPNDVTFVNAITAC 241 Query: 173 E---NVIHGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRN-KNVVSY 340 N+ +G Q+H +K + V T+++ MYS C SA +F ++ +N++S+ Sbjct: 242 ASLLNLQYGRQLHGLVMKKEFQFETMVGTALIDMYSKCRCWKSAYIVFTELKDTRNLISW 301 Query: 341 NAFISGLLQNGAPRVVLDV 397 N+ ISG++ NG +++ Sbjct: 302 NSVISGMMINGQHETAVEL 320 Score = 65.5 bits (158), Expect = 7e-09 Identities = 39/125 (31%), Positives = 69/125 (55%), Gaps = 5/125 (4%) Frame = +2 Query: 38 ASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI---HGLQMHCW 208 A+ N++ISGFS G EAF F + + P+ + S+L+ C ++ +G ++H Sbjct: 334 ATWNSLISGFSQLGKVIEAFKFFERMLSVVMVPSLKCLTSLLSACSDIWTLKNGKEIHGH 393 Query: 209 AIKLGVDADVYVLTSVVTMYSNCGKLISATKLFER--TRNKNVVSYNAFISGLLQNGAPR 382 IK + D++VLTS++ MY CG A ++F+R + K+ V +N ISG ++G Sbjct: 394 VIKAAAERDIFVLTSLIDMYMKCGLSSWARRIFDRFEPKPKDPVFWNVMISGYGKHGECE 453 Query: 383 VVLDV 397 +++ Sbjct: 454 SAIEI 458 >ref|XP_006290425.1| hypothetical protein CARUB_v10019221mg [Capsella rubella] gi|482559132|gb|EOA23323.1| hypothetical protein CARUB_v10019221mg [Capsella rubella] Length = 611 Score = 133 bits (334), Expect = 3e-29 Identities = 66/130 (50%), Positives = 91/130 (70%) Frame = +2 Query: 8 VFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVIH 187 V EMP+R++ASVNA +SG NG+ R+AF +F + V NSVT+ASVL C ++ Sbjct: 86 VLDEMPDRSIASVNAAVSGLLENGFCRDAFRMFGDARVSGSGTNSVTVASVLGSCGDIEG 145 Query: 188 GLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLLQ 367 G Q+HC A+K G D +VYV TS+V+MYS CG+ + A K+FE +K+VV+YNAFISGL++ Sbjct: 146 GRQLHCLAMKSGFDMEVYVGTSLVSMYSRCGEWVLAAKMFENVPHKSVVTYNAFISGLME 205 Query: 368 NGAPRVVLDV 397 NG +V V Sbjct: 206 NGVMHLVPSV 215 Score = 68.9 bits (167), Expect = 7e-10 Identities = 42/139 (30%), Positives = 74/139 (53%), Gaps = 7/139 (5%) Frame = +2 Query: 2 AKVFVEMPERNLASVNAVISGFSHNGYFR---EAFLVFRELGVERLKPNSVTIASVLAGC 172 AK+F +P +++ + NA ISG NG F + R+ E PN VT + + C Sbjct: 182 AKMFENVPHKSVVTYNAFISGLMENGVMHLVPSVFNLMRKFSSE--VPNPVTFINAITAC 239 Query: 173 ENVI---HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRN-KNVVSY 340 +++ +G Q+H +K DV V T+++ MYS C SA +F ++ +N++S+ Sbjct: 240 ASLLNLQYGRQIHGLVMKKNFSFDVMVGTALIDMYSKCRCWKSAYSVFTELKDTRNLISW 299 Query: 341 NAFISGLLQNGAPRVVLDV 397 N+ ISG++ NG +++ Sbjct: 300 NSVISGMMINGQHETAVEL 318 Score = 67.4 bits (163), Expect = 2e-09 Identities = 40/125 (32%), Positives = 71/125 (56%), Gaps = 5/125 (4%) Frame = +2 Query: 38 ASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI---HGLQMHCW 208 A+ N++ISGFS G EAF F + + P+ + S+L+ C ++ +G ++H + Sbjct: 332 ATWNSLISGFSQLGKVVEAFKFFETMLLVVTIPSLKCLTSLLSACSDIWALKNGKEIHGY 391 Query: 209 AIKLGVDADVYVLTSVVTMYSNCGKLISATKLFER--TRNKNVVSYNAFISGLLQNGAPR 382 IK + D+YV TS++ MY CG + A ++F+R + K+ V +NA ISG ++G Sbjct: 392 VIKATAERDIYVSTSLIDMYMKCGFSLWARRIFDRFEPKPKDPVFWNAMISGYGKHGEYE 451 Query: 383 VVLDV 397 +++ Sbjct: 452 SAIEI 456 >ref|XP_004490048.1| PREDICTED: pentatricopeptide repeat-containing protein At2g02750-like [Cicer arietinum] Length = 863 Score = 132 bits (331), Expect = 6e-29 Identities = 67/133 (50%), Positives = 94/133 (70%), Gaps = 2/133 (1%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAG--CEN 178 K+F EM + + + NA +SG S NG +A +FR++G+ L+PNSVTIAS+L +N Sbjct: 99 KLFDEMSQPTITAFNAALSGLSRNGPPGQAIELFRQIGLRTLRPNSVTIASLLTARDTKN 158 Query: 179 VIHGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISG 358 H Q+HC A+KLGV+ DVYV TS++T YS G L+S+ +FE + +NVV+YNAF+SG Sbjct: 159 PSHVHQVHCLALKLGVENDVYVSTSLITCYSKSGNLVSSKNVFENSHVRNVVTYNAFMSG 218 Query: 359 LLQNGAPRVVLDV 397 LL NG PR+V+DV Sbjct: 219 LLPNGFPRIVVDV 231 Score = 78.6 bits (192), Expect = 8e-13 Identities = 46/125 (36%), Positives = 72/125 (57%), Gaps = 4/125 (3%) Frame = +2 Query: 8 VFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVE-RLKPNSVTIASVLAGCENVI 184 VF RN+ + NA +SG NG+ R VF+++ + KPN VT SV + C N++ Sbjct: 200 VFENSHVRNVVTYNAFMSGLLPNGFPRIVVDVFKDMMMSLEEKPNMVTFVSVFSACANLL 259 Query: 185 H---GLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFIS 355 + G Q+H ++KL V V+TS+V MYS CG SA +F +N++++N+ I+ Sbjct: 260 NIRLGKQVHGLSMKLEACDHVMVVTSLVDMYSKCGCWRSAFDVFNEGEKRNLITWNSMIA 319 Query: 356 GLLQN 370 GL+ N Sbjct: 320 GLMMN 324 Score = 61.2 bits (147), Expect = 1e-07 Identities = 37/117 (31%), Positives = 64/117 (54%), Gaps = 5/117 (4%) Frame = +2 Query: 38 ASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCEN---VIHGLQMHCW 208 A+ N +I GF+ G F EAF FR++ + P + S+L+ C + + G ++H + Sbjct: 347 ATWNTLIGGFAQKGLFLEAFKYFRKMQYFGVVPCLKIVTSILSVCADSSVLRSGKEIHGY 406 Query: 209 AIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNK--NVVSYNAFISGLLQNG 373 A+++ VD D + T++V MY CG + A +F++ K + +NA I G +NG Sbjct: 407 AVRICVDMDEFFATALVDMYMKCGCVSLARCIFDQFDEKPDDPAFWNAMIGGYGRNG 463 >gb|EXB63826.1| hypothetical protein L484_021099 [Morus notabilis] Length = 619 Score = 131 bits (330), Expect = 8e-29 Identities = 68/131 (51%), Positives = 92/131 (70%) Frame = +2 Query: 5 KVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI 184 KVF EMP R+LAS+NAV+SG + NG+F EA VFR +PNSVT+AS+L+ C +V Sbjct: 90 KVFDEMPHRSLASMNAVLSGLAQNGHFWEALDVFRSGRYGDFRPNSVTLASLLSTCGSVG 149 Query: 185 HGLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLL 364 ++CWA KLGV+ DVYV T+++++YS ++ A K+F NKN+VSYNA++SGLL Sbjct: 150 FAEILYCWATKLGVEKDVYVATAILSVYSKFKDMVLAAKVFNGMDNKNLVSYNAYVSGLL 209 Query: 365 QNGAPRVVLDV 397 QNG VL V Sbjct: 210 QNGFSLEVLAV 220 Score = 86.7 bits (213), Expect = 3e-15 Identities = 51/129 (39%), Positives = 81/129 (62%), Gaps = 6/129 (4%) Frame = +2 Query: 2 AKVFVEMPERNLASVNAVISGFSHNGYFREAFLVFRELGVERL--KPNSVTIASVLAGCE 175 AKVF M +NL S NA +SG NG+ E VF+++ +E L +P+ VT+ S ++ C Sbjct: 187 AKVFNGMDNKNLVSYNAYVSGLLQNGFSLEVLAVFKQM-MEVLDERPSHVTLVSAISACA 245 Query: 176 NVIH---GLQMHCWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFER-TRNKNVVSYN 343 +++ G Q+H A+K G+ DV V T++V MYS CG A+ +FE + ++N+V++N Sbjct: 246 RLLYVLLGSQVHKLAMKFGLARDVMVGTALVDMYSKCGCWWRASNVFEELSGDRNLVTWN 305 Query: 344 AFISGLLQN 370 A ISG++ N Sbjct: 306 AIISGMMLN 314 Score = 56.6 bits (135), Expect = 3e-06 Identities = 34/127 (26%), Positives = 65/127 (51%), Gaps = 5/127 (3%) Frame = +2 Query: 32 NLASVNAVISGFSHNGYFREAFLVFRELGVERLKPNSVTIASVLAGCENVI---HGLQMH 202 +LA N++I GFS G EAF F+ + + PNS ++ S+L+ C + G ++H Sbjct: 336 DLAIWNSMIGGFSQLGKGTEAFKYFKLMQCYGISPNSKSMTSMLSACSGLSALRSGKEIH 395 Query: 203 CWAIKLGVDADVYVLTSVVTMYSNCGKLISATKLFE--RTRNKNVVSYNAFISGLLQNGA 376 +A ++ D + T+++ +Y CG A ++F + ++ +N ISG +NG Sbjct: 396 GYATRMHAKIDTVMATALIDLYMKCGHSSWARRVFYWFNVKPEDPAFWNVMISGYGRNGD 455 Query: 377 PRVVLDV 397 +++ Sbjct: 456 DESAVEI 462 Score = 56.2 bits (134), Expect = 5e-06 Identities = 33/109 (30%), Positives = 51/109 (46%), Gaps = 3/109 (2%) Frame = +2 Query: 80 YFREAFLVFRELGVERLKPNSVTIASVLAGCENV---IHGLQMHCWAIKLGVDADVYVLT 250 Y + FL + +P+ T +L C + HG +H +K G +D Y T Sbjct: 14 YKKALFLYSKSSSSSSRRPHRFTFPPLLKACAKLQAASHGQMLHTHLLKTGFSSDSYAAT 73 Query: 251 SVVTMYSNCGKLISATKLFERTRNKNVVSYNAFISGLLQNGAPRVVLDV 397 ++ MY N A K+F+ ++++ S NA +SGL QNG LDV Sbjct: 74 ALTGMYMNIRLFRDALKVFDEMPHRSLASMNAVLSGLAQNGHFWEALDV 122