BLASTX nr result
ID: Glycyrrhiza36_contig00023547
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza36_contig00023547 (406 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_012573667.1 PREDICTED: pentatricopeptide repeat-containing pr... 225 5e-68 KYP42158.1 Pentatricopeptide repeat-containing protein At3g12770... 202 4e-60 XP_007155935.1 hypothetical protein PHAVU_003G244800g [Phaseolus... 196 5e-57 KHN26464.1 Pentatricopeptide repeat-containing protein [Glycine ... 190 6e-56 XP_003551036.1 PREDICTED: pentatricopeptide repeat-containing pr... 190 8e-55 XP_014506479.1 PREDICTED: pentatricopeptide repeat-containing pr... 189 2e-54 XP_017410302.1 PREDICTED: putative pentatricopeptide repeat-cont... 186 2e-53 XP_016206998.1 PREDICTED: pentatricopeptide repeat-containing pr... 181 2e-51 XP_009366958.1 PREDICTED: pentatricopeptide repeat-containing pr... 181 3e-51 XP_015954901.1 PREDICTED: pentatricopeptide repeat-containing pr... 181 3e-51 XP_002277337.2 PREDICTED: pentatricopeptide repeat-containing pr... 180 4e-51 KRH04632.1 hypothetical protein GLYMA_17G175800 [Glycine max] 176 6e-51 GAV80726.1 PPR domain-containing protein/PPR_1 domain-containing... 179 7e-51 ONH96536.1 hypothetical protein PRUPE_7G135300 [Prunus persica] 177 5e-50 XP_008344308.1 PREDICTED: pentatricopeptide repeat-containing pr... 177 8e-50 XP_008340659.1 PREDICTED: pentatricopeptide repeat-containing pr... 179 8e-50 XP_007047218.1 PREDICTED: pentatricopeptide repeat-containing pr... 176 1e-49 XP_015890127.1 PREDICTED: pentatricopeptide repeat-containing pr... 173 2e-48 KDP31910.1 hypothetical protein JCGZ_12371 [Jatropha curcas] 162 4e-46 XP_010263222.1 PREDICTED: pentatricopeptide repeat-containing pr... 167 5e-46 >XP_012573667.1 PREDICTED: pentatricopeptide repeat-containing protein At3g16610-like [Cicer arietinum] Length = 653 Score = 225 bits (574), Expect = 5e-68 Identities = 109/134 (81%), Positives = 117/134 (87%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAGAALLTLYARC R D E VFR MD SDVVTWNAMILG+ID GLG LA ECFREMQ G Sbjct: 311 SAGAALLTLYARCNRLHDAEKVFRVMDDSDVVTWNAMILGYIDTGLGRLAFECFREMQ-G 369 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 RGV ID TTISTILPVCDLRCGKQIHAYV+KS+FDC V VYNAL+HMYSICGCI+YA S+ Sbjct: 370 RGVRIDQTTISTILPVCDLRCGKQIHAYVRKSNFDCAVGVYNALIHMYSICGCISYAYSL 429 Query: 363 FSTMAKKDLVTWNT 404 FSTM KKDL++WNT Sbjct: 430 FSTMVKKDLISWNT 443 Score = 55.5 bits (132), Expect = 3e-06 Identities = 39/119 (32%), Positives = 59/119 (49%), Gaps = 7/119 (5%) Frame = +3 Query: 66 VFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNIDHTTISTILPVC---- 233 VF + +V++W +I G+ +G +ALE FR+M + D +S IL C Sbjct: 227 VFEQIKDPNVISWTILISGYSGVGKHVVALEIFRDMVNVGMIIPDVDALSGILVSCKFLG 286 Query: 234 DLRCGKQIHAYVKKSDFDCMV---PVYNALVHMYSICGCIAYACSVFSTMAKKDLVTWN 401 +L G++IH Y K+ F V AL+ +Y+ C + A VF M D+VTWN Sbjct: 287 NLTSGREIHGYGLKNGFRNDVFYKSAGAALLTLYARCNRLHDAEKVFRVMDDSDVVTWN 345 >KYP42158.1 Pentatricopeptide repeat-containing protein At3g12770 family [Cajanus cajan] Length = 549 Score = 202 bits (515), Expect = 4e-60 Identities = 102/134 (76%), Positives = 109/134 (81%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAGAALLTLYA C R +VFRGMDK DVVTWNAMI G +D GLGHLALECFREMQ G Sbjct: 241 SAGAALLTLYAGCGRVDRAYSVFRGMDKGDVVTWNAMIFGLVDAGLGHLALECFREMQ-G 299 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 RGV ID TT+STILPVCDLR GKQ+HAYV K +F C VPV+NALVHMYSI GCI YA SV Sbjct: 300 RGVEIDGTTVSTILPVCDLRRGKQLHAYVSKWNFSCGVPVFNALVHMYSIRGCIVYAHSV 359 Query: 363 FSTMAKKDLVTWNT 404 FS M KDLV+WNT Sbjct: 360 FSMMENKDLVSWNT 373 Score = 67.4 bits (163), Expect = 2e-10 Identities = 48/139 (34%), Positives = 72/139 (51%), Gaps = 10/139 (7%) Frame = +3 Query: 15 ALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVN 194 +LL +YA+C E VF M + DV +WN+++ G++ GL A+E F M++ Sbjct: 141 SLLGMYAKCGDMASAERVFGEMPQRDVFSWNSVMSGYVCNGLPDRAVEVFGVMKE--ECQ 198 Query: 195 IDHTTISTILP------VCDLRCGKQIHAYVKKSDFDCMVPVYN----ALVHMYSICGCI 344 D T +T++ + DL G++IH Y K C Y AL+ +Y+ CG + Sbjct: 199 PDVVTWNTLMDAYWKWFLGDLASGREIHGYGLK--IMCGDVFYRSAGAALLTLYAGCGRV 256 Query: 345 AYACSVFSTMAKKDLVTWN 401 A SVF M K D+VTWN Sbjct: 257 DRAYSVFRGMDKGDVVTWN 275 >XP_007155935.1 hypothetical protein PHAVU_003G244800g [Phaseolus vulgaris] ESW27929.1 hypothetical protein PHAVU_003G244800g [Phaseolus vulgaris] Length = 619 Score = 196 bits (497), Expect = 5e-57 Identities = 96/134 (71%), Positives = 110/134 (82%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAGAALL LYA C R + VFR MDKSDVVTWNAMI G +D+GLG LALECFREMQ+ Sbjct: 311 SAGAALLALYAGCGRLDRADVVFRRMDKSDVVTWNAMIFGLVDVGLGDLALECFREMQE- 369 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 RG+ ID TT++TILPVCDLRCGK++HAYV+K ++PV NALVHMYSI GCIAYAC+V Sbjct: 370 RGLRIDGTTVATILPVCDLRCGKEMHAYVRKCCLSSVIPVNNALVHMYSIRGCIAYACAV 429 Query: 363 FSTMAKKDLVTWNT 404 FSTM KDLV+WNT Sbjct: 430 FSTMVAKDLVSWNT 443 Score = 60.8 bits (146), Expect = 3e-08 Identities = 42/134 (31%), Positives = 66/134 (49%), Gaps = 6/134 (4%) Frame = +3 Query: 18 LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197 L+ Y R + + VF ++ +V++W +I G+ +G H++L FREM V+ Sbjct: 212 LMDAYCRMGKCCEAWRVFGEIEIPNVISWTILISGYASVGRHHVSLGIFREMVNVGMVSP 271 Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMV--PVYNALVHMYSICGCIAYACS 359 D +S +L C L G +IH Y K + + AL+ +Y+ CG + A Sbjct: 272 DVDALSGVLVSCRALGALASGMEIHGYGLKIMYGDVFYRSAGAALLALYAGCGRLDRADV 331 Query: 360 VFSTMAKKDLVTWN 401 VF M K D+VTWN Sbjct: 332 VFRRMDKSDVVTWN 345 >KHN26464.1 Pentatricopeptide repeat-containing protein [Glycine soja] Length = 473 Score = 190 bits (482), Expect = 6e-56 Identities = 96/134 (71%), Positives = 109/134 (81%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAGAALL LYA R +NVF MDKSDVVTWNAMI G +D+GL LAL+CFREMQ G Sbjct: 165 SAGAALLMLYAGWGRLDCADNVFWRMDKSDVVTWNAMIFGLVDVGLVDLALDCFREMQ-G 223 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 RGV ID TIS+ILPVCDLRCGK+IHAYV+K +F ++PVYNAL+HMYSI GCIAYA SV Sbjct: 224 RGVGIDGRTISSILPVCDLRCGKEIHAYVRKCNFSGVIPVYNALIHMYSIRGCIAYAYSV 283 Query: 363 FSTMAKKDLVTWNT 404 FSTM +DLV+WNT Sbjct: 284 FSTMVARDLVSWNT 297 Score = 55.1 bits (131), Expect = 3e-06 Identities = 41/136 (30%), Positives = 66/136 (48%), Gaps = 8/136 (5%) Frame = +3 Query: 18 LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197 ++ Y R + + VF ++ +V++W +I G+ +G ++L FR+M V+ Sbjct: 66 VMDAYCRMGQCCEASRVFGEIEDPNVISWTILISGYAGVGRHDVSLGIFRQMVNVGMVSP 125 Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMVPVYN----ALVHMYSICGCIAYA 353 D +S +L C L GK+IH Y K C Y AL+ +Y+ G + A Sbjct: 126 DVDALSGVLVSCRHLGALASGKEIHGYGLK--IMCGDVFYRSAGAALLMLYAGWGRLDCA 183 Query: 354 CSVFSTMAKKDLVTWN 401 +VF M K D+VTWN Sbjct: 184 DNVFWRMDKSDVVTWN 199 >XP_003551036.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39350-like [Glycine max] Length = 619 Score = 190 bits (482), Expect = 8e-55 Identities = 96/134 (71%), Positives = 109/134 (81%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAGAALL LYA R +NVF MDKSDVVTWNAMI G +D+GL LAL+CFREMQ G Sbjct: 311 SAGAALLMLYAGWGRLDCADNVFWRMDKSDVVTWNAMIFGLVDVGLVDLALDCFREMQ-G 369 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 RGV ID TIS+ILPVCDLRCGK+IHAYV+K +F ++PVYNAL+HMYSI GCIAYA SV Sbjct: 370 RGVGIDGRTISSILPVCDLRCGKEIHAYVRKCNFSGVIPVYNALIHMYSIRGCIAYAYSV 429 Query: 363 FSTMAKKDLVTWNT 404 FSTM +DLV+WNT Sbjct: 430 FSTMVARDLVSWNT 443 Score = 55.1 bits (131), Expect = 3e-06 Identities = 41/136 (30%), Positives = 66/136 (48%), Gaps = 8/136 (5%) Frame = +3 Query: 18 LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197 ++ Y R + + VF ++ +V++W +I G+ +G ++L FR+M V+ Sbjct: 212 VMDAYCRMGQCCEASRVFGEIEDPNVISWTILISGYAGVGRHDVSLGIFRQMVNVGMVSP 271 Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMVPVYN----ALVHMYSICGCIAYA 353 D +S +L C L GK+IH Y K C Y AL+ +Y+ G + A Sbjct: 272 DVDALSGVLVSCRHLGALASGKEIHGYGLK--IMCGDVFYRSAGAALLMLYAGWGRLDCA 329 Query: 354 CSVFSTMAKKDLVTWN 401 +VF M K D+VTWN Sbjct: 330 DNVFWRMDKSDVVTWN 345 >XP_014506479.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39350-like [Vigna radiata var. radiata] Length = 619 Score = 189 bits (479), Expect = 2e-54 Identities = 93/134 (69%), Positives = 108/134 (80%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAGAALLTLYA C R + VFR MDKSDVVTWNAMI G +D+G G LALECFR+MQ+ Sbjct: 311 SAGAALLTLYAGCGRLDRADIVFRRMDKSDVVTWNAMIFGLVDVGSGDLALECFRKMQE- 369 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 RGV ID TT+ST+LPVCDLRCGK++HAYV+K ++PV NALVHMYS+ GCIAYA +V Sbjct: 370 RGVRIDGTTVSTVLPVCDLRCGKEMHAYVRKCCLSSVIPVNNALVHMYSVRGCIAYAFAV 429 Query: 363 FSTMAKKDLVTWNT 404 FS M KDLV+WNT Sbjct: 430 FSAMLAKDLVSWNT 443 Score = 54.7 bits (130), Expect = 5e-06 Identities = 39/134 (29%), Positives = 64/134 (47%), Gaps = 6/134 (4%) Frame = +3 Query: 18 LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197 L+ Y R + + F ++ +V++W ++ G+ G ++L FR+M V+ Sbjct: 212 LMDAYCRMGKCCEAWRAFGEIEVPNVISWTILMSGYASAGRHDVSLGIFRKMMNVGMVSP 271 Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMV--PVYNALVHMYSICGCIAYACS 359 D T+S +L C L G +IH Y K + + AL+ +Y+ CG + A Sbjct: 272 DVDTLSGMLVSCRCLGALASGMEIHGYGLKIMYGDVFYRSAGAALLTLYAGCGRLDRADI 331 Query: 360 VFSTMAKKDLVTWN 401 VF M K D+VTWN Sbjct: 332 VFRRMDKSDVVTWN 345 >XP_017410302.1 PREDICTED: putative pentatricopeptide repeat-containing protein At1g17630 [Vigna angularis] KOM29543.1 hypothetical protein LR48_Vigan727s000200 [Vigna angularis] BAT75575.1 hypothetical protein VIGAN_01345300 [Vigna angularis var. angularis] Length = 619 Score = 186 bits (473), Expect = 2e-53 Identities = 94/134 (70%), Positives = 106/134 (79%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAGAALLTLYA C R + VFR MDKSDVVTWNAMI +D+G G LALECFREMQ+ Sbjct: 311 SAGAALLTLYAGCGRLDRADIVFRRMDKSDVVTWNAMIFCLVDVGSGDLALECFREMQE- 369 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 RGV ID TT+STILPVCDLRCGK++HAYV+K ++PV NALVHMYS+ GCIAYA V Sbjct: 370 RGVRIDGTTVSTILPVCDLRCGKEMHAYVRKCCLSSVIPVNNALVHMYSVRGCIAYAFVV 429 Query: 363 FSTMAKKDLVTWNT 404 FS M KDLV+WNT Sbjct: 430 FSAMVAKDLVSWNT 443 Score = 55.5 bits (132), Expect = 3e-06 Identities = 39/134 (29%), Positives = 63/134 (47%), Gaps = 6/134 (4%) Frame = +3 Query: 18 LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197 L+ Y R + + F ++ +V++W ++ G+ G ++L FREM V+ Sbjct: 212 LMDAYCRMGKCCEAWRAFGEIEVPNVISWTILLSGYASAGRHDVSLGIFREMMNVGMVSP 271 Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMV--PVYNALVHMYSICGCIAYACS 359 D +S +L C L G +IH Y K + + AL+ +Y+ CG + A Sbjct: 272 DVDALSGVLVSCRSLGALASGMEIHGYGLKIMYGDVFYRSAGAALLTLYAGCGRLDRADI 331 Query: 360 VFSTMAKKDLVTWN 401 VF M K D+VTWN Sbjct: 332 VFRRMDKSDVVTWN 345 >XP_016206998.1 PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Arachis ipaensis] Length = 609 Score = 181 bits (458), Expect = 2e-51 Identities = 89/134 (66%), Positives = 105/134 (78%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAGAALLTLYA C R D ENVF MDKSDVVTWNAMI G IDMGL + A++CF+EMQ Sbjct: 327 SAGAALLTLYANCGRLNDAENVFDRMDKSDVVTWNAMIYGLIDMGLANEAVQCFKEMQAS 386 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 V +D TT+ST+L CDLR GK++HAYV K ++ ++PV NAL+H YS CGCIAYA SV Sbjct: 387 N-VKVDQTTVSTLLLACDLRRGKEMHAYVLKHRYNWVIPVCNALIHTYSKCGCIAYAYSV 445 Query: 363 FSTMAKKDLVTWNT 404 FSTMA +DLV+WNT Sbjct: 446 FSTMAVRDLVSWNT 459 Score = 58.5 bits (140), Expect = 2e-07 Identities = 42/134 (31%), Positives = 64/134 (47%), Gaps = 6/134 (4%) Frame = +3 Query: 18 LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197 ++ Y + + VF + +V++W +I G+ +G LAL FR+M V Sbjct: 228 MMDAYCKMGLCSEALRVFHQIKDPNVISWTTLISGYAGVGRHDLALGTFRDMVNFGMVLP 287 Query: 198 DHTTISTILPVC----DLRCGKQIHAY-VKKSDFDCMVPVYN-ALVHMYSICGCIAYACS 359 D ++S IL C L G ++H Y VK D AL+ +Y+ CG + A + Sbjct: 288 DVDSLSGILVSCRFLGSLTSGNEVHCYGVKVISGDAFYKSAGAALLTLYANCGRLNDAEN 347 Query: 360 VFSTMAKKDLVTWN 401 VF M K D+VTWN Sbjct: 348 VFDRMDKSDVVTWN 361 >XP_009366958.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39350 isoform X1 [Pyrus x bretschneideri] Length = 690 Score = 181 bits (460), Expect = 3e-51 Identities = 87/134 (64%), Positives = 108/134 (80%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAG ALL LYA C R +D NVFR M+ +DVV+WNAMILGFID+GL LALECFR+MQ+ Sbjct: 382 SAGPALLILYANCSRIQDAINVFRLMNPADVVSWNAMILGFIDLGLEDLALECFRKMQRA 441 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 + V +D TTIST+LP C+L+ GKQIHA+V+K+ FD + PV+NAL+HMY+ICGCI A SV Sbjct: 442 Q-VKVDQTTISTVLPTCNLKFGKQIHAFVRKNSFDLVAPVWNALIHMYAICGCIESAYSV 500 Query: 363 FSTMAKKDLVTWNT 404 FS M +DLVTWN+ Sbjct: 501 FSNMVHRDLVTWNS 514 Score = 59.3 bits (142), Expect = 1e-07 Identities = 39/134 (29%), Positives = 64/134 (47%), Gaps = 6/134 (4%) Frame = +3 Query: 18 LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197 ++ Y R + +F + + ++++W +I GF +G +L+ FR+M G V Sbjct: 283 VMDAYCRLGHCDKAKRIFEQIKEPNIISWTTLISGFSRIGNHESSLKIFRDMMDGSRVYP 342 Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKK--SDFDCMVPVYNALVHMYSICGCIAYACS 359 D ++S +L C L GK+IH Y K S AL+ +Y+ C I A + Sbjct: 343 DLDSLSAVLVSCRHLGSLLNGKEIHGYGIKIGSGIAFYSSAGPALLILYANCSRIQDAIN 402 Query: 360 VFSTMAKKDLVTWN 401 VF M D+V+WN Sbjct: 403 VFRLMNPADVVSWN 416 >XP_015954901.1 PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Arachis duranensis] Length = 641 Score = 181 bits (458), Expect = 3e-51 Identities = 89/134 (66%), Positives = 105/134 (78%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAGAALLTLYA C R D ENVF MDKSDVVTWNAMI G IDMGL + A++CF+EMQ Sbjct: 327 SAGAALLTLYANCGRLNDAENVFDRMDKSDVVTWNAMIYGLIDMGLANEAVQCFKEMQAS 386 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 V +D TT+ST+L CDLR GK++HAYV K ++ ++PV NAL+H YS CGCIAYA SV Sbjct: 387 N-VKVDQTTVSTLLLACDLRRGKEMHAYVLKHRYNWVIPVCNALIHTYSKCGCIAYAYSV 445 Query: 363 FSTMAKKDLVTWNT 404 FSTMA +DLV+WNT Sbjct: 446 FSTMAVRDLVSWNT 459 Score = 58.2 bits (139), Expect = 3e-07 Identities = 42/134 (31%), Positives = 64/134 (47%), Gaps = 6/134 (4%) Frame = +3 Query: 18 LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197 ++ Y + R + VF + +V++W +I G+ + LAL FR+M V Sbjct: 228 MMDAYCKMGRCSEALRVFHQIKDPNVISWTTLISGYAGVRRHDLALGTFRDMVNFGMVLP 287 Query: 198 DHTTISTILPVC----DLRCGKQIHAY-VKKSDFDCMVPVYN-ALVHMYSICGCIAYACS 359 D ++S IL C L G ++H Y VK D AL+ +Y+ CG + A + Sbjct: 288 DVDSLSGILVSCRFLGSLTSGNEVHCYGVKVISGDAFYKSAGAALLTLYANCGRLNDAEN 347 Query: 360 VFSTMAKKDLVTWN 401 VF M K D+VTWN Sbjct: 348 VFDRMDKSDVVTWN 361 >XP_002277337.2 PREDICTED: pentatricopeptide repeat-containing protein At5g39350-like [Vitis vinifera] Length = 634 Score = 180 bits (457), Expect = 4e-51 Identities = 82/134 (61%), Positives = 106/134 (79%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAGAALLT+Y +C+R +D NVF MD+ DVVTWNAMILGF+D+ +GHLALECF +MQ+ Sbjct: 330 SAGAALLTMYVKCKRIQDALNVFELMDRFDVVTWNAMILGFVDLEMGHLALECFSKMQRS 389 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 G+ + TIST+LP CDL+ GKQ+HAY+ K+ F ++PV+NAL+HMYS CGCI A S+ Sbjct: 390 -GIMNNQITISTVLPACDLKSGKQVHAYITKNSFSSVIPVWNALIHMYSKCGCIGTAYSI 448 Query: 363 FSTMAKKDLVTWNT 404 FS M +DLV+WNT Sbjct: 449 FSNMISRDLVSWNT 462 >KRH04632.1 hypothetical protein GLYMA_17G175800 [Glycine max] Length = 456 Score = 176 bits (447), Expect = 6e-51 Identities = 85/115 (73%), Positives = 98/115 (85%) Frame = +3 Query: 60 ENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNIDHTTISTILPVCDL 239 +NVF MDKSDVVTWNAMI G +D+GL LAL+CFREMQ GRGV ID TIS+ILPVCDL Sbjct: 167 DNVFWRMDKSDVVTWNAMIFGLVDVGLVDLALDCFREMQ-GRGVGIDGRTISSILPVCDL 225 Query: 240 RCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSVFSTMAKKDLVTWNT 404 RCGK+IHAYV+K +F ++PVYNAL+HMYSI GCIAYA SVFSTM +DLV+WNT Sbjct: 226 RCGKEIHAYVRKCNFSGVIPVYNALIHMYSIRGCIAYAYSVFSTMVARDLVSWNT 280 >GAV80726.1 PPR domain-containing protein/PPR_1 domain-containing protein/PPR_2 domain-containing protein [Cephalotus follicularis] Length = 581 Score = 179 bits (453), Expect = 7e-51 Identities = 86/134 (64%), Positives = 105/134 (78%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 S+G ALLT+YA C R + +NVF +DKSDVVTWNAMILGF+D+GLGHLALECF +MQ+ Sbjct: 270 SSGPALLTMYANCGRIWEAKNVFELLDKSDVVTWNAMILGFVDLGLGHLALECFSDMQR- 328 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 RG D TTIST+LPVCDL GKQ+HA + +S D V V+NAL+HMYS CGCI A SV Sbjct: 329 RGFKNDDTTISTVLPVCDLTSGKQVHALIWRSHLDSAVSVWNALIHMYSKCGCIGSAYSV 388 Query: 363 FSTMAKKDLVTWNT 404 F++M +DLV+WNT Sbjct: 389 FTSMVTRDLVSWNT 402 >ONH96536.1 hypothetical protein PRUPE_7G135300 [Prunus persica] Length = 646 Score = 177 bits (450), Expect = 5e-50 Identities = 85/134 (63%), Positives = 106/134 (79%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAG ALLT+YA CRR D NVF+ M+ + VV+WNAMILGFID+GL LAL+ FR MQ+ Sbjct: 330 SAGPALLTMYANCRRIHDATNVFKLMNPAHVVSWNAMILGFIDLGLEDLALDSFRRMQRA 389 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 R +N+D TTISTILP C+L+ GKQIHA+++K FD +VPV+NAL+HMYS CGCI A SV Sbjct: 390 R-INVDQTTISTILPACNLKFGKQIHAFIRKISFDLVVPVWNALIHMYSKCGCIGSAYSV 448 Query: 363 FSTMAKKDLVTWNT 404 FS M +DLV+WN+ Sbjct: 449 FSNMINRDLVSWNS 462 Score = 57.0 bits (136), Expect = 7e-07 Identities = 38/136 (27%), Positives = 66/136 (48%), Gaps = 8/136 (5%) Frame = +3 Query: 18 LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197 ++ Y R + +F + + ++++W +I G+ +G +L FR+M V+ Sbjct: 231 VMDAYCRMGHCNEATRIFEQIKEPNIISWTTLISGYSRIGSHEASLRIFRDMIGSSMVDP 290 Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMVPVYN----ALVHMYSICGCIAYA 353 D ++ST+L C L GK+IH Y K + + Y+ AL+ MY+ C I A Sbjct: 291 DLDSLSTVLVSCRHLGSLLNGKEIHGYGIKRESG--IAFYHSAGPALLTMYANCRRIHDA 348 Query: 354 CSVFSTMAKKDLVTWN 401 +VF M +V+WN Sbjct: 349 TNVFKLMNPAHVVSWN 364 >XP_008344308.1 PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like isoform X1 [Malus domestica] Length = 690 Score = 177 bits (450), Expect = 8e-50 Identities = 87/134 (64%), Positives = 105/134 (78%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAG ALL LYA C R +D NVFR M+ +DVV+WNAMILGFID+GL LALECFR+MQ+ Sbjct: 382 SAGPALLILYANCSRIQDAINVFRLMNPADVVSWNAMILGFIDLGLXDLALECFRKMQRA 441 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 + V D TTIST LP C+L+ GKQIHA+V+KS FD + PV+NAL+HMY+ CGCI A SV Sbjct: 442 Q-VKADQTTISTXLPTCNLKFGKQIHAFVRKSSFDLVAPVWNALIHMYAKCGCIESAYSV 500 Query: 363 FSTMAKKDLVTWNT 404 FS M +DLVTWN+ Sbjct: 501 FSNMVNRDLVTWNS 514 Score = 59.7 bits (143), Expect = 9e-08 Identities = 38/136 (27%), Positives = 66/136 (48%), Gaps = 8/136 (5%) Frame = +3 Query: 18 LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197 ++ Y R + + +F + ++++W +I GF +G +L+ FR+M G V Sbjct: 283 VMDAYCRLGHCDEAKRIFEQIKDPNIISWTTLISGFSRIGNHESSLKIFRDMMDGSRVYP 342 Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMVPVYN----ALVHMYSICGCIAYA 353 D ++S ++ C L GK+IH Y K + Y+ AL+ +Y+ C I A Sbjct: 343 DLDSLSAVJVSCRHLGSLLNGKEIHGYGIK--IGSXIAFYSSAGPALLILYANCSRIQDA 400 Query: 354 CSVFSTMAKKDLVTWN 401 +VF M D+V+WN Sbjct: 401 INVFRLMNPADVVSWN 416 >XP_008340659.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39350-like [Malus domestica] Length = 817 Score = 179 bits (453), Expect = 8e-50 Identities = 87/134 (64%), Positives = 106/134 (79%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAG ALL LYA C R +D NVFR M+ +DVV+WNAMILGFID+GL LALECFR+MQ+ Sbjct: 382 SAGPALLILYANCSRIQDAINVFRLMNPADVVSWNAMILGFIDLGLEDLALECFRKMQRA 441 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 + V D TTIST+LP C+L+ GKQIHA+V+KS FD + PV+NAL+HMY+ CGCI A SV Sbjct: 442 Q-VKADQTTISTVLPTCNLKFGKQIHAFVRKSSFDLVAPVWNALIHMYAKCGCIESAYSV 500 Query: 363 FSTMAKKDLVTWNT 404 FS M +DLVTWN+ Sbjct: 501 FSNMVNRDLVTWNS 514 Score = 59.7 bits (143), Expect = 9e-08 Identities = 38/136 (27%), Positives = 66/136 (48%), Gaps = 8/136 (5%) Frame = +3 Query: 18 LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197 ++ Y R + + +F + ++++W +I GF +G +L+ FR+M G V Sbjct: 283 VMDAYCRLGHCDEAKRIFEQIKDPNIISWTTLISGFSRIGNHESSLKIFRDMMDGSRVYP 342 Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMVPVYN----ALVHMYSICGCIAYA 353 D ++S ++ C L GK+IH Y K + Y+ AL+ +Y+ C I A Sbjct: 343 DLDSLSAVJVSCRHLGSLLNGKEIHGYGIK--IGSXIAFYSSAGPALLILYANCSRIQDA 400 Query: 354 CSVFSTMAKKDLVTWN 401 +VF M D+V+WN Sbjct: 401 INVFRLMNPADVVSWN 416 >XP_007047218.1 PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic [Theobroma cacao] EOX91375.1 Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] Length = 635 Score = 176 bits (447), Expect = 1e-49 Identities = 83/134 (61%), Positives = 104/134 (77%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAG ALLTL+++C R RD N+F MDKSD VTWNAMILGF+D GLGH+A++CF EMQ+ Sbjct: 327 SAGPALLTLHSKCGRSRDAGNIFELMDKSDTVTWNAMILGFVDRGLGHMAVDCFGEMQR- 385 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 G+ D TTI T+LPVC+LR GKQ+HAY+++ D + P++NALVHMYS CG I A SV Sbjct: 386 MGIKNDQTTICTVLPVCELRQGKQLHAYIRRQYSDSICPIWNALVHMYSKCGSIGSAYSV 445 Query: 363 FSTMAKKDLVTWNT 404 FS M +DLV+WNT Sbjct: 446 FSNMVARDLVSWNT 459 >XP_015890127.1 PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Ziziphus jujuba] Length = 635 Score = 173 bits (438), Expect = 2e-48 Identities = 82/134 (61%), Positives = 106/134 (79%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAGA LLT+YA+ R +D +NVF+ MD++DVVTWNAMILGF D+GL H ALECF +MQ+ Sbjct: 327 SAGATLLTMYAKYGRLQDAKNVFKLMDQADVVTWNAMILGFADVGLEHSALECFSKMQRA 386 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 G+ D TTIST+LPVCDL+ GKQIHA+++K FD + PV+NAL++MYS CGCI A V Sbjct: 387 -GIKNDRTTISTVLPVCDLKSGKQIHAFIRKGCFDLVTPVWNALIYMYSKCGCIRSASLV 445 Query: 363 FSTMAKKDLVTWNT 404 FS M +D+V+WN+ Sbjct: 446 FSNMLTRDVVSWNS 459 Score = 60.8 bits (146), Expect = 3e-08 Identities = 40/132 (30%), Positives = 62/132 (46%), Gaps = 4/132 (3%) Frame = +3 Query: 18 LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197 L+ +YA C +F + + +V W A+I + G+ + + EM GV+ Sbjct: 61 LVQMYADCDHLLSARILFDQLSQPNVFAWTAIIGFYSRHGMYQKCVRTYAEMSL-MGVSP 119 Query: 198 DHTTISTILPVCD----LRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSVF 365 D +L VC L+ G QIH V S F+ V N+L+ MYS C + A VF Sbjct: 120 DEYVFPKVLKVCAQSSCLKAGMQIHKDVITSGFEFSSEVCNSLIEMYSKCMDVQNAKRVF 179 Query: 366 STMAKKDLVTWN 401 + +DL++WN Sbjct: 180 DVIVGRDLLSWN 191 Score = 60.1 bits (144), Expect = 6e-08 Identities = 42/136 (30%), Positives = 65/136 (47%), Gaps = 6/136 (4%) Frame = +3 Query: 15 ALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVN 194 AL+ +Y++C R VF M DVV+WN+M+ GF GLG ALE +EM+Q + Sbjct: 428 ALIYMYSKCGCIRSASLVFSNMLTRDVVSWNSMMGGFRMHGLGQAALELLKEMRQS-ALE 486 Query: 195 IDHTTISTILPVCDLR--CGKQIHAYVKKSDFDCMVPV---YNALVHMYSICGCIAYACS 359 D T +++L C + + + K + + C+ P Y +V M + G + A S Sbjct: 487 PDSMTFTSVLSACSHSGLVNEGLEVFHKMTKYYCLTPSMEHYACIVDMLARAGRLQDAVS 546 Query: 360 VFSTM-AKKDLVTWNT 404 M + D W T Sbjct: 547 FIQNMPLEPDKSIWGT 562 Score = 57.8 bits (138), Expect = 4e-07 Identities = 37/136 (27%), Positives = 69/136 (50%), Gaps = 8/136 (5%) Frame = +3 Query: 18 LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197 ++ Y + R + N+F + + ++++W +I G+ +G ++L FR+M ++ Sbjct: 228 VMDAYCQMRLCDEAWNIFERIKEPNIISWTTLIKGYSRIGNHEVSLRIFRDMISSGMISP 287 Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMVPVYNA----LVHMYSICGCIAYA 353 D +S +L C L G++IH+Y K + YN+ L+ MY+ G + A Sbjct: 288 DLDCLSGVLVSCRHLGSLSGGREIHSYGIK--MKSCIAFYNSAGATLLTMYAKYGRLQDA 345 Query: 354 CSVFSTMAKKDLVTWN 401 +VF M + D+VTWN Sbjct: 346 KNVFKLMDQADVVTWN 361 >KDP31910.1 hypothetical protein JCGZ_12371 [Jatropha curcas] Length = 377 Score = 162 bits (410), Expect = 4e-46 Identities = 84/135 (62%), Positives = 101/135 (74%), Gaps = 1/135 (0%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 SAG ALLT+YA+C + VF MDKSDVVTWNAMILGF+++ L LALECF MQ+ Sbjct: 57 SAGPALLTMYAKCGIIQYARFVFELMDKSDVVTWNAMILGFVELQLVQLALECFSGMQRS 116 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYV-KKSDFDCMVPVYNALVHMYSICGCIAYACS 359 GV D TTISTILPVC L+CGKQIHAY+ + S + +VPV++A++HMY GCI A S Sbjct: 117 -GVKNDQTTISTILPVCGLKCGKQIHAYILRSSSLNSVVPVWSAMIHMYCKSGCIRSAYS 175 Query: 360 VFSTMAKKDLVTWNT 404 VFS MA KD+VTWNT Sbjct: 176 VFSNMAVKDIVTWNT 190 >XP_010263222.1 PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Nelumbo nucifera] Length = 654 Score = 167 bits (422), Expect = 5e-46 Identities = 81/134 (60%), Positives = 103/134 (76%) Frame = +3 Query: 3 SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182 S+G ALLT+YA R RD NVF+ MDKSDVVTWNAMILG + +GLG LA++ REMQ Sbjct: 330 SSGPALLTVYATSGRLRDARNVFQLMDKSDVVTWNAMILGLVHLGLGDLAIKYVREMQS- 388 Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362 RG+ D TT+ST+LPVCDLR GKQIHAY++++ D + V+NAL++MYS CGCI A +V Sbjct: 389 RGLQYDETTVSTVLPVCDLRFGKQIHAYIRRNALDSAISVWNALINMYSKCGCIRSAYTV 448 Query: 363 FSTMAKKDLVTWNT 404 FS M +D+V+WNT Sbjct: 449 FSKMDSRDVVSWNT 462 Score = 58.5 bits (140), Expect = 2e-07 Identities = 37/133 (27%), Positives = 66/133 (49%), Gaps = 4/133 (3%) Frame = +3 Query: 18 LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197 L+ +YA C +F + + +V W ++I + G+ + + EM+ +G+ Sbjct: 64 LVQMYAACNDLISARILFDELPRPNVFAWTSIISFYSRNGMFKECVRTYNEMKL-QGIGP 122 Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSVF 365 D +L C L G +IH + + + + V N+L+ MYS CG + A +F Sbjct: 123 DGYVFPKVLRACTQSLSLAEGIRIHKDIIELGAEHNLQVCNSLIDMYSKCGDVQTAQRIF 182 Query: 366 STMAKKDLVTWNT 404 + MA+KDL+TWN+ Sbjct: 183 NGMAEKDLLTWNS 195