BLASTX nr result
ID: Akebia24_contig00025915
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00025915 (458 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002270963.2| PREDICTED: pentatricopeptide repeat-containi... 196 3e-48 emb|CBI29222.3| unnamed protein product [Vitis vinifera] 196 3e-48 ref|XP_007217153.1| hypothetical protein PRUPE_ppa001463mg [Prun... 195 5e-48 ref|XP_006351033.1| PREDICTED: pentatricopeptide repeat-containi... 194 1e-47 ref|XP_004249905.1| PREDICTED: pentatricopeptide repeat-containi... 191 9e-47 ref|XP_006432869.1| hypothetical protein CICLE_v10000274mg [Citr... 191 1e-46 ref|XP_002303480.2| pentatricopeptide repeat-containing family p... 173 3e-41 ref|XP_004149000.1| PREDICTED: pentatricopeptide repeat-containi... 168 8e-40 ref|XP_002519901.1| pentatricopeptide repeat-containing protein,... 162 5e-38 ref|XP_003548529.2| PREDICTED: pentatricopeptide repeat-containi... 161 8e-38 ref|XP_004166658.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 161 8e-38 gb|AHB18408.1| pentatricopeptide repeat-containing protein [Goss... 161 1e-37 ref|XP_007040906.1| Tetratricopeptide repeat-like superfamily pr... 157 1e-36 sp|Q940A6.2|PP325_ARATH RecName: Full=Pentatricopeptide repeat-c... 156 2e-36 emb|CAA18631.1| putative protein [Arabidopsis thaliana] gi|72687... 156 2e-36 ref|NP_567587.1| pentatricopeptide repeat-containing protein [Ar... 156 2e-36 ref|XP_004509525.1| PREDICTED: pentatricopeptide repeat-containi... 156 3e-36 gb|EYU29134.1| hypothetical protein MIMGU_mgv1a001281mg [Mimulus... 155 6e-36 ref|XP_007156329.1| hypothetical protein PHAVU_003G277400g [Phas... 155 6e-36 ref|XP_006413978.1| hypothetical protein EUTSA_v10024401mg [Eutr... 152 4e-35 >ref|XP_002270963.2| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Vitis vinifera] Length = 1022 Score = 196 bits (498), Expect = 3e-48 Identities = 97/153 (63%), Positives = 120/153 (78%), Gaps = 1/153 (0%) Frame = -3 Query: 456 LIDGKLPALFEKSKDRHIEISLAIAD-SSVDESELTMASFDLLVHVYCTQFKNLGLGFAF 280 LID KLP LF K+RHIEI+ A+AD + V ES + +A+ DLL+HVYCTQF+N+G A Sbjct: 205 LIDRKLPVLFGDPKNRHIEIASAMADLNEVGESGVAVAAVDLLIHVYCTQFRNVGFRNAI 264 Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100 VFR L N+G+FP++KTC FLLSSLVKANEL++SY VF + +G SPDVY FSTAINAFC Sbjct: 265 GVFRFLANKGVFPTVKTCTFLLSSLVKANELEKSYWVFETMRQGVSPDVYLFSTAINAFC 324 Query: 99 KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 KGG++E A QLF+ ME G+SP V+TYN LIHG Sbjct: 325 KGGKVEDAIQLFFDMEKLGVSPNVVTYNNLIHG 357 >emb|CBI29222.3| unnamed protein product [Vitis vinifera] Length = 826 Score = 196 bits (498), Expect = 3e-48 Identities = 97/153 (63%), Positives = 120/153 (78%), Gaps = 1/153 (0%) Frame = -3 Query: 456 LIDGKLPALFEKSKDRHIEISLAIAD-SSVDESELTMASFDLLVHVYCTQFKNLGLGFAF 280 LID KLP LF K+RHIEI+ A+AD + V ES + +A+ DLL+HVYCTQF+N+G A Sbjct: 138 LIDRKLPVLFGDPKNRHIEIASAMADLNEVGESGVAVAAVDLLIHVYCTQFRNVGFRNAI 197 Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100 VFR L N+G+FP++KTC FLLSSLVKANEL++SY VF + +G SPDVY FSTAINAFC Sbjct: 198 GVFRFLANKGVFPTVKTCTFLLSSLVKANELEKSYWVFETMRQGVSPDVYLFSTAINAFC 257 Query: 99 KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 KGG++E A QLF+ ME G+SP V+TYN LIHG Sbjct: 258 KGGKVEDAIQLFFDMEKLGVSPNVVTYNNLIHG 290 >ref|XP_007217153.1| hypothetical protein PRUPE_ppa001463mg [Prunus persica] gi|462413303|gb|EMJ18352.1| hypothetical protein PRUPE_ppa001463mg [Prunus persica] Length = 821 Score = 195 bits (496), Expect = 5e-48 Identities = 91/153 (59%), Positives = 118/153 (77%), Gaps = 1/153 (0%) Frame = -3 Query: 456 LIDGKLPALFEKSKDRHIEISLAIAD-SSVDESELTMASFDLLVHVYCTQFKNLGLGFAF 280 LIDG +P L+ RH+EI++A+ D ++V L + + DLL+HVYCTQFKN+G G+A Sbjct: 142 LIDGNVPVLYANHNQRHMEIAIAMLDLNTVSTQGLGVQALDLLIHVYCTQFKNMGFGYAI 201 Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100 D F + + +G+FPSLKTCNFLLSSLVKANEL +SY+VF ++CRG SPDVY F+TAINAFC Sbjct: 202 DAFVIFSKKGVFPSLKTCNFLLSSLVKANELHKSYDVFEVMCRGVSPDVYLFTTAINAFC 261 Query: 99 KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 KGG+++ A LF KME GI P V+TYN +IHG Sbjct: 262 KGGKVDDAIGLFSKMEGLGIVPNVVTYNNIIHG 294 >ref|XP_006351033.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Solanum tuberosum] Length = 928 Score = 194 bits (493), Expect = 1e-47 Identities = 94/153 (61%), Positives = 121/153 (79%), Gaps = 1/153 (0%) Frame = -3 Query: 456 LIDGKLPALFEKSKDRHIEISLAIAD-SSVDESELTMASFDLLVHVYCTQFKNLGLGFAF 280 LIDGKLPALF+ S+ +H+E+++++A+ S V + + + +FDLL+H+ CTQFKN+G A Sbjct: 240 LIDGKLPALFDTSQQKHVEVAVSLAELSGVSDFGVAVRTFDLLLHLCCTQFKNVGFDAAL 299 Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100 DVFR L +RG++PSLKTCNFLLSSLVK NEL +SYEVF I+ G PDVY FSTAINAFC Sbjct: 300 DVFRSLASRGVYPSLKTCNFLLSSLVKENELWKSYEVFGILKDGVEPDVYLFSTAINAFC 359 Query: 99 KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 KGG+++ A +LF KME GI P V+TYN LIHG Sbjct: 360 KGGKVDEAKELFRKMENIGIVPNVVTYNNLIHG 392 Score = 60.1 bits (144), Expect = 3e-07 Identities = 30/87 (34%), Positives = 55/87 (63%), Gaps = 1/87 (1%) Frame = -3 Query: 258 NRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICR-GFSPDVYSFSTAINAFCKGGRIE 82 ++GL + T L++ L KA++L++ ++F + R G +P++ ++T I AFC+ G ++ Sbjct: 691 SKGLVCDIYTYGALINGLCKADQLEKGRDLFHEMLRQGLAPNLIIYNTLIGAFCRNGNVK 750 Query: 81 VAAQLFYKMEVFGISPTVITYNTLIHG 1 A +L + GI P V+TY++LIHG Sbjct: 751 EALKLRDDIRSRGILPNVVTYSSLIHG 777 Score = 58.5 bits (140), Expect = 9e-07 Identities = 33/119 (27%), Positives = 62/119 (52%), Gaps = 1/119 (0%) Frame = -3 Query: 360 ELTMASFDLLVHVYCTQFKNLGLGFAFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDR 181 ++ +++ L+ +C K L AF + + +G+ P + T N LL L + + D Sbjct: 625 QIDSMTYNTLICAFC---KEGNLDGAFMLREEMVKQGIAPDVSTYNVLLHGLGEKGKTDE 681 Query: 180 SYEVF-AIICRGFSPDVYSFSTAINAFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLI 7 + ++ + +G D+Y++ IN CK ++E LF++M G++P +I YNTLI Sbjct: 682 ALLLWDECLSKGLVCDIYTYGALINGLCKADQLEKGRDLFHEMLRQGLAPNLIIYNTLI 740 >ref|XP_004249905.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Solanum lycopersicum] Length = 839 Score = 191 bits (485), Expect = 9e-47 Identities = 93/153 (60%), Positives = 120/153 (78%), Gaps = 1/153 (0%) Frame = -3 Query: 456 LIDGKLPALFEKSKDRHIEISLAIAD-SSVDESELTMASFDLLVHVYCTQFKNLGLGFAF 280 LIDGKLPALF+ + +H+E+++++A+ S V + + + +FDLL+H+ CTQFK++G A Sbjct: 151 LIDGKLPALFDSLQQKHVEVAVSLAELSGVSDFGVAVRTFDLLLHLCCTQFKSVGFDAAL 210 Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100 DVFR L +RG++PSLKTCNFLLSSLVK NEL +SYEVF I+ G PDVY FSTAINAFC Sbjct: 211 DVFRSLASRGVYPSLKTCNFLLSSLVKENELWKSYEVFEILKDGVKPDVYLFSTAINAFC 270 Query: 99 KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 KGG++E A +LF KME GI P V+TYN LIHG Sbjct: 271 KGGKVEEAQELFRKMENMGILPNVVTYNNLIHG 303 Score = 60.1 bits (144), Expect = 3e-07 Identities = 30/87 (34%), Positives = 55/87 (63%), Gaps = 1/87 (1%) Frame = -3 Query: 258 NRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICR-GFSPDVYSFSTAINAFCKGGRIE 82 ++GL + T L++ L KA++L++ ++F + R G +P++ ++T I AFC+ G ++ Sbjct: 602 SKGLVCDIYTYGALINGLCKADQLEKGRDLFHEMLRQGLAPNLIVYNTLIGAFCRNGNVK 661 Query: 81 VAAQLFYKMEVFGISPTVITYNTLIHG 1 A +L + GI P V+TY++LIHG Sbjct: 662 EALKLRDDIRSRGILPNVVTYSSLIHG 688 >ref|XP_006432869.1| hypothetical protein CICLE_v10000274mg [Citrus clementina] gi|568835123|ref|XP_006471629.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X1 [Citrus sinensis] gi|568835125|ref|XP_006471630.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X2 [Citrus sinensis] gi|557534991|gb|ESR46109.1| hypothetical protein CICLE_v10000274mg [Citrus clementina] Length = 833 Score = 191 bits (484), Expect = 1e-46 Identities = 96/154 (62%), Positives = 116/154 (75%), Gaps = 2/154 (1%) Frame = -3 Query: 456 LIDGKLPALFEKSKD-RHIEISLAIADSSV-DESELTMASFDLLVHVYCTQFKNLGLGFA 283 LIDGK+P L+ + RHIEI+ + D +V E L + DLLVHVYCTQFKNLG G+A Sbjct: 143 LIDGKMPVLYASNPSIRHIEIASQMVDLNVTSEPALGVQIADLLVHVYCTQFKNLGFGYA 202 Query: 282 FDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAF 103 DVF + +N+G+FPSLKTCNFLL+SLVKANE+ + EVF +CRG SPDV+ FSTAINAF Sbjct: 203 IDVFSIFSNKGIFPSLKTCNFLLNSLVKANEVQKGIEVFETMCRGVSPDVFLFSTAINAF 262 Query: 102 CKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 CK GRIE A LF KME GI+P V+TYN +IHG Sbjct: 263 CKRGRIEDAIGLFTKMEELGIAPNVVTYNNIIHG 296 Score = 60.5 bits (145), Expect = 2e-07 Identities = 36/118 (30%), Positives = 66/118 (55%), Gaps = 1/118 (0%) Frame = -3 Query: 351 MASFDLLVHVYCTQFKNLGLGFAFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELD-RSY 175 + +++ ++H C +N L AF + + R + PSL T + L++ L+K + D ++ Sbjct: 287 VVTYNNIIHGLC---RNGRLYEAFHLKEKMVLREVEPSLITYSILINGLIKLEKFDDANF 343 Query: 174 EVFAIICRGFSPDVYSFSTAINAFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 + + RGF P+ ++T I+ +CK G I A ++ M G+SP +T+N+LIHG Sbjct: 344 VLKEMSVRGFVPNYVVYNTLIDGYCKKGNISEALKIRDDMVSKGMSPNSVTFNSLIHG 401 >ref|XP_002303480.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550342907|gb|EEE78459.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 842 Score = 173 bits (438), Expect = 3e-41 Identities = 93/155 (60%), Positives = 113/155 (72%), Gaps = 3/155 (1%) Frame = -3 Query: 456 LIDGKLPALFEKS-KDRHIEISLAIADSS-VDESELTMASFDLLVHVYCTQFKNLGLGFA 283 LIDGK+PA + ++ + RH EI+ +AD + V E + + DLLVHVY TQFK+LG GFA Sbjct: 152 LIDGKVPAFYARNFESRHFEIAQIMADFNLVFEPVIGVKIADLLVHVYSTQFKHLGFGFA 211 Query: 282 FDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIIC-RGFSPDVYSFSTAINA 106 DVF LL +GLFPSLKTC FLLSSLVKANEL +SYEV+ IC G PDV+ FST INA Sbjct: 212 ADVFSLLAKKGLFPSLKTCTFLLSSLVKANELKKSYEVYDFICLGGIIPDVHLFSTMINA 271 Query: 105 FCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 FCKG R + A LF KME G++P V+TYN +IHG Sbjct: 272 FCKGHREDDAIGLFSKMEKLGVAPNVVTYNNIIHG 306 >ref|XP_004149000.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucumis sativus] Length = 822 Score = 168 bits (425), Expect = 8e-40 Identities = 90/154 (58%), Positives = 108/154 (70%), Gaps = 2/154 (1%) Frame = -3 Query: 456 LIDGKLPALFEKSKDRHIEISLAI--ADSSVDESELTMASFDLLVHVYCTQFKNLGLGFA 283 LIDG LP L S+ HIEI+ A+ S V E T A FDLL+HVY TQF+NLG A Sbjct: 135 LIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQA-FDLLIHVYSTQFRNLGFSCA 193 Query: 282 FDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAF 103 DVF LL +G FPSLKTCNFLLSSLVKANE ++ EVF ++ G PDV+SF+ INA Sbjct: 194 VDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGACPDVFSFTNVINAL 253 Query: 102 CKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 CKGG++E A +LF KME GISP V+TYN +I+G Sbjct: 254 CKGGKMENAIELFMKMEKLGISPNVVTYNCIING 287 Score = 59.3 bits (142), Expect = 5e-07 Identities = 37/118 (31%), Positives = 67/118 (56%), Gaps = 1/118 (0%) Frame = -3 Query: 351 MASFDLLVHVYCTQFKNLGLGFAFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYE 172 + +++ +++ C +N L AF++ +T +G+ P+LKT L++ L+K N D+ Sbjct: 278 VVTYNCIINGLC---QNGRLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNH 334 Query: 171 VF-AIICRGFSPDVYSFSTAINAFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 V +I GF+P+V F+ I+ +CK G IE A ++ M I+PT +T +L+ G Sbjct: 335 VLDEMIGSGFNPNVVVFNNLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLYSLMQG 392 >ref|XP_002519901.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223540947|gb|EEF42505.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 777 Score = 162 bits (410), Expect = 5e-38 Identities = 77/129 (59%), Positives = 98/129 (75%), Gaps = 1/129 (0%) Frame = -3 Query: 384 ADSSVDESELTMASFDLLVHVYCTQFKNLGLGFAFDVFRLLTNRGLFPSLKTCNFLLSSL 205 A ++ E + + DLL+HVY TQFK+LG G F++F LL N+GLFPSLKTCNFLLSSL Sbjct: 113 ASETLFEPAVAVTVVDLLIHVYSTQFKHLGFGVVFELFSLLANKGLFPSLKTCNFLLSSL 172 Query: 204 VKANELDRSYEVFAIICR-GFSPDVYSFSTAINAFCKGGRIEVAAQLFYKMEVFGISPTV 28 VKANE+ SY+VF I+C G +PDVY FST +NAFC GGR++ A +LF KME G++P V Sbjct: 173 VKANEVKMSYQVFDIMCHCGVTPDVYLFSTMVNAFCTGGRVDDAIELFRKMEKVGVAPNV 232 Query: 27 ITYNTLIHG 1 +TYN +IHG Sbjct: 233 VTYNNIIHG 241 >ref|XP_003548529.2| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Glycine max] Length = 840 Score = 161 bits (408), Expect = 8e-38 Identities = 81/156 (51%), Positives = 107/156 (68%), Gaps = 4/156 (2%) Frame = -3 Query: 456 LIDGKLPALFEKSK----DRHIEISLAIADSSVDESELTMASFDLLVHVYCTQFKNLGLG 289 LIDG +P K+ DR EI+ ++ + + E + DLL+H+ C+QFK LG Sbjct: 148 LIDGHVPTWSSKTTTSFHDRLREIASSMLELNQGSDEQRLGELDLLLHILCSQFKCLGSR 207 Query: 288 FAFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAIN 109 AFD+F + + RG+FP LKTCN LLSSLVKANEL +SYEVF + C+G +PDV++F+TAIN Sbjct: 208 CAFDIFVMFSKRGVFPCLKTCNLLLSSLVKANELHKSYEVFDLACQGVAPDVFTFTTAIN 267 Query: 108 AFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 AFCKGGR+ A LF KME G+ P V+TYN +I G Sbjct: 268 AFCKGGRVGDAVDLFCKMEGLGVFPNVVTYNNVIDG 303 Score = 57.0 bits (136), Expect = 3e-06 Identities = 32/127 (25%), Positives = 64/127 (50%), Gaps = 4/127 (3%) Frame = -3 Query: 369 DESELTMASFDLLVHVYCTQFKNLGLGFAFDVFRL---LTNRGLFPSLKTCNFLLSSLVK 199 ++ EL+ +++L+ YC +G + F+L + +RG+ P+ T + L+ + Sbjct: 639 EKVELSSVVYNILIAAYCR------IGNVTEAFKLRDAMKSRGILPTCATYSSLIHGMCC 692 Query: 198 ANELDRSYEVFAIICR-GFSPDVYSFSTAINAFCKGGRIEVAAQLFYKMEVFGISPTVIT 22 +D + E+F + G P+V+ ++ I CK G++++ + +M GI P IT Sbjct: 693 IGRVDEAKEIFEEMRNEGLLPNVFCYTALIGGHCKLGQMDIVGSILLEMSSNGIRPNKIT 752 Query: 21 YNTLIHG 1 Y +I G Sbjct: 753 YTIMIDG 759 >ref|XP_004166658.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucumis sativus] Length = 799 Score = 161 bits (408), Expect = 8e-38 Identities = 86/154 (55%), Positives = 106/154 (68%), Gaps = 2/154 (1%) Frame = -3 Query: 456 LIDGKLPALFEKSKDRHIEISLAI--ADSSVDESELTMASFDLLVHVYCTQFKNLGLGFA 283 ++ G LP L S+ HIEI+ A+ S V E T A FDLL+HVY TQF+NLG A Sbjct: 112 IVYGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQA-FDLLIHVYSTQFRNLGFSCA 170 Query: 282 FDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAF 103 DVF LL +G FPSLKTCNF LSSLVKANE ++ EVF ++ G PDV+SF+ INA Sbjct: 171 VDVFYLLARKGTFPSLKTCNFXLSSLVKANEFEKCCEVFRVMSEGACPDVFSFTNVINAL 230 Query: 102 CKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 CKGG++E A +LF KME GISP V+TYN +I+G Sbjct: 231 CKGGKMENAIELFMKMEKLGISPNVVTYNCIING 264 Score = 58.9 bits (141), Expect = 7e-07 Identities = 36/118 (30%), Positives = 67/118 (56%), Gaps = 1/118 (0%) Frame = -3 Query: 351 MASFDLLVHVYCTQFKNLGLGFAFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYE 172 + +++ +++ C +N L AF++ +T +G+ P+LKT L++ L+K N D+ Sbjct: 255 VVTYNCIINGLC---QNGRLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNH 311 Query: 171 VF-AIICRGFSPDVYSFSTAINAFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 + +I GF+P+V F+ I+ +CK G IE A ++ M I+PT +T +L+ G Sbjct: 312 ILDEMIGAGFNPNVVVFNNLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLYSLMQG 369 >gb|AHB18408.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum] Length = 846 Score = 161 bits (407), Expect = 1e-37 Identities = 84/155 (54%), Positives = 110/155 (70%), Gaps = 3/155 (1%) Frame = -3 Query: 456 LIDGKLPALFEKSKD---RHIEISLAIADSSVDESELTMASFDLLVHVYCTQFKNLGLGF 286 LIDGKLP LF + HI+I++A+AD ++ S +A DLL+H+YCTQFKN+G + Sbjct: 168 LIDGKLP-LFSPNNPPTVNHIQIAIALAD--LNTSFKGVAGVDLLLHLYCTQFKNVGFTY 224 Query: 285 AFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINA 106 A DVF L +G+FPS KTCNF L+SL+KANE+ ++Y+VF + R S DVY +T IN Sbjct: 225 AIDVFFTLAYKGIFPSTKTCNFFLNSLLKANEVRKTYQVFETLSRSVSLDVYLCTTMING 284 Query: 105 FCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 FCKGGRI+ A LF +ME GISP V+TYN +IHG Sbjct: 285 FCKGGRIQDAMALFSRMENLGISPNVVTYNNIIHG 319 Score = 58.5 bits (140), Expect = 9e-07 Identities = 32/118 (27%), Positives = 69/118 (58%), Gaps = 1/118 (0%) Frame = -3 Query: 351 MASFDLLVHVYCTQFKNLGLGFAFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYE 172 + +++ ++H C K+ L AF + + +T +G+ SL T + L++ L+K ++ + + Sbjct: 310 VVTYNNIIHGLC---KSGRLDEAFQIKQNMTKQGVDHSLITYSVLINGLIKLDKFEEANS 366 Query: 171 VFAIIC-RGFSPDVYSFSTAINAFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 V + +GF+P+ + ++T I +CK I+ A ++ ++M G+ P +T+N L+HG Sbjct: 367 VLKEMSDKGFAPNEFVYNTLIAGYCKMENIDEALRIKHQMLSNGMKPNSVTFNLLMHG 424 >ref|XP_007040906.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|590680604|ref|XP_007040907.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|590680608|ref|XP_007040908.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|590680612|ref|XP_007040909.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|590680616|ref|XP_007040910.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|590680620|ref|XP_007040911.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508778151|gb|EOY25407.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508778152|gb|EOY25408.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508778153|gb|EOY25409.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508778154|gb|EOY25410.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508778155|gb|EOY25411.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508778156|gb|EOY25412.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] Length = 845 Score = 157 bits (397), Expect = 1e-36 Identities = 82/154 (53%), Positives = 107/154 (69%), Gaps = 2/154 (1%) Frame = -3 Query: 456 LIDGKLPALFEKSKD-RHIEISLAIAD-SSVDESELTMASFDLLVHVYCTQFKNLGLGFA 283 LIDGKLP + HI+I+ A+AD +++ + + D+L+H+YCTQFKN G A Sbjct: 156 LIDGKLPLSSPNNTTIDHIQITTALADLNTLSKGVPRVMGVDMLLHLYCTQFKNAGFTSA 215 Query: 282 FDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAF 103 DVF L ++G+FPS KTCNF LSSLVKANEL ++Y+VF + R S DVY +T INAF Sbjct: 216 IDVFFTLADKGMFPSSKTCNFFLSSLVKANELQKTYQVFETLSRFVSLDVYLCTTMINAF 275 Query: 102 CKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 CKGGRI+ A LF +ME GI+P V+TYN +IHG Sbjct: 276 CKGGRIQDAFALFSRMENLGIAPNVVTYNNIIHG 309 >sp|Q940A6.2|PP325_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g19440, chloroplastic; Flags: Precursor Length = 838 Score = 156 bits (395), Expect = 2e-36 Identities = 78/153 (50%), Positives = 106/153 (69%), Gaps = 1/153 (0%) Frame = -3 Query: 456 LIDGKLPALFEKSKDRHIEISLAIADSSVD-ESELTMASFDLLVHVYCTQFKNLGLGFAF 280 LI+G +P L +D + I+ A+A S+ + E+ DLL+ VYCTQFK G A Sbjct: 165 LINGNVPVLPCGLRDSRVAIADAMASLSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLAL 224 Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100 DVF +L N+G+FPS TCN LL+SLV+ANE + E F ++C+G SPDVY F+TAINAFC Sbjct: 225 DVFPVLANKGMFPSKTTCNILLTSLVRANEFQKCCEAFDVVCKGVSPDVYLFTTAINAFC 284 Query: 99 KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 KGG++E A +LF KME G++P V+T+NT+I G Sbjct: 285 KGGKVEEAVKLFSKMEEAGVAPNVVTFNTVIDG 317 >emb|CAA18631.1| putative protein [Arabidopsis thaliana] gi|7268739|emb|CAB78946.1| putative protein [Arabidopsis thaliana] Length = 814 Score = 156 bits (395), Expect = 2e-36 Identities = 78/153 (50%), Positives = 106/153 (69%), Gaps = 1/153 (0%) Frame = -3 Query: 456 LIDGKLPALFEKSKDRHIEISLAIADSSVD-ESELTMASFDLLVHVYCTQFKNLGLGFAF 280 LI+G +P L +D + I+ A+A S+ + E+ DLL+ VYCTQFK G A Sbjct: 141 LINGNVPVLPCGLRDSRVAIADAMASLSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLAL 200 Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100 DVF +L N+G+FPS TCN LL+SLV+ANE + E F ++C+G SPDVY F+TAINAFC Sbjct: 201 DVFPVLANKGMFPSKTTCNILLTSLVRANEFQKCCEAFDVVCKGVSPDVYLFTTAINAFC 260 Query: 99 KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 KGG++E A +LF KME G++P V+T+NT+I G Sbjct: 261 KGGKVEEAVKLFSKMEEAGVAPNVVTFNTVIDG 293 >ref|NP_567587.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|334186696|ref|NP_001190771.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|15810161|gb|AAL07224.1| unknown protein [Arabidopsis thaliana] gi|332658782|gb|AEE84182.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332658783|gb|AEE84183.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 825 Score = 156 bits (395), Expect = 2e-36 Identities = 78/153 (50%), Positives = 106/153 (69%), Gaps = 1/153 (0%) Frame = -3 Query: 456 LIDGKLPALFEKSKDRHIEISLAIADSSVD-ESELTMASFDLLVHVYCTQFKNLGLGFAF 280 LI+G +P L +D + I+ A+A S+ + E+ DLL+ VYCTQFK G A Sbjct: 152 LINGNVPVLPCGLRDSRVAIADAMASLSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLAL 211 Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100 DVF +L N+G+FPS TCN LL+SLV+ANE + E F ++C+G SPDVY F+TAINAFC Sbjct: 212 DVFPVLANKGMFPSKTTCNILLTSLVRANEFQKCCEAFDVVCKGVSPDVYLFTTAINAFC 271 Query: 99 KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 KGG++E A +LF KME G++P V+T+NT+I G Sbjct: 272 KGGKVEEAVKLFSKMEEAGVAPNVVTFNTVIDG 304 >ref|XP_004509525.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X1 [Cicer arietinum] gi|502153968|ref|XP_004509526.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X2 [Cicer arietinum] gi|502153970|ref|XP_004509527.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X3 [Cicer arietinum] gi|502153972|ref|XP_004509528.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X4 [Cicer arietinum] gi|502153974|ref|XP_004509529.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X5 [Cicer arietinum] gi|502153976|ref|XP_004509530.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X6 [Cicer arietinum] gi|502153978|ref|XP_004509531.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X7 [Cicer arietinum] gi|502153980|ref|XP_004509532.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X8 [Cicer arietinum] gi|502153982|ref|XP_004509533.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X9 [Cicer arietinum] gi|502153984|ref|XP_004509534.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like isoform X10 [Cicer arietinum] Length = 835 Score = 156 bits (394), Expect = 3e-36 Identities = 86/156 (55%), Positives = 108/156 (69%), Gaps = 4/156 (2%) Frame = -3 Query: 456 LIDGKLPALFEKSKDRHIEISLAIADSSVDESELTMAS---FDLLVHVYCTQFKNLGLGF 286 LIDG + DR E+ A S ++ S LT S DLL+H+ C+QF++LG + Sbjct: 145 LIDGNVSTPLLNRDDRLSEM----ASSFLELSRLTERSHGELDLLLHILCSQFQHLGFHW 200 Query: 285 AFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICR-GFSPDVYSFSTAIN 109 AFD+F L T+ G+FPSLKTCNFLLSSLVK+NEL +SY VF ++CR G S DVY+FSTAIN Sbjct: 201 AFDIFTLFTSNGVFPSLKTCNFLLSSLVKSNELHKSYRVFDVVCRGGVSLDVYTFSTAIN 260 Query: 108 AFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 AF KGG+I+ A LF KME G+ P V+TYN LI G Sbjct: 261 AFSKGGKIDDAVGLFSKMEEQGVLPNVVTYNNLIDG 296 >gb|EYU29134.1| hypothetical protein MIMGU_mgv1a001281mg [Mimulus guttatus] Length = 847 Score = 155 bits (392), Expect = 6e-36 Identities = 81/155 (52%), Positives = 109/155 (70%), Gaps = 3/155 (1%) Frame = -3 Query: 456 LIDGKLP-ALFEKSKDRHIEISLAIADSSVDESELTMAS--FDLLVHVYCTQFKNLGLGF 286 LID KLP +L + + H EI++ +AD+ + + FD+LVHVY T+FK+LGL Sbjct: 161 LIDRKLPVSLRDNVVNLHNEIAIVLADTFSGSEKFRSGNRGFDMLVHVYATEFKSLGLDA 220 Query: 285 AFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINA 106 A DVFRLL R L PS KTCNFL+S+LVKA+E ++SYE+F I+ R PDVY +STAINA Sbjct: 221 AMDVFRLLAGRRLVPSFKTCNFLMSTLVKADEHEKSYEIFLIVSRESLPDVYLYSTAINA 280 Query: 105 FCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 CKGG+++ AA LF M G++P V+TYN L++G Sbjct: 281 LCKGGKVDEAAMLFKVMGNSGVAPNVVTYNNLMNG 315 Score = 56.6 bits (135), Expect = 4e-06 Identities = 31/96 (32%), Positives = 51/96 (53%), Gaps = 1/96 (1%) Frame = -3 Query: 285 AFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICR-GFSPDVYSFSTAIN 109 A +F + NRG+ P+L T + L+ L A L+ S +F + + G PDV ++ I Sbjct: 675 ALKLFDDMKNRGVKPTLATYSSLIHGLSNAGRLNDSKVLFDEMRKEGLMPDVVCYTALIG 734 Query: 108 AFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 +CK G ++ A L +M +F + IT+ +IHG Sbjct: 735 GYCKLGHMDEARNLLQEMSLFNVKANKITFTVIIHG 770 >ref|XP_007156329.1| hypothetical protein PHAVU_003G277400g [Phaseolus vulgaris] gi|561029683|gb|ESW28323.1| hypothetical protein PHAVU_003G277400g [Phaseolus vulgaris] Length = 837 Score = 155 bits (392), Expect = 6e-36 Identities = 75/152 (49%), Positives = 105/152 (69%) Frame = -3 Query: 456 LIDGKLPALFEKSKDRHIEISLAIADSSVDESELTMASFDLLVHVYCTQFKNLGLGFAFD 277 LIDG +P F ++R EI+ ++ + + + DLL+++ C+++K+ G AFD Sbjct: 150 LIDGHVPTSFHDRENRLREIASSMLELN-QVLDTRHGELDLLLYILCSRYKDFGFRCAFD 208 Query: 276 VFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFCK 97 +F + + RG+FP LKTCNFLLSSLV ANEL +SYEVF + C+G PDV+ F+ AINAFCK Sbjct: 209 IFIMFSKRGVFPCLKTCNFLLSSLVTANELHKSYEVFDVTCQGVVPDVFMFTAAINAFCK 268 Query: 96 GGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 GGR+ A LF+KME G+SP V+TYN +I G Sbjct: 269 GGRVGDAVDLFHKMEKLGVSPNVVTYNNVIDG 300 >ref|XP_006413978.1| hypothetical protein EUTSA_v10024401mg [Eutrema salsugineum] gi|557115148|gb|ESQ55431.1| hypothetical protein EUTSA_v10024401mg [Eutrema salsugineum] Length = 837 Score = 152 bits (385), Expect = 4e-35 Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 3/155 (1%) Frame = -3 Query: 456 LIDGKLPALFEKSKDRHIEISLAIADSSVD---ESELTMASFDLLVHVYCTQFKNLGLGF 286 LI+G +P L + R +++A A +S+ + E+ M DLL+ VYCTQFK G Sbjct: 162 LINGNVPVLPSANDSRDGRVAIADAMASLSLCFDPEIRMRISDLLIEVYCTQFKRAGCYL 221 Query: 285 AFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINA 106 A D+F LL N+GLFPS TCN LL+SLV+ANE + E F +C+G SPDVY F+T INA Sbjct: 222 ALDIFPLLANKGLFPSRTTCNILLTSLVRANEFQKCCEAFEAVCKGVSPDVYLFTTVINA 281 Query: 105 FCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1 +CK G++ A +LF KME G++P V+TYNT+I G Sbjct: 282 YCKRGKVGEAIELFSKMEEAGVAPNVVTYNTVIDG 316