BLASTX nr result
ID: Akebia24_contig00027703
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00027703 (638 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002282084.1| PREDICTED: pentatricopeptide repeat-containi... 301 8e-80 ref|XP_007037335.1| Tetratricopeptide repeat (TPR)-like superfam... 281 9e-74 gb|EXB44682.1| hypothetical protein L484_015939 [Morus notabilis] 269 6e-70 ref|XP_007211285.1| hypothetical protein PRUPE_ppa002699mg [Prun... 261 1e-67 ref|XP_002887548.1| hypothetical protein ARALYDRAFT_339650 [Arab... 253 3e-65 ref|XP_004137583.1| PREDICTED: pentatricopeptide repeat-containi... 252 6e-65 ref|XP_004301149.1| PREDICTED: pentatricopeptide repeat-containi... 250 3e-64 ref|XP_002511576.1| pentatricopeptide repeat-containing protein,... 246 5e-63 ref|XP_004240655.1| PREDICTED: pentatricopeptide repeat-containi... 245 7e-63 ref|XP_007155315.1| hypothetical protein PHAVU_003G190600g [Phas... 241 1e-61 ref|XP_006601432.1| PREDICTED: putative pentatricopeptide repeat... 240 3e-61 ref|NP_177601.1| pentatricopeptide repeat-containing protein [Ar... 239 4e-61 dbj|BAD93880.1| hypothetical protein [Arabidopsis thaliana] gi|6... 239 4e-61 ref|XP_006390406.1| hypothetical protein EUTSA_v10018254mg [Eutr... 235 7e-60 ref|XP_006301886.1| hypothetical protein CARUB_v10022358mg [Caps... 234 2e-59 emb|CBI15227.3| unnamed protein product [Vitis vinifera] 233 3e-59 gb|EPS64459.1| hypothetical protein M569_10321 [Genlisea aurea] 210 3e-52 ref|XP_003609069.1| Pentatricopeptide repeat protein [Medicago t... 189 5e-46 gb|EYU24377.1| hypothetical protein MIMGU_mgv1a020160mg [Mimulus... 171 2e-40 ref|XP_003621264.1| Pentatricopeptide repeat-containing protein ... 146 6e-33 >ref|XP_002282084.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74630-like [Vitis vinifera] Length = 643 Score = 301 bits (772), Expect = 8e-80 Identities = 144/212 (67%), Positives = 175/212 (82%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 HAY KTGL DP IAGK DYAR LFL P+PD+FM+NTLIRGL+ES+ Sbjct: 25 HAYVCKTGLDTDPIIAGKLLLHSAVSVPDALDYARRLFLHFPNPDVFMHNTLIRGLAESD 84 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 +PQNSL+T++EMRRR T DSFSFAF+LKAAA+Y+SL SGIQLHCQ+++HGLDTH+FV Sbjct: 85 TPQNSLITFVEMRRRLTAPLDSFSFAFLLKAAASYRSLESGIQLHCQAIVHGLDTHLFVG 144 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLT 99 TTLVSMY+E G +A A+K F+E+ +PNVV+WNA+VTACFRCGDVKGA+++FNRMPFRNLT Sbjct: 145 TTLVSMYSECGFVAFAKKVFEEMFEPNVVAWNAVVTACFRCGDVKGADMMFNRMPFRNLT 204 Query: 98 SWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 SWN+MLAGY KAGELE A+++F EM VKD VS Sbjct: 205 SWNVMLAGYTKAGELELARKLFLEMPVKDDVS 236 >ref|XP_007037335.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma cacao] gi|508774580|gb|EOY21836.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma cacao] Length = 643 Score = 281 bits (720), Expect = 9e-74 Identities = 138/212 (65%), Positives = 167/212 (78%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 HA KTGL++DPFIAGK DYAR FL P+PD+FM+NTLIRG SES+ Sbjct: 25 HASLVKTGLNSDPFIAGKLILHCAVTNSDVLDYARRFFLHFPNPDVFMHNTLIRGFSESS 84 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 +PQNS+ T+I+MRR+S PDSFSFAFVLKAA+NY SL +GIQLHCQ+LIHGLDTH+FV Sbjct: 85 TPQNSIFTFIDMRRKSMVPPDSFSFAFVLKAASNYGSLRAGIQLHCQALIHGLDTHLFVG 144 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLT 99 TTL+SMY E G + A+KAF+++ +PNVV+WNAIVTACFRCGDVKGA +F+ MPF N T Sbjct: 145 TTLISMYGECGSVCFAKKAFEQMLEPNVVAWNAIVTACFRCGDVKGARKMFDMMPFTNST 204 Query: 98 SWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 S N+MLAG+ KAGE+E AK+MF EM VKD VS Sbjct: 205 SSNVMLAGFAKAGEMELAKKMFWEMKVKDDVS 236 >gb|EXB44682.1| hypothetical protein L484_015939 [Morus notabilis] Length = 644 Score = 269 bits (687), Expect = 6e-70 Identities = 130/212 (61%), Positives = 159/212 (75%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 HA+A KTG+ +DP AGK DYAR L P+PD FMYNTLIRG SES+ Sbjct: 27 HAFACKTGVDSDPLFAGKLLLHSAVAISEGLDYARRLLSHFPNPDAFMYNTLIRGFSESD 86 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 +P N+ T+ EM R+S DSFSFAFVLKAAAN + L +G+QLHCQ+L GL+TH+FV Sbjct: 87 NPCNAFATFKEMHRKSVSQFDSFSFAFVLKAAANLRCLQAGVQLHCQALARGLNTHLFVG 146 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLT 99 TTL+S YAE GC+ ARK FDE+PQPN V+WNAI+TACFRCGD++G E LF RMP RNLT Sbjct: 147 TTLISFYAECGCLEFARKMFDEIPQPNAVTWNAILTACFRCGDLEGGEALFERMPVRNLT 206 Query: 98 SWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 SW++MLAGY+KAGELE A+++F+ M VKD VS Sbjct: 207 SWDVMLAGYVKAGELELARKVFSRMPVKDDVS 238 >ref|XP_007211285.1| hypothetical protein PRUPE_ppa002699mg [Prunus persica] gi|462407020|gb|EMJ12484.1| hypothetical protein PRUPE_ppa002699mg [Prunus persica] Length = 643 Score = 261 bits (668), Expect = 1e-67 Identities = 125/212 (58%), Positives = 158/212 (74%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 HA+A KTGL P ++GK +YAR L L +PD FMYNTLIRG +ES+ Sbjct: 25 HAFACKTGLDAHPLVSGKLLLHCAVTISGALEYARRLLLHFRNPDAFMYNTLIRGFAESD 84 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 +P N+ ++EMRR+ DSFSFAF+LKAAAN +SL G+QLHCQ+L HGLDTH+FV Sbjct: 85 TPDNAFDVFVEMRRKLIDPLDSFSFAFILKAAANCRSLRDGMQLHCQALTHGLDTHLFVG 144 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLT 99 TT++SMYAE G ++ ARK F+E+ PNVV+WNAI+TACFRCGDV+GAE +F+RMP RNLT Sbjct: 145 TTIISMYAECGIVSFARKVFEEMSDPNVVAWNAILTACFRCGDVEGAETMFDRMPLRNLT 204 Query: 98 SWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 SWN++LAGY+KA ELE AK+ F M +KD VS Sbjct: 205 SWNVLLAGYVKADELELAKKAFLRMPMKDDVS 236 >ref|XP_002887548.1| hypothetical protein ARALYDRAFT_339650 [Arabidopsis lyrata subsp. lyrata] gi|297333389|gb|EFH63807.1| hypothetical protein ARALYDRAFT_339650 [Arabidopsis lyrata subsp. lyrata] Length = 1221 Score = 253 bits (646), Expect = 3e-65 Identities = 119/212 (56%), Positives = 152/212 (71%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 H + K+G+ D + GK YAR L L P PD FM+NTL+RG SES+ Sbjct: 192 HGFFIKSGVDTDSYFIGKLILHCAISISDALPYARRLLLCFPEPDAFMFNTLVRGYSESD 251 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 P NS+ ++EM R+ PDSFSFAFV+KAAAN++SL +G Q+HCQ+L HGLD+H+FV Sbjct: 252 EPHNSVAVFVEMMRKGFIFPDSFSFAFVVKAAANFRSLRTGFQMHCQALKHGLDSHLFVA 311 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLT 99 TTL+ MY E GC+ ARK FDE+PQPN+V+WNA+VTACFR DV GA +F++M RN T Sbjct: 312 TTLIGMYGECGCVGFARKVFDEMPQPNLVAWNAVVTACFRGNDVSGAREIFDKMLVRNHT 371 Query: 98 SWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 SWN+MLAGY+KAGELE AKR+F+EM +D VS Sbjct: 372 SWNVMLAGYIKAGELECAKRIFSEMPHRDDVS 403 >ref|XP_004137583.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74630-like [Cucumis sativus] gi|449487109|ref|XP_004157499.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74630-like [Cucumis sativus] Length = 642 Score = 252 bits (644), Expect = 6e-65 Identities = 121/207 (58%), Positives = 157/207 (75%) Frame = -1 Query: 623 KTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESNSPQNS 444 KT L++ P ++GK YAR LFL I +PD+FMYNTLIRGLS+S++P N+ Sbjct: 30 KTCLNSYPLVSGKLLLHCAVTLPDSLHYARRLFLDIRNPDVFMYNTLIRGLSDSDTPSNA 89 Query: 443 LLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVRTTLVS 264 L ++EMRR+S LPDSFSFAF+LKAAAN ++L +G+QLHC ++ +GLD+H+FV TTL+S Sbjct: 90 LQLFVEMRRKSVALPDSFSFAFLLKAAANCRALTNGLQLHCLAVGYGLDSHLFVGTTLIS 149 Query: 263 MYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLTSWNLM 84 MYAE C+ ARK FDE+ +PN+V+WNAIV ACFRC VK AE +F MP RNLTSWN+M Sbjct: 150 MYAECACLVFARKVFDEMIEPNIVAWNAIVAACFRCEGVKDAEQVFRCMPIRNLTSWNIM 209 Query: 83 LAGYMKAGELEPAKRMFAEMSVKDPVS 3 LAGY KAGEL+ A+ +F +M +KD VS Sbjct: 210 LAGYTKAGELQLAREVFMKMPLKDDVS 236 Score = 57.0 bits (136), Expect = 5e-06 Identities = 41/146 (28%), Positives = 68/146 (46%) Frame = -1 Query: 539 ARHLFLQIPSPDIFMYNTLIRGLSESNSPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAA 360 AR +F+++P D ++T+I G + + + ++ + E+RR P+ S VL A A Sbjct: 222 AREVFMKMPLKDDVSWSTMIVGFAHNGNFNDAFAFFREVRREGMR-PNEVSLTGVLSACA 280 Query: 359 NYKSLNSGIQLHCQSLIHGLDTHIFVRTTLVSMYAESGCIASARKAFDELPQPNVVSWNA 180 + G LH G I V L+ Y++ G + AR FD + + + VSW A Sbjct: 281 QAGAFEFGRILHGFVEKSGFLQIISVNNALIDTYSKCGNLDMARLVFDNMLRRSAVSWTA 340 Query: 179 IVTACFRCGDVKGAEILFNRMPFRNL 102 ++ G + A LFN M N+ Sbjct: 341 MIAGMAMHGYGEEAIRLFNEMEESNI 366 >ref|XP_004301149.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74630-like [Fragaria vesca subsp. vesca] Length = 643 Score = 250 bits (638), Expect = 3e-64 Identities = 122/212 (57%), Positives = 152/212 (71%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 HA+ KTGL DP + GK YAR LFL P PD FMYNTLIRGL++S+ Sbjct: 25 HAFVHKTGLDTDPLVFGKLLLHCAVTISDLLVYARRLFLHFPYPDAFMYNTLIRGLADSD 84 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 +P N+LL + EMRR++ DSF+FAF LKAAAN +SL+ G QLHCQ+ IHGL +H+FV Sbjct: 85 NPHNALLLFREMRRKNMVSIDSFTFAFTLKAAANSRSLSGGTQLHCQAFIHGLYSHMFVG 144 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLT 99 TTLVS+Y E G + ARK FDE+ +PNVV+WNA++TACFRCGDV+ AE +F MP R+ T Sbjct: 145 TTLVSVYGECGSVGHARKVFDEMTEPNVVAWNAVLTACFRCGDVEVAEEVFGSMPLRDST 204 Query: 98 SWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 SWN++LAGY+KAGEL A+ F M VKD VS Sbjct: 205 SWNIVLAGYVKAGELALAREAFWRMPVKDDVS 236 >ref|XP_002511576.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223550691|gb|EEF52178.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 438 Score = 246 bits (627), Expect = 5e-63 Identities = 121/212 (57%), Positives = 153/212 (72%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 HA K GL ND + GK DYA LF P+PD+FMYNTLIRGL+ES+ Sbjct: 25 HANVLKIGLQNDLLLIGKLLLHCTIVLSDSIDYALSLFRDTPNPDVFMYNTLIRGLAESD 84 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 SPQ S+ T++E+R+ S PDSFSFAFVLKAAA +SL GIQLHCQ+ +GL+ H+FV Sbjct: 85 SPQKSIATFLELRKESALSPDSFSFAFVLKAAAYLRSLRGGIQLHCQAWKYGLNAHLFVG 144 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLT 99 TTL+SMY E GC+ AR+ F E+ +PNV++WNA++ ACFR GDVK A +F+ M FR+LT Sbjct: 145 TTLISMYGECGCVGYARQVFGEMHEPNVIAWNAVIAACFRGGDVKEAGKMFSLMVFRDLT 204 Query: 98 SWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 SWN+MLAGY+K GEL+ A+ MF EM+VKD VS Sbjct: 205 SWNVMLAGYVKIGELQLAREMFLEMAVKDDVS 236 >ref|XP_004240655.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74630-like [Solanum lycopersicum] Length = 525 Score = 245 bits (626), Expect = 7e-63 Identities = 122/212 (57%), Positives = 154/212 (72%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 HA+ K+GL +P IAGK DYAR L + P+ D+FMYNTLIRG SES+ Sbjct: 25 HAFVYKSGLETNPLIAGKLLILGALQISDAIDYARRLLIHYPNSDVFMYNTLIRGESESD 84 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 SP+NS+ T+I M R+S PDSFSFAFVLKAAAN + L +G QLHCQ++ GLDTH+FV Sbjct: 85 SPKNSVSTFIYMLRQSYSPPDSFSFAFVLKAAANLRCLTTGFQLHCQAMTRGLDTHLFVG 144 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLT 99 TT++SMYAE G + A K F ++PQPNVV+WNAI+TA R DV GA+ +F MPFRNLT Sbjct: 145 TTIISMYAECGFVEFAWKVFVQIPQPNVVAWNAILTAYLRGSDVSGADKVFGLMPFRNLT 204 Query: 98 SWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 +WN+MLAGY KAGELE A+R+F +M +D +S Sbjct: 205 TWNVMLAGYTKAGELERAERLFLQMPSRDDIS 236 >ref|XP_007155315.1| hypothetical protein PHAVU_003G190600g [Phaseolus vulgaris] gi|561028669|gb|ESW27309.1| hypothetical protein PHAVU_003G190600g [Phaseolus vulgaris] Length = 626 Score = 241 bits (615), Expect = 1e-61 Identities = 121/212 (57%), Positives = 149/212 (70%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 H + K+GL DPF+ GK Y+ LF P+PD FM+NTLIRGLS S Sbjct: 23 HGHILKSGLRTDPFVFGKLLLHCAVSISDALHYSLRLFHHFPNPDTFMHNTLIRGLSLSQ 82 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 +P SL +I++RR+ + PDSFSFAFVLK AN + L GIQLH Q+L HG DTHIFV Sbjct: 83 TPLLSLHPFIQLRRQPSLSPDSFSFAFVLKGVANSRQLRPGIQLHSQALHHGFDTHIFVG 142 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLT 99 TTL+SMYAE G SAR+ FD++ +PNVV+WNA VTA FRCGDV+GA +F RMP RNLT Sbjct: 143 TTLISMYAECGDSVSARRVFDKMSEPNVVAWNAAVTASFRCGDVEGAGDVFGRMPLRNLT 202 Query: 98 SWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 SWN+MLAGY KAGEL A+R+F +M ++D VS Sbjct: 203 SWNVMLAGYAKAGELGLARRVFCDMPLRDEVS 234 >ref|XP_006601432.1| PREDICTED: putative pentatricopeptide repeat-containing protein At1g74580-like [Glycine max] Length = 1428 Score = 240 bits (612), Expect = 3e-61 Identities = 121/212 (57%), Positives = 147/212 (69%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 HA KTGLH DP + GK YA LF P+PD FM+NTLIR LS S Sbjct: 810 HAQICKTGLHTDPLVFGKLLLHCAITISDALHYALRLFHHFPNPDTFMHNTLIRSLSLSQ 869 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 +P +SL +I++RR+ T PDSF+FAF LKA AN + L GIQLH Q+ HG D HIFV Sbjct: 870 TPLSSLHPFIQLRRQPTLSPDSFTFAFALKAVANSRHLRPGIQLHSQAFRHGFDAHIFVG 929 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLT 99 TTL+SMYAE G SAR+ FDE+ +PNVV+WNA++TA FRCGDV+GA+ +F MP RNLT Sbjct: 930 TTLISMYAECGDSGSARRVFDEMSEPNVVTWNAVLTAAFRCGDVEGAQDVFGCMPVRNLT 989 Query: 98 SWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 SWN MLAGY KAGEL A+R+F EM ++D VS Sbjct: 990 SWNGMLAGYAKAGELGLARRVFYEMPLRDEVS 1021 >ref|NP_177601.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75169836|sp|Q9CA54.1|PP122_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g74630 gi|12324801|gb|AAG52363.1|AC011765_15 hypothetical protein; 86841-88772 [Arabidopsis thaliana] gi|332197495|gb|AEE35616.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 643 Score = 239 bits (611), Expect = 4e-61 Identities = 113/212 (53%), Positives = 146/212 (68%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 H K G+ D + GK YAR L L P PD FM+NTL+RG SES+ Sbjct: 25 HGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLCFPEPDAFMFNTLVRGYSESD 84 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 P NS+ ++EM R+ PDSFSFAFV+KA N++SL +G Q+HCQ+L HGL++H+FV Sbjct: 85 EPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGFQMHCQALKHGLESHLFVG 144 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLT 99 TTL+ MY GC+ ARK FDE+ QPN+V+WNA++TACFR DV GA +F++M RN T Sbjct: 145 TTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGNDVAGAREIFDKMLVRNHT 204 Query: 98 SWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 SWN+MLAGY+KAGELE AKR+F+EM +D VS Sbjct: 205 SWNVMLAGYIKAGELESAKRIFSEMPHRDDVS 236 Score = 61.2 bits (147), Expect = 2e-07 Identities = 48/177 (27%), Positives = 81/177 (45%), Gaps = 5/177 (2%) Frame = -1 Query: 539 ARHLFLQIPSPDIFMYNTLIRGLSESNSPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAA 360 A+ +F ++P D ++T+I G++ + S S L + E++R P+ S VL A + Sbjct: 222 AKRIFSEMPHRDDVSWSTMIVGIAHNGSFNESFLYFRELQRAGMS-PNEVSLTGVLSACS 280 Query: 359 NYKSLNSGIQLHCQSLIHGLDTHIFVRTTLVSMYAESGCIASARKAFDELPQPN-VVSWN 183 S G LH G + V L+ MY+ G + AR F+ + + +VSW Sbjct: 281 QSGSFEFGKILHGFVEKAGYSWIVSVNNALIDMYSRCGNVPMARLVFEGMQEKRCIVSWT 340 Query: 182 AIVTACFRCGDVKGAEILFNRMPFRNLT----SWNLMLAGYMKAGELEPAKRMFAEM 24 +++ G + A LFN M +T S+ +L AG +E + F+EM Sbjct: 341 SMIAGLAMHGQGEEAVRLFNEMTAYGVTPDGISFISLLHACSHAGLIEEGEDYFSEM 397 >dbj|BAD93880.1| hypothetical protein [Arabidopsis thaliana] gi|62318835|dbj|BAD93890.1| hypothetical protein [Arabidopsis thaliana] Length = 635 Score = 239 bits (611), Expect = 4e-61 Identities = 113/212 (53%), Positives = 146/212 (68%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 H K G+ D + GK YAR L L P PD FM+NTL+RG SES+ Sbjct: 17 HGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLCFPEPDAFMFNTLVRGYSESD 76 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 P NS+ ++EM R+ PDSFSFAFV+KA N++SL +G Q+HCQ+L HGL++H+FV Sbjct: 77 EPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGFQMHCQALKHGLESHLFVG 136 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLT 99 TTL+ MY GC+ ARK FDE+ QPN+V+WNA++TACFR DV GA +F++M RN T Sbjct: 137 TTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGNDVAGAREIFDKMLVRNHT 196 Query: 98 SWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 SWN+MLAGY+KAGELE AKR+F+EM +D VS Sbjct: 197 SWNVMLAGYIKAGELESAKRIFSEMPHRDDVS 228 Score = 61.2 bits (147), Expect = 2e-07 Identities = 48/177 (27%), Positives = 81/177 (45%), Gaps = 5/177 (2%) Frame = -1 Query: 539 ARHLFLQIPSPDIFMYNTLIRGLSESNSPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAA 360 A+ +F ++P D ++T+I G++ + S S L + E++R P+ S VL A + Sbjct: 214 AKRIFSEMPHRDDVSWSTMIVGIAHNGSFNESFLYFRELQRAGMS-PNEVSLTGVLSACS 272 Query: 359 NYKSLNSGIQLHCQSLIHGLDTHIFVRTTLVSMYAESGCIASARKAFDELPQPN-VVSWN 183 S G LH G + V L+ MY+ G + AR F+ + + +VSW Sbjct: 273 QSGSFEFGKILHGFVEKAGYSWIVSVNNALIDMYSRCGNVPMARLVFEGMQEKRCIVSWT 332 Query: 182 AIVTACFRCGDVKGAEILFNRMPFRNLT----SWNLMLAGYMKAGELEPAKRMFAEM 24 +++ G + A LFN M +T S+ +L AG +E + F+EM Sbjct: 333 SMIAGLAMHGQGEEAVRLFNEMTAYGVTPDGISFISLLHACSHAGLIEEGEDYFSEM 389 >ref|XP_006390406.1| hypothetical protein EUTSA_v10018254mg [Eutrema salsugineum] gi|557086840|gb|ESQ27692.1| hypothetical protein EUTSA_v10018254mg [Eutrema salsugineum] Length = 645 Score = 235 bits (600), Expect = 7e-60 Identities = 113/212 (53%), Positives = 149/212 (70%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 HA K+G+ + + GK YAR L L P PD FM+N LIRG SES+ Sbjct: 27 HASFIKSGVDTNSYFTGKLILQCAISIPDALPYARRLLLCFPHPDAFMFNALIRGYSESH 86 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 P NS+ ++EM R+ PDSFSFAFV+KAAA+++SL +G Q+HCQ+L HGLD+H+FV Sbjct: 87 EPHNSVAAFVEMMRKGLVFPDSFSFAFVVKAAASFQSLRTGFQMHCQALKHGLDSHLFVA 146 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLT 99 TTL+S+Y E C+ ARK F+E+ QPN+V+WNA+VTACFR DV A+ +F++M R+ Sbjct: 147 TTLISLYGECRCVEFARKVFNEMRQPNLVAWNAVVTACFRGNDVAAAKEIFDKMLVRDHM 206 Query: 98 SWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 SWN+MLAGY KAGELE AKR+F+EM +KD VS Sbjct: 207 SWNVMLAGYTKAGELESAKRVFSEMPLKDDVS 238 >ref|XP_006301886.1| hypothetical protein CARUB_v10022358mg [Capsella rubella] gi|482570596|gb|EOA34784.1| hypothetical protein CARUB_v10022358mg [Capsella rubella] Length = 643 Score = 234 bits (596), Expect = 2e-59 Identities = 109/212 (51%), Positives = 146/212 (68%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 H + K+G+ +D + GK YAR L P PD FM+NTL+RG S S+ Sbjct: 25 HGFFIKSGVDSDSYFNGKLILQCAISVPGALPYARRLLFCFPEPDAFMFNTLVRGYSGSD 84 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 P+N++ ++EM R+ PDSFSFAFV+KA AN++SL +G Q+HCQ+L HGL++H+FV Sbjct: 85 EPRNAVSVFVEMMRKGFVFPDSFSFAFVVKATANFRSLRTGFQMHCQALKHGLESHLFVA 144 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLT 99 TTL+ MY E GC+ ARK FDE+ QPN+V+WNA++TACFR D A +F+ M +N T Sbjct: 145 TTLIGMYGECGCVEFARKVFDEMHQPNLVAWNAVITACFRGNDFSKAREIFDNMLVKNHT 204 Query: 98 SWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 SWN+MLAGY KAGELE AKR+F+EM +D VS Sbjct: 205 SWNVMLAGYTKAGELESAKRIFSEMPHRDDVS 236 >emb|CBI15227.3| unnamed protein product [Vitis vinifera] Length = 491 Score = 233 bits (595), Expect = 3e-59 Identities = 113/174 (64%), Positives = 138/174 (79%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 HAY KTGL DP IAGK DYAR LFL P+PD+FM+NTLIRGL+ES+ Sbjct: 25 HAYVCKTGLDTDPIIAGKLLLHSAVSVPDALDYARRLFLHFPNPDVFMHNTLIRGLAESD 84 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 +PQNSL+T++EMRRR T DSFSFAF+LKAAA+Y+SL SGIQLHCQ+++HGLDTH+FV Sbjct: 85 TPQNSLITFVEMRRRLTAPLDSFSFAFLLKAAASYRSLESGIQLHCQAIVHGLDTHLFVG 144 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRM 117 TTLVSMY+E G +A A+K F+E+ +PNVV+WNA+VTACFRCGDVK A LF+ M Sbjct: 145 TTLVSMYSECGFVAFAKKVFEEMFEPNVVAWNAVVTACFRCGDVKEAIQLFHEM 198 >gb|EPS64459.1| hypothetical protein M569_10321 [Genlisea aurea] Length = 512 Score = 210 bits (534), Expect = 3e-52 Identities = 114/212 (53%), Positives = 140/212 (66%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 HA K GL +D F+AG DYAR LF + +PD FMYN LIRG ++S+ Sbjct: 23 HAGIVKIGLDSDRFVAGMLMLTSAVELSGGMDYARLLFRRFSAPDAFMYNALIRGFADSD 82 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 P+ S+ Y +M R + + DSFS AF LKAAAN L SG+QLH Q+L GLD HIFV Sbjct: 83 RPETSVSVYSDMLRAAAAV-DSFSLAFALKAAANSMCLKSGLQLHGQALARGLDAHIFVG 141 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCGDVKGAEILFNRMPFRNLT 99 TTLVSMY E + A K FDE+P PNVV+WNA VTA R GDVKGAE FN +PF+NLT Sbjct: 142 TTLVSMYGECDRLEYADKVFDEIPDPNVVTWNAKVTAHLRYGDVKGAEKAFNSIPFKNLT 201 Query: 98 SWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 S+NLML+GY K GE+ A+R+F +M +D VS Sbjct: 202 SYNLMLSGYSKLGEIGLARRLFDQMPSRDDVS 233 >ref|XP_003609069.1| Pentatricopeptide repeat protein [Medicago truncatula] gi|355510124|gb|AES91266.1| Pentatricopeptide repeat protein [Medicago truncatula] Length = 611 Score = 189 bits (481), Expect = 5e-46 Identities = 105/214 (49%), Positives = 127/214 (59%), Gaps = 9/214 (4%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXD-YARHLFLQIPSPDIFMYNTLIRGLSES 462 H + TGLH PF GK Y+ LF P+PD FMYNTLIR LS S Sbjct: 30 HTHLYVTGLHTHPFFFGKLLLNCAVSISDHVLNYSLRLFHHFPNPDTFMYNTLIRSLSHS 89 Query: 461 NSPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYK-SLNSGIQLHCQSLIHGLDTHIF 285 ++P +SL +I++ R T LPDSFSFAF LK AN S GIQLH + HG D HIF Sbjct: 90 STPLSSLQPFIQLLRHPTLLPDSFSFAFTLKGIANDGCSKRQGIQLHSHAFRHGFDDHIF 149 Query: 284 VRTTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACFRCG--DVKGA-----EILF 126 V TTL+SMYAE GC ARK FDE+ QPNVV+WNA+VTACFRCG V G E++F Sbjct: 150 VGTTLISMYAECGCYEYARKVFDEMSQPNVVAWNAVVTACFRCGMWRVLGVSFGWREVVF 209 Query: 125 NRMPFRNLTSWNLMLAGYMKAGELEPAKRMFAEM 24 M R+ SW+ M+ G+ K+G A F E+ Sbjct: 210 CEMKMRDDASWSTMIVGFAKSGSFHDAFGFFKEL 243 >gb|EYU24377.1| hypothetical protein MIMGU_mgv1a020160mg [Mimulus guttatus] Length = 580 Score = 171 bits (433), Expect = 2e-40 Identities = 87/159 (54%), Positives = 107/159 (67%) Frame = -1 Query: 638 HAYASKTGLHNDPFIAGKXXXXXXXXXXXXXDYARHLFLQIPSPDIFMYNTLIRGLSESN 459 HA A+ TGL +DP +AGK DYAR L L P PD FMYNTLIRG S+S Sbjct: 25 HARAAVTGLDSDPLVAGKLLLHSAVHLSGALDYARRLLLHNPCPDTFMYNTLIRGFSDSA 84 Query: 458 SPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAANYKSLNSGIQLHCQSLIHGLDTHIFVR 279 SPQNS+ T+ M + DSFS AF LK+AAN + L +G QLH Q+L GL TH+FV Sbjct: 85 SPQNSVSTFSLMLQNLDSPVDSFSLAFTLKSAANMRCLRTGTQLHSQALTRGLHTHLFVG 144 Query: 278 TTLVSMYAESGCIASARKAFDELPQPNVVSWNAIVTACF 162 TTL+SMYAE GC+ A+ FDE+P+PN+V+WNA+VTA F Sbjct: 145 TTLISMYAECGCVEFAQNMFDEIPEPNIVTWNALVTAFF 183 >ref|XP_003621264.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355496279|gb|AES77482.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 519 Score = 146 bits (368), Expect = 6e-33 Identities = 78/180 (43%), Positives = 110/180 (61%) Frame = -1 Query: 542 YARHLFLQIPSPDIFMYNTLIRGLSESNSPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAA 363 YA LF QIP PD FMYN +IRG S+S +P ++ Y EM R DS++F FVLKA Sbjct: 60 YAHQLFAQIPQPDTFMYNVMIRGSSQSPNPLRAISLYTEMHRHFVK-GDSYTFPFVLKAC 118 Query: 362 ANYKSLNSGIQLHCQSLIHGLDTHIFVRTTLVSMYAESGCIASARKAFDELPQPNVVSWN 183 +N+G +H L G ++ VR TL+ +A+ G + A FD+ + +VV+W+ Sbjct: 119 TRLFWVNTGSAVHGMVLRLGFGSNAVVRNTLLVFHAKCGDLNVATSLFDDSCKGDVVAWS 178 Query: 182 AIVTACFRCGDVKGAEILFNRMPFRNLTSWNLMLAGYMKAGELEPAKRMFAEMSVKDPVS 3 +++ R GD+K A LFN MP R+L SWN+M+ GY+K GE+E A+ +F E VKD VS Sbjct: 179 SLIAGYARRGDLKVARKLFNEMPERDLVSWNVMITGYVKQGEMESARMLFDEAPVKDVVS 238 Score = 57.4 bits (137), Expect = 4e-06 Identities = 44/178 (24%), Positives = 77/178 (43%), Gaps = 5/178 (2%) Frame = -1 Query: 539 ARHLFLQIPSPDIFMYNTLIRGLSESNSPQNSLLTYIEMRRRSTPLPDSFSFAFVLKAAA 360 AR LF + P D+ +N +I G + +L + EM R PD + +L A A Sbjct: 224 ARMLFDEAPVKDVVSWNAMIAGYVVCGLSKQALELFNEMCRAGV-FPDEVTLLSLLSACA 282 Query: 359 NYKSLNSGIQLHCQSLIHGLDT-HIFVRTTLVSMYAESGCIASARKAFDELPQPNVVSWN 183 + L +G ++H + + + + L+ MYA+ G I + F + +V+SWN Sbjct: 283 DLGDLENGKKVHAKVMEISMGKLSTLLGNALIDMYAKCGNIKESLDVFWSITDKDVISWN 342 Query: 182 AIVTACFRCGDVKGAEILFNRMPFRNLTSWNLMLAGYM----KAGELEPAKRMFAEMS 21 +++ G K + LF M + + G + AGE++ + F MS Sbjct: 343 SVIVGMALHGHGKESLSLFKMMQRTKICPNEITFVGVLVACSHAGEIDEGYKYFDLMS 400