Running local models on Macs gets faster with Ollama's MLX support

Apple Silicon Macs get a performance boost thanks to better unified memory usage.

calendar_today March 31, 2026 schedule 23:00 visibility 61 views

Running local models on Macs gets faster with Ollama's MLX support

Source: Ars Technica

Ollama, a runtime system for operating large language models on a local computer, has introduced support for Apple's open source MLX framework for machine learning. Additionally, Ollama says it has improved caching performance and now supports Nvidia's NVFP4 format for model compression, making for much more efficient memory usage in certain models.

Combined, these developments promise significantly improved performance on Macs with Apple Silicon chips (M1 or later)—and the timing couldn't be better, as local models are starting to gain steam in ways they haven't before outside researcher and hobbyist communities.

The recent runaway success of OpenClaw—which raced its way to over 300,000 stars on GitHub, made headlines with experiments like Moltbook and became an obsession in China in particular—has many people experimenting with running models on their machines.

Read full article

Comments

newspaper

Originally published at

Ars Technica

open_in_new Read Full Article

Technology

Here comes new Siri again

Apple has been on its back foot, AI-wise, for the past few years. But in a strange way, playing from behind might not be such a bad move. At WWDC on Monday, Apple appears to be getting ready to reintroduce us to the new Siri. Again. As a reminder...

The Verge 1 hours ago

Technology

The next YouTube phenomenon hitting the big screen

Hi, friends! Welcome to Installer No. 131, your guide to the best and Verge-iest stuff in the world. (If you're new here, welcome, happy last week of productivity before the World Cup starts, and also you can read all the old editions at the...

The Verge 1 hours ago

Technology

More than a decade later, the team behind N++ is back with a multiplayer sequel

Back in 2015, the two-person studio Metanet released N++, a brutally hard 2D platformer that was a decade in the making, building off of previous releases dating back to the freeware Flash title N. At the time, cofounder Raigan Burns issued some...

The Verge 13 hours ago

Technology

Highly reviewed speaker can be hacked over the air to infect connected devices

Seller of the Sound Blaster Katana V2X doesn't consider the behavior a vulnerability.

Ars Technica 16 hours ago

Technology

Startup Battlefield 200 applications officially close in 3 days

Applications for Startup Battlefield 200 officially close on June 8, 11:59 p.m. PT. Now’s not the time to wait any longer. Secure your shot at competing on the Disrupt Stage at TechCrunch Disrupt 2026 this October at San Francisco's Moscone West.

TechCrunch 17 hours ago

Technology

Small modular nuclear reactor reaches criticality in first test

The reactor, from a startup called Antares, isn't ready to generate power yet.

Ars Technica 17 hours ago

Technology

The most interesting startups right now want to get you off your phone

While the AI fundraising machine keeps breaking its own records, some founders are building in the other direction.  Mirror founder Brynn Putnam just raised money for Board, a startup focused on bringing people together through...

TechCrunch 19 hours ago

Running local models on Macs gets faster with Ollama's MLX support

Related Articles

Here comes new Siri again

The next YouTube phenomenon hitting the big screen

More than a decade later, the team behind N++ is back with a multiplayer sequel

Read More

Highly reviewed speaker can be hacked over the air to infect connected devices

Startup Battlefield 200 applications officially close in 3 days

Small modular nuclear reactor reaches criticality in first test

The most interesting startups right now want to get you off your phone