By Prashanth Rajagopal in LLM — 21 Mar 2025

Ollama: Running Large Language Models Locally on Your Macbook Pro - A Game Changer

LLMs on your Macbook? Ollama makes it happen! This post explores running Large Language Models locally, with a focus on ease of use and M1 Pro performance. Discover the power of local AI.

The world of AI is rapidly evolving, and Large Language Models (LLMs) are at the forefront of this revolution. While cloud-based LLM APIs have been the primary way to access these powerful tools, a new wave of innovation is bringing LLMs to your local machine. Enter Ollama.

In this blog post, I'll share my initial experience with Ollama, a tool that makes it incredibly easy to run various LLMs directly on your Macbook Pro, specifically focusing on my experience with an M1 Pro chip.

What is Ollama?

Ollama simplifies the process of setting up and running LLMs locally. It handles the complexities of dependencies, configurations, and optimizations, allowing you to focus on utilizing these models. This is a significant shift, as it opens up possibilities for:

Privacy: Process data locally without sending it to external servers.
Offline Access: Utilize LLMs even without an internet connection.
Performance: For certain tasks, local processing can be faster and more responsive.
Development: Easier experimentation and integration of LLMs into local applications.

Ollama on an M1 Pro Macbook Pro: A Sweet Spot

My Macbook Pro with the M1 Pro chip has proven to be a surprisingly capable platform for running LLMs locally. The Apple Silicon architecture, with its unified memory and powerful GPU, provides a significant performance boost for these kinds of workloads. Ollama leverages these capabilities effectively, making the experience smooth and responsive.

Ease of Use: Ollama Gets It Right

One of the most impressive aspects of Ollama is its ease of use. Here's a quick rundown of how simple it is to get started:

Installation: Ollama provides a straightforward installation process for macOS(https://ollama.com/download/mac).
Chatting with the Model: Once the model is running, you can interact with it directly from your terminal.

Running a Model: To run a model, you simply use the ollama run command followed by the model name.Bash

ollama run llama2

This command downloads (if necessary) and runs the Llama 2 model.

Exploring Different Models

Ollama supports a growing number of LLMs. Here are a few I've experimented with on my M1 Pro Macbook Pro:

Llama 3.2: A powerful and open-source LLM that performs remarkably well.
Mistral: Known for its efficiency and strong performance.
Gemma3:1b: Google's versatile LLM, offering a range of capabilities.

The performance and capabilities of these models can vary, and it's exciting to explore the unique strengths of each.

A Glimpse into Local LLM Power

Running these models locally has been eye-opening. The speed and responsiveness for tasks like text generation, code completion, and creative writing are impressive. It feels like having a powerful AI assistant readily available on your machine.

The Journey Continues

This is just the beginning of my exploration into local LLMs with Ollama. In future blog posts, I plan to dive deeper into:

Integrating Ollama with Visual Studio Code (its a game changer).
Advanced Ollama usage and configuration.
Optimizing performance on Apple Silicon.
Specific use cases for local LLMs in development workflows.
Comparing different LLMs and their performance characteristics.

Stay tuned for more as we delve further into the exciting world of local LLMs!

Subscribe to Through My Lens