Overnight at Computex in Taipei, during his GTC keynote, NVIDIA CEO Jensen Huang announced a partnership with Microsoft. They're launching a new line of Windows laptops built around a chip that runs powerful AI right on the machine, no connection to a far-off datacenter required (NVIDIA, Axios).
If you're not deep in the AI world, that probably sounds like every other hardware announcement. But the phrase doing the heavy lifting is "local model," and it's worth understanding, because it changes where your data goes every time you use AI.
How AI works for most people today
When you type a question into a popular AI tool today, your words leave your device and travel to a company's servers. The AI does its thinking there, on hardware you'll never see, and the answer travels back to you.
Two things follow from that. You're renting the AI, so the moment you stop paying, it's gone. And you're handing over your data to use it. Your draft email, your contract, your company's numbers, all of it makes a round trip to someone else's computer. For casual use, nobody minds. For anything sensitive, that round trip is exactly the part people are nervous about.
What a local model actually means
A local model flips the setup. Instead of living in a datacenter, the AI lives on your own computer. Your documents, your emails, and your questions never leave the laptop, because the thinking happens right there on the machine in front of you.
The practical difference is simple. Your data never leaves the building. The same draft or question that used to make a round trip to a stranger's servers now stays on the device sitting on your desk. That's what NVIDIA and Microsoft just made possible for a normal laptop, with enough horsepower to run a serious model on-device.
The cost angle nobody mentions
There's a money story here too. Companies today often pay for AI by the seat, and for heavy users running AI all day, subscriptions to providers like OpenAI or Anthropic can climb into the thousands per employee per month.
With a local model, the AI comes bundled with the PC you already bought. You pay for the laptop once instead of paying a per-person fee forever. For a business with hundreds of employees, that's a very different math problem.
Why this is aimed squarely at Apple
NVIDIA isn't entering an empty room. Apple's Macs have quietly become the favorite machine for people who want to run AI fast, privately, and securely on their own hardware, thanks to strong chips and a lot of fast memory.
NVIDIA, with Microsoft and Windows behind it, is going straight at that audience (CNBC). The new laptops ship this fall from familiar names like Dell, HP, Lenovo, ASUS, and Microsoft's own Surface line. The pitch is the same one Apple has been making. Your AI, on your machine, under your control.
The other freedom is not being locked into one model
Privacy is the headline, but there's a second freedom hiding in this announcement, and it might matter more over time.
For the last few years, using AI has looked like the early days of computing, when an office shared a single machine that everyone lined up to use. You picked one company, learned its quirks, fed it your data, and built your habits around it. If a competitor shipped something better next month, switching meant starting over.
Putting a capable model on your own laptop is the same shift the personal computer made. The power moves from a shared system you borrow to one that's yours. And because these machines run open models, you get to be model-agnostic. You pick the best AI for the task in front of you, and swap it the day a better one shows up.
Running it safely is still an orchestration problem
Here's the part the announcement glosses over. Running a model on your laptop solves where the AI lives. It doesn't, on its own, solve what the AI is allowed to do.
A real task isn't just answering a question. It's checking your calendar, pulling a file, sending an email, updating a record. The moment AI starts doing actual work, it needs access to your accounts, which means your passwords and keys are in play. Keeping the model on your laptop means nothing if it still has to be handed your passwords, or if there's no record of what it touched.
That's the part we built General Input around. The AI does the work across your tools, but it never sees your passwords. They're encrypted at rest, injected only when a step needs them, and stripped before anything reaches the model. Every action it takes on your behalf is logged. Pair that with model-agnostic routing and you can keep a sensitive step on the open-weight model running locally, send the rest to whatever cloud model fits, and still guarantee no model in the chain ever held a credential or moved data without a record.
The takeaway
A local model is a real step forward for privacy, and these laptops will put it in a lot more hands. But the deeper lesson is about control. Don't bet your work on a single company's servers, and don't marry a single company's model. Own where your AI runs, and which one you run.