Hacker Newsnew | past | comments | ask | show | jobs | submit | herval's commentslogin

this is sorta like saying that being able to run your blog on your laptop will completely implode the cloud business

This is actually what happens.

I run my word processing software on my apple 2 (a total joke of a computer) instead of running it on the WANG.

I run my book keeping software on visicalc instead of the IBM.

I run my simulation software on my IBM PC (I even paid for the 8087!) instead of the VAX.

Moore's law has, at least so far, allowed the pioneers with toy computers to grow their toys big enough to solve "big boy" problems after some time has allowed the toy computers to be faster and the pioneers have scaled their crappy home-grown solution to solve their 60% of the problem that was originally solved by some enormous complex system.

Eventually the toy infrastructure gets expensive and solves 90-120% of the "big iron" problem space, but it also grows to cost as much as the big iron solution, but then a new generation of toy software and toy systems emerges to disrupt the "big iron" systems.

See also http://www.catb.org/jargon/html/W/wheel-of-reincarnation.htm...


Under appreciated requirement for this to work in post-cloud times: open source

If a vendor can SaaS a solution, then enterprise is generally happy (they don't want to have to hire folks for maintenance), and that completely locks out any ability to run locally.

Between enterprise's ambivalence and the obvious financial incentive to vendors, you get SaaS-only products.


You're right Moore's law has been holding up, but will hit a hard limit on process node size, so all scaling will be based on multiple cores. OTH, computing per watt spent has been plateauing. If the future bottlenecks are energy and cooling, that will require infrastructure-scale solutions. My bet is this is going to be real AI company moat.

https://www.riq.net.br/pub/computing-scaling/


It's a huge difference. If you had AI sufficiently good running locally on a phone, you could devise workflows for things like basic digital hygiene, technical assistance, and tedious tasks like inbox management, image sorting, device updates, and so on. Privacy and security gets a big boost past some local competence threshold, and we're nearly there.

Make the local AI competent enough to do good image generation and editing, realtime voice and music generation, handle agentic tasks with a framework like Hermes, and you can take your AI places to do tasks in contexts that are inaccessible to or inappropriate for cloud.

Frontier big platform models will be the best, but there's a level of "good enough" for local uses that we're already seeing flourish, and "good enough" for the average joe is almost here.


Phones and laptops are terrible devices for local AI, way too constrained by bad thermals and small batteries. MiniPC's (many of them using mobile hardware) don't have that particular issue, and can easily run on a 24/7 basis.

Phones are also a terrible place to run a radio, but there's a huge amount of benefit in figuring out how to do so.

That level of local AI is also more or less what you need for competent autonomous robots, too. If your household robots are orchestrated from your phone, the local security and cloud convenience converge on a single device. No extra servers, etc, reduced cost, all that - local AI is a massive market amplifier.

Let me speculate - we are going in the weird direction of no private property unless you're an overlord that rents his property to peasants. I like to call it the revenge of communism. See how the market behaves in the llm space - it's more viable to share infrastructure than to own it. Imagine the private car revolution in the US was a bus revolution.

We’ve been dreaming about this since the days of talking about wifi mesh networking, but it seems to never happen.

It's a little different because cloud and blogs didn't actively get in the way of your home compute. To wit, the various cost spikes for hardware.

People -- WANT -- this technology on their home devices and (apparently?) the providers of this tech don't seem to be running a profit so they probably don't want the maintenance tail on their side either.

I think it's a bit different. Inevitable that this becomes a household-run thing? Not likely.


The primary feature of a blog or any website is that it is available around the clock, that is the primary feature of cloud: around on the clock computer and network that scales on demand.

The primary feature of "AI" is to process information and reason with a natural language interface at speed, the primary feature of AI bigboys is to provide the machinery that runs the "models".

See the difference?


You severely underestimate how little the fraction of the performance and human labor of a frontier AI is in "the model".

Hosting a blog 24x7 on a laptop is trivial, except for hyperscaling to the front page of HN and Reddit.


Yeah, exactly, hosting on a laptop is trivial except for when it is not. However, I am using an AI on a mac mini just fine, Qwen 3.6 27B at Q6. Works just as good as STOA models for most things.

Running an LLM locally is theoretically viable. Running your blog on your laptop is never viable (unless you hook it up like a server). One just requires compute while the other a stable network.

tbh, my home network is pretty close to the stability of my host these days…

But my downtimes are a bit self-inflicted: changing ISPs which I can personally workaround but harder for a blog where one expects uptime.


More like implode proprietary blog hosting platforms and replace them with commodity VMs that can be used for blog hosting, among other things

Wouldn't arcade cabinets vs home video game consoles be a more apt comparison?

You have to consider that the enshittification factor is much higher now than in the cloud-for-free age.

Viruses are nature’s GPUs

Every time a big paradigm shift happens in technology, people try to find equivalences between brain/consciousness/humanness and the latest tech they can comprehend. We are “just” magic clay golems, mechanical contraptions not unlike a complex clock, convoluted electrical devices, inefficient analog computers, now just a form of “messy LLM” or “a stochastic parrot”

Same for the Vision Pro. Locking down a $3k media consumption device to a single user is such a greedy move, it's unbelieavable


this entire administration has been a constant stream of "important turning point for the US" moments


That’s true and it’s not over yet, wait till he reaches the thousand year reich bit.


Like hanging his face on banners at the DOJ while proclaiming he'd be entitled to a third term?


No, it comes after the mass rallies and deportations of opponents. This can get an awful lot worse.


I think most, perhaps all of those "important turning points" aren't really important turning points but just business as usual.


Then you know and understand nothing.


Is threatening an ally business as usual? Tell me about all the times that recent presidents threatened a NATO ally...


Things change. Allies just used to be threatened in private. Even today, the UK, Canada, and others are supporting the US and Israel in taking down the regime.

I’m not suggesting things haven’t or can’t change, but I am suggesting we haven’t seen any pivotal turning points, at least not yet.


We have, they were just a long time ago, and people are only just now noticing because Obama and Biden were relatively restrained and Trump I was simply incompetent.

But all the things that allow Trump to do that he's doing happened a long time ago


I get where you're coming from. Every US administration has been corrupt, flaunted the constitution, started illegal wars (at least in the last 100 years or so)

It does kind of drive me nuts that people don't remember Bush very well, and give Obama and Biden passes on their own crimes.

That said, i do honestly believe that Trump has taken the level of corruption and abject cruelty to a new level. But this was inevitable; both parties have spent 50 years building this reality. I won't be surprised when the next Democrat also deports millions and starts illegal wars.


I don't disagree with you, in general. My point here only was that I don't think the specific language used is correct. For me a turning point would be like, the Japanese declaring war on the United States and attacking Pearl Harbor, or Napoleon being defeated by the Duke, or the French Revolution, or something more along those lines. Bombing Iran (we've done stuff like that before), arresting Maduro - Noriega (sp?), federal vs state standoffs - yep done that before. Largely this is the routine mess of democracy, and it's heightened and more exposed because it's the United States of America and also because our republic has 340 million people from all over the world - there's going to be some differences of opinion.

Of course "this time" can be different for these things but I'm not sure I've seen anything I'd construe as a turning point or significant change or anything quite like that.


I think this ship has sailed pretty hard, by now. Pretty much any app you can possibly use, from iTerm to Slack, is sending data to third-party LLMs (sometimes explicitly, most times as small features here and there)


Control of where data goes is always an option. People just need to make that choice.


It’s just irrelevant for most users. These companies are getting more adoption than they can handle, no matter how clunky their desktop apps are. They’re optimizing for experimentation. Not performance.


While this may be true for casual users, for dev native products like Codex, the desktop experience actually matters a lot. When you are living in the tool for hours, latency, keyboard handling, file system access, and OS-level integration stop being “nice to have” and start affecting real productivity. web or Electron apps are fine for experimentation, but they hit a ceiling fast for serious workflows -- especially if the icp is mostly technical users


VSCode is arguably one of the most if not the most popular code editor these days…


And they're pretty much the only example of an embedded browser architecture actually performing tolerably and integrating well with the native environment.


Vscode’s performance is pretty bad. It’s “tolerable” just like any electron app.


Still good enough for the majority of the users.


Fair, I think I'm certainly in the minority. Especially now more then ever with an increasing amount of non-technical people exploring vibe coding, 'good enough' really is good enough for most users


[flagged]


Well unfortunately, that’s just how I write. None of my posts are LLM-generated, so I'm sorry they come across that way.


Apologies.


It's not irrelevant for developers neither for users. Tiktok has shown that users deeply care about the experience and they'll flock en-masse to something that has a good experience.


The experience in the claude app is fine.


More adoption? I don't think so... It feels to me that these models && tools are getting more verbose/consuming more tokens to compensate for a decrease in usage. I know my usage of these tools has fallen off a cliff as it become glaringly obvious they're useful in very limited scopes.

I think most people start off overusing these tools, then they find the few small things that genuinely improve their workflows which tend to be isolated and small tasks.

Moltbot et al, to me, seems like a psyop by these companies to get token consumption back to levels that justify the investments they need. The clock is ticking, they need more money.

I'd put my money on token prices doubling to tripling over the next 12-24 months.


> I'd put my money on token prices doubling to tripling over the next 12-24 months.

Chinese open weights models make this completely infeasible.


What do weights have to do with how much it costs to run inference? Inference is heavily subsidized, the economics of it don't make any sense.

Anthropic and OpenAI could open source their models and it wouldn't make it any cheaper to run those models.. You still need $500k in GPUs and a boatload of electricity to serve like 3 concurrent sessions at a decent tok/ps.

There are no open source models, Chinese or otherwise that are going to be able to be run profitably and give you productivity gains comparable to a foundation model. No matter what, running LLMs is expensive and the capex required per tok/ps is only increasing, and the models are only getting more compute intensive.

The hardware market literally has to crash for this to make any sense from a profitability standpoint, and I don't see that happening, therefor prices have to go up. You can't just lose billions year after year forever. None of this makes sense to me. This is simple math but everyone is literally delusional atm.


Open weights means that the current prices for inference of Chinese models are indicative of their cost to run because.

https://openrouter.ai/moonshotai/kimi-k2.5

It's a fantasy to believe that every single one of these 8 providers is serving at incredibly subsidized dumping prices 50% below cost and once that runs out suddenly you'll pay double for 1M of tokens for this model. It's incredibly competitive with Sonnet 4.5 for coding at 20% of the token price.

I encourage you to become more familiar with the market and stop overextrapolating purely based on rumored OpenAI numbers.


I'm not making any guesses, I happen to know for a fact what it costs. Please go try to sell inference and compete on price. You actually have no clue what you're talking about. I knew when I sent that response I was going to get "but Kimi!"


The numbers you stated sound off ($500k capex + electricity per 3 concurrent requests?). Especially now that the frontier has moved to ultra sparse MoE architectures. I’ve also read a couple of commodity inference providers claiming that their unit economics are profitable.


You're delusional, I didn't even include the labor the install and run the damn thing. More than 500k


Okay, so you are claiming "every single one of those 8 providers, along with all others who don't serve openrouter but are at similar price points, are subsidizing by more than 50%".

That's an incredibly bold claim that would need quite a bit of evidence, and just waving "$500k in gpus" isn't it. Especially when individuals are reporting more than enough tps at native int4 with <$80k setups, without any of the scaling benefits that commercial inference providers have.


Imagine thinking that $80k setups to run Kimi and serve a single user session is evidence that inference providers are running at cost, or even close to it. Or that this fact is some sort of proof that token pricing will come down. All you one-shotted llm dependents said the same thing about Deepseek.

I know you need to cope because your competency is 1:1 correlated to the quality and quantity of tokens you can afford, so have fun with your Think for me SaaS while you can afford it. You have no clue the amount of engineering that goes into provide inference at scale. I wasn't even including the cost of labor.


It directly disproves this wild claim

> You still need $500k in GPUs and a boatload of electricity to serve like 3 concurrent sessions at a decent tok/ps.

as being patent bullshit, after which the burden is squarely on you to back up the remainder of your claims.


You are literally telling me that an open source model costs $80k "at decent tok/ps (whatever that means)" to run a single session as proof something. How come people aren't dropping Anthropic for Kimi, it costs 10x less... You aren't a serious person worth engaging with.


It really is insane how far it's gone. All of the subsidization and free usage is deeply anticompetitive, and it is only a profitable decision if they can recoup all the losses. It's either a bubble and everything will crash, or within a few years once the supplier market settles, they will eventually start engaging in cartel-like behavior and ratchet up the price level to turn on the profits.


I suspect making the models more verbose is also a source of inflation. You’d expect an advanced model to nail down the problem succinctly, rather than spawning a swarm of agents that brute force something resembling an answer. Biggest scam ever.


80% of the world population back then is less than 50% of the current number of people working in farming, so the assertion isn’t wrong, even if fewer people are working on farming proportionally (as it should be, as more complex, desirable and higher paid options exist)


You might be underestimating complexity and pay. What is and isn’t desirable, and to whom, is also complicated.


We're at a point in the LLM curve where there's two huge, polarized groups of developers:

- the ones who don't see any value on AI for coding and dismiss it as a fad at every change they get

- the ones who are in love with the new tools and adopting as many as they can on their workflows

I know the arguments of the second bunch well. But very curious about what the "AI is a fad" bunch thinks will happen. Are we going to suddenly realize all these productivity gains people are claiming are all lies and go back to coding by typing characters on emacs and memorizing CS books? Will StackOverflow suddenly return as the most popular source of copy-paste code slop?


> Are we going to suddenly realize all these productivity gains people are claiming are all lies

I'll grant you that many have become adamant that LLMs suddenly, out of the blue, became useful just last week, which is much too soon to have any concrete data for, but coding agents in some shape have been around for quite a while and in the data we have there isn't offering of any suggestion of productivity gains yet.

And I'm not sure many are even claiming that they are more productive, just that the LLMs have allowed them to carry out a task faster. Here's the thing: At least my experience, coding was never the bottleneck. The bottleneck has always been the business people squabbling over what the customers and business need. They haven't yet figured out how to get past their egos.

The most promise for productivity seems to be from lone startup founders who aren't constrained by the squabbling found in a larger organization and can now get more done thanks to the task shortening. However, the economic conditions are not favourable to that environment right now. Consumers are feeling tapped out, marketing has become way harder, and, even when everything else is in place, nobody is going to consider your "SaaS" when they believe the foundational LLMs will be able to do the same thing tomorrow.


At first I was grumpy that the artisanal part of programming would go away. Now Im just happy to be giving my hands a break from RSI.


> Are we going to suddenly realize all these productivity gains people are claiming are all lies and go back to coding by typing characters on emacs and memorizing CS books?

If you have not learned CS, how do you expect to separate the LLM wheat from the chaff?

> Will StackOverflow suddenly return as the most popular source of copy-paste code slop?

Coding sites manually populated by humans are dead.


> If you have not learned CS, how do you expect to separate the LLM wheat from the chaff?

I didn't mention anything about learning CS. You can be a great engineer without having A* memorized line by line, no?


It doesnt really introduce anything that makes Vision Pro in any way better, though


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: