Hacker Newsnew | past | comments | ask | show | jobs | submit | vlovich123's commentslogin

> (which is good for Rust adherents, I figure).

As a Rust adherent, please do not put words in our mouths or set up unrealistic expectations for other people by linking together concepts at a very shallow level.

Language level memory safety has no answer for hardware security flaws which is what side channel attacks are. No programming language can provide memory privacy if another chip in your machine can read your memory. Just like no programming language can protect your application from a kernel vulnerability of the kernel it’s running on.


Damn. That wasn’t my intention at all, I was just pointing out that Rust has another reason to see wider adoption vis a vis the usual Valley advertising bullshit of deliberately conflating hardware security with software security. I personally give no fucks what something is written in, only that it’s written well enough that I don’t have to twist arms or babysit yet another sloppy piece of code in my enterprise.

But... it's rust.

The paper explicitly covers it that various memory COW/snapshot mechanisms are probably faster and safer than the zygote pattern. As it stands getting the zygote pattern correct and safe is something you have to plan for upfront. You can’t retrofit it which is why the paper mentions it has poor composability. Also the advantages of the zygote pattern can be overstated since the memory sharing benefit is minimal since it has to happen so early and modern OSes already transparently CoW duplicate pages in the background.

In what sense can you not retrofit the zygote pattern?

I recommend at least skimming the paper as it covers this. But essentially you can’t just inject a call at a random point in code to start being a zygote. It’s something you have to plan up front as to the exact point you’re going to fork and that you’re going to do it at the start of program before any threads have started or any files are open and before any locks have been acquired. It’s basically all the challenges of invoking fork at arbitrary points in time.

The reason to do a zygote in the first place could be solved with alternative special APIs that are safer and harder to misuse. But we have fork so there’s not as big of a demand despite the warts.


If the author is this concerned about security, I’m curious why rsync doesn’t just build with fil-c by default and skip the noise. Those who need the extra perf to do more than 1 gigabit/s can build it in “unsafe” mode.

Because Fil-C is not a serious project

If you make claims like that, you need to expand on them or at least provide some references.

It’s Fil’s side project that he uses to spend his extra creative energy and troll people on Twitter

Which version of rust are these in?


Something tells me a war with china isn’t going to be carriers duking it out but carriers filled to the brim with aviation and naval drones that seek and destroy enemy craft. As Iran has shown, you don’t need to attack the USA directly to destabilize its influence. The US market economical influence has been far more important for force projection and stabilizing trade than anything else and by all accounts Trump has pissed away allies on that front too. US force projection for trade stabilization is for minor things like protecting against pirates - you don’t need million dollar missiles for that.

Took me a while to find what you were referring to by gram. Arxiv paper from 9 days ago that's not properly indexed by search engines.

(G)enerative (R)ecursive re(A)soning (M)odels. They really wanted the acronym.

https://arxiv.org/html/2605.19376v1


I prefer GRRM but then that would imply a habit of not actually getting a final result

And then every time I ask it to hurry along it kills a Stark.

Version 8 had serious flaws and wasn't recieved well by users.

I am sorry, but there was no version 7 and 8.

Version 7 and 8 are well known viruses distributed by D&D software inc.


I'd really argue the bugs were introduced in version 5 but people were so excited by the promise of new features they sold well anyway.

Can we just say it was basically vibe coded, but with real humans in the loop?

Version 5 was when the source material dried up, and the hallucinations became more frequent and obvious.

As far as I remember there was a basic outline of major plot points and where all the major characters ended up (a prompt) and were left to fill in all the blanks.


> Can we just say it was basically vibe coded

I mean he references a "murder of ravens" several times. It's an unkindness of ravens and a murder of crows. Classic LLM mistake right up there with the emdash.


Thank you for the gold kind stranger.

Claude Opus 4.8 suggests "ReGRAM", which is less bad than GRAM.

Ouch.

As a fellow reader-in-waiting, I applaud that. GMTA :)


writing… (17 years)

That acronym is unacceptable. It's going to impede discussion and cause confusion for a long time if it doesn't die off immediately.

You think that's bad? I introduce you to LION, (evoLved sIgn mOmeNtum) [1]

[1] https://arxiv.org/pdf/2302.06675


Now I just hear the Voltron intro riff in my head

Those flying diecast lions hurt when they hit you as a kid

Not as much as when the leg broke off and you couldn't fix it, so you glue it in place and stop playing with it rather than ever tell your parents you broke it.

Between transformers, voltron, and borderline evil siblings it’s kinda of a miracle I made it from birth to now. But, hey, here we are and I love my brother… pretty sure he still stands me too.

not bad although archived. have any info why?

We're still talking about "zero-shot prompt" when the saying "X-shotted" ["One-shotted the difficult maze"] was already a well-established thing in daily vernacular. So now you constantly have to readjust your brain because whenever you read "zero-shot prompt" your mind goes "uh.. a zero-try attempt is a paradox and cannot exist".

Zero-shot, one-shot, few-shot etc. refers to how many examples you have to give.

It comes about from machine learning algorithms that could pick up on patterns from a small number of examples. Few shot means only a handful of examples to recognize something. One shot means only a single example. And zero shot means no examples. Of course, you have to indicate what you want somehow, but in the case of an LLM that's the prompt. Once LLMs were trained for instruction following, you didn't have to give any examples, you could just give a prompt describing what you want, and that was a zero-shot.


You're explaining something to me I already know. Hence the "readjust my brain".

I'm complaining about the LLM field co-opting a term that was already used in daily vernacular. Imagine if people in the LLM field made it so that saying the LLM made a "final answer" means that it got stuck in a loop. Now, whenever someone says an LLM gave a "final answer" we have to divine if they meant it is in a loop or gave the right answer after working through a few intermittent ones by itself.

Choosing to call it "X-shot" was a dumb move. And now we're stuck with it. No two ways about it.


> a zero-try attempt is a paradox and cannot exist

Have you tried applying L'Hôpital's Rule?


Zero shotting: there wasn't even an attempt.

Minus one shotting: you have to make one attempt for there to have been no attempt, and two attempts for there to have been one attempt.


You miss 100% of the shots you don't take

- Wayne Gretzky

  - altmanaltman

One shot: Taking a shot, just once.

Zero shot: Knowing you had a shot but choosing not to.

Minus one shot: Not even realizing there was a shot.


confusing indeed. I wondered "which RAM? nvram? dram? vram? dram? now what's g-ram?"

GPU RAM, clearly. At least that's where my mind went.

Pretty sure it's "GNU Is Not Unix Rapid Access Memory", actually

GPURAM is Probably Unix Rapid Access Memory

We already have VRAM for that purpose, thankfully.

  "Analysis" was right there

It's great if they also introduce KILOGRAM

Yeah, look what happened to GNU

Is this the right place to do everyone's favorite copypasta?:D

Sorry but I missed the joke, could you include me in the group? Honest question

I live to serve. For everyone's enjoyment:

https://stallman-copypasta.github.io/

GNU/Linux Copypasta

I'd just like to interject for a moment. What you're refering to as Linux, is in fact, GNU/Linux, or as I've recently taken to calling it, GNU plus Linux. Linux is not an operating system unto itself, but rather another free component of a fully functioning GNU system made useful by the GNU corelibs, shell utilities and vital system components comprising a full OS as defined by POSIX.

Many computer users run a modified version of the GNU system every day, without realizing it. Through a peculiar turn of events, the version of GNU which is widely used today is often called Linux, and many of its users are not aware that it is basically the GNU system, developed by the GNU Project.

There really is a Linux, and these people are using it, but it is just a part of the system they use. Linux is the kernel: the program in the system that allocates the machine's resources to the other programs that you run. The kernel is an essential part of an operating system, but useless by itself; it can only function in the context of a complete operating system. Linux is normally used in combination with the GNU operating system: the whole system is basically GNU with Linux added, or GNU/Linux. All the so-called Linux distributions are really distributions of GNU/Linux!


Exactly the kind of pedantry I enjoy

It's just an acronym. It's not gonna impede anything. Think of it as just a name - you either know what it refers to or you don't, you don't understand something from it's name, or it's acronym.

It's an acronym that matches an extremely common word, making it not easily searchable.

Like countless others. You just add a second term for context.

Random plug for Kagi, which got it for 'GRAM model llm' on the first try ;)

And to think, we could have had George RR Martins instead.

Speaking of things that never finish.

Just spell it GRRM but pronounce it “gram” if you have to reference it in spoken conversation.

Which will be pretty rare.


Grrm with a rolling r sounds better.

Pronounced like “groom” makes for a nice analogy with slimming down the model size too.

Or grim

Let's not forget about Yann LeCun's current area of research that's completely different from LLMs: Joint Embedding Predictive Architecture (JEPA)

If he gets that style to be more efficient (they're already competitive) it'll completely kill off LLMs

https://openreview.net/pdf?id=BZ5a1r-kVsf


I propose GRIM: Generative Recursive Indeterministic Impression Machine.

It is the 3rd list on Kagi when searching "gram models"

G return G

Of those 21% how many are time-dependent questions that are past the model’s training and requires research to verify? Like the “did Ukraine attack Russian in the past week” question?

Small alternative potential future changes that alter this analysis:

* At some point model capability reaches diminishing returns. Then inference >> training in the future but training >> inference now. It’s not a prisoner’s dilemma but a land grab to solidify market position and be one of the 2-3 firms left standing as dominant in the space. The model companies aren’t super sticky yet but they’re working on it.

* even if training remains >> inference, it’s possible to have multiple price points like they do today. If you need the most capable model you’ll be paying exponentially more per token to supplement the training cost even though the serving cost is marginal because most people will be satisfied with cheaper / less capable models for most tasks.

I buy that inference is a dropping line item while training is a growing one. There’s all sorts of things on the horizon that’ll be order of magnitudes improvements, from startups burning models into ASICs to get order of magnitudes more performance to alternate architectures like diffusion transformers that have orders of magnitude structural optimizations. It’s inevitable that it’ll come down even further from where we are. It’s possible model training also will go down but I’ve not seen any compelling research suggesting major “easy” reductions here.


The issue is that most tasks do not require frontier-level intelligence, but companies like OAI can really only profit off of the frontier. Capabilities from a year or two ago are so outdated that even OpenAI gives it away for free and there are many other models biting at their heels. In other words they are spending huge amounts of money to cash in on a depreciating asset.

So one possible future is that frontier-level training becomes so expensive and the use cases so sparse that it simply isn’t viable to keep going bigger.


Once the land grab is over, the market will consolidate and the winners will absorb the losers. Then the few winners will be the only ones with real capital to train frontier models and will have true pricing power. Similar to how social media companies or the gig-economy benefits from network effects, AI companies will benefit from having the lion's share of paying customers (that also constantly feed in more data to train the models on).

Unfortunately the 0.x version has pervaded because of community cargo culting claiming that versioning is easier with 0.x than with major version numbers > 0. Personally I find that hard to believe, especially given packages like Tokio and anyhow (still at v1) make it work and there’s others that are >v1.

That is to say 0.x doesn’t necessarily mean unmaintained, it can also mean “I don’t want to have to think about how to version APIs / make guarantees about APIs). Eg reqwest is very widely used and actively maintained yet is still at v0.13.


> claiming that versioning is easier with 0.x than with major version numbers > 0

I think it's less that versioning is claimed to be easier with 0.x versions, and more that some people have got into their heads that 1.0 signals either "permanently stable" or "no new versions for several years" and they don't want to commit to that yet.

I do wish more crates would 1.0 (and then 2.0, etc).


I don’t know what linters you use, but the ones I like are the ones that show you problems in the workspace stably, not just in the files that happen to be open and altering as files open and close.

You can always improve, but pretending like there’s an easy solution is lazy - if it was easy it would have been done.


This was bad wording on my part. I wrote "open" but that should have been "files in the workspace/project". Really, "open" WRT files is so overloaded already, they can be in the workspace, have an editor tab open for them, or have an active file handle, to name just three.

> You can always improve, but pretending like there’s an easy solution is lazy - if it was easy it would have been done.

I claimed that it is possible, not that it is easy.


Im highlighting that defining that sandbox policy cohesively in a way that works for all the different extensions types you’d want to support and doesn’t overwhelm the user with permission fatigue is difficult as to be impossible.

Browsers have a different problem - they protect different websites against each other. The IDE should probably protect you against extensions being able to access arbitrary files on disk, but even that’s difficult (eg a bundled linter often wants to read user defaults in a central location. But protecting even further than that is even harder, especially as here where the access was to the actual repo not anything else.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: