You seem to be assuming that the full cost of the cluster is recouped by Grok 3....

Tycho · 2025-02-18T16:31:13 1739896273

Yes. Why do get these replies on HN that seem to only consider the most shallow, surface details? It could well be that xAI wins the AI race by betting on hardware first and foremost - new ideas are quickly copied by everyone, but a compute edge is hard to match.

HarHarVeryFunny · 2025-02-18T16:50:27 1739897427

The compute edge belongs to those like Google (TPU) and Amazon/Anthropic (Trainium) building their own accelerators and not paying NVIDIAs 1000% cost markups. Microsoft just announced experimenting with Cerebras wafer scale chips for LLM inference which are also a cost savings.

Microsoft is in process of building optical links between existing datacenters to create meta-clusters, and I'd expect that others like Amazon and Meta may be doing the same.

Of course for Musk this is an irrational ego-driven pursuit, so he can throw as much money at it as he has available, but trying to sell AI when you're paying 10x the competition for FLOPs seems problematic, even you you are capable of building a competitive product.

Tycho · 2025-02-18T17:34:12 1739900052

Timing matters. A long term strategy for superior hardware might bear fruit too late.

HarHarVeryFunny · 2025-02-18T17:41:31 1739900491

I'm not sure about that - I expect AI is going to become a commodity market, so it doesn't matter how late you are if you've got a cheaper price.

In terms of who's got a lead on cheap (non-NVIDIA) hardware, I guess you have to give it to Google who are on their 6th generation TPU.

vardump · 2025-02-18T17:40:18 1739900418

I wonder how Tesla's training computer Dojo is doing. Although I guess there's a reason for buying so much Nvidia hardware...

theckel · 2025-02-18T17:30:37 1739899837

Curious where you saw the Microsoft/Cerebras experimentation noted online? That's very interesting.

HarHarVeryFunny · 2025-02-18T17:38:20 1739900300

It was mentioned in Anthropic Jack Clark's "Import AI" newsletter.

https://jack-clark.net/2025/02/17/import-ai-400-distillation...

gmerc · 2025-02-18T16:34:39 1739896479

DeepSeek just showed the compute edge is not that hard to match. They could have chosen to keep the gains proprietary but probably made good money playing the market instead, quants as they are.

https://centreforaileadership.org/resources/deepseeks_narrat...

If you’re using your compute capacity at 1.25% efficiency, you are not going to win because your iteration time is just going to be too long to stay competitive.

scarmig · 2025-02-18T17:01:36 1739898096

Software and algorithmic improvements diffuse faster than hardware, even with attempts to keep them secret. Maybe a company doubles the efficiency, but in 3 months, it's leaked and everyone is using it. And then the compute edge becomes that much more durable.

mirekrusin · 2025-02-18T16:46:21 1739897181

Optimisation efforts don’t negate investment in capacity but multiply output.

Tycho · 2025-02-18T16:41:04 1739896864

Sorry, you missed the point - DeepSeek tried some new software ideas, they did not manage to secure the same computation capacity.

gmerc · 2025-02-18T17:26:40 1739899600

They achieved the same results for 1.25% of the computation cost... If they actually had that computation capacity, it would be game over with the AGI race by the same logic.

acchow · 2025-02-18T16:55:21 1739897721

> but a compute edge is hard to match.

xAI bought hardware off the open market. Their compute edge could dissappear in a month if Google or Amazon wanted to raise their compute by a whole xAI

Tycho · 2025-02-18T19:36:09 1739907369

Not if there’s a hardware shortage.

acchow · 2025-02-18T20:53:38 1739912018

Ok, 2 months.

Remember, the new B200 have 2.2x the performance of xAI’s current H100 “hardware edge”. So it only takes an order half the size.

Or you could order the old H100 instead and avoid the B200 shortage.

niceice · 2025-02-18T16:38:52 1739896732

[flagged]

bnralt · 2025-02-18T16:49:01 1739897341

There seems to be a coordinated effort to control the narrative. Grok3's release is pretty important, no matter what you think of it, and initially this story quickly fell off the front page, likely from malicious mass flagging.

One thing that's taken over Reddit and unfortunately has spread to the rest of the internet is people thinking of themselves as online activists, who are saving the world by controlling what people can talk about and steering the conversation in the direction they want it to go. It's becoming harder and harder to have a normal conversation without someone trying to derail it with their own personal crusade.

Avshalom · 2025-02-18T17:00:27 1739898027

>Grok3's release is pretty important

How? After an enormous investment the latest version of some software is a bit better than the previous versions of some software from it's competitors and will likely be worse than the future versions from it's competitors. There's nothing novel about this.

niceice · 2025-02-18T17:41:38 1739900498

They just started, the velocity of xAI is novel.

NVIDIA's CEO Jensen Huang: “Building a massive [supercomputer] factory in the short time that was done, that is superhuman. There's only one person in the world who could do that. What Elon and the xAI team did is singular. Never been done before.”

Avshalom · 2025-02-18T18:04:00 1739901840

>only one person in the world who could do that. What Elon and the xAI team

That is literally more than one person.

nozzlegear · 2025-02-18T18:05:37 1739901937

One billionaire glazing another because it might enrich himself further hardly seems noteworthy. That quote is superfluous at best.

shytey · 2025-02-18T17:57:30 1739901450

Largest supercluster in the world created in a small time frame is pretty important. 4 years typically, cut down to 19 days. That's an incredible achievement and I, along with many others, think it's important.

https://nvidianews.nvidia.com/news/spectrum-x-ethernet-netwo...

https://www.tomshardware.com/pc-components/gpus/elon-musk-to...

Avshalom · 2025-02-18T18:39:22 1739903962

Okay but that's obviously a nonsense claim. Find me a computer on the https://en.wikipedia.org/wiki/TOP500 that was built 4 years after the chips it uses debuted.

H100s aren't even 3 years old.

raphman · 2025-02-18T17:14:42 1739898882

> There seems to be a coordinated effort to control the narrative.

Do you have any evidence for this? Who would want to coordinate such an effort, and how would they manipulate HN users to comment/vote in a certain way? I think it is far more plausible that some people on here have similar views.

> [people] controlling what people can talk about

That's called 'moderation' and protects communities against trolls and timewasters, no?

> and steering the conversation in the direction they want it to go

That's exactly what conversation is about, I'd say. Of course I want to talk about stuff that I am interested in, and convince others of my arguments. How is this unfortunate?

llm_nerd · 2025-02-18T17:34:16 1739900056

>Grok3's release is pretty important

Is it? It's Yet Another LLM, barely pipping competitors at cherry picked comparisons. DeepSeek R1 was news entirely because of the minuscule resources it was trained on (with an innovative new approach), and this "pretty important" Grok release beats it in chatbox arena by a whole 3%.

We're at the point where this stuff isn't that big of news unless something really jumps ahead. Like all of the new Gemini models and approaches got zero attention on here. Which is fair because it's basically "Company with big money puts out slightly better model".

I'd say Grok 3 is getting exactly the normal attention, but there is a "Leave Britney Alone" contingent who need to run to the defence.

BluSyn · 2025-02-18T16:44:48 1739897088

Noticed this also. It doesn’t feel organic.

beepbopboopp · 2025-02-18T16:46:32 1739897192

I mean, the honest truth is something closer to:

We have no clue how all this is going to play out, what value is captureable and what parts of a lead are likely to stay protected. This race is essentially the collective belief in a generationally big prize and no idea how it unlocks.

The problem with that for a comment section is it reduces ALL comments to gossip and guessing, which makes people feel stupid.

ansley · 2025-02-18T16:42:58 1739896978

i think it's astroturfing

api · 2025-02-18T16:44:37 1739897077

Reddit today feels like it's absolutely overrun by bots. So much of the comment content is so superficial and cookie-cutter I find it hard to believe it's all produced by human beings. A lot of it reads like the output of small cheap LLMs of the sort that would be used for spam bots.

Of course we know X, Facebook, and probably most other social media is also overrun by bots. I don't think you can assume that humans are on the other end anymore.

kmac_ · 2025-02-18T16:49:19 1739897359

The point is that it is inefficient. Others achieved similar results much cheaper, meaning they can go much further. Compute is important, but model architecture and compute methods still outweigh it.

HarHarVeryFunny · 2025-02-18T16:41:58 1739896918

How quickly will Grok 4/5/6 be released? Of course you can choose to keep running older GPUs for years, but if you want bleeding edge performance then you need to upgrade, so I'm not sure how many model generations the cost can really be spread over.

Also, what isn't clear is how RL-based reasoning model training compute requirements compares to earlier models. OpenAI have announced that GPT 4.5 will be their last non-reasoning model, so it seems we're definitely at a transition point now.

gmerc · 2025-02-18T17:04:02 1739898242

At current efficiency? Not nearly as fast as DeepSeek 4 ;)

gmerc · 2025-02-18T16:33:19 1739896399

None of which explains this massive waste of money for zero gain.

Larrikin · 2025-02-18T16:32:38 1739896358

It's not going to be from this unless it's forced upon us by the federal government. All the other companies are ahead and aren't just going to stop.

doctorpangloss · 2025-02-18T16:44:57 1739897097

> xAI also announced a few days ago they are starting an internal video game studio.

Ha ha. I'm sure their play to claim airdrop idle game will be groundbreaking.