The part of this that doesn’t jibe with me is the fact that they also released t...

gitfan86 · on Jan 27, 2025

What has changed is the perception that people like OpenAI/MSFT would have an edge on the competition because of their huge datacenters full of NVDA hardware. That is no longer true. People now believe that you can build very capable AI applications for far less money. So the perception is that the big guys no longer have an edge.

Tesla had already proven that to be wrong. Tesla's Hardware 3 is a 6 year old design, and it does amazingly well on less than 300 watts. And that was mostly trained on a 8k cluster.

nejsjsjsbsb · on Jan 27, 2025

The perception only makes sense if it is "that's it, pack up your stall" for AI.

I think what really happened is day to day trading noise. Nothing fundamentally changed, but traders believed other people believed it would.

plaidfuji · on Jan 27, 2025

I mean, I think they still do have an edge - ChatGPT is a great app and has strong consumer recognition already, very hard to displace.. and MSFT has a major installed base of enterprise customers who cannot readily switch cloud / productivity suite providers. So I guess they still have an edge it’s just nore of a traditional edge.

physicsguy · on Jan 27, 2025

Microsoft don't have to use OpenAI though, they could swap that out underneath for the business applications.

kavalg · on Jan 28, 2025

and it is even questionable whether "bundling" AI in every product is legal wrt anti-competitive laws (i.e. the IE case)

gitfan86 · on Jan 27, 2025

Yes, it is still a valid business model and I would expect MSFT to continue to make profits.

polygamous_bat · on Jan 27, 2025

> The part of this that doesn’t jibe with me is the fact that they also released this incredibly detailed technical report on their architecture and training strategy. The paper is well-written and has a lot of specifics. Exactly the opposite of what you would do if you had truly made an advancement of world-altering magnitude.

I disagree completely on this sentiment. This was in fact the trend for a century or more (see inventions ranging from the polio vaccine to "Attention is all you need" by Vaswani et. al.) before "Open"AI became the biggest player on the market due and Sam Altman tried to bag all the gains for himself. Hopefully, we can reverse course on this trend and go back to when world-changing innovations are shared openly so they can actually change the world.

evanjrowley · on Jan 27, 2025

Exactly. There's a strong case for being open about the advancements in AI. Secretive companies like Microsoft, OpenAI, and others are undercut by DeepSeek and any other company on the globe who wants to build on what they've published. Politically there are more reasons why China should not become the global center of AI and less reasons why the US should remain the center of it. Therefore, an approach that enables AI institutions worldwide makes more sense for China at this stage. The EU for example has even less reason now to form a dependency on OpenAI and Nvidia, which works to the advantage of China and Chinese AI companies.

kibibu · on Jan 28, 2025

Even the "Language Models are Unsupervised Multitask Learners" paper was pretty open; I'd say even more open than the R1 paper.

plaidfuji · on Jan 27, 2025

I’m not arguing for/against the altruistic ideal of sharing technological advancements with society, I’m just saying that having a great model architecture is really not a defensible value proposition for a business. Maybe more accurate to say publishing everything in detail indicates that it’s likely not a defensible advancement, not that it isn’t significant.

nicce · on Jan 28, 2025

Here is a great interview. They don’t seem to care that much about money. They are already profitable.

https://www.chinatalk.media/p/deepseek-ceo-interview-with-ch...

> Money has never been the problem for us; bans on shipments of advanced chips are the problem.

nejsjsjsbsb · on Jan 27, 2025

I always thought AMZN is the winner since I looked into Bedrock. When I saw Claude on there it added a fuck yeah, and now the best models being open just takes it to another level.

AMZN: no horse picked, we host anything

MSFT: Open AI

GOOGLE: Google AI

AMZN is in the strongest position.

amluto · on Jan 28, 2025

AWS’s usual most doesn’t really apply here. AWS is Hotel California — if your business and data is in AWS, the cost of moving any data-intensive portion out of AWS is absurd due to egress fees. But LLM inference is not data-transfer intensive at all — a relatively small number of bytes/tokens go to the model, it does a lot of compute, and a relatively small number of tokens come back. So a business that’s stuck in AWS can cost-effectively outsource their LLM inference to a competitor without any substantial egress fees.

RAG is kind of an exception, but RAG still splits the database part from the inference part, and the inference part is what needs lots of inference-time compute. AWS may still have a strong moat for the compute needed to build an embedding database in the first place.

Simple, cheap, low-compute inference on large amounts of data is another exception, but this use will strongly favor the “cheap” part, which means there may not be as much money in it for AWS. No one is about to do o3-style inference on each of 1M old business records.

mindwok · on Jan 28, 2025

You can also use Claude, Mistral, Llama and others on Google Vertex, similar to Bedrock.

fspeech · on Jan 27, 2025

You are not taking into account why people are willing to pay exceedingly high prices for GPUs now and that the underlying reason may have been taken away.

tokioyoyo · on Jan 27, 2025

Build trust by releasing your inferior product for free and as open as possible. Get attention, then release your superior product behind paywall. Name recognition is incredibly important within and outside of China.

Keep in mind, they’re still competing with Baidu, Tencent and other AI labs.