It's 1. It means that a developer can use their relatively low-powered Apple dev...

chvid · 2025-07-15T07:15:27 1752563727

If Apple cannot do their own implementation of CUDA due to copyright second best is this; getting developers to build for LMX (which is on their laptops) and still get NVIDIA hardware support.

Apple should do a similar thing for AMD.

xd1936 · 2025-07-15T12:17:58 1752581878

I thought that the US Supreme Court decision in Google v. Oracle and the Java reimplementation provided enough case precedent to allow companies to re-implement something like CUDA APIs?

https://www.theverge.com/2021/4/5/22367851/google-oracle-sup...

https://en.wikipedia.org/wiki/Google_LLC_v._Oracle_America,_....

timhigins · 2025-07-15T19:03:22 1752606202

Exactly and see also ROCM/HIP which is AMD’s reimplementation of CUDA for their gpus.

pjmlp · 2025-07-16T06:55:27 1752648927

Reimplementation of CUDA C++, not CUDA.

CUDA is a set of four compilers, namely C, C++, Fortran and Python JIT DSLs, a bytecode and two compiler backend libraries, a set of compute libraries collection for the languages listed above, plugins for Eclipse and Visual Studio, a GPU graphical debugger and profiler.

qalmakka · 2025-07-17T14:02:40 1752760960

There's ZLUDA for AMD that actually implements CUDA, but it's still quite immature yet

karmakaze · 2025-07-15T13:45:35 1752587135

It would be great for Apple if enough developers took this path and Apple could later release datacenter GPUs that support MLX without CUDA.

nightski · 2025-07-15T18:01:05 1752602465

It's the other way around. If Apple released data center GPUs then developers might take that path. Apple has shown time and again they don't care for developers, so it's on them.

randomNumber7 · 2025-07-15T12:41:42 1752583302

What is the performance penalty compared to a program in native CUDA?

_zoltan_ · 2025-07-15T06:47:00 1752562020

"relatively high powered"? there's nothing faster out there.

chvid · 2025-07-15T07:11:55 1752563515

Relative to what you can get in the cloud or on a desktop machine.

sgt101 · 2025-07-15T07:16:39 1752563799

I wonder what Apple would have to do to make metal + its processors run faster than nVidia? I guess that it's all about the interconnects really.

summarity · 2025-07-15T09:06:46 1752570406

Right now, for LLMs, the only limiting factor on Apple Silicon is memory bandwidth. There hasn’t been progress on this since the original M1 Ultra. And since abandoning UltraFusion, we won’t see progress here anytime soon either.

glhaynes · 2025-07-15T10:04:57 1752573897

Have they abandoned UltraFusion? Last I’d heard, they’d just said something like “not all generations will get an Ultra chip” around the time the M4 showed up (the first M chip lacking an Ultra variation), which makes me think the M5 or M6 is fairly likely to get an Ultra.

librasteve · 2025-07-15T10:57:11 1752577031

this is like saying the only limiting factor on computers is the von neumann bottleneck

MangoToupe · 2025-07-15T07:12:08 1752563528

Is this true per watt?

spookie · 2025-07-15T08:07:41 1752566861

It doesn't matter for a lot of applications. But fair, for a big part of them it is either essential or a nice to have. But completely off the point if we are waging fastest compute no matter what.

johnboiles · 2025-07-15T13:12:15 1752585135

...fastest compute no matter watt

quitit · 2025-07-15T08:55:20 1752569720

Relative to the apple hardware, the nvidia is high powered.

I appreciate that English is your second language after your Hungarian mother-tongue. My comment reflects upon the low and high powered compute of the apple vs. nvidia hardware.