Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Rex Computing (rexcomputing.com)
87 points by tambourine_man on May 29, 2024 | hide | past | favorite | 20 comments


Founder of REX Computing here; I highly recommend checking out my interview on the Microarch Club podcast linked elsewhere on the thread; will also answer questions on this thread if anyone has them.


The teaser reminds me a lot of other *failed* high performance/high efficiency architecture redesigns that failed because of the unreasonable effort required to squeeze out a useful fraction of the promised gains e.g. Transputer and Cell. Can you link to written documentation of how existing code can be ported? I doubt you can just recompile ffmpeg or libx264, but level of toolchain support can early adopters expect? Does it require manually partitioning code+data and mapping it to the on-chip network topology?


We had a basic LLVM backend that supported a slightly modified clang frontend and a basic ABI. We tried to make it drastically easier for both the programmer and compiler to handle memory by having all memory (code+data) be part of a global flat address space across the chip, with guarantees being made to the compiler by the NoC on the latency of all memory accesses across one or multiple chips. We tested this with very small programs that could fit in the local memory of up to two chips (128KB of memory), but in theory it could have scaled up to the 64 bit address space limit. Compilation time for programs was long, but fully automated, specifically to improve upon problems faced by Cell and other scratchpad memory architectures… some of our original funding in 2015 from DARPA was actually for automated scratchpad memory management techniques on Texas Instruments DSPs and Cell (our paper: https://dl.acm.org/doi/pdf/10.1145/2818950.2818966)

This was all designed a decade ago, and REX has been in effectively hibernation since the end of 2017 after successfully taping out our 16 core test chip back in 2016, but being unable to raise additional funding to continue. I have continued to work on architectures that have leveraged scratchpad memories in different ways, including on cryptocurrency and machine learning ASICs, including at my current startup, Positron AI (https://positron.ai)


This is inspiring stuff.

Was extremely interested in the Inmos Transputer in the 80s. Seems like an idea way ahead of its time… a bit like REX.

I find the parallels in design with the Actor concurrency model of Erlang, Elixir and Transputer/REX are very compelling.

Really hope something happens with this project or some spinoff from it.

The current interest in RISC-V is testament to the fact that it may still be viable.

I wish you great success. Wish there was a way to sponsor or crowd-source fund this.


VLIW has low latency. Why is that important for an inference engine?


This looks abandoned at first glance.

Maybe someone on HN has an update as to what happened to this project?


One of the founders said this when I asked about it:

https://news.ycombinator.com/item?id=39438579


Great! Here is the podcast episode he mentions in the comment you linked:

https://microarch.club/episodes/10/

> Thomas Sohmers joins to discuss dropping out of high school at age 17 to start a chip company, lessons from the successes and failures of past processor architectures, the history of VLIW, and the new AI hardware appliances he and his team are building at Positron AI.


Way more interesting than this old zombie company is that new podcast about cpu design. Cool!


I just found the ideas interesting and the contrast to Itanium's VLIW.

Maybe this was a better link: https://www.youtube.com/watch?v=ki6jVXZM2XU


I noticed that there hasn't been much publication or other activity since 2015, so for all practical purposes it's dead.

It would've been interesting to see what shook loose if they'd kept pushing.


Like the Mill CPU arch, looking for someone to buy the IP.


Vaguely reminds me of the Adapteva Epiphany RISC multi-processors from the old Parallella Kickstarter project, and presumably others, but that's the one I played with for a while.

I'm not sure how this project's interconnect differs, they do say theirs is revolutionary, maybe that's the difference.

[1] https://www.adapteva.com/epiphanyiii/ [2] https://parallella.org


This looks like the Cell CPU, how's it different?


Thomas addresses this in one of these two, I can't remember which, sorry:

https://www.youtube.com/watch?v=ki6jVXZM2XU

https://www.youtube.com/watch?v=DwBeSjWkcKc


Definitely addressed in the first link around the 10 minute mark: https://youtu.be/ki6jVXZM2XU?si=qm9nt0wtroZGul93&t=608


Darn - I got all excited that maybe Rexx was back with something interesting -0 although this is also exciting and new.

https://en.wikipedia.org/wiki/Rexx


Will a sub-$100 SBC with this chip on it be put on the market?


On first glance, with the routing part, it seems similar to Tenstorrent.


The premise of simplifying architecture, focus on memory, reliance on software, even the fact that you can stack a ton of chips per nose, all sounds very much like Groq. I wonder if this another case of multiple discovery.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: