Hacker Newsnew | past | comments | ask | show | jobs | submit | darkoob12's commentslogin

He is basically asking OpenAI to publish their methodology so we can understand the real state of AI in solving math problems.

I don't know how much novelty should you expect from IMO every year but i expect many of them be variation of the same problem.

These models are trained on all old problem and their various solutions.For LLM models, solving thses problems are as impressive as writing code.

There is no high generalization.


You should expect quite a bit of novelty from the IMO, given the constraint of high school level curriculum. The problem setters work very hard to avoid problems that are variations of other contests or solvable by routine methods. That's why this is a very exciting result--you can't just regurgitate homework problem solutions to get a high score at the IMO.

You shouldn't believe Big Tech on their PR statements.

They are decades behind in AI. I have been following AI research for a long time. You can find best papers published by Microsoft, Google, Facebook in past 15 years but not Apple. I don't know why but they didn't care about AI at all.

I would say this is PR to justify their AI state.


Apple used to be at the edge of AI. They shipped Siri before "AI assistant" went mainstream, they were one of the first to ship an actual NPU in consumer hardware and put neural networks into features people use. They were spearheading computational photography. They didn't publish research, they're fucking Apple, but they did do the work.

And then they just... gave up?

I don't know what happened to them. When AI breakthrough happened, I expected them to put up a fight. They never did.


> I don't know what happened to them.

Tim Cook happened. The fish rots from the head down.


>I don't know what happened to them. When AI breakthrough happened, I expected them to put up a fight. They never did.

Apple always had the luxury of time. They work heavily on integrating deeply into their ecosystems without worrying about the pace of the latest development. eg. Widgets were a 2023 feature for iOS. They do it late, but do it well.

The development in the LLM space was and is too fast for Apple to compete in. They usually pave their own path and stay in their lane as a leader. The impact on Apple's brand image will be tarnished if Google, Meta, OpenAI, MS all leapfrog Apple's models every 2-3 months. That's just not what the Apple brand is associated with.


If you're working on machine learning the most economic choice is Python.

But weiting a processing pipeline with Python is frustrating if you have worked with C# concurrency.

I figured the best option is Celery and you cannot do it without an external broker. Celery is a mess. I really hate it.


Agree. I think it's improved a bit but Celery is frustrating as the defacto job/queue solution. A lot of the defaults make it unreliable (it can lose jobs if workers crash or don't shutdown cleanly)

I'm hoping the existence of free-threading will push for more first-class concurrency primitives. Concurrent Futures is nice until you need a concurrent-safe data structure besides a queue


Agree that celery is a mess and it doesn't work well with async (Asyncio) python. I think version 6 maybe will support it sometime.

I also had a lot of problem due to async primitives with sqlalchemy - there's some tricky stuff with asyncio.gather vs TaskGroup and how sqlalchemy session works with it to be able to compose code easily.


I think in this structure people only think locally and they are not concerned with the overall mission of the company and do not actively think about morality of the mission or if they are following it.

In my experience, front-line and middle managers will penalize workers that stray from their explicit goals because they think something else more readily contributes to the company’s mission.

Kind of sounds like a traditional public company is a constitutional monarchy, not always the best but at least there's a balance of interests. While a private company could either be an autocracy or oligarchy where sucking up and playing tribal politics is the only way to survive.

Anyone tried setting up a modestly sized tech company where employees are randomly placed into various seniority roles at the start of each year? Of course considering capabilities and some business continuity concerns…

Could work with a bunch of similarly skilled people in a narrow niche


That's what David Graeber's Bullshit Jobs is all about! Modern companies as medieval-style fiefdoms where mid-level managers expand their domains to justify their salaries, not because the org demands it

> I think there is a good chance this behavior is unintended!

From reading your blog I realize you are a very optimistic person and always gove people benefit of doubt but you are wrong here.

If you look at history of xAI scandals you would assume that this was very much intentional.


The question is stupid and that's not the problem. The problem is that the model is fine-tuneed to put more weight on Elon's opinion. Assuming Elon has the truth it is supposed and instructed to find.

The behaviour is problematic, also Grok 4 might be relating "one word" answers to Elon's critique of ChatGPT, and might be seeking related context to that. Others demonstrated that slightly prompt wording changes can cause quite different behaviour. Access to the base model would be required to implicate fine-tuning Vs pre-training. Hopefully xAI will be checking the cause, fixing it, and reporting on it, unless it really is desired behaviour, like Commander Data learning from his Daddy, but I don't think users should have to put up with an arbitrary bias!

The question is not stupid, it's an alignment problem and should be fixed.

I've clarified my comment you replied to BTW.

I wonder how long it takes for Elon fans to flag this post.

I suspect that you never were truly interested in programming otherwise you wouldn't have preferred talking to several LLM models instead of writing code yourself.

Nobody forced you to switch LLM models until eventually one of them solve your problem.


It's mostly anout how Israel army controls the way journalists report the war or regime in west bank that walks, quacks, and swims like an apartheid but apparently they can't call it that.

Sadly no one will be able to document the carnage in gaza. They plan to create an internment camp in the south and move civilians into at after making sure they are not linked to Hamas. Then they are going to basically follow Trump's plan to clean Gaza by building new jewish settlements and kill anyone outside the internment camp. While doing that they will not allow independent journalists to go in gaza.


As much as there are barriers to reporters here, it seems less than most other conflicts. Its not like journalists have unrestricted access to the Ukraine/Russia front line. Access to other conflicts like Sudan or Myanmar are also very restricted in practise.

That doesn't appear to be correct. How have you reached that conclusion?

Israel has not granted access to journalists to report independently since October 2023.

There has been very limited escorted trips with external journalists but all tightly supervised by the IDF.

Journalists already in Gaza have been killed regularly and there are credible accusations that many are deliberately targeted by the IDF.


> Israel has not granted access to journalists to report independently since October 2023.

Has Russia granted access by independent journalists to russian occupied Ukraine in that time period? As far as i know the answer is no.

And even on the Ukraine side there has been significant restrictions

E.g. a quote from https://theintercept.com/2023/06/22/ukraine-war-journalists-...

“The Ukrainian government has made it virtually impossible for journalists to do real front line reportage.”

Maybe its hard to say which one is worse, but they seem to be at least in the same neighbourhood


If we define “worse” as higher journalist deaths, zero press freedom, no access, and active targeting, then Gaza is clearly worse for journalists right now.

Ukraine/Russia conflict is obviously extremely dangerous but it allows far more media access, transparency, and foreign presence.


> zero press freedom

According to the world press freedom index, Israel has the third highest press freedom of all middle eastern countries (Qatar and Cyprus are a bit higher, everyone else in the middle east is lower in most cases much lower).

https://en.wikipedia.org/wiki/World_Press_Freedom_Index#Rank...

I'm not saying its a paradise for reporters. There are clearly issues. But saying "zero press freedom" is a massive overstatement.


It's not an overstatement. If external journalists aren't allowed into Gaza on their own terms then it is a fact.

How you can have a sentence that includes the word "paradise" in it when referring to what's happening in Gaza is beyond me.


Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: