Pretty sure gpt-4 is doing image-to-llm directly while Gemini also runs the image through Google lens which is tailor-made for OCR. I think that's a good thing to create a better product overall but the performance isn't really comparable
Very-leftist here. They're becoming an everyone culture war thing because what they did was sufficiently brain-dead as to make everyone look bad. It was sufficiently skewed as to create a number of extremely racist images just because it was trying way too hard to avoid it.
That said, those of us in-the-know understand that this is just part of the cycle and they'll go back to being annoyingly bland and completely devoid of entertainment value soon.
Most people are attributing this to woke culture at google but it's possible it's just a cock up. I mean someone told it to use diverse images and forgot about historical plausibility.
This is an early product, and monkeying with the system prompts to try and prevent some classes of offensive results is sure to have the kind of unintended consequences seen with Gemini's image generation.
You can learn a lot about people's media diets based on how they perceive these issues with Gemini. Those who quickly proclaim that the product is a shameful embarrassment are almost universally very online, particularly in right wing spaces like twitter/x, even if they wouldn't consider themselves to be "anti-woke" personally.
Taking a step back, Google is clearly going to mess with the system prompt over time to correct this stuff, and bystanders' degree of fixation on this specific issue tends to suggest a certain set of politics. People have bemoaned how "woke" ChatGPT was in the past too, and OpenAI has spent a lot more time under public scrutiny iterating on their own system prompt.