OpenAI clearly states that they train on your data https://help.openai.com/en/ar...

lemming · on Jan 31, 2025

By default, we do not train on any inputs or outputs from our products for business users, including ChatGPT Team, ChatGPT Enterprise, and the API. We offer API customers a way to opt-in to share data with us, such as by providing feedback in the Playground, which we then use to improve our models. Unless they explicitly opt-in, organizations are opted out of data-sharing by default.

The business bit is confusing, I guess they see the API as a business product, but they do not train on API data.

therein · on Jan 31, 2025

So for posterity, in this subthread we found that OpenAI indeed trains on user data and it isn't something that only DeepSeek does.

lemming · on Jan 31, 2025

So for posterity, in this subthread we found that I can use OpenAI without them training on my data, whereas I cannot with DeepSeek.

therein · on Jan 31, 2025

What do you mean? They both say the same thing for usage through API. You can also use DeepSeek on your own compute.

lemming · on Jan 31, 2025

Where does DeepSeek say that about API usage? Their privacy policy says they store all data on servers in China, and their terms of use says that they can use any user data to improve their services. I can’t see anything where they say that they don’t train on API data.

pzo · on Jan 31, 2025

> Services for businesses, such as ChatGPT Team, ChatGPT Enterprise, and our API Platform > By default, we do not train on any inputs or outputs from our products for business users, including ChatGPT Team, ChatGPT Enterprise, and the API.

So on API they don't train by default, for other paid subscription they mention you can opt-out