Reddit’s set to rake in $60M per year in a deal with an unnamed AI company to train future models on its 20 years’ worth of user generated content
February 20, 2024
If you’ve ever posted to Reddit there’s a good chance you’re helping train the next generation of AI models with your own words, pictures, and memes, because the company’s selling access to its 20 years’ worth of content for a reported $60 million. I mean, chances are you’ve already been used to train AIs given that Reddit’s already featured pretty heavily in the training data for a bunch of different large language models (LLMs) and image generators, but at least now someone’s getting paid for it.
Generative AI models, such as ChatGPT and Stable Diffusion, need to be trained on databases comprising hundreds of millions of images, books, video clips, music, and so on. Sometimes, the source is publicly available and open to use by anyone, and sometimes AI companies simply ‘borrow’ what’s just lying around on the web. But there’s seldom any money handed over between the two bodies. Not so with Reddit, as it seems that it’s entered into a deal where for a healthy lump of cash each year, an AI model can use the site’s content for training.
We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies.
This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the ...
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
Cookie
Duration
Description
cookielawinfo-checkbox-analytics
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional
11 months
The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy
11 months
The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.