OpenAI Strikes a Reddit Deal to Train Its AI on Your Posts

Reddit, one of the internet’s largest discussion platforms, has entered into a significant agreement with OpenAI, the company behind ChatGPT. This deal grants OpenAI access to Reddit’s real-time content through its data API, enabling the integration of Reddit discussions into ChatGPT and other OpenAI products. The arrangement is reminiscent of Reddit’s earlier $60 million deal with Google, though financial details of this new partnership remain undisclosed.

The collaboration is set to benefit both parties. Reddit will leverage OpenAI’s large language models to develop new AI-powered features for its users and moderators. Additionally, OpenAI has committed to becoming an advertising partner on the Reddit platform, potentially opening up new revenue streams for the social media giant.

However, this partnership may face scrutiny from Reddit’s user base, known for its vocal opposition to certain management decisions. In June 2023, over 7,000 subreddits went dark in protest of Reddit’s API pricing changes. Similar concerns arose recently when users of Stack Overflow, another programming forum, faced suspensions for attempting to delete their posts following a partnership announcement with OpenAI.

Notably, the announcement does not mention the use of Reddit data for AI model training, a point that was explicitly stated in the Google deal. This omission may be an attempt to avoid potential backlash from users concerned about data privacy and usage.

The deal’s approval process involved OpenAI’s COO and independent Board of Directors, likely due to CEO Sam Altman’s position as a Reddit shareholder, which was disclosed in the announcement.

Reddit CEO Steve Huffman emphasized the platform’s value as an extensive archive of authentic human conversations, suggesting that integrating this content into ChatGPT aligns with the goal of creating a more connected internet and helping users find relevant information and communities.

It’s worth noting that Reddit has previously been cautious about data scraping for AI model training, even threatening to block Google’s web crawlers. Ironically, OpenAI had earlier accused the r/ChatGPT subreddit of copyright infringement for using the ChatGPT logo.

This partnership represents a significant development in the AI and social media landscape, potentially influencing how user-generated content is integrated into AI systems. However, it also raises questions about data usage, user privacy, and the evolving relationship between tech giants and online communities.

Latest articles