DropboxA practical blueprint for evaluating conversational AI at scale
A comprehensive guide detailing a structured, scalable evaluation framework for conversational AI, emphasizing dataset curation, actionable metrics with LLM judges, automated testing pipelines, and continuous improvement to ensure reliability and quality in real-world deployment.
OpenAISamsung and SK join OpenAI’s Stargate initiative to advance global AI infrastructure
Samsung and SK partner with OpenAI's Stargate initiative to enhance AI infrastructure in Korea through advanced memory chip production, AI data centers expansion, and integrating ChatGPT Enterprise capabilities.
OpenAILaunching Sora responsibly
Sora 2 launches with integrated safety features including AI content transparency, consent-based likeness control, teen protections, harmful content filtering, audio safeguards, and robust user control mechanisms to ensure responsible video generation and sharing.
OpenAISora 2 System Card
Sora 2 is an advanced video and audio generation model offering enhanced realism, physics accuracy, and safety measures for creative and responsible content creation.
OpenAISora 2 is here
Sora 2 launches as a breakthrough video and audio generation model offering advanced physical accuracy, synchronized dialogue, enhanced controllability, and a new social app enabling realistic cameos and creative collaboration while prioritizing user wellbeing and safety.
OpenAIThe Sora feed philosophy
Sora Feed combines creativity-driven ranking, user control, personalized recommendations, and robust safety measures to foster a safe, inspiring, and connected user experience.
All our product updates from Stripe Tour New York
Stripe Tour New York unveiled over 40 new product updates focusing on AI integration, payments innovation, embedded finance, stablecoins, and enhanced revenue tools to transform global commerce.
Introducing Open Issuance from Bridge: A new platform to launch your own stablecoin
Open Issuance by Bridge empowers businesses to create and manage their own customizable stablecoins with seamless liquidity integration and control over economics, enabling faster and cost-effective entry into the stablecoin market.
Snorkel AIEvaluating coding agent capabilities with Terminal-Bench: Snorkel’s role in building the next generation benchmark
Terminal-Bench, with significant contributions from Snorkel AI, sets a new standard for evaluating AI coding agents in complex terminal environments by providing real-world, end-to-end tasks and robust benchmarking frameworks.
MIT AIResponding to the climate impact of generative AI
Exploring innovative strategies and technological advancements to significantly reduce the growing carbon footprint of generative AI through improved hardware efficiency, smarter data center operations, renewable energy integration, and AI-driven solutions.
SalesforceRevolutionizing Data Cloud: Unleashing the Power of the New ML Recommendations System
Explore how Salesforce Personalization's engineering team revolutionized the Data Cloud with a flexible ML recommendations system, integrating multi-cluster architecture, automated NDCG evaluation in CI/CD pipelines, and AI-powered development workflows to deliver scalable, high-quality, and ethical personalized experiences.
OpenAIBuy it in ChatGPT: Instant Checkout and the Agentic Commerce Protocol
Introducing Instant Checkout powered by the open-source Agentic Commerce Protocol, enabling seamless AI-assisted purchases directly within ChatGPT while maintaining merchant control and secure transactions.