Gathering human feedback
Optimizing AI models through human feedback loop integration
Inside a SoundCloud Microservice
Exploring the inner workings of a microservice at SoundCloud
Better exploration with parameter noise
Improving learning efficiency through the use of parameter noise
Proximal Policy Optimization
Understanding Proximal Policy Optimization for Enhanced Reinforcement Learning
Robust adversarial inputs
Exploring strategies to defend against robust adversarial inputs in machine learning models
Hindsight Experience Replay
Exploring the benefits of hindsight experience replay in machine learning algorithms
Teacher–student curriculum learning
Exploring teacher-student curriculum learning methods in the context of education
Faster physics in Python
Optimizing Python code performance for physics simulations
Remote device sign-in
A method for signing in to a device without a keyboard using a game controller and onscreen keyboard.
A Better Model of Data Ownership
Defining ownership of datasets and ensuring the right teams own the right datasets for better data management.
Learning from human preferences
Leveraging insights from human preferences to enhance user experiences.
Learning to cooperate, compete, and communicate
Exploring the dynamics of cooperation, competition, and communication