How FlashAttention-2 Accelerates LLMs on NVIDIA H100 and A100 GPUs
Exploring the use of FlashAttention-2 on Lambda Cloud and comparing benchmark results between NVIDIA H100 and A100 GPUs for training GPT-3-style models.
How AI saves money and improves banking complaint handling
Using AI to handle banking complaints improves efficiency and customer satisfaction while saving money.
Teaching language models to reason algorithmically
This blog post explores an approach to teaching language models algorithmic reasoning capabilities through in-context learning and algorithmic prompting.
How to compare a noisy quantum processor to a classical computer
This blogpost discusses how to compare a noisy quantum processor to a classical computer in terms of computational cost and introduces the concept of effective quantum volume to measure the computational cost of a quantum experiment.
How to compare a noisy quantum processor to a classical computer
This blogpost discusses how to compare a noisy quantum processor to a classical computer and introduces a framework for measuring the computational cost of a quantum experiment.
Announcing the Preview of Amazon SageMaker Profiler: Track and visualize detailed hardware performance data for your model training workloads
Track and visualize detailed hardware performance data for your model training workloads with Amazon SageMaker Profiler.
Teaching language models to reason algorithmically
The blogpost discusses teaching language models to reason algorithmically through an approach that leverages in-context learning and algorithmic prompting.
How to help high schoolers prepare for the rise of artificial intelligence
A one-week summer program for high schoolers to foster a deeper understanding of machine-learning approaches in health.
Supporting sustainability, digital health, and the future of work
The MIT and Accenture Convergence Initiative for Industry and Technology selects three new research projects to support.
Encouragement Designs and Instrumental Variables for A/B Testing
Exploring encouragement designs and instrumental variables for A/B testing at Spotify
Unleashing GitHub Codespaces templates to ignite your development
Discover how to maximize your development productivity by utilizing templating features in GitHub Codespaces.
Using MLflow AI Gateway and Llama 2 to Build Generative AI Apps
Building generative AI apps using MLflow AI Gateway and Llama 2 with a focus on Retrieval Augmented Generation (RAG) applications