engblogs

How we monitor internal coding agents for misalignment

A concise guide to detecting and preventing misalignment in internal coding agents through monitoring, telemetry, governance, and automated safety controls.

3/19/2026

AWS ML

Run NVIDIA Nemotron 3 Super on Amazon Bedrock

A detailed look at NVIDIA Nemotron 3 Super on Amazon Bedrock, covering its Hybrid Transformer-Mamba Mixture of Experts architecture, Latent MoE, serverless fully managed inference, open weights and datasets, and real-world use cases across software development, finance, cybersecurity, search, and retail, plus getting started with AWS CLI/SDK.

3/19/2026

Building an MCP Ecosystem at Pinterest

A concise technical overview of building and scaling the MCP ecosystem at Pinterest, detailing architecture, integration patterns, and developer tooling for interoperable components.

3/19/2026

AWS ML

Introducing V-RAG: revolutionizing AI-powered video production with Retrieval Augmented Generation

V-RAG introduces a retrieval-augmented approach to AI-powered video production that grounds generated videos in retrieved reference imagery via a vector database, enhancing accuracy, customization, scalability, and multimodal capabilities while reducing hallucination.

3/19/2026

AWS ML

Use RAG for video generation using Amazon Bedrock and Amazon Nova Reel

A VRAG-powered, multimodal pipeline that combines image retrieval, prompt-based video generation, and batch processing with Amazon Bedrock, Amazon Nova Reel, OpenSearch vector engine, and S3 to transform structured text and reference images into scalable, AI-generated videos.

3/19/2026

Databricks

Announcing General Availability of Real-Time Mode for Apache Spark Structured Streaming on Databricks

GA of Real-Time Mode (RTM) in Spark Structured Streaming on Databricks delivers millisecond-level latency with a unified Spark engine, eliminating the need for a separate streaming engine like Flink for real-time workloads.

3/19/2026

OpenAI

OpenAI to acquire Astral

Technical overview of the strategic and architectural implications of OpenAI's planned acquisition of Astral, focusing on integration, data flow, and platform interoperability.

3/19/2026

Stripe

Testing the impact of Adaptive Pricing across 1.5M subscription checkout sessions

An empirical analysis of Adaptive Pricing for subscriptions, showing how local currency pricing across 1.5M checkout sessions improves conversion, authorization, and lifetime value while stabilizing renewals amid fluctuating exchange rates.

3/19/2026

Duolingo

4 must-know particles for ending Japanese sentences

A concise, technically oriented guide to four Japanese sentence-final particles (ka, ne, yo, yo ne) and how their subtle nuances, tones, and social functions shape questions, agreement, and everyday conversation.

3/19/2026

AWS ML

Enforce data residency with Amazon Quick extensions for Microsoft Teams

A practical, step-by-step guide to enforce data residency by deploying multi-Region Amazon Quick extensions for Microsoft Teams, enabling regional routing via IAM Identity Center and Microsoft Entra ID to direct users to region-specific Quick resources while maintaining GDPR and data sovereignty compliance.

3/19/2026

AWS ML

Enhanced metrics for Amazon SageMaker AI endpoints: deeper visibility for better performance

Granular, configurable metrics for SageMaker AI endpoints enable per-model cost attribution, real-time GPU and resource visibility, and precise troubleshooting across container- and instance-level metrics.

3/19/2026

MIT AI

A better method for identifying overconfident large language models

Cross-model disagreement paired with a total uncertainty metric combines epistemic and aleatoric uncertainty to more reliably detect overconfident, unreliable LLM predictions using a diverse ensemble and fewer queries.

3/19/2026