engblogs

summaries of the latest blog articles from your favorite tech companies.
SalesforceSalesforce

Building AWS Bedrock Model Availability: Slashing AI Routing Discovery From Days to Minutes

Automates Bedrock model availability and cross-region routing for Agentforce across 30+ AWS accounts, slashing discovery from days to minutes with Lambda-based data collection, CloudWatch metrics, and Grafana-driven observability to improve latency, resilience, and compliance.

3/16/2026
AWS MLAWS ML

Introducing Disaggregated Inference on AWS powered by llm-d

Explores deploying disaggregated inference on AWS with llm-d, detailing prefill/decode separation, cache-aware routing, MoE and tiered KV caching, and deployment on SageMaker HyperPod and EKS.

3/16/2026
AWS MLAWS ML

Agentic AI in the Enterprise Part 2: Guidance by Persona

Guidance-driven blueprint for deploying agentic AI in enterprises, mapping leadership personas to governance, measurement, and workflow orchestration to turn autonomous agents into scalable, auditable business capability.

3/16/2026
Google CloudGoogle Cloud

Google Cloud and NVIDIA expand AI innovation across industries at GTC 2026

A technical overview of Google Cloud and NVIDIA's co-engineered AI Hypercomputer stack that accelerates agentic AI, large-model inference, and industry-scale workloads through G4 VMs, Vera Rubin NVL72 integration, Dynamo-enabled orchestration, and Vertex AI enhancements.

3/16/2026
Google CloudGoogle Cloud

BigQuery Studio is more useful than ever, with enhanced Gemini assistant

BigQuery Studio's Gemini-powered assistant delivers context-aware interoperability, intelligent resource discovery across projects, and instant job analysis to turn analytics overhead into proactive AI-assisted data workflows.

3/16/2026
Modular AIModular AI

Modular: Modular at NVIDIA GTC 2026: MAX on Blackwell, Mojo Kernel Porting, and DeepSeek V3 on B200

Modular's NVIDIA GTC 2026 showcase highlights MAX on Blackwell, a Mojo-based port of CUTLASS kernels to Mojo with 130.7 TFLOPS on B200, and DeepSeek V3 running in Modular Cloud, illustrating end-to-end diffusion pipelines, kernel optimization, and scalable inference.

3/16/2026
Google CloudGoogle Cloud

Ransomware Under Pressure: Tactics, Techniques, and Procedures in a Shifting Threat Landscape

Analyzes the 2025 ransomware landscape—TTP evolution, data-theft extortion trends, and shifts in the RaaS ecosystem, highlighting cross-platform deployments and implications for defense and recovery.

3/16/2026
AWS MLAWS ML

Build an offline feature store using Amazon SageMaker Unified Studio and SageMaker Catalog

An end-to-end guide to building a scalable, governed offline feature store with SageMaker Unified Studio and SageMaker Catalog, using Apache Iceberg-backed S3 Tables, Lake Formation access control, and a publish-subscribe workflow to enable cross-team collaboration and reproducible ML features.

3/16/2026
AWS MLAWS ML

How Workhuman built multi-tenant self-service reporting using Amazon Quick Sight embedded dashboards

Workhuman demonstrates a scalable approach to multi-tenant, self-service analytics by embedding Amazon QuickSight dashboards, enforcing strict data isolation with namespaces and row-level security, and automating deployment through a CI/CD pipeline to deliver customized insights across SaaS customers.

3/16/2026
DatabricksDatabricks

Breaking the Microbatch Barrier: The Architecture of Apache Spark Real-Time Mode

Apache Spark 4.1's Real-Time Mode rearchitects Structured Streaming from microbatches to non-blocking, concurrent stages, delivering millisecond latency while maintaining high throughput and fault tolerance.

3/16/2026
CloudflareCloudflare

From legacy architecture to Cloudflare One

A practical blueprint for de-risking large-scale migrations from legacy VPNs to Cloudflare One by combining Cloudflare's Zero Trust platform with CDW's tiered, risk-aware SASE approach to modernize security without downtime.

3/16/2026
Berkeley AIBerkeley AI

Identifying Interactions at Scale for LLMs

Introducing SPEX and ProxySPEX, scalable, sparsity-aware methods for discovering and attributing influential interactions across features, data, and internal model components in LLMs through efficient ablations.

3/13/2026