engblogs

summaries of the latest blog articles from your favorite tech companies.
DatabricksDatabricks

How 7‑Eleven Transformed Maintenance Technician Knowledge Access with Databricks Agent Bricks

A technical walkthrough of how 7-Eleven replaced scattered maintenance documents with a Databricks Agent Bricks powered AI assistant, integrating Unity Catalog, vector search with Embeddings Compute, and a Teams Bot to surface contextual manuals, diagrams and images in seconds while reducing downtime.

1/9/2026
DatabricksDatabricks

Thumbtack Powering Safe, Smart Home Services on Databricks with GenAI

Unifying GenAI on Databricks to deliver a safe, scalable home-services platform through a hybrid CNN-LLM workflow that enhances privacy, trust, and collaboration across Thumbtack’s data science and engineering stack.

1/9/2026
Apple MLApple ML

AgentBuilder: Exploring Scaffolds for Prototyping User Experiences of Interface Agents

Explores scaffolds and prototyping tools for shaping the user experiences of interface agents, via AgentBuilder design probes, requirements elicitation, and in-situ on-device multi-agent prototyping.

1/9/2026
Apple MLApple ML

AdaBoN: Adaptive Best-of-N Alignment

AdaBoN introduces a prompt-adaptive Best-of-N alignment framework featuring a two-stage algorithm: an initial exploratory phase to estimate per-prompt reward distributions, then adaptive budget allocation to improve LM-RM alignment while reducing inference latency across diverse prompts.

1/9/2026
OpenAIOpenAI

OpenAI and SoftBank Group partner with SB Energy

A concise technical overview of the cross-sector partnership between OpenAI, SoftBank Group, and SB Energy to explore AI-powered energy technologies.

1/9/2026
Fly.ioFly.io

Code And Let Live

Replace read-only ephemeral sandboxes with Sprites—durable, instantly bootable computers that support checkpoint/restore, ample storage, and global Anycast access for seamless development-to-prod workflows.

1/9/2026
AWS MLAWS ML

Accelerating LLM inference with post-training weight and activation using AWQ and GPTQ on Amazon SageMaker AI

An end-to-end guide to accelerating LLM inference via post-training weight-and-activation quantization using AWQ and GPTQ on Amazon SageMaker AI, reducing memory and latency without retraining.

1/9/2026
Snorkel AISnorkel AI

Introducing the Snorkel Agentic Coding Benchmark

Introducing the Snorkel Agentic Coding Benchmark—a real-world, end-to-end evaluation suite for AI coding agents that spans multiple languages, 100 multi-step tasks across four difficulty tiers, and rigorous long-horizon planning, error recovery, and sandboxed execution.

1/9/2026
Apple MLApple ML

Which Evaluation for Which Model? A Taxonomy for Speech Model Assessment

A principled taxonomy for evaluating speech models that maps evaluation types to model capabilities and task requirements, revealing gaps in prosody, interaction, and reasoning to guide benchmark design and practical model selection.

1/9/2026
Google CloudGoogle Cloud

How Hackensack Meridian Health de-risked network migration using VPC Flow Logs

Case study: Hackensack Meridian Health de-risks a large Google Cloud network migration by using VPC Flow Logs and Flow Analyzer to visualize Cloud Interconnect traffic from VLAN attachments via Sankey diagrams.

1/9/2026
Apple MLApple ML

Inferring Optical Tissue Properties from Photoplethysmography using Hybrid Amortized Inference

Introducing PPGen, a biophysical model that maps photoplethysmography signals to interpretable optical and physiological tissue parameters, and Hybrid Amortized Inference (HAI) for fast, robust, and scalable parameter estimation in wearables, balancing deep learning performance with clinical interpretability and hardware-aware design.

1/9/2026
Apple MLApple ML

Pretraining with Hierarchical Memories: Separating Long-Tail and Common Knowledge

A memory-augmented pretraining approach using hierarchical memories to separate long-tail knowledge from common knowledge, with a small anchor model accessing a large memory bank for memory-efficient inference and fine-tuning on resource-constrained devices.

1/9/2026