engblogs

summaries of the latest blog articles from your favorite tech companies.
Apple MLApple ML

NarrativeTrack: Evaluating Video Language Models Beyond the Frame

NarrativeTrack introduces a benchmark for evaluating temporally grounded narrative understanding in multimodal language models, using entity-centric reasoning and a Compositional Reasoning Progression framework that scales from entity existence to changes and ambiguity.

1/6/2026
Apple MLApple ML

Improving User Interface Generation Models from Designer Feedback

A designer-aligned pipeline for improving UI generation models that leverages feedback via commenting, sketching, and direct manipulation to finetune LLMs with multimodal data and human evaluations, outperforming traditional RLHF baselines.

1/6/2026
DatabricksDatabricks

Instructed Retriever: Unlocking System-Level Reasoning in Search Agents

Introducing the Instructed Retriever, a system-aware retrieval architecture that propagates system specifications (user instructions, labeled examples, and index schema) through query generation and retrieval to enable robust instruction-following, low-latency enterprise agent search.

1/6/2026
Google CloudGoogle Cloud

Reflecting on a year of transformation and mission impact together

One-year retrospective on Google Public Sector's AI-enabled cloud, security-first innovations, and agentic technologies like Gemini that accelerate government missions across defense, civilian agencies, and education.

1/6/2026
DuolingoDuolingo

Dear Duolingo: How do I support someone who’s learning a language?

Practical guide to designing language-learning support systems that empower learners through choice, forgiving feedback, and low-pressure practice.

1/6/2026
Jane StreetJane Street

Fun with Algebraic Effects - from Toy Examples to Hardcaml Simulations

A practical guide to replacing monads with algebraic effects in OCaml 5, using the Handled_effect library to implement clean, composable Hardcaml FPGA testbenches and simulations.

1/6/2026
DatabricksDatabricks

BCBS 239 Compliance in the Age of AI: Turning Regulatory Burden into Strategic Advantage

BCBS 239 compliance is reframed as an AI-powered, governance-driven, lakehouse-enabled transformation on Databricks that automates risk data aggregation, accelerates regulatory reporting, reduces costs, and scales for evolving rules like DORA and Basel IV.

1/5/2026
MIT AIMIT AI

MIT scientists investigate memorization risk in the age of clinical AI

MIT researchers reveal how AI trained on de-identified electronic health records (EHRs) can memorize patient data and outline practical privacy evaluation methods to curb leakage in clinical foundation models.

1/5/2026
MIT AIMIT AI

Using design to interpret the past and envision the future

A designer uses digital fabrication, AI, and material innovation to reinterpret Black architectural heritage, reconstruct historic spaces from sparse archives, and prototype future-oriented products and interfaces.

1/5/2026
Google CloudGoogle Cloud

Simplify VM OS agent management at scale: Introducing VM Extensions Manager

Policy-driven, centralized VM extensions management integrated into the Compute Engine API, enabling scalable OS agent management across fleets with VM Extensions Manager, standardized rollout through zonal and global policies and configurable rollout speeds.

1/5/2026
Google CloudGoogle Cloud

Supercharge your Cloud SQL for MySQL write performance with new optimized writes

Automated, real-time tuning of Cloud SQL for MySQL writes in the Enterprise Plus edition delivers up to 3x throughput improvements and lower latency for write-heavy OLTP workloads, with benchmarking via sysbench.

1/5/2026
Google CloudGoogle Cloud

Auto-ISAC and Google partner to boost automotive sector cybersecurity

Google Cloud joins Auto-ISAC as an Innovator Partner to bolster automotive cybersecurity with shared threat intelligence, OT/IT convergence, and cloud-enabled protection for software-defined vehicles.

1/5/2026