Apple MLNarrativeTrack: Evaluating Video Language Models Beyond the Frame
NarrativeTrack introduces a benchmark for evaluating temporally grounded narrative understanding in multimodal language models, using entity-centric reasoning and a Compositional Reasoning Progression framework that scales from entity existence to changes and ambiguity.
Apple MLImproving User Interface Generation Models from Designer Feedback
A designer-aligned pipeline for improving UI generation models that leverages feedback via commenting, sketching, and direct manipulation to finetune LLMs with multimodal data and human evaluations, outperforming traditional RLHF baselines.
DatabricksInstructed Retriever: Unlocking System-Level Reasoning in Search Agents
Introducing the Instructed Retriever, a system-aware retrieval architecture that propagates system specifications (user instructions, labeled examples, and index schema) through query generation and retrieval to enable robust instruction-following, low-latency enterprise agent search.
Google CloudReflecting on a year of transformation and mission impact together
One-year retrospective on Google Public Sector's AI-enabled cloud, security-first innovations, and agentic technologies like Gemini that accelerate government missions across defense, civilian agencies, and education.
Dear Duolingo: How do I support someone who’s learning a language?
Practical guide to designing language-learning support systems that empower learners through choice, forgiving feedback, and low-pressure practice.
Jane StreetFun with Algebraic Effects - from Toy Examples to Hardcaml Simulations
A practical guide to replacing monads with algebraic effects in OCaml 5, using the Handled_effect library to implement clean, composable Hardcaml FPGA testbenches and simulations.
DatabricksBCBS 239 Compliance in the Age of AI: Turning Regulatory Burden into Strategic Advantage
BCBS 239 compliance is reframed as an AI-powered, governance-driven, lakehouse-enabled transformation on Databricks that automates risk data aggregation, accelerates regulatory reporting, reduces costs, and scales for evolving rules like DORA and Basel IV.
MIT AIMIT scientists investigate memorization risk in the age of clinical AI
MIT researchers reveal how AI trained on de-identified electronic health records (EHRs) can memorize patient data and outline practical privacy evaluation methods to curb leakage in clinical foundation models.
MIT AIUsing design to interpret the past and envision the future
A designer uses digital fabrication, AI, and material innovation to reinterpret Black architectural heritage, reconstruct historic spaces from sparse archives, and prototype future-oriented products and interfaces.
Google CloudSimplify VM OS agent management at scale: Introducing VM Extensions Manager
Policy-driven, centralized VM extensions management integrated into the Compute Engine API, enabling scalable OS agent management across fleets with VM Extensions Manager, standardized rollout through zonal and global policies and configurable rollout speeds.
Google CloudSupercharge your Cloud SQL for MySQL write performance with new optimized writes
Automated, real-time tuning of Cloud SQL for MySQL writes in the Enterprise Plus edition delivers up to 3x throughput improvements and lower latency for write-heavy OLTP workloads, with benchmarking via sysbench.
Google CloudAuto-ISAC and Google partner to boost automotive sector cybersecurity
Google Cloud joins Auto-ISAC as an Innovator Partner to bolster automotive cybersecurity with shared threat intelligence, OT/IT convergence, and cloud-enabled protection for software-defined vehicles.