engblogs

summaries of the latest blog articles from your favorite tech companies.
Apple MLApple ML

Over-Searching in Search-Augmented Large Language Models

Systematic evaluation of over-searching in search-augmented LLMs, revealing when external search improves accuracy versus abstention failures, introducing Tokens Per Correctness (TPC), examining multi-turn dynamics and noisy retrieval, and outlining mitigation strategies together with the OverSearchQA benchmark.

1/12/2026
MetaMeta

CSS at Scale With StyleX

Explores StyleX, Meta's CSS-at-scale framework that blends CSS-in-JS ergonomics with static CSS performance, enabling atomic styling with deduplication to shrink bundle sizes across major products and highlight open-source adoption.

1/12/2026
AirbnbAirbnb

Pay As a Local

Explores strategies for integrating local payment methods and region-specific checkout flows to improve conversions and user experience.

1/12/2026
Two SigmaTwo Sigma

AI in Investment Management: 2026 Outlook (Part I)

Explores how AI will transform quantitative investment management in 2026, portraying AI as the operating system for quant research and investing, enabled by agentic AI and integrated workflows, with ongoing emphasis on human supervision and disciplined governance.

1/12/2026
Google CloudGoogle Cloud

AuraInspector: Auditing Salesforce Aura for Data Exposure

AuraInspector reveals how misconfigurations in Salesforce Aura and the GraphQL Aura controller enable data exposure, and provides an automated tool to audit access controls and remediate vulnerabilities.

1/12/2026
Apple MLApple ML

DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search

DeepMMSearch-R1 enables on-demand, multi-turn web searches for multimodal LLMs with image-driven query crafting, self-reflection-based refinement, and a two-stage training pipeline that includes the DeepMMSearchVQA dataset.

1/12/2026
Apple MLApple ML

Multivariate Conformal Prediction using Optimal Transport

A principled, distribution-free framework for multivariate conformal prediction that uses optimal transport to extend conformity scores to multidimensional outputs and analyzes the practical trade-offs of OT-based scoring.

1/12/2026
Apple MLApple ML

MoEs Are Stronger than You Think: Hyper-Parallel Inference Scaling with RoE

Explores hyper-parallel inference with RoE to turn Mixture-of-Experts into a dynamic, stochastic ensemble that samples multiple expert outputs per token for higher accuracy with reduced compute and no fine-tuning.

1/12/2026
AWS MLAWS ML

How Omada Health scaled patient care by fine-tuning Llama models on Amazon SageMaker AI

Omada Health scales personalized nutrition coaching by fine-tuning Llama 3.1 on Amazon SageMaker AI using QLoRA, enabling HIPAA-compliant, real-time nutrition education with LangSmith-based evaluation.

1/12/2026
Apple MLApple ML

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

MANZANO is a simple, scalable unified multimodal model that couples a hybrid image tokenizer with a shared vision encoder and dual adapters to support continuous image-to-text understanding and discrete text-to-image generation within a single autoregressive LLM, aided by an auxiliary diffusion decoder that translates image tokens to pixels, all trained under a unified recipe to achieve state-of-the-art results with minimal task conflicts.

1/11/2026
AWS MLAWS ML

Crossmodal search with Amazon Nova Multimodal Embeddings

Unified crossmodal search enabled by a single Amazon Nova Multimodal Embeddings model that maps text, images, audio, video, and documents into one shared vector space for end-to-end ecommerce retrieval using cosine similarity, S3 Vectors, and Bedrock-backed embeddings.

1/10/2026
Lambda LabsLambda Labs

NVIDIA's Vera Rubin NVL72 coming to Lambda's Superintelligence Cloud

Vera Rubin NVL72 racks join Lambda's Superclusters, delivering a 72-GPU NVLink domain to enable production-scale AI with model-parallel training and MoE-powered inference at higher efficiency.

1/9/2026