Yelp Engineering and Product Blog

How Yelp Built a Back-Testing Engine for Safer, Smarter Ad Budget Allocation

Samuele Mazzanti, Applied Scientist
Feb 2, 2026

Introduction Modern advertising platforms are fast-paced and interconnected: even small adjustments can have ripple effects on how ads are shown, how budgets are spent, and the value advertisers get from their ad spend. At Yelp, Ad Budget Allocation means splitting each campaign’s spend between on‑platform inventory (our website, mobile site, and app) and off‑platform inventory (the Yelp Ad Network). We optimize this split to meet advertisers’ performance goals while growing overall revenue. Due to the complexity of the budget allocation system and its feedback loop, even small changes can lead to unexpected system‑wide effects. To help us safely evaluate changes,...

S3 server access logs at scale

Nurdan Almazbekov, Infrastructure Security
Sep 26, 2025

Introduction Yelp heavily relies on Amazon S3 (Simple Storage Service) to store a wide variety of data, from images, logs, database backups, and more. Since data is stored on the cloud, we need to carefully manage how this data is accessed, secured, and eventually deleted—both to control costs and uphold high standards of security and compliance. One of the core challenges in managing S3 buckets is gaining visibility into who is accessing your data (known as S3 objects), how frequently, and for what purpose. Without robust logging, it’s difficult to troubleshoot access issues, respond to security incidents, and ensure we...

Exploring CHAOS: Building a Backend for Server-Driven UI

Jonathan Baird, Software Engineer; Xin Shen, Software Engineer
Jul 8, 2025

A little while ago, we published a blog post on CHAOS: Yelp’s Unified Framework for Server-Driven UI. We strongly recommend reading that post first to gain a solid understanding of SDUI and the goals of CHAOS. This post builds on those concepts to delve into the inner workings of the CHAOS backend and how it generates server-driven content. To briefly recap, CHAOS is a server-driven UI framework used at Yelp. When a client wants to display CHAOS-powered content, it sends a GraphQL query to the CHAOS API. The API processes the query, requests the CHAOS backend to construct the configuration,...

Revenue Automation Series: Testing an Integration with Third-Party System

Anukriti Mishra, Software Engineer; Chukwuemeka Okobi, Software Engineer
May 27, 2025

Background As described in the second blog post of Revenue Automation series, Revenue Data Pipeline processes a large amount of data via complex logic transformations to recognize revenue. Thus, developing a robust production testing and integration strategy was essential to the success of this project phase. The status quo testing process utilized the Redshift Connector for data synchronization once the report was generated and published to the data warehouse (Redshift). This introduced a latency of approximately 10 hours before the data was available in the data warehouse for verification. This delay impacted our ability to verify whether the changes were...

Nrtsearch 1.0.0: Incremental Backups, Lucene 10, and More

Sarthak Nandi and Andrew Prudhomme
May 8, 2025

It has been over 3 years since we published our Nrtsearch blog post and over 4 years since we started using Nrtsearch, our Lucene-based search engine, in production. We have since migrated over 90% of Elasticsearch traffic to Nrtsearch. We are excited to announce the release of Nrtsearch 1.0.0 with several new features and improvements from the initial release. Glossary EBS (Elastic Block Store): Network-attached block storage volumes in AWS. HNSW (Hierarchical Navigable Small World): A graph-based approximate nearest neighbor search technique. Lucene: An open-source search library used by Nrtsearch. S3: Cloud object storage offered in AWS. Scatter-gather: A pattern...

Journey to Zero Trust Access

Carlos B. Hernandez, Software Engineer; Adam Skalicky, Software Engineer
Apr 15, 2025

Glossary ZTA: zero trust architecture SAML: security assertion markup language (an SSO facilitation protocol) Devbox: a remote server used to develop software Zero Trust Access Remote Future Yelp is now a fully remote company, which means our employee base has become increasingly distributed across the world, making secure access to resources from anywhere a critical business function. Yelp historically used Ivanti Pulse Secure as the employee VPN, but due to the need for a more reliable solution, it became clear that a change was necessary to ensure secure and consistent access to internal resources. The Corporate Systems and Client Platform...

Revenue Automation Series: Building Revenue Data Pipeline

Yizheng Zhang, Software Engineer; Yirun Zhou, Software Engineer
Feb 19, 2025

Background As Yelp’s business continues to grow, the revenue streams have become more complex due to the increased number of transactions, new products and services. These changes over time have challenged the manual processes involved in Revenue Recognition. As described in the first post of the Revenue Automation Series, Yelp invested significant resources in modernizing its Billing System to fulfill the pre-requisite of automating the revenue recognition process. In this blog, we would like to share how we built the Revenue Data Pipeline that facilitates the third party integration with a Revenue Recognition SaaS solution, referred to hereafter as the...

Search Query Understanding with LLMs: From Ideation to Production

Loc Trinh, Software Engineer; Ali Rokni, Tech Lead; John Hawksley, Group Tech Lead
Feb 4, 2025

How we bring LLM intelligence to millions of daily searches at Yelp. From the moment a user enters a search query to when we present a list of results, understanding the user’s intent is crucial for meeting their needs. Were they looking for a general category of business for that evening, a particular dish or service, or one specific business nearby? Does the query contain nuanced location or attribute information? Is the query misspelled? Is their phrasing unusual, so that it might not align well with our business data? All of the above questions represent Natural Language Understanding tasks where...

Yelp

Engineering

Engineering Blog

How Yelp Built a Back-Testing Engine for Safer, Smarter Ad Budget Allocation

S3 server access logs at scale

Exploring CHAOS: Building a Backend for Server-Driven UI

Revenue Automation Series: Testing an Integration with Third-Party System

Nrtsearch 1.0.0: Incremental Backups, Lucene 10, and More

Journey to Zero Trust Access

Revenue Automation Series: Building Revenue Data Pipeline

Search Query Understanding with LLMs: From Ideation to Production

About

Discover

Yelp for Business Owners