Inside Ads, Besides Design: Ad Tech
2021|Type: Edited|Tag: Theory
The main players
Publisher: The owners or suppliers of digital advertising space, with a commercialisation team to increase revenue by traffic monetization.
Ad Networks and Exchanges: The platform bundles ad space from the publishers and sells it to advertisers. The goal is to consider multiple parties' interests to improve ad matching efficiency.
The concept
eCPM = r(a, u, c)
This function highly generalizes the revenue per ad impression from the ad network perspective. The expected revenue for the ad network to display an ad a each time a user u accesses or searches, under the situation of context c. It defines the core value of advertising.
Billing method for advertising
CPT, Cost per Time: The ad space is delivered to the advertiser exclusively and charged for an exclusive period.
CPM, Cost per 1000 Impressions: Shows the amount spent on an ad campaign, divided by impressions, multiplied by 1,000.
CPC, Cost per click: Shows how much, on average, each link click costs you.
CPS, Cost per sale / CPA, Cost per Action / ROI: Shows the amount spent by sales, action or ROI. The intent is to get close to the advertiser.
oCPM, Optimized Cost Per 1000 Impressions: The new approach that allows you to prioritize the marketing goal, and deliver ads towards these goals in the most effective way possible.
The conflict between the advertiser and the publisher
The gross margin equals the difference between average transaction value and CPS, multiplied by sales volume.
Publishers and advertisers are individually focused on the closest billing metrics and have opposite expectations, while all conversions in between are subject to uncertainty. This contradiction is the basis of logic in the ad network.
The commonality between the publisher and the ad network
The goal is to improve the efficiency of traffic monetisation in ads. Overall this can be summarized in terms of automation in ad delivery, and we usually look at five elements - budget, targeting, bids, creative and landing pages.
Automatic budget allocation. Set budgets for campaigns and accounts, and make decisions on whether volume or cost is guaranteed.
Automatic targeting audience matched. It has mainly experienced two stages. The first stage utilizes the second-party data for mining and extracts the advertising audience through label combination and label extraction to execute an ad delivery. The second stage utilizes the advertiser's first-party data to do the expansion and split by hierarchy to complete ad delivery, or furthermore, executes the bidding by sub-targeting the audience to directly impact the retrieval, scoring and reranking.
Automatic bidding. oCPX has emerged to integrate all of the budget control, conversion rate prediction, and bidding on the ad network side. Bidding directly has an impact on the real-time ranking of eCPMs.
Dynamic Creative. Upload all the advertising materials at once, the system automatically completes the combination of materials and testing.
Landing pages. Currently landing pages focus on industry demand tools as well as templates and are still at the cooling-off period of automation level.
The predictive modelling
Earlier ads were mainly cost per click or action.
eCPM = oCPA * CVR * CTR * 1000
Because the system can't know the true CVR and CTR until the ad is displayed, it is generally predicted through machine learning.
eCPM = oCPA * pCVR * pCTR * 1000
The bidding
ROI based on ad delivery, the common strategies of advertisers can be divided into three categories: conservative, conventional and aggressive. (Oceanengine divides the bidding scenarios into five categories: the upper limit of cost-controlled delivery, the balanced delivery, the climax buying prioritized, the climax buying and the conversion cost-optimised, and the climax buying)
The restriction
Budget constraint. The budget of campaigns or accounts set by the advertisers, the upper limit of the amount spent on advertising over some time.
Traffic constraint. The publishers also have limited ad spaces.
User experience. Advertising is not only the publishers, the advertisers, and the ad network but also the users. Ads that are focused on revenue the ad network will flood with overpriced, low-quality ads. This is the need to improve the user experience of ads, especially in ad formats. The usual approach is to quantify the user experience or introduce a price squeeze factor.
The Retrieval Phase
The goal
The retrieval phase is facing the whole ad library with a large search engine. User-interested-ad being recalled may be a variety of perspectives. In the next-Gen ad network, high quality but low quantity is the requirement of the retrieval. Before the next-Gen ad network, models meet low precision because simple and efficient is sufficient.
The measurement
Diversity of retrieval
The present
Before the next-Gen ad network, which will be by 2022, the approach was mainly targeting and supplemented by recall. Among them, the recall is a multi-channel recall. It can be roughly divided into TAG branch, ANN branch and new branches being explored.
TAG branch. That is, recall by label, on this basis combined with dynamic adjustment of weights and labels. mainly include:
ANN branch. That is, vectorized recall, the model is currently mainly the deep structured semantic models of user and item. That has two shortcomings. First, the expression of the embedding is limited; second, this model is originally based on business goals and does not use the embedding similarity of the item as an optimization constraint. The ANN requires the similarity of items before. This process can be u2i or u2i2i. At present, ANN branches mainly include:
The new branches being explored mainly include:
It is expected to use intelligent targeting with multi-channel recall to perform Top K and merge in the future. include:
Combined with retrieval and filtered rules, such as filtered by targeting, budget, etc. Take 10,000 to 20,000 to enter the next stage.
The disadvantage of this is that the overlap of multi-channel recall is high and resources are wasted. The retrieval model is Learn-to-Rank in the next-Gen ad network. This time we focus on the model mainly, supplemented by the retrieval. It will be demonstrated in the next stage.
How it works
The following describes how recall takes effect in the ad network after the audience targeting is selected by the advertiser.

The Scoring Phase
The goal
Balance of precision and speed: the algorithm of the scoring model requires high computing performance, and a relatively simple and fast algorithm is needed.
Correlation with the goal of the reranking model: from the perspective of the ad network, select the ads that are considered high-performance by the reranking model.
The measurement
How it works

To ensure system performance and prevent excessive processing time, each stage of the ad selection process incorporates a pre-sorting and truncation mechanism. This mechanism limits the number of ads proceeding to the next stage by applying a fixed upper bound. In the first two pre-sorting stages, offline CTR, CVR, and eCPM calculations are performed at the ad, account, and overall levels due to the absence of the LiteCxR model. The LiteCXR model is used for eCPM calculation in the third truncation stage.
Before the end of 2020, the coarse ranking system employed a pre-estimation algorithm based on post-impression value, namely LiteCXR. This algorithm estimated eCPM by separately predicting ad CTR and CVR after impression, incorporating price adjustment strategies. However, it faced several persistent issues:
The next-Gen ad system proposes a ranking learning algorithm based on fitting fine-ranking results. It utilizes the fine-ranking eCPM order as the optimization objective and employs a ranking learning LTR (learn to rank) model to ultimately select the ads with the highest estimated eCPM by fine ranking. This approach addresses the following challenges:
The primary goal of coarse ranking modelling remains to align the estimated eCPM of ads in coarse ranking with that of fine ranking. Several approaches can be explored, with a focus on:
The advantage of "value" over "order" is that:
The current preference at this stage in 2022 is to indirectly fit the values of the eCPM of the reranking model by directly learning the Click-though-conversion-rate of the reranking model or the eCPM divides bid values.
Use the Pointwise algorithm to train samples through value models: The training set and test set are randomly divided according to the request. Each request can construct one 9-tuple sample <user, ad, pCTR, pCVR1, pCVR2, bid1, bid2, eCPM1, eCPM2> or two 6-tuple samples <user, ad, pCTR, pCVR1, bid1, eCPM1> and <user, ad, pCTR, pCVR2, bid2, eCPM2>
Use the Pairwise algorithm to train samples through order models: The problems here focus on how to construct pairs, how to sample pairs, and how to set sample weights. However, there are probably a lot of reranking ads in the queue, which involves the sampling problem of training samples. After the demonstration, the positive and negative examples used were sampled from different intervals and the positive and negative examples were sampled from the same interval.
Use the Pointwise algorithm and the Pairwise algorithm to train samples through binary classification models.
The Ranking Phase
The goal
How it works

References
© 2024 Xiang PENG. All Rights Reserved.