From the Vault

 

 

Your SEO Fix

 

Popular
Add to Technorati Favorites


 

Ad Serving and User Performance Metrics Print E-mail

Using concepts for Ad Targeting - Click here for original Patent filing

 

This patent deals with serving Ads based upon keywords or moreover, topical ‘concepts’ to help and determine ad ‘relevancy’.  Yes, here I am once again discussing ‘relevance’ – I hope 2008 brings me a new hobby. This time I am looking ( thanks Bill ) at the Ad serving world. I do feel it is important as many of the concepts relating to establishing meaning or ‘themes’ to content/information/words is much the same in this area is it is in the ‘organic’ search processes.

There is a bunch of the usual mind numbers as far as the nuts and bolts of user account setting such as geo-location, budget, keywords desired, to the server side calls for number of ads to be displayed to size/real estate available for them. Of interest there is the checking for geo-location issues from say a search query to a regionally targeted ad (sounds like IP delivery er.. cloaking to me?).

 

What’s the point?

 

In describing the desire there is an interesting line;

an advertiser may attempt to target its ads to more narrow niche audiences, thereby increasing the likelihood of a positive response by the audience

And that’s is the crux of it; trying to define the Ads and the target audience as tightly as possible. By doing so the conversion rates should improve and (in theory) the advertiser can get more bang for their buck. Also, in a Pay-Per-Click model, the more targeted and higher quality the Ad, the more chance of getting the ‘click’ that the Ad server wants as per the business model. Tighter relevance and higher quality ads are an obvious benefit and motivation.

The parameters are defined by the advertiser at the time registering the ad who establishes, “one or more concepts to be used to target its ad, or indicate whether some concept indicator is relevant to its ad. So how is dinner being served?

 

candidate ads (that) have been determined to be relevant to the request using, at least, keyword targeting information.

 

If you already understand some of the concepts used in indexing and scoring of documents based on relevance in organic search indexes, much of the usual concepts are the same as far as keyword/phrase and the processing of ‘concepts’ is concerned. Of interest is that the ever popular ‘jaguar’ example is back (used in some recent relevance patents);

As another example, the query term "jaguar" could refer to the car by that name, the animal by that name, the NFL football team by that name, etc. If the user is interested in the animal, then the user might not be interested in search results which pertain to the car or NFL football team. Similarly, the user might not be interested in advertisements, targeted to the keyword "Jaguar," but that pertain to the car or NFL football team.”

So we are still looking at words as phrases or ideas/concepts as opposed to simplistic keyword models – which is termed as ‘concept similarity scores’. Most of the tightening or relevance pertains to this train of thought. See earlier ‘phrase based indexing and retrieval ‘ to get some ideas on these concepts or documentation on things such as ‘probabilistic latent semantic analysis’ - Latent Dirichlet Allocation - as they are a good core towards understand relevance in the indexing and retrieval process.

 

Also of interest in this method is that it;

tracks such performance and uses it to modify concept scores. Concept performance information management operations accept the performance of concepts in ad serving and may adjust concept performance information  accordingly. The concept performance information may include a number of entries, each including a concept  and at least one performance factor (such as a weight for example).

A performance factor may be tracked for one or more of (a) a concept source, (b) a concept in general, and (c) a specific keyword-concept relationship. Thus, for example, if an ad is served pursuant to a concept, from a concept source, because of the concept's association with a request keyword, one or more performance indicators of the ad (e.g., click-through, conversion, etc.) may be tracked and used to adjust a performance factor(s) of one or more of (a) the source of the concept (e.g., ODP, a classification technique such as a semantic classification technique for example, etc.), (b) a concept in general (e.g., across all source and/or all keywords), and (c) a keyword-concept relationship (to reflect the fact that the same concept may perform well when used for ad serving based on its association with one keyword, but may perform poorly for another keyword).

So, much like some other works lately, ‘performance’ or what some would call ‘click data’ or ‘bounce rates’ and so forth, come into play in the endeavour to find relevance in words/concepts. What people are doing is as important to determine intent as understanding the concepts of a give document/ad text. I can’t help but think some form of communication could not be somewhere to pass scoring elements from one ranking process to another.

 

Scoring Elements

 

The method discusses taking a core set of candidate documents that are determined by ‘(at least) keyword targeting information’ as well as further scoring is also achieved via;

  1. Ad performance information
  2. advertiser quality information
  3. ad price information
  4. an information retrieval score.

 

In many ways it is not only using the data to decide which Ads are to be placed in a given setting, but the order (ranking) of them as well assigning a ‘relative preference attribute’. The content of the target page is established via concept with a ‘concept vector’ attribute. Delivery of the Ads is based on understanding the delivery page concept and assigning Ads based from the similar concept attributes mentioned earlier.

 

 

Using past data for scoring

The scoring/relevance of the Ad, ( as per this method )  is also targeted/scored for relevance including such  factors such as tracked performance information. This area can include previous processed search queries to which the Ad would have been relevant. Targeting the ‘concepts’ can also include “other ads using the same or similar targeting criteria information”. So past runs of existing ads and comparison to other ads can come into play. The performance of  past Ads that have been served that are related to the ‘concept’ can be used as further scoring of the current set(s) of results to be scored.

This information can include advertisement selection/conversion data. The score can be increased or decreased based upon the performance being above or below a pre-defined threshold (performance level).  I must say they are more than a bit hazy on what a ‘conversion’ is, they have some variants – the most likely being;

The ratio of the number of conversions to the number of impressions of the ad (i.e., the number of times an ad is displayed) is commonly referred to as the conversion rate. If a conversion is defined to be able to occur within a predetermined time since the serving of an ad, one possible definition of the conversion rate might only consider ads that have been served more than the predetermined time in the past.

 

One can see how such scoring can also be used when deciding if a set of Ads appears in a search result page as opposed to a content ad placement (website). They can use relevant ads that have a higher historical click/conversion rate, to capture ‘clicks’ at a higher rate (conversion), in an advantageous location such as high traffic search segments (travel etc..). By increasing the likelihood of a conversion, they increase the revenue model. That’s what I would be thinking at least if it was my toy.

Less historically desirable or under-performing Ads, could be designated for a lower class of traffic locations. Sure, that could be a bit in the ‘conspiracy theory’ terrain, but it most certainly could be modelled that way.

 

Summary -

In simplest terms, much of the scoring is based upon;

(a) ad performance (b) ad price information, (c) advertiser quality (d) IR score

As always, we’re not setting the thresholds for each, so how ads are scored/ranked can vary. I can see how it certainly encourages a well written, well placed ($$) ad. If no one clicks your ad, they don’t make as much and the more they make on your ad (ad price) has its own metric. It’s only natural, oui? There is some more about the ‘similarity score’ but I don’t want to go to far into that with an older document as more recent methods may superseded to at least have had an effect on this particular process.

Published methods since this was filed seem more suited as far as ascertaining relevance. I would have to imagine that this system would have benefited from being able to attain a ‘similarity’ score from some of these other methods/models – or other merging. The underlying concepts derived from end user data as far as trying to achieve further ‘relevance’ further brings to light the use of such data as a scoring set. Also of interest is the combination of similarity scoring with ad performance scoring variables – something likely not commonly considered.

 

All Ads were not created equal.

(also see related parts; Part I, Part II and Part III and the summary on my Blog)  

 

Resources - this is part of a 3 part series - Summary; Relevance through end user metrics - Learning a probabilistic generative model for text - Ranking documents based upon large data sets - Using concepts for Ad Targeting.

Original Patent - Ranking documents based on large data sets

Patents of further interest - Predicting ad quality - Using estimated ad qualities for ad filtering, ranking and promotion - Using estimated ad qualities for ad filtering, ranking and promotion -

 
< Prev   Next >

Knowledge Base
Keyword or Phrase research and targeting practices

This article is not aimed at the advanced level reader here folks. I talk to many people that don't even spend a passing thought on the KW/P research - this posting is written to give the reader a foundation - not as an advanced guide. Of course, this being an in-exact science, there could be some gems in here for the ragged SEO warriors as well.

For a while now, I have been intending to give my perspective on Keyword/Phrase ( KW/P ) research and targeting. Why? Because I think it is an essential if not mandatory part of any SEO campaign. Each and every term you target will have a cost associate with it. This is where the rubber meets the road. In simplest terms, if I spend $500 on link building, $200 on content creation, $100 on site adjustments then I am looking at $800 invested. Once we reach the desired ranking, how long will it take to recoup that cash (including ongoing maintenance). In short, where is the ROI and what is the ‘break even’ point? That is a very simplistic example, but hopefully you get the idea.

Simply aiming to be #1 on Google is foolhardy since some terms contain little reward (traffic). Remember, cash that gets tied up chasing non-performing terms is money that could have been used elsewhere in your marketing endeavors. So, this is certainly an important step in the SEO process. Mistakes here can be very costly later on and in the over-all ‘big picture’ that is the sites financial health.

 

Read more...