티스토리 뷰

카테고리 없음

Thompson Sampling for CPA model

최영민85 2018. 10. 18. 11:56


In a CPA payment model, the expected value of an impression : 


Model 

Notation 

CTR Model 

of the training set is given by: 

 

Choose a simple prior of Wck , 

 

Then we use Laplace approximation which enables us to approximate the posterior with a Gaussian distribution.

: the local minimum of the log-posterior

Also we can derive the following covariance :

      

However, since the sampling time for every variable can be problematic that can leads to a problem of prediction time, we better go more in depth.

Let 


CVR Model 

of the training set is given by: 

 

Choose a simple prior of Wcv , Wd

 

Then we use Laplace approximation which enables us to approximate the posterior with a Gaussian distribution.

: the local minimum of the log-posterior

Also, we can derive the following covariance :

      





However, since the sampling time for every variable can be problematic that can leads to a problem of prediction time, we better go more in depth.

Let 


Thompson Sampling

1. Sample a set of parameters according to the posterior after seeing the training data: 

2. Choose the action that is optimal with respect to the sampled set of parameters: 

공지사항
최근에 올라온 글
최근에 달린 댓글
Total
Today
Yesterday
링크
TAG
more
«   2025/05   »
1 2 3
4 5 6 7 8 9 10
11 12 13 14 15 16 17
18 19 20 21 22 23 24
25 26 27 28 29 30 31
글 보관함