2 m B, A) Y# `5 c! `(1) Top-k kk Performance:% U) p5 A6 m8 v* J+ |7 x( V. w
(Precision@ k ) P @ k : = 1 k ∑ l ∈ rank k ( y ^ ) y l \text{(Precision@$k$)}\text{P}@k := \frac{1}{k}\sum_{l \in \text{rank}_k (\hat{\mathbf{y}})} \mathbf{y}_l1 p4 Z$ L# f9 n6 u8 H
(Precision@k)P@k:= $ [( k" Q, z- W& ]9 X
k 6 V; S k3 O! p# G" `. [& P8 y: e1 " @: l( h) j& O0 l - e5 \* s2 f. |7 H* F. W! f& T( j v$ e+ q# ?
l∈rank % S- }- v, N, s+ W
k6 h5 S8 G) c8 R2 {! D
( Y2 F% t6 c/ N/ t5 p7 N. R# w
( ) R% \' t d$ z$ ? Ny 7 W$ \% h; g& [" o0 ^1 z^* D- u. R; }+ L3 {! n' a
) m- ?' F& u3 r% y$ f$ @) l ) 3 u9 D" I! ?' g$ ^ r4 q* y H( O∑ + t C2 E: e. a ) ~% c: V$ W$ T; j y & y1 z, f9 b. }: H @
l: ^ W( ^- v9 Z, Q q5 P
7 e4 V0 J9 m- L. c- y* K
C* u; A8 l; O% j5 g1 k6 U6 U( }6 @# u; X# a6 h
(Discounted Cumulative Gain (贴现累积收益))DCG @ k : = ∑ l ∈ rank k ( y ^ ) y l log ( l + 1 ) \text{(Discounted Cumulative Gain (贴现累积收益))} \text{DCG}@k := \sum_{l \in \text{rank}_k(\hat{\mathbf{y}})} \frac{\mathbf{y}_l}{\log(l+1)} $ [& D+ x0 u) x* X( f) X, Q(Discounted Cumulative Gain (贴现累积收益))DCG@k:= # N' `" F& ^, T+ A7 R
l∈rank # M5 D# O6 f, x5 d: F% J4 T; \* Yk5 v+ |! o4 s! ?8 n
+ W, h# b9 w) N) ^0 W1 e ( ' {3 B3 E0 a! u0 u; x. X9 c
y $ Z- z) l; r( ], J- {3 N^ 2 U" X. i8 c# `( i, E I; }8 n } 9 Q& J2 _4 t4 N0 a ) . j8 T2 H( {( D) l6 n∑ 4 ]* M! c5 ^2 x1 r / ]. }( M g6 R+ e7 l( o ' s6 ?2 ~7 _9 A) U, c- Ilog(l+1)8 P2 x# ~5 G( [: D) x
y 9 u' S+ v( e$ ^" f: J. p1 Dl4 f6 i6 `* d2 {& p$ I
+ }- [# {6 g: X0 Q# t* l3 e* m - L* u @! V5 n/ s 7 L# C+ k: n4 |" [+ q/ D) o 6 v1 b4 |+ d2 V- h( S+ n1 g/ }2 j
(Normalized DCG)nDCG @ k : = DCG@ k ∑ l = 1 min ( k , ∣ ∣ y ∣ ∣ 0 ) 1 log ( l + 1 ) \text{(Normalized DCG)} \text{nDCG}@k := \frac{\text{DCG@$k$}}{\sum_{l=1}^{\min(k,||\mathbf{y}||_0)} \frac{1}{\log(l+1)}} 9 K, f6 b5 M- ~. l(Normalized DCG)nDCG@k:= $ @$ `% t' t) o+ G9 N
∑ 3 w1 u: u% G2 A% _7 ]6 b xl=1 ' o% @8 ?. c5 X5 bmin(k,∣∣y∣∣ 4 x( Y' L& r D0$ D2 \8 {/ X% ~$ X! L. A6 o
( ~, u9 Q7 Z% Z7 e0 d' B N( }+ ~
)8 u/ X4 p) Q. e1 s F
$ x5 p5 ~' `4 t) ]3 w9 B
9 l) |- b5 D9 ^# ^# `8 p
log(l+1)4 i+ l7 x" e3 w2 \5 J& |0 E. F
1 ( b& J, B1 k V2 a' y+ k9 Q) t4 r% |& ]- E$ n, V6 O
5 r7 z: P4 x9 B! O. l+ k/ s% QDCG@k ' ` y M! D" k; f9 {. I4 ~/ y4 k% @- O% q5 d+ y% C
% Z, @" w0 q- ?0 Q. R# R
- ^3 U3 S/ h6 Q5 A' g6 R
rank k ( y ) \text{rank}_k(\mathbf{y})rank ( L. o: \# [* }' }$ Dk6 N7 c& @5 w( B* a" u
6 D3 L' ?& S& Y2 g) d! C8 N7 z
(y)为逆序排列y \mathbf{y}y的前k个下标。Note: DCG公式里的分母实际上不是l,而是from 1 to k.7 i) E- X! A' H& I5 s6 C
2 m/ S% E( }; ]! h$ L" M* @靠后的标签按照对数比例地减小,说白了就是加权。至于为什么用log?两个事实:1. 平滑缩减; 2. Wang等人提供了理论支撑说明了log缩减方式的合理性。The authors show that for every pair of substantially different ranking functions, the nDCG can decide which one is better in a consistent manner. (看不懂,暂时不管) . C! D& m3 F' ]8 l4 |+ Q3 l2 S . Z+ r8 c' g1 H) I6 h C M(2) Top-k kk Propensity-score:9 R1 H" w& ^5 r& D& h; o+ f2 Q& a
a3 w1 I% ], O! I; U/ D有些数据集包含一些频度很高的标签(通常称之为head labels),可以通过简单地重复预测头部标签来实现高的P @ k \text{P}@kP@k。Propensity-score可以检查这种微不足道的行为。7 l1 O) ~$ A- G) s! D- Y
( Propensity-score Precision ) PSP @ k : = 1 k ∑ l ∈ rank k ( y ^ ) y l p l (\text{Propensity-score Precision}) \text{ PSP}@k := \frac{1}{k} \sum_{l\in \text{rank}_k(\hat{\mathbf{y}})} \frac{\mathbf{y}_l}{p_l} 4 F4 V, O! U3 c8 W7 x+ \9 x- D(Propensity-score Precision) PSP@k:= 6 J+ V' e) f5 {9 m
k! C q9 ] {% Z
1) H' B! y# g S7 r9 L& |
/ Y2 ]( n( _% } $ |1 d% h" i* Pl∈rank / M& @( [9 u! yk + L5 m2 A2 T$ d* @" x% _ * B6 q6 p: r( u$ Q) D ( ' S0 J/ ~ ]+ @y 9 I' r2 Y2 x1 t$ \$ h^ / ~1 v$ n! p/ o% w3 G E/ k& [! X) ]5 j) E
) # Y+ y2 _" H6 N0 }5 C) a& e# f- {9 `∑ * n! o! S# h3 c$ q$ ~. v- ~; ]0 ^! R" h% D5 f
8 y5 Q+ ]. V% l" s! R2 Z( xp ! h" I9 m* T: O- m6 b
l9 C8 j- q( z8 o: a
7 w. s t3 i9 P ( M& a' L) b, iy , i) n+ `! k5 A% nl" m+ G l+ ~: n! i* A( {
5 N, W: W7 {8 k. A4 b+ ` 0 A# x0 W) Z3 [, | A0 `$ g' y. b: x( B: J* b
% L5 [" _ A7 O2 s- W % U" u( S) u9 Z& c; ]0 H& l' VPSDCG @ k : = ∑ l ∈ rank k ( y ^ ) y l p l log ( l + 1 ) \text{PSDCG}@k := \sum_{l \in \text{rank}_k(\hat{\mathbf{y}})} \frac{\mathbf{y}_l}{p_l\log(l+1)} . F! s9 Y; L( e1 d7 R6 SPSDCG@k:= - Y5 e- o5 i# N0 Ul∈rank ) j- s9 _6 }, m, v. C' L; A, l
k& c6 y; b$ X W# Y* x
+ ~: ` V$ k* w! ]: }7 R" `
( * i$ P) M$ m/ p5 by / h" `: Q+ n: h6 _^3 e. B, S( b1 U
+ w/ ~2 @$ Y8 m6 H+ F ) 4 g* [& n4 _1 L' _∑ & L" i) U, ?# C$ S0 B. I$ v; ~$ z- h) R7 d
! t! G1 x& b, X5 M- R8 \p 2 d3 {2 W8 m, |8 H5 Fl / P3 ^/ `9 b- }5 L 5 j0 |/ ~) u3 a- r& C; ^/ h: w log(l+1) , A% X9 l) p- I& By 5 o p6 X5 E& ol5 T& C% o7 ?3 K+ u) _& O
9 b( @& J) e0 E) u6 N+ D! I& c) N d `- D- T0 T$ ]2 ~9 L' A
* R1 J) Q9 S7 W! i! @$ v+ W
1 n% w) _4 \+ o
7 `1 d9 q" I2 S$ F4 I( S4 ?
PSnDCG @ k : = PSDCG@ k ∑ l = 1 k 1 log ( l + 1 ) \text{PSnDCG}@k := \frac{\text{PSDCG@$k$}}{\sum_{l=1}^{k} \frac{1}{\log(l+1)}} # Q' r$ S6 f' @ c# yPSnDCG@k:= ' _9 m X) V/ U3 n5 r3 `( X
∑ # Z: w v# h0 v7 b9 J0 p
l=1 - k% ] p- r. x1 U( ck, G! N" R& n I+ V9 [
/ b+ r) Q* w' c" t+ E( Q3 z2 X! K% ?5 `5 U+ E5 Y; g/ ]2 s) i) @
log(l+1)9 J2 y* j! U+ w, e, @1 W4 v
1# F" `0 O' g1 O8 a
7 q# f3 t1 `* V8 D2 o # w9 I' ^$ ], O i2 ^9 BPSDCG@k; o1 ~ ]; m- O' d0 B" s- u
, Y5 `7 B- e! M; c& X) r2 p/ _
1 C+ ]( Z& P4 `) a3 \! |8 W* ~8 R
其中p l p_lp 7 H3 I; r: _, d1 D( |l' \6 \) g t3 H8 a
$ q8 A! v6 B5 B# l
为标签l ll的propensity-score,使得这种度量在missing label方面无偏差(unbiased)。0 P' \% }% E8 V7 j" a
Propensity-score强调在tail labels上的表现,而对预测head labels提供微弱的奖励。1 {6 i' j# {) W) i- P
———————————————— 4 ?# n2 y' M5 h4 B7 Z% Q* s6 l& \7 U版权声明:本文为CSDN博主「摆烂的-白兰地」的原创文章,遵循CC 4.0 BY-SA版权协议,转载请附上原文出处链接及本声明。 6 K2 V g3 R9 c原文链接:https://blog.csdn.net/wuyanxue/article/details/126805190 M" W. j' ]' c! I$ U
1 C/ g! N$ I% ?4 B6 ]2 l7 x1 r, ^
* _% O: ]5 E5 H7 d* b/ T" t4 A