INDEX
Explanations
mentions of Chinese actors and their works
New Auto-Interp
Negative Logits
794
-0.16
argas
-0.14
Narr
-0.14
sy
-0.13
ç¡
-0.13
795
-0.13
Sesso
-0.13
Sanity
-0.13
HCI
-0.13
796
-0.13
POSITIVE LOGITS
tvb
0.28
mainland
0.22
ATV
0.21
Cant
0.20
Canton
0.19
TV
0.19
Chow
0.19
CCTV
0.18
TV
0.18
tv
0.18
Activations Density 0.018%