INDEX
Explanations
references to controversies involving celebrities
New Auto-Interp
Negative Logits
Caleb
-0.16
AFX
-0.15
NK
-0.15
IFn
-0.14
xcb
-0.14
Ariel
-0.14
erture
-0.14
ç´Ļ
-0.14
лини
-0.13
vamp
-0.13
POSITIVE LOGITS
Cosby
0.52
Cos
0.38
.Cos
0.36
COS
0.35
cos
0.34
Cos
0.33
cos
0.30
(cos
0.29
Bill
0.29
_cos
0.28
Activations Density 0.003%