INDEX
Explanations
actions and events related to competitions and achievements
New Auto-Interp
Negative Logits
kontakte
-0.17
oola
-0.15
Ì£
-0.15
odyn
-0.15
rzy
-0.14
ackers
-0.14
обоÑĢ
-0.14
arakter
-0.14
apons
-0.14
enda
-0.13
POSITIVE LOGITS
egl
0.19
ез
0.17
?,
0.14
OUCH
0.14
finishes
0.14
Rough
0.13
еко
0.13
ppo
0.13
éĶ
0.13
atable
0.13
Activations Density 0.175%