INDEX
Explanations
expressions of excitement and pride related to special events or upcoming shows
New Auto-Interp
Negative Logits
ylland
-0.17
odo
-0.16
ories
-0.15
loquent
-0.15
одо
-0.15
ledo
-0.15
å½¹
-0.15
ouden
-0.14
403
-0.14
_quit
-0.14
POSITIVE LOGITS
sted
0.15
Gym
0.15
å¨ľ
0.15
pedo
0.14
helicopt
0.14
akis
0.14
hel
0.14
vek
0.14
ãģ¿
0.13
_nth
0.13
Activations Density 0.016%