INDEX
Explanations
expressions of personal beliefs and self-identity
New Auto-Interp
Negative Logits
agh
-0.15
ivy
-0.14
rames
-0.14
Davidson
-0.14
Vide
-0.13
åºĦ
-0.13
borg
-0.13
okens
-0.13
ONY
-0.13
inventory
-0.13
POSITIVE LOGITS
ammo
0.17
shoot
0.17
shoot
0.17
Shoot
0.16
directors
0.15
shooting
0.15
Shoot
0.14
ìĬ
0.14
audiences
0.14
strugg
0.14
Activations Density 0.041%