INDEX
Explanations
mentions of the word "shots."
occurrences of the word "shots."
New Auto-Interp
Negative Logits
rador
-0.85
gres
-0.78
Myth
-0.71
ŃĶ
-0.70
ricular
-0.65
ding
-0.65
adian
-0.64
feasibility
-0.63
yrinth
-0.63
ply
-0.62
POSITIVE LOGITS
guns
0.97
gun
0.97
shots
0.97
hell
0.91
shot
0.90
shot
0.88
creen
0.87
Shots
0.86
fired
0.86
shots
0.85
Activations Density 0.014%