INDEX
Explanations
instances of tear gas being used
references to tear gas
New Auto-Interp
Negative Logits
orea
-0.73
atar
-0.70
ammy
-0.69
ancial
-0.68
ItemTracker
-0.68
raviolet
-0.67
merce
-0.66
Tanz
-0.65
nobility
-0.65
yrinth
-0.64
POSITIVE LOGITS
adoes
0.95
bow
0.91
bows
0.89
ful
0.84
iffs
0.83
tear
0.81
fully
0.81
away
0.79
itri
0.73
brid
0.72
Activations Density 0.010%