INDEX
Explanations
mentions of people being physically injured
mentions of injuries in various contexts
New Auto-Interp
Negative Logits
SpaceEngineers
-0.77
tz
-0.72
ularity
-0.70
perm
-0.68
gency
-0.65
etics
-0.65
graph
-0.64
AMA
-0.64
hist
-0.63
snipp
-0.62
POSITIVE LOGITS
jured
0.85
bystanders
0.81
Survivors
0.73
wounding
0.72
sustained
0.71
../
0.70
mond
0.69
injuring
0.68
nikov
0.68
monton
0.66
Activations Density 0.038%