INDEX
Explanations
instances of words related to situations or events that may cause concerns or discussions
terms related to safety and risk events
New Auto-Interp
Negative Logits
ģĸ
-0.98
Adapt
-0.90
ahime
-0.85
guiName
-0.83
obook
-0.79
derived
-0.78
ESE
-0.76
ranch
-0.73
thood
-0.72
abase
-0.72
POSITIVE LOGITS
bruises
1.43
gunshots
1.28
gunfire
1.27
tears
1.24
yelling
1.24
explosions
1.18
noise
1.17
cursing
1.17
stares
1.16
distractions
1.15
Activations Density 0.322%