INDEX
Explanations
words or phrases related to large-scale incidents or catastrophes, particularly involving harm to multiple individuals
references to mass events or occurrences, especially those related to violence or disasters
New Auto-Interp
Negative Logits
nos
-0.78
BILITIES
-0.78
yssey
-0.75
thy
-0.75
thens
-0.74
mins
-0.73
tis
-0.73
uana
-0.71
shire
-0.68
ilk
-0.68
POSITIVE LOGITS
achusetts
1.47
quantities
0.85
achus
0.85
transit
0.78
aging
0.77
exodus
0.76
incarceration
0.72
eval
0.69
ablishment
0.69
istg
0.68
Activations Density 0.016%