INDEX
Explanations
mentions of physical aggression or forceful actions
occurrences of the word "lash" and its variations, indicating a focus on expressions of anger or criticism
New Auto-Interp
Negative Logits
ĨĴ
-0.82
ublic
-0.70
umer
-0.70
Friendly
-0.68
unknown
-0.68
ccess
-0.67
missions
-0.67
cession
-0.65
Private
-0.65
Sorceress
-0.64
POSITIVE LOGITS
lash
1.49
lashes
1.12
Lash
0.97
lashed
0.95
whip
0.82
uate
0.79
oons
0.77
whipping
0.76
furnace
0.76
bolt
0.74
Activations Density 0.006%