INDEX
Explanations
intense emotions or aggressive behavior, particularly in discussions of conflict
New Auto-Interp
Negative Logits
spin
-0.17
اÙĪØª
-0.15
PROFITS
-0.15
iš
-0.15
oman
-0.15
Ù쨧ÙĤ
-0.14
inan
-0.14
pled
-0.14
vsp
-0.14
NEGLIGENCE
-0.14
POSITIVE LOGITS
aler
0.18
roit
0.17
GAME
0.15
acht
0.15
pity
0.15
жÑĥ
0.14
uet
0.14
Sanctuary
0.14
uby
0.14
effective
0.14
Activations Density 0.305%