INDEX
Explanations
expressions related to threats and conflict
expressions of conflict or threat
New Auto-Interp
Negative Logits
lately
-0.71
amiliar
-0.59
lyak
-0.59
erd
-0.56
artney
-0.54
bered
-0.54
emis
-0.54
Enlarge
-0.54
itled
-0.54
Entered
-0.53
POSITIVE LOGITS
if
1.64
anytime
1.39
unless
1.38
whenever
1.32
someday
1.30
whoever
1.26
whichever
1.26
wherever
1.24
hereafter
1.16
unless
1.12
Activations Density 0.729%