INDEX
Explanations
words related to violent or extreme actions
occurrences of the word "acts" in various contexts
New Auto-Interp
Negative Logits
env
-0.71
puting
-0.70
rame
-0.67
UTC
-0.65
OVER
-0.65
edin
-0.63
board
-0.63
Loren
-0.62
DEM
-0.61
reference
-0.61
POSITIVE LOGITS
acts
3.66
act
2.28
Acts
2.14
deeds
1.59
actions
1.58
acted
1.54
acts
1.41
behaves
1.37
Act
1.31
acting
1.24
Activations Density 0.012%