INDEX
Explanations
words related to physical actions and interactions, particularly violent ones
actions and physical interactions, particularly involving conflict and intensity
New Auto-Interp
Negative Logits
ħĭ
-0.67
Effective
-0.67
Publication
-0.65
Ratings
-0.64
dayName
-0.63
Influence
-0.63
Scholars
-0.63
£ı
-0.63
nces
-0.62
ormons
-0.62
POSITIVE LOGITS
bed
0.70
animate
0.70
screaming
0.68
ankles
0.67
sewing
0.65
boarded
0.64
stairs
0.63
shack
0.62
bed
0.62
pup
0.62
Activations Density 3.893%