INDEX
Explanations
physical actions or events related to police and public interactions
New Auto-Interp
Negative Logits
ivals
-0.80
oday
-0.70
anguage
-0.69
hess
-0.66
ancies
-0.66
vae
-0.65
ships
-0.65
IUM
-0.65
Families
-0.65
requires
-0.64
POSITIVE LOGITS
stretched
0.90
hers
0.89
clenched
0.83
ilts
0.73
protr
0.73
slit
0.71
palms
0.70
swollen
0.69
socket
0.68
shoulder
0.68
Activations Density 0.199%