INDEX
Explanations
mentions of social issues like homelessness and obesity
terms related to homelessness and obesity
New Auto-Interp
Negative Logits
estone
-0.73
parts
-0.72
compan
-0.70
ube
-0.68
ussian
-0.67
icles
-0.66
antioxid
-0.66
iman
-0.64
maid
-0.62
azine
-0.62
POSITIVE LOGITS
prevention
0.92
yip
0.90
plagued
0.88
remission
0.85
lust
0.84
plag
0.84
worsened
0.81
Prevention
0.80
stricken
0.78
stemming
0.77
Activations Density 0.060%