INDEX
Explanations
instances of the word "moderate"
references to moderate individuals or groups
New Auto-Interp
Negative Logits
arium
-0.80
borne
-0.72
OGR
-0.71
chy
-0.71
stals
-0.70
ADRA
-0.70
tyard
-0.70
raltar
-0.70
metry
-0.68
ilage
-0.68
POSITIVE LOGITS
erate
1.05
sized
0.93
minded
0.89
xual
0.85
(<
0.76
medi
0.75
leaning
0.73
(~
0.69
fare
0.65
lees
0.65
Activations Density 0.039%