INDEX
Explanations
instances of the word "the" and associated discussions regarding marginalized perspectives
New Auto-Interp
Negative Logits
namely
-0.79
elaide
-0.77
pin
-0.75
ÃĥÃĤ
-0.73
staking
-0.72
beam
-0.71
prepares
-0.68
zai
-0.68
erton
-0.68
replace
-0.68
POSITIVE LOGITS
slightest
1.42
entirety
1.27
entire
1.19
truth
1.10
same
1.10
whole
1.09
evils
1.07
possibility
1.07
realities
1.04
ses
1.01
Activations Density 0.290%