INDEX
Explanations
words related to the concept of "othering."
the term "other" or variations and mentions of Google
New Auto-Interp
Negative Logits
edded
-0.72
noon
-0.66
lished
-0.65
elig
-0.65
yrinth
-0.64
stacked
-0.64
ials
-0.63
ially
-0.63
consisted
-0.62
zero
-0.62
POSITIVE LOGITS
othe
1.32
otle
0.88
phe
0.88
zyme
0.84
ogle
0.80
azy
0.78
ogi
0.77
ocal
0.77
osit
0.75
ãĥĥãĥĪ
0.75
Activations Density 0.012%