INDEX
Explanations
instances of the word "hum" and its variations, indicating a focus on humility or humility-related concepts
New Auto-Interp
Negative Logits
yonel
-0.19
hell
-0.17
ncia
-0.17
upo
-0.17
anje
-0.16
енÑĮ
-0.15
adem
-0.15
legate
-0.15
oler
-0.15
Ïĩή
-0.15
POSITIVE LOGITS
pty
0.31
ankind
0.28
bug
0.27
iliated
0.27
iliate
0.26
mock
0.25
mus
0.25
mers
0.24
mer
0.23
bling
0.23
Activations Density 0.008%