INDEX
Explanations
references to a specific entity named "Lo" at varying levels of adoration and specificity
references to the term "Lo" in various contexts
New Auto-Interp
Negative Logits
manship
-0.85
itated
-0.79
pillar
-0.77
EMENT
-0.70
Equality
-0.70
rations
-0.69
itates
-0.68
chell
-0.68
itating
-0.67
ITAL
-0.66
POSITIVE LOGITS
veland
1.26
zzi
1.13
fty
1.11
ppy
1.07
vers
1.05
zzle
1.03
aned
1.01
vel
0.98
ven
0.98
pper
0.98
Activations Density 0.027%