INDEX
Explanations
instances of the word "os."
New Auto-Interp
Negative Logits
nakalista
-0.66
◀
-0.60
autorytatywna
-0.59
distanciation
-0.58
Dage
-0.58
bezeichneter
-0.58
препратки
-0.57
pkey
-0.56
-0.56
agré
-0.56
POSITIVE LOGITS
os
4.20
OS
3.03
os
2.54
Os
2.35
Os
2.23
OS
2.10
osk
1.54
osz
1.48
osy
1.34
osc
1.29
Activations Density 0.065%