INDEX
Explanations
occurrences of apostrophized words, indicating possessive or contracted forms
New Auto-Interp
Negative Logits
s
-0.27
Ùĩ
-0.16
tti
-0.15
owski
-0.14
952
-0.14
нг
-0.14
YNAM
-0.14
ombat
-0.14
igner
-0.14
760
-0.13
POSITIVE LOGITS
richt
0.15
them
0.14
ık
0.13
ephir
0.13
waves
0.13
imately
0.12
-navbar
0.12
Doch
0.12
nyder
0.12
_scal
0.12
Activations Density 0.021%