INDEX
Explanations
phrases indicating continuation or abundance
New Auto-Interp
Negative Logits
elib
-0.16
.locals
-0.15
cf
-0.15
immers
-0.14
ramer
-0.14
ustr
-0.14
üst
-0.14
cf
-0.13
so
-0.13
elpers
-0.13
POSITIVE LOGITS
forth
0.54
forth
0.34
fourth
0.21
-on
0.19
nữa
0.18
etc
0.18
Fourth
0.17
Fourth
0.16
дал
0.16
далÑĸ
0.15
Activations Density 0.015%