INDEX
Explanations
instances of the word "half" in various contexts
New Auto-Interp
Negative Logits
erate
-0.15
las
-0.15
hence
-0.14
andler
-0.14
uled
-0.14
luv
-0.14
kla
-0.14
chema
-0.13
ieder
-0.13
ee
-0.13
POSITIVE LOGITS
dozen
0.18
/full
0.18
enger
0.15
wares
0.15
weg
0.14
pter
0.14
Trab
0.14
/all
0.14
ystone
0.14
iterals
0.14
Activations Density 0.042%