INDEX
Explanations
the presence of the word "ack" in various contexts
New Auto-Interp
Negative Logits
exter
-0.15
ONO
-0.15
å¥ij
-0.15
Comparable
-0.15
grate
-0.14
geist
-0.14
ogh
-0.14
ono
-0.13
excerpt
-0.13
exter
-0.13
POSITIVE LOGITS
Ħ
0.17
ombo
0.16
ifetime
0.16
utschen
0.15
ocha
0.15
ieved
0.14
nowled
0.14
nio
0.14
chio
0.14
slashes
0.14
Activations Density 0.013%