INDEX
Explanations
occurrences of the substring "som" in various forms
New Auto-Interp
Negative Logits
ivot
-0.17
ignum
-0.16
enan
-0.15
amps
-0.15
.scalablytyped
-0.15
HEET
-0.15
eos
-0.14
ungs
-0.14
fections
-0.14
anga
-0.14
POSITIVE LOGITS
erville
0.28
ewhere
0.27
ewhat
0.25
thing
0.25
mers
0.23
erset
0.22
brero
0.21
ething
0.21
etime
0.21
place
0.18
Activations Density 0.010%