INDEX
Explanations
phrases emphasizing one particular thing or concept
repeated references to "one thing."
New Auto-Interp
Negative Logits
mson
-0.73
iolet
-0.71
illon
-0.70
ocks
-0.69
ocked
-0.68
illus
-0.68
inders
-0.66
avascript
-0.66
zanne
-0.65
ICO
-0.65
POSITIVE LOGITS
thing
0.96
hundred
0.95
exception
0.92
Hundred
0.82
thousand
0.78
sided
0.77
eyed
0.73
guy
0.73
remaining
0.72
person
0.69
Activations Density 0.052%