INDEX
Explanations
the word "oo" with varying levels of activation
variations of the sound "oo"
New Auto-Interp
Negative Logits
代
-0.70
mus
-0.64
itates
-0.63
ewski
-0.63
Expend
-0.63
idates
-0.61
Luthor
-0.60
oblig
-0.59
adr
-0.59
izoph
-0.58
POSITIVE LOGITS
gey
1.25
gee
1.21
zing
1.21
zie
1.19
ey
1.16
ze
1.16
zy
1.11
zers
1.06
za
1.05
zer
1.05
Activations Density 0.067%