INDEX
Explanations
words related to the mouth or actions involving the mouth
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
228
+0.14
0.5%
597
+0.11
0.4%
1984
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
228
+0.14
0.03
791
+0.11
0.02
597
+0.11
0.02
Negative Logits
bamb
-0.60
PreAuthorize
-0.58
providedIn
-0.55
jaja
-0.53
meras
-0.52
Studi
-0.49
shadowRadius
-0.49
ExecuteAsync
-0.48
fusc
-0.48
erci
-0.48
POSITIVE LOGITS
mouth
1.35
Mouth
1.26
Mouth
1.20
mouths
1.18
mouth
1.04
lips
0.92
mout
0.88
lip
0.87
Lips
0.79
bouche
0.79
Activations Density 0.094%