INDEX
Explanations
the act of agreeing to something or making a decision to do something
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1334
+0.10
0.3%
161
+0.10
0.3%
120
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1334
+0.10
0.03
161
+0.10
0.02
120
+0.09
0.02
Negative Logits
jectures
-0.68
disait
-0.67
brille
-0.64
ledad
-0.61
yogur
-0.60
keramik
-0.60
constate
-0.59
quarelle
-0.59
ekos
-0.57
soigne
-0.57
POSITIVE LOGITS
Să
0.59
skimage
0.56
disagreeable
0.54
Tó
0.54
Genau
0.51
Misión
0.51
Ilustra
0.50
accept
0.50
Ár
0.49
Wichtig
0.49
Activations Density 0.119%