INDEX
Explanations
descriptions related to historical inventions and food ingredients
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
227
+0.16
0.5%
198
+0.14
0.4%
2034
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
227
+0.16
0.08
766
+0.14
0.05
1901
+0.10
0.04
Negative Logits
sergio
-0.77
<bos>
-0.77
javier
-0.76
felipe
-0.76
fernando
-0.68
gabri
-0.68
eduardo
-0.68
cristina
-0.68
JULIO
-0.67
jorge
-0.67
POSITIVE LOGITS
glaubte
0.56
underland
0.55
NUKAT
0.53
where
0.52
throughout
0.52
since
0.50
wherever
0.50
Mère
0.49
tamment
0.49
izarse
0.48
Activations Density 0.602%