INDEX
Explanations
descriptions related to philosophy and the human mind
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1108
+0.14
0.4%
184
+0.13
0.4%
1842
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
29
+0.14
0.04
1081
+0.13
0.05
1108
+0.09
0.06
Negative Logits
carrefour
-0.65
couverte
-0.53
pamph
-0.52
végétal
-0.52
madeus
-0.51
Joaqu
-0.49
bourg
-0.49
ritard
-0.49
Sénégal
-0.48
undred
-0.47
POSITIVE LOGITS
remainder
0.60
consists
0.53
ºC
0.52
remainder
0.52
consisting
0.50
reserved
0.49
°;
0.49
portion
0.48
comprises
0.47
separately
0.47
Activations Density 0.475%