INDEX
Explanations
terms related to dietary choices and statistics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
2034
+0.11
0.3%
1150
+0.10
0.3%
1013
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1766
+0.11
0.04
1816
+0.10
0.04
1354
+0.08
0.05
Negative Logits
nece
-1.55
sovere
-1.47
volunte
-1.46
ftu
-1.35
viciss
-1.35
coö
-1.34
pamph
-1.34
desir
-1.33
dispen
-1.32
effe
-1.32
POSITIVE LOGITS
because
0.82
.
0.81
due
0.78
;
0.76
but
0.70
regarding
0.70
。
0.69
,
0.68
!
0.68
when
0.67
Activations Density 0.337%