INDEX
Explanations
names, dates, and duration mentions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.11
0.3%
191
+0.07
0.2%
850
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
776
+0.11
0.04
418
+0.07
0.01
850
+0.07
0.03
Negative Logits
demag
-0.52
Zul
-0.49
Vind
-0.49
induct
-0.47
businessman
-0.47
caprice
-0.46
sophistic
-0.46
TheReal
-0.46
vexed
-0.46
Havel
-0.45
POSITIVE LOGITS
tagena
0.75
membrance
0.74
bandung
0.71
sappi
0.69
incess
0.68
credere
0.68
torner
0.67
parteci
0.67
prega
0.67
domani
0.65
Activations Density 0.119%