INDEX
Explanations
phrases related to historical events and periods
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
776
+0.12
0.3%
191
+0.07
0.2%
856
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
776
+0.12
0.06
1788
+0.07
0.04
1904
+0.07
0.04
Negative Logits
Warto
-0.50
?</
-0.49
!!</
-0.48
Conc
-0.48
netinet
-0.48
senza
-0.48
caprice
-0.47
desir
-0.47
<^
-0.46
sS
-0.46
POSITIVE LOGITS
decades
0.88
years
0.78
centuries
0.71
decade
0.69
silikon
0.67
kafe
0.67
years
0.67
YEARS
0.65
millennia
0.64
ekos
0.64
Activations Density 0.199%