INDEX
Explanations
refer to historical and geographical terms and locations, specifically discussing advancements and industries in the past
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.15
0.5%
1967
+0.13
0.4%
1108
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1819
+0.15
0.21
227
+0.13
0.18
1213
+0.11
0.16
Negative Logits
affez
-1.22
Ottobre
-1.18
robus
-1.14
ordina
-1.11
parteci
-1.08
sappi
-1.07
solidar
-1.06
TagMode
-1.06
vogli
-1.04
LIRE
-1.04
POSITIVE LOGITS
ineffec
1.26
unavoid
1.11
impelled
1.08
unspeak
1.08
McLaugh
1.05
Vaugh
1.03
shenan
1.02
McInt
1.00
sophistic
0.99
impra
0.98
Activations Density 4.430%