INDEX
Explanations
mentions of military-related terms and historical events
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1926
+0.14
0.5%
369
+0.13
0.5%
1575
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1575
+0.14
0.04
369
+0.13
0.04
1926
+0.12
0.04
Negative Logits
?...
-0.54
manik
-0.54
encomp
-0.54
!...
-0.53
ciasc
-0.52
logarith
-0.52
depic
-0.51
amanda
-0.50
adona
-0.50
perchance
-0.50
POSITIVE LOGITS
General
1.09
General
1.06
general
1.05
GENERAL
1.02
general
1.01
GENERAL
0.99
eneral
0.93
Général
0.87
Geral
0.87
général
0.84
Activations Density 0.071%