INDEX
Explanations
references to historical events, particularly wars and crises
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.34
1.3%
2019
+0.09
0.3%
690
+0.07
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
2019
+0.34
0.06
690
+0.09
0.03
62
+0.07
0.05
Negative Logits
<bos>
-2.53
encomp
-1.09
intersper
-1.07
endow
-0.89
unve
-0.89
/***
-0.87
indestru
-0.87
underval
-0.82
quitted
-0.81
<?
-0.80
POSITIVE LOGITS
CollectionUtils
0.60
Gnaden
0.58
gruntled
0.57
reportWebVitals
0.57
PhysRevLett
0.57
fillText
0.55
Tode
0.55
Herzen
0.54
caseros
0.53
ruinas
0.53
Activations Density 0.603%