INDEX
Explanations
references related to historical events and philosophical ideologies
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1842
+0.13
0.4%
394
+0.13
0.4%
764
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1870
+0.13
0.06
1220
+0.13
0.07
1842
+0.13
0.08
Negative Logits
bewerken
-0.70
تضيفلها
-0.60
GraphicsUnit
-0.60
ویکیپدیای
-0.56
ValueStyle
-0.53
AndEndTag
-0.52
"..\..\..\
-0.52
Suara
-0.52
solicited
-0.50
تانيه
-0.50
POSITIVE LOGITS
oliveira
0.74
surfact
0.65
Edizioni
0.63
felipe
0.62
mercu
0.62
urso
0.61
Mémoires
0.60
sulfu
0.59
liberality
0.59
alicante
0.58
Activations Density 1.476%