INDEX
Explanations
terms and concepts related to philosophy
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.20
1.1%
966
+0.11
0.6%
950
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
966
+0.20
0.02
950
+0.11
0.02
251
+0.11
0.02
Negative Logits
<bos>
-2.64
AsUp
-0.74
writeField
-0.71
ⓧ
-0.69
convene
-0.67
inaugurate
-0.65
/***
-0.65
avert
-0.62
conserve
-0.62
IContainer
-0.61
POSITIVE LOGITS
affor
1.26
increa
1.21
thut
1.14
wien
1.13
bandung
1.12
Juf
1.11
tew
1.10
yong
1.10
yoo
1.10
FFFF
1.08
Activations Density 0.034%