INDEX
Explanations
terms related to laws, regulations, and procedures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1870
+0.15
0.4%
1013
+0.13
0.4%
1705
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1705
+0.15
0.09
2006
+0.13
0.07
74
+0.11
0.05
Negative Logits
alkoh
-0.93
akut
-0.86
panik
-0.85
„,
-0.84
uhr
-0.84
glan
-0.84
kado
-0.83
ché
-0.83
hina
-0.82
kön
-0.81
POSITIVE LOGITS
consisting
0.73
which
0.68
whereby
0.65
containing
0.65
capable
0.63
that
0.62
for
0.62
called
0.60
based
0.59
awsze
0.59
Activations Density 0.572%