INDEX
Explanations
mentions of specific numbers or quantities
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
776
+0.12
0.4%
1842
+0.12
0.4%
897
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
2016
+0.12
0.07
16
+0.12
0.07
161
+0.12
0.03
Negative Logits
<bos>
-0.69
AssemblyCompany
-0.58
toHaveBeen
-0.53
Ikr
-0.50
tartalomajánló
-0.49
kaynağından
-0.48
eût
-0.48
CascadeType
-0.48
reú
-0.47
fordable
-0.47
POSITIVE LOGITS
lagar
0.52
municipi
0.51
Seconde
0.51
település
0.50
Ə
0.50
whom
0.48
Ibidem
0.47
voglio
0.47
SBATCH
0.47
inverte
0.47
Activations Density 0.574%