INDEX
Explanations
Spanish words and phrases
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.36
1.9%
2019
+0.15
0.8%
1506
+0.10
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
2019
+0.36
0.12
1265
+0.15
0.10
1921
+0.10
0.10
Negative Logits
<bos>
-2.61
<?
-0.77
ⓧ
-0.72
/***
-0.67
-0.64
spek
-0.62
/**
-0.61
dras
-0.59
kast
-0.58
elek
-0.58
POSITIVE LOGITS
unlaw
1.00
maneu
1.00
unwarran
0.98
toledo
0.94
perfon
0.94
increa
0.92
affor
0.91
tucson
0.90
chrysler
0.89
accla
0.89
Activations Density 1.519%