INDEX
Explanations
references to locations and proximity to attractions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.41
1.5%
599
+0.10
0.3%
738
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1446
+0.41
0.04
1120
+0.10
0.01
939
+0.09
0.04
Negative Logits
<bos>
-1.06
solidar
-0.73
pessi
-0.72
/***
-0.71
recipro
-0.66
-0.66
utop
-0.66
ⓧ
-0.66
lau
-0.64
psycholog
-0.63
POSITIVE LOGITS
déplo
1.05
fameux
1.01
bénéfice
0.97
saurait
0.96
triomphe
0.92
réjou
0.87
trésor
0.87
marié
0.86
vœ
0.86
règlement
0.85
Activations Density 0.239%