INDEX
Explanations
words related to geographical locations and potential future developments or projects
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
832
+0.09
0.2%
206
+0.07
0.2%
1108
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
184
+0.09
0.02
283
+0.07
0.02
304
+0.07
0.02
Negative Logits
on
-0.70
</strong>
-0.70
in
-0.69
at
-0.67
</b>
-0.66
he
-0.65
by
-0.65
to
-0.64
through
-0.64
de
-0.63
POSITIVE LOGITS
dises
1.85
haup
1.76
hcm
1.74
jaya
1.74
sappi
1.73
hina
1.68
napoli
1.61
santiago
1.59
umo
1.58
seiz
1.58
Activations Density 0.311%