INDEX
Explanations
phrases related to financial transactions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1741
+0.17
0.5%
394
+0.13
0.4%
1265
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1265
+0.17
0.07
382
+0.13
0.07
736
+0.13
0.07
Negative Logits
makro
-1.47
antik
-1.44
alkoh
-1.43
marte
-1.34
umo
-1.34
tuta
-1.31
Kategor
-1.31
maroc
-1.30
elek
-1.29
ananas
-1.29
POSITIVE LOGITS
but
0.91
they
0.69
yes
0.67
we
0.66
however
0.66
nhưng
0.64
but
0.64
pero
0.63
and
0.63
he
0.62
Activations Density 0.410%