INDEX
Explanations
mention of financial transactions, political connections, and controversial figures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
198
+0.13
0.4%
184
+0.13
0.4%
1150
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1662
+0.13
0.06
1136
+0.13
0.05
238
+0.12
0.04
Negative Logits
tricot
-0.72
chèvre
-0.65
broderie
-0.58
chemise
-0.57
artig
-0.56
menthe
-0.54
nage
-0.54
rivi
-0.53
matel
-0.52
toilette
-0.52
POSITIVE LOGITS
reportedly
0.54
CiNii
0.54
BERNAMA
0.53
Américas
0.53
chaired
0.53
katun
0.52
kasama
0.51
Conexion
0.51
allegedly
0.50
bahay
0.50
Activations Density 0.612%