INDEX
Explanations
references to financial transactions and monetary figures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
184
+0.21
0.7%
1842
+0.15
0.5%
1129
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
184
+0.21
0.02
1842
+0.15
0.02
715
+0.12
0.01
Negative Logits
<bos>
-1.07
quegli
-0.82
awtextra
-0.81
expandindo
-0.75
autorytatywna
-0.70
Autoritní
-0.69
estekak
-0.69
WindowConstants
-0.66
autunno
-0.65
Geplaatst
-0.65
POSITIVE LOGITS
Wtf
0.81
FTFY
0.71
Lmao
0.68
reconno
0.67
unspeak
0.66
Considerable
0.65
McLaugh
0.65
intersper
0.64
Whence
0.64
tolerably
0.63
Activations Density 0.151%