INDEX
Explanations
ratings and reviews of products
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.28
1.2%
1177
+0.15
0.6%
1150
+0.14
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
736
+0.28
0.12
1150
+0.15
0.03
876
+0.14
-0.07
Negative Logits
<bos>
-3.09
ⓧ
-0.99
ideolog
-0.72
referenties
-0.71
-0.71
<",
-0.70
/***
-0.70
horm
-0.69
PerformLayout
-0.69
UnknownFields
-0.68
POSITIVE LOGITS
vété
1.35
maneu
1.29
impra
1.28
désol
1.28
malheureux
1.28
malheure
1.27
indestru
1.24
disreg
1.21
shenan
1.21
considér
1.21
Activations Density 2.653%