INDEX
Explanations
mentions of medals and achievements
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
32
+0.15
0.7%
1339
+0.14
0.7%
938
+0.13
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1363
+0.15
0.03
1896
+0.14
0.02
1515
+0.13
0.02
Negative Logits
<bos>
-1.59
WebElementEntity
-0.65
Personendaten
-0.62
illots
-0.62
bezeichneter
-0.60
nasel
-0.59
CodeDom
-0.59
Билгалдахарш
-0.58
Italijani
-0.58
оригіналу
-0.57
POSITIVE LOGITS
medal
1.08
Medal
1.05
Medal
1.00
medal
0.97
medals
0.89
Medals
0.81
medallion
0.78
Meda
0.77
paradiso
0.77
unden
0.73
Activations Density 0.439%