INDEX
Explanations
texts related to encouragement and promotion
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.19
1.1%
1392
+0.09
0.5%
220
+0.09
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1370
+0.19
0.03
1141
+0.09
0.03
220
+0.09
0.03
Negative Logits
<bos>
-3.08
ⓧ
-0.98
-0.95
<?
-0.91
/***
-0.84
/**
-0.80
/*
-0.75
glMatrixMode
-0.74
<?
-0.69
Vegeu
-0.69
POSITIVE LOGITS
stockholm
1.58
maroc
1.56
bandung
1.51
hcm
1.51
lele
1.48
wien
1.47
aen
1.46
meis
1.44
Minang
1.43
mef
1.42
Activations Density 0.126%