INDEX
Explanations
mentions of physical injuries or medical emergencies, specifically bleeding occurrences
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
410
+0.09
0.3%
1921
+0.07
0.2%
1573
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
976
+0.09
0.03
690
+0.07
0.03
1343
+0.07
0.03
Negative Logits
<bos>
-0.99
attach
-0.60
ніж
-0.60
put
-0.59
want
-0.58
WSER
-0.58
have
-0.58
create
-0.57
<blockquote>
-0.57
raise
-0.57
POSITIVE LOGITS
wien
1.49
aen
1.46
mef
1.45
fta
1.44
franz
1.42
squa
1.42
ftu
1.40
sii
1.40
fup
1.40
„,
1.39
Activations Density 0.198%