INDEX
Explanations
mentions of injuries or accidents involving individuals, particularly in the context of entertainment or performance
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
98
+0.15
0.9%
365
+0.13
0.7%
280
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
438
+0.15
0.40
53
+0.13
0.42
56
+0.11
0.11
Negative Logits
ingale
-1.66
isance
-1.66
vel
-1.54
pleth
-1.47
pendicular
-1.42
ternal
-1.41
ry
-1.37
yset
-1.35
-1.34
vertisement
-1.31
POSITIVE LOGITS
ŀ
2.74
ĻĤ
2.71
³
2.59
»¿
2.59
Ł
2.50
↵
2.46
↵ ↵
2.46
↵
2.46
↵
2.46
č↵
2.46
Activations Density 4.670%