INDEX
Explanations
timestamps and numerical data within the text
New Auto-Interp
Negative Logits
j
-0.21
u
-0.20
z
-0.19
q
-0.18
HR
-0.18
p
-0.17
a
-0.17
ft
-0.17
ir
-0.17
ts
-0.17
POSITIVE LOGITS
pm
0.55
am
0.41
(pm
0.32
/pm
0.30
pm
0.29
_pm
0.26
pm
0.26
.pm
0.25
fm
0.22
_PM
0.21
Activations Density 0.007%