INDEX
Explanations
emotional states and concepts related to remorse and hurt
New Auto-Interp
Negative Logits
heimer
-0.15
byt
-0.14
COPY
-0.14
appId
-0.14
ypad
-0.14
/sidebar
-0.14
simd
-0.13
llx
-0.13
yscale
-0.13
nameof
-0.13
POSITIVE LOGITS
ful
1.20
FUL
0.97
fully
0.96
full
0.93
fulness
0.91
FULL
0.78
-full
0.66
ful
0.65
Full
0.60
Ful
0.59
Activations Density 0.155%