INDEX
Explanations
expressions of self-reflection and critique in performance
New Auto-Interp
Negative Logits
915
-0.16
_hour
-0.16
.Hour
-0.15
utm
-0.15
oref
-0.14
abe
-0.14
nig
-0.14
بات
-0.14
.subplots
-0.13
icons
-0.13
POSITIVE LOGITS
sustain
0.22
regist
0.21
vibr
0.21
dynamics
0.21
cresc
0.19
attack
0.19
artic
0.19
technique
0.19
Technique
0.19
registers
0.18
Activations Density 0.039%