INDEX
Explanations
themes of humility and self-reflection
New Auto-Interp
Negative Logits
Ïģιν
-0.19
icari
-0.15
osite
-0.15
γεÏģ
-0.15
icer
-0.14
ryn
-0.14
Ñĥла
-0.14
lag
-0.14
onet
-0.14
.Areas
-0.13
POSITIVE LOGITS
humble
0.38
hum
0.36
humility
0.35
low
0.34
Hum
0.32
lowest
0.29
-low
0.28
LOW
0.27
Low
0.27
humiliation
0.26
Activations Density 0.127%