INDEX
Explanations
themes of loss, limited freedom, and the impact of societal pressures on individual lives
New Auto-Interp
Negative Logits
anzi
-0.16
Slow
-0.14
innen
-0.13
Slow
-0.13
sluggish
-0.12
Lew
-0.12
Stable
-0.12
xCB
-0.12
uts
-0.12
Zug
-0.12
POSITIVE LOGITS
short
1.05
short
0.91
-short
0.87
çŁŃ
0.85
Short
0.85
SHORT
0.84
Short
0.81
.short
0.76
_short
0.74
shorter
0.73
Activations Density 0.220%