INDEX
Explanations
references to motivation and discipline in the context of personal development
New Auto-Interp
Negative Logits
rette
-0.15
iec
-0.15
ãĥ¼ãĥ¬
-0.15
ibur
-0.15
affected
-0.14
aga
-0.14
bazen
-0.14
ima
-0.14
xen
-0.13
(?
-0.13
POSITIVE LOGITS
ALIGN
0.18
Alignment
0.18
Authentic
0.17
Alignment
0.17
congr
0.17
Purpose
0.17
Success
0.16
Align
0.16
Mas
0.16
alignment
0.16
Activations Density 0.260%