INDEX
Explanations
references to procrastination and laziness
New Auto-Interp
Negative Logits
Landing
-0.19
åѤ
-0.17
alte
-0.15
Sanity
-0.14
loneliness
-0.14
wares
-0.14
üven
-0.14
-UA
-0.14
Lon
-0.14
discreet
-0.14
POSITIVE LOGITS
lazy
0.45
lazy
0.42
Lazy
0.38
laz
0.38
Lazy
0.36
laz
0.35
slack
0.35
Laz
0.33
.lazy
0.31
æĩ
0.29
Activations Density 0.272%