INDEX
Explanations
references to a "master" or "mastery" in various contexts
New Auto-Interp
Negative Logits
Dagger
-0.16
Rodrig
-0.15
scre
-0.15
dagger
-0.14
Firm
-0.14
agna
-0.14
lush
-0.14
arch
-0.14
Stein
-0.14
quat
-0.14
POSITIVE LOGITS
NotNull
0.17
hoff
0.16
pieces
0.16
941
0.15
990
0.15
mind
0.15
Ïĩα
0.15
loub
0.15
_mx
0.15
391
0.15
Activations Density 0.014%