INDEX
Explanations
references to significant events or changes related to "groundbreaking" projects or concepts
New Auto-Interp
Negative Logits
æľĭ
-0.18
ÌĨ
-0.18
ett
-0.15
akes
-0.15
ipi
-0.14
cks
-0.14
ayo
-0.14
lite
-0.14
Kob
-0.14
mono
-0.14
POSITIVE LOGITS
hog
0.31
truth
0.25
-breaking
0.25
_truth
0.23
breaking
0.23
truth
0.23
ground
0.23
breaking
0.23
Truth
0.22
-floor
0.21
Activations Density 0.009%