INDEX
Explanations
phrases related to tasks being completed or finished
instances of the word "done."
New Auto-Interp
Negative Logits
Å¡
-0.70
anth
-0.65
lement
-0.62
unin
-0.61
anta
-0.59
McCorm
-0.58
correspond
-0.56
Ann
-0.56
acas
-0.54
erald
-0.53
POSITIVE LOGITS
done
3.60
done
2.82
Done
2.29
Done
2.08
accomplished
1.77
undertaken
1.72
performed
1.68
finished
1.39
completed
1.38
achieved
1.34
Activations Density 0.021%