INDEX
Explanations
phrases indicating instructions or tasks that need to be done
the phrase "all you need to do is."
New Auto-Interp
Negative Logits
iates
-0.66
iture
-0.65
atural
-0.64
Yor
-0.63
ishers
-0.63
defic
-0.62
Registered
-0.62
itures
-0.61
eatures
-0.60
ised
-0.60
POSITIVE LOGITS
olation
0.90
ANGE
0.78
olated
0.77
Solitaire
0.76
:]
0.74
SS
0.73
olate
0.71
plain
0.70
opus
0.67
nt
0.66
Activations Density 0.078%