INDEX
Explanations
the word "still" in various contexts
New Auto-Interp
Negative Logits
é¾
-0.75
alties
-0.74
incial
-0.72
tymology
-0.71
Course
-0.71
ASE
-0.71
bably
-0.69
livious
-0.67
cius
-0.66
abama
-0.64
POSITIVE LOGITS
birth
1.17
born
0.95
lifes
0.89
ness
0.84
heres
0.83
unanswered
0.78
intact
0.76
creen
0.76
cam
0.75
gray
0.74
Activations Density 0.028%