INDEX
Explanations
phrases indicating continuation or ongoing action
instances of the word "continued."
New Auto-Interp
Negative Logits
oster
-0.99
ard
-0.85
arta
-0.82
ramid
-0.80
liest
-0.77
ranch
-0.73
ards
-0.73
atana
-0.72
ainted
-0.72
ustomed
-0.72
POSITIVE LOGITS
unab
0.88
depress
0.73
onward
0.72
proble
0.70
tremend
0.69
ap
0.68
onwards
0.66
exha
0.65
thrust
0.65
sclerosis
0.65
Activations Density 0.033%