INDEX
Explanations
phrases related to ongoing actions or events
phrases indicating ongoing actions or situations
New Auto-Interp
Negative Logits
atana
-0.73
idden
-0.73
aster
-0.72
ranch
-0.71
oster
-0.70
ixed
-0.70
worker
-0.70
ramid
-0.69
ool
-0.68
onomic
-0.67
POSITIVE LOGITS
unab
0.86
ap
0.73
unanswered
0.71
uninterrupted
0.70
adolesc
0.69
unchanged
0.69
momentum
0.67
unravel
0.64
trending
0.64
tremend
0.64
Activations Density 0.035%