INDEX
Explanations
phrases related to ongoing actions or processes
phrases that indicate ongoing actions or processes
New Auto-Interp
Negative Logits
idden
-0.79
ranch
-0.78
oster
-0.74
onomic
-0.74
worker
-0.73
aster
-0.72
arta
-0.70
ramid
-0.70
ool
-0.70
iman
-0.69
POSITIVE LOGITS
unab
0.90
unanswered
0.69
ap
0.67
uninterrupted
0.66
adolesc
0.65
tremend
0.65
momentum
0.64
unravel
0.64
unresolved
0.64
progressing
0.63
Activations Density 0.035%