INDEX
Explanations
phrases related to progress or advancement
phrases indicating gradual progress or incremental steps forward
New Auto-Interp
Negative Logits
ghazi
-0.67
igslist
-0.61
htaking
-0.58
urated
-0.57
onduct
-0.57
unci
-0.56
ictions
-0.56
idth
-0.55
zeb
-0.55
cius
-0.55
POSITIVE LOGITS
closer
1.36
worse
1.28
richer
1.28
better
1.24
cheaper
1.24
farther
1.23
louder
1.22
faster
1.21
higher
1.20
safer
1.20
Activations Density 0.497%