INDEX
Explanations
phrases related to progress and achievement
New Auto-Interp
Negative Logits
belang
-0.15
already
-0.14
zajÃŃm
-0.14
richtig
-0.14
already
-0.14
Already
-0.14
.metro
-0.13
oje
-0.13
ENU
-0.13
owi
-0.13
POSITIVE LOGITS
everybody
0.20
slower
0.18
gradual
0.18
slow
0.18
slow
0.18
everyone
0.17
equally
0.17
similarly
0.17
Everybody
0.16
Everybody
0.16
Activations Density 0.026%