INDEX
Explanations
instances of repetition or recurrence in various contexts
New Auto-Interp
Negative Logits
still
-0.20
finally
-0.20
Still
-0.19
still
-0.17
indeed
-0.16
обо
-0.15
Still
-0.15
Finally
-0.14
any
-0.14
stále
-0.14
POSITIVE LOGITS
proven
0.20
demonstrated
0.20
failed
0.20
proved
0.19
fail
0.18
ovnÄĽ
0.18
FAILED
0.17
prove
0.17
proves
0.17
fails
0.16
Activations Density 0.030%