INDEX
Explanations
the occurrence of the word "first" and its variations in different contexts
New Auto-Interp
Negative Logits
aul
-0.17
for
-0.15
aret
-0.15
ider
-0.14
ook
-0.14
žit
-0.14
abil
-0.14
pul
-0.14
essen
-0.13
ara
-0.13
POSITIVE LOGITS
times
0.32
fois
0.25
vez
0.25
keer
0.23
TIMES
0.22
time
0.21
times
0.21
veces
0.20
Times
0.19
vezes
0.19
Activations Density 0.015%