INDEX
Explanations
instances of the word "today" and its variations
New Auto-Interp
Negative Logits
ic
-0.17
er
-0.16
ray
-0.16
s
-0.16
now
-0.16
su
-0.16
ussen
-0.16
jetzt
-0.15
leaf
-0.15
ses
-0.15
POSITIVE LOGITS
lerde
0.20
aday
0.20
ÑĪ
0.17
گار
0.17
Ùħار
0.17
cÃłng
0.16
jÅ¡ÃŃ
0.15
eÄį
0.15
-day
0.15
########.
0.15
Activations Density 0.037%