INDEX
Explanations
phrases and words related to time, specifically references to days, months, and yearly events
New Auto-Interp
Negative Logits
ÑĢÑĥн
-0.17
overall
-0.17
overall
-0.16
illy
-0.15
angs
-0.14
nám
-0.13
obr
-0.13
naire
-0.13
xima
-0.13
ough
-0.13
POSITIVE LOGITS
cky
0.15
istrovstvÃŃ
0.14
Leban
0.14
-ion
0.14
ertz
0.14
Dagger
0.14
indsight
0.14
365
0.14
Mvc
0.14
zug
0.13
Activations Density 0.044%