INDEX
Explanations
dates and related time expressions
New Auto-Interp
Negative Logits
Fung
-0.72
مرئيه
-0.69
Ambro
-0.65
avedra
-0.64
Svetlana
-0.64
hamdu
-0.64
ozat
-0.64
zecz
-0.62
Wiese
-0.62
június
-0.61
POSITIVE LOGITS
January
2.08
January
1.88
JANUARY
1.73
january
1.66
Jan
1.66
Januar
1.65
Jan
1.65
january
1.61
gennaio
1.57
janvier
1.55
Activations Density 0.071%