INDEX
Explanations
times mentioned in hours and minutes
phrases indicating a specific time or approximate temporal references
New Auto-Interp
Negative Logits
rog
-0.64
eming
-0.61
atro
-0.58
yet
-0.57
Masquerade
-0.56
Sent
-0.56
indeed
-0.55
oran
-0.54
Auth
-0.54
oleon
-0.54
POSITIVE LOGITS
midday
1.03
noon
1.00
1900
0.97
midnight
0.94
7000
0.94
3000
0.94
9000
0.94
1850
0.94
1400
0.93
1700
0.91
Activations Density 0.055%