INDEX
Explanations
phrases indicating events or announcements that will happen in the near future
references to time-related phrases
New Auto-Interp
Negative Logits
ulner
-0.79
cu
-0.73
ward
-0.72
WARD
-0.71
antz
-0.71
omas
-0.70
uates
-0.70
ohn
-0.70
agascar
-0.69
acter
-0.69
POSITIVE LOGITS
meantime
1.47
vein
1.13
vicinity
1.06
evenings
1.01
midst
0.99
hopes
0.93
mornings
0.92
Philippines
0.92
afternoon
0.91
meanwhile
0.90
Activations Density 0.179%