INDEX
Explanations
mentions of the time "tonight"
references to the word "tonight."
New Auto-Interp
Negative Logits
berman
-0.76
sac
-0.68
sed
-0.65
egal
-0.65
aments
-0.64
eer
-0.64
Journals
-0.63
hetti
-0.63
lasses
-0.63
Entity
-0.62
POSITIVE LOGITS
tonight
1.03
afternoon
1.02
evening
0.96
morning
0.94
night
0.90
Tonight
0.88
night
0.85
Evening
0.84
Tonight
0.82
nights
0.80
Activations Density 0.010%