INDEX
Explanations
dates and days of the week
specific days and times mentioned in the text
New Auto-Interp
Negative Logits
onite
-0.64
anium
-0.63
ufact
-0.62
cs
-0.58
=-=-=-=-=-=-=-=-
-0.57
VIDEOS
-0.56
ktop
-0.56
cause
-0.55
hop
-0.55
etheless
-0.54
POSITIVE LOGITS
we
0.75
afternoon
0.74
evening
0.72
,
0.70
morning
0.68
however
0.67
they
0.67
ruary
0.66
meanwhile
0.65
2010
0.64
Activations Density 0.148%