INDEX
Explanations
timestamps or dates related to the word "Last" in a specific context
references to recency or timeliness of events
New Auto-Interp
Negative Logits
mbuds
-0.68
onto
-0.67
cil
-0.63
obin
-0.63
OPLE
-0.63
urious
-0.62
sure
-0.60
aii
-0.59
plain
-0.58
odium
-0.58
POSITIVE LOGITS
updated
1.09
rites
1.05
edited
1.04
bumped
1.03
Updated
1.02
modified
0.96
seen
0.95
night
0.91
Seen
0.91
Modified
0.90
Activations Density 0.055%