INDEX
Explanations
time-related events or changes
temporal phrases indicating changes over time
New Auto-Interp
Negative Logits
Ĥª
-0.77
Ļ
-0.77
owship
-0.72
ãĤ¹
-0.65
ģ
-0.65
ĩ
-0.64
cherished
-0.63
voic
-0.63
Favorite
-0.62
Sons
-0.62
POSITIVE LOGITS
adjusting
0.96
accounting
0.91
subtract
0.90
noon
0.78
excluding
0.76
market
0.75
hattan
0.73
kefeller
0.72
adjustment
0.71
achev
0.71
Activations Density 0.125%