INDEX
Explanations
dates or time-related events
instances of the word "after" and related phrases indicating events or actions taken subsequently
New Auto-Interp
Negative Logits
ardless
-0.85
ail
-0.76
odes
-0.67
olute
-0.67
ertain
-0.66
each
-0.65
atts
-0.64
isible
-0.63
etric
-0.63
ctive
-0.63
POSITIVE LOGITS
actionDate
0.86
suspicions
0.72
Azerb
0.69
childhood
0.65
Johann
0.65
pledging
0.63
researching
0.63
initials
0.63
growth
0.62
necessity
0.62
Activations Density 0.406%