INDEX
Explanations
the word "Throughout" followed by a date or a specific period of time
repetitive phrases indicating continuity or duration
New Auto-Interp
Negative Logits
nery
-0.71
potion
-0.69
umble
-0.66
JV
-0.65
que
-0.65
venge
-0.65
boy
-0.62
olk
-0.61
inals
-0.61
Haram
-0.61
POSITIVE LOGITS
eatures
0.83
ĸļ
0.81
Wage
0.79
Seasons
0.77
theless
0.77
itialized
0.77
Languages
0.71
ende
0.70
perty
0.68
ership
0.67
Activations Density 0.005%