INDEX
Explanations
dates or time-related terms
occurrences of the word "start" and related terminology
New Auto-Interp
Negative Logits
entirety
-0.66
majority
-0.65
æ©Ł
-0.61
illard
-0.61
keyboards
-0.60
surv
-0.60
selves
-0.60
compan
-0.60
bearer
-0.59
Bastard
-0.59
POSITIVE LOGITS
nings
1.27
ribune
0.92
ups
0.86
ners
0.84
up
0.80
date
0.78
UP
0.77
othal
0.77
strap
0.76
points
0.75
Activations Density 0.047%