INDEX
Explanations
specific phrases related to actions and events
phrases indicating actions or activities involving movement or change
New Auto-Interp
Negative Logits
anymore
-0.51
tan
-0.49
whichever
-0.48
base
-0.47
iddle
-0.47
yond
-0.47
Chapters
-0.47
MI
-0.47
entric
-0.46
affected
-0.46
POSITIVE LOGITS
yesterday
0.60
last
0.59
earlier
0.53
Thursday
0.52
captcha
0.49
extensively
0.49
Roz
0.48
Wednesday
0.48
Horowitz
0.48
ax
0.48
Activations Density 1.103%