INDEX
Explanations
phrases indicating spatial movement or directionality
New Auto-Interp
Negative Logits
ThroughAttribute
-0.64
resourceCulture
-0.60
WriteTagHelper
-0.57
ItemBackground
-0.56
LookAnd
-0.56
AssemblyCompany
-0.55
relâche
-0.53
argint
-0.52
AndEndTag
-0.51
SharedDtor
-0.51
POSITIVE LOGITS
past
0.49
front
0.48
into
0.46
yonder
0.43
near
0.43
past
0.42
toward
0.42
range
0.40
soci
0.39
towards
0.39
Activations Density 0.152%