INDEX
Explanations
words indicating movement or direction
expressions that indicate direction or progression
New Auto-Interp
Negative Logits
nz
-0.78
CD
-0.76
RC
-0.74
NZ
-0.70
cell
-0.69
cham
-0.69
umm
-0.67
named
-0.67
nai
-0.65
drivers
-0.65
POSITIVE LOGITS
toward
1.29
towards
1.09
ward
0.90
vernment
0.83
Towards
0.81
infinity
0.80
outheast
0.78
WARD
0.78
wards
0.78
fruition
0.74
Activations Density 0.011%