INDEX
Explanations
phrases related to direction or movement
multiple instances of the word "for" and phrases indicating direction or destination
New Auto-Interp
Negative Logits
FORE
-0.71
persisted
-0.70
nces
-0.68
belong
-0.65
Used
-0.65
inant
-0.64
applied
-0.64
complied
-0.64
Rated
-0.62
secondly
-0.62
POSITIVE LOGITS
Charlottesville
0.80
pex
0.77
retirement
0.77
代
0.76
Skydragon
0.74
Showdown
0.71
town
0.69
Downing
0.68
tnc
0.67
Munich
0.67
Activations Density 0.066%