INDEX
Explanations
phrases related to physical distance or separation
occurrences of the word "away" and its contextual use
New Auto-Interp
Negative Logits
sshd
-0.68
Ellison
-0.67
Frog
-0.62
Butterfly
-0.61
Phillip
-0.59
Sentinel
-0.59
ific
-0.58
heels
-0.57
xual
-0.57
Minotaur
-0.56
POSITIVE LOGITS
fitting
0.82
aday
0.75
away
0.74
lier
0.72
irts
0.70
iances
0.70
lane
0.69
fits
0.69
lessly
0.67
irt
0.67
Activations Density 0.045%