INDEX
Explanations
mentions of the origin or source of something
the word "from" used repeatedly in various contexts
New Auto-Interp
Negative Logits
ratulations
-0.72
seek
-0.69
priority
-0.65
aqu
-0.65
few
-0.64
merce
-0.64
iar
-0.64
bably
-0.63
istar
-0.63
sat
-0.62
POSITIVE LOGITS
afar
1.34
whence
1.13
thence
1.07
abroad
0.96
scratch
0.86
anywhere
0.81
somewhere
0.80
elsewhere
0.79
inside
0.79
Frie
0.78
Activations Density 0.220%