INDEX
Explanations
words related to being surrounded or surroundedness
instances of the word "surrounded."
New Auto-Interp
Negative Logits
uh
-0.68
eway
-0.66
hist
-0.64
der
-0.64
wee
-0.64
err
-0.63
odor
-0.63
bal
-0.62
oi
-0.62
nerv
-0.62
POSITIVE LOGITS
surrounded
0.93
surround
0.89
htaking
0.87
surrounds
0.80
bys
0.80
ength
0.76
angelo
0.73
ouver
0.72
ecause
0.71
ournal
0.70
Activations Density 0.016%