INDEX
Explanations
occurrences of the word "friends" in the text
references to friends in social contexts
New Auto-Interp
Negative Logits
phas
-0.77
grim
-0.72
yss
-0.70
imon
-0.66
assion
-0.64
Ħ¢
-0.63
©¶æ
-0.63
aton
-0.62
acted
-0.62
chloride
-0.61
POSITIVE LOGITS
hips
1.20
folk
1.05
liest
0.94
liness
0.91
lier
0.90
acquaintances
0.89
hip
0.87
hall
0.82
friends
0.82
erv
0.81
Activations Density 0.050%