INDEX
Explanations
references to spending time with friends
references to friendship and social connections
New Auto-Interp
Negative Logits
rection
-0.72
Ħ¢
-0.71
©¶æ
-0.70
phas
-0.67
orney
-0.65
untreated
-0.63
elsius
-0.62
plet
-0.61
agnetic
-0.59
æ©Ł
-0.59
POSITIVE LOGITS
hips
1.27
liest
1.04
folk
1.00
liness
0.99
lier
0.96
hip
0.88
acquaintances
0.86
whom
0.81
who
0.80
ships
0.78
Activations Density 0.058%