INDEX
Explanations
mentions of groups or organizations called "Friends."
repeated mentions of the word "Friends," indicating a focus on social groups or organizations
New Auto-Interp
Negative Logits
Siren
-0.75
uid
-0.72
aneous
-0.71
aper
-0.68
apers
-0.67
concussion
-0.65
Hok
-0.64
Hurricanes
-0.61
Noon
-0.59
urities
-0.58
POSITIVE LOGITS
hips
1.00
Friends
0.97
liness
0.90
nect
0.88
itism
0.87
ington
0.86
liest
0.83
adelphia
0.80
ilial
0.80
emouth
0.76
Activations Density 0.050%