INDEX
Explanations
words related to relationships and social connections
references to friendship
New Auto-Interp
Negative Logits
cise
-0.68
consumer
-0.67
pex
-0.66
vich
-0.64
arde
-0.63
ktop
-0.62
por
-0.61
erate
-0.61
grim
-0.60
warning
-0.60
POSITIVE LOGITS
friendships
0.85
friendship
0.85
ilial
0.79
buddy
0.75
banter
0.74
halla
0.73
partner
0.73
unsus
0.73
recomm
0.72
buddies
0.71
Activations Density 0.026%