INDEX
Explanations
specific mentions of an entity called "Friends" in various contexts
mentions of the word "Friends" in various contexts, indicating a focus on friendship or community connections
New Auto-Interp
Negative Logits
bearer
-0.68
concussion
-0.65
OTT
-0.63
otal
-0.62
arcity
-0.60
artifacts
-0.59
OUT
-0.59
utilitarian
-0.59
ameron
-0.59
stall
-0.59
POSITIVE LOGITS
hips
1.19
Friends
1.07
liest
0.98
liness
0.92
Friends
0.88
busters
0.87
ships
0.84
ilial
0.81
lier
0.81
Friend
0.80
Activations Density 0.023%