INDEX
Explanations
references to close relationships or organizations related to companionship
mentions of the word "Friends" in various contexts
New Auto-Interp
Negative Logits
churn
-0.72
belt
-0.68
cylinder
-0.67
shedding
-0.66
tipped
-0.65
dep
-0.64
vein
-0.62
AMD
-0.62
ossus
-0.62
saturation
-0.60
POSITIVE LOGITS
Friends
4.02
Friends
2.95
friends
2.34
Friend
1.82
friends
1.81
Friendship
1.63
FRI
1.61
friend
1.59
pals
1.47
Friend
1.44
Activations Density 0.016%