INDEX
Explanations
terms related to social connections and friendships
New Auto-Interp
Negative Logits
baga
-0.97
Życiorys
-0.88
IsContent
-0.86
oriasis
-0.84
Mosley
-0.81
بيها
-0.80
uParam
-0.77
Theaters
-0.77
silenzio
-0.76
disponibilités
-0.76
POSITIVE LOGITS
Friends
1.58
friends
1.55
friends
1.50
FRIENDS
1.49
Friends
1.45
Friend
1.43
friend
1.38
Friend
1.35
FRIEND
1.34
friend
1.20
Activations Density 0.042%