INDEX
Explanations
references to friends
references to friends and social connections
New Auto-Interp
Negative Logits
hner
-0.68
phas
-0.68
uchin
-0.67
oted
-0.67
assion
-0.66
ITH
-0.65
ifer
-0.64
ruling
-0.63
vent
-0.63
arcity
-0.63
POSITIVE LOGITS
hips
1.00
friends
0.97
buddies
0.95
Friends
0.93
acquaintances
0.92
friends
0.87
Friends
0.85
folk
0.85
collaborators
0.83
hip
0.78
Activations Density 0.029%