INDEX
Explanations
phrases related to companionship or membership in a group
instances of the word "mate" and its variations in the context of relationships or partnerships
New Auto-Interp
Negative Logits
icum
-0.83
SIGN
-0.79
okin
-0.73
inyl
-0.71
istry
-0.70
icer
-0.69
kefeller
-0.69
acco
-0.69
si
-0.67
authorized
-0.67
POSITIVE LOGITS
mates
0.95
Roc
0.88
mate
0.81
Swap
0.78
Delete
0.75
bery
0.69
':
0.68
liness
0.68
Sylvia
0.67
nicknamed
0.66
Activations Density 0.017%