INDEX
Explanations
mentions of actions or interactions involving sports teammates
references to teammates and teamwork
New Auto-Interp
Negative Logits
brow
-0.80
osc
-0.74
atra
-0.74
tarian
-0.72
tarians
-0.68
consumer
-0.65
cot
-0.65
budget
-0.64
ulent
-0.64
skinned
-0.63
POSITIVE LOGITS
teammate
1.20
teammates
1.10
mates
1.04
colleagues
0.84
ubs
0.83
mates
0.81
classmates
0.74
colleague
0.73
AVG
0.72
Coco
0.71
Activations Density 0.023%