INDEX
Explanations
mentions of team members or people working together
references to teammates in sports or group settings
New Auto-Interp
Negative Logits
atra
-0.81
brow
-0.78
osc
-0.76
skinned
-0.71
warning
-0.69
isen
-0.67
tarians
-0.66
olia
-0.66
ifax
-0.65
ingo
-0.65
POSITIVE LOGITS
teammate
1.17
teammates
1.09
mates
1.00
ubs
0.91
mates
0.83
peak
0.78
colleagues
0.78
nicknamed
0.75
AVG
0.70
":[{"0.69
Activations Density 0.015%