INDEX
Explanations
phrases related to collaboration and teamwork
New Auto-Interp
Negative Logits
rong
-0.16
Tok
-0.16
éĩİ
-0.15
610
-0.14
/from
-0.13
330
-0.13
гов
-0.13
anter
-0.13
HI
-0.13
911
-0.13
POSITIVE LOGITS
closely
0.40
alongside
0.26
together
0.21
towards
0.20
directly
0.20
close
0.19
hand
0.19
CLOSE
0.18
closer
0.18
toward
0.18
Activations Density 0.046%