INDEX
Explanations
actions related to collaboration and teamwork
New Auto-Interp
Negative Logits
/from
-0.15
dry
-0.15
rong
-0.15
éĩİ
-0.15
Tok
-0.14
ilder
-0.14
гов
-0.13
330
-0.13
oleon
-0.13
/down
-0.13
POSITIVE LOGITS
closely
0.37
alongside
0.28
close
0.24
together
0.24
closer
0.23
towards
0.22
CLOSE
0.22
closest
0.20
toward
0.20
hand
0.19
Activations Density 0.044%