INDEX
Explanations
references to team collaboration and communication in a work context
New Auto-Interp
Negative Logits
alian
-0.18
uli
-0.17
ulus
-0.16
اÙĩÙħ
-0.16
lew
-0.15
ategy
-0.14
pany
-0.14
exion
-0.13
everyone
-0.13
ãĢĤãĢĤ↵↵
-0.13
POSITIVE LOGITS
uh
0.24
sort
0.20
actually
0.19
kind
0.19
èĥ½å¤Ł
0.18
åij¢
0.17
um
0.17
really
0.16
kind
0.16
yeah
0.15
Activations Density 0.904%