INDEX
Explanations
references to organizational or team structures in a competitive context
New Auto-Interp
Negative Logits
ff
-0.15
opport
-0.15
hos
-0.15
ility
-0.14
лаб
-0.14
Setup
-0.13
INK
-0.13
reck
-0.13
pte
-0.13
conse
-0.13
POSITIVE LOGITS
emphasis
0.29
finishing
0.27
into
0.26
together
0.26
effort
0.25
aside
0.25
blame
0.24
brakes
0.23
_into
0.21
spin
0.21
Activations Density 0.028%