INDEX
Explanations
mentions of relationships and dynamics within organizations or systems
New Auto-Interp
Negative Logits
ople
-0.16
lessly
-0.16
inan
-0.16
erin
-0.15
รà¸ĵ
-0.15
nnen
-0.14
thon
-0.14
ilio
-0.14
žit
-0.14
rawl
-0.13
POSITIVE LOGITS
(++
0.16
timing
0.15
)
0.14
morgan
0.13
esz
0.13
Kauf
0.13
eson
0.13
Ãŀ
0.13
ãĥĥãĥī
0.13
_CHK
0.13
Activations Density 0.101%