INDEX
Explanations
references to positions or roles in organizations and boards
New Auto-Interp
Negative Logits
zk
-0.17
kon
-0.15
sembler
-0.14
æīĢ
-0.14
-append
-0.14
indr
-0.14
Counseling
-0.14
Ekon
-0.14
esk
-0.13
令
-0.13
POSITIVE LOGITS
board
0.22
ilon
0.20
boards
0.18
Board
0.16
Parents
0.15
ende
0.15
steering
0.15
governors
0.15
ujet
0.15
apiro
0.15
Activations Density 0.056%