INDEX
Explanations
words and phrases related to unity, leadership, and organization structure
New Auto-Interp
Negative Logits
rus
-0.17
/fw
-0.16
outine
-0.15
æľĭ
-0.14
aze
-0.14
dev
-0.14
rze
-0.13
gend
-0.13
ibu
-0.13
ginas
-0.13
POSITIVE LOGITS
еди
0.17
istrovstvÃŃ
0.17
oya
0.16
<source
0.15
/shared
0.15
elik
0.15
GPC
0.14
arpa
0.14
born
0.14
esModule
0.14
Activations Density 0.157%