INDEX
Explanations
references to societal structures and community elements
New Auto-Interp
Negative Logits
edic
-0.17
па
-0.15
shan
-0.15
Insecta
-0.15
XM
-0.15
shake
-0.15
Tenn
-0.15
svc
-0.15
oku
-0.15
logy
-0.15
POSITIVE LOGITS
ides
0.21
it
0.20
in
0.19
ide
0.19
inen
0.18
ink
0.18
id
0.17
ins
0.17
its
0.17
itung
0.17
Activations Density 0.050%