INDEX
Explanations
references to group dynamics and cooperation
New Auto-Interp
Negative Logits
emarks
-0.17
zik
-0.17
serter
-0.16
obraz
-0.16
ovice
-0.16
ABC
-0.15
mrb
-0.14
vertis
-0.14
Discovery
-0.14
icolor
-0.14
POSITIVE LOGITS
geber
0.17
Carr
0.16
Weiner
0.16
Juda
0.15
ieri
0.15
anst
0.14
inger
0.14
/Internal
0.14
ëĵ
0.14
NonNull
0.14
Activations Density 0.009%