INDEX
Explanations
references to authority figures and hierarchical structures in a narrative context
New Auto-Interp
Negative Logits
surla
-0.84
nakalista
-0.78
MessageOf
-0.73
AsUp
-0.64
省市镇
-0.63
otomatig
-0.61
autorytatywna
-0.61
Ծանոթ
-0.59
sorts
-0.58
luents
-0.58
POSITIVE LOGITS
opponent
0.47
+#+#
0.34
investis
0.32
originally
0.30
démocr
0.29
whole
0.29
springfox
0.28
crowd
0.28
เศ
0.28
?>/
0.28
Activations Density 0.088%