INDEX
Explanations
references to specific individuals or characters associated with leadership and influence
New Auto-Interp
Negative Logits
ees
-0.24
eh
-0.23
ele
-0.20
ee
-0.20
ean
-0.20
tı
-0.20
eam
-0.20
stown
-0.20
ez
-0.20
ever
-0.19
POSITIVE LOGITS
awn
0.31
s
0.24
AWN
0.22
(es
0.22
øj
0.22
midt
0.22
mallow
0.21
à¥įà¤ļ
0.21
hh
0.21
ields
0.21
Activations Density 0.071%