INDEX
Explanations
references to servant leadership
New Auto-Interp
Negative Logits
er
-0.16
گاÙĩ
-0.16
es
-0.15
wig
-0.15
ed
-0.15
Services
-0.14
lag
-0.14
eg
-0.14
zung
-0.14
ubar
-0.14
POSITIVE LOGITS
illance
0.26
iced
0.25
itude
0.23
icing
0.19
ANTS
0.17
ants
0.17
izio
0.17
iteur
0.16
annah
0.16
atore
0.16
Activations Density 0.011%