INDEX
Explanations
references to a specific character or entity associated with leadership
New Auto-Interp
Negative Logits
Sri
-0.65
eled
-0.62
ERC
-0.62
Jian
-0.58
jazz
-0.58
els
-0.58
ainted
-0.58
Lovecraft
-0.57
ELS
-0.57
zza
-0.56
POSITIVE LOGITS
dash
1.33
minster
1.01
rama
0.98
lein
0.92
hoe
0.90
stood
0.86
lund
0.86
meyer
0.84
ocket
0.84
wald
0.84
Activations Density 0.006%