INDEX
Explanations
mentions of people in managerial or leadership roles
New Auto-Interp
Negative Logits
romy
-0.77
Leafs
-0.66
Reincarn
-0.63
Crimes
-0.63
Territories
-0.62
Revival
-0.62
Transit
-0.61
Sleeping
-0.60
Kingdoms
-0.59
Warfare
-0.59
POSITIVE LOGITS
esses
1.05
hip
0.98
urally
0.95
ess
0.91
alty
0.89
beware
0.86
ial
0.83
essing
0.81
hesis
0.80
hips
0.79
Activations Density 2.384%