INDEX
Explanations
names with "hart" or "hardt"
references to specific leaders and their associated names
New Auto-Interp
Negative Logits
Magikarp
-0.70
URI
-0.69
millenn
-0.68
ntil
-0.67
veter
-0.65
tampering
-0.65
sclerosis
-0.65
pty
-0.63
corros
-0.61
sarc
-0.60
POSITIVE LOGITS
hart
0.97
awi
0.93
ford
0.85
ãĤ¤ãĥĪ
0.82
enstein
0.81
hardt
0.80
Ducks
0.79
enberg
0.76
ilton
0.75
Jr
0.73
Activations Density 0.019%