INDEX
Explanations
names of people
references to specific individuals and their actions or roles
New Auto-Interp
Negative Logits
Zel
-0.95
ule
-0.89
cryptoc
-0.83
orthern
-0.83
629
-0.81
Tooth
-0.81
Phillies
-0.80
philos
-0.80
Chip
-0.80
raph
-0.80
POSITIVE LOGITS
Anderson
1.77
Anderson
1.71
Evans
1.09
Mund
1.00
Concord
0.92
Henderson
0.92
Andersen
0.91
und
0.91
Amanda
0.90
Ev
0.88
Activations Density 0.374%