INDEX
Explanations
personal names
references to specific individuals, particularly names
New Auto-Interp
Negative Logits
NAT
-0.77
Arsenal
-0.74
wolves
-0.73
YP
-0.70
Terror
-0.69
Spawn
-0.69
anski
-0.68
usky
-0.66
Arsenal
-0.66
spawn
-0.65
POSITIVE LOGITS
Bib
3.56
Gib
1.72
Jac
1.53
Jac
1.29
Bun
1.24
Tib
1.22
Heb
1.19
Nib
1.18
Rib
1.15
Pas
1.10
Activations Density 0.067%