INDEX
Explanations
names of individuals who have been arrested or charged with crimes
proper nouns and specific names related to individuals and events
New Auto-Interp
Negative Logits
parency
-0.72
isphere
-0.72
ashington
-0.69
ipedia
-0.68
ensical
-0.66
fters
-0.65
udos
-0.64
omics
-0.64
ysc
-0.64
igraph
-0.63
POSITIVE LOGITS
29
1.11
59
1.06
26
1.05
27
1.05
31
1.05
34
1.04
Jr
1.03
33
1.03
37
1.02
43
1.02
Activations Density 0.210%