INDEX
Explanations
names of individuals involved in news stories or events
proper nouns, particularly names of individuals mentioned in criminal contexts
New Auto-Interp
Negative Logits
mathemat
-0.82
guaranteeing
-0.81
cffff
-0.81
optim
-0.77
forecasting
-0.75
bookmark
-0.74
charism
-0.73
pmwiki
-0.72
pse
-0.71
conclud
-0.70
POSITIVE LOGITS
Doe
1.24
Tsarnaev
1.07
Zimmerman
1.00
Bundy
0.96
Hernandez
0.92
Ramirez
0.92
Paddock
0.91
Martinez
0.87
Rodriguez
0.87
Nguyen
0.86
Activations Density 0.428%