INDEX
Explanations
occurrences of specific nouns and their attributes related to individuals, actions, and circumstances in criminal news stories
New Auto-Interp
Negative Logits
asser
-0.07
bine
-0.07
Formatter
-0.07
OfString
-0.07
ãĥ³ãĥĢ
-0.07
erk
-0.07
stype
-0.07
utes
-0.07
ÑĥлÑĮÑĤа
-0.07
elere
-0.07
POSITIVE LOGITS
man
0.07
woman
0.07
echan
0.06
someone
0.06
male
0.06
young
0.06
ged
0.06
odka
0.06
ac
0.05
employee
0.05
Activations Density 0.013%