INDEX
Explanations
locations or entities mentioned in a news or crime report
instances of the word "the"
New Auto-Interp
Negative Logits
omics
-0.79
heit
-0.72
ç¥ŀ
-0.72
æĹ
-0.71
etically
-0.71
ipedia
-0.70
igue
-0.69
RGB
-0.69
ingly
-0.69
notes
-0.69
POSITIVE LOGITS
incident
1.16
latter
1.14
offending
1.05
remainder
1.03
altercation
1.02
alleged
1.01
same
0.99
arresting
0.99
ensuing
0.97
aforementioned
0.97
Activations Density 0.617%