INDEX
Explanations
proper nouns and specific entities mentioned in news articles
references to specific organizations, places, or notable entities
New Auto-Interp
Negative Logits
$.
-0.88
*.
-0.83
.</
-0.77
!.
-0.72
`.
-0.71
etc
-0.69
â̦.
-0.68
."
-0.68
..
-0.68
.<
-0.68
POSITIVE LOGITS
spokesman
0.99
spokeswoman
0.95
spokesperson
0.92
Tribune
0.75
historian
0.74
memorandum
0.74
ologist
0.73
Herald
0.71
watchdog
0.70
Gazette
0.70
Activations Density 0.793%