INDEX
Explanations
proper nouns related to locations, organizations, and events
proper nouns, particularly names of locations, organizations, and prominent figures
New Auto-Interp
Negative Logits
whilst
-0.78
â̦.
-0.74
chwitz
-0.74
nob
-0.73
..
-0.71
....
-0.71
[_
-0.71
*****
-0.71
..
-0.70
*.
-0.70
POSITIVE LOGITS
spokeswoman
1.08
spokesman
1.06
spokesperson
1.05
Tribune
0.97
duo
0.97
Gazette
0.96
group
0.96
lawmaker
0.96
incident
0.96
trio
0.94
Activations Density 0.359%