INDEX
Explanations
various words related to locations and events
proper nouns related to places, specifically names of towns or cities
New Auto-Interp
Head Attr Weights
0:0.04
1:0.02
2:0.12
3:0.03
4:0.31
5:0.06
6:0.02
7:0.02
8:0.08
9:0.17
10:0.04
11:0.02
Negative Logits
olars
-1.32
np
-1.27
Plex
-1.26
doi
-1.25
href
-1.23
occup
-1.21
fw
-1.19
btn
-1.19
mosqu
-1.19
Redd
-1.19
POSITIVE LOGITS
steen
1.75
uration
1.36
manslaughter
1.36
Booker
1.36
tein
1.33
anders
1.33
prose
1.31
hire
1.30
Kane
1.27
blasphemy
1.27
Activations Density 0.002%