INDEX
Explanations
proper nouns, particularly names of individuals and notable entities
New Auto-Interp
Head Attr Weights
0:0.03
1:0.10
2:0.02
3:0.02
4:0.03
5:0.40
6:0.02
7:0.01
8:0.04
9:0.14
10:0.10
11:0.03
Negative Logits
PN
-1.70
endemic
-1.59
metic
-1.56
Trop
-1.50
Balt
-1.43
unexpl
-1.43
apons
-1.43
xp
-1.41
ppo
-1.40
ngth
-1.38
POSITIVE LOGITS
&
1.76
etts
1.71
ulously
1.69
and
1.55
\":
1.53
photographed
1.49
icipated
1.48
±
1.47
iott
1.44
ihara
1.42
Activations Density 0.078%