INDEX
Explanations
proper nouns related to specific people, places, or organizations
New Auto-Interp
Head Attr Weights
0:0.03
1:0.03
2:0.13
3:0.11
4:0.35
5:0.03
6:0.05
7:0.04
8:0.03
9:0.05
10:0.05
11:0.04
Negative Logits
erity
-1.76
thora
-1.56
indifference
-1.52
tein
-1.49
cannabin
-1.48
silence
-1.48
icum
-1.43
disregard
-1.41
therein
-1.40
,,,,
-1.39
POSITIVE LOGITS
���
2.06
alysed
1.57
Ranked
1.44
Org
1.40
¯¯¯¯
1.36
haps
1.35
Takes
1.34
��
1.32
Olympia
1.29
pires
1.28
Activations Density 0.001%