INDEX
Explanations
the presence of brief statements or descriptions of roles and their significance
New Auto-Interp
Head Attr Weights
0:0.09
1:0.02
2:0.04
3:0.04
4:0.05
5:0.03
6:0.22
7:0.05
8:0.09
9:0.27
10:0.03
11:0.02
Negative Logits
Ü
-4.16
�
-3.53
lycer
-3.52
Ö
-3.38
depos
-3.34
isot
-3.31
onna
-3.29
indo
-3.29
SUN
-3.24
cookie
-3.14
POSITIVE LOGITS
Vaughan
11.22
Vaughn
8.18
Vaugh
6.93
Finch
3.89
VILLE
3.84
Kur
3.74
Vernon
3.59
Harper
3.53
Hutchinson
3.53
Pence
3.53
Activations Density 0.001%