INDEX
Explanations
proper names and titles of individuals, often related to media or entertainment
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.05
3:0.06
4:0.04
5:0.04
6:0.33
7:0.12
8:0.05
9:0.08
10:0.09
11:0.05
Negative Logits
facult
-1.44
worldly
-1.36
yip
-1.30
fml
-1.27
ificantly
-1.23
BLIC
-1.17
iable
-1.15
igible
-1.13
Dhabi
-1.11
ensibly
-1.11
POSITIVE LOGITS
Bride
1.36
herself
1.27
rette
1.26
opol
1.20
xxx
1.17
��
1.16
Jen
1.14
aminer
1.14
sch
1.14
pson
1.13
Activations Density 0.045%