INDEX
Explanations
names and mentions of individuals, particularly in professional contexts
New Auto-Interp
Head Attr Weights
0:0.16
1:0.02
2:0.00
3:0.03
4:0.03
5:0.51
6:0.05
7:0.02
8:0.06
9:0.03
10:0.01
11:0.02
Negative Logits
Rih
-2.39
Peb
-2.38
ヴ
-2.35
Sarah
-2.29
Suff
-2.25
Jamaica
-2.22
Deborah
-2.22
Sao
-2.22
Nur
-2.19
Florence
-2.18
POSITIVE LOGITS
Companies
2.23
newsletters
2.08
wagen
2.07
warranties
2.06
mercial
2.02
tools
1.98
Interactive
1.94
constructive
1.92
anium
1.91
advising
1.91
Activations Density 0.007%