INDEX
Explanations
names of individuals or proper nouns related to specific people or entities
New Auto-Interp
Head Attr Weights
0:0.06
1:0.02
2:0.09
3:0.06
4:0.09
5:0.09
6:0.03
7:0.09
8:0.08
9:0.08
10:0.16
11:0.10
Negative Logits
suspic
-1.02
cryst
-0.84
reluct
-0.82
Rove
-0.80
pony
-0.80
millenn
-0.79
corrid
-0.79
Pg
-0.78
enthusi
-0.78
unfavorable
-0.78
POSITIVE LOGITS
oglu
1.08
III
1.05
ensis
1.03
Aut
0.97
angan
0.95
abis
0.90
uala
0.88
allas
0.88
Parish
0.86
uminium
0.86
Activations Density 0.585%