INDEX
Explanations
adjectives and adverbs that describe attributes or qualities
New Auto-Interp
Head Attr Weights
0:0.02
1:0.03
2:0.04
3:0.42
4:0.02
5:0.02
6:0.04
7:0.15
8:0.04
9:0.08
10:0.06
11:0.03
Negative Logits
oslav
-1.43
heast
-1.41
olate
-1.27
ationally
-1.26
auc
-1.26
ua
-1.24
uliffe
-1.16
azard
-1.16
abilia
-1.14
DPR
-1.12
POSITIVE LOGITS
Regiment
1.33
ienne
1.19
Gray
1.13
76561
1.12
ansas
1.11
lied
1.10
clad
1.10
udeb
1.09
Lie
1.08
Jub
1.08
Activations Density 0.031%