INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.08
2:0.08
3:0.09
4:0.10
5:0.07
6:0.07
7:0.09
8:0.07
9:0.08
10:0.07
11:0.09
Negative Logits
-2.56
patronage
-2.53
pronouns
-2.38
custody
-2.32
LW
-2.30
mberg
-2.30
endorsements
-2.27
paws
-2.25
CASE
-2.25
esville
-2.25
POSITIVE LOGITS
Than
2.63
Bulgar
2.59
Mic
2.57
Sat
2.52
guiIcon
2.46
ttle
2.43
haps
2.43
hy
2.42
nown
2.39
ubby
2.37
Activations Density 0.000%
No Known Activations
This feature has no known activations.