INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.09
3:0.07
4:0.07
5:0.07
6:0.07
7:0.08
8:0.07
9:0.08
10:0.08
11:0.09
Negative Logits
omit
-2.55
omitted
-2.30
ewitness
-2.16
publish
-2.15
substituted
-2.13
discard
-2.05
swapped
-2.02
letter
-2.01
/*
-1.96
reprint
-1.96
POSITIVE LOGITS
gging
2.06
Morales
2.06
Investigator
2.04
IER
2.03
Funds
1.98
jandro
1.98
uman
1.97
Els
1.94
Liberals
1.91
DonaldTrump
1.90
Activations Density 0.000%
No Known Activations
This feature has no known activations.