INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.07
2:0.09
3:0.08
4:0.08
5:0.08
6:0.09
7:0.07
8:0.09
9:0.07
10:0.07
11:0.07
Negative Logits
Grassley
-1.81
ibly
-1.75
compromise
-1.73
enegger
-1.71
lobb
-1.62
denomin
-1.60
pursu
-1.58
Burke
-1.58
consumers
-1.58
Lauder
-1.54
POSITIVE LOGITS
ipedia
2.09
duration
2.08
prison
1.93
rys
1.88
info
1.81
ilk
1.78
intern
1.71
num
1.68
1001
1.68
tro
1.68
Activations Density 0.000%
No Known Activations
This feature has no known activations.