INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.08
2:0.08
3:0.08
4:0.09
5:0.07
6:0.06
7:0.09
8:0.08
9:0.07
10:0.08
11:0.09
Negative Logits
Tatt
-1.86
Joined
-1.67
��
-1.66
Dra
-1.63
Legislation
-1.60
Dwarf
-1.59
Malk
-1.58
Spending
-1.55
Scot
-1.55
Philos
-1.53
POSITIVE LOGITS
pload
2.10
ague
2.01
hots
1.78
await
1.76
apons
1.67
gor
1.67
cano
1.63
upid
1.62
ample
1.62
azel
1.61
Activations Density 0.000%
No Known Activations
This feature has no known activations.