INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.05
2:0.09
3:0.08
4:0.09
5:0.08
6:0.07
7:0.07
8:0.08
9:0.08
10:0.08
11:0.08
Negative Logits
Inspect
-1.86
reditary
-1.77
experien
-1.63
Slay
-1.63
attendance
-1.62
alleg
-1.57
ⓘ
-1.55
perture
-1.54
census
-1.52
nutrit
-1.51
POSITIVE LOGITS
dry
1.67
weet
1.61
�
1.60
flame
1.57
aldo
1.54
dist
1.54
eu
1.50
oliath
1.50
Asia
1.50
Fram
1.47
Activations Density 0.000%
No Known Activations
This feature has no known activations.