INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.05
2:0.09
3:0.08
4:0.08
5:0.08
6:0.08
7:0.10
8:0.08
9:0.08
10:0.09
11:0.08
Negative Logits
=-=-=-=-=-=-=-=-
-1.55
Keynes
-1.46
ticking
-1.46
Rosenthal
-1.41
Product
-1.36
unfinished
-1.35
Guinness
-1.34
product
-1.34
curated
-1.33
ß
-1.32
POSITIVE LOGITS
etheless
1.92
ornia
1.86
��
1.78
ledged
1.78
ioch
1.67
viol
1.66
erton
1.65
gered
1.60
ava
1.59
ihar
1.56
Activations Density 0.000%
No Known Activations
This feature has no known activations.