INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.08
3:0.06
4:0.07
5:0.08
6:0.08
7:0.08
8:0.08
9:0.09
10:0.08
11:0.08
Negative Logits
angular
-1.88
oots
-1.75
idable
-1.73
atin
-1.67
next
-1.65
api
-1.64
cod
-1.62
otos
-1.62
inders
-1.61
own
-1.58
POSITIVE LOGITS
¶
1.69
=================
1.67
criticised
1.67
Lange
1.61
quickShipAvailable
1.59
omission
1.58
nutshell
1.48
defamation
1.46
Anthrop
1.46
ⓘ
1.45
Activations Density 0.000%
No Known Activations
This feature has no known activations.