INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.06
2:0.08
3:0.08
4:0.09
5:0.09
6:0.09
7:0.07
8:0.08
9:0.08
10:0.07
11:0.08
Negative Logits
pas
-1.73
セ
-1.57
apon
-1.56
Lear
-1.50
Span
-1.50
eas
-1.50
Reviewer
-1.44
tim
-1.43
von
-1.43
ircraft
-1.41
POSITIVE LOGITS
competing
1.61
undecided
1.60
defect
1.47
moot
1.41
icts
1.40
Saud
1.39
elsius
1.34
automakers
1.34
testosterone
1.33
merits
1.33
Activations Density 0.000%
No Known Activations
This feature has no known activations.