INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.07
2:0.06
3:0.09
4:0.09
5:0.08
6:0.07
7:0.07
8:0.08
9:0.09
10:0.08
11:0.08
Negative Logits
endif
-1.66
uren
-1.64
td
-1.61
tis
-1.58
irlf
-1.58
icago
-1.56
rets
-1.53
rette
-1.53
rha
-1.52
ilus
-1.52
POSITIVE LOGITS
Coffin
1.72
hedon
1.71
Miliband
1.66
omorphic
1.63
Gillespie
1.54
Heist
1.53
女
1.53
Dickens
1.47
Kafka
1.45
volunt
1.42
Activations Density 0.000%
No Known Activations
This feature has no known activations.