INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.07
2:0.08
3:0.09
4:0.07
5:0.07
6:0.09
7:0.06
8:0.09
9:0.07
10:0.09
11:0.07
Negative Logits
Dota
-3.01
Frames
-2.99
Overwatch
-2.89
StarCraft
-2.74
Starcraft
-2.73
Arkham
-2.69
dystopian
-2.63
sych
-2.60
cereal
-2.56
Dream
-2.56
POSITIVE LOGITS
nels
3.16
nc
2.95
tn
2.74
..............
2.69
nel
2.60
psey
2.57
unt
2.55
vt
2.54
arse
2.48
INC
2.48
Activations Density 0.000%
No Known Activations
This feature has no known activations.