INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.05
2:0.08
3:0.09
4:0.08
5:0.08
6:0.08
7:0.07
8:0.08
9:0.08
10:0.09
11:0.09
Negative Logits
intermittent
-1.82
artif
-1.78
privile
-1.75
resp
-1.70
acknowled
-1.65
extingu
-1.65
ciating
-1.64
abnormal
-1.63
subsequ
-1.62
concess
-1.61
POSITIVE LOGITS
spin
1.76
Turtles
1.70
stars
1.70
birds
1.69
perture
1.61
atoon
1.60
mobile
1.58
moons
1.57
Ten
1.56
OTUS
1.55
Activations Density 0.000%
No Known Activations
This feature has no known activations.