INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.07
2:0.09
3:0.08
4:0.08
5:0.09
6:0.07
7:0.07
8:0.07
9:0.08
10:0.09
11:0.08
Negative Logits
Task
-3.07
asks
-2.66
missions
-2.61
Crowley
-2.60
obos
-2.55
Rogue
-2.53
amins
-2.50
gifts
-2.50
Pool
-2.48
Minecraft
-2.47
POSITIVE LOGITS
eleph
3.02
ERA
2.77
hypert
2.70
lever
2.65
ariat
2.63
∼
2.58
coronary
2.57
actu
2.57
)—
2.56
rupture
2.56
Activations Density 0.000%
No Known Activations
This feature has no known activations.