INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.07
2:0.09
3:0.09
4:0.07
5:0.08
6:0.08
7:0.08
8:0.08
9:0.08
10:0.08
11:0.07
Negative Logits
Trivia
-1.77
uations
-1.75
Cooldown
-1.66
Photos
-1.65
Shots
-1.65
Stars
-1.63
rences
-1.59
Tweet
-1.58
alls
-1.56
Shares
-1.52
POSITIVE LOGITS
fortun
1.92
streng
1.81
terday
1.73
��
1.69
zsche
1.65
largeDownload
1.64
ablishment
1.62
̶
1.62
corros
1.62
decency
1.61
Activations Density 0.000%
No Known Activations
This feature has no known activations.