INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.07
2:0.07
3:0.08
4:0.08
5:0.08
6:0.07
7:0.08
8:0.08
9:0.08
10:0.08
11:0.08
Negative Logits
shock
-1.59
Hurt
-1.58
sponsors
-1.51
buster
-1.46
Shade
-1.44
rgb
-1.43
SPONSORED
-1.43
osphere
-1.42
sil
-1.39
Sab
-1.39
POSITIVE LOGITS
tein
2.02
sqor
1.99
Parables
1.94
ichever
1.84
theless
1.69
utmost
1.62
vow
1.62
*/(
1.60
ouk
1.58
ongyang
1.57
Activations Density 0.000%
No Known Activations
This feature has no known activations.