INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.09
2:0.08
3:0.08
4:0.08
5:0.09
6:0.08
7:0.07
8:0.07
9:0.07
10:0.07
11:0.08
Negative Logits
canvas
-2.44
colored
-2.40
...
-2.38
Hawaiian
-2.37
HI
-2.36
reversible
-2.28
hod
-2.24
Fair
-2.24
descriptive
-2.24
Constantin
-2.23
POSITIVE LOGITS
gob
3.12
gobl
2.85
GBT
2.82
Breitbart
2.68
FactoryReloaded
2.67
FB
2.67
UCH
2.67
mber
2.64
龍契士
2.64
tf
2.57
Activations Density 0.000%
No Known Activations
This feature has no known activations.