INDEX
Explanations
instances of the word "f***" in various forms within the text
New Auto-Interp
Head Attr Weights
0:0.02
1:0.03
2:0.08
3:0.26
4:0.02
5:0.02
6:0.05
7:0.22
8:0.07
9:0.05
10:0.07
11:0.04
Negative Logits
quickShipAvailable
-1.50
SEA
-1.41
theless
-1.34
SPA
-1.34
ItemImage
-1.32
vantage
-1.29
ggles
-1.28
Lower
-1.24
imester
-1.24
OTA
-1.20
POSITIVE LOGITS
scrap
1.44
gigg
1.28
rices
1.25
ric
1.23
bullshit
1.18
boobs
1.18
culosis
1.15
ibaba
1.14
scraps
1.14
haunted
1.14
Activations Density 0.012%