INDEX
Explanations
references to advertising or promotional content
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.09
3:0.08
4:0.07
5:0.04
6:0.14
7:0.18
8:0.05
9:0.06
10:0.08
11:0.10
Negative Logits
gaard
-1.37
opter
-1.36
scraping
-1.35
nings
-1.26
scrape
-1.26
�
-1.24
Americ
-1.19
rye
-1.19
Saharan
-1.19
rolls
-1.17
POSITIVE LOGITS
ilities
1.35
earances
1.35
ilan
1.33
andise
1.31
imum
1.22
��
1.22
raviolet
1.22
isSpecial
1.21
imon
1.17
Athletics
1.17
Activations Density 0.001%