INDEX
Explanations
advertisements within the text
instances of the word "advertisement"
New Auto-Interp
Negative Logits
hof
-0.76
ommel
-0.75
grass
-0.75
Normandy
-0.72
rum
-0.72
lands
-0.69
oids
-0.68
peak
-0.65
fully
-0.65
gger
-0.65
POSITIVE LOGITS
..........
1.27
ãħĭãħĭ
1.21
ãħĭ
1.00
allery
0.95
ileaks
0.82
sembly
0.81
teasp
0.80
veyard
0.76
aucuses
0.76
olulu
0.75
Activations Density 0.014%