INDEX
Explanations
mentions of the word "stone" in the text
references to the word "stone" in various contexts
New Auto-Interp
Negative Logits
oresc
-0.91
merce
-0.87
ersive
-0.83
ornia
-0.83
unct
-0.81
acy
-0.76
rha
-0.76
largeDownload
-0.76
orea
-0.75
inen
-0.75
POSITIVE LOGITS
stone
1.09
stones
1.02
fish
0.95
hill
0.94
castle
0.87
lake
0.86
works
0.86
rocks
0.85
bats
0.84
slab
0.83
Activations Density 0.017%