INDEX
Explanations
references to the word "Stone"
mentions of the word "Stone."
New Auto-Interp
Negative Logits
merce
-0.90
olulu
-0.85
oresc
-0.82
orescence
-0.78
ornia
-0.78
unct
-0.72
ntil
-0.72
uates
-0.72
unal
-0.71
ership
-0.70
POSITIVE LOGITS
hill
0.97
works
0.93
falls
0.91
lings
0.88
hook
0.87
house
0.84
castle
0.84
Age
0.84
ring
0.83
Cold
0.82
Activations Density 0.011%