INDEX
Explanations
mentions of proper nouns related to the word 'Stone'
references to "Stone" and associated terms
New Auto-Interp
Negative Logits
merce
-0.82
oresc
-0.82
ership
-0.79
orescence
-0.78
olulu
-0.76
ornia
-0.72
unct
-0.70
olitan
-0.69
ores
-0.66
NCT
-0.65
POSITIVE LOGITS
hill
0.99
falls
0.96
castle
0.90
cold
0.87
works
0.85
wright
0.85
lake
0.85
fish
0.81
Soup
0.80
Cold
0.79
Activations Density 0.053%