INDEX
Explanations
mentions of the word "rock" in various contexts
mentions of rocks and related geological terms
New Auto-Interp
Negative Logits
ufact
-0.98
rha
-0.77
rification
-0.73
oresc
-0.72
ples
-0.69
xus
-0.69
NOTICE
-0.66
ornia
-0.65
ntil
-0.65
mble
-0.64
POSITIVE LOGITS
castle
0.94
er
0.89
ers
0.88
formations
0.84
papers
0.82
lake
0.82
stars
0.80
solid
0.80
stead
0.80
climbers
0.79
Activations Density 0.015%