INDEX
Explanations
mentions of geographical features, specifically cliffs
instances of the word "cliff" in various contexts
New Auto-Interp
Negative Logits
Interstitial
-1.04
rity
-0.89
apolis
-0.86
Role
-0.82
alg
-0.79
ufact
-0.79
natureconservancy
-0.78
zona
-0.76
agara
-0.74
algia
-0.71
POSITIVE LOGITS
cliffs
0.93
cliff
0.92
dwellings
0.84
Dwell
0.79
stump
0.77
bluff
0.77
Cliff
0.75
ledge
0.75
Guth
0.74
swall
0.72
Activations Density 0.016%