INDEX
Explanations
words related to circular objects like "ring"
occurrences of the word "ring" in various contexts
New Auto-Interp
Negative Logits
matter
-0.74
ives
-0.73
icago
-0.72
essors
-0.67
Freeze
-0.57
Haas
-0.57
autop
-0.57
tical
-0.55
orial
-0.55
ISS
-0.55
POSITIVE LOGITS
tone
1.60
leader
1.48
leaders
1.39
tones
1.39
git
1.08
worm
1.07
wra
1.03
senal
0.93
bone
0.90
master
0.89
Activations Density 0.042%