INDEX
Explanations
mentions of the word "rim" with varying activations, possibly indicating a focus on terms related to the outer edge of something or basketball terminology
terms related to the concept of a "rim."
New Auto-Interp
Negative Logits
herty
-0.72
anders
-0.67
Interstitial
-0.66
orth
-0.63
CHO
-0.61
ggies
-0.60
Homes
-0.59
ence
-0.59
EE
-0.58
Strange
-0.58
POSITIVE LOGITS
Rim
1.25
senal
1.18
rim
1.09
med
0.95
assic
0.87
ned
0.84
ming
0.84
mast
0.80
fires
0.77
blaster
0.75
Activations Density 0.006%