INDEX
Explanations
words related to giving a new perspective or interpretation
instances of the word "spin" in various contexts
New Auto-Interp
Negative Logits
avis
-0.72
Admir
-0.71
Commodore
-0.67
ecause
-0.65
inances
-0.64
inez
-0.63
Scores
-0.62
ĨĴ
-0.62
enance
-0.62
Defenders
-0.59
POSITIVE LOGITS
ners
1.42
spin
1.09
spin
0.96
kered
0.91
eless
0.91
wheel
0.86
cher
0.82
rot
0.82
ball
0.80
ned
0.80
Activations Density 0.009%