INDEX
Explanations
words related to strong interests or fixations, often bordering on unhealthy obsessions
terms related to obsession and fixation
New Auto-Interp
Negative Logits
soType
-0.71
stood
-0.66
versible
-0.63
Medic
-0.61
eli
-0.60
RIS
-0.60
detrim
-0.59
comings
-0.59
ells
-0.58
RNC
-0.58
POSITIVE LOGITS
ishly
0.93
iously
0.84
obsess
0.81
fascination
0.79
fixation
0.76
fascinated
0.75
obsessed
0.75
obsession
0.72
obs
0.71
atically
0.71
Activations Density 0.068%