INDEX
Explanations
terms related to specific viewpoints or ideologies, particularly focusing on the suffix "-ist."
terms related to various ideologies, particularly those ending in 'ist'
New Auto-Interp
Negative Logits
ecause
-0.78
perty
-0.68
ilings
-0.67
sembly
-0.67
accompanied
-0.67
BOX
-0.66
speedy
-0.66
tesy
-0.66
upon
-0.66
smoot
-0.64
POSITIVE LOGITS
ess
0.93
geist
0.85
extraord
0.85
ãĤº
0.80
opol
0.77
otle
0.75
tendencies
0.75
essed
0.72
rior
0.72
ophe
0.71
Activations Density 0.027%