INDEX
Explanations
words that indicate professions or roles ending in 'ist' or 'ists'
New Auto-Interp
Negative Logits
FACE
-0.85
MENTS
-0.75
ç«
-0.75
é¾
-0.74
shown
-0.73
ãģ®éŃĶ
-0.72
ãĥīãĥ©
-0.71
zens
-0.66
ghazi
-0.65
APTER
-0.64
POSITIVE LOGITS
extraord
0.86
otle
0.84
ische
0.81
ophical
0.79
ischer
0.79
ophe
0.77
emi
0.77
ribution
0.75
atics
0.75
tendencies
0.74
Activations Density 0.032%