INDEX
Explanations
words related to occupations or roles within a community
words with the suffix "-er," indicating roles, professions, or characteristics
New Auto-Interp
Negative Logits
SIZE
-0.78
emetery
-0.75
ãĥĥãĥī
-0.75
ī
-0.74
Lead
-0.67
ori
-0.67
chwitz
-0.65
ij
-0.65
ij士
-0.64
ĸļ士
-0.64
POSITIVE LOGITS
extraord
1.55
beware
0.93
gery
0.90
stakes
0.84
izer
0.73
jack
0.72
digy
0.70
glers
0.69
oad
0.67
agonist
0.66
Activations Density 0.218%