INDEX
Explanations
words related to specific professions or occupations
words that denote roles or professions ending in specific suffixes
New Auto-Interp
Negative Logits
ASED
-0.71
increments
-0.68
Territories
-0.67
Warden
-0.67
conviction
-0.61
bilateral
-0.60
implication
-0.60
Farm
-0.58
Investments
-0.57
Thing
-0.57
POSITIVE LOGITS
hip
1.42
hips
1.22
paces
1.16
ervatives
1.06
mith
1.03
heet
0.98
peak
0.97
pace
0.96
'
0.93
hare
0.92
Activations Density 0.134%