INDEX
Explanations
mentions of education, being educated, or educating others
New Auto-Interp
Negative Logits
imposed
-0.64
apan
-0.64
ity
-0.63
launcher
-0.60
ettes
-0.60
dimension
-0.59
apa
-0.59
dimension
-0.58
containment
-0.58
hedral
-0.57
POSITIVE LOGITS
guesses
0.94
liter
0.83
otle
0.81
eering
0.77
ourgeois
0.77
llor
0.77
renheit
0.75
ĨĴ
0.74
habi
0.73
irect
0.70
Activations Density 0.052%