INDEX
Explanations
phrases indicating lack of success or disappointment despite effort
instances of the word "avail" and related concepts of availability and privilege
New Auto-Interp
Negative Logits
eric
-0.66
refined
-0.58
Patriarch
-0.58
odor
-0.57
coron
-0.57
Mercury
-0.57
Prin
-0.57
monarchy
-0.56
temperament
-0.54
Devon
-0.54
POSITIVE LOGITS
abilities
1.09
ĸļ
1.03
avail
1.00
urations
0.90
ãģĨ
0.88
heim
0.83
mathemat
0.80
iture
0.80
ifully
0.79
ments
0.79
Activations Density 0.028%