INDEX
Explanations
adjectives describing appearance or condition
descriptive adjectives indicating quality or similarity
New Auto-Interp
Negative Logits
mental
-0.75
ricular
-0.75
Childhood
-0.70
byter
-0.69
shire
-0.65
iculty
-0.63
selling
-0.61
igion
-0.60
ritical
-0.60
submission
-0.59
POSITIVE LOGITS
lifeless
0.79
bones
0.79
bley
0.79
suspic
0.74
sleek
0.73
ugly
0.72
shiny
0.70
pretty
0.69
differently
0.69
suspicious
0.69
Activations Density 0.144%