INDEX
Explanations
negative superlatives, specifically the word "least."
phrases indicating minimal or low conditions
New Auto-Interp
Negative Logits
ses
-0.73
halla
-0.72
clerosis
-0.70
krit
-0.68
prototype
-0.66
bj
-0.66
selves
-0.65
kas
-0.65
bind
-0.65
raltar
-0.64
POSITIVE LOGITS
imaginable
0.79
practicable
0.78
conceivable
0.77
toler
0.76
plausible
0.75
intrusive
0.74
amount
0.71
favourable
0.70
possible
0.70
conspicuous
0.69
Activations Density 0.016%