INDEX
Explanations
adjectives indicating strong contrast
the word "stark" in various contexts, often relating to contrasts or notable differences
New Auto-Interp
Negative Logits
diligently
-0.69
ipop
-0.69
safely
-0.68
«ĺ
-0.67
APS
-0.66
hops
-0.66
annis
-0.66
uthor
-0.65
conservancy
-0.65
llular
-0.64
POSITIVE LOGITS
ly
1.16
contrasts
1.16
contrast
1.04
stark
0.92
est
0.89
ness
0.84
er
0.81
difference
0.79
iary
0.78
naked
0.78
Activations Density 0.019%