INDEX
Explanations
adjectives describing quality or skill, particularly those with positive connotations
expressions of high quality or excellence
New Auto-Interp
Negative Logits
ople
-0.74
eters
-0.71
mber
-0.68
ocry
-0.67
othy
-0.67
pper
-0.67
aters
-0.64
Lyons
-0.63
hip
-0.63
Mellon
-0.62
POSITIVE LOGITS
enough
1.00
sword
0.83
sounding
0.82
luck
0.78
fits
0.75
bye
0.74
nat
0.72
karma
0.72
smelling
0.72
nosis
0.71
Activations Density 0.060%