INDEX
Explanations
mentions of the word "convenience" in a text
references to convenience
New Auto-Interp
Negative Logits
ynt
-0.84
orns
-0.83
vae
-0.82
orn
-0.79
lem
-0.74
volent
-0.73
ewski
-0.72
ongyang
-0.71
orian
-0.70
oric
-0.70
POSITIVE LOGITS
convenience
0.92
gratification
0.84
é¾įå¥ij士
0.81
store
0.80
ously
0.76
inconven
0.72
Stores
0.71
conven
0.70
pleasures
0.68
stores
0.67
Activations Density 0.024%