INDEX
Explanations
references to convenience
references to convenience and inconvenience
New Auto-Interp
Negative Logits
ynt
-0.87
orn
-0.77
orns
-0.76
vae
-0.74
amm
-0.68
zik
-0.67
interstitial
-0.67
Benz
-0.66
ewski
-0.65
hov
-0.64
POSITIVE LOGITS
convenience
1.05
gratification
0.87
conven
0.80
venient
0.79
inconven
0.77
convenient
0.76
ously
0.76
iences
0.76
familiarity
0.74
store
0.72
Activations Density 0.010%