INDEX
Explanations
adjectives related to strong negative emotions, particularly bitterness
references to bitterness in various contexts
New Auto-Interp
Negative Logits
arlane
-0.80
upload
-0.78
allows
-0.73
Upload
-0.73
ulhu
-0.72
nesota
-0.71
aucus
-0.71
001
-0.70
gov
-0.69
urat
-0.69
POSITIVE LOGITS
terness
0.94
bitter
0.91
ries
0.84
grapes
0.84
parting
0.81
bitterness
0.79
sour
0.77
sweet
0.77
tasting
0.77
heart
0.76
Activations Density 0.036%