INDEX
Explanations
words related to bitterness and conflict
New Auto-Interp
Negative Logits
nesota
-0.81
arlane
-0.73
upload
-0.73
ulhu
-0.71
iques
-0.70
aver
-0.70
utations
-0.69
allows
-0.67
aucus
-0.67
ansion
-0.67
POSITIVE LOGITS
bitter
0.91
terness
0.83
parting
0.83
bitterness
0.80
irony
0.77
disappointment
0.76
cold
0.74
spo
0.72
bitterly
0.71
streaks
0.70
Activations Density 8.730%