INDEX
Explanations
the word "bored" with different degrees of intensity
expressions of boredom
New Auto-Interp
Negative Logits
annex
-0.68
Lomb
-0.67
Lucia
-0.67
Slovenia
-0.66
Kou
-0.65
Coch
-0.61
Luxembourg
-0.61
Flowers
-0.61
Ivanka
-0.60
Kamp
-0.60
POSITIVE LOGITS
icion
0.88
oad
0.87
||||
0.86
bored
0.84
repetition
0.76
icip
0.75
hound
0.74
dit
0.71
bots
0.71
jong
0.71
Activations Density 0.025%