INDEX
Explanations
quiet and secretive communication in the form of whispers
words and phrases related to quiet communication or subtle hints
New Auto-Interp
Negative Logits
ouf
-0.78
andals
-0.76
arcity
-0.74
onut
-0.73
unes
-0.73
folios
-0.73
bard
-0.71
ticket
-0.70
ocr
-0.69
aples
-0.69
POSITIVE LOGITS
whisper
1.39
whispers
1.21
whispering
1.18
whispered
1.05
aloud
1.00
softly
0.91
louder
0.86
omin
0.82
murm
0.78
uttered
0.77
Activations Density 0.014%