INDEX
Explanations
concepts related to amplifying voices and making them heard
New Auto-Interp
Negative Logits
phil
-0.16
WEEN
-0.15
DataReader
-0.15
ır
-0.14
amor
-0.14
odge
-0.14
hugs
-0.14
iro
-0.13
ost
-0.13
à¸Ńà¸ļ
-0.13
POSITIVE LOGITS
voice
0.38
voices
0.33
voice
0.30
Voice
0.27
vo
0.26
voices
0.25
_voice
0.24
louder
0.24
silenced
0.23
Voices
0.23
Activations Density 0.134%