INDEX
Explanations
mentions of opinions or statements being expressed verbally by individuals
words related to expressing opinions or concerns
New Auto-Interp
Negative Logits
adr
-0.75
ney
-0.70
neys
-0.68
ensation
-0.68
leans
-0.67
igm
-0.66
âĢ¢âĢ¢âĢ¢âĢ¢
-0.66
ionage
-0.64
eh
-0.64
dry
-0.63
POSITIVE LOGITS
voiced
1.34
voicing
1.21
voices
1.00
voice
0.96
vocal
0.94
voic
0.94
chords
0.89
unci
0.88
artic
0.86
conson
0.83
Activations Density 0.005%