INDEX
Explanations
expressions of anger and protest related to social or political issues
New Auto-Interp
Negative Logits
ECS
-0.46
struk
-0.45
RRect
-0.44
횟
-0.43
Prom
-0.40
large
-0.39
apuestas
-0.39
Prom
-0.39
misu
-0.39
gam
-0.39
POSITIVE LOGITS
protesting
0.87
disambiguazione
0.85
complains
0.85
protested
0.82
protesta
0.80
complaining
0.78
protes
0.77
protest
0.77
complained
0.75
protests
0.74
Activations Density 0.229%