INDEX
Explanations
phrases related to activism and social causes
repeated phrases or expressions that emphasize a sentiment or idea indicating frustration or discontent
New Auto-Interp
Negative Logits
adolesc
-0.82
mathemat
-0.81
hemor
-0.80
anium
-0.79
Seym
-0.78
imitation
-0.74
fortun
-0.74
oscope
-0.74
captives
-0.72
Palestin
-0.71
POSITIVE LOGITS
ï¸
1.02
ï¸ı
0.90
Balt
0.82
own
0.78
Ru
0.77
nder
0.76
¯
0.75
女
0.74
ishable
0.73
wise
0.73
Activations Density 0.231%