INDEX
Explanations
concepts related to societal issues and challenges, particularly those involving injustice, rights, and labor
New Auto-Interp
Negative Logits
ãĥ´
-0.61
ãĤ±
-0.59
APTER
-0.57
ificant
-0.57
inosaur
-0.57
Mos
-0.56
ghazi
-0.56
itialized
-0.55
ãĤ¨ãĥ«
-0.54
MAC
-0.53
POSITIVE LOGITS
and
0.78
syndrome
0.77
or
0.75
rather
0.72
techniques
0.72
.
0.72
disorder
0.71
("0.69
tactics
0.67
therapy
0.66
Activations Density 0.272%