INDEX
Explanations
references to race and historical context related to oppression
Follows infinitive "to"
to + verb structures
New Auto-Interp
Negative Logits
beschik
-0.78
écout
-0.72
feroit
-0.69
déchir
-0.69
auroit
-0.68
ainfi
-0.67
avoient
-0.67
démocr
-0.65
équip
-0.64
arrêté
-0.64
POSITIVE LOGITS
romanti
1.01
legitim
0.90
glor
0.85
glorify
0.84
glam
0.84
legiti
0.78
glor
0.77
normalize
0.74
gloss
0.73
objec
0.72
Activations Density 0.486%