INDEX
Explanations
words and phrases related to historical injustices and social issues
New Auto-Interp
Negative Logits
ा:
-0.17
ãĢĬ
-0.17
Fcn
-0.16
leck
-0.16
bdb
-0.15
imuth
-0.15
ExecutionContext
-0.15
emek
-0.14
Rudd
-0.14
mund
-0.14
POSITIVE LOGITS
]
0.17
}
0.17
[]
0.17
"
0.16
iej
0.15
[s
0.14
ÂĶ
0.14
utor
0.14
phan
0.14
417
0.14
Activations Density 0.171%