INDEX
Explanations
key terms related to civil rights discussions and educational contexts
New Auto-Interp
Negative Logits
(“
-0.18
">//
-0.17
“[
-0.15
“â̦
-0.15
åŃĺäºİ
-0.15
â̦â̦ãĢĤ
-0.15
“
-0.14
----------------------------------------------------------------------------------------------------------------
-0.14
á»ķ
-0.14
weep
-0.14
POSITIVE LOGITS
,↵
0.26
.↵
0.25
).↵
0.23
",↵
0.23
\↵
0.22
',↵
0.21
ãĢĤ↵
0.21
ãĢģ↵
0.20
').↵
0.20
".↵
0.20
Activations Density 0.263%