INDEX
Explanations
legal terminology and references to civil rights
New Auto-Interp
Negative Logits
eç
-0.16
$core
-0.16
comprom
-0.15
agra
-0.15
dese
-0.15
asal
-0.14
äm
-0.14
indow
-0.14
alama
-0.14
cope
-0.14
POSITIVE LOGITS
mens
0.27
mens
0.26
knowledge
0.26
intent
0.25
reasonable
0.24
reasonably
0.24
actus
0.24
prox
0.23
acts
0.22
scient
0.21
Activations Density 0.288%