INDEX
Explanations
references to individuals or entities involved in legal or social justice contexts
New Auto-Interp
Negative Logits
allas
-0.18
æķ·
-0.17
erif
-0.16
usp
-0.14
aspers
-0.14
Vale
-0.14
Animalia
-0.14
riere
-0.13
airo
-0.13
ime
-0.13
POSITIVE LOGITS
also
0.19
Specifically
0.18
then
0.18
thereby
0.17
specifically
0.16
also
0.16
719
0.15
zhou
0.14
Also
0.14
ÑĤакже
0.14
Activations Density 0.163%