INDEX
Explanations
topics related to animal rights and welfare
New Auto-Interp
Negative Logits
omo
-0.17
asal
-0.15
amongst
-0.14
haft
-0.14
ffen
-0.14
princip
-0.14
ansom
-0.14
challenge
-0.14
rase
-0.13
lose
-0.13
POSITIVE LOGITS
unei
0.17
hlas
0.16
.createObject
0.16
hasil
0.16
izin
0.15
ICES
0.15
vio
0.14
vik
0.14
ORED
0.14
(č↵
0.14
Activations Density 0.399%