INDEX
Explanations
references to elephants, particularly related to elephant poaching and ivory
references to elephants and issues related to them
New Auto-Interp
Negative Logits
lly
-0.80
nder
-0.77
pring
-0.74
âĸ¬
-0.72
ername
-0.71
ndra
-0.70
lying
-0.70
HER
-0.69
nergy
-0.69
nerg
-0.68
POSITIVE LOGITS
iasis
1.33
elephant
0.95
elephants
0.89
Haram
0.84
ivory
0.83
ota
0.81
opard
0.79
Elephant
0.77
herds
0.76
calf
0.75
Activations Density 0.043%