INDEX
Explanations
references to ivory
references to ivory and its trade
New Auto-Interp
Negative Logits
________________
-0.76
Rog
-0.68
dies
-0.67
uberty
-0.67
Rap
-0.67
tered
-0.66
drivers
-0.66
igating
-0.65
LAN
-0.64
UAL
-0.63
POSITIVE LOGITS
ivory
1.37
Ivory
1.11
poaching
0.91
elephant
0.90
pyramid
0.82
wana
0.79
elephants
0.77
toile
0.75
opard
0.72
otte
0.71
Activations Density 0.012%