INDEX
Explanations
mentions of animals or related terms
references to animals and their treatment or care
New Auto-Interp
Negative Logits
Sutherland
-0.73
ELD
-0.68
Compton
-0.67
STRUCT
-0.66
itudinal
-0.66
heit
-0.65
nder
-0.65
sonian
-0.65
Ack
-0.64
nce
-0.63
POSITIVE LOGITS
animals
1.04
carc
1.03
mammals
0.89
slaughtered
0.89
aclysm
0.86
species
0.86
Animals
0.85
domest
0.84
reptiles
0.81
folk
0.80
Activations Density 0.022%