INDEX
Explanations
references to elephants and rhinos, particularly in contexts discussing their existence or conservation
New Auto-Interp
Negative Logits
Poultry
-0.40
Chickens
-0.40
Chicken
-0.37
ingles
-0.36
IVEREF
-0.35
Binary
-0.35
findpost
-0.34
PMailer
-0.33
Wheat
-0.33
Wheat
-0.33
POSITIVE LOGITS
elephant
1.16
elephants
1.13
Elephant
1.05
Elephant
1.03
Elephants
0.93
elephant
0.91
elef
0.90
elefante
0.86
giraffe
0.79
phants
0.77
Activations Density 0.426%