INDEX
Explanations
references to bull and cow species, particularly in the context of animal interactions and characteristics
New Auto-Interp
Negative Logits
Prakash
-0.81
Alvin
-0.70
Nazareth
-0.70
Hama
-0.67
Farley
-0.65
Weldon
-0.65
newArray
-0.64
Hic
-0.63
Aless
-0.63
Champlain
-0.61
POSITIVE LOGITS
cows
1.70
cow
1.64
Cow
1.63
Cow
1.63
Cows
1.60
Cattle
1.52
Bull
1.48
bulls
1.45
bull
1.42
Bulls
1.41
Activations Density 0.022%