INDEX
Explanations
herbivores and grazing animals
New Auto-Interp
Negative Logits
মাছ
0.42
sharks
0.41
Animal
0.41
orpions
0.41
Dogs
0.40
shark
0.40
Dogs
0.40
Sharks
0.40
perros
0.38
कुत्तों
0.38
POSITIVE LOGITS
herb
1.77
Herb
1.71
Herb
1.63
herbivores
1.58
herb
1.56
ung
1.12
graz
1.04
grazing
1.02
phyt
0.96
Ung
0.90
Activations Density 0.023%