INDEX
Explanations
emotive words associated with animal behavior and feelings
New Auto-Interp
Negative Logits
animal
-0.56
animaux
-0.53
animal
-0.52
animali
-0.52
Pets
-0.52
pets
-0.51
Animal
-0.49
Animal
-0.49
ANIMAL
-0.48
hayvan
-0.48
POSITIVE LOGITS
breed
0.60
bred
0.60
conformation
0.58
Bred
0.57
temperament
0.56
Bred
0.56
ⓘ
0.54
Breed
0.53
Working
0.51
working
0.51
Activations Density 0.121%