INDEX
Explanations
mentions and discussions about dogs and their behaviors
New Auto-Interp
Negative Logits
الحره
-0.86
Rial
-0.85
themſelves
-0.81
Holloway
-0.79
Transparency
-0.77
myſelf
-0.76
Eſ
-0.75
Reiche
-0.74
Camb
-0.74
Dami
-0.74
POSITIVE LOGITS
dogs
1.79
dog
1.65
Dog
1.65
Dogs
1.60
DOG
1.56
Dog
1.52
Dogs
1.46
DOGS
1.39
dog
1.36
DOG
1.35
Activations Density 0.042%