INDEX
Explanations
mentions of the word "sparrow."
words related to species, particularly about extinction risks and animal references
New Auto-Interp
Negative Logits
Univers
-0.72
phys
-0.72
Die
-0.69
glor
-0.68
Spo
-0.68
beating
-0.64
ju
-0.64
Dat
-0.61
instit
-0.61
Vin
-0.61
POSITIVE LOGITS
arrow
4.72
arrow
1.43
ARR
1.30
arrows
1.29
Arrow
1.27
Arrows
1.15
aird
1.14
angles
1.14
ollow
1.14
angled
1.09
Activations Density 0.008%