INDEX
Explanations
references to specific names, likely focusing on surnames
proper nouns related to characters or entities in a specific context
New Auto-Interp
Negative Logits
cele
-0.83
dent
-0.78
nai
-0.74
hospitalized
-0.72
amy
-0.69
herpes
-0.69
paramedics
-0.68
phot
-0.68
pus
-0.67
cath
-0.66
POSITIVE LOGITS
Frog
3.10
Horn
2.05
Blossom
1.45
Crow
1.45
Cotton
1.31
Crusader
1.18
Fog
1.18
Worm
1.17
Rabbit
1.17
Egg
1.14
Activations Density 0.024%