INDEX
    Explanations

    instances of dogs and their descriptions in various contexts

    New Auto-Interp
    Head Attr Weights
    0:0.30
    1:0.02
    2:0.19
    3:0.10
    4:0.03
    5:0.09
    6:0.05
    7:0.06
    8:0.05
    9:0.02
    10:0.03
    11:0.02
    Negative Logits
     Gutenberg
    -2.45
     graphene
    -2.39
     eru
    -2.38
     Volcano
    -2.37
    adeon
    -2.31
     Bosh
    -2.31
    ineries
    -2.30
    ioxide
    -2.30
    wikipedia
    -2.28
    device
    -2.26
    POSITIVE LOGITS
     puppy
    5.02
     puppies
    4.92
     barking
    4.89
     leash
    4.73
     dogs
    4.61
     canine
    4.36
     pets
    4.25
    Dog
    4.23
     paws
    4.07
     breed
    4.06
    Act Density 0.299%

    No Known Activations