INDEX
    Explanations

    instances of the word "multiple"

    occurrences of the word "multiple."

    New Auto-Interp
    Negative Logits
    spring
    -0.73
    roit
    -0.70
    NER
    -0.70
    Prince
    -0.69
    hers
    -0.68
    OST
    -0.66
    potion
    -0.66
    kamp
    -0.66
    IER
    -0.66
    ampunk
    -0.66
    POSITIVE LOGITS
     sclerosis
    1.38
    xes
    1.34
     iterations
    1.12
     simultaneous
    1.03
     generations
    0.97
    iating
    0.94
     overlapping
    0.93
     instances
    0.92
     layers
    0.89
     digits
    0.89
    Act Density 0.030%

    No Known Activations