INDEX
    Explanations

    references to the concept of "sense" in various contexts

    New Auto-Interp
    Negative Logits
    oust
    -0.07
    epar
    -0.07
    len
    -0.07
    lena
    -0.07
    rias
    -0.07
    hecy
    -0.07
    avis
    -0.06
    oster
    -0.06
    ILON
    -0.06
    .DataBindings
    -0.06
    POSITIVE LOGITS
     meaning
    0.11
     sense
    0.09
     meanings
    0.09
     meant
    0.08
    meaning
    0.08
    Mean
    0.08
     Meaning
    0.07
    Sense
    0.07
     mean
    0.07
    -mean
    0.07
    Act Density 0.007%

    No Known Activations