INDEX
    Explanations

    phrases related to comparison or different aspects of something

    expressions of multiple interpretations or perspectives on a topic

    New Auto-Interp
    Negative Logits
    bart
    -0.66
    tailed
    -0.63
    andel
    -0.62
    alsa
    -0.61
    ctors
    -0.61
    perty
    -0.61
    prus
    -0.60
    lish
    -0.60
    multiple
    -0.59
    mins
    -0.59
    POSITIVE LOGITS
     resembles
    0.75
     analogous
    0.72
     resembling
    0.71
    ,
    0.68
     reminiscent
    0.68
     resemble
    0.67
     mirror
    0.64
     embodies
    0.64
     it
    0.63
     this
    0.61
    Act Density 0.088%

    No Known Activations