INDEX
    Explanations

    expressions or phrases where something is described or paraphrased

    the word "it" used in various contexts

    New Auto-Interp
    Negative Logits
    hift
    -0.76
    ppa
    -0.71
    IELD
    -0.69
    adan
    -0.68
     Panama
    -0.63
     Flavoring
    -0.63
    icial
    -0.61
    424
    -0.60
    ept
    -0.59
     Java
    -0.59
    POSITIVE LOGITS
    self
    0.97
     bluntly
    0.80
    selves
    0.77
    chy
    0.73
    unes
    0.72
     succinct
    0.70
     mildly
    0.69
    anooga
    0.69
     sarcast
    0.68
     selves
    0.68
    Act Density 0.051%

    No Known Activations