INDEX
    Explanations

    terms related to providing information or instruction

    New Auto-Interp
    Negative Logits
    iny
    -0.18
    INY
    -0.17
    oras
    -0.16
    uliar
    -0.16
    ps
    -0.15
    fall
    -0.15
    pod
    -0.15
    bard
    -0.14
    ery
    -0.14
    lein
    -0.14
    POSITIVE LOGITS
    atics
    0.30
    ally
    0.29
    ative
    0.25
    ercial
    0.21
     about
    0.20
    atica
    0.20
    ATIVE
    0.19
    ALLY
    0.19
    acje
    0.18
    idable
    0.18
    Act Density 0.021%

    No Known Activations