INDEX
    Explanations

    phrases indicating proximity or closeness in terms of location or time

    New Auto-Interp
    Negative Logits
    ãĥ¥
    -0.18
    eted
    -0.16
    regunta
    -0.15
    IAL
    -0.15
    gan
    -0.15
    gregator
    -0.14
    entric
    -0.14
    ulong
    -0.14
    gnore
    -0.14
    adients
    -0.14
    POSITIVE LOGITS
     misses
    0.27
    ctic
    0.27
    ness
    0.27
    abouts
    0.25
     miss
    0.25
    -term
    0.25
    shore
    0.25
    ish
    0.24
    s
    0.24
    lier
    0.23
    Act Density 0.036%

    No Known Activations