INDEX
    Explanations

    phrases that express temporal or contextual specificity

    New Auto-Interp
    Negative Logits
    ocker
    -0.16
    ritel
    -0.15
     near
    -0.14
     Niet
    -0.14
    nze
    -0.14
    zet
    -0.14
    械
    -0.14
    ifer
    -0.14
     yet
    -0.14
    cape
    -0.13
    POSITIVE LOGITS
    este
    0.19
    abbo
    0.17
    ilik
    0.16
    reatest
    0.14
    Äįi
    0.14
    :disable
    0.14
    iators
    0.14
     gezocht
    0.14
    rippling
    0.14
    .osgi
    0.14
    Act Density 0.023%

    No Known Activations