INDEX
    Explanations

    references to time and occurrences of events

    New Auto-Interp
    Negative Logits
    ish
    -0.18
    uma
    -0.15
     latter
    -0.15
    ing
    -0.14
    uala
    -0.14
    ump
    -0.14
    otal
    -0.14
    bjerg
    -0.13
    ened
    -0.13
    ublic
    -0.13
    POSITIVE LOGITS
     round
    0.28
     around
    0.26
    Around
    0.26
    round
    0.26
    -round
    0.23
     Around
    0.23
    -ci
    0.22
    around
    0.22
    -around
    0.21
    Round
    0.21
    Act Density 0.030%

    No Known Activations