INDEX
    Explanations

    relative references to time or sequence of events

    New Auto-Interp
    Negative Logits
     disambiguazione
    -0.62
     sauvages
    -0.62
    UserScript
    -0.62
     uniformity
    -0.60
     artificiales
    -0.59
    thâu
    -0.57
    inburgh
    -0.56
     regionales
    -0.56
     Playback
    -0.55
    Diweddarwch
    -0.55
    POSITIVE LOGITS
     kasarigan
    0.56
    Geplaatst
    0.47
    cotta
    0.46
    nery
    0.45
    nth
    0.45
    sda
    0.45
    SequentialGroup
    0.44
     nth
    0.44
     shifted
    0.43
     next
    0.42
    Act Density 0.159%

    No Known Activations