INDEX
    Explanations

    occurrences of the term "on" in various contexts

    New Auto-Interp
    Negative Logits
    addir
    -0.15
    ición
    -0.15
    achten
    -0.15
    defaults
    -0.14
    cly
    -0.14
     eben
    -0.14
    alm
    -0.14
    amik
    -0.14
    alem
    -0.14
    ahoo
    -0.13
    POSITIVE LOGITS
     disc
    0.17
    ÑĢик
    0.16
    ews
    0.16
    ÄĻd
    0.15
    ilogue
    0.15
    /off
    0.15
    ocs
    0.15
    ICES
    0.15
     Mueller
    0.14
    .vertx
    0.14
    Act Density 0.076%

    No Known Activations