INDEX
    Explanations

    occurrences of the word "am"

    New Auto-Interp
    Negative Logits
    readcr
    -0.16
    velle
    -0.16
    seite
    -0.15
     Woche
    -0.15
    ová
    -0.14
    rena
    -0.14
     flock
    -0.14
    ÑĩеÑģкаÑı
    -0.14
    liste
    -0.14
    ugar
    -0.14
    POSITIVE LOGITS
     Ende
    0.18
    ti
    0.17
    tier
    0.16
    æľ
    0.16
    ts
    0.15
    ÅĻÃŃ
    0.15
     mismo
    0.15
     Tag
    0.14
    éli
    0.14
     Begin
    0.14
    Act Density 0.005%

    No Known Activations