INDEX
    Explanations

    variations of the word "ang" in various contexts

    New Auto-Interp
    Negative Logits
    leck
    -0.18
    anova
    -0.18
    laps
    -0.18
    ingo
    -0.17
    ãĥ£
    -0.17
     Nová
    -0.16
    ingt
    -0.15
    zd
    -0.15
    cken
    -0.15
    edo
    -0.15
    POSITIVE LOGITS
    aroo
    0.26
    ladesh
    0.22
    ements
    0.21
    ulate
    0.20
    rove
    0.20
    els
    0.19
    lish
    0.19
    rowth
    0.19
    ue
    0.19
    eline
    0.19
    Act Density 0.029%

    No Known Activations