INDEX
    Explanations

    occurrences of the word "for" and variations of related terms

    New Auto-Interp
    Negative Logits
    omid
    -0.15
    setMax
    -0.15
    fram
    -0.14
    ifen
    -0.14
    eel
    -0.14
    ův
    -0.13
    anst
    -0.13
    erti
    -0.13
    fort
    -0.13
     rég
    -0.13
    POSITIVE LOGITS
    reater
    0.15
    á»įng
    0.15
    ÄĻż
    0.15
    führ
    0.14
    é¾Ħ
    0.14
    iaux
    0.14
    LOCITY
    0.14
    Lib
    0.14
    yh
    0.14
    utom
    0.13
    Act Density 0.297%

    No Known Activations