INDEX
    Explanations

    adverbs of time/frequency

    New Auto-Interp
    Negative Logits
     Faul
    -0.07
    .details
    -0.07
     contrary
    -0.07
    chl
    -0.07
    =result
    -0.07
     such
    -0.07
     Internacional
    -0.07
    _weights
    -0.07
    CH
    -0.07
    amma
    -0.07
    POSITIVE LOGITS
    rám
    0.07
     Nad
    0.06
    ++)
    ↵
    0.06
     graphics
    0.06
     dann
    0.06
    leaning
    0.06
     cod
    0.06
     halten
    0.05
    elloworld
    0.05
     Initi
    0.05
    Act Density 0.027%

    No Known Activations