INDEX
    Explanations

    conditional phrases and questions

    New Auto-Interp
    Negative Logits
    abant
    -0.16
    opic
    -0.16
    ibar
    -0.15
     Clement
    -0.14
    ini
    -0.14
     zad
    -0.14
    asmus
    -0.14
    tail
    -0.14
    adel
    -0.14
     behalf
    -0.13
    POSITIVE LOGITS
    alama
    0.17
    δι
    0.17
    CEE
    0.15
    IMAL
    0.14
    æķĻ
    0.14
    ãĥIJãĤ¤
    0.14
    RTL
    0.14
    675
    0.13
    ona
    0.13
     каÑĪ
    0.13
    Act Density 0.066%

    No Known Activations