INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Foot
    -0.07
     plugins
    -0.07
     удив
    -0.06
    アルバ
    -0.06
    (drop
    -0.06
     Appalach
    -0.06
    .guna
    -0.06
    jumlah
    -0.06
    คอม
    -0.06
    Duration
    -0.06
    POSITIVE LOGITS
     clear
    0.16
     clearly
    0.13
     clearer
    0.12
     Clear
    0.10
    clear
    0.09
    Clear
    0.09
     CLEAR
    0.09
    Clearly
    0.08
    -clear
    0.08
     clarity
    0.08
    Act Density 0.032%

    No Known Activations