INDEX
    Explanations

    foreign languages

    New Auto-Interp
    Negative Logits
     concent
    -0.08
    18
    -0.07
     free
    -0.06
     posterior
    -0.06
     contributions
    -0.06
    -0.06
     знаю
    -0.06
     CC
    -0.06
     coastal
    -0.06
    17
    -0.06
    POSITIVE LOGITS
    0.07
     first
    0.07
    .jsx
    0.07
    wpdb
    0.07
     помощи
    0.06
     druhou
    0.06
    (paths
    0.06
    /views
    0.06
     Puppet
    0.06
    ادی
    0.06
    Act Density 0.024%

    No Known Activations