INDEX
    Explanations

    mentions of specific numbers or quantities

    New Auto-Interp
    Negative Logits
    <bos>
    -0.69
     AssemblyCompany
    -0.58
    toHaveBeen
    -0.53
    Ikr
    -0.50
     tartalomajánló
    -0.49
     kaynağından
    -0.48
     eût
    -0.48
     CascadeType
    -0.48
     reú
    -0.47
    fordable
    -0.47
    POSITIVE LOGITS
     lagar
    0.52
    municipi
    0.51
     Seconde
    0.51
    település
    0.50
     Ə
    0.50
     whom
    0.48
     Ibidem
    0.47
     voglio
    0.47
    SBATCH
    0.47
     inverte
    0.47
    Act Density 0.574%

    No Known Activations