INDEX
    Explanations

    instances of numerical values

    New Auto-Interp
    Negative Logits
    ########.
    -0.77
    ]]
    
    -0.74
    ГЛА
    -0.70
     transfieras
    -0.69
    تقاوى
    -0.68
     Paglinawan
    -0.68
     Blok
    -0.67
     متعلقه
    -0.66
     Dati
    -0.66
    ]>=
    -0.66
    POSITIVE LOGITS
     २०
    0.80
    AndEndTag
    0.79
     ২০
    0.68
    wanzig
    0.68
     veinte
    0.67
    0
    0.67
     coscienza
    0.66
    entieth
    0.65
     Ath
    0.64
     vzduchu
    0.64
    Act Density 0.208%

    No Known Activations