INDEX
    Explanations

    original, standard, or plain states

    New Auto-Interp
    Negative Logits
     невероят
    0.65
     deftly
    0.65
     extraordin
    0.64
     सुरक्षा
    0.64
     subconsciously
    0.64
    hernalia
    0.63
     maravilloso
    0.63
     безопасности
    0.63
     Extraordinary
    0.62
     সংস্কৃতি
    0.62
    POSITIVE LOGITS
     traditional
    1.74
    traditional
    1.48
     standalone
    1.48
     pure
    1.35
     standard
    1.35
     Traditional
    1.34
    Traditional
    1.33
    传统的
    1.32
     conventional
    1.31
     plain
    1.29
    Act Density 4.501%

    No Known Activations