INDEX
    Explanations

    blank, null, default values

    New Auto-Interp
    Negative Logits
     ancora
    -0.81
     ainda
    -0.81
    wasi
    -0.80
    Pubs
    -0.76
    Jürgen
    -0.75
     모두
    -0.74
    ]=\
    -0.73
     and
    -0.73
    ganet
    -0.73
    //!
    
    -0.73
    POSITIVE LOGITS
    blank
    1.46
     blank
    1.40
     help
    1.11
     null
    1.10
    BLANK
    1.07
    default
    1.06
    null
    1.03
     helpt
    1.02
    Blank
    1.02
     default
    1.00
    Act Density 0.006%

    No Known Activations