INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    #+#
    -0.79
    tagHelperRunner
    -0.63
     Week
    -0.60
    Week
    -0.57
    новниш
    -0.54
    InputBorder
    -0.54
    Weeks
    -0.53
    InSection
    -0.53
     Signalez
    -0.52
     Woche
    -0.52
    POSITIVE LOGITS
     style
    0.59
     Majefty
    0.59
    出版年
    0.56
    eam
    0.56
    вање
    0.56
    dish
    0.56
     pleaſure
    0.56
     betweenstory
    0.55
     Esteban
    0.54
     Deo
    0.53
    Act Density 0.175%

    No Known Activations