INDEX
    Explanations

    numerical values and symbols associated with ratings or categories

    New Auto-Interp
    Negative Logits
     myſelf
    -1.17
     itſelf
    -1.08
     Monfieur
    -1.06
     Theſe
    -1.02
     raiſ
    -1.00
     purpoſe
    -0.97
     Jefus
    -0.96
    ſelf
    -0.96
     whoſe
    -0.93
     ainfi
    -0.93
    POSITIVE LOGITS
    classnames
    0.88
    0.58
     autorytatywna
    0.55
    toggleClass
    0.52
    arbon
    0.51
     makeStyles
    0.51
    ונות
    0.48
    ↵↵
    0.47
    classNames
    0.46
     n
    0.45
    Act Density 0.417%

    No Known Activations