INDEX
    Explanations

    mentions of notable figures in entertainment and politics

    Tokens preceding names

    New Auto-Interp
    Negative Logits
    -0.75
     the
    -0.63
     a
    -0.60
    ,
    -0.59
     et
    -0.57
    .
    -0.54
     all
    -0.53
     it
    -0.52
     more
    -0.51
     '
    -0.51
    POSITIVE LOGITS
    0.99
     Anſ
    0.99
    abetes
    0.98
    0.98
    mybatisplus
    0.97
     Efq
    0.92
     Diſ
    0.92
     Reſ
    0.90
     Administrativna
    0.87
    ArgsConstructor
    0.87
    Act Density 0.228%

    No Known Activations