INDEX
    Explanations

    prominent people's names or references to well-known individuals

    New Auto-Interp
    Negative Logits
    ledon
    -0.16
    ylko
    -0.15
    .chomp
    -0.15
    ottes
    -0.14
    nds
    -0.14
    SWG
    -0.14
    gger
    -0.14
    æľĭ
    -0.14
    inski
    -0.14
    ynes
    -0.14
    POSITIVE LOGITS
     himself
    0.15
     æĥ
    0.14
     sper
    0.14
    RICT
    0.14
     fug
    0.14
    _bio
    0.13
     Hague
    0.13
     bi
    0.13
    panic
    0.13
     caste
    0.13
    Act Density 0.042%

    No Known Activations