INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Theſe
    -1.08
     myſelf
    -1.07
     Monfieur
    -1.05
     ―――――
    -1.02
     purpoſe
    -1.00
     $_"
    -0.99
     Anſ
    -0.95
     itſelf
    -0.95
     auffi
    -0.94
    Geplaatst
    -0.94
    POSITIVE LOGITS
    ,
    0.99
    .
    0.97
    :
    0.93
    ;
    0.91
    0.88
    (
    0.82
    "
    0.81
    ?
    0.79
    /
    0.78
     (
    0.77
    Act Density 0.265%

    No Known Activations