INDEX
    Explanations

    mentions of the nervous system, drawing someone in, and ego or things people find likeable

    complex systems

    New Auto-Interp
    Negative Logits
     متعلقه
    -0.92
    ########.
    -0.89
     مرئيه
    -0.85
    Hochspringen
    -0.79
     Theſe
    -0.78
     Anſ
    -0.75
     Untitled
    -0.74
     OkHttpClient
    -0.72
    énario
    -0.72
    GEBURTSDATUM
    -0.72
    POSITIVE LOGITS
    <bos>
    0.80
     I
    0.65
     in
    0.59
     and
    0.48
    '
    0.47
     The
    0.47
    0.47
     (
    0.47
     for
    0.47
    bewah
    0.45
    Act Density 0.292%

    No Known Activations