INDEX
    Explanations

    nothing, as it has no significant activations

    Non-alphanumeric characters and the following word

    sequences following specific tokens

    New Auto-Interp
    Negative Logits
    TintMode
    -0.79
    -0.78
     שוליים
    -0.78
     houſe
    -0.77
    ArrowToggle
    -0.76
     '\\;'
    -0.76
     fubject
    -0.75
     Houſe
    -0.73
     Efq
    -0.72
     iſt
    -0.72
    POSITIVE LOGITS
    </blockquote>
    0.46
    optarg
    0.46
     cinque
    0.42
     náv
    0.42
    strftime
    0.41
    eni
    0.40
     le
    0.40
     grand
    0.40
     BufferedReader
    0.39
    ota
    0.39
    Act Density 0.015%

    No Known Activations