INDEX
    Explanations

    the word "elephant" whenever it appears in the text

    New Auto-Interp
    Negative Logits
    ValueStyle
    -0.58
     actionMode
    -0.54
    サギ
    -0.51
    /*++
    -0.50
     Bellamy
    -0.50
    messageInfo
    -0.49
     وتسجيلات
    -0.49
    )_{\
    -0.48
    からです
    -0.48
    rxjs
    -0.48
    POSITIVE LOGITS
     elephant
    3.72
     elephants
    3.27
     Elephant
    3.03
    elephant
    2.78
     Elephants
    2.67
    Elephant
    2.63
     elef
    1.91
     elefante
    1.87
    phants
    1.57
    🐘
    1.09
    Act Density 0.000%

    No Known Activations