INDEX
    Explanations

    mentions and variations of the word "banana."

    New Auto-Interp
    Negative Logits
    fjspx
    -0.53
     giorgio
    -0.48
     Савезне
    -0.47
    }{*}{
    -0.45
    ertale
    -0.45
    *~
    -0.45
    oult
    -0.44
    /*
    -0.44
    :]:
    -0.43
    ')";
    -0.43
    POSITIVE LOGITS
     Banana
    1.20
     banana
    1.19
    Banana
    1.14
     Bananas
    1.03
    Bananas
    1.02
    banana
    1.00
     bananas
    0.98
     banane
    0.94
     banan
    0.84
    🍌
    0.73
    Act Density 0.002%

    No Known Activations