INDEX
    Explanations

    instances of the word "super."

    New Auto-Interp
    Negative Logits
     utafitiHapana
    -0.51
    adaptiveStyles
    -0.45
    фициальный
    -0.44
    kloped
    -0.44
     '+':
    -0.44
    edback
    -0.44
    fizz
    -0.43
     änd
    -0.43
     mukana
    -0.43
    ","\
    -0.42
    POSITIVE LOGITS
    super
    2.17
     super
    1.68
    Super
    1.42
     SUPER
    1.40
    SUPER
    1.36
     Super
    1.35
    uper
    1.17
     супер
    1.15
    Супер
    1.08
    超级
    1.08
    Act Density 0.035%

    No Known Activations