INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    fjspx
    -0.59
     Signalez
    -0.47
    Portail
    -0.43
    bootstrapcdn
    -0.42
    OGND
    -0.41
    tagHelperRunner
    -0.40
     FetchType
    -0.40
    GEBURTSDATUM
    -0.39
    Gön
    -0.39
    Chham
    -0.39
    POSITIVE LOGITS
     pancakes
    2.11
     pancake
    2.02
     Pancake
    1.95
     Pancakes
    1.87
    Pancake
    1.69
    🥞
    0.90
     waffles
    0.83
     crê
    0.76
     Waffle
    0.74
     waffle
    0.67
    Act Density 0.001%

    No Known Activations