INDEX
    Explanations

    instances of the word "strong" and its variations

    New Auto-Interp
    Negative Logits
    enumi
    -0.78
    upaten
    -0.75
    Loot
    -0.72
    kaido
    -0.71
     []:
    -0.71
     Maus
    -0.70
    jima
    -0.69
    atimes
    -0.68
     SHOPPING
    -0.67
     jPanel
    -0.65
    POSITIVE LOGITS
    strong
    1.64
     Strong
    1.61
    Strong
    1.57
    STRONG
    1.56
    strength
    1.56
     STRONG
    1.50
     strong
    1.48
     strength
    1.46
     Strength
    1.42
    Strength
    1.39
    Act Density 0.103%

    No Known Activations