INDEX
    Explanations

    occurrences of the word "gate."

    New Auto-Interp
    Negative Logits
    hovah
    -0.47
    ľ
    -0.44
    inary
    -0.43
    uania
    -0.42
    urgy
    -0.41
    acebook
    -0.40
    anian
    -0.40
    ogun
    -0.40
    uitive
    -0.39
     Adin
    -0.39
    POSITIVE LOGITS
    way
    0.56
    boro
    0.49
    gate
    0.48
    agher
    0.47
     Papers
    0.46
    eln
    0.46
    forth
    0.45
    leaf
    0.45
    plot
    0.44
    WAY
    0.43
    Act Density 11.276%

    No Known Activations