INDEX
    Explanations

    phrases emphasizing a specific location or point in a discussion

    instances of the word "here," indicating a focus on emphasizing points or clarifying details within a context

    New Auto-Interp
    Negative Logits
     parap
    -0.66
    ews
    -0.64
     Gujar
    -0.60
     Mehran
    -0.59
     Doors
    -0.59
    visors
    -0.58
    ggle
    -0.58
     tongues
    -0.56
    uously
    -0.55
    amaz
    -0.54
    POSITIVE LOGITS
    abouts
    1.54
    tics
    1.54
    tical
    1.54
    tic
    1.20
    upon
    0.81
    with
    0.80
     guiActiveUn
    0.78
    after
    0.73
    from
    0.70
    here
    0.69
    Act Density 0.061%

    No Known Activations