INDEX
    Explanations

    repeated occurrences of the same element or pattern

    New Auto-Interp
    Negative Logits
    ·¸
    -1.66
    Ļ
    -1.57
    ķ
    -1.55
    actors
    -1.52
     wonders
    -1.40
     mention
    -1.38
    eyed
    -1.38
     plans
    -1.37
     predictions
    -1.36
    ¢
    -1.36
    POSITIVE LOGITS
    +^
    1.62
    ↵ ↵ 
    1.57
    %.
    1.56
     eds
    1.53
    zyk
    1.52
    igraph
    1.51
    /~
    1.47
    ary
    1.46
    phab
    1.45
    ki
    1.45
    Act Density 2.888%

    No Known Activations