INDEX
    Explanations

    phrases related to politics and political figures

    references to notable individuals and their achievements or contributions

    New Auto-Interp
    Negative Logits
    .",
    -0.59
     Decay
    -0.58
    ravings
    -0.54
    resy
    -0.54
     <-
    -0.52
    izoph
    -0.52
    :,
    -0.51
     Prelude
    -0.50
    rex
    -0.50
    ubi
    -0.50
    POSITIVE LOGITS
    })
    0.75
    )}
    0.70
    )—
    0.68
    )|
    0.66
    )]
    0.60
    )
    0.59
    )</
    0.59
    *)
    0.59
    interstitial
    0.55
     gram
    0.55
    Act Density 2.456%

    No Known Activations