INDEX
    Explanations

    locations, specifically cities and venues

    proper nouns, particularly names of places and events

    New Auto-Interp
    Negative Logits
    Reviewer
    -0.62
     âĢİ
    -0.58
    âĢİ
    -0.54
    Footnote
    -0.49
     Democr
    -0.47
     ours
    -0.47
    âĸĵ
    -0.46
     caution
    -0.46
     cybersecurity
    -0.45
    .''
    -0.44
    POSITIVE LOGITS
     TBA
    0.56
    apest
    0.54
     srf
    0.54
     Variant
    0.50
    idth
    0.48
    lectic
    0.48
    apeshifter
    0.46
    igion
    0.46
    igma
    0.46
    gins
    0.46
    Act Density 1.182%

    No Known Activations