INDEX
    Explanations

    references to individuals named "Oro" with varying levels of emphasis

    words related to powerful or influential figures

    New Auto-Interp
    Negative Logits
    ablishment
    -0.77
    ivities
    -0.74
     Corpus
    -0.68
     Extrem
    -0.65
    ivism
    -0.63
    Lost
    -0.61
    Temperature
    -0.60
    NER
    -0.59
    nings
    -0.58
     Sunshine
    -0.58
    POSITIVE LOGITS
    oro
    1.40
    zzi
    1.07
    vernment
    1.03
    annis
    0.89
    oros
    0.88
    eco
    0.87
    onto
    0.85
    omon
    0.84
    heim
    0.83
    oso
    0.81
    Act Density 0.009%

    No Known Activations