INDEX
    Explanations

    words related to high-ranking positions or entities

    references to levels of intensity or importance

    New Auto-Interp
    Negative Logits
     Lucia
    -0.80
     Andromeda
    -0.77
    Frames
    -0.77
    OPLE
    -0.76
     Predators
    -0.72
    Meg
    -0.69
    Woman
    -0.67
     Stars
    -0.67
    ELY
    -0.67
    iframe
    -0.66
    POSITIVE LOGITS
    level
    0.91
     levels
    0.83
     level
    0.82
    acies
    0.71
     diagrams
    0.69
     guidance
    0.68
    ahead
    0.67
    rise
    0.67
     Level
    0.65
    tenance
    0.64
    Act Density 0.015%

    No Known Activations