INDEX
    Explanations

    mentions of rankings or positions, particularly those starting with "Top" followed by a number

    references to ranking lists, specifically "Top X" lists related to various categories

    New Auto-Interp
    Negative Logits
    ufact
    -0.81
     Gaul
    -0.66
     fir
    -0.66
    issance
    -0.66
    oresc
    -0.65
    BILITY
    -0.64
     Advent
    -0.63
    ¯¯
    -0.63
     heightened
    -0.63
     stead
    -0.62
    POSITIVE LOGITS
    eka
    1.12
    Top
    0.96
    ographical
    0.94
    ography
    0.92
    most
    0.85
    ology
    0.85
    Boss
    0.82
     Top
    0.82
    mast
    0.82
    ICS
    0.81
    Act Density 0.023%

    No Known Activations