INDEX
    Explanations

    phrases related to abstract concepts, such as 'the most', 'the biggest', 'the worst'

    repeated references to specific measurements or quantities

    New Auto-Interp
    Negative Logits
    Allows
    -0.88
    rade
    -0.66
    ogie
    -0.64
    Raid
    -0.63
    SPONSORED
    -0.63
    ENCE
    -0.63
    olo
    -0.60
    TD
    -0.60
     Want
    -0.60
    ARDIS
    -0.59
    POSITIVE LOGITS
     earliest
    1.04
    ses
    1.02
     heaviest
    0.96
     finest
    0.95
     toughest
    0.93
     hottest
    0.90
     hars
    0.88
     strongest
    0.87
     greatest
    0.86
     hardest
    0.85
    Act Density 0.170%

    No Known Activations