INDEX
    Explanations

    phrases related to length or duration

    the word "longer" in various contexts

    New Auto-Interp
    Negative Logits
    Stars
    -0.84
    elf
    -0.81
    IRO
    -0.77
    Ro
    -0.73
    Sov
    -0.72
    ery
    -0.72
    Always
    -0.71
    mir
    -0.69
     Coordinator
    -0.69
    eers
    -0.68
    POSITIVE LOGITS
     than
    1.27
    than
    1.04
     Than
    1.00
     paced
    0.89
     periods
    0.87
     longer
    0.85
     distances
    0.85
     stretches
    0.84
     lasting
    0.84
    neck
    0.77
    Act Density 0.022%

    No Known Activations