INDEX
    Explanations

    mentions of time or specific time-related indicators

    New Auto-Interp
    Negative Logits
    structure
    -0.46
    Ũ
    -0.42
    material
    -0.41
    Necessary
    -0.41
     neces
    -0.39
     necessary
    -0.39
     nothwendig
    -0.39
     Necessary
    -0.38
     listy
    -0.36
    necessary
    -0.36
    POSITIVE LOGITS
    pm
    2.28
    PM
    1.72
     pm
    1.51
     PM
    1.41
    Pm
    1.34
     Pm
    1.11
    pms
    0.93
    ppm
    0.82
    pM
    0.81
    rpm
    0.79
    Act Density 0.017%

    No Known Activations