INDEX
    Explanations

    phrases indicating first occurrences or historical milestones

    New Auto-Interp
    Negative Logits
    akan
    -0.19
    .ng
    -0.15
    assen
    -0.15
    latest
    -0.15
    alus
    -0.14
     latest
    -0.14
    ÑħÑĸд
    -0.14
    iž
    -0.14
    uchen
    -0.14
    μÎŃ
    -0.14
    POSITIVE LOGITS
     recorded
    0.31
    -record
    0.25
     Recorded
    0.24
    record
    0.22
     mention
    0.20
     documented
    0.20
     inkl
    0.20
    successful
    0.19
     successful
    0.19
    -known
    0.18
    Act Density 0.090%

    No Known Activations