INDEX
    Explanations

    references to current entities, dates, and events

    New Auto-Interp
    Negative Logits
    kowski
    -0.18
     initially
    -0.18
     soon
    -0.17
     originally
    -0.17
    аннÑĸ
    -0.16
    dorf
    -0.16
    åİŁæľ¬
    -0.15
     ÑĢанÑĮÑĪе
    -0.15
     previously
    -0.15
     first
    -0.15
    POSITIVE LOGITS
     STILL
    0.28
     now
    0.28
     still
    0.27
    still
    0.26
     Still
    0.25
    Still
    0.23
    now
    0.22
    _now
    0.22
     artık
    0.21
     hâlâ
    0.21
    Act Density 0.211%

    No Known Activations