INDEX
    Explanations

    sections of text that contain no significant content or activations

    Tokens after dates or numbers

    New Auto-Interp
    Negative Logits
    ectoria
    -0.48
     võimal
    -0.47
    ništvo
    -0.47
     comuniques
    -0.45
     traer
    -0.45
    -0.44
     épis
    -0.44
     Urqu
    -0.43
    ayuno
    -0.43
     verrez
    -0.43
    POSITIVE LOGITS
     lenker
    0.76
     autorytatywna
    0.70
     EconPapers
    0.67
    MemoryWarning
    0.67
     محفوظة
    0.65
    typeorm
    0.65
     propOrder
    0.63
     nawr
    0.62
    Autoritní
    0.62
    期刊论文
    0.61
    Act Density 0.185%

    No Known Activations