INDEX
    Explanations

    frequent occurrences of specific articles and conjunctions used in context

    New Auto-Interp
    Negative Logits
    â̦
    -0.15
    عب
    -0.14
    âĢ
    -0.14
     tÃŃ
    -0.13
     during
    -0.13
    าà¸ģ
    -0.12
     brave
    -0.12
    U
    -0.12
    i
    -0.12
     stol
    -0.12
    POSITIVE LOGITS
    (éĩij
    0.16
    erif
    0.15
    vailability
    0.15
     ÑĦÑĥнда
    0.14
    CallCheck
    0.14
    ehir
    0.14
    ÅĻÃŃm
    0.14
    Slinky
    0.14
    STALL
    0.14
    emachine
    0.14
    Act Density 0.018%

    No Known Activations