INDEX
    Explanations

    reported speech and dialogue phrases

    New Auto-Interp
    Negative Logits
    ething
    -0.08
    ustos
    -0.08
    SPATH
    -0.08
    با
    -0.08
    ležit
    -0.08
    Ħìŀ¬
    -0.07
    laden
    -0.07
    çīĩ
    -0.07
    ColumnsMode
    -0.07
    eft
    -0.07
    POSITIVE LOGITS
    :
    0.08
     '
    0.06
    eger
    0.06
     ->
    0.06
    nic
    0.06
     num
    0.06
    ->
    0.06
    ajo
    0.06
     >>
    0.06
    atten
    0.06
    Act Density 0.027%

    No Known Activations