INDEX
    Explanations

    proper names and specific nouns

    New Auto-Interp
    Negative Logits
    neider
    -0.13
    âĢĮپدÛĮاÛĮ
    -0.13
    antino
    -0.13
    .nlm
    -0.13
    untos
    -0.12
    ê
    -0.12
    omik
    -0.12
    ombine
    -0.12
    _ASYNC
    -0.12
    QE
    -0.12
    POSITIVE LOGITS
    -w
    0.56
    _w
    0.47
    -W
    0.45
    .w
    0.43
    	w
    0.39
    *w
    0.38
    _W
    0.38
    >w
    0.36
    +w
    0.35
    ,w
    0.34
    Act Density 0.440%

    No Known Activations