INDEX
    Explanations

    proper nouns, particularly names and titles

    New Auto-Interp
    Negative Logits
    rsp
    -0.17
    rage
    -0.17
    imens
    -0.15
    ieron
    -0.15
    uras
    -0.14
    ITTER
    -0.14
    anford
    -0.14
    craft
    -0.14
    ipar
    -0.14
    imos
    -0.14
    POSITIVE LOGITS
    ulu
    0.17
    ãĥ³ãĥ
    0.16
    ngx
    0.16
    ongo
    0.16
    ambia
    0.16
    nesty
    0.16
    osen
    0.16
    okane
    0.15
     Leban
    0.15
    ToSelector
    0.15
    Act Density 0.127%

    No Known Activations