INDEX
    Explanations

    mentions of the name "Thor."

    New Auto-Interp
    Negative Logits
    ropoda
    -0.17
    zer
    -0.16
    archy
    -0.15
    Ø´ÛĮ
    -0.15
    gor
    -0.14
     è©ķ
    -0.14
    ائÙħ
    -0.14
    erman
    -0.14
    ÑĤ
    -0.14
    ivre
    -0.14
    POSITIVE LOGITS
    acic
    0.32
    nton
    0.32
    OUGH
    0.24
    arin
    0.21
    bj
    0.20
    wald
    0.20
    sten
    0.19
     Thor
    0.19
    azine
    0.18
    stein
    0.18
    Act Density 0.003%

    No Known Activations