INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ilden
    -0.12
     ridic
    -0.09
    hib
    -0.09
    iode
    -0.09
    owie
    -0.09
    monkey
    -0.09
    alloca
    -0.08
    wan
    -0.08
     Ziel
    -0.08
     widgets
    -0.08
    POSITIVE LOGITS
     Sheldon
    0.11
     Japanese
    0.11
     British
    0.10
     stere
    0.10
    -wise
    0.10
     Anc
    0.10
     ancient
    0.10
     old
    0.09
     manner
    0.09
     Cyc
    0.09
    Act Density 0.266%

    No Known Activations