INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    sterdam
    -0.93
    vere
    -0.89
    ©¶æ
    -0.89
    fect
    -0.86
    ften
    -0.86
    thodox
    -0.85
    eln
    -0.84
    secution
    -0.82
    endment
    -0.81
    mediate
    -0.80
    POSITIVE LOGITS
     qualities
    0.89
     landmarks
    0.88
     iconic
    0.85
     likeness
    0.84
     personalities
    0.80
     haunt
    0.79
     fixtures
    0.79
     figure
    0.79
     sounding
    0.78
     mascot
    0.78
    Act Density 0.088%

    No Known Activations