INDEX
    Explanations

    proper nouns

    New Auto-Interp
    Negative Logits
    y
    -0.87
    n
    -0.79
    ه
    -0.75
    s
    -0.69
    ositol
    -0.68
    i
    -0.68
    yntaxException
    -0.67
     mellitus
    -0.65
    k
    -0.65
    ی
    -0.65
    POSITIVE LOGITS
     Sto
    0.57
    busch
    0.55
    veck
    0.54
    logfile
    0.52
     käyt
    0.52
     Way
    0.51
    undred
    0.51
     Man
    0.50
     Mc
    0.49
     Ground
    0.49
    Act Density 0.224%

    No Known Activations