INDEX
    Explanations

    proper nouns, particularly names and places

    New Auto-Interp
    Negative Logits
     Wilk
    -0.88
    triangleq
    -0.82
    ViewFeatures
    -0.80
     Nim
    -0.77
    sprogramm
    -0.75
     coel
    -0.72
    BindingSource
    -0.71
    ühungen
    -0.68
     corporation
    -0.68
     Lopez
    -0.68
    POSITIVE LOGITS
     Somerville
    0.92
     Glou
    0.81
     Balzac
    0.79
    Kiki
    0.77
    PVC
    0.77
     NOU
    0.77
     Kiki
    0.77
     Fors
    0.77
     Stans
    0.75
     Jä
    0.75
    Act Density 2.104%

    No Known Activations