INDEX
    Explanations

    names of individuals or characters

    New Auto-Interp
    Head Attr Weights
    0:0.04
    1:0.05
    2:0.02
    3:0.03
    4:0.03
    5:0.37
    6:0.02
    7:0.01
    8:0.04
    9:0.12
    10:0.14
    11:0.05
    Negative Logits
    ��
    -1.77
     cens
    -1.59
     banning
    -1.53
     accredited
    -1.52
    CVE
    -1.48
     SPECIAL
    -1.47
     NYT
    -1.44
     inaccur
    -1.39
     forecasting
    -1.35
    aution
    -1.35
    POSITIVE LOGITS
    Kyle
    1.67
    lar
    1.65
    essa
    1.64
    [[
    1.64
     Samson
    1.64
     Ler
    1.60
    rider
    1.60
    sie
    1.59
    username
    1.56
    sylv
    1.55
    Act Density 0.169%

    No Known Activations