INDEX
    Explanations

    numerical data and statistics in the text

    New Auto-Interp
    Negative Logits
    emis
    -0.81
    inyl
    -0.75
    hement
    -0.72
    schild
    -0.69
    dor
    -0.67
    wards
    -0.67
    ettings
    -0.66
    owship
    -0.65
    oneliness
    -0.65
    ngth
    -0.65
    POSITIVE LOGITS
    Thread
    0.70
     reservation
    0.66
    à¼
    0.65
     Pastebin
    0.64
    notations
    0.63
    Narr
    0.59
    Links
    0.58
    Spoiler
    0.58
    ãĥĥãĥĪ
    0.58
    animous
    0.57
    Act Density 0.011%

    No Known Activations