INDEX
    Explanations

    advice on various topics such as diet, exercise, and file sharing

    New Auto-Interp
    Negative Logits
     Warfare
    -0.76
    "]=>
    -0.68
    Scale
    -0.64
     Dictionary
    -0.64
     Scrib
    -0.64
     Bhar
    -0.63
    rule
    -0.63
    Dub
    -0.62
    Mario
    -0.61
    WARD
    -0.61
    POSITIVE LOGITS
     unable
    0.95
     enrolled
    0.92
     unsure
    0.91
     wished
    0.86
    having
    0.82
     wish
    0.82
     somehow
    0.81
     experiencing
    0.80
     possessed
    0.78
     possesses
    0.77
    Act Density 0.408%

    No Known Activations