INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    èª
    -0.92
    uous
    -0.83
    colo
    -0.78
    ifts
    -0.74
    å£
    -0.71
    uously
    -0.71
    oise
    -0.70
    IBLE
    -0.69
    ought
    -0.69
    ä¹ĭ
    -0.68
    POSITIVE LOGITS
     Mayhem
    1.09
     Madness
    0.95
     Matters
    0.85
     Mania
    0.83
     Improvement
    0.80
     Definition
    0.79
     Interest
    0.79
     Disorders
    0.78
     Massacre
    0.76
    achus
    0.76
    Act Density 0.076%

    No Known Activations