INDEX
    Explanations

    verbs expressing action or judgement

    phrases related to criticism and accountability

    New Auto-Interp
    Negative Logits
    inger
    -0.72
    ilogy
    -0.68
    ciating
    -0.68
    talking
    -0.65
    Depending
    -0.64
    DragonMagazine
    -0.64
    Reporting
    -0.63
    gui
    -0.62
     thanking
    -0.60
    orget
    -0.58
    POSITIVE LOGITS
     insensitive
    0.77
    BuyableInstoreAndOnline
    0.74
     sake
    0.72
     improper
    0.70
    ãĤ¤ãĥĪ
    0.69
     insufficient
    0.68
     unsu
    0.65
     purposes
    0.65
     illegal
    0.64
     unconventional
    0.64
    Act Density 0.176%

    No Known Activations