INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    expandindo
    -0.71
    IVEREF
    -0.56
     démocr
    -0.53
     dilaporkan
    -0.53
     bezeichneter
    -0.53
     cuillère
    -0.51
     StatefulWidget
    -0.50
     grossa
    -0.49
     feroit
    -0.49
    berdayakan
    -0.47
    POSITIVE LOGITS
     mess
    1.05
     nightmare
    0.92
     disaster
    0.90
     cess
    0.86
     monstros
    0.83
     abomination
    0.81
     hell
    0.81
     monster
    0.77
     catastrophe
    0.77
     mon
    0.75
    Act Density 0.006%

    No Known Activations