INDEX
    Explanations

    phrases expressing thoughts, opinions, beliefs, or evaluations

    expressions of personal beliefs and thoughts

    New Auto-Interp
    Negative Logits
    ntil
    -0.61
    orio
    -0.59
    Merit
    -0.57
    endment
    -0.56
    imen
    -0.55
    Poké
    -0.54
     meanwhile
    -0.53
    eeper
    -0.53
    0000000
    -0.53
    uty
    -0.52
    POSITIVE LOGITS
     belongs
    1.26
     constitutes
    1.22
     deserves
    1.21
     could
    1.18
     exists
    1.16
     ought
    1.15
     will
    1.14
     represents
    1.13
     qualifies
    1.10
     would
    1.10
    Act Density 0.112%

    No Known Activations