INDEX
    Explanations

    phrases related to things being well-defined or well-described

    phrases indicating something that is well-defined or well-regarded

    New Auto-Interp
    Negative Logits
     Midnight
    -0.71
     Cutter
    -0.70
     Phi
    -0.69
     Bravo
    -0.69
     Crisis
    -0.69
     Compass
    -0.67
     Theft
    -0.67
     Indigo
    -0.66
     Tags
    -0.64
     Hancock
    -0.64
    POSITIVE LOGITS
    known
    1.37
    established
    1.33
    defined
    1.30
    enough
    1.28
    trained
    1.27
    earned
    1.26
    documented
    1.25
    connected
    1.24
    respected
    1.23
    intention
    1.23
    Act Density 0.032%

    No Known Activations