INDEX
    Explanations

    words related to evaluation or judgment on a scale

    the phrase "what makes" in various contexts

    New Auto-Interp
    Negative Logits
     scrimmage
    -0.67
    ban
    -0.67
     nurs
    -0.67
    76561
    -0.65
    thia
    -0.65
    ---------
    -0.62
     conditioning
    -0.57
     Witch
    -0.55
     Souls
    -0.55
    herry
    -0.55
    POSITIVE LOGITS
    hift
    1.13
     sure
    1.02
    berra
    0.81
    paio
    0.77
    auri
    0.76
    elling
    0.76
    ailable
    0.74
     landfall
    0.74
    paces
    0.73
    ebin
    0.73
    Act Density 0.129%

    No Known Activations