INDEX
    Explanations

    phrases emphasizing correctness or completion

    affirmations or confirmations within statements

    New Auto-Interp
    Negative Logits
     Krug
    -0.78
    moil
    -0.76
    ngth
    -0.74
     Agric
    -0.63
    alks
    -0.62
    ras
    -0.62
    hester
    -0.60
    cius
    -0.60
    clock
    -0.59
    plates
    -0.59
    POSITIVE LOGITS
    SPONSORED
    0.81
    ECA
    0.69
     coincidence
    0.68
    natureconservancy
    0.68
    UGH
    0.65
    POS
    0.65
    PLIED
    0.64
    ï¸
    0.63
     scary
    0.63
    soType
    0.62
    Act Density 0.298%

    No Known Activations