INDEX
    Explanations

    instances where someone is explaining or noting something to others

    instances where explanations and assertions are being made

    New Auto-Interp
    Negative Logits
    Ther
    -0.65
    raq
    -0.59
    held
    -0.56
    peg
    -0.56
    respect
    -0.55
    icum
    -0.54
    bish
    -0.54
    backer
    -0.53
    aic
    -0.53
    Pont
    -0.53
    POSITIVE LOGITS
    ©¶æ
    0.84
     "â̦
    0.83
     "[
    0.81
     "...
    0.81
    -+-+
    0.71
     '[
    0.70
     "(
    0.69
    =\"
    0.67
     quoting
    0.65
     although
    0.64
    Act Density 0.257%

    No Known Activations