INDEX
    Explanations

    phrases enclosed in quotation marks

    quotation marks and their contents

    New Auto-Interp
    Negative Logits
    terday
    -0.74
     Nieto
    -0.73
     Rica
    -0.72
     upon
    -0.68
     describ
    -0.67
     jailed
    -0.66
     relate
    -0.64
     viewers
    -0.63
     McGr
    -0.63
     accompl
    -0.62
    POSITIVE LOGITS
    most
    1.26
    official
    1.25
    classic
    1.24
    ultimate
    1.16
    little
    1.12
    original
    1.09
    Ultimate
    1.08
    best
    1.08
    problem
    1.06
    Golden
    1.06
    Act Density 0.054%

    No Known Activations