INDEX
    Explanations

    phrases indicating important or noteworthy information

    phrases indicating quantities or occurrences

    New Auto-Interp
    Negative Logits
    emale
    -0.74
    vid
    -0.73
    raid
    -0.72
    eton
    -0.71
    ride
    -0.71
    ossier
    -0.71
    vest
    -0.69
    é¾įåĸļ士
    -0.68
    imaru
    -0.67
    querade
    -0.66
    POSITIVE LOGITS
     interesting
    1.18
     serious
    1.12
     nifty
    1.12
     surprises
    1.09
     surprising
    1.06
     semblance
    1.06
     pretty
    1.04
     intriguing
    1.03
     awfully
    1.03
     incredible
    1.03
    Act Density 0.090%

    No Known Activations