INDEX
    Explanations

    phrases emphasizing that something is more than what it appears to be

    themes of superficiality versus deeper meaning in various contexts

    New Auto-Interp
    Negative Logits
     unfocusedRange
    -0.70
    also
    -0.67
     unsus
    -0.64
    å§«
    -0.64
    ãģ¦
    -0.63
     ALSO
    -0.63
     concess
    -0.61
    éŃĶ
    -0.60
     Jury
    -0.60
    OTH
    -0.60
    POSITIVE LOGITS
     anymore
    0.90
     alone
    0.88
    ;
    0.76
     itself
    0.76
     superficial
    0.70
     decoration
    0.69
    ':
    0.69
     but
    0.68
    .;
    0.68
    .
    0.67
    Act Density 0.335%

    No Known Activations