INDEX
    Explanations

    the word "liked" followed by various subjects or objects

    mentions of preferences or feelings towards something, primarily focusing on the word "liked."

    New Auto-Interp
    Negative Logits
     Terminal
    -0.63
     extingu
    -0.62
     stage
    -0.61
     recovery
    -0.60
     Elim
    -0.58
     eviction
    -0.57
     mounting
    -0.56
     Express
    -0.56
    olve
    -0.55
     sovereignty
    -0.54
    POSITIVE LOGITS
     liked
    3.37
     disliked
    2.11
     loved
    1.95
     enjoyed
    1.71
     hated
    1.71
     likes
    1.67
     liking
    1.64
     admired
    1.62
     appreciated
    1.44
     Likes
    1.36
    Act Density 0.012%

    No Known Activations