INDEX
    Explanations

    phrases related to different forms of lying

    instances of the word "lie" in various contexts

    New Auto-Interp
    Negative Logits
     connecting
    -0.65
     pip
    -0.63
     access
    -0.61
     Vital
    -0.60
     tabs
    -0.59
     rounded
    -0.59
     stats
    -0.59
     pool
    -0.59
     upgraded
    -0.59
     scheduled
    -0.59
    POSITIVE LOGITS
    lie
    5.00
    lies
    2.16
    Lie
    1.66
    lied
    1.46
    lia
    1.44
    lio
    1.43
    li
    1.42
     Lie
    1.39
    lying
    1.23
    leigh
    1.19
    Act Density 0.011%

    No Known Activations