INDEX
    Explanations

    similes describing forceful or rough actions

    similes and comparisons involving the word "like."

    New Auto-Interp
    Negative Logits
    inion
    -0.82
    iets
    -0.81
    ulty
    -0.79
    hiba
    -0.77
    ilic
    -0.77
    elin
    -0.74
    ennes
    -0.71
    arcity
    -0.70
    inas
    -0.69
    ysical
    -0.69
    POSITIVE LOGITS
    lihood
    1.35
    liest
    1.04
    lier
    1.01
     clock
    0.86
     crazy
    0.85
     ours
    0.81
    liness
    0.80
     wildfire
    0.79
     minded
    0.75
    minded
    0.74
    Act Density 0.071%

    No Known Activations