INDEX
    Explanations

    phrases indicating there is more to a situation than what is initially visible or known

    phrases that indicate a minor aspect of a larger issue

    New Auto-Interp
    Negative Logits
    iors
    -0.76
    Merit
    -0.70
    forts
    -0.69
    ials
    -0.68
    owship
    -0.66
    ãĤ¹ãĥĪ
    -0.66
    BAT
    -0.65
    except
    -0.65
    FACE
    -0.65
    fy
    -0.64
    POSITIVE LOGITS
     iceberg
    1.25
     scales
    0.82
     scale
    0.72
     finger
    0.70
     toes
    0.70
     rope
    0.69
     Scale
    0.67
     needles
    0.62
    gall
    0.62
     toe
    0.61
    Act Density 0.092%

    No Known Activations