INDEX
    Explanations

    phrases related to qualities or characteristics of a particular entity or subject

    phrases that include the word "that," often in relation to specific qualities or characteristics of a subject

    New Auto-Interp
    Negative Logits
    "],
    -0.66
     Bott
    -0.64
     Explan
    -0.61
     Checking
    -0.61
     docking
    -0.59
    "],"
    -0.58
     Passive
    -0.57
     glasses
    -0.57
    .]
    -0.57
    engers
    -0.57
    POSITIVE LOGITS
     thri
    0.85
     spans
    0.82
    İĭ
    0.80
     celebrates
    0.80
     embraces
    0.79
     starved
    0.77
     lacks
    0.77
     sleeps
    0.75
    luaj
    0.75
    ĻĤ
    0.74
    Act Density 0.185%

    No Known Activations