INDEX
    Explanations

    instances where something is being limited, furthered, or only allowed to a certain extent

    words that indicate limitations or constraints

    New Auto-Interp
    Negative Logits
    )|
    -0.64
    âĿ
    -0.61
    },{"
    -0.61
    },"
    -0.60
    ilege
    -0.60
    oir
    -0.60
     Base
    -0.59
    oko
    -0.57
    onomy
    -0.57
     Hold
    -0.56
    POSITIVE LOGITS
     preferring
    1.15
     suggesting
    1.05
     implying
    0.97
     noting
    0.97
     adding
    0.93
     culminating
    0.92
     spilling
    0.90
     emphasizing
    0.89
     prompting
    0.89
     echoing
    0.86
    Act Density 0.421%

    No Known Activations