INDEX
    Explanations

    words related to physical attributes or characteristics

    negative phrases or contexts

    New Auto-Interp
    Negative Logits
     Expend
    -0.72
     materially
    -0.64
     opin
    -0.64
     Patreon
    -0.64
     Emmy
    -0.63
     RBI
    -0.63
     intel
    -0.62
     Instr
    -0.62
     Fiscal
    -0.61
    ité
    -0.61
    POSITIVE LOGITS
    based
    1.35
    shaped
    1.29
    like
    1.15
    type
    1.13
    sized
    1.13
    length
    1.12
    style
    1.11
    mounted
    1.08
    plate
    1.08
    clad
    1.07
    Act Density 0.102%

    No Known Activations