INDEX
    Explanations

    references to specific brands or products, particularly in the context of consumer behavior or marketing strategies

    New Auto-Interp
    Negative Logits
     she
    -0.17
     они
    -0.16
     they
    -0.15
     ils
    -0.15
    åħ¶
    -0.15
    ike
    -0.15
    wan
    -0.15
    rix
    -0.14
    imits
    -0.13
    ecute
    -0.13
    POSITIVE LOGITS
     it
    0.45
     It
    0.33
    It
    0.33
    	it
    0.28
    _it
    0.28
    ,it
    0.27
    	It
    0.25
    (it
    0.23
    .It
    0.23
    -it
    0.22
    Act Density 0.338%

    No Known Activations