INDEX
    Explanations

    The neuron detects expressions of appreciation or value (e.g. “appreciation for…”).

    New Auto-Interp
    Negative Logits
     towing
    -0.07
    _fixture
    -0.07
     cloud
    -0.07
     szy
    -0.07
     Dome
    -0.07
    EXIST
    -0.06
    หย
    -0.06
     Fall
    -0.06
     Cloud
    -0.06
    consistent
    -0.06
    POSITIVE LOGITS
     appreciate
    0.15
     appreciated
    0.12
     appreciation
    0.11
     apprec
    0.11
     admiration
    0.07
    0.07
     alır
    0.07
     depressed
    0.07
     употреб
    0.07
     이해
    0.07
    Act Density 0.012%

    No Known Activations