INDEX
    Explanations

    objective factual rational

    New Auto-Interp
    Negative Logits
     enjoyable
    0.92
     leisure
    0.91
    柔らか
    0.90
     pleasurable
    0.90
     romantic
    0.90
     leisurely
    0.89
     fanciful
    0.88
     joyful
    0.87
     carnival
    0.87
     Spaß
    0.86
    POSITIVE LOGITS
    理性
    1.26
     scientific
    1.14
     utilitarian
    1.14
    scientific
    1.13
     sterile
    1.10
     coldly
    1.10
    科学
    1.09
     рациона
    1.07
     austere
    1.06
    Scientific
    1.05
    Act Density 1.979%

    No Known Activations