INDEX
    Explanations

    "Don't worry", "Luckily", "common feeling"

    New Auto-Interp
    Negative Logits
     on
    0.52
     cre
    0.48
     eat
    0.48
     everyone
    0.46
     vs
    0.46
     café
    0.46
     health
    0.45
     cena
    0.45
     dietitian
    0.45
     gold
    0.44
    POSITIVE LOGITS
    0.55
    0.54
    0.52
    Cutting
    0.52
     внутреннего
    0.51
    Manipulation
    0.50
    ర్‌
    0.50
    функциона
    0.50
     диамет
    0.49
    IVACON
    0.49
    Act Density 0.008%

    No Known Activations