INDEX
    Explanations

    phrases indicating significant changes or adaptations in behavior during crises

    New Auto-Interp
    Negative Logits
    ÃĸL
    -0.15
    ¹
    -0.15
    712
    -0.14
    agra
    -0.14
    romium
    -0.14
    vari
    -0.14
    mens
    -0.13
    οÏĤ
    -0.13
    cot
    -0.13
    大ä¼ļ
    -0.13
    POSITIVE LOGITS
    itmap
    0.16
    ODY
    0.15
    Unnamed
    0.15
    ifa
    0.14
    #ad
    0.14
    å±Ĭ
    0.14
    ategorical
    0.14
    oping
    0.14
    irl
    0.14
    ç·Ĵ
    0.14
    Act Density 0.420%

    No Known Activations