INDEX
Explanations
phrases indicating significant changes or adaptations in behavior during crises
New Auto-Interp
Negative Logits
ÃĸL
-0.15
¹
-0.15
712
-0.14
agra
-0.14
romium
-0.14
vari
-0.14
mens
-0.13
οÏĤ
-0.13
cot
-0.13
大ä¼ļ
-0.13
POSITIVE LOGITS
itmap
0.16
ODY
0.15
Unnamed
0.15
ifa
0.14
#ad
0.14
å±Ĭ
0.14
ategorical
0.14
oping
0.14
irl
0.14
ç·Ĵ
0.14
Activations Density 0.420%