INDEX
Explanations
phrases related to societal issues and governance
New Auto-Interp
Negative Logits
éĤ£äºĽ
-0.15
DETAIL
-0.15
slightest
-0.14
POSIT
-0.14
ä¸Ĭ涨
-0.14
empo
-0.14
iator
-0.13
crease
-0.13
ibe
-0.13
inaire
-0.13
POSITIVE LOGITS
significant
0.29
considerable
0.26
pronounced
0.26
frequent
0.25
strong
0.25
near
0.24
marked
0.22
fairly
0.22
far
0.22
little
0.22
Activations Density 0.387%