INDEX
Explanations
mentions of economic concepts and their impacts on society
New Auto-Interp
Negative Logits
éĺħ读次æķ°
-0.20
ä¸ĭ载次æķ°
-0.16
çĽijåIJ¬é¡µéĿ¢
-0.16
ä¸ŃæĸĩåŃĹå¹ķ
-0.15
âĢŀN
-0.14
оби
-0.14
æĬķ稿æĹ¥
-0.14
(![
-0.14
âĢŀJ
-0.14
OrNil
-0.14
POSITIVE LOGITS
,
0.22
;↵
0.21
;
0.21
(
0.21
which
0.19
,↵
0.19
:↵
0.19
:
0.18
Âł
0.17
Which
0.17
Activations Density 2.580%