INDEX
Explanations
phrases related to economic establishments and transactions
New Auto-Interp
Negative Logits
âĢIJ
-0.39
âĢIJ
-0.32
"'
-0.31
''
-0.30
''
-0.27
âĢIJâĢIJ
-0.27
--
-0.27
,''
-0.26
.''
-0.26
'',
-0.26
POSITIVE LOGITS
«
1.27
«
1.12
(«
0.91
»
0.79
»
0.76
.»
0.69
»↵
0.68
»,
0.65
».
0.63
<<
0.59
Activations Density 0.067%