INDEX
Explanations
numeric data representing choices or preferences
New Auto-Interp
Negative Logits
Económica
-1.03
pleaſure
-0.93
EconPapers
-0.93
navideña
-0.88
GenerationType
-0.88
defaultstate
-0.87
دانشنامهٔ
-0.86
otomatig
-0.85
fjspx
-0.82
ſtate
-0.82
POSITIVE LOGITS
'
0.71
0.67
man
0.54
)
0.52
Ce
0.50
s
0.50
’
0.50
ce
0.48
\
0.48
</em>
0.47
Activations Density 0.182%