INDEX
Explanations
references to economic concepts and numerical values
New Auto-Interp
Negative Logits
Efq
-1.35
raiſ
-1.32
Jefus
-1.32
faſt
-1.32
ſche
-1.32
purpoſe
-1.31
itſelf
-1.31
myſelf
-1.27
greateſt
-1.24
houſe
-1.24
POSITIVE LOGITS
"+
1.12
'+
0.84
+
0.76
'+
0.76
("+0.69
(+
0.66
ve
0.60
>'+
0.58
na
0.56
from
0.56
Activations Density 0.288%