INDEX
Explanations
affirmative and negative responses
New Auto-Interp
Negative Logits
'},
-0.58
>();
-0.57
DoubleQuotes
-0.56
>());
-0.55
"];
-0.55
)||
-0.53
'}}
-0.53
});
-0.52
'];
-0.52
'},
-0.51
POSITIVE LOGITS
africains
0.77
secondaires
0.72
enfans
0.69
ſtate
0.68
extérieures
0.68
pédagogique
0.67
complètes
0.67
imprimée
0.67
énergé
0.66
houſe
0.66
Activations Density 0.472%