INDEX
Explanations
words associated with scientific studies and data
New Auto-Interp
Negative Logits
coû
-0.65
traditionnels
-0.63
bä
-0.60
tourner
-0.59
actuels
-0.59
supérieurs
-0.59
rapides
-0.58
évidemment
-0.58
much
-0.57
couverts
-0.57
POSITIVE LOGITS
itſelf
0.93
Majefty
0.93
raiſ
0.91
'\\;'
0.88
myſelf
0.88
Efq
0.85
noft
0.85
crdi
0.84
scania
0.82
fhew
0.81
Activations Density 1.076%