INDEX
Explanations
quote marks and formatting-related elements in the text
Preceding tokens of quotations or formatting characters
New Auto-Interp
Negative Logits
indépendante
-0.56
stället
-0.55
particulières
-0.54
vectorielle
-0.53
vectorielles
-0.53
resourceCulture
-0.52
médicaux
-0.52
nyttet
-0.52
enfans
-0.52
européennes
-0.51
POSITIVE LOGITS
Continue
0.28
ities
0.28
pate
0.27
bes
0.27
rän
0.27
ism
0.26
waite
0.26
ting
0.26
ses
0.25
nery
0.25
Activations Density 0.998%