INDEX
Explanations
instances of negation and statements of non-affirmation
New Auto-Interp
Negative Logits
}{@-0.65
snippetHide
-0.56
مشين
-0.55
informée
-0.54
estacks
-0.54
<<<<<<<<<<<<<<
-0.52
gegangen
-0.52
AspNetCore
-0.52
الحره
-0.51
etcode
-0.51
POSITIVE LOGITS
EnglishChoose
0.60
ритори
0.58
JspWriter
0.50
Personensuche
0.50
trajets
0.49
infantiles
0.47
Réponses
0.47
\{\\0.47
rhestr
0.47
Appell
0.47
Activations Density 0.008%