INDEX
Explanations
punctuation marks and exclamation points in the text
New Auto-Interp
Negative Logits
DoubleQuotes
-1.09
expandindo
-1.02
."));
-0.96
}');
-0.95
)');
-0.92
)");
-0.91
///</
-0.91
>=",
-0.90
]');
-0.89
)}</
-0.88
POSITIVE LOGITS
“
0.69
rendre
0.60
iconque
0.60
=\"
0.59
='
0.58
nationaux
0.57
adag
0.56
plotlib
0.56
...
0.55
CONTR
0.55
Activations Density 0.135%