INDEX
Explanations
numerical values or counts within the text
New Auto-Interp
Negative Logits
Демографія
-0.62
omyia
-0.56
autorytatywna
-0.55
hability
-0.52
垣
-0.52
الحره
-0.52
preuves
-0.52
-0.52
ICOLON
-0.51
Crumb
-0.51
POSITIVE LOGITS
RectangleBorder
0.88
Gweler
0.70
AndEndTag
0.69
tagHelperRunner
0.68
LookAnd
0.65
légales
0.63
%</
0.62
argout
0.61
.",
0.61
%";
0.59
Activations Density 0.064%