INDEX
Explanations
numeric comparisons or math-related expressions
New Auto-Interp
Negative Logits
glers
-0.49
Zel
-0.47
وء
-0.46
onn
-0.45
respectively
-0.44
respectively
-0.42
ghijkl
-0.42
cu
-0.42
eding
-0.42
lei
-0.41
POSITIVE LOGITS
étrangère
0.82
sauvages
0.78
aveug
0.76
étrangères
0.73
étranger
0.71
fermés
0.70
BufferException
0.69
Lightboxes
0.69
rrggbb
0.68
africaine
0.68
Activations Density 0.013%