INDEX
Explanations
comparative phrases indicating amounts or levels that are below a certain threshold
"Less than" expressions
New Auto-Interp
Negative Logits
feroit
-0.54
klientów
-0.50
oneofs
-0.47
inneces
-0.47
avoient
-0.46
spół
-0.46
pelanggan
-0.46
ainfi
-0.45
navideña
-0.45
çift
-0.44
POSITIVE LOGITS
Below
0.72
Below
0.69
below
0.67
NSCoder
0.65
$<
0.64
BELOW
0.63
below
0.61
(<
0.58
LessThan
0.58
未満
0.57
Activations Density 0.452%