INDEX
Explanations
phrases indicating restrictions or limitations
New Auto-Interp
Negative Logits
abetes
-0.70
}\]
-0.63
__*/
-0.61
ientôt
-0.61
aika
-0.59
axter
-0.59
υτό
-0.58
IUrlHelper
-0.58
daß
-0.58
انيف
-0.58
POSITIVE LOGITS
beschränkt
0.54
confined
0.53
disambiguazione
0.48
merely
0.47
limited
0.46
confines
0.46
*:
0.45
confined
0.44
แค่
0.44
あく
0.44
Activations Density 0.337%