INDEX
Explanations
phrases related to communication and access to resources
New Auto-Interp
Negative Logits
للمعارف
-0.54
ypress
-0.49
InternalFrame
-0.47
nombreux
-0.46
Shear
-0.44
estad
-0.44
Dominus
-0.42
nombreuses
-0.42
>=",
-0.42
큼
-0.41
POSITIVE LOGITS
only
1.42
only
1.31
jedynie
1.23
seulement
1.21
Only
1.17
Only
1.15
lediglich
1.12
而已
1.11
лишь
1.07
ONLY
1.04
Activations Density 0.603%