INDEX
Explanations
elements related to presence and social interactions in various contexts
New Auto-Interp
Negative Logits
abandonné
-0.40
amizade
-0.38
préf
-0.35
veulent
-0.34
préfé
-0.34
conclusão
-0.34
sprache
-0.34
Independencia
-0.34
faveur
-0.33
licencia
-0.33
POSITIVE LOGITS
Wikimedijinoj
0.93
DockStyle
0.79
ویکیپدی
0.77
ब्रेकडाउन
0.75
ValueStyle
0.73
otomatig
0.72
hoeddwyd
0.69
LookAnd
0.68
RTEX
0.67
Билгалдахарш
0.66
Activations Density 0.218%