INDEX
Explanations
phrases indicating joint actions or situations
New Auto-Interp
Negative Logits
تانيه
-1.01
ویکیپدیا
-0.79
ArgsConstructor
-0.72
ChildScrollView
-0.71
الدولى
-0.68
ocratic
-0.66
HomeAsUpEnabled
-0.66
Geplaatst
-0.65
ocracy
-0.65
ificato
-0.65
POSITIVE LOGITS
and
0.64
or
0.53
betweenstory
0.48
/
0.48
writes
0.48
motor
0.47
GMENT
0.46
uwagę
0.46
,
0.43
metal
0.42
Activations Density 0.697%