INDEX
Explanations
references to participation in activities or projects
New Auto-Interp
Negative Logits
for
-0.34
длÑı
-0.25
for
-0.24
untuk
-0.23
για
-0.23
for
-0.22
für
-0.22
pentru
-0.21
voor
-0.20
براÛĮ
-0.20
POSITIVE LOGITS
purposes
0.90
sake
0.80
purpose
0.50
reasons
0.47
PURPOSE
0.41
purpose
0.39
pur
0.35
Purpose
0.35
reason
0.35
Purpose
0.34
Activations Density 0.643%