INDEX
Explanations
themes related to human development and well-being
New Auto-Interp
Negative Logits
heimer
-0.17
.xtext
-0.16
linky
-0.15
exus
-0.15
ê¶Į
-0.15
оÑıÑĤ
-0.14
овеÑĢ
-0.14
bject
-0.14
arge
-0.14
appa
-0.14
POSITIVE LOGITS
besides
0.74
Besides
0.56
addition
0.54
aside
0.54
alongside
0.53
além
0.53
Besides
0.51
oltre
0.51
además
0.49
apart
0.47
Activations Density 0.227%