INDEX
Explanations
themes related to children and their treatment in society
New Auto-Interp
Negative Logits
oÄŁ
-0.17
endi
-0.14
妹
-0.14
macı
-0.14
athe
-0.14
öt
-0.14
оÑĤÑĢеб
-0.14
اÙĪÙĨد
-0.14
acades
-0.14
askell
-0.14
POSITIVE LOGITS
adult
0.78
adults
0.77
grown
0.75
adult
0.68
Adults
0.67
Adult
0.65
Adult
0.62
adulthood
0.59
adultos
0.58
grown
0.56
Activations Density 0.159%