INDEX
Explanations
references to youth and children's topics
New Auto-Interp
Negative Logits
apesh
-0.16
kup
-0.15
issan
-0.15
isseur
-0.15
kola
-0.14
Od
-0.14
isas
-0.14
Od
-0.14
thora
-0.14
apture
-0.14
POSITIVE LOGITS
åŃIJä¾Ľ
0.18
orph
0.16
kids
0.16
(children
0.16
childhood
0.15
mediator
0.15
Boy
0.15
boy
0.15
children
0.15
Childhood
0.15
Activations Density 1.028%