INDEX
Explanations
references to youth and youth-related topics
New Auto-Interp
Negative Logits
intra
-0.46
lans
-0.45
contraction
-0.44
incon
-0.43
ALERT
-0.42
sales
-0.42
feat
-0.41
EAR
-0.41
fare
-0.40
corner
-0.40
POSITIVE LOGITS
jeunesse
0.65
Youth
0.60
Youth
0.58
dafx
0.57
juventud
0.57
pieniądze
0.56
Jugend
0.56
Notwendigkeit
0.55
importanza
0.55
jóvenes
0.54
Activations Density 0.419%