INDEX
Explanations
references to childhood and upbringing experiences
New Auto-Interp
Negative Logits
Reſ
-0.81
ſta
-0.77
purpoſe
-0.77
perſon
-0.77
Inſ
-0.76
Diſ
-0.75
ſeveral
-0.75
ſame
-0.74
Theſe
-0.74
ſtand
-0.73
POSITIVE LOGITS
juventud
0.63
youth
0.62
infância
0.56
childhood
0.56
Cubit
0.55
الحره
0.54
ddelweddau
0.54
age
0.53
Jugend
0.53
youthful
0.52
Activations Density 0.150%