INDEX
Explanations
references to age and beginnings in personal narratives
New Auto-Interp
Negative Logits
rats
-0.16
istrovstvÃŃ
-0.15
.modified
-0.15
ChangeListener
-0.14
âķĹ
-0.14
αλ
-0.14
IVEN
-0.14
ipay
-0.14
thora
-0.14
lek
-0.13
POSITIVE LOGITS
young
0.54
young
0.41
very
0.40
Young
0.39
Young
0.36
younger
0.35
jeune
0.34
tender
0.33
early
0.31
jeunes
0.31
Activations Density 0.047%