INDEX
Explanations
mention of age or generational comparison, especially focusing on the younger individuals
New Auto-Interp
Negative Logits
odor
-0.86
edIn
-0.84
orial
-0.79
vez
-0.77
utterstock
-0.75
eren
-0.75
square
-0.75
Gazette
-0.74
ayne
-0.74
alion
-0.73
POSITIVE LOGITS
than
1.29
generations
1.21
versions
1.19
sibling
0.99
Than
0.98
brother
0.90
sister
0.88
installments
0.87
siblings
0.86
puberty
0.86
Activations Density 0.473%