INDEX
Explanations
mentions of childhood and related experiences
New Auto-Interp
Negative Logits
apor
-0.17
vatel
-0.16
ãĤ
-0.15
ÙĨÚ¯
-0.15
eteria
-0.15
Fus
-0.14
ooter
-0.14
ìĦľëĬĶ
-0.14
ancient
-0.14
asion
-0.14
POSITIVE LOGITS
spent
0.23
innocence
0.22
hood
0.21
memories
0.21
experiences
0.19
years
0.19
sweetheart
0.19
summers
0.18
trauma
0.17
memory
0.17
Activations Density 0.022%