INDEX
Explanations
phrases related to personal backgrounds and life experiences
New Auto-Interp
Negative Logits
top
-0.56
v
-0.52
top
-0.46
WriteLiteral
-0.46
currently
-0.46
over
-0.45
re
-0.45
-
-0.44
ts
-0.44
量
-0.43
POSITIVE LOGITS
childhood
1.27
Childhood
1.15
upbringing
1.14
Childhood
1.07
enfance
1.01
enfance
1.00
boyhood
0.99
Мексичка
0.94
anzia
0.88
Roskov
0.87
Activations Density 0.107%