INDEX
Explanations
phrases related to personal background and identity
New Auto-Interp
Negative Logits
elper
-0.15
oden
-0.14
MCU
-0.14
newfound
-0.14
ê
-0.14
ö
-0.14
.meta
-0.14
ingles
-0.14
Pregn
-0.14
830
-0.14
POSITIVE LOGITS
upbringing
0.33
Raised
0.28
raised
0.27
early
0.26
childhood
0.26
Raised
0.25
growing
0.24
raised
0.23
environment
0.21
early
0.21
Activations Density 0.380%