INDEX
Explanations
references to children and their experiences
New Auto-Interp
Negative Logits
студент
-0.61
mahasiswa
-0.60
大学生
-0.54
obiety
-0.53
craper
-0.50
студен
-0.49
***!
-0.49
studenten
-0.49
Mahasiswa
-0.47
namor
-0.47
POSITIVE LOGITS
children
1.55
childrens
1.48
Children
1.43
Children
1.37
children
1.36
kids
1.35
childrens
1.32
child
1.31
Kids
1.30
Kids
1.30
Activations Density 0.470%