INDEX
Explanations
descriptions of childhood behavior, particularly tantrums and emotional expressions
New Auto-Interp
Negative Logits
illez
-0.17
ãİ
-0.14
551
-0.14
_VARS
-0.14
buzz
-0.14
ingles
-0.13
éľŀ
-0.13
Ward
-0.13
dog
-0.13
年度
-0.13
POSITIVE LOGITS
tantr
0.23
pac
0.18
pac
0.17
nap
0.15
nap
0.15
crying
0.15
disobed
0.15
sul
0.15
angel
0.15
sé
0.14
Activations Density 0.052%