INDEX
Explanations
occurrences of emotional expressions and interpersonal interactions
New Auto-Interp
Negative Logits
infer
-0.17
worthy
-0.16
iffer
-0.16
elles
-0.15
avel
-0.15
ovel
-0.15
andon
-0.14
ellers
-0.14
uls
-0.13
atty
-0.13
POSITIVE LOGITS
ç»ĵæŀľ
0.19
result
0.19
çµIJæŀľ
0.19
resulted
0.18
Result
0.17
ãĤ«ãĥ¼
0.17
results
0.17
Ergebn
0.16
resulting
0.16
Results
0.16
Activations Density 0.273%