INDEX
Explanations
phrases that express emotional sentiments and interpersonal interactions
New Auto-Interp
Negative Logits
fart
-0.16
iline
-0.15
quals
-0.15
boa
-0.15
Äijâu
-0.14
exactly
-0.14
bits
-0.14
uku
-0.13
asto
-0.13
OUNDS
-0.13
POSITIVE LOGITS
those
0.27
Those
0.24
Those
0.23
those
0.22
ya
0.21
éĤ£äºĽ
0.21
cha
0.18
dem
0.18
CHA
0.18
cha
0.17
Activations Density 0.458%