INDEX
Explanations
mentions of strong emotional responses and opinions about experiences or content, particularly favorable ones
New Auto-Interp
Negative Logits
ongo
-0.17
inch
-0.16
sar
-0.15
chia
-0.15
Äģn
-0.14
adel
-0.14
ALLE
-0.14
arshal
-0.14
dice
-0.13
agy
-0.13
POSITIVE LOGITS
wdx
0.16
originally
0.15
Inflater
0.15
icode
0.14
boo
0.14
childs
0.14
ì§ĢëĤľ
0.14
овоÑĢ
0.14
815
0.14
/icons
0.14
Activations Density 3.437%