INDEX
Explanations
phrases expressing strong emotional attachment or significance
New Auto-Interp
Negative Logits
hari
-0.76
âĶĤ
-0.60
Reaction
-0.59
background
-0.55
available
-0.55
DW
-0.54
æ©Ł
-0.54
CY
-0.53
Ze
-0.53
Rumble
-0.53
POSITIVE LOGITS
bye
0.78
ppa
0.76
goodbye
0.74
LOS
0.72
Goodbye
0.71
farewell
0.68
roads
0.67
ueller
0.61
ŃĶ
0.61
THANK
0.61
Activations Density 0.101%