INDEX
Explanations
phrases expressing the emotional significance or importance of something
expressions of significance or emotional weight associated with certain experiences or concepts
New Auto-Interp
Negative Logits
hari
-0.71
âĶĤ
-0.64
Royale
-0.61
herd
-0.59
Reaction
-0.59
prism
-0.59
background
-0.57
æ©Ł
-0.56
atus
-0.55
kcal
-0.55
POSITIVE LOGITS
bye
0.78
goodbye
0.71
farewell
0.69
sworth
0.69
Goodbye
0.67
enez
0.67
ŃĶ
0.67
roads
0.65
THANK
0.65
LOS
0.65
Activations Density 0.132%