INDEX
Explanations
common phrases related to favoriting, rating, and sharing online content
mentions of favorites or preferred items
New Auto-Interp
Negative Logits
aea
-0.71
utical
-0.69
raud
-0.67
Frie
-0.66
aucas
-0.65
Centauri
-0.65
kaya
-0.65
lessness
-0.64
urg
-0.63
ufact
-0.63
POSITIVE LOGITS
favorite
1.56
favorite
1.17
Favorite
1.17
favourite
1.06
Favorite
0.96
favorites
0.95
¥µ
0.80
Favor
0.70
Reviewer
0.69
favored
0.68
Activations Density 0.016%