INDEX
Explanations
references to prominent individuals and their careers in entertainment
New Auto-Interp
Negative Logits
expandindo
-0.75
AssemblyCompany
-0.70
estekak
-0.65
виправивши
-0.64
########.
-0.64
unmute
-0.60
AssemblyProduct
-0.58
مشين
-0.58
#+#
-0.57
Dissertation
-0.57
POSITIVE LOGITS
stars
1.44
star
1.39
celebrity
1.36
celebrities
1.36
actor
1.25
stars
1.17
actors
1.17
actress
1.15
celebs
1.12
celebrity
1.02
Activations Density 0.307%