INDEX
Explanations
words related to celebrities or famous personalities
occurrences of the word "star" in various contexts
New Auto-Interp
Negative Logits
ufact
-0.79
rals
-0.78
PLA
-0.76
ilated
-0.76
Downloadha
-0.75
onom
-0.72
Legislation
-0.71
Measures
-0.71
-0.68
onomy
-0.67
POSITIVE LOGITS
star
3.62
stars
3.01
star
2.40
stars
2.10
superstar
1.91
Stars
1.87
Star
1.74
Star
1.74
starred
1.72
Stars
1.71
Activations Density 0.014%