INDEX
Explanations
references to artists, musicians, and entertainment personalities
references to individuals with artistic or celebrity status, particularly related to music and entertainment
New Auto-Interp
Negative Logits
levant
-0.64
uliffe
-0.64
causation
-0.64
ctory
-0.63
arantine
-0.61
rompt
-0.60
DATA
-0.60
equival
-0.59
ahon
-0.59
plet
-0.59
POSITIVE LOGITS
awaits
1.24
joins
1.21
extraord
1.08
greets
1.07
celebrates
1.06
has
1.03
boasts
1.00
enjoys
0.99
continues
0.99
knows
0.98
Activations Density 0.445%