INDEX
Explanations
phrases related to famous or well-known figures and entities, particularly focusing on Hollywood celebrities
references to celebrities and social issues surrounding them
New Auto-Interp
Negative Logits
inexper
-0.57
yours
-0.56
ãĥ¯
-0.55
theirs
-0.54
Ń·
-0.53
inki
-0.53
ours
-0.53
).[
-0.51
latter
-0.50
ãħĭãħĭ
-0.49
POSITIVE LOGITS
devoted
0.78
relating
0.74
dedicated
0.73
regarding
0.72
debating
0.69
advocating
0.67
documenting
0.66
mourning
0.64
discussing
0.63
regulating
0.62
Activations Density 1.246%