INDEX
Explanations
names of celebrities, particularly Alec Baldwin and Conor McGregor
names of prominent individuals mentioned in the context of entertainment and media
New Auto-Interp
Negative Logits
ening
-0.77
trap
-0.71
bang
-0.66
pring
-0.66
lets
-0.64
liga
-0.64
grades
-0.64
yg
-0.64
ened
-0.63
children
-0.63
POSITIVE LOGITS
Conan
0.99
ovic
0.96
inition
0.79
Cumm
0.78
DIT
0.77
ventions
0.77
Guinness
0.77
thia
0.76
Fallon
0.74
Weir
0.72
Activations Density 0.015%