INDEX
Explanations
phrases related to specific individuals named "Andre"
mentions of a specific individual, particularly focused on DeAndre Jordan and related names
New Auto-Interp
Negative Logits
yip
-0.70
warranted
-0.62
verty
-0.60
merit
-0.59
abouts
-0.59
risks
-0.55
srfAttach
-0.55
lesbians
-0.55
warrant
-0.54
Wonders
-0.54
POSITIVE LOGITS
ciating
1.10
orable
0.86
ij士
0.84
escent
0.84
issance
0.80
rency
0.79
icio
0.78
enment
0.78
alion
0.77
liction
0.76
Activations Density 0.114%