INDEX
Explanations
mentions of the platform "Youtube"
references to YouTube and phrases indicating group settings or numeric thresholds
New Auto-Interp
Negative Logits
ity
-0.75
ly
-0.73
iosity
-0.66
iants
-0.66
iens
-0.65
gency
-0.64
lers
-0.64
tain
-0.64
phrine
-0.63
gling
-0.63
POSITIVE LOGITS
chnology
1.03
sembly
0.88
yip
0.81
obser
0.77
izoph
0.76
selves
0.75
ocre
0.75
apons
0.75
essage
0.74
merce
0.70
Activations Density 0.028%