INDEX
Explanations
references to a specific brand or product name
references to the word "Pro" and its variations in various contexts
New Auto-Interp
Negative Logits
ãĤ¼ãĤ¦ãĤ¹
-0.79
halls
-0.77
gow
-0.72
Titanic
-0.70
Cornell
-0.70
straw
-0.69
ablishment
-0.69
needles
-0.68
spears
-0.67
owship
-0.66
POSITIVE LOGITS
digy
1.41
posal
1.20
secution
1.20
verbs
1.11
gression
1.10
poses
1.08
portion
1.07
blems
1.03
pose
1.02
secut
1.01
Activations Density 0.014%