INDEX
Explanations
verbs or phrases indicating a distinction or uniqueness
questions about differentiation or what makes something unique
New Auto-Interp
Negative Logits
abad
-0.85
bill
-0.79
lance
-0.77
lex
-0.75
lla
-0.72
ixel
-0.69
estern
-0.68
endix
-0.67
ammy
-0.66
jab
-0.66
POSITIVE LOGITS
motiv
0.80
pires
0.78
motivating
0.75
Unleashed
0.70
distinguishes
0.70
rament
0.69
motivate
0.68
GOODMAN
0.66
piring
0.66
SPONSORED
0.64
Activations Density 0.098%