INDEX
Explanations
mentions of improvement or comparison for different scenarios, with a preference towards the concept of "better"
the word "better" and its variations
New Auto-Interp
Negative Logits
Pione
-0.73
ategory
-0.66
cha
-0.65
sup
-0.64
ette
-0.63
cano
-0.62
amine
-0.61
kaya
-0.61
umo
-0.60
ums
-0.59
POSITIVE LOGITS
than
1.31
than
1.09
suited
1.08
behaved
1.04
Than
1.03
ment
0.95
acquainted
0.94
equipped
0.86
ments
0.81
luck
0.80
Activations Density 0.067%