INDEX
Explanations
phrases comparing or diminishing one thing to another
phrases that diminish the significance of something by suggesting it is "nothing more than" a trivial or lesser version of itself
New Auto-Interp
Negative Logits
ahime
-0.83
ode
-0.80
anta
-0.77
enser
-0.70
NCT
-0.68
20439
-0.65
oris
-0.65
arts
-0.65
oren
-0.63
rones
-0.62
POSITIVE LOGITS
mediocre
0.70
rudimentary
0.69
filler
0.67
anke
0.66
cosmetic
0.65
subsistence
0.64
superficial
0.62
pure
0.61
hors
0.60
a
0.59
Activations Density 0.059%