INDEX
Explanations
phrases that involve negating the existence of certain concepts or items
phrases asserting the non-existence of certain concepts or entities
New Auto-Interp
Negative Logits
uay
-0.80
rik
-0.71
alde
-0.70
Flavoring
-0.70
ipedia
-0.69
olate
-0.69
igi
-0.69
venue
-0.68
osterone
-0.68
Murd
-0.67
POSITIVE LOGITS
thing
1.15
luck
0.99
thing
0.82
luxury
0.77
fate
0.69
fancy
0.67
designation
0.66
exact
0.65
pesky
0.64
ities
0.64
Activations Density 0.040%