INDEX
Explanations
positive adjectives followed by nouns
expressions emphasizing the word "such" in various contexts
New Auto-Interp
Negative Logits
ertodd
-0.82
kick
-0.69
ARM
-0.69
ntil
-0.68
TeX
-0.68
Murd
-0.67
ysc
-0.64
·
-0.62
Drum
-0.62
iliate
-0.62
POSITIVE LOGITS
ties
0.72
ities
0.70
abundantly
0.65
awful
0.65
consequential
0.64
thin
0.62
specificity
0.61
minded
0.60
vered
0.60
sums
0.60
Activations Density 0.047%