INDEX
Explanations
adjectives related to usefulness
instances of the word "useful."
New Auto-Interp
Negative Logits
Fever
-0.73
olina
-0.71
BU
-0.65
opers
-0.64
buck
-0.62
Bom
-0.62
otide
-0.61
uph
-0.60
eree
-0.60
bow
-0.59
POSITIVE LOGITS
idiots
0.88
tips
0.86
guiActiveUn
0.79
glers
0.79
fully
0.79
insights
0.78
tools
0.77
tip
0.76
useful
0.74
NESS
0.71
Activations Density 0.027%