INDEX
Explanations
words related to value or worth
concepts related to value and enjoyment
New Auto-Interp
Negative Logits
ept
-0.78
ebus
-0.69
Aval
-0.68
sup
-0.65
á
-0.64
yer
-0.63
ping
-0.62
chu
-0.62
opathy
-0.61
avis
-0.59
POSITIVE LOGITS
worthwhile
1.08
terday
1.03
etheless
0.85
endeavour
0.80
ItemTracker
0.80
allery
0.79
guiActiveUn
0.79
traged
0.78
enjoyable
0.77
newcom
0.76
Activations Density 0.008%