INDEX
Explanations
words related to valuable items
words related to value and valuation
New Auto-Interp
Negative Logits
merce
-0.84
gypt
-0.60
wom
-0.58
fronts
-0.58
tale
-0.58
Seventh
-0.57
selves
-0.56
Hera
-0.56
enegger
-0.54
rotated
-0.53
POSITIVE LOGITS
ãĤ£
0.77
oso
0.76
rations
0.74
anamo
0.71
athan
0.69
ão
0.68
idated
0.67
iant
0.67
otten
0.66
emale
0.65
Activations Density 0.077%