INDEX
Explanations
expressions related to change or transformation
New Auto-Interp
Negative Logits
ety
-0.71
tumblr
-0.68
anecd
-0.68
ital
-0.68
via
-0.66
PLUS
-0.65
ivals
-0.64
xual
-0.63
iga
-0.62
outine
-0.62
POSITIVE LOGITS
swung
0.91
swinging
0.83
tipping
0.81
inning
0.78
hatt
0.76
peeled
0.75
bowed
0.75
ayers
0.73
tightening
0.73
urned
0.73
Activations Density 0.227%