INDEX
Explanations
words related to the texture or form of materials, particularly those that are flour-based
New Auto-Interp
Negative Logits
irt
-0.18
BOUND
-0.17
empl
-0.17
bound
-0.16
ew
-0.16
eg
-0.15
irting
-0.15
IRT
-0.15
eworthy
-0.15
ÑĢÑĥб
-0.15
POSITIVE LOGITS
ours
0.24
uted
0.23
ax
0.20
ammable
0.20
anges
0.20
axes
0.19
asks
0.19
ange
0.19
utes
0.18
anged
0.18
Activations Density 0.008%