INDEX
Explanations
words related to bumping or impacting
New Auto-Interp
Negative Logits
undo
-0.15
park
-0.15
UTION
-0.15
RY
-0.15
onne
-0.15
ipped
-0.15
hn
-0.15
ر
-0.14
iser
-0.14
ë´ī
-0.14
POSITIVE LOGITS
sters
0.25
ster
0.22
kins
0.21
site
0.20
stead
0.20
ylon
0.19
urge
0.19
ers
0.18
shaft
0.18
ing
0.17
Activations Density 0.059%