INDEX
Explanations
terms related to physical impact or damage, especially those associated with crushing
New Auto-Interp
Negative Logits
rick
-0.16
869
-0.15
ste
-0.15
zial
-0.15
ting
-0.15
alloca
-0.14
osen
-0.14
jar
-0.14
Drum
-0.14
úb
-0.14
POSITIVE LOGITS
crush
0.19
Crush
0.18
fold
0.15
Voyage
0.15
erd
0.15
iform
0.14
eness
0.14
inks
0.14
IFIED
0.14
ingly
0.14
Activations Density 0.020%