INDEX
Explanations
references to cutting or shredding actions and related terms
New Auto-Interp
Negative Logits
state
-0.45
benessere
-0.42
STATE
-0.42
Schaefer
-0.40
câte
-0.40
encontraban
-0.40
estado
-0.39
who
-0.39
synes
-0.39
răsp
-0.39
POSITIVE LOGITS
Cutting
1.22
Cutting
1.21
cutting
1.12
cutting
1.11
cuts
1.08
cut
1.08
CUT
1.05
cuts
1.02
Cuts
1.02
Cut
1.01
Activations Density 0.333%