INDEX
Explanations
instances of the word "break" and its variations
New Auto-Interp
Negative Logits
vale
-0.18
craft
-0.17
aura
-0.17
asca
-0.16
istics
-0.16
ror
-0.15
discrete
-0.15
Dispose
-0.15
UFFIX
-0.14
unu
-0.14
POSITIVE LOGITS
age
0.27
neck
0.23
away
0.21
fast
0.20
ages
0.20
heart
0.19
water
0.19
aldi
0.18
dance
0.17
-even
0.17
Activations Density 0.044%