INDEX
Explanations
actions related to the act of shaving
terms related to shaving and haircuts
New Auto-Interp
Negative Logits
alez
-0.81
encia
-0.76
Guilty
-0.68
Crom
-0.66
gom
-0.66
Miranda
-0.64
ologue
-0.62
Cass
-0.62
phis
-0.62
iege
-0.61
POSITIVE LOGITS
shaving
1.22
shave
1.20
shaved
1.05
utic
0.90
utical
0.83
utics
0.83
scars
0.82
blades
0.81
trimmed
0.80
rette
0.78
Activations Density 0.007%