INDEX
Explanations
verbs or phrases related to things being split or divided
instances of the word "separated" and its variations
New Auto-Interp
Negative Logits
vous
-0.71
×Ķ
-0.70
WN
-0.67
OD
-0.67
cycl
-0.65
TL
-0.65
nz
-0.65
Briggs
-0.63
Gos
-0.63
enos
-0.63
POSITIVE LOGITS
separating
0.91
separ
0.84
separ
0.83
separated
0.83
sexes
0.81
separates
0.76
detach
0.74
ĨĴ
0.73
ively
0.73
icut
0.73
Activations Density 0.017%