INDEX
Explanations
instances of the word "back" and its variations
New Auto-Interp
Negative Logits
ycz
-0.17
ãĥ¡ãĥ³ãĥĪ
-0.16
pekt
-0.16
ÑĸлÑĮÑĪ
-0.15
pector
-0.15
agli
-0.15
ilded
-0.14
Supplement
-0.14
ould
-0.14
ominator
-0.14
POSITIVE LOGITS
lash
0.28
ed
0.26
yard
0.26
story
0.26
ward
0.24
wards
0.23
ups
0.23
stage
0.23
bone
0.22
draft
0.22
Activations Density 0.012%