INDEX
Explanations
words related to actions or items associated with a specific function or purpose
words related to actions or processes, especially those that denote physical movement or transfer
New Auto-Interp
Negative Logits
lin
-0.63
kale
-0.63
socks
-0.62
hiding
-0.62
stocks
-0.61
Elf
-0.60
sequ
-0.59
mel
-0.59
Luk
-0.57
recess
-0.57
POSITIVE LOGITS
ction
4.77
ctions
3.09
ctive
2.18
ctor
1.71
ct
1.58
ctory
1.47
ctors
1.23
cture
1.22
gment
1.16
tion
1.12
Activations Density 0.016%