INDEX
Explanations
words related to emotions or actions emphasizing a sense of intensity or importance
words related to different forms of the word "enemies" or expressions of opposition
New Auto-Interp
Negative Logits
yip
-0.77
apest
-0.75
é¾įå
-0.75
jri
-0.73
cules
-0.62
ancial
-0.60
incon
-0.60
appropriate
-0.59
hops
-0.58
swick
-0.57
POSITIVE LOGITS
ciating
1.08
achment
0.98
enment
0.89
ãĤ¨ãĥ«
0.75
emy
0.73
ĸļ
0.73
yll
0.67
iasm
0.67
emies
0.65
ached
0.64
Activations Density 0.065%