INDEX
Explanations
words related to possession or attribution
New Auto-Interp
Negative Logits
issance
-0.79
invariably
-0.73
utic
-0.72
edient
-0.72
qqa
-0.70
always
-0.70
essa
-0.70
usterity
-0.69
atu
-0.66
ani
-0.66
POSITIVE LOGITS
mentioning
0.80
mention
0.73
handedly
0.66
hasht
0.64
mentions
0.64
jokes
0.64
scratch
0.63
remotely
0.62
nick
0.61
curs
0.61
Activations Density 0.317%