INDEX
Explanations
words related to giving up or relinquishing ownership
past tense and participial forms of verbs
New Auto-Interp
Negative Logits
iosity
-0.64
alys
-0.63
ritical
-0.63
ses
-0.63
acio
-0.60
itect
-0.60
itri
-0.59
owitz
-0.59
ivari
-0.59
dinand
-0.59
POSITIVE LOGITS
ModLoader
0.65
oute
0.63
pty
0.61
aukee
0.61
owship
0.60
dit
0.59
nces
0.57
hazard
0.57
vows
0.56
Yose
0.56
Activations Density 0.185%