INDEX
Explanations
possessive forms or references to ownership
New Auto-Interp
Negative Logits
ieux
-0.16
Nichols
-0.16
icult
-0.15
boss
-0.14
aved
-0.14
unlock
-0.14
enburg
-0.14
emperor
-0.14
ights
-0.14
ishly
-0.14
POSITIVE LOGITS
Lair
0.21
bane
0.19
Tale
0.19
Den
0.17
Apprentice
0.17
Gate
0.17
Choice
0.17
Cove
0.17
sing
0.16
Gamb
0.16
Activations Density 0.034%