INDEX
Explanations
possessive forms of nouns
New Auto-Interp
Negative Logits
hart
-0.16
iven
-0.15
DW
-0.14
igner
-0.14
erties
-0.14
vess
-0.14
ž
-0.14
iking
-0.14
egra
-0.14
票
-0.13
POSITIVE LOGITS
Law
0.18
Revenge
0.17
Corner
0.17
'
0.17
Landing
0.16
Choice
0.16
Place
0.16
aurus
0.16
orne
0.15
nyder
0.15
Activations Density 0.072%