INDEX
Explanations
phrases emphasizing ownership or relationships, particularly indicating possession or familial connections
New Auto-Interp
Negative Logits
fern
-0.17
rava
-0.14
yg
-0.14
-php
-0.14
orted
-0.14
oling
-0.13
ãģŁãģĦ
-0.13
quare
-0.13
ÑģÑĤи
-0.13
äge
-0.13
POSITIVE LOGITS
ways
0.26
Ways
0.19
how
0.18
象
0.16
å©·
0.15
526
0.14
dem
0.14
ptic
0.14
Giles
0.14
konkrét
0.14
Activations Density 0.055%