INDEX
Explanations
phrases related to individual ownership and personal agency
New Auto-Interp
Negative Logits
Poor
-0.16
Nor
-0.15
Worm
-0.14
ovan
-0.14
missed
-0.14
ç§»åΰ
-0.13
POSITORY
-0.13
NOP
-0.13
idy
-0.13
Homes
-0.13
POSITIVE LOGITS
alone
0.22
alone
0.17
Alone
0.17
enis
0.16
olin
0.15
å¢
0.15
227
0.14
quires
0.14
gle
0.14
-alone
0.14
Activations Density 0.036%