INDEX
Explanations
possessive pronouns that indicate ownership or association
New Auto-Interp
Negative Logits
orden
-0.15
isc
-0.15
ol
-0.14
295
-0.14
ems
-0.13
ither
-0.13
pi
-0.13
iset
-0.13
467
-0.13
info
-0.13
POSITIVE LOGITS
UCT
0.16
EFR
0.16
INA
0.15
اث
0.15
IFn
0.15
rog
0.14
oins
0.14
846
0.14
associated
0.14
accompanying
0.14
Activations Density 0.041%