INDEX
Explanations
pronouns and words that indicate possession or speech related to individuals
New Auto-Interp
Negative Logits
hower
-0.16
Budd
-0.15
å·¡
-0.15
Cort
-0.15
aptors
-0.14
ανδ
-0.14
aptor
-0.13
ãĥ©ãĥĥãĤ¯
-0.13
sour
-0.13
stown
-0.13
POSITIVE LOGITS
client
0.90
clients
0.84
client
0.78
Client
0.77
Clients
0.76
Client
0.73
-client
0.70
_client
0.68
CLIENT
0.67
(client
0.66
Activations Density 0.011%