INDEX
Explanations
specific mentions of "your" in relation to computer processes or setups
possessive pronouns referring to the reader or user
New Auto-Interp
Negative Logits
apo
-0.84
ilts
-0.80
trak
-0.74
oft
-0.73
vous
-0.72
Originally
-0.72
forth
-0.71
Goes
-0.71
Uriel
-0.70
Shapiro
-0.69
POSITIVE LOGITS
own
1.49
favourite
1.12
favorite
1.08
ocard
1.00
surroundings
1.00
opponent
0.98
imagination
0.95
fingertips
0.94
adversary
0.92
desired
0.90
Activations Density 0.118%