INDEX
Explanations
personal pronouns indicating possession or connection
instances of the word "your."
New Auto-Interp
Negative Logits
ilts
-0.76
trak
-0.73
ylum
-0.73
raq
-0.72
Lago
-0.71
ega
-0.70
76561
-0.69
asia
-0.68
rix
-0.68
oldemort
-0.67
POSITIVE LOGITS
own
1.22
mileage
1.01
favorite
1.01
favourite
1.00
selves
0.89
opponent
0.87
choice
0.86
guys
0.84
ths
0.82
imagination
0.82
Activations Density 0.109%