INDEX
Explanations
mentions of personal possession or body parts
references to personal ownership or possessive terms concerning health and well-being
New Auto-Interp
Negative Logits
ilts
-0.81
apo
-0.79
Uriel
-0.77
trak
-0.73
vous
-0.73
wik
-0.71
Originally
-0.70
Goes
-0.66
raq
-0.66
witch
-0.66
POSITIVE LOGITS
own
1.55
favourite
1.11
favorite
1.09
surroundings
1.02
opponent
1.00
adversary
0.97
ocard
0.97
spouse
0.93
fingertips
0.92
imagination
0.92
Activations Density 0.115%