INDEX
Explanations
possessive pronouns followed by an adjective
variations of the word 'its'
New Auto-Interp
Negative Logits
basketball
-0.80
Unsure
-0.77
tsy
-0.76
rette
-0.73
stad
-0.72
Gallery
-0.71
ooo
-0.70
pc
-0.68
Hastings
-0.68
behind
-0.67
POSITIVE LOGITS
own
1.35
ELF
1.22
predecessor
1.19
namesake
1.03
predecessors
1.01
respective
0.91
usefulness
0.87
rightful
0.85
newest
0.84
entirety
0.84
Activations Density 0.128%