INDEX
Explanations
mentions of specific individuals participating in various activities or roles
instances of the word "played"
New Auto-Interp
Negative Logits
ibel
-0.74
ordinate
-0.69
ortium
-0.68
plet
-0.65
icon
-0.64
owder
-0.64
brow
-0.64
attribute
-0.64
ciples
-0.63
lad
-0.63
POSITIVE LOGITS
Plays
1.02
played
1.00
Playing
0.95
Played
0.93
wright
0.92
Parenthood
0.86
ername
0.83
Piano
0.82
plays
0.78
GROUND
0.78
Activations Density 0.035%